
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
Hermann Ney, "On the Probabilistic Interpretation of Neural Network Classifiers and Discriminative Training Criteria," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 17, no. 2, pp. 107119, February, 1995.  
BibTex  x  
@article{ 10.1109/34.368176, author = {Hermann Ney}, title = {On the Probabilistic Interpretation of Neural Network Classifiers and Discriminative Training Criteria}, journal ={IEEE Transactions on Pattern Analysis and Machine Intelligence}, volume = {17}, number = {2}, issn = {01628828}, year = {1995}, pages = {107119}, doi = {http://doi.ieeecomputersociety.org/10.1109/34.368176}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Pattern Analysis and Machine Intelligence TI  On the Probabilistic Interpretation of Neural Network Classifiers and Discriminative Training Criteria IS  2 SN  01628828 SP107 EP119 EPD  107119 A1  Hermann Ney, PY  1995 KW  Statistical pattern recognition KW  neural networks KW  discriminant functions KW  training criteria KW  speech recognition. VL  17 JA  IEEE Transactions on Pattern Analysis and Machine Intelligence ER   
[1] M. A. Aizerman,E. M. Braverman,L. I. Rozonoer,“Theoretical Foundations of the Potential Function Method in Pattern Recognition Learning,” Automation and Remote Control, vol. 25, pp. 821837, June. 1964; “The Probability Problem of Pattern Recognition Learning and the Method of Potential Functions,” Automation and Remote Control, vol. 25, pp. 11751193, Sept. 1964.
[2] H. Asoh,N. Otsu,“Nonlinear Data Analysis and Multilayer Perceptrons,” Proc. IEEE Int. Joint Conf. on Neural Networks,Washington, DC, vol. II, pp. 411415, June 1989.
[3] J. K Baker,“Stochastic Modeling for Automatic Speech Understanding,” Reddy, D.R. (ed.): Speech Recognition, Academic Press, New York, pp. 512542, 1975.
[4] L. R. Bahl,P. F. Brown,P.V. de Souza,R. L. Mercer,“Maximum Mutual Information Estimation of Hidden Markov Model Parameters,” Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Proc.,Tokyo, pp. 4952, April 1986.
[5] L. R. Bahl,P. F. Brown,P.V. de Souza,R. L. Mercer,“A New Algorithm for the Estimation of Hidden Markov Model Parameters,” Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Proc.,New York, NY, pp. 493497, April 1988.
[6] H. Bourlard,Y. Kamp,“AutoAssociation by Multilayer Perceptrons and Singular Value Decomposition,” Biological Cybernetics, vol. 59, pp. 291294, 1988.
[7] C. C. Blaydon,“On a Pattern Recognition Result of Aizerman, Braverman and Rozonoer,” IEEE Trans. on Information Theory, vol. IT12, pp. 8283, Jan. 1966.
[8] J. S. Bridle,“Probabilistic Interpretation of Feedforward Classification Network Outputs with Relationships to Statistical Pattern Recognition,” F. FogelmanSoulie, J. Herault (eds.): Neurocomputing: Algorithms, Architectures and Applications, NATO ASI Series in Systems and Computer Science, Springer, 1989.
[9] H. Bourlard,C. J. Wellekens,“Links between Markov Models and Multilayer Perceptrons,” D.S. Touretzky (ed.): “Advances in Neural Information Processing Systems I”, Morgan Kaufmann Pub., San Mateo, CA, pp. 502507, 1989.
[10] R. O. Duda,P. E. Hart,“Pattern Classification and Scene Analysis”,John Wiley&Sons, New York, 1973.
[11] A. ElJaroudi,J. Makhoul,“A New Error Criterion for Posterior Probability Estimation with Neural Nets,” Proc. IEEE Int. Joint Conf. on Neural Networks,Washington, DC, vol. III, pp. 185192, June 1990.
[12] K. Fukunaga, Introduction to Statistical Pattern Recognition, second edition. Academic Press, 1990.
[13] H. Gish,“A Probabilistic Approach to the Understanding and Training of Neural Network Classifiers,” Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Proc.,Albuquerque, NM, pp. 13611365, April 1990.
[14] P. Gallinari,S. Thiria,F. Badran,F. FogelmanSoulie,“On the Relations between Discriminant Analysis and Multilayer Perceptrons,” Neural Networks, vol. 4, pp. 349360, 1991.
[15] D. J. Hand,“Kernel Discriminant Analysis”, Research Studies Press, Wiley, Chichester, 1982.
[16] J. B. Hampshire,A. H. Waibel,“A Novel Objective Function for Improved Phoneme Recognition Using Time Delay Neural Networks,” Proc. IEEE Int. Joint Conf. on Neural Networks,Washington, DC, vol. I, pp. 235241, June 1989.
[17] F. Jelinek,“Speech Recognition by Statistical Methods,” Proc. of the IEEE, vol. 64, pp. 532556, April 1976.
[18] B. H. Juang,S. Katagiri,“Discriminative Learning for Minimum Error Classification,” IEEE Trans. on Signal Processing, vol. 40, pp. 30433054, Dec. 1992.
[19] T. Kohonen, "SelfOrganization and Associated Memory," Berlin Heidelberg. New York: SpringerVerlag, 1988.
[20] T. Kohonen,G. Barna,R. Chrisley,“Statistical Pattern Recognition with Neural Networks: Benchmark Studies,” Proc. IEEE Int. Conf. on Neural Networks,San Diego, CA, pp. I6168, July 1988.
[21] A. Ljolje,Y. Ephraim,L. R. Rabiner,“Estimation of Hidden Markov Model Parameters by Minimizing Empirical Error Rate,” Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Proc.,Albuquerque, NM, pp. 709712, April 1990.
[22] K. F. Lee,S. Mahajan,“Corrective and Reinforcement Learning for SpeakerIndependent Continuous Speech Recognition,” Computer Science Dept., CarnegieMellon University, Pittsburgh, PA, Rep. CMUCS89100, Jan. 1989.
[23] D. Lowe,A. R. Webb,“Optimized Feature Extraction and the Bayes Decision in FeedForward Classifier Networks,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 13, pp. 355364, April 1991.
[24] A. Nadas,“Optimal Solution of a Training Problem in Speech Recognition,” IEEE Trans. on Acoustics, Speech and Signal Proc., vol. ASSP33, pp. 326329, Feb. 1985.
[25] L. T. Niles,H. F. Silverman,M. A. Bush,“Neural Networks, Maximum Mutual Information Training, and Maximum Likelihood Training,” Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Proc.,Albuquerque, NM, pp. 493496, April 1990.
[26] T. Poggio and F. Girosi, Networks for Approximation and Learning Proc. IEEE, vol. 78, pp. 14811497, 1990.
[27] J. D. Patterson,B. F. Womack,“An Adaptive Pattern Classification Scheme,” IEEE Trans. on Systems, Science and Cybernetics, vol. SSC2, pp. 6267, Aug. 1966.
[28] C. R. Rao,“Linear Statistical Inference and its Applications,”John Wiley&Sons, New York, 1965.
[29] D.E. Rumelhart, G.E. Hinton, and R.J. Williams, "Learning Internal Representations by Error Propagation," Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol. 1: Foundations, D.E. Rumelhart and J.L. McClelland et al., eds., chapter 8, pp. 318362.Cambridge, Mass.: MIT Press, 1986.
[30] M. D. Richard,R. P. Lippmann,“Neural Network Classifiers Estimate Bayesian a posteriori Probabilities,” Neural Computation, vol. 3, pp. 461483, 1991.
[31] S. Renals,R. Rohwer,“Phoneme Classification Experiments Using Radial Basis Functions,” Proc. IEEE Int. Joint Conf. on Neural Networks,Washington, DC, vol. I., pp. 461467, June 1989.
[32] S. A. Solla,E. Levin,M. Fleisher,M:“Accelerated Learning in Layered Neural Networks,” Complex Systems, vol. 2, pp. 625639, 1988.
[33] D.F. Specht, "Probabilistic Neural Networks," Neural Networks, vol. 3, no. 1, pp. 109118, 1990.
[34] N. Tishby,S. Levin,S. A. Solla,“Consistent Inference of Probabilities in Layered Neural Networks,” Proc. IEEE Int. Joint Conf. on Neural Networks,Washington, DC, vol. II, pp. 4034o9, June 1989.
[35] A. J. Viterbi,J. K. Omura,“Principles of Digital Communication and Coding,”McGrawHill, Tokyo, 1979.
[36] A. Waibel,T. Hanazawa,G. Hinton,K. Shikano,K. L. Lang,“Phoneme Recognition: Neural Networks vs. Hidden Markov Models,” Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Proc.,New York, NY, pp. 107110, April 1988.
[37] H. White,“Learning in Artificial Neural Networks: A Statistical Perspective,” Neural Computation, vol. 1, pp. 425464, 1990.
[38] A. R. Webb,D. Lowe,“The Optimised Internal Representation of Multilayer Classifier Networks Performs Nonlinear Discriminant Analysis,” Neural Networks, vol. 3, pp. 367375, 1990.