This Article 
 Bibliographic References 
 Add to: 
Person Identification Using Multiple Cues
October 1995 (vol. 17 no. 10)
pp. 955-966

Abstract—This paper presents a person identification system based on acoustic and visual features. The system is organized as a set of non-homogeneous classifiers whose outputs are integrated after a normalization step. In particular, two classifiers based on acoustic features and three based on visual ones provide data for an integration module whose performance is evaluated. A novel technique for the integration of multiple classifiers at an hybrid rank/measurement level is introduced using HyperBF networks. Two different methods for the rejection of an unknown person are introduced. The performance of the integrated system is shown to be superior to that of the acoustic and visual subsystems. The resulting identification system can be used to log personal access and, with minor modifications, as an identity verification system.

[1] T. Poggio and L. Stringa,“A project for an intelligent system: Vision and learning,” Int’l J. Quantum Chemistry, vol. 42, pp. 727-739, 1992.
[2] F.K. Soong and A.E. Rosenberg,“On the use of instantaneous and transitional spectral information in speaker recognition,” IEEE Trans Acoustic, Speech, and Signal Processing, vol. 36, no. 6, pp. 871-879, 1988.
[3] S. Furui,“Cepstrum analysis technique for automatic speaker verification,” IEEE Trans Acoustic, Speech, and Signal Processing, vol. 29, no. 1, pp. 254-272, 1981.
[4] A.E. Rosenberg,J. DeLong,C.H. Lee,B.H. Juang,, and F.K. Soong,“The use of cohort normalized scores for speaker verification,” Proc. ICSLP, vol. 1, pp. 599-602,Banff, Canada, Oct. 1992.
[5] T. Matsui and S. Furui,“Similarity normalization method for speaker verification based on a posteriori probability,” Proc. ESCA Workshop on Automatic Speaker Recognition Identification Verification, pp. 59-62,Martigny, Switzerland, Apr. 1994.
[6] R. Brunelli,D. Falavigna,T. Poggio,, and L. Stringa,“A recognition system, particularly for recognizing people,” patent no. 93112738, 1993, Priority IT/11.08.92/IT TO920695.
[7] R. Brunelli,T. Poggio,D. Falavigna,, and L. Stringa,“Automatic person recognition by using acoustic and geometric features,” Tech. Report 9307-43, I.R.S.T., 1993, to appear in Machine Vision and Applications.
[8] G.R. Doddington,“Speaker recognition, identifying people by their voices,” Proc. IEEE, vol. 73, no. 11, 1985.
[9] A.E. Rosenberg and F.K. Soong,“Evaluation of a vector quantization talker recognition system in text independent and text dependent modes,” Computer Speech and Language, vol. 2, no. 3-4, pp. 143-157, 1987.
[10] H. Gish, J. Makhoul,S. Roucos,“Vector quantization in speech coding,” Proc. IEEE, vol. 73, no. 11, pp. 1551-1588, 1985.
[11] D. O’Shaughnessy,Speech communication.Addison-Wesley, 1987.
[12] P. Melmerstein and S.B. Davis,“Comparison of parametric representations for monosyllabic word recognition in continuosly spoken sentences,” IEEE Trans Acoustic, Speech, and Signal Processing, vol 28, no. 4, pp. 357-366, 1980.
[13] G. Carli and R. Gretter, “A start-end point detection algorithm for a real-time acoustic front-end based on dsp32c vme board,” Proc. ICSPAT, pages 1,011-1,017,Boston, Nov. 1992.
[14] R. Brunelli and T. Poggio, "Face Recognition: Features vs. Templates," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 10, pp. 1,042-1,053, Oct. 1993.
[15] R. Brunelli,“Estimation of pose and illuminant direction for face processing,” A.I. Memo No. 1499, Massachusetts Inst. of Tech nology, 1994. P.J. Burt,“Smart sensing within a pyramid vision machine,” Proc. IEEE, vol. 76, no. 8, pp. 1,006-1,015, 1988.
[16] P. W. Hallinan,“Recognizing human eyes,” SPIE Proc., vol. 1570, pp. 214-226, 1991.
[17] L. Stringa,“Eyes detection for face recognition,” Applied Artificial Intelligence, vol. 7, pp. 365-382, 1993.
[18] D.H. Ballard and C.M. Brown, Computer Vision, Prentice Hall, Upper Saddle River, N.J., 1982.
[19] R. Brunelli and S. Messelodi,“Robust estimation of correlation: With application to computer vision,” Pattern Recognition, vol. 28, no. 6, pp. 833-861, 1995.
[20] L. Stringa,“Automatic face recognition using directional derivatives,” Tech. Report 9205-04, I.R.S.T., 1991.
[21] L. Xu, A. Krzyzak, and C.Y. Suen, “Methods of Combining Multiple Classifiers and Their Applications to Handwriting Recognition,” IEEE Trans. Systems, Man, and Cybernetics, vol. 22, no. 3, pp. 418-435, 1992.
[22] F.R. Hampel,P.J. Rousseeuw,E.M. Ronchetti,, and W.A. Stahel,Robust Statistics: The Approach Based on Influence Functions. John Wiley&Sons, 1986.
[23] P. J. Huber,Robust Statistics.Wiley, 1981.
[24] P.B. Bonissone and K.S. Decker,“Selecting uncertainty calculi and granularity: An experiment in trading off precision and complexity,” J.F. Lemmer, L.N. Karnak, eds., Uncertainty in Artificial Intelligence, pp. 217-247,North Holland, 1986.
[25] P.B. Bonissone,S.S. Gans,, and K.S. Decker,“Rum: A layered architecture for reasoning with uncertainty,” Proc. 10th Int’l Joint Conf. on Artificial Intelligence, pp. 891-898,Milan, Aug. 1987.
[26] R.V. Hogg and A.T. Craig,Intro. to Statistics.Collier-Macmillan, 1978.
[27] R.O. Duda and P.E. Hart,Pattern Recognition and Scene Analysis.New York: Wiley, 1973.
[28] Y.-H. Pao,Adaptive Pattern Recognition and Neural Networks. Reading, Mass: Addison-Wesley, 1989.
[29] R. Brunelli. On training neural nets through stochastic minimization, Neural Networks, vol. 7, no. 9, pp. 1405-1412, 1996.
[30] R. Brunelli and G. Tecchiolli. Stochastic minimization with adaptive memory, , J. of Computational and Applied Mathematics, pp. 329-343, 1995.
[31] K. Fukunaga, Introduction to Statistical Pattern Recognition, second edition. Academic Press, 1990.
[32] T. Poggio and F. Girosi, Regularization algorithms for learning that are equivalent to multilayer networks, Science, vol. 247, pp. 978-982, 1990.

Index Terms:
Template matching, robust statistics, correlation, face recognition, speaker recognition, learning, classification.
Roberto Brunelli, Daniele Falavigna, "Person Identification Using Multiple Cues," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 17, no. 10, pp. 955-966, Oct. 1995, doi:10.1109/34.464560
Usage of this product signifies your acceptance of the Terms of Use.