loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2006 IEEE International Conference on Multimedia and Expo
Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches
Toronto, ON, Canada
July 09-July 12
ISBN: 1-4244-0366-7
Margarita Kotti, Department of Informatics, Aristotle Univ. of Thessaloniki, Box 451, Thessaloniki 541 24, Greece. E-mail: mkotti@zeus.csd.auth.gr
Luis P. M. Martins, INESC Porto, Porto, Portugal. E-mail: lmartins@inescporto.pt
Emmanouil Benetos, Department of Informatics, Aristotle Univ. of Thessaloniki, Box 451, Thessaloniki 541 24, Greece. E-mail: empeneto@zeus.csd.auth.gr
Jaime Cardoso, INESC Porto, Porto, Portugal. E-mail: jsc@inescporto.pt
Constantine Kotropoulos, Department of Informatics, Aristotle Univ. of Thessaloniki, Box 451, Thessaloniki 541 24, Greece. E-mail: costas@zeus.csd.auth.gr
This paper addresses the problem of unsupervised speaker change detection. Three systems based on the Bayesian Information Criterion (BIC) are tested. The first system investigates the AudioSpectrumCentroid and the AudioWaveformEnvelope features, implements a dynamic thresholding followed by a fusion scheme, and finally applies BIC. The second method is a real-time one that uses a metric-based approach employing the line spectral pairs and the BIC to validate a potential speaker change point. The third method consists of three modules. In the first module, a measure based on second-order statistics is used; in the second module, the Euclidean distance and T2 Hotelling statistic are applied; and in the third module, the BIC is utilized. The experiments are carried out on a dataset created by concatenating speakers from the TIMIT database, that is referred to as the TIMIT data set. A comparison between the performance of the three systems is made based on t-statistics.
Citation:
Margarita Kotti, Luis P. M. Martins, Emmanouil Benetos, Jaime Cardoso, Constantine Kotropoulos, "Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches," icme, pp.1101-1104, 2006 IEEE International Conference on Multimedia and Expo, 2006
Usage of this product signifies your acceptance of the Terms of Use.