18th International Conference on Pattern Recognition (ICPR'06) Volume 4 Efficient Gaussian Mixture for Speech Recognition Hong Kong August 20-August 24 ISBN: 0-7695-2521-0
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICPR.2006.475
This article presents a clustering algorithm to determine the optimal number of components in a Gaussian mixture. The principle is to start from an important number of mixture components then group the multivariate normal distributions into clusters using the divergence, a weighted symmetric, distortion measure based on the Kullback-Leibler distance. The optimal cut in the tree, i.e. the clustering, satisfies criteria based on either the minimum amount of available training data or dissimilarities between clusters. The performance of this algorithm is compared favorably against a reference system and a likelihood loss based clustering system. The tree cutting criteria are also discussed. About an hour of Ester, a French broadcast News database is used for the recognition experiments. Performance are significantly improved and the word error rate decreases by about 4.8%, where the confidence interval is 1%.
Citation:
Leila , Gerard Chollet, "Efficient Gaussian Mixture for Speech Recognition," icpr, vol. 4, pp.294-297, 18th International Conference on Pattern Recognition (ICPR'06) Volume 4, 2006 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||