Issue No. 09 - September (1990 vol. 12)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/34.57686
<p>The problem of using a small amount of speech data to adapt a set of Gaussian HMMs (hidden Markov models) that have been trained on one speaker to recognize the speech of another is considered. The authors experimented with a phoneme-dependent spectral mapping for adapting the mean vectors of the multivariate Gaussian distributions (a method analogous to the confusion matrix method that has been used to adapt discrete HMMs), and a heuristic for estimating covariance matrices from small amounts of data. The best results were obtained by training the mean vectors individually from the adaptation data and using the heuristic to estimate distinct covariance matrices for each phoneme.</p>
Gaussian HMM speech recognition; speaker adaptation; hidden Markov models; phoneme-dependent spectral mapping; heuristic; covariance matrices; Markov processes; matrix algebra; spectral analysis; speech recognition
P. Mermelstein, P. Kenny and M. Lennig, "Speaker Adaptation in a Large-Vocabulary Gaussian HMM Recognizer," in IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 12, no. , pp. 917-920, 1990.