16th International Conference on Pattern Recognition (ICPR'02) - Volume 4 Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition Quebec City, QC, Canada August 11-August 15 ISBN: 0-7695-1695-X
In standard automatic speech recognition (ASR), hidden Markov models (HMMs) calculate their emission probabilities by an artificial neural network (ANN) or a Gaussian distribution conditioned only upon the hidden state variable. Recent work [12 ] showed the benefit of conditioning the emission distributions also upon a discrete auxiliary variable, which is observed in training and hidden in recognition. Related work [3 ] has shown the utility of conditioning the emission distributions on a continuous auxiliary variable. We apply mixed Bayesian networks (BNs) to extend these works by introducing a continuous auxiliary variable that is observed in training but is hidden in recognition. We find that an auxiliary pitch variable conditioned itself upon the hidden state can degrade performance unless the auxiliary variable is also hidden. The performance, furthermore, can be improved by making the auxiliary pitch variable independent of the hidden state.
Citation:
Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard, "Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition," icpr, vol. 4, pp.40293, 16th International Conference on Pattern Recognition (ICPR'02) - Volume 4, 2002 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||