15th International Conference on Pattern Recognition (ICPR'00) - Volume 3
A Markov Random Field Model for Automatic Speech Recognition
Barcelona, Spain
September 03-September 08
ISBN: 0-7695-0750-6
Speech can be represented as a time/frequency distribution of energy using a multi-band filter bank. A Markov random field model, which takes into account the possible time asynchrony across the bands, is estimated for each segmental unit to be recognized. The law of the speech process is given by a parametric Gibbs distribution and a maximum likelihood parameter estimation algorithm is developed. Experiments are conducted on an isolated word recognition problem. It is shown that similar performances are obtained with the new model and with standard HMM techniques in the mono-band case. In the multi-band case, it is shown that modeling inter-band synchrony is an interesting approach to increase the performance when the number of bands increases.
Citation:
Guillaume Gravier, Marc Sigelle, Gérard Chollet, "A Markov Random Field Model for Automatic Speech Recognition," icpr, vol. 3, pp.3258, 15th International Conference on Pattern Recognition (ICPR'00) - Volume 3, 2000