Acoustics, Speech, and Signal Processing, IEEE International Conference on (2000)
June 5, 2000 to June 9, 2000
G. Zweig , IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
In this paper, we apply boosting to the problem of frame-level phone classification, and use the resulting system to perform voicemail transcription. We develop parallel, hierarchical, and restricted versions of the classic AdaBoost algorithm, which enable the technique to be used in large-scale speech recognition tasks with hundreds of thousands of Gaussians and tens of millions of training frames. We report small but consistent improvements in both frame recognition accuracy and word error rate.
G. Zweig and M. Padmanabhan, "Boosting Gaussian mixtures in an LVCSR system," Acoustics, Speech, and Signal Processing, IEEE International Conference on(ICASSP), Istanbul, Turkey, 2000, pp. 1527-1530.