Acoustics, Speech, and Signal Processing, IEEE International Conference on (2000)
June 5, 2000 to June 9, 2000
M. Lieb , Philips GmbH Forschungslab., Aachen, Germany
Amongst several data driven approaches for designing filters for the time sequence of spectral parameters, the linear discriminant analysis (LDA) based method has been proposed for automatic speech recognition. Here we apply LDA-based filter design to cepstral features, which better match the inherent assumption of this method that feature vector components are uncorrelated. Extensive recognition experiments have been conducted both on the standard TIMIT phone recognition task and on a proprietary 130-words command word task under various adverse environmental conditions, including reverberant data with real-life room impulse responses and data processed by acoustic echo cancellation algorithms. Significant error rate reductions have been achieved when applying the novel long-range feature filters compared to standard approaches employing cepstral mean normalization and delta and delta-delta features, in particular when facing acoustic echo cancellation scenarios and room reverberation. For example, the phone accuracy on reverberated TIMIT data could be increased from 50.7% to 56.0%.
R. Haeb-Umbach and M. Lieb, "LDA derived cepstral trajectory filters in adverse environmental conditions," Acoustics, Speech, and Signal Processing, IEEE International Conference on(ICASSP), Istanbul, Turkey, 2000, pp. II1105-II1108.