21st International Conference on Data Engineering Workshops (ICDEW'05)
Improved Noise Spectra Estimation and Log-spectral Regression for In-car Speech Recognition
Tokyo, Japan
April 05-April 08
ISBN: 0-7695-2657-8
DOI Bookmark:
http://doi.ieeecomputersociety.org/10.1109/ICDE.2005.229
In this paper, we present a two-stage noise spectra estimation approach. After the first-stage noise estimation using the improved minima controlled recursive averaging (IMCRA) method, the second-stage noise estimation is performed by employing a maximum a posteriori (MAP) noise amplitude estimator. We also develop a regression-based speech enhance system by approximating the clean speech with the estimated noise and original noisy speech. Evaluation experiments show that the proposed two-stage noise estimation method results in lower estimation error for all test noise types. Compared to original noisy speech, the proposed regression-based approach obtains an average relative word error rate (WER) reduction of 65% in our isolated word recognition experiments conducted in 12 real car environments.
Citation:
Weifeng Li, Katunobu Itou, Kazuya Takeda, Fumitada Itakura, "Improved Noise Spectra Estimation and Log-spectral Regression for In-car Speech Recognition," icdew, pp.1206, 21st International Conference on Data Engineering Workshops (ICDEW'05), 2005
Usage of this product signifies your acceptance of the
Terms of Use.
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||