The Community for Technology Leaders
Acoustics, Speech, and Signal Processing, IEEE International Conference on (2009)
Taipei, Taiwan
Apr. 19, 2009 to Apr. 24, 2009
ISBN: 978-1-4244-2353-8
pp: 3713-3716
Xiao Li , Microsoft Research, One Microsoft Way, Redmond, WA, 98052, U.S.A.
Patrick Nguyen , Microsoft Research, One Microsoft Way, Redmond, WA, 98052, U.S.A.
Geoffrey Zweig , Microsoft Research, One Microsoft Way, Redmond, WA, 98052, U.S.A.
Dan Bohus , Microsoft Research, One Microsoft Way, Redmond, WA, 98052, U.S.A.
ABSTRACT
A voice search system requires a speech interface that can correctly recognize spoken queries uttered by users. The recognition performance strongly relies on a robust language model. In this work, we present the use of multiple data sources, with the focus on query logs, in improving ASR language models for a voice search application. Our contributions are three folds: (1) the use of text queries from web search and mobile search in language modeling; (2) the use of web click data to predict query forms from business listing forms; and (3) the use of voice query logs in creating a positive feedback loop. Experiments show that by leveraging these resources, we can achieve recognition performance comparable to, or even better than, that of a previously deploy system where a large amount of spoken query transcripts are used in language modeling.
INDEX TERMS
CITATION

G. Zweig, Xiao Li, D. Bohus and P. Nguyen, "Leveraging multiple query logs to improve language models for spoken query recognition," Acoustics, Speech, and Signal Processing, IEEE International Conference on(ICASSP), Taipei, Taiwan, 2009, pp. 3713-3716.
doi:10.1109/ICASSP.2009.4960433
182 ms
(Ver 3.3 (11022016))