The Community for Technology Leaders
Acoustics, Speech, and Signal Processing, IEEE International Conference on (2009)
Taipei, Taiwan
Apr. 19, 2009 to Apr. 24, 2009
ISBN: 978-1-4244-2353-8
pp: 4697-4700
Benoit Favre , International Computer Science Institute, Berkeley, USA
Dilek Hakkani-Tur , International Computer Science Institute, Berkeley, USA
Elizabeth Shriberg , International Computer Science Institute, Berkeley, USA
ABSTRACT
Providing punctuation in speech transcripts not only improves readability, but it also helps downstream text processing such as information extraction or machine translation. In this paper, we improve by 7% the accuracy of comma prediction in English broadcast news by introducing syntactic features inspired by the role of commas as described in linguistics studies. We conduct an analysis of the impact of those features on other subsets of features (prosody, words…) when combined through CRFs. The syntactic cues can help characterizing large syntactic patterns such as appositions and lists which are not necessarily marked by prosody.
INDEX TERMS
CITATION

E. Shriberg, B. Favre and D. Hakkani-Tur, "Syntactically-informed models for comma prediction," Acoustics, Speech, and Signal Processing, IEEE International Conference on(ICASSP), Taipei, Taiwan, 2009, pp. 4697-4700.
doi:10.1109/ICASSP.2009.4960679
93 ms
(Ver 3.3 (11022016))