loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
16th International Conference on Pattern Recognition (ICPR'02) - Volume 3
Constructing Speech Processing Systems on Universal Phonetic Codes Accompanied with Reference Acoustic Models
Quebec City, QC, Canada
August 11-August 15
ISBN: 0-7695-1695-X
Kazuyo Tanaka, National Institute of Advanced Industrial Science and Technology and University of Library and Information Science
Hiroaki Kojima, National Institute of Advanced Industrial Science and Technology
Nahoko Fujimura, National Institute of Advanced Industrial Science and Technology
Yoshiaki Itoh, Iwate Prefectural University
This paper proposes a novel speech processing framework, where all of the speech data are once encoded into universal phonetic code (UPC) sequences and speech processing systems, such as speech recognition, retrieval, digesting, are constructed on this UPC domain. First of all, we introduce an IPA-based sub-phonetic segment (SPS) set as the UPC to deal with multilingual speech. In the UPC (SPS) domain, each UPC accompanies a reference acoustic model which is independent of real acoustic models used in the encoding process. Processing, such as recognition, in the UPC domain is conducted based on the distance between UPC sequences estimated by using the reference acoustic models. We confirm the proposed framework by constructing a speech recognition and a vocabulary-free speech retrieval system on the SPS domain. We show several experimental results on these systems, using Japanese and English speech data sets.
Citation:
Kazuyo Tanaka, Hiroaki Kojima, Nahoko Fujimura, Yoshiaki Itoh, "Constructing Speech Processing Systems on Universal Phonetic Codes Accompanied with Reference Acoustic Models," icpr, vol. 3, pp.30728, 16th International Conference on Pattern Recognition (ICPR'02) - Volume 3, 2002
Usage of this product signifies your acceptance of the Terms of Use.