This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2009 13th Panhellenic Conference on Informatics
Using Hybrid HMM-Based Speech Segmentation to Improve Synthetic Speech Quality
Corfu, Greece
September 10-September 12
ISBN: 978-0-7695-3788-7
The automatic phonetic time-alignment of speech databases is essential for the development cycle of a Text-to-Speech (TTS) system. Furthermore, the quality of the synthesized speech signals is strongly related to the precision of the produced alignment. In the present work we study the performance of a new HMM-based speech segmentation method. The method is based on hybrid embedded and isolated-unit trained models, and has proved to improve the phonetic segmentation accuracy in the multiple speaker task. Here it is employed on the single speaker segmentation task, utilizing a Greek-speech database. The evaluation of the method showed significant improvement in terms of phonetic segmentation accuracy as well as in the perceptual quality of synthetic speech, when compared to the baseline system.
Citation:
Iosif Mporas, Alexandros Lazaridis, Todor Ganchev, Nikos Fakotakis, "Using Hybrid HMM-Based Speech Segmentation to Improve Synthetic Speech Quality," pci, pp.118-122, 2009 13th Panhellenic Conference on Informatics, 2009
Usage of this product signifies your acceptance of the Terms of Use.