loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
7th IEEE International Conference on Computer and Information Technology (CIT 2007)
Speech and Song Search on the Web: System Design and Implementation
Aizu-Wakamatsu City, Fukushima, Japan
October 16-October 19
ISBN: 0-7695-2983-6
Yuichi Yaguchi, University of Aizu
Yoshiyuki Watanabe, University of Aizu
Keitaro Naruse, University of Aizu
Ryuichi Oka, University of Aizu
This paper proposes a novel search system for speech and song segments. The amount of accumulated video data in the World Wide Web is expanding and its content is varied. Video content includes natural voices and singing voices, and these differ in their phoneme lengths. Our system uses frame-wise phoneme recognition and Continuous Dynamic Programming (CDP). First, each target and query waveform is divided into fixed short-time frames; second, each frame of the waveform is used to estimate a phoneme label using Bayes estimation; third, the query sequences of phoneme labels are searched from target sequences by time-robustness CDP; and, finally, this system gets candidate answers. This method is robust along the time dimension, and thus has a great advantage for natural voice as well as song. This paper also introduces an implementation of this system, which is published on the Web, as a secondary search engine for Youtube data.
Citation:
Yuichi Yaguchi, Yoshiyuki Watanabe, Keitaro Naruse, Ryuichi Oka, "Speech and Song Search on the Web: System Design and Implementation," cit, pp.270-275, 7th IEEE International Conference on Computer and Information Technology (CIT 2007), 2007
Usage of this product signifies your acceptance of the Terms of Use.