loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Eighth IEEE International Symposium on Multimedia (ISM'06)
A New Multimedia Content Skimming Method Based on Speech Emphasis Extraction and Its Application to Content Variations
San Diego, CA
December 11-December 13
ISBN: 0-7695-2746-9
Kota Hidaka, NTT East Corp., Japan
Shinya Nakajima, NTT East Corp., Japan
Yasuyuki Niihara, NTT East Corp., Japan
We propose Choco-Para, a multimedia content skimming technique; its application to a variety of content types is described. Based on automatic speech emphasis extraction, Choco-Para extracts speech attributes, prosodic parameters such as pitch, power, and speaking rate, and uses the data to estimate the degree of emphasis of each spoken phrase. By computing the degree of the emphasis curve, Choco- Para can generate a skimmed edition at an arbitrary skimming rate by selecting emphasized speech portions via dynamic threshold logic. Choco-Para uses three types of prosodic parameters and both short term and long term deviation. Experiments assess the contributions of each prosodic parameter and deviation type. They show that estimation accuracy is optimized by using both short and long term deviation with regard to pitch, power, and speaking rate. The results confirm that Choco-Para supports a wide variety of multimedia content.
Citation:
Kota Hidaka, Shinya Nakajima, Yasuyuki Niihara, "A New Multimedia Content Skimming Method Based on Speech Emphasis Extraction and Its Application to Content Variations," ism, pp.716-719, Eighth IEEE International Symposium on Multimedia (ISM'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.