loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2005 IEEE International Conference on Multimedia and Expo
Extracting vocal melody from karaoke music audio
Amsterdam, Netherlands
July 06-July 06
ISBN: 0-7803-9331-7
Y. Zhu, Inst. for Infocomm Res., A-STAR, Singapore, Singapore
S. Gao, Inst. for Infocomm Res., A-STAR, Singapore, Singapore
Extracting the melody from polyphonic musical audio is a nontrivial research problem. This paper presents an approach for vocal melody extraction from dual channel Karaoke music audio. The extracted melody corresponds to the singing voice in the original performance channel, which can then be used for melody-based music retrieval. In the proposed technique, audio signals from both the accompaniment channel and the original performance channel are analyzed. The note partials are firstly extracted from the signal, which is represented in constant-Q transform frequency domain. Then the volume balance between the two channels is estimated based on signal approximation in the sub-bands. Finally, the pitch corresponding to the singing voice is identified based on the note partial differences between the two channels. The extracted melody is represented as a sequence of pitch values. This technique assumes that the two channels have similar accompaniment instrument performance except for the singing voices. Experimental result on 40 Karaoke music audios has shown the performance of the proposed technique. The pitch extraction rate is above 70% and melody retrieval accuracy in an 800-tune-database is 90%.
Index Terms:
pitch extraction rate, vocal melody extraction, polyphonic music audio, dual channel Karaoke system, melody-based music retrieval, audio signal, constant-Q transform, frequency domain analysis, subband signal approximation
Citation:
Y. Zhu, S. Gao, "Extracting vocal melody from karaoke music audio," icme, pp.4 pp., 2005 IEEE International Conference on Multimedia and Expo, 2005
Usage of this product signifies your acceptance of the Terms of Use.