The Community for Technology Leaders
Multimedia and Ubiquitous Engineering, International Conference on (2007)
Seoul, Korea
Apr. 26, 2007 to Apr. 28, 2007
ISBN: 0-7695-2777-9
pp: 366-371
Byeong-jun Han , Korea University, Seoul, Korea
Seungmin Rho , Ajou University, Suwon, Korea
Eenjun Hwang , Korea University, Seoul, Korea
In this paper, we propose a new scheme for transcribing sung or hummed queries into a sequence of pitch and duration pairs automatically for efficient music retrieval. More specifically, we present two novel methods called WAE (Windowed Average Energy) and dynamic threshold method for ADF onsets for note segmentation and onset/offset detection in acoustic signal, respectively. The former improves previous energy-based approaches such as AE by defining small but coherent windows with local and global threshold values. The latter also improves the traditional global/local threshold method. By performing various experiments on our prototype music retrieval system, we show the effectiveness of our proposed scheme.

E. Hwang, S. Rho and B. Han, "An Efficient Voice Transcription Scheme for Music Retrieval," 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07)(MUE), Seoul, 2007, pp. 366-371.
82 ms
(Ver 3.3 (11022016))