2006 IEEE International Conference on Multimedia and Expo
Musical Signal Type Discrimination based on Large Open Feature Sets
Toronto, ON, Canada
July 09-July 12
ISBN: 1-4244-0366-7
Bjorn Schuller, Institute for Human-Machine Communication, Technische Universit?t M?nchen, Germany. Schuller@tum.de
Frank Wallhoff, Institute for Human-Machine Communication, Technische Universit?t M?nchen, Germany. Wallhoff@tum.de
Dejan Arsic, Institute for Human-Machine Communication, Technische Universit?t M?nchen, Germany. Arsic@tum.de
Gerhard Rigoll, Institute for Human-Machine Communication, Technische Universit?t M?nchen, Germany. Rigoll@tum.de
Automatic discrimination of musical signal types as speech, singing, music, genres or drumbeats within audio streams is of great importance e. g. for radio broadcast stream segmentation. Yet, feature sets are largely discussed. We therefore suggest a large open feature set approach starting with systematical generation of 7k hi-level features based on MPEG-7 Low-Level-Descriptors and further feature contours. A subsequent fast Gain Ratio reduction followed by wrapper-based Floating Search leads to a strong basis of relevant features. Next, features are added by alteration and combination within genetic search. For classification we use Support-Vector-Machines proven reliable for this task. Test-runs are carried out on two task-specific databases and the public Columbia SMD database and show significant improvements for each step of the suggested novel concept.
Citation:
Bjorn Schuller, Frank Wallhoff, Dejan Arsic, Gerhard Rigoll, "Musical Signal Type Discrimination based on Large Open Feature Sets," icme, pp.1089-1092, 2006 IEEE International Conference on Multimedia and Expo, 2006