loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
International Conference on Computational Inteligence for Modelling Control and Automation and International Conference on Intelligent Agents Web Technologies and International Commerce (CIMCA'06)
Optimal Short-Time Features for Music/Speech Classification of Compressed Audio Data
Sydney Australia
November 28-December 01
ISBN: 0-7695-2731-0
A. Rizzi, University of Rome
M. Buccino, University of Rome
M. Panella, University of Rome
A. Uncini, University of Rome
This paper deals with the Music/Speech classification problem, starting from a set of features extracted directly from compressed audio data. The proposed classification system is able to label audio sequences stored as compressed MPEG layer III files. Decoding and analyzing in a unique stage is a fundamental tool for audio streaming applications, such as real time classification. Moreover, the techniques described herein provide useful tools in the management (data tagging, summarization, etc.) of a digital music library. The adopted set of short-time features are computed from the spectral information available in the decoding stage. In this paper, we show that for the classification problem at hand this set of features is redundant and can be dramatically pruned. To this aim we used an optimization strategy based on principal component analysis and genetic algorithms. The results show a very interesting classification accuracy using just one short-time feature.
Citation:
A. Rizzi, M. Buccino, M. Panella, A. Uncini, "Optimal Short-Time Features for Music/Speech Classification of Compressed Audio Data," cimca, pp.210, International Conference on Computational Inteligence for Modelling Control and Automation and International Conference on Intelligent Agents Web Technologies and International Commerce (CIMCA'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.