loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
10th International Multimedia Modelling Conference
Audio Segmentation and Classification based on a Selective Analysis Scheme
Brisbane, Australia
January 05-January 07
ISBN: 0-7695-2084-7
Shahrokh Ghaemmaghami, Sharif University of Technology, Tehran, Iran
This paper addresses a new approach to segmentation and classification of audio through analysis of a smaller set of selective frames, which are identified by temporal decomposition (TD). These frames are located at the most steady instants, or event centroids, within a given block of the signal, which yield the maximal diversity over the set of selected features. Based on this selection scheme, the number of frames used in the analysis is reduced by at least 40%, while the temporal resolution is doubled as compared to that in typical audio classifiers. By constructing a classification system to segment audio into speech, music, speech-music, and others, it is shown that the proposed method outperforms the typical classifiers in most cases. In addition, by using hierarchical TD for frame selection, it is made possible to adapt the audio classifier with other segmentation schemes, e.g., visual classification based on motion picture analysis, for accurate audio-visual segmentation of multimedia data.
Citation:
Shahrokh Ghaemmaghami, "Audio Segmentation and Classification based on a Selective Analysis Scheme," mmm, pp.42, 10th International Multimedia Modelling Conference, 2004
Usage of this product signifies your acceptance of the Terms of Use.