2001 IEEE International Conference on Multimedia and Expo (ICME'01)
CLASSIFYING MOTION PICTURE SOUNDTRACK FOR VIDEO INDEXING
Tokyo, Japan
August 22-August 25
ISBN: 0-7695-1198-8
We investigate a method for classification of patterns with temporal support. This method combines the ability of a nonlinear-kernel based classifier (in the form of a support vector machine) to discriminate and the ability of a first order Markov chain to model temporal transitions. We apply this to the task of classifying motion picture soundtrack. Experiments with classification of the soundtrack into speech and non-speech audio patterns reveal improvement in classification performance using this proposed method over HMM-based classification as well as SVM-based classification. Using a normalized margin obtained from the SVM and mapping it to a non-negative confidence measure bounded by 1, we attempt to alter the classification of patterns close to the separating boundary, by using the constraints on the transition between the two classes. Sound track classification with semantic classes can help browse and index a video efficiently.
Citation:
Milind R. Naphade, Roy Wang, Thomas S. Huang, "CLASSIFYING MOTION PICTURE SOUNDTRACK FOR VIDEO INDEXING," icme, pp.242, 2001 IEEE International Conference on Multimedia and Expo (ICME'01), 2001