2006 IEEE International Conference on Multimedia and Expo
Evolutionary Feature Generation in Speech Emotion Recognition
Toronto, ON, Canada
July 09-July 12
ISBN: 1-4244-0366-7
Bjorn Schuller, Institute for Human-Machine Communication, Technische Universit?t M?nchen. Schuller@tum.de
Stephan Reiter, Institute for Human-Machine Communication, Technische Universit?t M?nchen. Reiter@tum.de
Gerhard Rigoll, Institute for Human-Machine Communication, Technische Universit?t M?nchen. Rigoll@tum.de
Feature sets are broadly discussed within speech emotion recognition by acoustic analysis. While popular filter and wrapper based search help to retrieve relevant ones, we feel that automatic generation of such allows for more flexibility throughout search. The basis is formed by dynamic Low-Level Descriptors considering intonation, intensity, formants, spectral information and others. Next, systematic derivation of prosodic, articulatory, and voice quality high level functionals is performed by descriptive statistical analysis. From here on feature alterations are automatically fulfilled, to find an optimal representation within feature space in view of a target classifier. To avoid NP-hard exhaustive search, we suggest use of evolutionary programming. Significant overall performance improvement over former works can be reported on two public databases.
Citation:
Bjorn Schuller, Stephan Reiter, Gerhard Rigoll, "Evolutionary Feature Generation in Speech Emotion Recognition," icme, pp.5-8, 2006 IEEE International Conference on Multimedia and Expo, 2006