CSDL Home IEEE Transactions on Pattern Analysis & Machine Intelligence 2010 vol.32 Issue No.07 - July
Classification of Complex Information: Inference of Co-Occurring Affective States from Their Expressions in Speech
Issue No.07 - July (2010 vol.32)
Tal Sobol-Shikler , Ben-Gurion University of the Negev, Beer-Sheva
Peter Robinson , University of Cambridge, Cambridge
We present a classification algorithm for inferring affective states (emotions, mental states, attitudes, and the like) from their nonverbal expressions in speech. It is based on the observations that affective states can occur simultaneously and different sets of vocal features, such as intonation and speech rate, distinguish between nonverbal expressions of different affective states. The input to the inference system was a large set of vocal features and metrics that were extracted from each utterance. The classification algorithm conducted independent pairwise comparisons between nine affective-state groups. The classifier used various subsets of metrics of the vocal features and various classification algorithms for different pairs of affective-state groups. Average classification accuracy of the 36 pairwise machines was 75 percent, using 10-fold cross validation. The comparison results were consolidated into a single ranked list of the nine affective-state groups. This list was the output of the system and represented the inferred combination of co-occurring affective states for the analyzed utterance. The inference accuracy of the combined machine was 83 percent. The system automatically characterized over 500 affective state concepts from the Mind Reading database. The inference of co-occurring affective states was validated by comparing the inferred combinations to the lexical definitions of the labels of the analyzed sentences. The distinguishing capabilities of the system were comparable to human performance.
Affective computing, human perception, cognition, affective states, emotions, speech, machine learning, intelligent systems, multiclass, multilabel.
Tal Sobol-Shikler, Peter Robinson, "Classification of Complex Information: Inference of Co-Occurring Affective States from Their Expressions in Speech", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.32, no. 7, pp. 1284-1297, July 2010, doi:10.1109/TPAMI.2009.107