Issue No.09 - September (2008 vol.20)
Hsinchun Chen , University of Arizona, Tucson
Sven Thoms , University of Arizona, Tucson
Ahmed Abbasi , University of Arizona, Tucson
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2008.51
Analysis of affective intensities in computer mediated communication is important in order to allow a better understanding of online users? emotions and preferences. Despite considerable research on textual affect classification, it is unclear which features and techniques are most effective. In this study we compared several feature representations for affect analysis, including learned n-grams and various automatically and manually crafted affect lexicons. We also proposed the support vector regression correlation ensemble (SVRCE) method for enhanced classification of affect intensities. SVRCE uses an ensemble of classifiers each trained using a feature subset tailored towards classifying a single affect class. The ensemble is combined with affect correlation information to enable better prediction of emotive intensities. Experiments were conducted on four test beds encompassing web forums, blogs, and online stories. The results revealed that learned n-grams were more effective than lexicon based affect representations. The findings also indicated that SVRCE outperformed comparison techniques, including Pace regression, semantic orientation, and WordNet models. Ablation testing showed that the improved performance of SVRCE was attributable to its use of feature ensembles as well as affect correlation information. A brief case study was conducted to illustrate the utility of the features and techniques for affect analysis of large archives of online discourse.
Text mining, Discourse, Machine learning, Linguistic processing
Hsinchun Chen, Sven Thoms, Ahmed Abbasi, "Affect Analysis of Web Forums and Blogs Using Correlation Ensembles", IEEE Transactions on Knowledge & Data Engineering, vol.20, no. 9, pp. 1168-1180, September 2008, doi:10.1109/TKDE.2008.51