The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.09 - September (2008 vol.20)
pp: 1168-1180
Hsinchun Chen , University of Arizona, Tucson
Sven Thoms , University of Arizona, Tucson
Ahmed Abbasi , University of Arizona, Tucson
ABSTRACT
Analysis of affective intensities in computer mediated communication is important in order to allow a better understanding of online users? emotions and preferences. Despite considerable research on textual affect classification, it is unclear which features and techniques are most effective. In this study we compared several feature representations for affect analysis, including learned n-grams and various automatically and manually crafted affect lexicons. We also proposed the support vector regression correlation ensemble (SVRCE) method for enhanced classification of affect intensities. SVRCE uses an ensemble of classifiers each trained using a feature subset tailored towards classifying a single affect class. The ensemble is combined with affect correlation information to enable better prediction of emotive intensities. Experiments were conducted on four test beds encompassing web forums, blogs, and online stories. The results revealed that learned n-grams were more effective than lexicon based affect representations. The findings also indicated that SVRCE outperformed comparison techniques, including Pace regression, semantic orientation, and WordNet models. Ablation testing showed that the improved performance of SVRCE was attributable to its use of feature ensembles as well as affect correlation information. A brief case study was conducted to illustrate the utility of the features and techniques for affect analysis of large archives of online discourse.
INDEX TERMS
Text mining, Discourse, Machine learning, Linguistic processing
CITATION
Hsinchun Chen, Sven Thoms, Ahmed Abbasi, "Affect Analysis of Web Forums and Blogs Using Correlation Ensembles", IEEE Transactions on Knowledge & Data Engineering, vol.20, no. 9, pp. 1168-1180, September 2008, doi:10.1109/TKDE.2008.51
REFERENCES
[1] A. Abbasi, H. Chen, and A. Salem, “Sentiment Analysis in Multiple Languages: Feature Selection for Opinion Classification in Web Forums,” ACM Trans. Information Systems, vol. 26, no. 3,article 12, July 2008.
[2] S. Argamon, C. Whitelaw, P. Chase, S.R. Hota, N. Garg, and S. Levitan, “Stylistic Text Classification Using Functional Lexical Features,” J. Am. Soc. for Information Science and Technology, vol. 58, no. 6, pp. 802-822, 2007.
[3] Z. Chuang and C. Wu, “Multi-Modal Emotion Recognition from Speech and Text,” Computational Linguistics and Chinese Language Processing, vol. 9, no. 2, pp. 45-62, 2004.
[4] K.J. Cherkauer, “Human Expert-Level Performance on a Scientific Image Analysis Task by a System Using Combined Artificial Neural Networks,” Working Notes of the AAAI Workshop Integrating Multiple Learned Models, P. Chan, ed., pp. 15-21, 1996.
[5] Y.H. Cho and K.J. Lee, “Automatic Affect Recognition Using Natural Language Processing Techniques and Manually Built Affect Lexicon,” IEICE Trans. Information Systems, vol. E89, no. 12, pp. 2964-2971, 2006.
[6] T.G. Dietterich, “Ensemble Methods in Machine Learning,” Proc. First Int'l Workshop Multiple Classifier Systems (MCS '00), pp. 1-15, 2000.
[7] J. Donath, K. Karahalio, and F. Viegas, “Visualizing Conversation,” Proc. 32nd Conf. Computer-Human Interaction (CHI), 1999.
[8] WordNet: An Electronic Lexical Database. MIT Press, 1998.
[9] G. Grefenstette, Y. Qu, D.A. Evans, and J.G. Shanahan, “Validating the Coverage of Lexical Resources for Affect Analysis and Automatically Classifying New Words Along Semantic Axes,” Proc. AAAI Spring Symp. Exploring Attitude and Affect in Text: Theories and Applications (AAAI-EAAT '04), Y. Qu, J. Shanahan, and J. Wiebe, eds., pp. 71-78, 2004.
[10] G. Grefenstette, Y. Qu, J.G. Shanahan, and D.A. Evans, “Coupling Niche Browsers and Affect Analysis for an Opinion Mining Application,” Proc. 12th Int'l Conf. Recherche d'Information Assistee par Ordinateur (RIAO '04), pp. 186-194, 2004.
[11] M.A. Hearst, “Direction-Based Text Interpretation as an Information Access Refinement,” Text-Based Intelligent Systems: Current Research and Practice in Information Extraction and Retrieval, P.Jacobs, ed., Lawrence Erlbaum Assoc., 1992.
[12] M. Jiang, E. Jensen, S. Beitzel, and S. Argamon, “Choosing the Right Bigrams for Information Retrieval,” Proc. Meeting of the Int'l Federation of Classification Soc., 2004.
[13] S. Kim and E. Hovy, “Determining the Sentiment of Opinions,” Proc. 20th Int'l Conf. Computational Linguistics (COLING '04), pp.1367-1373, 2004.
[14] H. Liu, H. Lieberman, and T. Selker, “A Model of Textual Affect Sensing Using Real-World Knowledge,” Proc. Eighth Int'l Conf. Intelligent User Interfaces, 2003.
[15] C. Ma, H. Prendinger, and M. Ishizuka, “Emotion Estimation and Reasoning Based on Affective Textual Interaction,” Proc. First Int'l Conf. Affective Computing and Intelligent Interaction (ACII '05), pp.622-628, 2005.
[16] G. Mishne, “Experiments with Mood Classification,” Proc. First Workshop Stylistic Analysis of Text for Information Access Workshop (Style), 2005.
[17] G. Mishne and M. de Rijke, “Capturing Global Mood Levels Using Blog Posts,” Proc. AAAI Spring Symp. Computational Approaches to Analysing Weblogs (AAAI-CAAW), 2006.
[18] K. Nigam and M. Hurst, “Towards a Robust Metric of Opinion,” Proc. AAAI Spring Symp. Exploring Attitude and Affect in Text (AAAI-EAAT), 2004.
[19] B. Pang, L. Lee, and S. Vaithyanathain, “Thumbs Up? Sentiment Classification Using Machine Learning Techniques,” Proc. Empirical Methods in Natural Language Processing (EMNLP '02), pp. 79-86, 2002.
[20] B. Pang and L. Lee, “Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales,” Proc. Ann. Meeting on Assoc. for Computational Linguistics (ACL '05), pp.115-124, 2005.
[21] R.W. Picard, Affective Computing. MIT Press, 1997.
[22] R.W. Picard, E. Vyzas, and J. Healey, “Toward Machine Emotional Intelligence: Analysis of Affective Physiological State,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 10, pp. 1179-1191, Oct. 2001.
[23] J. Read, “Recognizing Affect in Text Using Point-Wise Mutual Information,” master's thesis, 2004.
[24] R. Schumaker and H. Chen, “Textual Analysis of Stock Market Prediction Using Financial News Articles,” Proc. 11th Americas Conf. Information System (AMCIS), 2006.
[25] E. Stamatatos and G. Widmer, “Music Performer Recognition Using an Ensemble of Simple Classifiers,” Proc. 15th European Conf. Artificial Intelligence (ECAI), 2002.
[26] P. Subasic and A. Huettner, “Affect Analysis of Text Using Fuzzy Semantic Typing,” IEEE Trans. Fuzzy Systems, vol. 9, no. 4, pp. 483-496, 2001.
[27] P.D. Turney and M.L. Littman, “Measuring Praise and Criticism: Inference of Semantic Orientation from Association,” ACM Trans. Information Systems, vol. 21, no. 4, pp. 315-346, 2003.
[28] A. Valitutti, C. Strapparava, and O. Stock, “Developing Affective Lexical Resources,” PsychNology J., vol. 2, no. 1, pp. 61-83, 2004.
[29] J. Wiebe, “Tracking Point of View in Narrative,” Computational Linguistics, vol. 20, no. 2, pp. 233-287, 1994.
[30] J. Wiebe, T. Wilson, R. Bruce, M. Bell, and M. Martin, “Learning Subjective Language,” Computational Linguistics, vol. 30, no. 3, pp.277-308, 2004.
[31] I.H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, second ed. Morgan Kauffman, 2005.
[32] C. Wu, Z. Chuang, and Y. Lin, “Emotion Recognition from Text Using Semantic Labels and Separable Mixture Models,” ACM Trans. Asian Language Information Processing, vol. 5, no. 2, pp. 165-182, 2006.
[33] Y. Yang and J.O. Pederson, “A Comparative Study on Feature Selection in Text Categorization,” Proc. 14th Int'l Conf. Machine Learning (ICML '97), pp. 412-420, 1997.
[34] G. Mishne and N. Glance, “Predicting Movie Sales from Blogger Sentiment,” Proc. AAAI Spring Symp. Computational Approaches to Analysing Weblogs (AAAI-CAAW), 2006.
[35] V. Vapnik, The Nature of Statistical Learning Theory. Springer, 1995.
[36] A. Webb, Statistical Pattern Recognition. John Wiley & Sons, 2002.
[37] K.R. Muller, A.J. Smola, G. Ratsch, B. Scholkopf, J. Kohlmorgen, and V. Vapnik, “Predicting Time Series with Support Vector Machines,” Proc. 15th Int'l Joint Conf. Artificial Intelligence (IJCAI), 1997.
[38] A.L. Prodromidis and S.J. Stolfo, “A Comparative Evaluation of Meta-Learning Strategies over Large and Distributed Data Sets,” Proc. 16th Int'l Conf. Machine Learning (ICML '99), pp. 18-27, 1999.
[39] K.M. Ting and I.H. Witten, “Stacked Generalization: When Does It Work?” Proc. 15th Int'l Joint Conf. Artificial Intelligence (IJCAI), 1997.
18 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool