The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - April-June (2013 vol.4)
pp: 211-225
Carmen Banea , University of North Texas, Denton
Rada Mihalcea , University of North Texas, Denton
Janyce Wiebe , University of Pittsburgh, Pittsburgh
ABSTRACT
Subjectivity analysis focuses on the automatic extraction of private states in natural language. In this paper, we explore methods for generating subjectivity analysis resources in a new language by leveraging on the tools and resources available in English. Given a bridge between English and the selected target language (e.g., a bilingual dictionary or a parallel corpus), the methods can be used to rapidly create tools for subjectivity analysis in the new language.
INDEX TERMS
Dictionaries, Availability, Natural language processing, Manuals, Semantics, Tagging, Engines, multilingual subjectivity, Natural language processing, text analysis, sentiment and subjectivity analysis
CITATION
Carmen Banea, Rada Mihalcea, Janyce Wiebe, "Porting Multilingual Subjectivity Resources across Languages", IEEE Transactions on Affective Computing, vol.4, no. 2, pp. 211-225, April-June 2013, doi:10.1109/T-AFFC.2013.1
REFERENCES
[1] C.O. Alm, D. Roth, and R. Sproat, "Emotions from Text: Machine Learning for Text-Based Emotion Prediction," Proc. Conf. Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP '05), pp. 579-586, 2005.
[2] L. Lloyd, D. Kechagias, and S. Skiena, "Lydia: A System for Large-Scale News Analysis," Proc. 12th Int'l Conf. String Processing and Information Retrieval, pp. 161-166, 2005.
[3] K. Balog, G. Mishne, and M.D. Rijke, "Why Are They Excited? Identifying and Explaining Spikes in Blog Mood Levels," Proc. 11th Conf. European Chapter of the Assoc. Computational Linguistics: Posters and Demonstrations (EACL '06), 2006.
[4] M. Hu and B. Liu, "Mining and Summarizing Customer Reviews," Proc. ACM 10th SIGKDD Int'l Conf. Knowledge Discovery and Data Mining (SIGKDD '04), pp. 168-177, 2004.
[5] H. Yu and V. Hatzivassiloglou, "Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentence," Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP '03), pp. 129-136, 2003.
[6] G. Carenini, R.T. Ng, and X. Zhou, "Summarizing Emails with Conversational Cohesion and Subjectivity," Proc. 46th Ann. Meeting Assoc. Computational Linguistics: Human Language Technologies (ACL-HLT '08), pp. 353-361, 2008.
[7] N. Kobayashi, K. Inui, K. Tateishi, and T. Fukushima, "Collecting Evaluative Expressions for Opinion Extraction," Proc. First Int'l Joint Conf. Natural Language Processing (IJCNLP '04), pp. 596-605, 2004.
[8] H. Takamura, T. Inui, and M. Okumura, "Latent Variable Models for Semantic Orientations of Phrases," Proc. Conf. European Chapter of the Assoc. Computational Linguistics (EACL '06), pp. 201-208, 2006.
[9] H. Kanayama and T. Nasukawa, "Fully Automatic Lexicon Expansion for Domain-Oriented Sentiment Analysis," Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP '06), pp. 355-363, 2006.
[10] Y. Hu, J. Duan, X. Chen, B. Pei, and R. Lu, "A New Method for Sentiment Classification in Text Retrieval," Proc. Second Int'l Joint Conf. Natural Language Processing (IJCNLP '05), pp. 1-9, 2005.
[11] S.-M. Kim and E. Hovy, "Identifying and Analyzing Judgment Opinions," Proc. Main Conf. Human Language Technology Conf. North Am. Chapter of the Assoc. Computational Linguistics (HLT-NAACL '06), pp. 200-207, 2006.
[12] Y. Seki, D. Evans, L. Ku, H. Chen, N. Kando, and C. Lin, "Overview of Opinion Analysis Pilot Task at NTCIR-6," Proc. Sixth NTCIR Workshop Meeting Evaluation of Information Access Technologies: Information Retrieval, Question Answering, and Cross-Lingual Information Access, pp. 265-278, 2007.
[13] C. Banea, R. Mihalcea, and J. Wiebe, "Multilingual Subjectivity: Are More Languages Better?" Proc. 23rd Int'l Conf. Computational Linguistics (COLING '10), pp. 28-36, 2010.
[14] S. Hassan, C. Banea, and R. Mihalcea, "Measuring Semantic Relatedness Using Multilingual Representations," Proc. Workshop Semantic Evaluations (SemEval '12), 2012.
[15] W. Wei, J. Gulla, and Z. Fu, "Enhancing Negation-Aware Sentiment Classification on Product Reviews via Multi-Unigram Feature Generation," Proc. Sixth Int'l Conf. Advanced Intelligent Computing Theories and Applications, vol. 6215/2010, pp. 380-391, 2010.
[16] X. Wan, "Co-Training for Cross-Lingual Sentiment Classification," Proc. Joint Conf. 47th Ann. Meeting ACL and Fourth Int'l Joint Conf. Natural Language Processing of the AFNLP (ACL-IJCNLP '09), vol. 1, pp. 235-243, 2009.
[17] J. Wiebe, T. Wilson, and C. Cardie, "Annotating Expressions of Opinions and Emotions in Language," Language Resources and Evaluation, vol. 39, no. 2/3, pp. 165-210, 2005.
[18] R. Quirk, S. Greenbaum, G. Leech, and J. Svartvik, A Comprehensive Grammar of the English Language. Longman, 1985.
[19] J. Wiebe and R. Mihalcea, "Word Sense and Subjectivity," Proc. 21st Int'l Conf. Computational Linguistics and 44th Ann. Meeting Assoc. Computational Linguistics (COLING-ACL '06), pp. 1065-1072, 2006.
[20] T. Wilson, J. Wiebe, and P. Hoffmann, "Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis," Proc. Conf. Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP '05), pp. 347-354, 2005.
[21] A. Andreevskaia and S. Bergler, "Mining WordNet for Fuzzy Sentiment: Sentiment Tag Extraction from WordNet Glosses," Proc. Conf. European Chapter of the Assoc. Computational Linguistics (EACL '06), pp. 209-216, 2006.
[22] A. Esuli and F. Sebastiani, "Determining Term Subjectivity and Term Orientation for Opinion Mining," Proc. Conf. European Chapter of the Assoc. Computational Linguistics (EACL '06), vol. 2, pp. 193-200, 2006.
[23] B. Pang and L. Lee, "A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts," Proc. 42nd Ann. Meeting Assoc. Computational Linguistics (ACL '04), pp. 271-278, 2004.
[24] P.J. Stone, M.S. Smith, D.M. Ogilivie, and D.C. Dumphy, The General Inquirer: A Computer Approach to Content Analysis, first ed. MIT Press, 1966.
[25] V. Hatzivassiloglou and K.R. McKeown, "Predicting the Semantic Orientation of Adjectives," Proc. Conf. European Chapter of the Assoc. Computational Linguistics (EACL '97), pp. 174-181, 1997.
[26] P.D. Turney and M.L. Littman, "Unsupervised Learning of Semantic Orientation from a Hundred-Billion-Word Corpus," Technical Report ERB-1094, Nat'l Research Council Canada, Inst. of Information Tech nology, 2002.
[27] M.A. Gamon and A. Aue, "Automatic Identification of Sentiment Vocabulary: Exploiting Low Association with Known Sentiment Terms," Proc. ACL Workshop Feature Eng. for Machine Learning in Natural Language Processing, pp. 57-64, 2005.
[28] E. Riloff, J. Wiebe, and T. Wilson, "Learning Subjective Nouns Using Extraction Pattern Bootstrapping," Proc. Seventh Conf. Natural Language Learning (CoNLL '03), pp. 25-32, 2003.
[29] B. Pang, L. Lee, and S. Vaithyanathan, "Thumbs Up? Sentiment Classification Using Machine Learning Techniques," Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP '02), vol. 10, pp. 79-86, 2002.
[30] J. Wiebe and T. Wilson, "Learning to Disambiguate Potentially Subjective Expressions," Proc. Conf. Natural Language Learning (CoNLL '02), pp. 1-7, 2002.
[31] J. Yi, T. Nasukawa, R. Bunescu, and W. Niblack, "Sentiment Analyzer: Extracting Sentiments about a Given Topic Using Natural Language Processing Techniques," Proc. IEEE Int'l Conf. Data Mining (ICDM '03), pp. 427-434, 2003.
[32] E. Riloff and J. Wiebe, "Learning Extraction Patterns for Subjective Expressions," Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP '03), pp. 105-112, 2003.
[33] K. Bloom, N. Garg, and S. Argamon, "Extracting Appraisal Expressions," Proc. Human Language Technologies/North Am. Assoc. Computational Linguists (NAACL-HLT '07), pp. 308-315, 2007.
[34] H. Takamura, T. Inui, and M. Okumura, "Extracting Semantic Orientations of Phrases from Dictionary," Proc. Human Language Technologies/North Am. Assoc. Computational Linguists (NAACL-HLT '07), pp. 292-299, 2007.
[35] T. Zagibalov and J.A. Carroll, "Automatic Seed Word Selection for Unsupervised Sentiment Classification of Chinese Text," Proc. 22nd Int'l Conf. Computational Linguistics (COLING '08), vol. 1, pp. 1073-1080, 2008.
[36] T. Wilson, P. Hoffmann, S. Somasundaran, J. Kessler, J. Wiebe, Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan, "OpinionFinder: A System for Subjectivity Analysis," Proc. Conf. Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP) on Interactive Demonstrations, pp. 34-35, 2005.
[37] J. Wiebe and E. Riloff, "Creating Subjective and Objective Sentence Classifiers from Unannotated Texts," Proc. Sixth Int'l Conf. Computational Linguistics and Intelligent Text Processing (CICLing '05), pp. 486-497, 2005.
[38] G.A. Miller, C. Leacock, T. Randee, and R.T. Bunker, "A Semantic Concordance," Proc. Workshop Human Language Technology (HLT '93), pp. 303-308, 1993.
[39] A. Esuli, F. Sebastiani, and I.C. Urciuoli, "Annotating Expressions of Opinion and Emotion in the Italian Content Annotation Bank," Proc. Int'l Conf. Language Resources and Evaluation (LREC '08), pp. 2247-2251, 2008.
[40] L.-W. Ku, Y.-T. Liang, and H.-H. Chen, "Opinion Extraction, Summarization and Tracking in News and Blog Corpora," Proc. AAAI Spring Symp. Computational Approaches to Analyzing Weblogs, pp. 100-107, 2006.
[41] Y. Wu, "Classifying Attitude by Topic Aspect for English and Chinese Document Collections," PhD dissertation, Univ. of Maryland, College Park, MD, 2008.
[42] R. Mihalcea, C. Banea, and J. Wiebe, "Learning Multilingual Subjective Language via Cross-Lingual Projections," Proc. 45th Ann. Meeting. Assoc. Computational Linguistics (ACL '07), pp. 976-983, 2007.
[43] J. Blitzer, M. Dredze, and F. Pereira, "Biographies, Bollywood, Boom-Boxes and Blenders: Domain Adaptation for Sentiment Classification," Proc. 45th Ann. Meeting. Assoc. Computational Linguistics (ACL '07), pp. 440-447, 2007.
[44] C. Banea, R. Mihalcea, J. Wiebe, and S. Hassan, "Multilingual Subjectivity Analysis Using Machine Translation," Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP '08), pp. 127-135, 2008.
[45] M. Bautin, L. Vijayarenu, and S. Skiena, "International Sentiment Analysis for News and Blogs," Proc. Int'l Conf. Weblogs and Social Media (ICWSM '08), pp. 19-26, 2008.
[46] J. Wiebe, "Learning Subjective Adjectives from Corpora," Proc. 17th Nat'l Conf. Artificial Intelligence and 12th Conf. Innovative Applications of Artificial Intelligence (AAAI '00), pp. 735-740, 2000.
[47] G.A. Miller, "WordNet: A Lexical Database for English," Comm. ACM, vol. 38, no. 11, pp. 39-41, 1995.
[48] D. Rao and D. Ravichandran, "Semi-Supervised Polarity Lexicon Induction," Proc. 12th Conf. Empirical Methods in Natural Language Processing (EACL '09), pp. 675-682, 2009.
[49] P. Prettenhofer and B. Stein, "Cross-Language Text Classification Using Structural Correspondence Learning," Proc. 48th Ann. Meeting Assoc. Computational Linguistics (ACL '10), pp. 1118-1127, 2010,
[50] J. Boyd-Graber and P. Resnik, "Holistic Sentiment Analysis across Languages: Multilingual Supervised Latent Dirichlet Allocation," Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP '10), pp. 45-55, 2010.
[51] B. Lu, C. Tan, C. Cardie, and B.K. Tsou, "Joint Bilingual Sentiment Classification with Unlabeled Parallel Corpora," Proc. 49th Ann. Meeting Assoc. Computational Linguistics: Human Language Technologies (ACL-HLT '11), pp. 320-330, 2011,
[52] P.D. Turney, "Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL," Proc. 12th European Conf. Machine Learning (ECML '01), pp. 491-502, 2001.
[53] S.T. Dumais, G.W. Furnas, T.K. Landauer, S. Deerwester, and R. Harshman, "Using Latent Semantic Analysis to Improve Access to Textual Information," Proc. SIGCHI Conf. Human Factors in Computing Systems (CHI '88), pp. 281-285, 1988.
[54] T.K. Landauer, P.W. Foltz, and D. Laham, "An Introduction to Latent Semantic Analysis," Discourse Processes, vol. 25, pp. 259-284, 1998.
[55] C. Banea, R. Mihalcea, and J. Wiebe, "A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources," Proc. Int'l Conf. Language Resources and Evaluation (LREC '08), 2008.
[56] R. Ghani, R. Jones, and D. Mladenic, "Mining the Web to Create Minority Language Corpora," Proc. 10th Int'l Conf. Information and Knowledge Management (CIKM '01), 2001.
[57] V. Vapnik, The Nature of Statistical Learning Theory. Springer, 1995.
[58] T. Joachims, "Text Categorization with Support Vector Machines: Learning with Many Relevant Features," Proc. European Conf. Machine Learning (ECML '98), pp. 137-142, 1998.
[59] Y. Yang and X. Liu, "A Reexamination of Text Categorization Methods," Proc. 22nd Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval (ACM-SIGIR '99), 1999.
[60] G. Salton and C. Buckley, "Term Weighting Approaches in Automatic Text Retrieval," Readings in Information Retrieval, Morgan Kaufmann, 1997.
[61] Y. Li, K. Bontcheva, and H. Cunningha, "Experiments of Opinion Analysis on two Corpora MPQA and NTCIR-6," Proc. NTCIR Workshop Meeting. Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access (NTCIR '07), pp. 323-329, 2007.
[62] J. Kim, J.-J. Li, and J.-H. Lee, "Evaluating Multilanguage-Comparability of Subjectivity Analysis Systems," Proc. Ann. Meeting Assoc. Computational Linguistics (ACL '10), pp. 595-603, 2010.
[63] I. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, 2005.
23 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool