This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
An Unsupervised Approach for Person Name Bipolarization Using Principal Component Analysis
Nov. 2012 (vol. 24 no. 11)
pp. 1963-1976
Chien Chin Chen, National Taiwan University, Taipei
Zhong-Yong Chen, National Taiwan University, Taipei
Chen-Yuan Wu, National Taiwan University, Taipei
A topic is usually associated with a specific time, place, and person(s). Generally, topics that involve bipolar or competing viewpoints are attention getting and are thus reported in a large number of documents. Identifying the association between important persons mentioned in numerous topic documents would help readers comprehend topics more easily. In this paper, we propose an unsupervised approach for identifying bipolar person names in a set of topic documents. Specifically, we employ principal component analysis (PCA) to discover bipolar word usage patterns of person names in the documents, and show that the signs of the entries in the principal eigenvector of PCA partition the person names into bipolar groups spontaneously. To reduce the effect of data sparseness, we introduce two techniques, called the weighted correlation coefficient and off-topic block elimination. We also present a timeline system that shows the intensity and activeness development of the identified bipolar person groups. Empirical evaluations demonstrate the efficacy of the proposed approach in identifying bipolar person names in topic documents, while the generated timelines provide comprehensive storylines of topics.
Index Terms:
Principal component analysis,Correlation,Symmetric matrices,Hidden Markov models,Web pages,Matrix decomposition,Internet,bipolar timeline,Topic mining,sentiment analysis
Citation:
Chien Chin Chen, Zhong-Yong Chen, Chen-Yuan Wu, "An Unsupervised Approach for Person Name Bipolarization Using Principal Component Analysis," IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 11, pp. 1963-1976, Nov. 2012, doi:10.1109/TKDE.2011.177
Usage of this product signifies your acceptance of the Terms of Use.