This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2012 IEEE 12th International Conference on Data Mining Workshops
Sentiment Polarity Classification Using Statistical Data Compression Models
Brussels, Belgium Belgium
December 10-December 10
ISBN: 978-1-4673-5164-5
With growing availability and popularity of user generated content, the discipline of sentiment analysis has come to the attention of many researchers. Existing work has mainly focused on either knowledge based methods or standard machine learning techniques. In this paper we investigate sentiment polarity classification based on adaptive statistical data compression models. We evaluate the classification performance of the loss less compression algorithm Prediction by Partial Matching (PPM) as well as compression based measures using PPM-like character n-gram frequency statistics. Comprehensive experiments on three corpora show that compression based methods are efficient, easy to apply and can compete with the accuracy of sophisticated classifiers such as support vector machines.
Index Terms:
Accuracy,Training,Entropy,Support vector machines,Compression algorithms,Frequency measurement,Computational modeling,Prediction by Partial Matching,sentiment analysis,opinion mining,text classification,data compression
Citation:
Dominique Ziegelmayer, Rainer Schrader, "Sentiment Polarity Classification Using Statistical Data Compression Models," icdmw, pp.731-738, 2012 IEEE 12th International Conference on Data Mining Workshops, 2012
Usage of this product signifies your acceptance of the Terms of Use.