2012 IEEE 12th International Conference on Data Mining Workshops (2012)
Brussels, Belgium Belgium
Dec. 10, 2012 to Dec. 10, 2012
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDMW.2012.43
With growing availability and popularity of user generated content, the discipline of sentiment analysis has come to the attention of many researchers. Existing work has mainly focused on either knowledge based methods or standard machine learning techniques. In this paper we investigate sentiment polarity classification based on adaptive statistical data compression models. We evaluate the classification performance of the loss less compression algorithm Prediction by Partial Matching (PPM) as well as compression based measures using PPM-like character n-gram frequency statistics. Comprehensive experiments on three corpora show that compression based methods are efficient, easy to apply and can compete with the accuracy of sophisticated classifiers such as support vector machines.
Accuracy, Training, Entropy, Support vector machines, Compression algorithms, Frequency measurement, Computational modeling, Prediction by Partial Matching, sentiment analysis, opinion mining, text classification, data compression
Dominique Ziegelmayer, Rainer Schrader, "Sentiment Polarity Classification Using Statistical Data Compression Models", 2012 IEEE 12th International Conference on Data Mining Workshops, vol. 00, no. , pp. 731-738, 2012, doi:10.1109/ICDMW.2012.43