2014 IEEE International Conference on Data Mining (ICDM) (2014)
Dec. 14, 2014 to Dec. 17, 2014
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2014.98
Threaded debate forums have become one of the major social media platforms. Usually people argue with one another using not only claims and evidences about the topic under discussion but also language used to organize them, which we refer to as shell. In this paper, we study how to separate shell from topical contents using unsupervised methods. Along this line, we develop a latent variable model named Shell Topic Model (STM) to jointly model both topics and shell. Experiments on real online debate data show that our model can find both meaningful shell and topics. The results also show the effectiveness of our model by comparing it with several baselines in shell phrases extraction and document modeling.
Hidden Markov models, Data models, Data mining, Media, Educational institutions, Noise measurement, Training
J. Du, J. Jiang, L. Yang, D. Song and L. Liao, "Shell Miner: Mining Organizational Phrases in Argumentative Texts in Social Media," 2014 IEEE International Conference on Data Mining (ICDM), Shenzhen, China, 2014, pp. 797-802.