The Community for Technology Leaders
Green Image
Issue No. 01 - Jan. (2014 vol. 26)
ISSN: 1041-4347
pp: 120-130
Toshimitsu Takahashi , The University of Tokyo, Tokyo
Ryota Tomioka , The University of Tokyo, Tokyo
Kenji Yamanishi , The University of Tokyo, Tokyo
ABSTRACT
Detection of emerging topics is now receiving renewed interest motivated by the rapid growth of social networks. Conventional-term-frequency-based approaches may not be appropriate in this context, because the information exchanged in social-network posts include not only text but also images, URLs, and videos. We focus on emergence of topics signaled by social aspects of theses networks. Specifically, we focus on mentions of users--links between users that are generated dynamically (intentionally or unintentionally) through replies, mentions, and retweets. We propose a probability model of the mentioning behavior of a social network user, and propose to detect the emergence of a new topic from the anomalies measured through the model. Aggregating anomaly scores from hundreds of users, we show that we can detect emerging topics only based on the reply/mention relationships in social-network posts. We demonstrate our technique in several real data sets we gathered from Twitter. The experiments show that the proposed mention-anomaly-based approaches can detect new topics at least as early as text-anomaly-based approaches, and in some cases much earlier when the topic is poorly identified by the textual contents in posts.
INDEX TERMS
Social network services, Maximum likelihood estimation, Encoding, Hidden Markov models, Density functional theory, Training,burst detection, Topic detection, anomaly detection, social networks, sequentially discounted normalized maximum-likelihood coding
CITATION
Toshimitsu Takahashi, Ryota Tomioka, Kenji Yamanishi, "Discovering Emerging Topics in Social Streams via Link-Anomaly Detection", IEEE Transactions on Knowledge & Data Engineering, vol. 26, no. , pp. 120-130, Jan. 2014, doi:10.1109/TKDE.2012.239
164 ms
(Ver )