The Community for Technology Leaders
International Workshop on Challenges in Web Information Retrieval and Integration (2005)
Tokyo, Japan
Apr. 8, 2005 to Apr. 9, 2005
ISBN: 0-7695-2414-1
pp: 122-127
Masafumi Hamamoto , Graduate School of Systems and Information Engineering
Hiroyuki Kitagawa , Center for Computational Science, University of Tsukuba
Jia-Yu Pan , Computer Science Department Carnegie Mellon University
Christos Faloutsos , Computer Science Department Carnegie Mellon University
ABSTRACT
<p>Topic detection is an important subject when voluminous text data is sent continuously to a user. We examine a method to detect topics in text data using feature vectors. Feature vectors represent the main distribution of data and they are obtained by various data analysis methods. This paper examines three methods: Singular Value Decomposition (SVD), clustering, and Independent Component Analysis (ICA). SVD and clustering are popular existing methods. Clustering, especially, is applied to many topic detection methods. ICA was recently developed in signal processing research. In applications related to text data, however, ICA has not been compared with SVD and clustering, nor has its relationship with them been explored. This paper reports comparative experiments for these three methods and then shows properties as they apply to text data.</p>
INDEX TERMS
null
CITATION

M. Hamamoto, C. Faloutsos, H. Kitagawa and J. Pan, "A Comparative Study of Feature Vector-Based Topic Detection Schemes A Comparative Study of Feature Vector-Based Topic Detection Schemes," International Workshop on Challenges in Web Information Retrieval and Integration(WIRI), Tokyo, Japan, 2005, pp. 122-127.
doi:10.1109/WIRI.2005.1
92 ms
(Ver 3.3 (11022016))