This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fifth IEEE International Conference on Data Mining (ICDM'05)
Text Representation: From Vector to Tensor
Houston, Texas
November 27-November 30
ISBN: 0-7695-2278-5
Ning Liu, Tsinghua University
Benyu Zhang, Microsoft Research Asia
Jun Yan, Peking University
Zheng Chen, Microsoft Research Asia
Wenyin Liu, City University of Hong Kong
Fengshan Bai, Tsinghua University
Leefeng Chien, Academia Sinica
In this paper, we propose a text representation model, Tensor Space Model (TSM), which models the text by multilinear algebraic high-order tensor instead of the traditional vector. Supported by techniques of multilinear algebra, TSM offers a potent mathematical framework for analyzing the multifactor structures. TSM is further supported by certain introduced particular operations and presented tools, such as the High-Order Singular Value Decomposition (HOSVD) for dimension reduction and other applications. Experimental results on the 20 Newsgroups dataset show that TSM is constantly better than VSM for text classification.
Citation:
Ning Liu, Benyu Zhang, Jun Yan, Zheng Chen, Wenyin Liu, Fengshan Bai, Leefeng Chien, "Text Representation: From Vector to Tensor," icdm, pp.725-728, Fifth IEEE International Conference on Data Mining (ICDM'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.