loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
22nd International Conference on Data Engineering Workshops (ICDEW'06)
Text Mining using PrefixSpan constrained by Item Interval and Item Attribute
Atlanta, Georgia
April 03-April 07
ISBN: 0-7695-2571-7
Issei Sato, Waseda University, Japan
Yu Hirate, Waseda University, Japan
Hayato Yamana, National Institute of Infomatices, Japan
Applying conventional sequential pattern mining methods to text data extracts many uninteresting patterns, which increases the time to interpret the extracted patterns. To solve this problem, we propose a new sequential pattern mining algorithm by adopting the following two constraints. One is to select sequences with regard to item intervals--the number of items between any two adjacent items in a sequence--and the other is to select sequences with regard to item attributes. Using Amazon customer reviews in the book category, we have confirmed that our method is able to extract patterns faster than the conventional method, and is better able to exclude uninteresting patterns while retaining the patterns of interest.
Citation:
Issei Sato, Yu Hirate, Hayato Yamana, "Text Mining using PrefixSpan constrained by Item Interval and Item Attribute," icdew, pp.x118, 22nd International Conference on Data Engineering Workshops (ICDEW'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.