loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06)
Efficient Clustering for Orders
Hong Kong, China
December 18-December 22
ISBN: 0-7695-2702-7
Toshihiro Kamishima, National Institute of Advanced Industrial Science and Technology (AIST), Japan
Shotaro Akaho, National Institute of Advanced Industrial Science and Technology (AIST), Japan
Lists of ordered objects are widely used as representational forms. Such ordered objects include Web search results or best-seller lists. Clustering is a useful data analysis technique for grouping mutually similar objects. To cluster orders, hierarchical clustering methods have been used together with dissimilarities defined between pairs of orders. However, hierarchical clustering methods cannot be applied to large-scale data due to their computational cost in terms of the number of orders. To avoid this problem, we developed an k-o?means algorithm. This algorithm successfully extracted grouping structures in orders, and was computationally efficient with respect to the number of orders. However, it was not efficient in cases where there are too many possible objects yet. We therefore propose a new method (k-o?means-EBC), grounded on a theory of order statistics. We further propose several techniques to analyze acquired clusters of orders.
Citation:
Toshihiro Kamishima, Shotaro Akaho, "Efficient Clustering for Orders," icdmw, pp.274-278, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.