This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases
Aug. 2013 (vol. 25 no. 8)
pp. 1772-1786
Vincent S. Tseng, National Cheng Kung University, Tainan City
Bai-En Shie, National Cheng Kung University, Tainan City
Cheng-Wei Wu, National Cheng Kung University, Tainan City
Philip S. Yu, University of Illnois at Chicago, Chicago and King Abdulaziz University, Jeddah
Mining high utility itemsets from a transactional database refers to the discovery of itemsets with high utility like profits. Although a number of relevant algorithms have been proposed in recent years, they incur the problem of producing a large number of candidate itemsets for high utility itemsets. Such a large number of candidate itemsets degrades the mining performance in terms of execution time and space requirement. The situation may become worse when the database contains lots of long transactions or long high utility itemsets. In this paper, we propose two algorithms, namely utility pattern growth (UP-Growth) and UP-Growth+, for mining high utility itemsets with a set of effective strategies for pruning candidate itemsets. The information of high utility itemsets is maintained in a tree-based data structure named utility pattern tree (UP-Tree) such that candidate itemsets can be generated efficiently with only two scans of database. The performance of UP-Growth and UP-Growth+ is compared with the state-of-the-art algorithms on many types of both real and synthetic data sets. Experimental results show that the proposed algorithms, especially UP-Growth+, not only reduce the number of candidates effectively but also outperform other algorithms substantially in terms of runtime, especially when databases contain lots of long transactions.
Index Terms:
Itemsets,Algorithm design and analysis,Association rules,Data structures,data mining,Candidate pruning,frequent itemset,high utility itemset,utility mining
Citation:
Vincent S. Tseng, Bai-En Shie, Cheng-Wei Wu, Philip S. Yu, "Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases," IEEE Transactions on Knowledge and Data Engineering, vol. 25, no. 8, pp. 1772-1786, Aug. 2013, doi:10.1109/TKDE.2012.59
Usage of this product signifies your acceptance of the Terms of Use.