The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.09 - September (2004 vol.16)
pp: 1052-1069
ABSTRACT
In classical association rules mining, a minimum support threshold is assumed to be available for mining frequent itemsets. However, setting such a threshold is typically hard. In this paper, we handle a more practical problem; roughly speaking, it is to mine N k-itemsets with the highest supports for k up to a certain k_{max} value. We call the results the N-most interesting itemsets. Generally, it is more straightforward for users to determine N and k_{max}. We propose two new algorithms, LOOPBACK and BOMO. Experiments show that our methods outperform the previously proposed Itemset-Loop algorithm, and the performance of BOMO can be an order of magnitude better than the original FP-tree algorithm, even with the assumption of an optimally chosen support threshold. We also propose the mining of "N-most interesting k-itemsets with item constraints.” This allows user to specify different degrees of interestingness for different itemsets. Experiments show that our proposed Double FP-trees algorithm, which is based on BOMO, is highly efficient in solving this problem.
INDEX TERMS
Association rules, N-most interesting itemsets, FP-tree, item constraints.
CITATION
Yin-Ling Cheung, Ada Wai-Chee Fu, "Mining Frequent Itemsets without Support Threshold: With and without Item Constraints", IEEE Transactions on Knowledge & Data Engineering, vol.16, no. 9, pp. 1052-1069, September 2004, doi:10.1109/TKDE.2004.44
19 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool