Pacific-Asia Workshop on Computational Intelligence and Industrial Application, IEEE (2008)
Dec. 19, 2008 to Dec. 20, 2008
This paper has analyzed the Apriori algorithm performance, and has pointed out performance bottleneck question of the Apriori algorithm. Currently those algorithms to mine association rules only pay attention to one aspect of efficiency or accuracy respectively. There is a paradox between efficiency and accuracy. In order to resolve to this conflict, a novel algorithm based on Probability estimate and least square estimate is proposed to mine the association rules from database with the high correlativity and the high confidence. Probability estimate reduce the times of database scanning so as to increase efficiency; least square estimate is based on rigorous and classical mathematical model so as to enhance accuracy. Furthermore, we deduce a recurrence formula to resolve K-itemsets issue. Experimental results have demonstrated that our algorithm is not only efficient but also keeps the completion of frequent items.
association rules, probability estimate, least square estimate
C. Li, W. Chen and Y. Yu, "Association Rules Mining Based on an Optimized Probability and Statistics Estimate Model," 2008 Pacific-Asia Workshop on Computational Intelligence and Industrial Application. PACIIA 2008(PACIIA), Wuhan, 2008, pp. 3-7.