19th IEEE International Conference on Tools with Artificial Intelligence - Vol.2 (ICTAI 2007)
Mining Prevalence-Based Ratio Patterns
Paris, France
October 29-October 31
ISBN: 0-7695-3015-X
Association rule mining aims to discover sets of features that occur together. A variation of association rule min- ing is ratio rule mining. A ratio rule is an eigenvector of the database that describes ratios of features. However, ra- tio rules are sensitive to outliers. In this work, we design a prevalence-based model for mining ratio patterns from a database. Our model is more robust to noises, and ratio patterns in our model have clear statistic meanings. We develop an algorithm to quickly determine the sets of fea- tures and their ratios that satisfy the prevalence require- ment. Data structures, such as hash table and hash tree are utilized to further improve the efficiency of the algorithm. Experiments on synthetic data indicates the efficiency and scalability of the proposed algorithm. We also present a case study on US census data.
Citation:
Minghua Zhang, Wynne Hsu, Mong Li Lee, "Mining Prevalence-Based Ratio Patterns," ictai, vol. 2, pp.140-147, 19th IEEE International Conference on Tools with Artificial Intelligence - Vol.2 (ICTAI 2007), 2007