22nd International Conference on Data Engineering Workshops (ICDEW'06) Mining Frequent Itemsets from Noisy Data Atlanta, Georgia April 03-April 07 ISBN: 0-7695-2571-7
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDEW.2006.90
As we face huge amounts of varied information, data mining, which helps us discover hidden features or rules from voluminous data systematically, has become more important [3, 4, 6, 10]. However, real world data is often dirty, including noise such as missing or irrelevant values. The information mined from such noisy data may be incorrect. We model noisy data with probabilities, assuming that noise is mixed with data statistically. We also propose a way to find frequent itemsets [2] by estimating supports on noiseless data from noisy data. An algorithm using FP-tree [6, 10] is also presented to mine frequent itemsets efficiently.
Citation:
Kasuyo Narita, Hiroyuki Kitagawa, "Mining Frequent Itemsets from Noisy Data," icdew, pp.x117, 22nd International Conference on Data Engineering Workshops (ICDEW'06), 2006 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||