loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
22nd International Conference on Data Engineering Workshops (ICDEW'06)
Mining Frequent Itemsets from Noisy Data
Atlanta, Georgia
April 03-April 07
ISBN: 0-7695-2571-7
Kasuyo Narita, University of Tsukuba, Japan
Hiroyuki Kitagawa, University of Tsukuba, Japan
As we face huge amounts of varied information, data mining, which helps us discover hidden features or rules from voluminous data systematically, has become more important [3, 4, 6, 10]. However, real world data is often dirty, including noise such as missing or irrelevant values. The information mined from such noisy data may be incorrect. We model noisy data with probabilities, assuming that noise is mixed with data statistically. We also propose a way to find frequent itemsets [2] by estimating supports on noiseless data from noisy data. An algorithm using FP-tree [6, 10] is also presented to mine frequent itemsets efficiently.
Citation:
Kasuyo Narita, Hiroyuki Kitagawa, "Mining Frequent Itemsets from Noisy Data," icdew, pp.x117, 22nd International Conference on Data Engineering Workshops (ICDEW'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.