loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Third IEEE International Conference on Data Mining (ICDM'03)
Applying Noise Handling Techniques to Genomic Data: A Case Study
Melbourne, Florida
November 19-November 22
ISBN: 0-7695-1978-4
Choh Man Teng, Institute for Human and Machine Cognition, Pensacola FL
Osteogenesis Imperfecta (OI) is a genetic collagenous disease associated with mutations in one or both of the genes COLIA1 and COLIA2. There are at least four known phenotypes of OI, of which type II is the severest and often lethal. We identified three approaches to noise handling, namely, robust algorithms, filtering, and polishing, and evaluated their effectiveness when applied to the problem of classifying the disease OI based on a data set of amino acid sequences and associated information of point mutations of COLIA1. Preliminary results suggest that each noise handling mechanism can be useful under different circumstances. Filtering is stable across all cases. Pruning with robust c4.5 increased the classification accuracy in some cases, and polishing gave rise to some additional improvement in classifying the lethal OI phenotype.
Citation:
Choh Man Teng, "Applying Noise Handling Techniques to Genomic Data: A Case Study," icdm, pp.743, Third IEEE International Conference on Data Mining (ICDM'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.