Third IEEE International Conference on Data Mining (ICDM'03) Applying Noise Handling Techniques to Genomic Data: A Case Study Melbourne, Florida November 19-November 22 ISBN: 0-7695-1978-4
Osteogenesis Imperfecta (OI) is a genetic collagenous disease associated with mutations in one or both of the genes COLIA1 and COLIA2. There are at least four known phenotypes of OI, of which type II is the severest and often lethal. We identified three approaches to noise handling, namely, robust algorithms, filtering, and polishing, and evaluated their effectiveness when applied to the problem of classifying the disease OI based on a data set of amino acid sequences and associated information of point mutations of COLIA1. Preliminary results suggest that each noise handling mechanism can be useful under different circumstances. Filtering is stable across all cases. Pruning with robust c4.5 increased the classification accuracy in some cases, and polishing gave rise to some additional improvement in classifying the lethal OI phenotype.
Citation:
Choh Man Teng, "Applying Noise Handling Techniques to Genomic Data: A Case Study," icdm, pp.743, Third IEEE International Conference on Data Mining (ICDM'03), 2003 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||