Integrated Analysis of Gene Expression and Copy Number Data on Gene Shaving Using Independent Component Analysis
Issue No. 06 - November/December (2011 vol. 8)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2011.71
Hong-Wen Deng , Tulane University, New Orleans
Yu-Ping Wang , Tulane University, New Orleans
Jinhua Sheng , Indiana University, Indianapolis
Vince D. Calhoun , The Mind Research Network, New Mexico
DNA microarray gene expression and microarray-based comparative genomic hybridization (aCGH) have been widely used for biomedical discovery. Because of the large number of genes and the complex nature of biological networks, various analysis methods have been proposed. One such method is "gene shaving,” a procedure which identifies subsets of the genes with coherent expression patterns and large variation across samples. Since combining genomic information from multiple sources can improve classification and prediction of diseases, in this paper we proposed a new method, "ICA gene shaving” (ICA, independent component analysis), for jointly analyzing gene expression and copy number data. First we used ICA to analyze joint measurements, gene expression and copy number, of a biological system and project the data onto statistically independent biological processes. Next, we used these results to identify patterns of variation in the data and then applied an iterative shaving method. We investigated the properties of our proposed method by analyzing both simulated and real data. We demonstrated that the robustness of our method to noise using simulated data. Using breast cancer data, we showed that our method is superior to the Generalized Singular Value Decomposition (GSVD) gene shaving method for identifying genes associated with breast cancer.
Clustering technique, comparative genomic hybridization (CGH), copy number variation (CNV), generalized singular value decomposition (GSVD), gene expression, gene shaving, independent component analysis (ICA).
Hong-Wen Deng, Yu-Ping Wang, Jinhua Sheng, Vince D. Calhoun, "Integrated Analysis of Gene Expression and Copy Number Data on Gene Shaving Using Independent Component Analysis", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 8, no. , pp. 1568-1579, November/December 2011, doi:10.1109/TCBB.2011.71