Issue No. 01 - Jan.-Feb. (2013 vol. 10)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.166
Xiaowei Zhou , Dept. of Electron. & Comput. Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
Can Yang , Dept. of Biostat., Yale Univ., New Haven, CT, USA
Xiang Wan , Dept. of Comput. Sci., Hong Kong Baptist Univ., Hong Kong, China
Hongyu Zhao , Dept. of Biostat., Yale Univ., New Haven, CT, USA
Weichuan Yu , Dept. of Electron. & Comput. Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
DNA copy number variation (CNV) accounts for a large proportion of genetic variation. One commonly used approach to detecting CNVs is array-based comparative genomic hybridization (aCGH). Although many methods have been proposed to analyze aCGH data, it is not clear how to combine information from multiple samples to improve CNV detection. In this paper, we propose to use a matrix to approximate the multisample aCGH data and minimize the total variation of each sample as well as the nuclear norm of the whole matrix. In this way, we can make use of the smoothness property of each sample and the correlation among multiple samples simultaneously in a convex optimization framework. We also developed an efficient and scalable algorithm to handle large-scale data. Experiments demonstrate that the proposed method outperforms the state-of-the-art techniques under a wide range of scenarios and it is capable of processing large data sets with millions of probes.
Spectral analysis, Optimization, Convex optimization
Xiaowei Zhou, Can Yang, Xiang Wan, Hongyu Zhao and Weichuan Yu, "Multisample aCGH Data Analysis via Total Variation and Spectral Regularization," in IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 10, no. 1, pp. 230-235, 2013.