The Community for Technology Leaders
Green Image
Issue No. 02 - March/April (2012 vol. 9)
ISSN: 1545-5963
pp: 499-510
Wenji Ma , Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong, China
Yong Yang , Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong, China
Zhi-Zhong Chen , Dept. of Inf. Syst. Design, Tokyo Denki Univ., Tokyo, Japan
Lusheng Wang , Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong, China
ABSTRACT
Linkage analysis serves as a way of finding locations of genes that cause genetic diseases. Linkage studies have facilitated the identification of several hundreds of human genes that can harbor mutations which by themselves lead to a disease phenotype. The fundamental problem in linkage analysis is to identify regions whose allele is shared by all or almost all affected members but by none or few unaffected members. Almost all the existing methods for linkage analysis are for families with clearly given pedigrees. Little work has been done for the case where the sampled individuals are closely related, but their pedigree is not known. This situation occurs very often when the individuals share a common ancestor at least six generations ago. Solving this case will tremendously extend the use of linkage analysis for finding genes that cause genetic diseases. In this paper, we propose a mathematical model (the shared center problem) for inferring the allele-sharing status of a given set of individuals using a database of confirmed haplotypes as reference. We show the NP-completeness of the shared center problem and present a ratio-2 polynomial-time approximation algorithm for its minimization version (called the closest shared center problem). We then convert the approximation algorithm into a heuristic algorithm for the shared center problem. Based on this heuristic, we finally design a heuristic algorithm for mutation region detection. We further implement the algorithms to obtain a software package. Our experimental data show that the software is both fast and accurate. The package is available at >;http://www.cs.cityu.edu.hk/~lwang/software/LDWP/ for noncommercial use.
INDEX TERMS
Algorithm design and analysis, Couplings, Approximation algorithms, Software algorithms, Biological cells, Heuristic algorithms, Inference algorithms,and approximation algorithm., Haplotype inference, linkage analysis, pedigree, allele-sharing status
CITATION
Wenji Ma, Yong Yang, Zhi-Zhong Chen, Lusheng Wang, "Mutation Region Detection for Closely Related Individuals without a Known Pedigree", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 9, no. , pp. 499-510, March/April 2012, doi:10.1109/TCBB.2011.134
232 ms
(Ver 3.1 (10032016))