|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information
Nov.-Dec. 2012 (vol. 9 no. 6)
pp. 1582-1594
| ASCII Text | x | ||
| Y. Pirola, G. D. Vedova, S. Biffani, A. Stella, P. Bonizzoni, "A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 9, no. 6, pp. 1582-1594, Nov.-Dec., 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/TCBB.2012.100, author = {Y. Pirola and G. D. Vedova and S. Biffani and A. Stella and P. Bonizzoni}, title = {A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information}, journal ={IEEE/ACM Transactions on Computational Biology and Bioinformatics}, volume = {9}, number = {6}, issn = {1545-5963}, year = {2012}, pages = {1582-1594}, doi = {http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.100}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics TI - A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information IS - 6 SN - 1545-5963 SP1582 EP1594 EPD - 1582-1594 A1 - Y. Pirola, A1 - G. D. Vedova, A1 - S. Biffani, A1 - A. Stella, A1 - P. Bonizzoni, PY - 2012 KW - genomics KW - bioinformatics KW - computability KW - constraint handling KW - incomplete information KW - genotype phasing KW - genotype imputation KW - combinatorial formulation KW - pedigrees KW - minimum-recombinant haplotype configuration with bounded errors problem KW - MRHCE KW - satisfiability problem KW - constraint programming literature KW - biological soundness KW - phasing model KW - genotyping errors KW - erroneous information KW - Bioinformatics KW - Genetics KW - Algorithm design and analysis KW - Computational biology KW - Genomics KW - recombinations KW - Haplotype inference KW - pedigrees KW - genotyping errors KW - missing data VL - 9 JA - IEEE/ACM Transactions on Computational Biology and Bioinformatics ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.100
Web Extra: View Supplemental Materials (PDF)
The MINIMUM-RECOMBINANT HAPLOTYPE CONFIGURATION problem (MRHC) has been highly successful in providing a sound combinatorial formulation for the important problem of genotype phasing on pedigrees. Despite several algorithmic advances that have improved the efficiency, its applicability to real data sets has been limited since it does not take into account some important phenomena such as mutations, genotyping errors, and missing data. In this work, we propose the MINIMUM-RECOMBINANT HAPLOTYPE CONFIGURATION WITH BOUNDED ERRORS problem (MRHCE), which extends the original MRHC formulation by incorporating the two most common characteristics of real data: errors and missing genotypes (including untyped individuals). We describe a practical algorithm for MRHCE that is based on a reduction to the well-known Satisfiability problem (SAT) and exploits recent advances in the constraint programming literature. An experimental analysis demonstrates the biological soundness of the phasing model and the effectiveness (on both accuracy and performance) of the algorithm under several scenarios. The analysis on real data and the comparison with state-of-the-art programs reveals that our approach couples better scalability to large and complex pedigrees with the explicit inclusion of genotyping errors into the model.
Index Terms:
genomics,bioinformatics,computability,constraint handling,incomplete information,genotype phasing,genotype imputation,combinatorial formulation,pedigrees,minimum-recombinant haplotype configuration with bounded errors problem,MRHCE,satisfiability problem,constraint programming literature,biological soundness,phasing model,genotyping errors,erroneous information,Bioinformatics,Genetics,Algorithm design and analysis,Computational biology,Genomics,recombinations,Haplotype inference,pedigrees,genotyping errors,missing data
Citation:
Y. Pirola, G. D. Vedova, S. Biffani, A. Stella, P. Bonizzoni, "A fast and practical approach to genotype phasing and imputation on a pedigree with erroneous and incomplete information," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 9, no. 6, pp. 1582-1594, Nov.-Dec. 2012, doi:10.1109/TCBB.2012.100
Usage of this product signifies your acceptance of the Terms of Use.

