CSDL Home IEEE/ACM Transactions on Computational Biology and Bioinformatics 2013 vol.10 Issue No.02 - March-April

Subscribe

Issue No.02 - March-April (2013 vol.10)

pp: 372-382

Jiaoyun Yang , Anhui Province & the Sch. of Comput. Sci., Univ. of Sci. & Technol. of China, Hefei, China

Yun Xu , Anhui Province & the Sch. of Comput. Sci., Univ. of Sci. & Technol. of China, Hefei, China

Xiaohui Yao , Anhui Province & the Sch. of Comput. Sci., Univ. of Sci. & Technol. of China, Hefei, China

Guoliang Chen , Anhui Province & the Sch. of Comput. Sci., Univ. of Sci. & Technol. of China, Hefei, China

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2013.18

ABSTRACT

An enormous amount of sequence data has been generated with the development of new DNA sequencing technologies, which presents great challenges for computational biology problems such as haplotype phasing. Although arduous efforts have been made to address this problem, the current methods still cannot efficiently deal with the incoming flood of large-scale data. In this paper, we propose a flow network model to tackle haplotype phasing problem, and explain some classical haplotype phasing rules based on this model. By incorporating the heuristic knowledge obtained from these classical rules, we design an algorithm FNphasing based on the flow network model. Theoretically, the time complexity of our algorithm is

*O*(n^{2}m+m^{2}), which is better than that of 2SNP, one of the most efficient algorithms currently. After testing the performance of FNphasing with several simulated data sets, the experimental results show that when applied on large-scale data sets, our algorithm is significantly faster than the state-of-the-art Beagle algorithm. FNphasing also achieves an equal or superior accuracy compared with other approaches.INDEX TERMS

Hidden Markov models, Algorithm design and analysis, Accuracy, Phylogeny, Computational biology, Bioinformatics, Data models,flow network, Hidden Markov models, Algorithm design and analysis, Accuracy, Phylogeny, Computational biology, Bioinformatics, Data models, heuristic methods, Haplotype phasing

CITATION

Jiaoyun Yang, Yun Xu, Xiaohui Yao, Guoliang Chen, "FNphasing: A Novel Fast Heuristic Algorithm for Haplotype Phasing Based on Flow Network Model",

*IEEE/ACM Transactions on Computational Biology and Bioinformatics*, vol.10, no. 2, pp. 372-382, March-April 2013, doi:10.1109/TCBB.2013.18REFERENCES