CSDL Home IEEE/ACM Transactions on Computational Biology and Bioinformatics 2012 vol.9 Issue No.02 - March/April

Subscribe

Issue No.02 - March/April (2012 vol.9)

pp: 548-559

Biing-Feng Wang , Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2011.112

ABSTRACT

The focus of this paper is the problem of finding all nested common intervals of two general sequences. Depending on the treatment one wants to apply to duplicate genes, Blin et al. introduced three models to define nested common intervals of two sequences: the uniqueness, the free-inclusion, and the bijection models. We consider all the three models. For the uniqueness and the bijection models, we give O(n + N

_{out})-time algorithms, where N_{out}denotes the size of the output. For the free-inclusion model, we give an O(n^{1+ε}+ N_{out})-time algorithm, where ε >; 0 is an arbitrarily small constant. We also present an upper bound on the size of the output for each model. For the uniqueness and the free-inclusion models, we show that N_{out}= O(n^{2}). Let C = Σ_{gϵΓ}o_{1}(g)o_{2}(5), where Γ is the set of distinct genes, and o_{1}(g) and o_{2}(g) are, respectively, the numbers of copies of g in the two given sequences. For the bijection model, we show that N_{out}= O(Cn). In this paper, we also study the problem of finding all approximate nested common intervals of two sequences on the bijection model. An O(δn + N_{out})-time algorithm is presented, where δ denotes the maximum number of allowed gaps. In addition, we show that for this problem N_{out}is O(δn^{3}).INDEX TERMS

Approximation algorithms, Algorithm design and analysis, Bioinformatics, Biological system modeling, Computational modeling, Genomics, Computational biology,conserved gene clusters., Algorithms, data structures, common intervals, comparative genomics

CITATION

Biing-Feng Wang, "Output-Sensitive Algorithms for Finding the Nested Common Intervals of Two General Sequences",

*IEEE/ACM Transactions on Computational Biology and Bioinformatics*, vol.9, no. 2, pp. 548-559, March/April 2012, doi:10.1109/TCBB.2011.112REFERENCES