Publication 2013 Issue No. 6 - Nov.-Dec. Abstract - Protein Chain Pair Simplification under the Discrete Fréchet Distance
Protein Chain Pair Simplification under the Discrete Fréchet Distance
Nov.-Dec. 2013 (vol. 10 no. 6)
pp. 1372-1383
 ASCII Text x Tim Wylie, Binhai Zhu, "Protein Chain Pair Simplification under the Discrete Fréchet Distance," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 10, no. 6, pp. 1372-1383, Nov.-Dec., 2013.
 BibTex x @article{ 10.1109/TCBB.2013.17,author = {Tim Wylie and Binhai Zhu},title = {Protein Chain Pair Simplification under the Discrete Fréchet Distance},journal ={IEEE/ACM Transactions on Computational Biology and Bioinformatics},volume = {10},number = {6},issn = {1545-5963},year = {2013},pages = {1372-1383},doi = {http://doi.ieeecomputersociety.org/10.1109/TCBB.2013.17},publisher = {IEEE Computer Society},address = {Los Alamitos, CA, USA},}
 RefWorks Procite/RefMan/Endnote x TY - JOURJO - IEEE/ACM Transactions on Computational Biology and BioinformaticsTI - Protein Chain Pair Simplification under the Discrete Fréchet DistanceIS - 6SN - 1545-5963SP1372EP1383EPD - 1372-1383A1 - Tim Wylie, A1 - Binhai Zhu, PY - 2013KW - ProteinsKW - Approximation methodsKW - BioinformaticsKW - VisualizationKW - Dynamic programmingKW - Approximation algorithmsKW - NP-completeKW - Protein structure alignmentKW - protein structure simplification and visualizationKW - Discrete Fréchet distanceKW - Approximation algorithmsKW - dynamic programmingVL - 10JA - IEEE/ACM Transactions on Computational Biology and BioinformaticsER -
Tim Wylie, Montana State University, Bozeman
Binhai Zhu, Montana State University, Bozeman
For protein structure alignment and comparison, a lot of work has been done using RMSD as the distance measure, which has drawbacks under certain circumstances. Thus, the discrete Fréchet distance was recently applied to the problem of protein (backbone) structure alignment and comparison with promising results. For this problem, visualization is also important because protein chain backbones can have as many as 500-600 $(\alpha)$-carbon atoms, which constitute the vertices in the comparison. Even with an excellent alignment, the similarity of two polygonal chains can be difficult to visualize unless the chains are nearly identical. Thus, the chain pair simplification problem (CPS-3F) was proposed in 2008 to simultaneously simplify both chains with respect to each other under the discrete Fréchet distance. The complexity of CPS-3F is unknown, so heuristic methods have been developed. Here, we define a variation of CPS-3F, called the constrained CPS-3F problem ($({\rm CPS\hbox{-}3F}^+)$), and prove that it is polynomially solvable by presenting a dynamic programming solution, which we then prove is a factor-2 approximation for CPS-3F. We then compare the $({\rm CPS\hbox{-}3F}^+)$ solutions with previous empirical results, and further demonstrate some of the benefits of the simplified comparisons. Chain pair simplification based on the Hausdorff distance (CPS-2H) is known to be NP-complete, and here we prove that the constrained version ($(\rm CPS\hbox{-}2H^+)$) is also NP-complete. Finally, we discuss future work and implications along with a software library implementation, named the Fréchet-based Protein Alignment & Comparison Toolkit (FPACT).
Index Terms:
Proteins,Approximation methods,Bioinformatics,Visualization,Dynamic programming,Approximation algorithms,NP-complete,Protein structure alignment,protein structure simplification and visualization,Discrete Fr&#x00E9;chet distance,Approximation algorithms,dynamic programming
Citation:
Tim Wylie, Binhai Zhu, "Protein Chain Pair Simplification under the Discrete Fréchet Distance," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 10, no. 6, pp. 1372-1383, Nov.-Dec. 2013, doi:10.1109/TCBB.2013.17