CSDL Home IEEE/ACM Transactions on Computational Biology and Bioinformatics 2008 vol.5 Issue No.01 - January-March

Subscribe

Issue No.01 - January-March (2008 vol.5)

pp: 25-41

ABSTRACT

This paper presents two in-depth studies on RnaPredict, an evolutionary algorithm for RNA secondary structure prediction. The first study is an analysis of the performance of two thermodynamic models, INN and INN-HB. The correlation between the free energy of predicted structures and the sensitivity is analyzed for 19 RNA sequences. Although some variance is shown, there is a clear trend between a lower free energy and an increase in true positive base pairs. With increasing sequence length, this correlation generally decreases. In the second experiment, the accuracy of the predicted structures for these 19 sequences are compared against the accuracy of the structures generated by the mfold dynamic programming algorithm (DPA) and also to known structures. RnaPredict is shown to outperform the minimum free energy structures produced by mfold and has comparable performance when compared to sub-optimal structures produced by mfold.

INDEX TERMS

RNA Secondary Structure Prediction, Evolutionary Computation, RnaPredict

CITATION

Kay C. Wiese, Andrew G. Hendriks, "RnaPredict—An Evolutionary Algorithm for RNA Secondary Structure Prediction",

*IEEE/ACM Transactions on Computational Biology and Bioinformatics*, vol.5, no. 1, pp. 25-41, January-March 2008, doi:10.1109/tcbb.2007.1054REFERENCES

- [2] K.C. Wiese and E. Glen, “A Permutation Based Genetic Algorithm for RNA Secondary Structure Prediction,”
Soft Computing Systems, series Frontiers in Artificial Intelligence and Applications, A.Abraham, J.R. del Solar, and M. Koppen, eds., vol. 87, chapter4, pp. 173-182, IOS Press, 2002.- [3] K.C. Wiese and E. Glen, “A Permutation-Based Genetic Algorithm for the RNA Folding Problem: A Critical Look at Selection Strategies, Crossover Operators and Representation Issues,”
BioSystems, special issue on computational intelligence in bioinformatics, vol. 72, pp. 29-41, 2003.- [8] T. Bäck,
Evolutionary Algorithms in Theory and Practice: Evolution Strategies, Evolutionary Programming, Genetic Algorithms. Oxford Univ. Press, 1996.- [10] B.A. Shapiro and J.C. Wu, “An Annealing Mutation Operator in the Genetic Algorithms for RNA Folding,”
Computer Applications in the Biosciences, vol. 12, no. 3, pp. 171-180, 1996.- [14] B.A. Shapiro, D. Bengali, and W. Kasprzak, “Determination of RNA Folding Pathway Functional Intermediates Using a Massively Parallel Genetic Algorithm,”
Proc. ACM SIGKDD Workshop Data Mining in Bioinformatics (BIOKDD '01), p. 1, citeseer.ist.psu.edushapiro01determination.html , 2001.- [17] K.M. Currey and B.A. Shapiro, “Secondary Structure Computer Prediction of the Poliovirus 5' Non-Coding Region Is Improved by a Genetic Algorithm,”
Computer Applications in the Biosciences, vol. 13, no. 1, pp. 1-12, 1997.- [18] S.J. Chen and K.A. Dill, “RNA Folding Energy Landscapes,”
Proc. Nat'l. Academy of Sciences, vol. 97, pp. 646-651, 2000.- [19] I.I. Titov, D.G. Vorobiev, V.A. Ivanisenko, and N.A. Kolchanov, “A Fast Genetic Algorithm for RNA Secondary Structure Analysis,”
Russian Chemical Bull. vol. 51, no. 7, pp. 1135-1144, 2002.- [21] K. Wiese and S.D. Goodwin, “Keep-Best Reproduction: A Local Family Competition Selection Strategy and the Environment It Flourishes In,”
Constraints, vol. 6, no. 4, pp. 399-422, 2001.- [28] R. Nussinov, G. Pieczenik, J.R. Griggs, and D.J. Kleitman, “Algorithms for Loop Matchings,”
SIAM J. Applied Math., vol. 35, pp. 68-82, 1978.- [32] M. Zuker, “Prediction of RNA Secondary Structure by Energy Minimization,”
Computer Analysis of Sequence Data, A.M. Griffin and H.G. Griffin, eds., pp. 267-294, Humana Press, July 1994.- [33] M. Zuker, D.H. Mathews, and D.H. Turner, “Algorithms and Thermodynamics for RNA Secondary Structure Prediction: A Practical Guide,”
RNA Biochemistry and Biotechnology, series NATO ASI Series, J. Barciszewski and B. Clark, eds., Kluwer Academic Publishers, 1999.- [35] J.A. Jaeger, D.H. Turner, and M. Zuker, “Improved Predictions of Secondary Structures for RNA,”
Biochemistry, vol. 86, pp. 7706-7710, Oct. 1989.- [37] D.H. Mathews, T.C. Andre, J. Kim, D.H. Turner, and M. Zuker, “An Updated Recursive Algorithm for RNA Secondary Structure Prediction with Improved Free Energy Parameters,”
Am. Chemical Soc., N.B. Leontis and J. SantaLucia Jr., eds., chapter 15, pp. 246-257, Am. Chemical Soc., 1998.- [38] D.H. Mathews, M.D. Disney, J.L. Childs, S.J. Schroeder, M. Zuker, and D.H. Turner, “Incorporating Chemical Modification Constraints into a Dynamic Programming Algorithm for Prediction of RNA Secondary Structure,”
Proc. Nat'l Academy of Sciences, vol. 101, pp. 7287-7292, 2004.- [39] S.M. Freier, R. Kierzek, J.A. Jaeger, N. Sugimoto, M.H. Caruthers, T. Neilson, and D.H. Turner, “Improved Free-Energy Parameters for Predictions of RNA Duplex Stability,”
Proc. Nat'l Academy of Sciences, vol. 83, pp. 9373-9377, 1986.- [41] T. Xia, J. John SantaLucia, M.E. Burkard, R. Kierzek, S.J. Schroeder, X. Jiao, C. Cox, and D.H. Turner, “Thermodynamic Parameters for an Expanded Nearest-Neighbor Model for Formation of RNA Duplexes with Watson-Crick Base Pairs,”
Biochemistry, vol. 37, pp.14719-14735, 1998.- [43] S.M. Freier, B.J. Burger, D. Alkema, T. Neilson, and D.H. Turner, “Effects of 3' Dangling End Stacking on the Stability of GGCC and CCGG Double Helixes,”
Biochemistry, vol. 22, no. 26, pp. 6198-6206, 1983.- [44] R. Kierzek, M.H. Caruthers, C.E. Longfellow, D. Swinton, D.H. Turner, and S.M. Freier, “Polymer-Supported RNA Synthesis and Its Application to Test the Nearest Neighbor Model for Duplex Stability,”
Biochemistry, vol. 25, pp. 7840-7846, June 1986.- [46] N. Sugimoto, R. Kierzek, S.M. Freier, and D.H. Turner, “Energetics of Internal GU Mismatches in Ribooligonucleotide Helixes,”
Biochemistry, vol. 25, no. 19, pp. 5755-5759, 1986.- [47] L. He, R. Kierzek, J. SantaLucia Jr., A.E. Walter, and D.H. Runer, “Nearest-Neighbor Parameters for GU Mismatches: GU/UG is Destabilizing in the Contexts CGUG/GUGC, UGUA/AUGU but Stabillizing in GGUC/CUGG,”
Biochemistry, vol. 30, pp. 11124-11132, 1991.- [48] M. Wu, J.A. McDowell, and D.H. Turner, “A Periodic Table of Symmetric Tandem Mismatches in RNA,”
Biochemistry, vol. 34, pp. 3204-3211, 1995.- [49] T. Xia, J.A. McDowell, and D.H. Turner, “Thermodynamics of Nonsymmetric Tandem Mismatches Adjacent to GC Base Pairs in RNA,”
Biochemistry, vol. 36, pp. 12486-12497, 1997.- [50] A. Deschenes, “A Genetic Algorithm for RNA Secondary Structure Prediction Using Stacking Energy Thermodynamic Models,” master's thesis, Simon Fraser Univ., 2005.
- [51] I.M. Oliver, D.J. Smith, and J.R.C. Holland, “A Study of Permutation Crossover Operators on the Traveling Salesman Problem,”
Proc. Second Int'l Conf. Genetic Algorithms (ICGA '87), pp.224-230, 1987.- [52] G. Syswerda, “Schedule Optimization Using Genetic Algorithms,”
Handbook of Genetic Algorithms, L. Davis, ed., Van Nostrand Reinhold, 1991.- [53] D.E. Goldberg and R. Lingle, Jr, “Alleles, Loci and the Travelling Salesman Problem,”
Proc. First Int'l Conf. Genetic Algorithms, J.Grefenstette, ed., pp. 154-159, 1985.- [54] J.J. Cannone, S. Subramanian, M.N. Schnare, J.R. Collett, L.M. D'Souza, Y. Du, B. Feng, N. Lin, L.V. Madabusi, K.M. Müller, N. Pande, Z. Shang, N. Yu, and R.R. Gutell, “The Comparative RNA Web (CRW) Site: An Online Database of Comparative Sequence and Structure Information for Ribosomal, Intron and Other RNAs,”
BMC Bioinformatics, vol. 3, 2002.- [55] N.A. Weiss,
Elementary Statistics. Addison-Wesley, 1999. |