
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
Ankit Agrawal, Xiaoqiu Huang, "Pairwise Statistical Significance of Local Sequence Alignment Using SequenceSpecific and PositionSpecific Substitution Matrices," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 8, no. 1, pp. 194205, JanuaryFebruary, 2011.  
BibTex  x  
@article{ 10.1109/TCBB.2009.69, author = {Ankit Agrawal and Xiaoqiu Huang}, title = {Pairwise Statistical Significance of Local Sequence Alignment Using SequenceSpecific and PositionSpecific Substitution Matrices}, journal ={IEEE/ACM Transactions on Computational Biology and Bioinformatics}, volume = {8}, number = {1}, issn = {15455963}, year = {2011}, pages = {194205}, doi = {http://doi.ieeecomputersociety.org/10.1109/TCBB.2009.69}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE/ACM Transactions on Computational Biology and Bioinformatics TI  Pairwise Statistical Significance of Local Sequence Alignment Using SequenceSpecific and PositionSpecific Substitution Matrices IS  1 SN  15455963 SP194 EP205 EPD  194205 A1  Ankit Agrawal, A1  Xiaoqiu Huang, PY  2011 KW  Database statistical significance KW  homologs KW  pairwise statistical significance KW  positionspecific scoring matrices (PSSMs) KW  sequence alignment KW  substitution matrices. VL  8 JA  IEEE/ACM Transactions on Computational Biology and Bioinformatics ER   
[1] W.R. Pearson and D.J. Lipman, "Improved Tools for Biological Sequence Comparison," Proc. Nat'l Academy of Sciences USA vol. 85, no. 8, pp. 24442448, http://www.pnas.org/cgi/content/abstract/ 85/82444, 1988.
[2] S.F. Altschul, W. Gish, W. Miller, E.W. Myers, and D.J. Lipman, "Basic Local Alignment Search Tool.," J. Molecular Biology, vol. 215, no. 3, pp. 403410, http://dx.doi.org/10.1006jmbi.1990.9999, 1990.
[3] S.F. Altschul, T.L. Madden, A.A. Schäffer, J. Zhang, Z. Zhang, W. Miller, and D.J. Lipman, "Gapped BLAST PSIBLAST: A New Generation of Protein Database Search Programs," Nucleic Acids Research, vol. 25, no. 17, pp. 33893402, http://dx.doi.org/10.1093/nar25.17.3389, 1997.
[4] T.F. Smith and M.S. Waterman, "Identification of Common Molecular Subsequences," J. Molecular Biology, vol. 147, no. 1, pp. 195197, http://view.ncbi.nlm.nih.gov/pubmed7265238 , 1981.
[5] O. Gotoh, "An Improved Algorithm for Matching Biological Sequences," J. Molecular Biology, vol. 162, no. 3, pp. 705708, Dec. 1982.
[6] P.H. Sellers, "Pattern Recognition in Genetic Sequences by Mismatch Density," Bull. of Math. Biology, vol. 46, no. 4, pp. 501514, http://www.springerlink.com/content2v4477481102w030 , 1984.
[7] W.R. Pearson, "Effective Protein Sequence Comparison," Methods in Enzymology, vol. 266, pp. 227259, 1996.
[8] W.R. Pearson, "Flexible Sequence Similarity Searching with the FASTA3 Program Package," Methods in Molecular Biology, vol. 132, pp. 185219, 2000.
[9] B. Ma, J. Tromp, and M. Li, "PatternHunter: Faster and More Sensitive Homology Search," Bioinformatics, vol. 18, no. 3, pp. 440445, 2002.
[10] M. Li, B. Ma, D. Kisman, and J. Tromp, "PatternHunter II: Highly Sensitive and Fast Homology Search," J. Bioinformatics and Computational Biology, vol. 2, no. 3, pp. 417439, 2004.
[11] K.M. Chao, "Calign: Aligning Sequences with Restricted Affine Gap Penalties," Bioinformatics, vol. 15, no. 4, pp. 298304, 1999.
[12] X. Huang and K.M. Chao, "A Generalized Global Alignment Algorithm," Bioinformatics, vol. 19, no. 2, pp. 228233, 2003.
[13] X. Huang and D.L. Brutlag, "Dynamic Use of Multiple Parameter Sets in Sequence Alignment," Nucleic Acids Research, vol. 35, no. 2, pp. 678686, http://nar.oxfordjournals.org/cgi/content/ abstract/35/2678, 2007.
[14] R. Mott, "Alignment: Statistical Significance," Encyclopedia of Life Science, http://mrw.interscience.wiley.com/emrw/9780470015902/ els/article/a0005264/current abstract, 2005.
[15] S.F. Altschul, M.S. Boguski, W. Gish, and J.C. Wootton, "Issues in Searching Molecular Sequence Databases," Nature Genetics, vol. 6, no. 2, pp. 119129, 1994.
[16] S. Karlin and S.F. Altschul, "Methods for Assessing the Statistical Significance of Molecular Sequence Features by Using General Scoring Schemes," Proc. Nat'l Academy of Sciences USA, vol. 87, no. 6, pp. 22642268, http://www.pnas.org/cgi/content/ abstract/ 87/62264, 1990.
[17] M.S. Waterman and M. Vingron, "Rapid, Accurate Estimates of Statistical Significance for Sequence Database Searches," Proc. Nat'l Academy of Sciences USA, vol. 91, no. 11, pp. 46254628, http://www.pnas.org/cgi/content/abstract/ 91/114625, 1994.
[18] S.F. Altschul and W. Gish, "Local Alignment Statistics," Methods in Enzymology, vol. 266, pp. 46080, 1996.
[19] W.R. Pearson, "Empirical Statistical Estimates for Sequence Similarity Searches," J. Molecular Biology, vol. 276, pp. 7184, 1998.
[20] R. Mott and R. Tribe, "Approximate Statistics of Gapped Alignments," J. Computational Biology, vol. 6, no. 1, pp. 91112, 1999.
[21] R. Mott, "Accurate Formula for PValues of Gapped Local Sequence and Profile Alignments," J. Molecular Biology, vol. 300, pp. 649659, 2000.
[22] R. Bundschuh, "Rapid Significance Estimation in Local Sequence Alignment with Gaps," Proc. Fifth Ann. Int'l Conf. Research in Computational Molecular Biology (RECOMB '01), pp. 7785, 2001.
[23] S.F. Altschul, R. Bundschuh, R. Olsen, and T. Hwa, "The Estimation of Statistical Parameters for Local Alignment Score Distributions," Nucleic Acids Research, vol. 29, no. 2, pp. 351361, 2001.
[24] A.A. Schäffer, L. Aravind, T.L. Madden, S. Shavirin, J.L. Spouge, Y.I. Wolf, E.V. Koonin, and S.F. Altschul, "Improving the Accuracy of PSIBLAST Protein Database Searches with CompositionBased Statistics and Other Refinements," Nucleic Acids Research, vol. 29, no. 14, pp. 29943005, 2001.
[25] S. Sheetlin, Y. Park, and J.L. Spouge, "The Gumbel PreFactor $k$ for Gapped Local Alignment Can Be Estimated from Simulations of Global Alignment," Nucleic Acids Research, vol. 33, no. 15, pp. 49874994, 2005.
[26] A. Poleksic, J.F. Danzer, K. Hambly, and D.A. Debe, "Convergent Island Statistics: A Fast Method for Determining Local Alignment Score Significance," Bioinformatics, vol. 21, no. 12, pp. 28272831, 2005.
[27] Y.K. Yu, E.M. Gertz, R. Agarwala, A.A. Schäffer, and S.F. Altschul, "Retrieval Accuracy, Statistical Significance and Compositional Similarity in Protein Sequence Database Searches," Nucleic Acids Research, vol. 34, no. 20, pp. 59665973, 2006.
[28] A. Agrawal, V.P. Brendel, and X. Huang, "Pairwise Statistical Significance and Empirical Determination of Effective Gap Opening Penalties for Protein Local Sequence Alignment," Int'l J. Computational Biology and Drug Design, vol. 1, no. 4, pp. 347367, 2008.
[29] A. Agrawal and X. Huang, "Conservative, NonConservative and Average Pairwise Statistical Significance of Local Sequence Alignment," Proc. IEEE Int'l Conf. Bioinformatics and Biomedicine, pp. 433436, 2008.
[30] M. Kschischo, M. Lässig, and Y.K. Yu, "Toward an Accurate Statistics of Gapped Alignments," Bull. of Math. Biology, vol. 67, pp. 169191, 2004.
[31] S. Grossmann and B. Yakir, "Large Deviations for Global Maxima of Independent Superadditive Processes with Negative Drift and an Application to Optimal Sequence Alignments," Bernoulli, vol. 10, no. 5, pp. 829845, 2004.
[32] M. Pagni and C.V. Jongeneel, "Making Sense of Score Statistics for Sequence Alignments," Briefings in Bioinformatics, vol. 2, no. 1, pp. 5167, 2001.
[33] W.R. Pearson and T.C. Wood, "Statistical Significance in Biological Sequence Comparison," Handbook of Statistical Genetics, D. J. Balding, M. Bishop, and C. Cannings, eds., pp. 3966, Wiley, 2001.
[34] A.Y. Mitrophanov and M. Borodovsky, "Statistical Significance in Biological Sequence Analysis," Briefings in Bioinformatics, vol. 7, no. 1, pp. 224, 2006.
[35] Y.K. Yu and S.F. Altschul, "The Construction of Amino Acid Substitution Matrices for the Comparison of Proteins with NonStandard Compositions," Bioinformatics, vol. 21, no. 7 pp. 902911, 2005.
[36] S.R. Eddy, "Maximum Likelihood Fitting of Extreme Value Distributions," unpublished work, citeseer.ist.psu.edu370503.html, 1997.
[37] A. Agrawal and X. Huang, "Pairwise Statistical Significance of Local Sequence Alignment Using Multiple Parameter Sets and Empirical Justification of Parameter Set Change Penalty," BMC Bioinformatics, vol. 10, suppl. 3, p. S1, 2009.
[38] A. Agrawal and X. Huang, "Pairwise Statistical Significance of Local Sequence Alignment Using Substitution Matrices with SequencePairSpecific Distance," Proc. Int'l Conf. Information Technology, (ICIT '08), pp. 9499, 2008.
[39] M.L. Sierk and W.R. Pearson, "Sensitivity and Selectivity in Protein Structure Comparison," Protein Science, vol. 13, no. 3, pp. 773785, 2004.
[40] S. Kotz and S. Nadarajah, Extreme Value Distributions: Theory and Applications, ch. 1, pp. 34. Imperial College Press, 2000.
[41] S. Wolfsheimer, B. Burghardt, and A.K. Hartmann, "Local Sequence Alignments Statistics: Deviations from Gumbel Statistics in the RareEvent Tail," Algorithms for Molecular Biology, vol. 2, p. 9, 2007.
[42] A.K. Hartmann, "Sampling Rare Events: Statistics of Local Sequence Alignments," Physical Rev. E, vol. 65, no. 5, p. 056102, 2002.
[43] R. Olsen, R. Bundschuh, and T. Hwa, "Rapid Assessment of Extremal Statistics for Gapped Local Alignment," Proc. Seventh Int'l Conf. Intelligent Systems for Molecular Biology, pp. 211222, 1999.
[44] R.F. Mott, "MaximumLikelihood Estimation of the Statistical Distribution of Smith Waterman Local Sequence Similarity Scores," Bull. of Math. Biology, vol. 54, pp. 5975, 1992.
[45] S.R. Eddy, "Where did the Blosum62 Alignment Score Matrix Come from?," Nature Biotechnology, vol. 22, no. 8, pp. 10351036, Aug. 2004.
[46] C.A. Orengo, A.D. Michie, S. Jones, D.T. Jones, M.B. Swindells, and J.M. Thornton, "CATH—A Hierarchic Classification of Protein Domain Structures," Structure, vol. 28, no. 1, pp. 10931108, 1997.
[47] J. Rocha, F. Rosselló, and J. Segura, "Compression Ratios Based on the Universal Similarity Metric Still Yield Protein Distances Far from CATH Distances," CoRR, vol. abs/qbio/0603007, 2006.
[48] D.S. Hirschberg, "A Linear Space Algorithm for Computing Maximal Common Subsequences," Comm. ACM, vol. 18, no. 6, pp. 341343, 1975.
[49] S. Altschul and B. Erickson, "Optimal Sequence Alignment Using Affine Gap Costs," Bull. of Math. Biology, vol. 48, no. 5, pp. 603616, Sept. 1986.