The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - March-April (2013 vol.10)
pp: 494-503
Tsung-Heng Tsai , Dept. of Electr. & Comput. Eng., Georgetown Univ., Washington, DC, USA
Mahlet G. Tadesse , Dept. of Math. & Stat., Georgetown Univ., Washington, DC, USA
Yue Wang , Dept. of Electr. & Comput. Eng., Virginia Tech, Arlington, VA, USA
Habtom W. Ressom , Dept. of Oncology, Georgetown Univ., Washington, DC, USA
ABSTRACT
A Bayesian alignment model (BAM) is proposed for alignment of liquid chromatography-mass spectrometry (LC-MS) data. BAM belongs to the category of profile-based approaches, which are composed of two major components: a prototype function and a set of mapping functions. Appropriate estimation of these functions is crucial for good alignment results. BAM uses Markov chain Monte Carlo (MCMC) methods to draw inference on the model parameters and improves on existing MCMC-based alignment methods through 1) the implementation of an efficient MCMC sampler and 2) an adaptive selection of knots. A block Metropolis-Hastings algorithm that mitigates the problem of the MCMC sampler getting stuck at local modes of the posterior distribution is used for the update of the mapping function coefficients. In addition, a stochastic search variable selection (SSVS) methodology is used to determine the number and positions of knots. We applied BAM to a simulated data set, an LC-MS proteomic data set, and two LC-MS metabolomic data sets, and compared its performance with the Bayesian hierarchical curve registration (BHCR) model, the dynamic time-warping (DTW) model, and the continuous profile model (CPM). The advantage of applying appropriate profile-based retention time correction prior to performing a feature-based approach is also demonstrated through the metabolomic data sets.
INDEX TERMS
Bayes methods, Stochastic processes, Monte Carlo methods, Chromatography-mass spectrometry,stochastic search variable selection (SSVS), Alignment, Bayesian inference, block Metropolis-Hastings algorithm, liquid chromatography-mass spectrometry (LC-MS), Markov chain Monte Carlo (MCMC)
CITATION
Tsung-Heng Tsai, Mahlet G. Tadesse, Yue Wang, Habtom W. Ressom, "Profile-Based LC-MS Data Alignment - A Bayesian Approach", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.10, no. 2, pp. 494-503, March-April 2013, doi:10.1109/TCBB.2013.25
REFERENCES
[1] R. Aebersold and M. Mann, "Mass Spectrometry-Based Proteomics," Nature, vol. 422, no. 6928, pp. 198-207, 2003.
[2] G.J. Patti, O. Yanes, and G. Siuzdak, "Innovation: Metabolomics: The Apogee of the Omics Trilogy," Nature Rev. Molecular Cell Biology, vol. 13, no. 4, pp. 263-269, 2012.
[3] J. Zaia, "Mass Spectrometry and Glycomics," OMICS, vol. 14, pp. 401-418, 2010.
[4] A. Prakash, P. Mallick, J. Whiteaker, H. Zhang, A. Paulovich, M. Flory, H. Lee, R. Aebersold, and B. Schwikowski, "Signal Maps for Mass Spectrometry-based Comparative Proteomics," Molecular and Cellular Proteomics, vol. 5, no. 3, pp. 423-432, 2006.
[5] D. Radulovic, S. Jelveh, S. Ryu, T.G. Hamilton, E. Foss, Y. Mao, and A. Emili, "Informatics Platform for Global Proteomic Profiling and Biomarker Discovery Using Liquid Chromatography-Tandem Mass Spectrometry," Molecular and Cellular Proteomics, vol. 3, no. 10, pp. 984-997, 2004.
[6] Y.V. Karpievitch, A.D. Polpitiya, G.A. Anderson, R.D. Smith, and A.R. Dabney, "Liquid Chromatography Mass Spectrometry-based Proteomics: Biological and Technological Aspects," Annals of Applied Statistics, vol. 4, no. 4, pp. 1797-1823, 2010.
[7] M. Mann and N. Kelleher, "Precision Proteomics: The Case for High Resolution and High Mass Accuracy," Proc. Nat'l Academy of Sciences USA, vol. 105, no. 47, pp. 18132-18138, 2008.
[8] M. Vandenbogaert, S. Li-Thiao-Te, H.-M. Kaltenbach, R. Zhang, T. Aittokallio, and B. Schwikowski, "Alignment of LC-MS Images, with Applications to Biomarker Discovery and Protein Identification," Proteomics, vol. 8, pp. 650-672, 2008.
[9] B. Fischer, J. Grossmann, V. Roth, W. Gruissem, S. Baginsky, and J.M. Buhmann, "Semi-Supervised LC/MS Alignment for Differential Proteomics," Bioinformatics, vol. 22, no. 14, pp. e132-e140, 2006.
[10] E. Lange, C. Gropl, O. Schulz-Trieglaff, A. Leinenbach, C. Huber, and K. Reinert, "A Geometric Approach for the Alignment of Liquid Chromatography-Mass Spectrometry Data," Bioinformatics, vol. 23, no. 13, pp. i273-i281, 2007.
[11] K. Podwojski, A. Fritsch, D. Chamrad, W. Paul, B. Sitek, P. Mutzel, C. Stephan, H. Meyer, W. Urfer, and J. Rahnenfuhrer, "Retention Time Alignment Algorithms for LC/MS Data Must Consider Non-Linear Shifts," Bioinformatics, vol. 25, no. 6, pp. 758-764, 2009.
[12] B. Voss, M. Hanselmann, B.Y. Renard, M.S. Lindner, U. Kothe, M. Kirchner, and F.A. Hamprecht, "SIMA: Simultaneous Multiple Alignment of LC/MS Peak Lists," Bioinformatics, vol. 27, no. 7, pp. 987-993, 2011.
[13] G. Tomasi, F. van den Berg, and C. Andersson, "Correlation Optimized Warping and Dynamic Time Warping as Preprocessing Methods for Chromatographic Data," J. Chemometrics, vol. 18, pp. 231-241, 2004.
[14] J. Listgarten, R.M. Neal, S.T. Roweis, and A. Emili, "Multiple Alignment of Continuous Time Series," Proc. Advances in Neural Information Processing Systems, pp. 817-824, 2005.
[15] J.D. Jaffe, D.R. Mani, K.C. Leptos, G.M. Church, M.A. Gillette, and S.A. Carr, "PEPPeR, a Platform for Experimental Proteomic Pattern Recognition," Molecular and Cellular Proteomics, vol. 5, no. 10, pp. 1927-1941, 2006.
[16] N. Jaitly, M.E. Monroe, V.A. Petyuk, T.R.W. Clauss, J.N. Adkins, and R.D. Smith, "Robust Algorithm for Alignment of Liquid Chromatography Mass Spectrometry Analyses in an Accurate Mass and Time Tag Data Analysis Pipeline," Analytical Chemistry, vol. 78, no. 21, pp. 7397-7409, 2006.
[17] J. Listgarten, R.M. Neal, S.T. Roweis, R. Puckrin, and S. Cutler, "Bayesian Detection of Infrequent Differences in Sets of Time Series with Shared Structure," Proc. Advances in Neural Information Processing Systems, pp. 905-912, 2007.
[18] D. Telesca and L.Y.T. Inoue, "Bayesian Hierarchical Curve Registration," J. Am. Statistical Assoc., vol. 103, no. 481, pp. 328-339, 2008.
[19] G.O. Roberts and S.K. Sahu, "Updating Schemes, Correlation Structure, Blocking and Parameterisation for the Gibbs Sampler," J. Royal Statistical Soc. Series B, vol. 59, pp. 291-317, 1997.
[20] E.I. George and R.E. McCulloch, "Variable Selection via Gibbs Sampling," J. Am. Statistical Assoc., vol. 88, no. 423, pp. 881-889, 1993.
[21] P.J. Green, "Reversible-Jump Markov Chain Monte Carlo Computation and Bayesian Model Determination," Biometrika, vol. 82, no. 4, pp. 711-732, 1995.
[22] J. Listgarten, R.M. Neal, S.T. Roweis, P. Wong, and A. Emili, "Difference Detection in LC-MS Data for Protein Biomarker Discovery," Bioinformatics, vol. 23, no. 2, pp. e198-e204, 2007.
[23] E. Lange, R. Tautenhahn, S. Neumann, and C. Gropl, "Critical Assessment of Alignment Procedures for LC-MS Proteomics and Metabolomics Measurements," BMC Bioinformatics, vol. 9, article 375, 2008.
[24] M. Sturm, A. Bertsch, C. Gropl, A. Hildebrandt, R. Hussong, E. Lange, N. Pfeifer, O. Schulz-Trieglaff, A. Zerck, K. Reinert, and O. Kohlbacher, "OpenMS—An Open-Source Software Framework for Mass Spectrometry," BMC Bioinformatics, vol. 9, article 163, 2008.
[25] R. Tautenhahn, C. Bottcher, and S. Neumann, "Annotation of LC/ESI-MS Mass Signals," Proc. First Int'l Conf. Bioinformatics Research and Development, S. Hochreiter and R. Wagner eds., pp. 371-380, 2007.
[26] W. Windig, J.M. Phalp, and A.W. Payne, "A Noise and Background Reduction Method for Component Detection in Liquid Chromatography/Mass Spectrometry," Analytical Chemistry, vol. 68, no. 20, pp. 3602-3606, 1996.
89 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool