CSDL Home IEEE/ACM Transactions on Computational Biology and Bioinformatics 2012 vol.9 Issue No.05 - Sept.-Oct.

Subscribe

Issue No.05 - Sept.-Oct. (2012 vol.9)

pp: 1352-1365

Sucheendra K. Palaniappan , Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore

S. Akshay , IRISA, ENS Cachan Bretagne, Rennes, France

Bing Liu , Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore

Blaise Genest , IRISA, Rennes, France

P. S. Thiagarajan , Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.60

ABSTRACT

Dynamic Bayesian Networks (DBNs) can serve as succinct probabilistic dynamic models of biochemical networks [1]. To analyze these models, one must compute the probability distribution over system states at a given time point. Doing this exactly is infeasible for large models; hence one must use approximate algorithms. The Factored Frontier algorithm (FF) is one such algorithm [2]. However FF as well as the earlier Boyen-Koller (BK) algorithm [3] can incur large errors. To address this, we present a new approximate algorithm called the Hybrid Factored Frontier (HFF) algorithm. At each time slice, in addition to maintaining probability distributions over local states-as FF does-HFF explicitly maintains the probabilities of a number of global states called spikes. When the number of spikes is 0, we get FF and with all global states as spikes, we get the exact inference algorithm. We show that by increasing the number of spikes one can reduce errors while the additional computational effort required is only quadratic in the number of spikes. We validated the performance of HFF on large DBN models of biopathways. Each pathway has more than 30 species and the corresponding DBN has more than 3,000 nodes. Comparisons with FF and BK show that HFF is a useful and powerful approximate inferencing algorithm for DBNs.

INDEX TERMS

probability, belief networks, bioinformatics, bioinformatics, hybrid factored frontier algorithm, dynamic Bayesian networks, biopathway application, probabilistic distribution dynamic models, biochemical networks, Boyen-Koller algorithm, spikes, global states, DBN models, Biological system modeling, Mathematical model, Trajectory, Approximation algorithms, Probability distribution, Approximation methods, Computational modeling, life and medical sciences—biology and genetics., Probability and statistics, symbolic and algebraic manipulation—algorithms

CITATION

Sucheendra K. Palaniappan, S. Akshay, Bing Liu, Blaise Genest, P. S. Thiagarajan, "A Hybrid Factored Frontier Algorithm for Dynamic Bayesian Networks with a Biopathways Application",

*IEEE/ACM Transactions on Computational Biology and Bioinformatics*, vol.9, no. 5, pp. 1352-1365, Sept.-Oct. 2012, doi:10.1109/TCBB.2012.60REFERENCES

- [1] B. Liu, D. Hsu, and P.S. Thiagarajan, "Probabilistic Approximations of ODEs Based Bio-Pathway Dynamics,"
Theoretical Computer Science, vol. 412, pp. 2188-2206, 2011.- [2] K.P. Murphy and Y. Weiss, "The Factored Frontier Algorithm for Approximate Inference in DBNs,"
Proc. 17th Int'l Conf. Uncertainty in Artificial Intelligence (UAI '01), pp. 378-385, 2001.- [3] X. Boyen and D. Koller, "Tractable Inference for Complex Stochastic Processes,"
Proc. 14th Int'l Conf. Uncertainty in Artificial Intelligence (UAI '98), pp. 33-42, 1998.- [4] B. Liu, J. Zhang, P.Y. Tan, D. Hsu, A.M. Blom, B. Leong, S. Sethi, B. Ho, J.L. Ding, and P.S. Thiagarajan, "A Computational and Experimental Study of the Regulatory Mechanisms of the Complement System,"
PLoS Computational Biology, vol. 7, no. 1, p. e1001059, 2011.- [5] G. Koh, D. Hsu, and P.S. Thiagarajan, "Incremental Signaling Pathway Modeling by Data Integration,"
Proc. Int'l Conf. Research in Computational Molecular Biology (RECOMB '10), pp. 281-296, 2010.- [6] C. Langmead, S. Jha, and E. Clarke, "Temporal Logics as Query Languages for Dynamic Bayesian Networks: Application to D. Melanogaster Embryo Development," technical report, Carnegie Mellon Univ., 2006.
- [7] D. Koller and N. Friedman,
Probabilistic Graphical Models: Principles and Techniques. MIT Press, 2009.- [8] K.S. Brown, C.C. Hill, G.A. Calero, C.R. Myers, K.H. Lee, and R.A. Cerione, "The Statistical Mechanics of Complex Signaling Networks: Nerve Growth Factor Signaling,"
Physical Biology, vol. 1, pp. 184-195, 2004.- [9] M. Schilling, T. Maiwald, S. Hengl, D. Winter, C. Kreutz, W. Kolch, W.D. Lehmann, J. Timmer, and U. Klingmüller, "Theoretical and Experimental Analysis Links Isoform-Specific ERK Signalling to Cell Fate Decisions,"
Molecular Systems Biology, vol. 5, p. 334, 2009.- [10] P.F. Felzenszwalb and D.P. Huttenlocher, "Efficient Belief Propagation for Early Vision,"
Int'l J. Computer Vision, vol. 70, pp. 41-54, 2006.- [11] R.J. Mceliece, D.J.C. Mackay, and J.-F. Cheng, "Turbo Decoding as an Instance of Pearl's "Belief Propagation" Algorithm,"
IEEE J. Selected Areas in Comm., vol. 16, no. 2, pp. 140-152, Feb. 1998.- [12] N. Friedman, "Inferring Cellular Networks Using Probabilistic Graphical Models,"
Science, vol. 303, pp. 799-805, 2004.- [13] B. Bidyuk and R. Dechter, "An Anytime Scheme for Bounding Posterior Beliefs,"
Proc. 21st Nat'l Conf. Artificial Intelligence (AAAI '06), pp. 1-6, 2006.- [14] J. Bilmes and H. Lin, "Online Adaptive Learning for Speech Recognition Decoding,"
Proc. Ann. Conf. Int'l Speech Comm. Assoc. (Interspeech '10), pp. 1958-1961, 2010.- [15] M.Z. Kwiatkowska, G. Norman, and D. Parker, "PRISM: Probabilistic Symbolic Model Checker,"
Proc. 12th Int'l Conf. Modelling Techniques and Tools for Computer Performance Evaluation (TOOLS '02), pp. 200-204, 2002.- [16] M. Calder, V. Vyshemirsky, D. Gilbert, and R.J. Orton, "Analysis of Signalling Pathways Using Continuous Time Markov Chains,"
Trans. Computational Systems Biology, vol. 4220, pp. 44-67, 2006.- [17] W.S. Hlavacek, J.R. Faeder, M.L. Blinov, R.G. Posner, M. Hucka, and W. Fontana, "Rules for Modeling Signal-Transduction Systems,"
Science STKE, vol. 2006, p. re6, 2006.- [18] V. Danos, J. Feret, W. Fontana, R. Harmer, and J. Krivine, "Rule-Based Modelling of Cellular Signalling,"
Proc. 18th Int'l Conf. Concurrency Theory (CONCUR '07), pp. 17-41, 2007.- [19] B. Liu, P.S. Thiagarajan, and D. Hsu, "Probabilistic Approximations of Signaling Pathway Dynamics,"
Proc. Seventh Int'l Conf. Computational Methods in Systems Biology (CMSB '09), pp. 251-265, 2009.- [20] T.A. Henzinger, M. Mateescu, and V. Wolf, "Sliding Window Abstraction for Infinite Markov Chains,"
Proc. 21th Int'l Conf. Computer Aided Verification (CAV '09), pp. 337-352, 2009.- [21] S.K. Jha, E.M. Clarke, C.J. Langmead, A. Legay, A. Platzer, and P. Zuliani, "A Bayesian Approach to Model Checking Biological Systems,"
Proc. Seventh Int'l Conf. Computational Methods in Systems Biology (CMSB '09), pp. 218-234, 2009.- [22] M.Z. Kwiatkowska, G. Norman, and D. Parker, "Probabilistic Model Checking for Systems Biology,"
Symbolic Systems Biology, Jones and Bartlett, 2010.- [23] R. Grosu and S.A. Smolka, "Monte Carlo Model Checking,"
Proc. 11th Int'l Conf. Tools and Algorithms for Construction and Analysis of Systems (TACAS '05), pp. 271-286, 2005.- [24] F. Fages and A. Rizk, "On the Analysis of Numerical Data Time Series in Temporal Logic,"
Proc. Int'l Conf. Computational Methods in Systems Biology (CMSB '07), pp. 48-63, 2007.- [25] R. Donaldson and D. Gilbert, "A Model Checking Approach to the Parameter Estimation of Biochemical Pathways,"
Proc. Sixth Int'l Conf. Computational Methods in Systems Biology (CMSB '08), pp. 269-287, 2008.- [26] F. Ciocchetta, A. Degasperi, J. Hillston, and M. Calder, "Some Investigations Concerning the CTMC and the ODE Model Derived from Bio-PEPA,"
Electronic Notes in Theoretical Computer Science, vol. 229, no. 1, pp. 145-163, 2009.- [27] F. Didier, T.A. Henzinger, M. Mateescu, and V. Wolf, "Approximation of Event Probabilities in Noisy Cellular Processes,"
Proc. Seventh Int'l Conf. Computational Methods in Systems Biology (CMSB '09), pp. 173-188, 2009.- [28] "Supplementary Materials," http://www.comp.nus.edu.sg/~sucheeHFF/, 2011.
- [29] S.K. Palaniappan, S. Akshay, B. Genest, and P.S. Thiagarajan, "A Hybrid Factored Frontier Algorithm for Dynamic Bayesian Network Models of Biopathways,"
Proc. Ninth Int'l Conf. Computational Methods in Systems Biology (CMSB '11), pp. 35-44, 2011.- [30] B.B. Aldridge, J.M. Burke, D.A. Lauffenburger, and P.K. Sorger, "Physicochemical Modelling of Cell Signalling Pathways,"
Nature Cell Biology, vol. 8, pp. 1195-1203, 2006.- [31] P. Bremaud,
Markov Chains: Gibbs Fields, Monte Carlo Simulation and Queues. Springer, 2010.- [32] N. Le Novere, B. Bornstein, A. Broicher, M. Courtot, M. Donizelli, H. Dharuri, L. Li, H. Sauro, M. Schilstra, B. Shapiro, J. Snoep, and M. Hucka, "BioModels Database: A Free, Centralized Database of Curated, Published, Quantitative Kinetic Models of Biochemical and Cellular Systems,"
Nucleic Acids Research, vol. 34, pp. D689-D691, 2006.- [33] K.P. Murphy, "Bayes Net Toolbox for Matlab," http:/bnt. googlecode.com, 2012.
- [34] B.N. Kholodenko, "Untangling the Signalling Wires,"
Nature Cell Biology, vol. 9, pp. 247-249, 2007. |