The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - March/April (2011 vol.8)
pp: 353-367
Ramesh Ram , Monash University, Churchill
Madhu Chetty , Monash University, Churchill
ABSTRACT
An efficient two-step Markov blanket method for modeling and inferring complex regulatory networks from large-scale microarray data sets is presented. The inferred gene regulatory network (GRN) is based on the time series gene expression data capturing the underlying gene interactions. For constructing a highly accurate GRN, the proposed method performs: 1) discovery of a gene's Markov Blanket (MB), 2) formulation of a flexible measure to determine the network's quality, 3) efficient searching with the aid of a guided genetic algorithm, and 4) pruning to obtain a minimal set of correct interactions. Investigations are carried out using both synthetic as well as yeast cell cycle gene expression data sets. The realistic synthetic data sets validate the robustness of the method by varying topology, sample size, time delay, noise, vertex in-degree, and the presence of hidden nodes. It is shown that the proposed approach has excellent inferential capabilities and high accuracy even in the presence of noise. The gene network inferred from yeast cell cycle data is investigated for its biological relevance using well-known interactions, sequence analysis, motif patterns, and GO data. Further, novel interactions are predicted for the unknown genes of the network and their influence on other genes is also discussed.
INDEX TERMS
Cause-effect analysis, causal modeling, gene regulatory network, genetic algorithms, microarray gene expression data, network inference.
CITATION
Ramesh Ram, Madhu Chetty, "A Markov-Blanket-Based Model for Gene Regulatory Network Inference", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.8, no. 2, pp. 353-367, March/April 2011, doi:10.1109/TCBB.2009.70
REFERENCES
[1] S. Bay, L. Chrisman, A. Pohorille, and J. Shrager, "Temporal Aggregation Bias and Inference of Causal Regulatory Networks," J. Computational Biology, vol. 11, pp. 971-985, 2004.
[2] N. Friedman, M. Linial, I. Nachman, and D. Pe'er, "Using Bayesian Networks to Analyze Expression Data," J. Computational Biology, vol. 7, pp. 601-620, 2000.
[3] P. D'haseleer, S. Liang, and R. Somogoyi, "Genetic Network Inference: From Co-Expression Clustering to Reverse Engineering," Bioinformatics, vol. 16, pp. 707-726, 2000.
[4] N. Friedman, "Inferring Cellular Network Using Probabilistic Graphical Models," Science, vol. 33, pp. 799-805, 2004.
[5] C.S. Kim, "Bayesian Orthogonal Least Squares (BOLS) Algorithm for Reverse Engineering of Gene Regulatory Networks," BMC Bioinformatics, vol. 8, article no. 251, 2007.
[6] W. Luo, K.D. Hankenson, and P.J. Woolf, "Learning Transcriptional Regulatory Networks from High Throughput Gene Expression Data Using Continuous Three-Way Mutual Information," BMC Bioinformatics, vol. 9, article no. 467, 2008.
[7] Z. Bar-Joseph, G.K. Gerber, T.I. Lee, N.J. Rinaldi, J.Y. Yoo, F. Robert, D.B. Gordon, E. Fraenkel, T.S. Jaakkola, R.A. Young, and D.K. Gifford, "Computational Discovery of Gene Modules and Regulatory Networks," Nature Biotechnology, vol. 21, no. 11, pp. 1337-1342, 2003.
[8] K. Murphy and S. Mian, Modelling Gene Expression Data Using Dynamic Bayesian Networks. Univ. of California at Berkeley, 1999.
[9] A.J. Butte and I.S. Kohane, "Mutual Information Relevance Networks: Functional Genomic Clustering Using Pairwise Entropy Measurements," Proc. Pacific Symp. Biocomputing, pp. 418-429, 2000.
[10] A. de la Fuente, N. Bing, I. Hoeschele, and P. Mendes, "Discovery of Meaningful Associations in Genomic Data Using Partial Correlation Coefficients," Bioinformatics, vol. 20, pp. 3565-3574, 2004.
[11] H. Kishino and P.J. Waddell, "Correspondence Analysis of Genes and Tissue Types and Finding Genetic Links from Microarray Data," Proc. Workshop Genome Informatics, pp. 83-95, 2000.
[12] P.M. Magwene and J. Kim, "Estimating Genomic Coexpression Networks Using First-Order Conditional Independence," Genome Biology, vol. 5, no. 12, p. R100, 2004.
[13] J. Schäfer and K. Strimmer, "An Empirical Bayes Approach to Inferring Large-Scale Gene Association Networks," Bioinformatics, vol. 21, pp. 754-764, 2005.
[14] Z. Bar-Joseph, "Analyzing Time Series Gene Expression Data," Bioinformatics, vol. 20, no. 16, pp. 2493-503, 2004.
[15] J. Pearl, Causality: Models, Reasoning and Inference. Cambridge Univ. Press, 2000.
[16] B. Shipley, Cause and Correlation in Biology: A User's Guide to Path Analysis, Structural Equations and Causal Inference. Cambridge Univ. Press, 2002.
[17] P. Sprites, C. Glymour, and R. Scheines, Causation, Prediction, and Search: Adaptive Computation and Machine Learning, second ed. MIT Press, 2000.
[18] D. Koller and M. Sahami, "Towards Optimal Feature Selection," Proc. 13th Int'l Conf. Machine Learning (ICML), 1996.
[19] N. Guelzim, S. Bottani, P. Bourgine, and F. Kepes, "Topological and Causal Structure of the Yeast Transcriptional Regulatory Network," Nature Genetics, vol. 31, pp. 60-63, 2002.
[20] R. Ram and M. Chetty, "A Guided Genetic Algorithm for Gene Regulatory Network," Proc. IEEE Congress Evolutionary Computation, pp. 3862-3869, 2007.
[21] R. Ram and M. Chetty, "Framework for Path Analysis for Learning Gene Regulatory Network," Lecture Notes in Bioinformatics, pp. 264-273, Springer, 2007.
[22] P. Mendes, W. Sha, and K. Ye, "Artificial Gene Networks for Objective Comparison of Analysis Algorithms," Bioinformatics, vol. 19, pp. 122-129, 2003.
[23] R. Ram and M. Chetty, "Generating Synthetic Gene Regulatory Networks," Lecture Notes in Bioinformatics, pp. 237-249, Springer, 2008.
[24] H. Toh and K. Horimoto, "Inference of a Genetic Network by a Combined Approach of Cluster Analysis and Graphical Gaussian Modeling," Bioinformatics, vol. 18, pp. 287-297, 2002.
[25] P.T. Spellman, G. Sherlock, M.Q. Zhang, V.R. Iyer, K. Anders, M.B. Eisen, P.O. Brown, D. Botstein, and B. Futcher, "Comprehensive Identification of Cell Cycle-Regulated Genes of the Yeast Saccharomyces Cerevisiae by Microarray Hybridization," Molecular Biology of the Cell, vol. 9, pp. 3273-3297, 1998.
[26] P. Brazhnik, A. de la Fuente, and P. Mendes, "Gene Networks: How to Put the Function in Genomics," Trends in Biotechnology, vol. 20, pp. 467-472, 2002.
[27] B. Futcher, "Transcriptional Regulatory Networks and the Yeast Cell Cycle," Current Opinion in Cell Biology, vol. 14, pp. 676-683, 2002.
[28] U. Güldener et al., "CYGD: The Comprehensive Yeast Genome Database," Nucleic Acids Research, vol. 33, pp. D364-D368, 2005.
[29] F. Li, T. Long, Y. Lu, Q. Ouyang, and C. Tang, "The Yeast Cell-cycle Network Is Robustly Designed," Proc. Nat'l Academy of Sciences USA, vol. 101, no. 14, pp. 4781-4786, 2004.
[30] K.C. Chen, L. Calzone, A. Csikasz-Nagy, F.R. Cross, B. Novak, and J.J. Tyson, "Integrative Analysis of Cell Cycle Control in Budding Yeast," Molecular Biology of the Cell, vol. 15, pp. 3841-3862, 2004.
[31] T.S. Gardner et al., "Inferring Genetic Networks and Identifying Compound Mode of Action via Expression Profiling," Science, vol. 301, pp. 102-105, 2003.
[32] M. Kellis et al., "Sequencing and Comparison of Yeast Species to Identify Genes and Regulatory Elements," Nature, vol. 423, pp. 241-254, 2003.
[33] E. Segal, M. Shapira, A. Regev, D. Pe'er, D. Botstein, D. Koller, and N. Friedman, "Module Networks: Identifying Regulatory Modules and Their Condition-Specific Regulators from Gene Expression Data," Nature Genetics, vol. 34, pp. 166-176, 2003.
[34] A. Shinohara, K. Iida, M. Takeda, O. Maruyama, S. Miyano, and S. Kuhara, "Finding Sparse Gene Networks," Genome Informatics, vol. 11, pp. 249-250, 2000.
[35] I. Famili, J. Forster, J. Nielsen, B.O. Palsson, "Saccharomyces Cerevisiae Phenotypes can be Predicted by Using Constraint Based Analysis of a Genome-Scale Reconstructed Metabolic Network," Proc. Nat'l Academy Sciences USA, vol. 100, pp. 13134-13139, 2003.
17 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool