The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.01 - Jan.-Feb. (2013 vol.10)
pp: 37-49
Carlos H. A. Higa , Coll. of Comput., Fed. Univ. of Mato Grosso do Sul, Campo Grande, Brazil
Tales P. Andrade , Inst. of Math. & Stat., Univ. of Sao Paulo, Sao Paulo, Brazil
Ronaldo F. Hashimoto , Inst. of Math. & Stat., Univ. of Sao Paulo, Sao Paulo, Brazil
ABSTRACT
Models of gene regulatory networks (GRN) have been proposed along with algorithms for inferring their structure. By structure, we mean the relationships among the genes of the biological system under study. Despite the large number of genes found in the genome of an organism, it is believed that a small set of genes is responsible for maintaining a specific core regulatory mechanism (small subnetworks). We propose an algorithm for inference of subnetworks of genes from a small initial set of genes called seed and time series gene expression data. The algorithm has two main steps: First, it grows the seed of genes by adding genes to it, and second, it searches for subnetworks that can be biologically meaningful. The seed growing step is treated as a feature selection problem and we used a thresholded Boolean network with a perturbation model to design the criterion function that is used to select the features (genes). Given that the reverse engineering of GRN is a problem that does not necessarily have one unique solution, the proposed algorithm has as output a set of networks instead of one single network. The algorithm also analyzes the dynamics of the networks which can be time-consuming. Nevertheless, the algorithm is suitable when the number of genes is small. The results showed that the algorithm is capable of recovering an acceptable rate of gene interactions and to generate regulatory hypotheses that can be explored in the wet lab.
INDEX TERMS
Biological system modeling, Inference algorithms, Computational modeling, Bioinformatics, Gene expression, Boolean functions, Time series analysis,reverse engineering, Gene regulatory networks, Boolean networks, inference
CITATION
Carlos H. A. Higa, Tales P. Andrade, Ronaldo F. Hashimoto, "Growing Seed Genes from Time Series Data and Thresholded Boolean Networks with Perturbation", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.10, no. 1, pp. 37-49, Jan.-Feb. 2013, doi:10.1109/TCBB.2012.169
REFERENCES
[1] S.A. Kauffman, “Metabolic Stability and Epigenesis in Randomly Constructed Genetic Nets,” J. Theoretical Biology, vol. 22, pp. 437-467, 1969.
[2] I. Shmulevich, E.R. Dougherty, S. Kim, and W. Zhang, “Probabilistic Boolean Networks: A Rule-Based Uncertainty Model for Gene Regulatory Networks,” Bioinformatics, vol. 18, no. 2, pp. 261-274, 2002.
[3] N. Friedman, M. Linial, I. Nachman, and D. Péer, “Using Bayesian Networks to Analyze Expression Data,” J. Computational Biology, vol. 7, pp. 601-620, 2000.
[4] B. Goodwin, Temporal Organization in Cells; A Dynamic Theory of Cellular Control Process. Academic Press, 1963.
[5] M.A. Savageau, Biochemical Systems Analysis. Addison-Wesley, 1970.
[6] G. Karlebach and R. Shamir, “Modelling and Analysis of Gene Regulatory Networks,” Nature, vol. 9, pp. 770-780, 2008.
[7] M. Hecker, S. Lambeck, S. Toepfer, E. van Someren, and E. Guthke, “Gene Regulatory Network Inference: Data Integration in Dynamic Models—A Review,” BioSystems, vol. 96, pp. 86-103, 2009.
[8] S. Liang, S. Fuhrman, and R. Somogyi, “Reveal, a General Reverse Engineering Algorithm for Inference of Genetic Network Architectures,” Proc. Pacific Symp. Biocomputing, pp. 18-29, 1998.
[9] R. Pal, I. Ivanov, A. Datta, M.L. Bittner, and E.R. Dougherty, “Generating Boolean Networks with a Prescribed Attractor Structure,” Bioinformatics, vol. 21, no. 21, pp. 4021-4025, 2005.
[10] E.R. Dougherty and Y. Xiao, “Design of Probabilistic Boolean Networks under the Requirement of Contextual Data Consistency,” IEEE Trans. Signal Processing, vol. 54, no. 9, pp. 3601-3613, Sept. 2006.
[11] A. Margolin, I. Nemenman, K. Basso, C. Wiggins, G. Stolovitzky, R. Favera, and A. Califano, “ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context,” BMC Bioinformatics, vol. 7, no. Suppl. 1, article S7, 2006.
[12] F. Lopes, E. de Oliveira, and R. Cesar, “Inference of Gene Regulatory Networks from Time Series by Tsallis Entropy,” BMC Systems Biology, vol. 5, no. 1, p. 61, 2011.
[13] P.M. Das, K. Ramachandran, J. van Wert, and R. Singal, “Chromatin Immunoprecipitation Assay,” Biotechniques, vol. 37, no. 6, pp. 961-969, 2004.
[14] M. Zheng, L.O. Barrera, B. Ren, and Y.N. Wu, “ChIP-Chip: Data, Model, and Analysis,” Biometrics, vol. 63, no. 3, pp. 787-796, 2007.
[15] M. Selbach and M. Mann, “Protein Interaction Screening by Quantitative Immunoprecipitation Combined with Knockdown (QUICK),” Nature Methods, vol. 3, no. 12, pp. 981-983, 2006.
[16] T. Akutsu, S. Miyano, and S. Kuhara, “Identification of Genetic Networks from a Small Number of Gene Expression Patterns under the Boolean Network Model,” Proc. Pacific Symp. Biocomputing, pp. 17-28, 1999.
[17] M. Maucher, B. Kracher, M. Kühl, and H.A. Kestler, “Inferring Boolean Network Structure via Correlation,” Bioinformatics, vol. 27, no. 11, pp. 1529-1536, 2011.
[18] G.J. Hickman and T.C. Hodgman, “Inference of Gene Regulatory Networks Using Boolean-Network Inference Methods,” J. Bioinformatics and Computational Biology, vol. 7, no. 6, pp. 1013-1029, 2009.
[19] C. Sima, J. Hua, and S. Jung, “Inference of Gene Regulatory Networks Using Time-Series Data: A Survey,” Current Genomics, vol. 10, no. 6, pp. 416-429, 2009.
[20] R.F. Hashimoto, S. Kim, I. Shmulevich, W. Zhang, M.L. Bittner, and E.R. Dougherty, “Growing Genetic Regulatory Networks from Seed Genes,” Bioinformatics, vol. 20, no. 8, pp. 1241-1247, 2004.
[21] E.R. Dougherty, S. Kim, and Y. Chen, “Coefficient of Determination in Nonlinear Signal Processing,” Signal Processing, vol. 80, pp. 2219-2235, 2000.
[22] R.F. Hashimoto, E.R. Dougherty, M. Brun, Z. Zhou, M.L. Bittner, and J.M. Trent, “Efficient Selection of Feature Sets Possessing High Coefficients of Determination Based on Incremental Determinations,” Signal Processing, vol. 83, pp. 695-712, 2003.
[23] J.M. Peña, J. Bjorkegren, and J. Tegnér, “Growing Bayesian Network Models of Gene Networks from Seed Genes,” Bioinformatics, vol. 21, no. Suppl. 2, pp. ii224-ii229, 2005.
[24] S. Nakariyakul and D.P. Casasent, “An Improvement on Floating Search Algorithms for Feature Subset Selection,” Pattern Recognition, vol. 42, no. 9, pp. 1932-1940, 2009.
[25] E. Tsang, Foundations of Constraint Satisfaction. Academic Press, 1993.
[26] C.H.A. Higa, V.H.P. Louzada, T.P. Andrade, and R.F. Hashimoto, “Constraint-Based Analysis of Gene Interactions Using Restricted Boolean Networks and Time-Series Data,” BMC Proc., vol. 5, no. Suppl 2, article S5, 2011.
[27] S.A. Kauffman, The Origins of Order: Self-Organization and Selection in Evolution. Oxford Univ. Press, 1993.
[28] S. Huang, “Gene Expression Profiling, Genetic Networks, and Cellular States: An Integrating Concept for Tumorigenesis and Drug Discovery,” J. Molecular Medicine, vol. 77, no. 6, pp. 469-480, 1999.
[29] F. Li, T. Long, Y. Lu, Q. Ouyang, and C. Thang, “The Yeast Cell-Cycle Network Is Robustly Designed,” Proc. Nat'l Academy of Sciences USA, vol. 101, no. 14, pp. 4781-4786, 2004.
[30] M.I. Davidich and S. Bornholdt, “Boolean Network Model Predicts Cell Cycle Sequence of Fission Yeast,” PLoS ONE, vol. 3, no. 2, p. e1672, 2008.
[31] C. Darabos, F. Di Cunto, M. Tomassini, J.H. Moore, P. Provero, and M. Giacobini, “Additive Functions in Boolean Models of Gene Regulatory Network Modules,” PLoS ONE, vol. 6, no. 11, p. e25110, 2011.
[32] I. Shmulevich, E.R. Dougherty, and W. Zhang, “Gene Perturbation and Intervention in Probabilistic Boolean Networks,” Bioinformatics, vol. 18, no. 10, pp. 1319-1331, 2002.
[33] Y. Zhang, M. Qian, Q. Ouyang, M. Deng, F. Li, and C. Tang, “Stochastic Model of Yeast Cell-Cycle Network,” Physica D Nonlinear Phenomena, vol. 219, pp. 35-39, 2006.
[34] N.W. Trepode, H.A. Armelin, M. Bittner, J. Barrera, M.D. Gubitoso, and R.F. Hashimoto, “A Robust Structural PGN Model for Control of Cell-Cycle Progression Stabilized by Negative Feedbacks,” EURASIP J. Bioinformatics and Systems Biology, vol. 2007, pp. 1-11, 2007.
[35] R. Pal, A. Datta, M.L. Bittner, and E.R. Dougherty, “Intervention in Context-Sensitive Probabilistic Boolean Networks,” Bioinformatics, vol. 21, pp. 1211-1218, 2005.
[36] R. Pal, “Context-Sensitive Probabilistic Boolean Networks: Steady-State Properties, Reduction, and Steady-State Approximation,” IEEE Trans. Signal Processing, vol. 58, no. 2, pp. 879-890, Feb. 2010.
[37] I. Shmulevich, E.R. Dougherty, and W. Zhang, “From Boolean to Probabilistic Boolean Networks as Models of Genetic Regulatory Networks,” Proc. IEEE, vol. 90, no. 11, pp. 1778-1792, Nov. 2002.
[38] A.L. Barabási and R. Albert, “Emergence of Scaling in Random Networks,” Science, vol. 286, pp. 509-512, 1999.
[39] C. Espinosa-Soto, P. Padilla-Longoria, and E.R. Alvarez-Buylla, “A Gene Regulatory Network Model for Cell-Fate Determination during Arabidopsis Thaliana Flower Development That Is Robust and Recovers Experimental Gene Expression Profiles,” Plant Cell, vol. 16, no. 22, pp. 2923-2939, 2004.
[40] Y. Xiao, “A Tutorial on Analysis and Simulation of Boolean Gene Regulatory Networks Models,” Current Genomics, vol. 10, no. 7, pp. 511-525, 2009.
[41] S. Vajda, C.E. Shannon, and W. Weaver, “The Mathematical Theory of Communication,” The Math. Gazette, vol. 34, no. 310, p. 312, 1950.
[42] P. Pudil, J. Novovicova, and J. Kittler, “Floating Search Methods in Feature-Selection,” Pattern Recognition Letters, vol. 15, no. 11, pp. 1119-1125, 1994.
[43] A.W. Whitney, “A Direct Method of Nonparametric Measurement Selection,” IEEE Trans. Computer, vol. C-20, no. 9, pp. 1100-1103, Sept. 1971.
[44] T. Marill and D. Green, “On the Effectiveness of Receptors in Recognition Systems,” IEEE Trans. Information Theory, vol. IT-9, no. 1, pp. 11-17, Jan. 1963.
[45] J. Ellson, E. Gansner, L. Koutsofios, S. North, and G. Woodhull, “Graphviz—Open Source Graph Drawing Tools,” Proc. Int'l Symp. Graph Drawing, P. Mutzel, M. Junger, and S. Leipert, eds., pp. 594-597, 2002.
[46] E.R. Dougherty, “Validation of Inference Procedures for Gene Regulatory Networks,” Current Genomics, vol. 8, no. 6, pp. 351-359, 2007.
[47] M.L. Whitfield, G. Sherlock, A.J. Saldanha, J.I. Murray, C.A. Ball, K.E. Alexander, J.C. Matese, C.M. Perou, M.M. Hurt, P.O. Brown, and D. Botstein, “Identification of Genes Periodically Expressed in the Human Cell Cycle and Their Expression in Tumors,” Molecular Biology of the Cell, vol. 13, no. 6, pp. 1977-2000, 2002.
[48] Y. Li, L. Liu, X. Bai, H. Cai, W. Ji, D. Guo, and Y. Zhu, “Comparative Study of Discretization Methods of Microarray Data for Inferring Transcriptional Regulatory Networks,” BMC Bioinformatics, vol. 11, no. 1, article 520, 2010.
[49] R. Fagan, K.J. Flint, and N. Jones, “Phosphorylation of E2F-1 Modulates Its Interaction with the Retinoblastoma Gene Product and the Adenoviral E4 19 kDa Protein,” Cell, vol. 78, no. 5, pp. 799-811, 1994.
[50] M. Brun, E.R. Dougherty, and I. Shmulevich, “Steady-State Probabilities for Attractors in Probabilistic Boolean Networks,” Signal Processing, vol. 85, pp. 1993-2013, 2005.
[51] M. Hopfensitz, C. Müssel, C. Wawra, M. Maucher, M. Kühl, H. Neumann, and H.A. Kestler, “Multiscale Binarization of Gene Expression Data for Reconstructing Boolean Networks,” IEEE/ACM Trans. Computational Biology and Bioinformatics, vol. 9, no. 2, pp. 487-498, Mar./Apr. 2012.
[52] X. Quian, N. Ghaffari, I. Ivanov, and E.R. Dougherty, “State Reduction for Network Intervention in Probabilistic Boolean Networks,” Bioinformatics, vol. 26, no. 24, pp. 3098-3104, 2010.
[53] B. Derrida and Y. Pomeau, “Random Networks of Automata: A Simple Annealed Approximation,” Europhysics Letters, vol. 1, pp. 45-49, 1986.
19 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool