This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Inferring Adaptive Regulation Thresholds and Association Rules from Gene Expression Data through Combinatorial Optimization Learning
October-December 2007 (vol. 4 no. 4)
pp. 624-634
There is a need to design computational methods to support the prediction of gene regulatory networks. Such models should offer both biologically-meaningful and computationally-accurate predictions, which in combination with other techniques may improve large-scale, integrative studies. This paper presents a new machine learning method for the prediction of putative regulatory associations from expression data, which exhibit properties never or only partially addressed by other techniques recently published. The method was tested on a Saccharomyces cerevisiae gene expression dataset. The results were statistically validated and compared with the relationships inferred by two machine learning approaches to gene regulatory network prediction. Furthermore, the resulting predictions were assessed using domain knowledge. The proposed algorithm may be able to accurately predict relevant biological associations between genes. One of the most relevant features of this new method is the prediction of adaptive regulation thresholds for the discretization of gene expression values, which is required prior to the rule association learning process. Moreover, an important advantage consists of its low computational cost to infer association rules. The proposed system may significantly support exploratory, large-scale studies of automated identification of potentially-relevant gene expression associations.

[1] H. Bolouri and E.H. Davidson, “Modeling Transcriptional Regulatory Networks,” BioEssays, vol. 24, pp. 1118-1129, 2002.
[2] M.P. Styczynski and G. Stephanopoulos, “Overview of Computational Methods for the Inference of Gene Regulatory Networks,” Computers and Chemical Eng., vol. 29, pp. 519-534, 2005.
[3] H. De Jong, “Modeling and Simulation of Genetic Regulatory Systems: A Literature Review,” J. Computational Biology, vol. 9, pp.67-103, 2002.
[4] C. Pridgeon and D. Corne, “Genetic Network Reverse-Engineering and Network Size; Can We Identify Large GRNs?” Proc. 2004 IEEE Symp. Computational Intelligence in Bioinformatics and Computational Biology, pp. 32-36, Oct. 2004.
[5] J.L. De Risi, V.R. Iyer, and P.O. Broma, “Exploring the Metabolic and Genetic Control of Gene Expression on a Genomic Scale,” Science, vol. 278, pp. 680-686, 1997.
[6] M.B. Eisen, P.T. Spellman, P.O. Brown, and D. Botstein, “Cluster Analysis and Display of Genome-Wide Expression Patterns,” Proc. Nat'l Academy of Sciences USA, vol. 95, pp. 14863-14868, 1998.
[7] S. Liang, S. Fuhrman, and R. Somogyi, “REVEAL, A General Reverse Engineering Algorithm for Inference of Genetic Network Architectures,” Proc. Pacific Symp. Biocomputing, vol. 3, pp. 18-29, Jan. 1998.
[8] T. Akutsu, S. Miyano, and S. Buhara, “Identification of Genetic Networks from a Small Number of Gene Expression Patterns under the Boolean Network Model,” Proc. Pacific Symp. Biocomputing, vol. 4, pp. 17-28, Jan. 1998.
[9] S. Mehra, W.-S. Hu, and G. Karypis, “G: A Boolean Algorithm for Reconstructing the Structure of Regulatory Networks,” Metabolic Eng., vol. 6, pp. 326-339, 2004.
[10] L.A. Soinov, M.A. Krestyaninova, and A. Brazma, “Towards Reconstruction of Gene Networks from Expression Data by Supervised Learning,” Genome Biology, vol. 4, Article R6, 2003.
[11] N. Friedman, M. Linial, I. Nachman, and D. Péer, “Using Bayesian Networks to Analyze Expression Data,” J. Computational Biology, vol. 7, pp. 601-620, 2000.
[12] S. Bulashevska and R. Eils, “Inferring Genetic Regulatory Logic from Expression Data,” Bioinformatics, vol. 21, pp. 2706-2713, 2005.
[13] M. Zou and S.D. Conzen, “A New Dynamic Bayesian Network (DBN) Approach for Identifying Gene Regulatory Networks from Time Course Microarray Data,” Bioinformatics, vol. 21, pp. 71-79, 2005.
[14] A.E. Eiben and J.E. Smith, Introduction to Evolutionary Computing. Springer, 2003.
[15] S. Ando, E. Sakamoto, and H. Iba, “Evolutionary Modeling and Inference of Gene Network,” Information Sciences, vol. 145, pp. 225-236, 2002.
[16] H. Iba and A. Mimura, “Inference of a Gene Regulatory Network by Means of Interactive Evolutionary Computing,” Information Sciences, vol. 145, pp. 225-236, 2002.
[17] J. Hallinan and J. Wiles, “Evolving Genetic Regulatory Networks Using an Artificial Genome,” Proc. Second Asia-Pacific Bioinformatics Conf., 2004.
[18] T. Reil, “Dynamics of Gene Expression in an Artificial Genome: Implications for Biological and Artificial Ontogeny,” Proc. Fifth European Conf. Artificial Life, D. Floreano, F. Mondada, and J.D.Nicoud, eds., pp. 457-466, 1999.
[19] J.R. Quinlan, C4.5: Programs for Machine Learning. Morgan Kaufmann, 1992.
[20] S. Ruggieri, “Efficient C4.5,” IEEE Trans. Knowledge and Data Eng., vol. 14, pp. 438-444, 2002.
[21] J. Ihmels, G. Friedlander, S. Bergmann, O. Sarig, Y. Ziv, and N. Barkai, “Revealing Modular Organization in the Yeast Transcriptional Network,” Nature Genetics, vol. 31, pp. 370-377, 2002.
[22] E. Segal, M. Shapira, A. Regev, D. Pe'er, D. Botstein, D. Koller, and N. Friedman, “Module Networks: Identifying Regulatory Modules and Their Condition-Specific Regulators from Gene Expression Data,” Nature Genetics, vol. 34, pp. 166-176, 2003.
[23] P.H. Lee and D. Lee, “Inferring Genetic Regulatory Logic from Expression Data,” Bioinformatics, vol. 21, pp. 2739-2747, 2005.
[24] B. Snel and M.A. Huynen, “Quantifying Modularity in the Evolution of Biomolecular Systems,” Genome Research, vol. 14, pp. 391-397, 2004.
[25] T. Schlitt and A. Brazma, “Modelling Gene Networks at Different Organisational Levels,” FEBS Letters, vol. 579, pp. 1859-1866, 2005.
[26] T. Schlitt and A. Brazma, “Modelling in Molecular Biology: Describing Transcription Regulatory Networks at Different Scales,” Philosophical Trans. Royal Soc. of London Series B, Biological Sciences, vol. 361, no. 1467, pp. 483-494, 2006.
[27] C.-H. Yeang and T. Jaakkola, “Physical Network Models and Multi-Source Data Integration,” Proc. Seventh Ann. Int'l Conf. Research in Computational Molecular Biology, pp. 312-321, Apr. 2003.
[28] P.T. Spellman, G. Sherlock, M.Q. Zhang, V.R. Iyer, K. Andres, B. Eisen, P.O. Brown, D. Botstein, and B. Futcher, “Comprehensive Identification of Cell Cycle-Regulated Genes of the Yeast Saccharomyces cerevisiae by Microarray Hybridization,” Molecular Biology of the Cell, vol. 9, pp. 3273-3297, 1998.
[29] T. Mitchel, Machine Learning, chapter 3. WCB/McGraw-Hill, 1997.
[30] R. Kohani, “Wrappers for Performance Enhancement and Oblivious Decision Graphs,” PhD dissertation, Computer Science Dept., Stanford Univ., 1995.
[31] D.R. Carvalho and A.A. Freitas, “A Hybrid Decision Tree/Genetic Algorithm Method for Data Mining,” Information Sciences, vol. 163, pp. 13-35, 2004.
[32] F. Provost, D. Jensen, and T. Oates, “Efficient Progressive Sampling,” Proc. Fifth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, Paper ID 442, Aug. 1999.
[33] R.J. Cho, M.J. Campbell, E.A. Winzeler, L. Steinmetz, A. Conway, L. Wodicka, T.G. Wolfsberg, A.E. Gabrielian, D. Landsman, D.J. Lockhart, and R.W. Davis, “A Genome-Wide Transcriptional Analysis of the Mitotic Cell Cycle,” Molecular Cell, vol. 2, pp. 65-73, 1998.
[34] C. Kuhne and P. Linder, “A New Pair of B-Type Cyclins from Saccharomyces cerevisiae that Function Early in the Cell Cycle,” European Molecular Biology Organization J., vol. 12, pp. 3437-3447, 1993.
[35] K.C. Chen, A. Csikasz-Nagy, B. Gyorffy, J. Val, B. Novak, and J.J. Tyson, “Kinetic Analysis of a Molecular Model of the Budding Yeast Cell Cycle,” Molecular Biology of the Cell, vol. 11, pp. 369-391, 2000.
[36] L.H. Hwang, L.F. Lau, D.L. Smith, C.A. Mistrot, K.G. Hardwick, E.S. Hwang, A. Amon, and A.W. Murray, “Budding Yeast CDC20: A Target of the Spindle Checkpoint,” Science, vol. 279, pp. 1041-1044, 1998.
[37] C.B. Epstein and F.R. Cross, “CLB5: A Novel B Cyclin from Budding Yeast with a Role in S Phase,” Genes and Development, vol. 6, pp. 1695-1706, 1992.
[38] K. Levine, K. Huang, and F.R. Cross, “Saccharomyces cerevisiae G1 Cyclins Differ in Their Intrinsic Functional Specificities,” Molecular and Cellular Biology, vol. 16, pp. 6794-6803, 1996.
[39] H. Althoefer, A. Schleiffer, K. Wassmann, A. Nordheim, and G. Ammerer, “McmI Is Required to Coordinate G2-Specific Transcription in Saccharomyces cerevisiae,” Molecular and Cellular Biology, vol. 15, pp. 5917-5928, 1995.
[40] M. Koranda, A. Schleiffer, L. Endler, and G. Ammerer, “Forkhead-Like Transcription Factors Recruit NddI to the Chromatin of G2/M-Specific Promoters,” Nature, vol. 406, pp. 94-98, 2000.
[41] C.J. Loy, D. Lydall, and U. Surana, “NDDI, a High-Dosage Suppressor of cdc28-I N, Is Essential for Expression of a Subset of Late-S-Phase-Specific Genes in S. cerevisiae,” Molecular and Cellular Biology, vol. 19, pp. 3312-3327, 1999.
[42] A. Amon, M. Tyers, B. Futcher, and K. Nasmyth, “Mechanisms that Help the Yeast Cell Cycle Clock Tick: G2 Cyclins Transcriptionally Activate G2 Cyclins and Repress G1 Cyclins,” Cell, vol. 74, pp. 993-1007, 1993.
[43] B. Andrews and V. Measday, “The Cyclin Family of Budding Yeast: Abundant Use of a Good Idea,” Trends in Genetics, vol. 14, pp. 66-72, 1998.
[44] B. Schneider, E. Patton, S. Lanker, M. Mendenhall, C. Wittenberg, B. Futcher, and M. Tyers, “Yeast GI Cyclins Are Instable in GI Phase,” Nature, vol. 395, pp. 86-89, 1998.
[45] J.H. Toyn, A.L. Johnson, J.D. Donovan, W.M. Toone, and L.H. Johnston, “The Swi5 Transcription Factor of Saccharomyces cerevisiae Has a Role in Exit from Mitosis through Induction of the Cdk-Inhibitor SicI in Telophase,” Genetics, vol. 145, pp. 85-96, 1997.
[46] S. Prinz, E.S. Hwang, R. Visintin, and A. Amon, “The Regulation of Cdc20 Proteolysis Reveals a Role for the APC Components Cdc23 and Cdc27 during S Phase and Early Mitosis,” Current Biology, vol. 8, pp. 750-760, 1998.
[47] K.C. Chen, A. Csikasz-Nagy, B. Gyorffy, J. Val, B. Novak, and J.J. Tyson, “Kinetic Analysis of a Molecular Model of the Budding Yeast Cell Cycle,” Molecular Biology of the Cell, vol. 11, pp. 369-391, 2000.
[48] J.C. Igual, W.M. Toone, and L.H. Johnston, “A Genetic Screen Reveals a Role for the Late G1-Specific Transcription Factor Swi4p in Diverse Cellular Functions Including Cytokinesis,” J. Cell Science, vol. 110, pp. 1647-1654, 1997.
[49] “The Gene Ontology Consortium: Creating the Gene Ontology Resource: Design and Implementation,” Genome Research, vol. 11, pp. 1425-1433, 2001.
[50] H. Wang, F.J. Azuaje, O. Bodenreider, and J. Dopazo, “Gene Expression Correlation and Gene Ontology-Based Similarity: An Assessment of Quantitative Relationships,” Proc. IEEE 2004 Symp. Computational Intelligence in Bioinformatics and Computational Biology, pp. 25-31, 2004.

Index Terms:
combinatorial optimization, genetic regulatory networks, machine-learning, gene expression data, decision trees
Citation:
Ignacio Ponzoni, Francisco Azuaje, Juan Augusto, David Glass, "Inferring Adaptive Regulation Thresholds and Association Rules from Gene Expression Data through Combinatorial Optimization Learning," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 4, no. 4, pp. 624-634, Oct.-Dec. 2007, doi:10.1109/tcbb.2007.1049
Usage of this product signifies your acceptance of the Terms of Use.