The Community for Technology Leaders
RSS Icon
Issue No.05 - Sept.-Oct. (2012 vol.9)
pp: 1293-1300
Richard Rottger , Comput. Syst. Biol. Group, Max Planck Institutefor Inf., Saarbrucken, Germany
Ulrich Ruckert , Dept. of Electr. Eng. & Comput. Sci., Univ. of California, Berkeley, CA, USA
Jan Taubert , Dept. of Comput. & Syst. Biol., Rothamsted Res., Harpenden, UK
Jan Baumbach , Comput. Syst. Biol. Group, Max Planck Institutefor Inf., Saarbrucken, Germany
The National Center for Biotechnology Information (NCBI) recently announced the availability of whole genome sequences for more than 1,000 species. And the number of sequenced individual organisms is growing. Ongoing improvement of DNA sequencing technology will further contribute to this, enabling large-scale evolution and population genetics studies. However, the availability of sequence information is only the first step in understanding how cells survive, reproduce, and adjust their behavior. The genetic control behind organized development and adaptation of complex organisms still remains widely undetermined. One major molecular control mechanism is transcriptional gene regulation. The direct juxtaposition of the total number of sequenced species to the handful of model organisms with known regulations is surprising. Here, we investigate how little we even know about these model organisms. We aim to predict the sizes of the whole-organism regulatory networks of seven species. In particular, we provide statistical lower bounds for the expected number of regulations. For Escherichia coli we estimate at most 37 percent of the expected gene regulatory interactions to be already discovered, 24 percent for Bacillus subtilis, and <;3% human, respectively. We conclude that even for our best researched model organisms we still lack substantial understanding of fundamental molecular control mechanisms, at least on a large scale.
microorganisms, bioinformatics, biological techniques, cellular biophysics, DNA, genetics, genomics, bioinformatics, gene regulatory networks, National Center for Biotechnology Information, genome sequences, DNA sequencing technology, cell survival, cell reproduction, transcriptional gene regulation, direct juxtaposition, Escherichia coli, Bacillus subtilis, fundamental molecular control mechanisms, Estimation, Databases, Bioinformatics, Robustness, Genomics, Humans, transcriptional gene regulatory networks., Computational biology, network statistics
Richard Rottger, Ulrich Ruckert, Jan Taubert, Jan Baumbach, "How Little Do We Actually Know? On the Size of Gene Regulatory Networks", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.9, no. 5, pp. 1293-1300, Sept.-Oct. 2012, doi:10.1109/TCBB.2012.71
[1] E.W. Sayers et al., "Database Resources of the National Center for Biotechnology Information," Nucleic Acids Research, vol. 38, pp. D5-D16, Jan. 2010.
[2] M.L. Metzker, "Sequencing Technologies-the Next Generation," Nature Rev. Genetics, vol. 11, pp. 31-46, Jan. 2010.
[3] J. Shendure and H. Ji, "Next-Generation DNA Sequencing," Nature Biotechnology, vol. 26, pp. 1135-1145, Oct. 2008.
[4] T.M. Venancio and L. Aravind, "Reconstructing Prokaryotic Transcriptional Regulatory Networks: Lessons from Actinobacteria," J. Biology, vol. 8, p. 29, 2009.
[5] J. Baumbach et al., "Towards the Integrated Analysis, Visualization and Reconstruction of Microbial Gene Regulatory Networks," Briefings in Bioinformatics, vol. 10, p. 75, 2009.
[6] C.O. Pabo and R.T. Sauer, "Transcription Factors: Structural Families and Principles of DNA Recognition," Ann. Rev. Biochemistry, vol. 61, pp. 1053-1095, 1992.
[7] M.M. Babu and L. Aravind, "Adaptive Evolution by Optimizing Expression Levels in Different Environments," Trends in Microbiology, vol. 14, pp. 11-14, Jan. 2006.
[8] M.M. Babu et al., "Methods to Reconstruct and Compare Transcriptional Regulatory Networks," Methods in Molecular Biology, vol. 541, pp. 163-180, 2009.
[9] M.M. Babu et al., "Structure and Evolution of Transcriptional Regulatory Networks," Current Opinion in Structural Biology, vol. 14, pp. 283-291, 2004.
[10] J. Baumbach et al., "Integrated Analysis and Reconstruction of Microbial Transcriptional Gene Regulatory Networks Using CoryneRegNet," Nature Protocols, vol. 4, pp. 992-1005, 2009.
[11] A.H. van Vliet, "Next Generation Sequencing of Microbial Transcriptomes: Challenges and Opportunities," FEMS Microbiol Letters, vol. 302, pp. 1-7, Nov. 2009.
[12] L.M. Hellman and M.G. Fried, "Electrophoretic Mobility Shift Assay (EMSA) for Detecting Protein-Nucleic acid Interactions," Nature Protocols, vol. 2, pp. 1849-1861, 2007.
[13] D.J. Galas and A. Schmitz, "DNAse Footprinting: A Simple Method for the Detection of Protein-DNA Binding Specificity," Nucleic Acids Research, vol. 5, pp. 3157-3170, Sept. 1978.
[14] L.V. Sun et al., "Protein-DNA Interaction Mapping Using Genomic Tiling Path Microarrays in Drosophila," Proc. Nat'l Academy of Sciences USA, vol. 100, pp. 9428-9433, Aug. 2003.
[15] R. Jothi et al., "Genome-Wide Identification of in vivo Protein-DNA Binding Sites from Chip-Seq Data," Nucleic Acids Research, vol. 36, pp. 5221-5231, Sept. 2008.
[16] R. Bonneau, "Learning Biological Networks: From Modules to Dynamics," Nature Chemical Biology, vol. 4, pp. 658-664, Nov. 2008.
[17] M.J. Herrgard et al., "Reconstruction of Microbial Transcriptional Regulatory Networks," Current Opinion in Biotechnology, vol. 15, pp. 70-77, Feb. 2004.
[18] M. Hucka and A. Finney, "Escalating Model Sizes and Complexities Call for Standardized Forms of Representation," Molecular Systems Biology, vol. 1, article 2005.0011, 2005.
[19] M. Hucka et al., "The Systems Biology Markup Language (SBML): A Medium for Representation and Exchange of Biochemical Network Models," Bioinformatics, vol. 19, pp. 524-531, Mar. 2003.
[20] S. Gama-Castro et al., "RegulonDB (Version 6.0): Gene Regulation Model of Escherichia Coli K-12 beyond Transcription, Active (Experimental) Annotated Promoters and Textpresso Navigation," Nucleic Acids Research, vol. 36, pp. D120-D124, Jan. 2008.
[21] I.M. Keseler et al., "EcoCyc: A Comprehensive View of Escherichia Coli Biology," Nucleic Acids Research, vol. 37, pp. D464-D470, Jan. 2009.
[22] N. Sierro et al., "DBTBS: A Database of Transcriptional Regulation in Bacillus Subtilis Containing Upstream Intergenic Conservation Information," Nucleic Acids Research, vol. 36, pp. D93-D96, 2008.
[23] S.K. Palaniswamy et al., "AGRIS and AtRegNet. a Platform to Link Cis-Regulatory Elements and Transcription Factors into Regulatory Networks," Plant Physiology, vol. 140, pp. 818-829, 2006.
[24] H. Salgado et al., "RegulonDB (Version 5.0): Escherichia coli K-12 Transcriptional Regulatory Network, Operon Organization, and Growth Conditions," Nucleic Acids Research, vol. 34, pp. D394-D397, Jan. 2006.
[25] M.P. Stumpf et al., "Estimating the Size of the Human Interactome," Proc. Nat'l Academy of Sciences USA, vol. 105, pp. 6959-6964, May. 2008.
[26] P. Bakke et al., "Evaluation of Three Automated Genome Annotations for Halorhabdus Utahensis," PLoS One, vol. 4, p. e6291, 2009.
[27] D. Wilson et al., "DBD-Taxonomically Broad Transcription Factor Predictions: New Content and Functionality," Nucleic Acids Research, vol. 36, pp. D88-D92, 2008.
[28] G. Balazsi et al., "The Temporal Response of the Mycobacterium Tuberculosis Gene Regulatory Network During Growth Arrest," Molecular Systems Biology, vol. 4, p. 225, 2008.
[29] M. Guo et al., "Dissecting Transcription Regulatory Pathways through a New Bacterial One-Hybrid Reporter System," Genome Research, vol. 19, pp. 1301-1308, July 2009.
[30] T.J. DiCiccio and B. Efron, "Bootstrap Confidence Intervals," Statistical Science, vol. 11, pp. 189-212, 1996.
[31] B. Efron et al., An Introduction to the Bootstrap. Chapman and Hall/CRC, 1993.
[32] K. Kaufmann et al., "Target Genes of the MADS Transcription Factor SEPALLATA3: Integration of Developmental and Hormonal Pathways in the Arabidopsis flower," PLoS Biology, vol. 7, p. e1000090, 2009.
[33] E. Oh et al., "Genome-Wide Analysis of Genes Targeted by PHYTOCHROME INTERACTING FACTOR 3-LIKE5 during Seed Germination in Arabidopsis," The Plant Cell, vol. 21, pp. 403-419, 2009.
[34] Y. Zheng et al., "Global Identification of Targets of the Arabidopsis MADS Domain Protein AGamous-Like15," The Plant Cell, vol. 21, pp. 2563-2577, 2009.
237 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool