This Article 
 Bibliographic References 
 Add to: 
Applying Automatically Derived Gene-Groups to Automatically Predict and Refine Metabolic Pathways
July/August 2003 (vol. 15 no. 4)
pp. 883-894

Abstract—This paper describes an automated technique to predict integrated pathways and refine existing metabolic pathways using the information of automatically derived, functionally similar gene-groups and orthologs (functionally equivalent genes) derived by the comparison of complete microbial genomes archived in GenBank. The described method integrates automatically derived orthologous and homologous gene-groups ( with the biochemical pathway template available at the KEGG database (, the enzyme information derived from the SwissProt enzyme database (, and the Ligand database ( The technique refines existing pathways (based upon the network of reactions of enzymes) by associating corresponding nonenzymatic and regulatory proteins to enzymes and operons and by identifying substituting homologs. The technique is suitable for building and refining integrated pathways using evolutionary diverse organisms. A methodology and the corresponding algorithm are presented. The technique is illustrated by comparing the genomes of E. coli and B. subtilis with M. tuberculosis. The findings about integrated pathways are briefly discussed.

[1] S.F. Altschul, W. Gish, W. Miller, E.W. Myers, and D.J. Lipman, Basic Alignment Search Tools J. Molecular Biology, vol. 215, pp. 403-410, 1990.
[2] A. Bairoch, The ENZYME Database in 2000 Nucleic Acids Research, pp. 304-305, 2000.
[3] A.K. Bansal, P. Bork, and P.J. Stuckey, Automated Pair-Wise Comparisons of Microbial Genomes Math. Modeling and Scientific Computing, vol. 9, no. 1, pp. 1-23, 1998.
[4] A.K. Bansal, An Automated Comparative Analysis of 17 Complete Microbial Genomes Bioinformatics, vol. 15, no. 11, pp. 900-908, 1999.
[5] A.K. Bansal, A Framework of Automated Reconstruction of Microbial Metabolic Pathways Proc. IEEE Int'l Conf. Bioinformatics and Biomedical Eng., pp. 184-190, 2000.
[6] H. Bono, H. Ogata, S. Goto, and M. Kanehisa, Reconstruction of Amino Acid Biosynthesis Pathways from the Complete Genome Sequence Genome Research, vol. 8, no. 3, pp. 203-210, 1998.
[7] P. Bork, T. Dandekar, Y. Diaz-Lazcoz, F. Eisenhaber, M.A. Huynen, and Y. Yuan, Predicting Function: From Gene to Genomes and Back J. Molecular Biology, vol. 283, pp. 707-725, 1998.
[8] R. Brosch, S.V. Gordon, and A. Pym et al., Comparative Genomics of the Mycobacteria J. Medical Microbiolgy, vol. 290, no. 2, pp. 143-152, 2000.
[9] S.J. Cordwell,, Microbial Genomes and Missing Enzymes: Redefining Biochemical Pathways Archives Microbiology, vol. 172, no. 5, pp. 269-279, 1999.
[10] W.M. Fitch, Distinguishing Homologous from Analogous Proteins Systematic Zoology, vol. 19, pp. 99-113, 1970.
[11] S. Goto, T. Nishioka, and M. Kanehisa, LIGAND: Chemical Database for Enzyme Reactions Bioinformatics, vol. 14, no. 7, pp. 591-599, 1998.
[12] M.A. Huynen and P. Bork, Measuring Genome Evolution Proc. Nat'l Academy of Science, vol. 95, pp. 5849-5856, 1998.
[13] P.D. Karp, M. Krummenacker, S. Paley, and J. Wagg, Integrated Pathway-Genome Databases and Their Role in Drug Discovery Trends Biotechnology, vol. 17, no. 7, pp. 275-281, 1999.
[14] A. Kowl, A. Chodias, and M. Treder et. al., , Cloning and Characterization of Secretory Tyrosine Phosphatases of Mycobacterium Tuberculosis J. Bacteriology, vol. 182, no. 19, pp. 5425-5432, 2000.
[15] K.J. Linton and C.F. Higgins, TheEscherichia coliATP-Binding Cassette (ABC) Proteins Molecular Microbiology, vol. 28, no. 1, pp. 5-13, 1998.
[16] A.A. Salyers and D.D. Whitt, Bacterial Pathogenesis: A Molecular Approach. ASM Press, 1994.
[17] E. Selkov, N. Maltsev, G.J. Olsen, R. Overbeek, and W.B. Whitman, A Reconstruction of the Metabolism ofMethanococcusjannaschifrom Sequence Data Gene, vol. 197, nos. 1-2, pp. 11-26, 1997.
[18] E. SelkovJr., Y. Grechkin, N. Mikhailova, and E. Selkov, MPW: The Metabolic Pathways Database Nucleic Acids Research, vol. 26, no. 1, pp. 43-45, 1998.
[19] S. Schuster, T. Dandekar, and D.A. Fell, Detection of Elementary Flux Modes in Biochemical Networks: A Promising Tool for Pathway Analysis and Metabolic Engineering Trends Biotechnology, vol. 17, no. 2, pp. 53-60, 1999.
[20] R.L. Tatusov, M. Mushegian, P. Bork, N. Brown, W.S. Hayes, M. Borodovsky, K.E. Rudd, and E.V. Koonin, Metabolism and Evolution ofHaemophilius InfluenzaeDeduced From a Whole-Genome Comparison withEscherichia Coli Current Biology, vol. 6, pp. 279-291, 1996.
[21] K. Tomi and M. Kanehisa, A Comparative Analysis of ABC Transporters in Complete Microbial Genomes Genome Research, vol. 8, pp. 1048-1059, 1998.
[22] M.S. Waterman, Introduction to Computational Biology: Maps, Sequence, and Genomes. Chapman&Hall, 1995.

Index Terms:
Automation, bacteria, drug-discovery, enzymes, gene-groups, homologs, metabolic pathway, microbes, operons, orthologs, pathogenicity, pathway.
Arvind K. Bansal, Christopher J. Woolverton, "Applying Automatically Derived Gene-Groups to Automatically Predict and Refine Metabolic Pathways," IEEE Transactions on Knowledge and Data Engineering, vol. 15, no. 4, pp. 883-894, July-Aug. 2003, doi:10.1109/TKDE.2003.1209006
Usage of this product signifies your acceptance of the Terms of Use.