CSDL Home IEEE/ACM Transactions on Computational Biology and Bioinformatics 2008 vol.5 Issue No.02 - April-June

Subscribe

Issue No.02 - April-June (2008 vol.5)

pp: 262-274

ABSTRACT

Recently, the concept of mutual information has been proposed for infering the structure of genetic regulatory networks from gene expression profiling. After analyzing the limitations of mutual information in inferring the gene-to-gene interactions, this paper introduces the concept of conditional mutual information and based on it proposes two novel algorithms to infer the connectivity structure of genetic regulatory networks. One of the proposed algorithms exhibits a better accuracy while the other algorithm excels in simplicity and flexibility. By exploiting the mutual information and conditional mutual information, a practical metric is also proposed to assess the likeliness of direct connectivity between genes. This novel metric resolves a common limitation associated with the current inference algorithms, namely the situations where the gene connectivity is established in terms of the dichotomy of being either connected or disconnected. Based on the data sets generated by synthetic networks, the performance of the proposed algorithms is compared favorably relative to existing state-of-the-art schemes. The proposed algorithms are also applied on realistic biological measurements, such as the cutaneous melanoma data set, and biological meaningful results are inferred.

INDEX TERMS

Biology and genetics, information theory, genetic regulatory network, DNA microarray

CITATION

Wentao Zhao, Erchin Serpedin, Edward R. Dougherty, "Inferring Connectivity of Genetic Regulatory Networks Using Information-Theoretic Criteria",

*IEEE/ACM Transactions on Computational Biology and Bioinformatics*, vol.5, no. 2, pp. 262-274, April-June 2008, doi:10.1109/TCBB.2007.1067REFERENCES

- [1] J.D. Watson et al.,
Molecular Biology of the Gene, fifth ed. Benjamin Cummings, 2004.- [2] S. Liang, S. Fuhrman, and R. Somogyi, “REVEAL, a General Reverse-Engineering Algorithm for Inference of Genetic Network Architectures,”
Proc. Pacific Symp. Biocomputing (PSB '98), pp. 18-29, 1998.- [4] I. Huang and Y. Wang, “Bayesian Inference of Cell Cycle Regulatory Networks,”
Proc. IEEE Int'l Workshop Genomic Signal Processing and Statistics (GENSIPS '05), May 2005.- [7] T. Dean and K. Kanazawa, “A Model for Reasoning About Persistence and Causation,”
Computational Intelligence, vol. 5, pp.142-150, 1989.- [9] K. Murphy and S. Mia, “Modeling Gene Expression Data Using Dynamic Bayesian Networks,” technical report, Berkeley Univ., 1999.
- [11] I. Shmulevich and O. Yli-Harja, “Inference of Genetic Regulatory Networks under the Best-Fit Extension Paradigm,”
Proc. Fifth IEEE-EURASIP Workshop Nonlinear Signal and Image Processing (NSIP '01), June 2001.- [16] J. Yu et al., “Using Bayesian Network Inference Algorithms to Recover Molecular Genetic Regulatory Networks,”
Proc. Third Int'l Conf. Systems Biology (ICSB '02), Dec. 2002.- [17] J. Pearl,
Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, 1988.- [18] A.J. Butte and I.S. Kohane, “Mutual Information Relevance Networks: Functional Genomic Clustering Using Pairwise Entropy Measurements,”
Proc. Pacific Symp. Biocomputing (PSB '00), pp. 418-429, Jan. 2000.- [19] G.F. Cooper, “A Simple Constraint-Based Algorithm for Efficiently Mining Observational Databases for Causal Relationships,”
Data Mining and Knowledge Discovery, vol. 1, pp. 203-224, 1997.- [22] T.M. Cover and J.A. Thomas,
Elements of Information Theory. Wiley-Interscience, 1991.- [24] M. Gustafsson, M. Hornquist, and A. Lombardi, “Constructing and Analyzing a Large-Scale Gene-to-Gene Regulatory Network: Lasso-Constrained Inference and Biological Validation,”
IEEE/ACM Trans. Computational Biology and Bioinformatics, vol. 2, no. 3, July-Sept. 2005.- [25] D.W. Scott,
Multivariate Density Estimation: Theory, Practice, and Visualization. John Wiley & Sons, 1992.- [26] A. Ihler, “Kernel Density Estimation Software,” http://www. ics.uci.edu/~ihlercode/, 2006.
- [27] J. Beirlant, E. Dudewicz, L. Gyorfi, and E. van der Meulen, “Nonparametric Entropy Estimation: An Overview,”
Int'l J. Math. and Statistical Sciences, vol. 6, pp. 17-39, 1997.- [29] L. Paninski, “Estimating Entropy on $m$ Bins Given Fewer than $m$ Samples,”
IEEE Trans. Information Theory, vol. 50, pp. 2200-2203, 2004.- [34] P. Mendes, “Artificial Genetic Networks,” http://mendes.vbi. vt.edu/AGN/Centuryindex.html , 2006.
- [35] T.V.D. Bulcke et al., “SynTReN: A Generator of Synthetic Gene Expression Data for Design and Analysis of Structure Learning Algorithms,”
BMC Bioinformatics, vol. 7, p. 43, 2006.- [36] M Bittner et al., “Molecular Classification of Cutaneous Malignant Melanoma by Gene Expression Profiling,”
Nature, vol. 406, pp.536-540, 2000.- [39] F. Al-Ejeh, D. Croucher, and M. Ranson, “Kinetic Analysis of Plasminogen Activator Inhibitor Type-2: Urokinase Complex Formation and Subsequent Internalization by Carcinoma Cell Lines,”
Experimental Cell Research, vol. 297, pp. 259-271, 2004.- [40] E.T. Ifon et al., “U94 Alters FN1 and ANGPTL4 Gene Expression and Inhibits Tumorigenesis of Prostate Cancer Cell Line PC3,”
Cancer Cell Int'l, vol. 22, pp. 5-19, 2005.- [41] R.S. Watnick et al., “Ras Modulates Myc Activity to Repress Thrombospondin-1 Expression and Increase Tumor Angiogenesis; Ras Modulates Myc Activity to Repress Thrombospondin-1 Expression and Increase Tumor Angiogenesis,”
Cancer Cell, vol. 3, pp. 219-231, 2003.- [43] B.C. Fuchs and J.C. Perez et al., “Inducible Antisense RNA Targeting Amino Acid Transporter ATB0/ASCT2 Elicits Apoptosis in Human Hepatoma Cells,”
Am. J. Physiology—, Gastrointestinal and Liver Physiology, vol. 286, pp. G467-G478, 2003.- [44] Q.X. Li, N. Ke, R. Sundaram, and F. Wong-Staal, “NR4A1, 2, 3-An Orphan Nuclear Hormone Receptor Family Involved in Cell Apoptosis and Carcinogenesis,”
Histology and Histopathology, vol. 21, pp. 533-540, 2006. |