This Article 
 Bibliographic References 
 Add to: 
Information-Theoretic Model of Evolution over Protein Communication Channel
January-February 2011 (vol. 8 no. 1)
pp. 143-151
Liuling Gong, University of Illinois at Chicago, Chicago
Nidhal Bouaynaya, University of Arkansas at Little Rock, Little Rock
Dan Schonfeld, University of Illinois at Chicago, Chicago
In this paper, we propose a communication model of evolution and investigate its information-theoretic bounds. The process of evolution is modeled as the retransmission of information over a protein communication channel, where the transmitted message is the organism's proteome encoded in the DNA. We compute the capacity and the rate distortion functions of the protein communication system for the three domains of life: Archaea, Bacteria, and Eukaryotes. The tradeoff between the transmission rate and the distortion in noisy protein communication channels is analyzed. As expected, comparison between the optimal transmission rate and the channel capacity indicates that the biological fidelity does not reach the Shannon optimal distortion. However, the relationship between the channel capacity and rate distortion achieved for different biological domains provides tremendous insight into the dynamics of the evolutionary processes of the three domains of life. We rely on these results to provide a model of genome sequence evolution based on the two major evolutionary driving forces: mutations and unequal crossovers.

[1] A. Bruce, D. Bray, J. Lewis, M. Raff, K. Roberts, and J.D. Watson, Molecular Biology of the Cell, third ed. Garland Publishing, 1994.
[2] N. Bouaynaya and D. Schonfeld, "Protein Communication System: Evolution and Genomic Structure," Algorithmica, vol. 48, no. 4, pp. 375-397, Aug. 2007.
[3] C. Shannon, "A Mathematical Theory of Communication," Bell System Technical J., vol. 27, pp. 379-423, July 1948.
[4] T.M. Cover and J.A. Thomas, Elements of Information Theory. Wiley Interscience, 1991.
[5] T. Berger, Rate Distortion Theory: A Mathematical Basis for Data Compression. Prentice Hall, 1971.
[6] L. Gong, N. Bouaynaya, and D. Schonfeld, "Information-Theoretic Bounds of Evolutionary Processes Modeled as a Protein Communication System," Proc. IEEE Statistical Signal Processing Workshop, pp. 1-5, Aug. 2007.
[7] M. Dayhoff, R. Schwartz, and B. Orcutt, "A Model of Evolutionary Change in Proteins," Altas of Protein Sequence and Structure, vol. 5, no. 3, pp. 345-352, Nat'l Biomedical Research Foundation, 1978.
[8] R. Blahut, "Computation of Channel Capacity and Rate-Distortion Functions," IEEE Trans. Information Theory, vol. 18, no. 4, pp. 460-472, July 1972.
[9] T. Ohta, "Evolution of Gene Families," Gene, vol. 259, pp. 45-52, 2000.
[10] C. Zimmer, "Did DNA Come from Viruses," Science, vol. 312, no. 5775, pp. 870-872, May 2006.
[11] T. Ohta, "Simulating Evolution by Gene Duplication," Genetics, vol. 115, no. 1, pp. 207-213, Jan. 1987.
[12] D. Snyder and M. Miller, Random Point Processes in Time and Space, second ed. Springer, 1991.
[13] A. Papoulis and S.U. Pillai, Probabilities, Random Variables and Stochastic Processes. McGraw-Hill, 1991.
[14] J.A. Jacques, "Tracer Kinetics," Principles of Nuclear Medicine, H.N. Wagner, 1968.
[15] X. Song, X.J. Jordanides, M.J. Lang, and G.R. Fleming, "Solvation Dynamics in Protein Environments Studied by Photon Echo Spectroscopy," J. Physical Chemistry B, vol. 103, no. 37, pp. 7995-8005, 1999.
[16] J. Ervin, J. Sabelko, and M. Gruebele, "Observation of Strange Kinetics in Protein Folding," Proc. Nat'l Academy of Sciences USA, vol. 96, no. 11, pp. 6031-6036, 1999.
[17] Z. Bajzer and F.G. Prendergast, "A Model for MultiExponential Tryptophan Fluorescence Intensity Decay in Proteins," Biophysical J., vol. 65, no. 6, pp. 2313-2323, 1993.
[18] N. Bogatryreva, A. Finkelstein, and O. Galzitskaya, "Trend of Amino Acid Composition of Proteins of Different Taxa," J. Bioinformatics and Computational Biology, vol. 4, no. 2, pp. 597-608, 2006.
[19] P. Higgs and T. Attwood, Bioinformatics and Molecular Evolution. Blackwell Publishing, 2005.
[20] A. Wyner and J. Ziv, "Bounds on the Rate-Distortion Function for Stationary Sources with Memory," IEEE Trans. Information Theory, vol. 17, no. 5, pp. 508-513, Sept. 1971.
[21] C. Woese, "The Universal Ancestor," Proc. Nat'l Academy of Sciences USA, vol. 95, no. 12, pp. 6854-6859, June 1998.
[22] W. Gilbert, "Why Genes in Pieces?" Nature, vol. 271, p. 501, Feb. 1978.
[23] W. Gilbert, "Gene Structure and Evolutionary Theory," New Perspective on Evolution, pp. 155-163, Wiley-Liss, 1991.
[24] W. Gilbert, S.D. Souza, and M. Long, "Origin of Genes," Proc. Nat'l Academy of Sciences USA, vol. 94, no. 15, pp. 7698-7703, July 1997.
[25] S.J. Gould, "Evolution's Erratic Pace," Natural History, vol. 86, no. 5, pp. 12-16, May 1977.
[26] N.W. Gillham, "Evolution by Jumps: Francis Galton and William Bateson and the Mechanism of Evolutionary Change," Genetics, vol. 159, no. 4, pp. 1383-1392, Dec. 2001.
[27] J. Zhang, D.M. Webb, and O. Podlaha, "Accelerated Protein Evolution and Origins of Human-Specific Features: Foxp2 as an Example," Genetics, vol. 162, no. 4, pp. 1825-1835, Dec. 2002.
[28] N. Eldredge and S.J. Gould, "Punctuated Equilibria: An Alternative to Phyletic Gradualism," Models in Paleobiology, T.J.M. Schopf, ed., pp. 82-115, Freeman, 1972.
[29] E. Mayr, "Speciational Evolution or Punctuated Equilibria," The Dynamics of Evolution, A. Somit and S.A. Peterson, eds., pp. 21-48, Cornel Univ. Press, 1992.
[30] S.J. Gould, The Structure of Evolutionary Theory. Harvard Univ. Press, 2002.

Index Terms:
Protein communication system, channel capacity, rate distortion theory, nonhomogeneous Poisson process.
Liuling Gong, Nidhal Bouaynaya, Dan Schonfeld, "Information-Theoretic Model of Evolution over Protein Communication Channel," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 8, no. 1, pp. 143-151, Jan.-Feb. 2011, doi:10.1109/TCBB.2009.1
Usage of this product signifies your acceptance of the Terms of Use.