This Article 
 Bibliographic References 
 Add to: 
A Powerful Approach for Effective Finding of Significantly Differentially Expressed Genes
July-September 2006 (vol. 3 no. 3)
pp. 220-231
The problem of identifying significantly differentially expressed genes for replicated microarray experiments is accepted as significant and has been tackled by several researchers. Patterns from Gene Expression (PaGE) and q-values are two of the well-known approaches developed to handle this problem. This paper proposes a powerful approach to handle this problem. We first propose a method for estimating the prior probabilities used in the first version of the PaGE algorithm. This way, the problem definition of PaGE stays intact and we just estimate the needed prior probabilities. Our estimation method is similar to Storey's estimator without being its direct extension. Then, we modify the problem formulation to find significantly differentially expressed genes and present an efficient method for finding them. This formulation increases the power by directly incorporating Storey's estimator. We report the preliminary results on the BRCA data set to demonstrate the applicability and effectiveness of our approach.

[1] O. Abul, R. Alhajj, F. Polat, and K. Barker, “Finding Differentially Expressed Genes: Pattern Generation Using q-Values,” Proc. ACM Symp. Applied Computing (ACM SAC '04), pp. 138-142 Mar. 2004.
[2] A.A. Alizadeh et al., “Distinct Types of Diffuse Large b-Cell Lymphoma Identified by Gene Expression Profiling,” Nature, vol. 403, no. 6769, pp. 503-511, 2000.
[3] D.P. Bertsekas, Dynamic Programming: Deterministic and Stochastic Models. Englewood Cliffs, N.J.: Prentice-Hall, 1987.
[4] G.A. Churchill and M.K. Kerr, “Bootstrapping Cluster Analysis: Assessing the Reliability of Conclusions from Microarray Experiments,” PNAS Genetics, vol. 98, no. 16, pp. 8961-8965, 2001.
[5] J-M. Claverie, “Computational Methods for the Identification of Differential and Coordinated Gene Expression,” Human Molecular Genetics, vol. 8, no. 10, pp. 1821-1832, 1999.
[6] I. Dozmorov and M. Centola, “An Associative Analysis of Gene Expression Array Data,” Bioinformatics, vol. 19, no. 2, pp. 204-211, 2003.
[7] S. Dudoit, Y.H. Yang, M.J. Callow, and T.P. Speed, “Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments,” Statistica Sinica, vol. 12, no. 1, pp. 111-139, 2002.
[8] S. Dudoit, Y.H. Yang, M. Callow, and T.P. Speed, “Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments,” Technical Report #578, Univ. of California, Berkeley, 2000.
[9] G.R. Grant, J. Liu, and C.J. Stoeckert Jr, “A Practical False Discovery Rate Approach to Identifying Patterns of Differential Expression in Microarray Data,” Bioinformatics, vol. 21, no. 11, pp. 2684-2690, June 2005.
[10] G.R. Grant, E. Manduchi, and C.J. Stoeckert, “Using Non-Parametric Methods in the Context of Multiple Testing to Identify Differentially Expressed Genes,” Proc. Critical Assessment of Techniques for Microarray Data Analysis (CAMDA), pp. 37-55, Dec. 2000.
[11] I. Hedenfalk et al., “Gene-Expression Profiles in Hereditary Breast Cancer,” J. Medicine, vol. 344, no. 8, pp. 539-48, 2001.
[12] W.W. Hines, D.C. Montgomery, D.M. Goldsman, and C.M. Borror, Probability and Statistics in Engineering. John Wiley & Sons, 2003.
[13] E. Manduchi, G.R. Grant, S.E. McKenzei, G.C. Overton, S. Surrey, and C.J. Stoeckert, “Generation of Patterns from Gene Expression Data by Assigning Confidence to Differentially Expressed Genes,” Bioinformatics, vol. 16, no. 8, pp. 685-698, 2000.
[14] M. Schena, D. Shalon, R. Heller, A. Chai, P.O. Brown, and R.W. Davis, “Parallel Human Genome Analysis: Microarray-Based Expression Monitoring of 1000 Genes,” Proc. Nat'l Academy of Science, vol. 93, no. 20, pp. 10614-10619, 1996.
[15] T.P. Speed, S. Dudoit, and J. Fridlyand, “Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data,” J. Am. Statistical Assoc., vol. 97, no. 457, pp. 77-87, Mar. 2002.
[16] J.D. Storey, “False Discovery Rates: Theory and Applications to DNA Microarrays,” PhD thesis, Dept. of Statistics, Stanford Univ., 2002.
[17] J.D. Storey, J.E. Taylor, and D. Siegmund, “Strong Control, Conservative Point Estimation and Simultaneous Conservative Consistency of False Discovery Rates: A Unified Approach,” J. Royal Statistical Soc., vol. 66, no. 1, pp. 187-205, 2004.
[18] J.D. Storey and R. Tibshirani, “Statistical Significance for Genome-Wide Experiments,” manuscript, 2003.
[19] J.D. Storey and R. Tibshirani, “Statistical Significance for GenomeWide Studies,” PNAS Genetics, vol. 100, no. 16, pp. 9440-9445, 2003.

Index Terms:
q-values, significantly differentially expressed genes, microarray, pattern generation, PaGE.
Osman Abul, Reda Alhajj, Faruk Polat, "A Powerful Approach for Effective Finding of Significantly Differentially Expressed Genes," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 3, no. 3, pp. 220-231, July-Sept. 2006, doi:10.1109/TCBB.2006.29
Usage of this product signifies your acceptance of the Terms of Use.