The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.06 - Nov.-Dec. (2013 vol.10)
pp: 1422-1431
Ayshwarya Subramanian , Dept. of Biol. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Stanley Shackney , Oncotherapeutics, Pittsburgh, PA, USA
Russell Schwartz , Dept. of Biol. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
ABSTRACT
Computational cancer phylogenetics seeks to enumerate the temporal sequences of aberrations in tumor evolution, thereby delineating the evolution of possible tumor progression pathways, molecular subtypes, and mechanisms of action. We previously developed a pipeline for constructing phylogenies describing evolution between major recurring cell types computationally inferred from whole-genome tumor profiles. The accuracy and detail of the phylogenies, however, depend on the identification of accurate, high-resolution molecular markers of progression, i.e., reproducible regions of aberration that robustly differentiate different subtypes and stages of progression. Here, we present a novel hidden Markov model (HMM) scheme for the problem of inferring such phylogenetically significant markers through joint segmentation and calling of multisample tumor data. Our method classifies sets of genome-wide DNA copy number measurements into a partitioning of samples into normal (diploid) or amplified at each probe. It differs from other similar HMM methods in its design specifically for the needs of tumor phylogenetics, by seeking to identify robust markers of progression conserved across a set of copy number profiles. We show an analysis of our method in comparison to other methods on both synthetic and real tumor data, which confirms its effectiveness for tumor phylogeny inference and suggests avenues for future advances.
INDEX TERMS
Hidden Markov models, Phylogeny, Tumors, Cancer, Bioinformatics, Genomics, Data models,segmentation, Biology and genetics, health, trees
CITATION
Ayshwarya Subramanian, Stanley Shackney, Russell Schwartz, "Novel Multisample Scheme for Inferring Phylogenetic Markers from Whole Genome Tumor Profiles", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.10, no. 6, pp. 1422-1431, Nov.-Dec. 2013, doi:10.1109/TCBB.2013.33
REFERENCES
[1] C.M. Perou et al., "Molecular Portraits of Human Breast Tumors," Nature, vol. 406, pp. 747-752, 2000.
[2] T.R. Golub et al., "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring," Science, vol. 286, pp. 531-537, 1999.
[3] T. Sorlie et al., "Gene Expression Profiles of Breast Carcinomas Distinguish Tumor Subclasses with Clinical Implications," Proc. Nat'l Academy of Sciences USA, vol. 98, pp. 10869-10864, 2001.
[4] C. Sotiriou et al., "Breast Cancer Classification and Prognosis Based on Gene Expression Profiles from a Population-Based Study," Proc. Nat'l Academy Sciences USA, vol. 100, pp. 10393-10398, 2003.
[5] A. Ashworth and J. de Bono, "Translating Cancer Research into Targeted Therapeutics," Nature, vol. 467, pp. 543-549, 2010.
[6] L.J. van't Veer et al., "Gene Expression Profiling Predicts Clinical Outcome of Breast Cancer," Nature, vol. 415, pp. 530-536, 2002.
[7] L. Miller et al., "An Expression Signature for P53 Status in Human Breast Cancer Predicts Mutation Status, Transcriptional Effects, and Patient Survival," Proc. Nat'l Academy of Sciences USA, vol. 102, no. 38, pp. 13550-13555, 2005.
[8] A.B. Olshen et al., "Circular Binary Segmentation for the Analysis of Array-Based DNA Copy Number Data," Biostatistics, vol. 5, no. 4, pp. 557-572, Oct. 2004.
[9] F. Picard et al., "A Statistical Approach for Array CGH Data Analysis," BMC Bioinformatics, vol. 6, article 27, 2005.
[10] L. Hsu et al., "Denoising Array-Based Comparative Genomic Hybridization Data Using Wavelets," Biostatistics, vol. 6, no. 2, pp. 211-226, 2005.
[11] P. Eilers and R. de Menezes, "Quantile Smoothing of Array CGH Data," Bioinformatics, vol. 21, no. 7, pp. 1146-1153, 2005.
[12] K. Wang et al., "PennCNV: An Integrated Hidden Markov Model Designed for High-Resolution Copy Number Variation Detection in Whole-Genome SNP Genotyping Data," Genome Research, vol. 17, no. 11, pp. 1665-1674, 2007.
[13] C.D. Greenman, G. Bignell, A. Butler, S. Edkins, J. Hinton, D. Beare, S. Swamy, T. Santarius, L. Chen, S. Widaa, P.A. Futreal, and M.R. Stratton, "PICNIC: An Algorithm to Predict Absolute Allelic Copy Number Variation with Microarray Cancer Data," Biostatistics, vol. 11, no. 1, pp. 164-175, 2010.
[14] R. Pique-Regi, A. Ortega, and S. Asgharzadeh, "Joint Estimation of Copy Number Variation and Reference Intensities on Multiple DNA Arrays Using GADA," Bioinformatics, vol. 25, no. 10, pp. 1223-1230, 2009.
[15] S. Shah et al., "Model-Based Clustering of Array CGH Data," Bioinformatics, vol. 25, no. 12, pp. i30-i38, 2009.
[16] V. Wiel et al., "Smoothing Waves in Array CGH Tumor Profiles," Bioinformatics, vol. 25, no. 9, pp. 1099-1104, 2009.
[17] L. Wu, H. Chipman, S. Bull, L. Briollais, and K. Wang, "A Bayesian Segmentation Approach to Ascertain Copy Number Variations at the Population Level," Bioinformatics, vol. 25, no. 13, pp. 1669-1679, 2009.
[18] N. Zhang, Y. Senbabaoglu, and J. Li, "Joint Estimation of DNA Copy Number from Multiple Platforms," Bioinformatics, vol. 26, no. 2, pp. 153-160, 2010.
[19] R. Beroukhim et al., "Assessing the Significance of Chromosomal Aberrations in Cancer: Methodology and Application to Glioma," Proc. Nat'l Academy of Sciences USA, vol. 104, no. 50, pp. 20007-20012, 2007.
[20] G. Nowak, T. Hastie, J. Pollack, and R. Tibshirani, "A Fused Lasso Latent Feature Model for Analyzing Multi-Sample aCGH Data," Biostatistics, vol. 12, no. 4, pp. 776-791, 2011.
[21] F. Picard, E. Lebarbier, M. Hoebeke, G. Rigaill, B. Thiam, and S. Robin, "Joint Segmentation, Calling, and Normalization of Multiple CGH Profiles," Biostatistics, vol. 12, no. 3, pp. 413-428, 2011.
[22] R. Schwartz and S. Shackney, "Applying Unmixing to Gene Expression Data for Tumor Phylogeny Inference," BMC Bioinformatics, vol. 11, article 42, 2010.
[23] D. Tolliver et al., "Robust Unmixing of Tumor States in Array Comparative Genomic Hybridization Data," Bioinformatics, vol. 26, no. 12, pp. i106-i114, 2010.
[24] A. Subramanian, S. Shackney, and R. Schwartz, "Inference of Tumor Phylogenies from Genomic Assays on Heterogeneous Samples," Proc. Second ACM Conf. Bioinformatics, Computational Biology and Biomedicine, 2011.
[25] N. Navin et al., "Inferring Tumor Progression from Genomic Heterogeneity," Genome Research, vol. 20, pp. 68-80, Mar. 2010.
[26] A. Olshen, E. Venkatraman, R. Lucito, and M. Wigler, "Circular Binary Segmentation for the Analysis of Array Based DNA Copy Number Data," Biostatistics, vol. 5, no. 4, pp. 557-572, 2004.
[27] D. Swafford, "PAUP∗: Phylogenetic Analysis Using Parsimony (and Other Methods), 4.0 Beta," Sinauer Assoc., 2002.
[28] M.K. Kuhner and J. Felsenstein, "A Simulation Comparison of Phylogeny Algorithms under Equal and Unequal Evolutionary Rates," Molecular Biology and Evolution, vol. 11, no. 3, pp. 459-468, 1994.
[29] J. Felsenstein, "PHYLIP—Phylogeny Inference Package (Version 3.2)," Cladistics, vol. 5, pp. 164-166, 1989.
[30] C. Lengauer, K.W. Kinzler, and B. Vogelstein, "Genetic Instabilities in Human Cancers," Nature, vol. 396, pp. 643-649, 1998.
[31] S. Bamford et al., "The COSMIC (Catalogue of Somatic Mutations in Cancer) Database and Website," British J. Cancer, vol. 91, pp. 355-358, 2004.
[32] E.A. Mittendorf et al., "A Novel Interaction between HER2/neu and Cyclin E in Breast Cancer," Oncogene, vol. 29, pp. 3896-3907, July 2010.
[33] M. Scaltriti et al., "Cyclin E Amplification/Overexpression Is a Mechanism of Trastuzumab Resistance in HER2+ Breast Cancer Patients," Proc. Nat'l Academy of Sciences USA, vol. 108, no. 9,pp. 3761-3766, 2011.
71 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool