The Community for Technology Leaders
2018 IEEE 8th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS) (2018)
Las Vegas, NV, USA
Oct. 18, 2018 to Oct. 20, 2018
ISSN: 2164-229X
ISBN: 978-1-5386-8521-1
pp: 1
Simone Ciccolella , Department of Computer Systems and Communication, University of Milano-Bicocca, Milan, Italy
Mauricio Soto Gomez , Department of Computer Systems and Communication, University of Milano-Bicocca, Milan, Italy
Murray Patterson , Department of Computer Systems and Communication, University of Milano-Bicocca, Milan, Italy
Gianluca Della Vedova , Department of Computer Systems and Communication, University of Milano-Bicocca, Milan, Italy
Iman Hajirasouliha , Institute for Computational Biomedicine, Weill Cornell, Medicine of Cornell University, NY, USA
Paola Bonizzoni , Department of Computer Systems and Communication, University of Milano-Bicocca, Milan, Italy
ABSTRACT
BackgroundCancer progression reconstruction is an important development stemming from the phylogenetics field, where the goal is to infer the mutational history of a set of tumor cells carrying these cancerous mutations. Many methods have been developed in recent years for inferring such a history from bulk-sequencing data, where they construct a perfect phylogeny of the mutations: a mutation which is gained in a history is never lost. Single Cell Sequencing (SCS) technologies are an emerging alternative which offers a much higher resolution, providing evidence of the existence of back mutations in cancer: a phenomenon which is currently widely ignored, and is not modeled by a perfect phylogeny.ResultsFor these reasons we present gpps, an approach which combines Integer Linear Programming (ILP) with a Hill Climbing approach for reconstructing a tumor phylogeny from SCS data according to a more general model than the perfect phylogeny: allowing each mutation to be lost at most a fixed number of times, thus modeling back mutations. We test gpps on real data as well as synthetic data which simulates the error rates of SCS technologies, comparing to state-of-the-art tumor phylogeny inference methods. Here we reveal that gpps performs as well as or better than any of the tools, even on measures which do not take into account mutation losses. Most notably, we have been able to analyze a ER+ breast cancer dataset consisting of 40 somatic mutations over 47 cells, confirming the driver mutations of the original study.
INDEX TERMS
Phylogeny, Cancer, History, Tumors, Back, Data models, Tools
CITATION

S. Ciccolella, M. S. Gomez, M. Patterson, G. D. Vedova, I. Hajirasouliha and P. Bonizzoni, "gpps: an ILP-based approach for inferring cancer progression with mutation losses from single cell data," 2018 IEEE 8th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), Las Vegas, NV, USA, 2018, pp. 1.
doi:10.1109/ICCABS.2018.8542058
618 ms
(Ver 3.3 (11022016))