Resumen
We present Epiclomal, a probabilistic clustering method arising from a hierarchical mixture model to simultaneously cluster sparse single-cell DNA methylation data and impute missing values. Using synthetic and published single-cell CpG datasets, we show that Epiclomal outperforms non-probabilistic methods and can handle the inherent missing data characteristic that dominates single-cell CpG genome sequences. Using newly generated single-cell 5mCpG sequencing data, we show that Epiclomal discovers sub-clonal methylation patterns in aneuploid tumour genomes, thus defining epiclones that can match or transcend copy number-determined clonal lineages and opening up an important form of clonal analysis in cancer. Epiclomal is written in R and Python and is available at https://github.com/shahcompbio/Epiclomal.
Idioma original | English |
---|---|
Número de artículo | 1008270 |
Publicación | PLoS Computational Biology |
Volumen | 16 |
N.º | 9 |
DOI | |
Estado | Published - sep. 2020 |
Publicado de forma externa | Sí |
Nota bibliográfica
Funding Information:We acknowledge generous funding support provided by the BC Cancer Foundation. S. A. is supported by grants from CIHR, Terry Fox Research Institute, Canadian Cancer Society Research Institute, and the Breast Cancer Research Foundation. S.P.S. is supported by CIHR, Terry Fox Research Institute (grant 1082) and the Canadian Cancer Society (grant 705636). C.P.E.d.S. is supported by the National Science and Engineering Research Council of Canada. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Publisher Copyright:
© 2020 P. E. de Souza et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
ASJC Scopus Subject Areas
- Ecology, Evolution, Behavior and Systematics
- Ecology
- Modelling and Simulation
- Molecular Biology
- Genetics
- Cellular and Molecular Neuroscience
- Computational Theory and Mathematics