{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:00Z","timestamp":1772138040229,"version":"3.50.1"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2020,6,27]],"date-time":"2020-06-27T00:00:00Z","timestamp":1593216000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/100010663","name":"H2020 European Research Council","doi-asserted-by":"publisher","award":["766030"],"award-info":[{"award-number":["766030"]}],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010663","name":"H2020 European Research Council","doi-asserted-by":"publisher","award":["609883"],"award-info":[{"award-number":["609883"]}],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,12,8]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>The high resolution of single-cell DNA sequencing (scDNA-seq) offers great potential to resolve intratumor heterogeneity (ITH) by distinguishing clonal populations based on their mutation profiles. However, the increasing size of scDNA-seq datasets and technical limitations, such as high error rates and a large proportion of missing values, complicate this task and limit the applicability of existing methods.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Here, we introduce BnpC, a novel non-parametric method to cluster individual cells into clones and infer their genotypes based on their noisy mutation profiles. We benchmarked our method comprehensively against state-of-the-art methods on simulated data using various data sizes, and applied it to three cancer scDNA-seq datasets. On simulated data, BnpC compared favorably against current methods in terms of accuracy, runtime and scalability. Its inferred genotypes were the most accurate, especially on highly heterogeneous data, and it was the only method able to run and produce results on datasets with 5000 cells. On tumor scDNA-seq data, BnpC was able to identify clonal populations missed by the original cluster analysis but supported by Supplementary Experimental Data. With ever growing scDNA-seq datasets, scalable and accurate methods such as BnpC will become increasingly relevant, not only to resolve ITH but also as a preprocessing step to reduce data size.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>BnpC is freely available under MIT license at https:\/\/github.com\/cbg-ethz\/BnpC.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa599","type":"journal-article","created":{"date-parts":[[2020,6,19]],"date-time":"2020-06-19T07:14:32Z","timestamp":1592550872000},"page":"4854-4859","source":"Crossref","is-referenced-by-count":20,"title":["BnpC: Bayesian non-parametric clustering of single-cell mutation profiles"],"prefix":"10.1093","volume":"36","author":[{"given":"Nico","family":"Borgsm\u00fcller","sequence":"first","affiliation":[{"name":"Department of Biosystems Science and Engineering, ETH Z\u00fcrich , Basel 4058, Switzerland"},{"name":"SIB, Swiss Institute of Bioinformatics , Basel 4058, Switzerland"}]},{"given":"Jose","family":"Bonet","sequence":"additional","affiliation":[{"name":"Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology , Barcelona 08028, Spain"},{"name":"Research Program on Biomedical Informatics, Universitat Pompeu Fabra , Barcelona, Catalonia 08002, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8993-7320","authenticated-orcid":false,"given":"Francesco","family":"Marass","sequence":"additional","affiliation":[{"name":"Department of Biosystems Science and Engineering, ETH Z\u00fcrich , Basel 4058, Switzerland"},{"name":"SIB, Swiss Institute of Bioinformatics , Basel 4058, Switzerland"}]},{"given":"Abel","family":"Gonzalez-Perez","sequence":"additional","affiliation":[{"name":"Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology , Barcelona 08028, Spain"},{"name":"Research Program on Biomedical Informatics, Universitat Pompeu Fabra , Barcelona, Catalonia 08002, Spain"}]},{"given":"Nuria","family":"Lopez-Bigas","sequence":"additional","affiliation":[{"name":"Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology , Barcelona 08028, Spain"},{"name":"Instituci\u00f3 Catalana de Recerca i Estudis Avan\u00e7ats (ICREA) , Barcelona 08010, Spain"}]},{"given":"Niko","family":"Beerenwinkel","sequence":"additional","affiliation":[{"name":"Department of Biosystems Science and Engineering, ETH Z\u00fcrich , Basel 4058, Switzerland"},{"name":"SIB, Swiss Institute of Bioinformatics , Basel 4058, Switzerland"}]}],"member":"286","published-online":{"date-parts":[[2020,6,27]]},"reference":[{"key":"2023062408062844200_btaa599-B1","doi-asserted-by":"crossref","first-page":"338","DOI":"10.1038\/nature12625","article-title":"The causes and consequences of genetic heterogeneity in cancer evolution","volume":"501","author":"Burrell","year":"2013","journal-title":"Nature"},{"key":"2023062408062844200_btaa599-B2","author":"Ciccolella","year":"2018"},{"key":"2023062408062844200_btaa599-B3","author":"Ciccolella","year":"2019"},{"key":"2023062408062844200_btaa599-B4","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1016\/j.bbcan.2017.01.003","article-title":"Tumor evolution: linear, branching, neutral or punctuated?","volume":"1867","author":"Davis","year":"2017","journal-title":"Biochim. Biophys. Acta Rev. Cancer"},{"key":"2023062408062844200_btaa599-B5","doi-asserted-by":"crossref","first-page":"i671","DOI":"10.1093\/bioinformatics\/bty589","article-title":"SPhyR: tumor phylogeny estimation from single-cell sequencing data under loss and error","volume":"34","author":"El-Kebir","year":"2018","journal-title":"Bioinformatics"},{"key":"2023062408062844200_btaa599-B6","doi-asserted-by":"crossref","first-page":"577","DOI":"10.1080\/01621459.1995.10476550","article-title":"Bayesian density estimation and inference using mixtures","volume":"90","author":"Escobar","year":"1995","journal-title":"J. Am. Stat. Assoc"},{"key":"2023062408062844200_btaa599-B7","author":"Est\u00e9vez-G\u00f3mez","year":"2018"},{"key":"2023062408062844200_btaa599-B8","doi-asserted-by":"crossref","first-page":"956","DOI":"10.1158\/2159-8290.CD-13-0879","article-title":"EGFR variant heterogeneity in glioblastoma resolved through single-nucleus sequencing","volume":"4","author":"Francis","year":"2014","journal-title":"Cancer Discov"},{"key":"2023062408062844200_btaa599-B9","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1214\/09-BA414","article-title":"Improved criteria for clustering based on the posterior similarity matrix","volume":"4","author":"Fritsch","year":"2009","journal-title":"Bayesian Anal"},{"key":"2023062408062844200_btaa599-B10","doi-asserted-by":"crossref","first-page":"17947","DOI":"10.1073\/pnas.1420822111","article-title":"Dissecting the clonal origins of childhood acute lymphoblastic leukemia by single-cell genomics","volume":"111","author":"Gawad","year":"2014","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023062408062844200_btaa599-B11","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1038\/nrc3298","article-title":"Evolutionary dynamics of carcinogenesis and why targeted therapy does not work","volume":"12","author":"Gillies","year":"2012","journal-title":"Nat. Rev. Cancer"},{"key":"2023062408062844200_btaa599-B12","doi-asserted-by":"crossref","DOI":"10.1186\/s13059-016-0936-x","article-title":"Tree inference for single-cell data","volume":"17","author":"Jahn","year":"2016","journal-title":"Genome Biol"},{"key":"2023062408062844200_btaa599-B13","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1198\/1061860043001","article-title":"A split-merge Markov chain Monte Carlo procedure for the Dirichlet process mixture model","volume":"13","author":"Jain","year":"2004","journal-title":"J. Comput. Graph. Stat"},{"key":"2023062408062844200_btaa599-B14","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1214\/07-BA219","article-title":"Splitting and merging components of a nonconjugate Dirichlet process mixture model","volume":"2","author":"Jain","year":"2007","journal-title":"Bayesian Anal"},{"key":"2023062408062844200_btaa599-B15","doi-asserted-by":"crossref","first-page":"1860","DOI":"10.1101\/gr.234435.118","article-title":"PhISCS: a combinatorial approach for subperfect tumor phylogeny reconstruction via integrative use of single-cell and bulk sequencing data","volume":"29","author":"Malikic","year":"2019","journal-title":"Genome Res"},{"key":"2023062408062844200_btaa599-B16","doi-asserted-by":"crossref","first-page":"758","DOI":"10.1038\/ng.3573","article-title":"Divergent modes of clonal spread and intraperitoneal mixing in high-grade serous ovarian cancer","volume":"48","author":"McPherson","year":"2016","journal-title":"Nat. Genet"},{"key":"2023062408062844200_btaa599-B17","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1080\/10618600.2000.10474879","article-title":"Markov chain sampling methods for Dirichlet process mixture models","volume":"9","author":"Neal","year":"2000","journal-title":"J. Comput. Graph. Stat"},{"key":"2023062408062844200_btaa599-B18","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1007\/BF01213386","article-title":"Exchangeable and partially exchangeable random partitions","volume":"102","author":"Pitman","year":"1995","journal-title":"Probab. Theory Relat. Fields"},{"key":"2023062408062844200_btaa599-B19","first-page":"410","author":"Rosenberg","year":"2007"},{"key":"2023062408062844200_btaa599-B20","doi-asserted-by":"crossref","DOI":"10.1186\/s13059-016-0929-9","article-title":"Onconem: inferring tumor evolution from single-cell sequencing data","volume":"17","author":"Ross","year":"2016","journal-title":"Genome Biol"},{"key":"2023062408062844200_btaa599-B21","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1038\/nmeth.3867","article-title":"Clonal genotype and population structure inference from single-cell tumor sequencing","volume":"13","author":"Roth","year":"2016","journal-title":"Nat. Methods"},{"key":"2023062408062844200_btaa599-B22","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1038\/nrg.2016.170","article-title":"The evolution of tumour phylogenetics: principles and practice","volume":"18","author":"Schwartz","year":"2017","journal-title":"Nat. Rev. Genet"},{"key":"2023062408062844200_btaa599-B23","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1016\/j.cell.2018.03.043","article-title":"Deterministic evolutionary trajectories influence primary tumor growth: TRACERx renal","volume":"173","author":"Turajlic","year":"2018","journal-title":"Cell"},{"key":"2023062408062844200_btaa599-B24","author":"Vats","year":"2018"},{"key":"2023062408062844200_btaa599-B25","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1038\/nature13600","article-title":"Clonal evolution in breast cancer revealed by single nucleus genome sequencing","volume":"512","author":"Wang","year":"2014","journal-title":"Nature"},{"key":"2023062408062844200_btaa599-B26","volume-title":"The Biology of Cancer","year":"2014"},{"key":"2023062408062844200_btaa599-B27","doi-asserted-by":"crossref","first-page":"2857","DOI":"10.1038\/onc.2016.438","article-title":"Evolution and heterogeneity of non-hereditary colorectal cancer revealed by single-cell exome sequencing","volume":"36","author":"Wu","year":"2017","journal-title":"Oncogene"},{"key":"2023062408062844200_btaa599-B28","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1186\/s13059-015-0592-6","article-title":"BitPhylogeny: a probabilistic framework for reconstructing intra-tumor phylogenies","volume":"16","author":"Yuan","year":"2015","journal-title":"Genome Biol"},{"key":"2023062408062844200_btaa599-B29","doi-asserted-by":"crossref","DOI":"10.1186\/s13059-017-1311-2","article-title":"SiFit: inferring tumor trees from single-cell sequencing data under finite-sites models","volume":"18","author":"Zafar","year":"2017","journal-title":"Genome Biol"},{"key":"2023062408062844200_btaa599-B30","doi-asserted-by":"crossref","first-page":"1847","DOI":"10.1101\/gr.243121.118","article-title":"SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data","volume":"29","author":"Zafar","year":"2019","journal-title":"Genome Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa599\/33830604\/btaa599.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/19\/4854\/50692587\/btaa599.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/19\/4854\/50692587\/btaa599.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,7]],"date-time":"2024-08-07T20:36:06Z","timestamp":1723062966000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/19\/4854\/5864024"}},"subtitle":[],"editor":[{"given":"Anthony","family":"Mathelier","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,6,27]]},"references-count":30,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2020,12,8]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa599","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.01.15.907345","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,10,1]]},"published":{"date-parts":[[2020,6,27]]}}}