{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:11Z","timestamp":1772138051927,"version":"3.50.1"},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"9","license":[{"start":{"date-parts":[[2022,3,17]],"date-time":"2022-03-17T00:00:00Z","timestamp":1647475200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"AIRC under MFAG","award":["2020-ID. 24913"],"award-info":[{"award-number":["2020-ID. 24913"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,4,28]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Cancers are composed by several heterogeneous subpopulations, each one harbouring different genetic and epigenetic somatic alterations that contribute to disease onset and therapy response. In recent years, copy number alterations (CNAs) leading to tumour aneuploidy have been identified as potential key drivers of such populations, but the definition of the precise makeup of cancer subclones from sequencing assays remains challenging. In the end, little is known about the mapping between complex CNAs and their effect on cancer phenotypes.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We introduce CONGAS, a Bayesian probabilistic method to phase bulk DNA and single-cell RNA measurements from independent assays. CONGAS jointly identifies clusters of single cells with subclonal CNAs, and differences in RNA expression. The model builds statistical priors leveraging bulk DNA sequencing data, does not require a normal reference and scales fast thanks to a GPU backend and variational inference. We test CONGAS on both simulated and real data, and find that it can determine the tumour subclonal composition at the single-cell level together with clone-specific RNA phenotypes in tumour data generated from both 10\u00d7 and Smart-Seq assays.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>CONGAS is available as 2 packages: CONGAS (https:\/\/github.com\/caravagnalab\/congas), which implements the model in Python, and RCONGAS (https:\/\/caravagnalab.github.io\/rcongas\/), which provides R functions to process inputs, outputs and run CONGAS fits. The analysis of real data and scripts to generate figures of this paper are available via RCONGAS; code associated to simulations is available at https:\/\/github.com\/caravagnalab\/rcongas_test.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btac143","type":"journal-article","created":{"date-parts":[[2022,3,16]],"date-time":"2022-03-16T08:31:27Z","timestamp":1647419487000},"page":"2512-2518","source":"Crossref","is-referenced-by-count":12,"title":["A Bayesian method to cluster single-cell RNA sequencing data using copy number alterations"],"prefix":"10.1093","volume":"38","author":[{"given":"Salvatore","family":"Milite","sequence":"first","affiliation":[{"name":"Department of Mathematics and Geosciences, University of Trieste , Trieste 34127, Italy"}]},{"given":"Riccardo","family":"Bergamin","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Geosciences, University of Trieste , Trieste 34127, Italy"}]},{"given":"Lucrezia","family":"Patruno","sequence":"additional","affiliation":[{"name":"Department of Informatics, Systems and Communication, University of Milano-Bicocca , Milano 20125, Italy"}]},{"given":"Nicola","family":"Calonaci","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Geosciences, University of Trieste , Trieste 34127, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4240-3265","authenticated-orcid":false,"given":"Giulio","family":"Caravagna","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Geosciences, University of Trieste , Trieste 34127, Italy"}]}],"member":"286","published-online":{"date-parts":[[2022,3,17]]},"reference":[{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"1923","DOI":"10.1038\/s41467-020-15596-z","article-title":"Exploiting evolutionary steering to induce collateral drug sensitivity in cancer","volume":"11","author":"Acar","year":"2020","journal-title":"Nat. Commun"},{"key":"2023041402563023500_","first-page":"1","article-title":"Pyro: deep universal probabilistic programming","volume":"20","author":"Bingham","year":"2019","journal-title":"J. Mach. Learn. Res"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"859","DOI":"10.1080\/01621459.2017.1285773","article-title":"Variational inference: a review for statisticians","volume":"112","author":"Blei","year":"2017","journal-title":"J. Am. Stat. Assoc"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1186\/s13059-019-1645-z","article-title":"clonealign: statistical integration of independent single-cell RNA and DNA sequencing data from human cancers","volume":"20","author":"Campbell","year":"2019","journal-title":"Genome Biol"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"20200075","DOI":"10.1515\/sagmb-2020-0075","article-title":"Measuring evolutionary cancer dynamics from genome sequencing, one patient at a time","volume":"19","author":"Caravagna","year":"2020","journal-title":"Stat. Appl. Genet. Mol. Biol"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"E4025","DOI":"10.1073\/pnas.1520213113","article-title":"Algorithmic methods to infer the evolutionary trajectories in cancer progression","volume":"113","author":"Caravagna","year":"2016","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"707","DOI":"10.1038\/s41592-018-0108-x","article-title":"Detecting repeated cancer evolution from multi-region tumor sequencing data","volume":"15","author":"Caravagna","year":"2018","journal-title":"Nat. Methods"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"898","DOI":"10.1038\/s41588-020-0675-5","article-title":"Subclonal reconstruction of tumors by using machine learning and population genetics","volume":"52","author":"Caravagna","year":"2020","journal-title":"Nat. Genet"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"422","DOI":"10.1038\/nature13952","article-title":"Dynamics of genomic clones in breast cancer patient xenografts at single-cell resolution","volume":"518","author":"Eirew","year":"2015","journal-title":"Nature"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"1217","DOI":"10.1101\/gr.228080.117","article-title":"Linking transcriptional and genetic tumor heterogeneity through allele analysis of single-cell RNA-seq data","volume":"28","author":"Fan","year":"2018","journal-title":"Genome Res"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"1058","DOI":"10.1038\/nmeth.3578","article-title":"Interactive analysis and assessment of single-cell copy-number variations","volume":"12","author":"Garvin","year":"2015","journal-title":"Nat. Methods"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"306","DOI":"10.1038\/nature10762","article-title":"Clonal evolution in cancer","volume":"481","author":"Greaves","year":"2012","journal-title":"Nature"},{"key":"2023041402563023500_","author":"Househam","year":"2021"},{"key":"2023041402563023500_","author":"Kuipers","year":"2020"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1186\/s13059-020-1926-6","article-title":"Eleven grand challenges in single-cell data science","volume":"21","author":"L\u00e4hnemann","year":"2020","journal-title":"Genome Biol"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1038\/nmeth.3370","article-title":"G&T-seq: parallel sequencing of single-cell genomes and transcriptomes","volume":"12","author":"Macaulay","year":"2015","journal-title":"Nat. Methods"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"1262","DOI":"10.1038\/s41588-018-0179-8","article-title":"Copy number signatures and mutational processes in ovarian carcinoma","volume":"50","author":"Macintyre","year":"2018","journal-title":"Nat. Genet"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1186\/s13073-019-0648-4","article-title":"Somatic mutation and clonal expansions in human tissues","volume":"11","author":"Martincorena","year":"2019","journal-title":"Genome Med"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"880","DOI":"10.1126\/science.aaa6806","article-title":"Tumor evolution. High burden and pervasive positive selection of somatic mutations in normal human skin","volume":"348","author":"Martincorena","year":"2015","journal-title":"Science"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1016\/j.ccell.2014.12.001","article-title":"Biological and therapeutic impact of intratumor heterogeneity in cancer evolution","volume":"27","author":"McGranahan","year":"2015","journal-title":"Cancer Cell"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1016\/j.cell.2017.01.018","article-title":"Clonal heterogeneity and tumor evolution: past, present, and the future","volume":"168","author":"McGranahan","year":"2017","journal-title":"Cell"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1186\/s13059-017-1267-2","article-title":"ReMixT: clone-specific genomic structure estimation in cancer","volume":"18","author":"McPherson","year":"2017","journal-title":"Genome Biol"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"1396","DOI":"10.1126\/science.1254257","article-title":"Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma","volume":"344","author":"Patel","year":"2014","journal-title":"Science"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1038\/nprot.2014.006","article-title":"Full-length RNA-seq from single cells using Smart-seq2","volume":"9","author":"Picelli","year":"2014","journal-title":"Nat. Protoc"},{"key":"2023041402563023500_","first-page":"451","author":"Rozenblatt-Rosen","year":"2017"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"770","DOI":"10.1038\/s41588-021-00873-4","article-title":"Separating measurement and expression models clarifies confusion in single-cell RNA sequencing analysis","volume":"53","author":"Sarkar","year":"2021","journal-title":"Nat. Genet."},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1038\/s41467-019-13779-x","article-title":"CaSpER identifies and visualizes CNV events by integrative analysis of single-cell or bulk RNA-sequencing data","volume":"11","author":"Serin Harmanci","year":"2020","journal-title":"Nat. Commun"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1038\/s41576-019-0114-6","article-title":"Resolving genetic heterogeneity in cancer","volume":"20","author":"Turajlic","year":"2019","journal-title":"Nat. Rev. Genet"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"920","DOI":"10.1126\/science.aao2774","article-title":"Patient-derived organoids model treatment response of metastatic gastrointestinal cancers","volume":"359","author":"Vlachogiannis","year":"2018","journal-title":"Science"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"731","DOI":"10.1093\/bib\/bbx004","article-title":"DNA copy number profiling using single-cell sequencing","volume":"19","author":"Wang","year":"2018","journal-title":"Brief. Bioinform"},{"key":"2023041402563023500_","first-page":"253","article-title":"Direct comparative analyses of 10x genomics chromium and Smart-seq2","author":"Wang","year":"2021"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"126","DOI":"10.1038\/s41586-020-2698-6","article-title":"Pervasive chromosomal instability and karyotype order in tumour evolution","volume":"587","author":"Watkins","year":"2020","journal-title":"Nature"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1038\/s41587-020-0661-6","article-title":"Characterizing allele- and haplotype-specific copy numbers in single cells with CHISEL","volume":"39","author":"Zaccaria","year":"2020","journal-title":"Nat. Biotechnol"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1186\/s13059-017-1305-0","article-title":"Splatter: simulation of single-cell RNA sequencing data","volume":"18","author":"Zappia","year":"2017","journal-title":"Genome Biol"},{"key":"2023041402563023500_","doi-asserted-by":"crossref","first-page":"2762","DOI":"10.1182\/blood-2017-08-803353","article-title":"Single-cell RNA-seq reveals a distinct transcriptome signature of aneuploid hematopoietic cells","volume":"130","author":"Zhao","year":"2017","journal-title":"Blood"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btac143\/43152825\/btac143.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/9\/2512\/49874020\/btac143.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/9\/2512\/49874020\/btac143.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,18]],"date-time":"2023-11-18T17:18:06Z","timestamp":1700327886000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/9\/2512\/6550058"}},"subtitle":[],"editor":[{"given":"Can","family":"Alkan","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,3,17]]},"references-count":36,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2022,4,28]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btac143","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.02.02.429335","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,5,1]]},"published":{"date-parts":[[2022,3,17]]}}}