{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T21:49:54Z","timestamp":1773006594778,"version":"3.50.1"},"reference-count":23,"publisher":"Oxford University Press (OUP)","issue":"24","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2014,12,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Most tumor samples are a heterogeneous mixture of cells, including admixture by normal (non-cancerous) cells and subpopulations of cancerous cells with different complements of somatic aberrations. This intra-tumor heterogeneity complicates the analysis of somatic aberrations in DNA sequencing data from tumor samples.<\/jats:p>\n               <jats:p>Results: We describe an algorithm called THetA2 that infers the composition of a tumor sample\u2014including not only tumor purity but also the number and content of tumor subpopulations\u2014directly from both whole-genome (WGS) and whole-exome (WXS) high-throughput DNA sequencing data. This algorithm builds on our earlier Tumor Heterogeneity Analysis (THetA) algorithm in several important directions. These include improved ability to analyze highly rearranged genomes using a variety of data types: both WGS sequencing (including low \u223c7\u00d7 coverage) and WXS sequencing. We apply our improved THetA2 algorithm to WGS (including low-pass) and WXS sequence data from 18 samples from The Cancer Genome Atlas (TCGA). We find that the improved algorithm is substantially faster and identifies numerous tumor samples containing subclonal populations in the TCGA data, including in one highly rearranged sample for which other tumor purity estimation algorithms were unable to estimate tumor purity.<\/jats:p>\n               <jats:p>Availability and implementation: An implementation of THetA2 is available at http:\/\/compbio.cs.brown.edu\/software<\/jats:p>\n               <jats:p>Contact: \u00a0layla@cs.brown.edu or braphael@brown.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btu651","type":"journal-article","created":{"date-parts":[[2014,10,9]],"date-time":"2014-10-09T01:49:44Z","timestamp":1412819384000},"page":"3532-3540","source":"Crossref","is-referenced-by-count":110,"title":["Quantifying tumor heterogeneity in whole-genome and whole-exome sequencing data"],"prefix":"10.1093","volume":"30","author":[{"given":"Layla","family":"Oesper","sequence":"first","affiliation":[{"name":"1 Department of Computer Science and 2 Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA"}]},{"given":"Gryte","family":"Satas","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and 2 Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA"}]},{"given":"Benjamin J.","family":"Raphael","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and 2 Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA"},{"name":"1 Department of Computer Science and 2 Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA"}]}],"member":"286","published-online":{"date-parts":[[2014,10,8]]},"reference":[{"key":"2023012712044312100_btu651-B1","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1038\/ng1215","article-title":"Chromosome aberrations in solid tumors","volume":"34","author":"Albertson","year":"2003","journal-title":"Nat. Genet."},{"key":"2023012712044312100_btu651-B2","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1093\/bioinformatics\/btt622","article-title":"Expands: expanding ploidy and allele frequency on nested subpopulations","volume":"30","author":"Andor","year":"2014","journal-title":"Bioinformatics"},{"key":"2023012712044312100_btu651-B3","doi-asserted-by":"crossref","first-page":"e72","DOI":"10.1093\/nar\/gks001","article-title":"Summarizing and correcting the GC content bias in high-throughput sequencing","volume":"40","author":"Benjamini","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2023012712044312100_btu651-B4","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1038\/nature11412","article-title":"Comprehensive molecular portraits of human breast tumours","volume":"490","author":"Cancer Genome Atlas Network","year":"2012","journal-title":"Nature"},{"key":"2023012712044312100_btu651-B5","doi-asserted-by":"crossref","first-page":"2059","DOI":"10.1056\/NEJMoa1301689","article-title":"Genomic and epigenomic landscapes of adult \n              de novo\n               acute myeloid leukemia","volume":"368","author":"Cancer Genome Atlas Research Network","year":"2013","journal-title":"N. Engl. J. Med."},{"key":"2023012712044312100_btu651-B6","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1038\/nbt.2203","article-title":"Absolute quantification of somatic DNA alterations in human cancer","volume":"30","author":"Carter","year":"2012","journal-title":"Nat. Biotechnol."},{"key":"2023012712044312100_btu651-B7","doi-asserted-by":"crossref","first-page":"R188","DOI":"10.1093\/hmg\/ddq391","article-title":"Analysis of next-generation genomic data in cancer: accomplishments and challenges","volume":"19","author":"Ding","year":"2010","journal-title":"Hum. Mol. Genet."},{"key":"2023012712044312100_btu651-B8","doi-asserted-by":"crossref","first-page":"883","DOI":"10.1056\/NEJMoa1113205","article-title":"Intratumor heterogeneity and branched evolution revealed by multiregion sequencing","volume":"366","author":"Gerlinger","year":"2012","journal-title":"N. Engl. J. Med."},{"key":"2023012712044312100_btu651-B9","doi-asserted-by":"crossref","first-page":"306","DOI":"10.1038\/nature10762","article-title":"Clonal evolution in cancer","volume":"481","author":"Greaves","year":"2012","journal-title":"Nature"},{"key":"2023012712044312100_btu651-B10","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1093\/bioinformatics\/btr593","article-title":"Correcting for cancer genome size and tumour cell content enables better estimation of copy number alterations from next-generation sequence data","volume":"28","author":"Gusnanto","year":"2012","journal-title":"Bioinformatics"},{"key":"2023012712044312100_btu651-B11","doi-asserted-by":"crossref","first-page":"i78","DOI":"10.1093\/bioinformatics\/btu284","article-title":"A combinatorial approach for analyzing intra-tumor heterogeneity from high-throughput sequencing data","volume":"30","author":"Hajirasouliha","year":"2014","journal-title":"Bioinformatics"},{"key":"2023012712044312100_btu651-B12","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1186\/1471-2105-15-35","article-title":"Inferring clonal evolution of tumors from single nucleotide somatic mutations","volume":"15","author":"Jiao","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"2023012712044312100_btu651-B13","doi-asserted-by":"crossref","first-page":"1888","DOI":"10.1093\/bioinformatics\/btt293","article-title":"Purbayes: estimating tumor cellularity and subclonality in next-generation sequencing data","volume":"29","author":"Larson","year":"2013","journal-title":"Bioinformatics"},{"key":"2023012712044312100_btu651-B14","doi-asserted-by":"crossref","first-page":"R120","DOI":"10.1186\/gb-2013-14-10-r120","article-title":"Excavator: detecting copy number variants from whole-exome sequencing data","volume":"14","author":"Magi","year":"2013","journal-title":"Genome Biol."},{"key":"2023012712044312100_btu651-B15","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1038\/nrg2841","article-title":"Advances in understanding cancer genomes through second-generation sequencing","volume":"11","author":"Meyerson","year":"2010","journal-title":"Nat. Rev. Genet."},{"key":"2023012712044312100_btu651-B16","doi-asserted-by":"crossref","first-page":"1377","DOI":"10.1126\/science.1164266","article-title":"Genomic analysis of the clonal origins of relapsed acute lymphoblastic leukemia","volume":"322","author":"Mullighan","year":"2008","journal-title":"Science"},{"key":"2023012712044312100_btu651-B17","doi-asserted-by":"crossref","first-page":"994","DOI":"10.1016\/j.cell.2012.04.023","article-title":"The life history of 21 breast cancers","volume":"149","author":"Nik-Zainal","year":"2012","journal-title":"Cell"},{"key":"2023012712044312100_btu651-B18","doi-asserted-by":"crossref","first-page":"R80","DOI":"10.1186\/gb-2013-14-7-r80","article-title":"THetA: inferring intra-tumor heterogeneity from high-throughput DNA sequencing data","volume":"14","author":"Oesper","year":"2013","journal-title":"Genome Biol."},{"key":"2023012712044312100_btu651-B19","doi-asserted-by":"crossref","first-page":"396","DOI":"10.1038\/nmeth.2883","article-title":"Pyclone: statistical inference of clonal population structure in cancer","volume":"11","author":"Roth","year":"2014","journal-title":"Nat. Methods"},{"key":"2023012712044312100_btu651-B20","doi-asserted-by":"crossref","first-page":"2648","DOI":"10.1093\/bioinformatics\/btr462","article-title":"Exome sequencing-based copy-number variation and loss of heterozygosity detection: exomecnv","volume":"27","author":"Sathirapongsasuti","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012712044312100_btu651-B21","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1038\/nrc3655","article-title":"Paediatric and adult glioblastoma: multiform (epi)genomic culprits emerge","volume":"14","author":"Sturm","year":"2014","journal-title":"Nat. Rev. Cancer"},{"key":"2023012712044312100_btu651-B22","doi-asserted-by":"crossref","first-page":"E1128","DOI":"10.1073\/pnas.1110574108","article-title":"Copy number variation detection in whole-genome sequencing data using the bayesian information criterion","volume":"108","author":"Xi","year":"2011","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012712044312100_btu651-B23","article-title":"An assessment of computational methods for estimating purity and clonality using genomic data derived from heterogeneous tumor tissue samples","volume-title":"Brief. Bioinform","author":"Yadav","year":"2014"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/24\/3532\/48931377\/bioinformatics_30_24_3532.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/24\/3532\/48931377\/bioinformatics_30_24_3532.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T13:02:59Z","timestamp":1674824579000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/30\/24\/3532\/2422230"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,10,8]]},"references-count":23,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2014,12,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btu651","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2014,12,15]]},"published":{"date-parts":[[2014,10,8]]}}}