{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T04:48:15Z","timestamp":1770958095886,"version":"3.50.1"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"15","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2014,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Next-generation sequencing (NGS) has revolutionized the study of cancer genomes. However, the reads obtained from NGS of tumor samples often consist of a mixture of normal and tumor cells, which themselves can be of multiple clonal types. A prominent problem in the analysis of cancer genome sequencing data is deconvolving the mixture to identify the reads associated with tumor cells or a particular subclone of tumor cells. Solving the problem is, however, challenging because of the so-called \u2018identifiability problem\u2019, where different combinations of tumor purity and ploidy often explain the sequencing data equally well.<\/jats:p><jats:p>Results: We propose a new model to resolve the identifiability problem by integrating two types of sequencing information\u2014somatic copy number alterations and loss of heterozygosity\u2014within a unified probabilistic framework. We derive algorithms to solve our model, and implement them in a software package called PyLOH. We benchmark the performance of PyLOH using both simulated data and 12 breast cancer sequencing datasets and show that PyLOH outperforms existing methods in disambiguating the identifiability problem and estimating tumor purity.<\/jats:p><jats:p>Availability and implementation: The PyLOH package is written in Python and is publicly available at https:\/\/github.com\/uci-cbcl\/PyLOH .<\/jats:p><jats:p>Contact: \u00a0xhx@ics.uci.edu<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btu174","type":"journal-article","created":{"date-parts":[[2014,4,3]],"date-time":"2014-04-03T01:29:50Z","timestamp":1396488590000},"page":"2121-2129","source":"Crossref","is-referenced-by-count":33,"title":["Deconvolving tumor purity and ploidy by integrating copy number alterations and loss of heterozygosity"],"prefix":"10.1093","volume":"30","author":[{"given":"Yi","family":"Li","sequence":"first","affiliation":[{"name":"1 Department of Computer Science, 2 Institute for Genomics and Bioinformatics and 3 Center for Machine Learning and Intelligent Systems, University of California, Irvine, CA 92697, USA"}]},{"given":"Xiaohui","family":"Xie","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, 2 Institute for Genomics and Bioinformatics and 3 Center for Machine Learning and Intelligent Systems, University of California, Irvine, CA 92697, USA"},{"name":"1 Department of Computer Science, 2 Institute for Genomics and Bioinformatics and 3 Center for Machine Learning and Intelligent Systems, University of California, Irvine, CA 92697, USA"},{"name":"1 Department of Computer Science, 2 Institute for Genomics and Bioinformatics and 3 Center for Machine Learning and Intelligent Systems, University of California, Irvine, CA 92697, USA"}]}],"member":"286","published-online":{"date-parts":[[2014,4,2]]},"reference":[{"key":"2023012711323574800_btu174-B1","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1038\/nature11154","article-title":"Sequence analysis of mutations and translocations across breast cancer subtypes","volume":"486","author":"Banerji","year":"2012","journal-title":"Nature"},{"key":"2023012711323574800_btu174-B2","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1038\/nature11547","article-title":"Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes","volume":"491","author":"Biankin","year":"2012","journal-title":"Nature"},{"key":"2023012711323574800_btu174-B3","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1101\/gr.2012304","article-title":"High-resolution analysis of DNA copy number using oligonucleotide microarrays","volume":"14","author":"Bignell","year":"2004","journal-title":"Genome Res."},{"key":"2023012711323574800_btu174-B4","doi-asserted-by":"crossref","first-page":"722","DOI":"10.1038\/ng.128","article-title":"Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing","volume":"40","author":"Campbell","year":"2008","journal-title":"Nat. Genet."},{"key":"2023012711323574800_btu174-B5","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1038\/nbt.2203","article-title":"Absolute quantification of somatic DNA alterations in human cancer","volume":"30","author":"Carter","year":"2012","journal-title":"Nat. Biotechnol."},{"key":"2023012711323574800_btu174-B6","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1038\/nmeth.1276","article-title":"High-resolution mapping of copy-number alterations with massively parallel sequencing","volume":"6","author":"Chiang","year":"2008","journal-title":"Nat. Methods"},{"key":"2023012711323574800_btu174-B7","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1038\/scientificamerican0307-50","article-title":"Mapping the cancer genome","volume":"296","author":"Collins","year":"2007","journal-title":"Sci. Am. Mag."},{"key":"2023012711323574800_btu174-B8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum likelihood from incomplete data via the EM algorithm","volume":"39","author":"Dempster","year":"1977","journal-title":"J. R. Stat. Soc. B Methodol."},{"key":"2023012711323574800_btu174-B9","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1101\/gr.8.3.186","article-title":"Base-calling of automated sequencer traces using phred. II. Error probabilities","volume":"8","author":"Ewing","year":"1998","journal-title":"Genome Res."},{"key":"2023012711323574800_btu174-B10","doi-asserted-by":"crossref","first-page":"D945","DOI":"10.1093\/nar\/gkq929","article-title":"Cosmic: mining complete cancer genomes in the catalogue of somatic mutations in cancer","volume":"39","author":"Forbes","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023012711323574800_btu174-B11","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1093\/biostatistics\/kxp045","article-title":"Picnic: an algorithm to predict absolute allelic copy number variation with microarray cancer data","volume":"11","author":"Greenman","year":"2010","journal-title":"Biostatistics"},{"key":"2023012711323574800_btu174-B12","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1093\/bioinformatics\/btr593","article-title":"Correcting for cancer genome size and tumour cell content enables better estimation of copy number alterations from next-generation sequence data","volume":"28","author":"Gusnanto","year":"2012","journal-title":"Bioinformatics"},{"key":"2023012711323574800_btu174-B13","doi-asserted-by":"crossref","first-page":"993","DOI":"10.1038\/nature08987","article-title":"International network of cancer genome projects","volume":"464","author":"Hudson","year":"2010","journal-title":"Nature"},{"key":"2023012711323574800_btu174-B14","doi-asserted-by":"crossref","first-page":"568","DOI":"10.1101\/gr.129684.111","article-title":"Varscan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing","volume":"22","author":"Koboldt","year":"2012","journal-title":"Genome Res."},{"key":"2023012711323574800_btu174-B15","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/0888-7543(88)90007-9","article-title":"Genomic mapping by fingerprinting random clones: a mathematical analysis","volume":"2","author":"Lander","year":"1988","journal-title":"Genomics"},{"key":"2023012711323574800_btu174-B16","doi-asserted-by":"crossref","first-page":"1888","DOI":"10.1093\/bioinformatics\/btt293","article-title":"Purbayes: estimating tumor cellularity and subclonality in next-generation sequencing data","volume":"29","author":"Larson","year":"2013","journal-title":"Bioinformatics"},{"key":"2023012711323574800_btu174-B17","doi-asserted-by":"crossref","first-page":"1001","DOI":"10.1038\/79269","article-title":"Loss-of-heterozygosity analysis of small-cell lung carcinomas using single-nucleotide polymorphism arrays","volume":"18","author":"Lindblad-Toh","year":"2000","journal-title":"Nat. Biotechnol."},{"key":"2023012711323574800_btu174-B18","doi-asserted-by":"crossref","first-page":"R24","DOI":"10.1186\/gb-2013-14-3-r24","article-title":"Patchwork: allele-specific copy number analysis of whole genome sequenced tumor tissue","volume":"14","author":"Mayrhofer","year":"2013","journal-title":"Genome Biol."},{"key":"2023012711323574800_btu174-B19","doi-asserted-by":"crossref","first-page":"1126","DOI":"10.1101\/gr.10.8.1126","article-title":"Genome-wide detection of allelic imbalance using human SNPs and high-density DNA arrays","volume":"10","author":"Mei","year":"2000","journal-title":"Genome Res."},{"key":"2023012711323574800_btu174-B20","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1038\/nature09807","article-title":"Tumour evolution inferred by single-cell sequencing","volume":"472","author":"Navin","year":"2011","journal-title":"Nature"},{"key":"2023012711323574800_btu174-B21","doi-asserted-by":"crossref","first-page":"R80","DOI":"10.1186\/gb-2013-14-7-r80","article-title":"Theta: inferring intra-tumor heterogeneity from high-throughput DNA sequencing data","volume":"14","author":"Oesper","year":"2013","journal-title":"Genome Biol."},{"key":"2023012711323574800_btu174-B22","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1093\/biostatistics\/kxh008","article-title":"Circular binary segmentation for the analysis of array-based DNA copy number data","volume":"5","author":"Olshen","year":"2004","journal-title":"Biostatistics"},{"key":"2023012711323574800_btu174-B23","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1016\/j.mrrev.2008.05.004","article-title":"Many different tumor types have polyclonal tumor origin: evidence and implications","volume":"659","author":"Parsons","year":"2008","journal-title":"Mutat. Res."},{"key":"2023012711323574800_btu174-B24","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1038\/2524","article-title":"High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays","volume":"20","author":"Pinkel","year":"1998","journal-title":"Nat. Genet."},{"key":"2023012711323574800_btu174-B25","doi-asserted-by":"crossref","first-page":"R108","DOI":"10.1186\/gb-2011-12-10-r108","article-title":"Allele-specific copy number analysis of tumor samples with aneuploidy and tumor heterogeneity","volume":"12","author":"Rasmussen","year":"2011","journal-title":"Genome Biol."},{"key":"2023012711323574800_btu174-B26","doi-asserted-by":"crossref","first-page":"375","DOI":"10.2307\/1907835","article-title":"Identifiability of a linear relation between variables which are subject to error","volume":"18","author":"Reiers\u00f8l","year":"1950","journal-title":"Econometrica"},{"key":"2023012711323574800_btu174-B27","doi-asserted-by":"crossref","first-page":"2223","DOI":"10.1093\/bioinformatics\/btt375","article-title":"A comparative analysis of algorithms for somatic SNV detection in cancer","volume":"29","author":"Roberts","year":"2013","journal-title":"Bioinformatics"},{"key":"2023012711323574800_btu174-B28","doi-asserted-by":"crossref","first-page":"907","DOI":"10.1093\/bioinformatics\/bts053","article-title":"Jointsnvmix: a probabilistic model for accurate detection of somatic mutations in normal\/tumour paired next-generation sequencing data","volume":"28","author":"Roth","year":"2012","journal-title":"Bioinformatics"},{"key":"2023012711323574800_btu174-B29","doi-asserted-by":"crossref","first-page":"928","DOI":"10.1038\/35057149","article-title":"A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms","volume":"409","author":"Sachidanandam","year":"2001","journal-title":"Nature"},{"key":"2023012711323574800_btu174-B30","doi-asserted-by":"crossref","first-page":"308","DOI":"10.1093\/nar\/29.1.308","article-title":"dbSNP: the NCBI database of genetic variation","volume":"29","author":"Sherry","year":"2001","journal-title":"Nucleic Acids Res."},{"key":"2023012711323574800_btu174-B31","doi-asserted-by":"crossref","first-page":"2265","DOI":"10.1093\/bioinformatics\/bts365","article-title":"Purityest: estimating purity of human tumor samples using next-generation sequencing data","volume":"28","author":"Su","year":"2012","journal-title":"Bioinformatics"},{"key":"2023012711323574800_btu174-B32","doi-asserted-by":"crossref","first-page":"5365","DOI":"10.1093\/nar\/gkp493","article-title":"Integrated study of copy number states and genotype calls using high-density SNP arrays","volume":"37","author":"Sun","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012711323574800_btu174-B33","doi-asserted-by":"crossref","first-page":"16910","DOI":"10.1073\/pnas.1009843107","article-title":"Allele-specific copy number analysis of tumors","volume":"107","author":"Van Loo","year":"2010","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711323574800_btu174-B34","doi-asserted-by":"crossref","first-page":"E1128","DOI":"10.1073\/pnas.1110574108","article-title":"Copy number variation detection in whole-genome sequencing data using the bayesian information criterion","volume":"108","author":"Xi","year":"2011","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711323574800_btu174-B35","doi-asserted-by":"crossref","first-page":"2482","DOI":"10.1093\/bioinformatics\/btt416","article-title":"OncoSNP-SEQ: a statistical approach for the identification of somatic copy number alterations from next-generation sequencing of cancer genomes","volume":"29","author":"Yau","year":"2013","journal-title":"Bioinformatics"},{"key":"2023012711323574800_btu174-B36","doi-asserted-by":"crossref","first-page":"R92","DOI":"10.1186\/gb-2010-11-9-r92","article-title":"A statistical approach for detecting genomic aberrations in heterogeneous tumor samples from single nucleotide polymorphism genotyping data","volume":"11","author":"Yau","year":"2010","journal-title":"Genome Biol."},{"key":"2023012711323574800_btu174-B37","doi-asserted-by":"crossref","DOI":"10.1126\/scitranslmed.3004330","article-title":"Quantitative image analysis of cellular heterogeneity in breast tumors complements genomic profiling","volume":"4","author":"Yuan","year":"2012","journal-title":"Sci. Transl. Med."},{"key":"2023012711323574800_btu174-B38","doi-asserted-by":"crossref","first-page":"570","DOI":"10.1038\/ng.2246","article-title":"Exome sequencing of gastric adenocarcinoma identifies recurrent somatic mutations in cell adhesion and chromatin remodeling genes","volume":"44","author":"Zang","year":"2012","journal-title":"Nat. Genet."},{"key":"2023012711323574800_btu174-B39","doi-asserted-by":"crossref","first-page":"3060","DOI":"10.1158\/0008-5472.CAN-03-3308","article-title":"An integrated view of copy number and allelic alterations in the cancer genome using single nucleotide polymorphism arrays","volume":"64","author":"Zhao","year":"2004","journal-title":"Cancer Res."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/15\/2121\/48926250\/bioinformatics_30_15_2121.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/15\/2121\/48926250\/bioinformatics_30_15_2121.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,25]],"date-time":"2024-05-25T16:06:11Z","timestamp":1716653171000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/30\/15\/2121\/2390232"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,4,2]]},"references-count":39,"journal-issue":{"issue":"15","published-print":{"date-parts":[[2014,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btu174","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2014,8,1]]},"published":{"date-parts":[[2014,4,2]]}}}