{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,20]],"date-time":"2025-10-20T10:28:31Z","timestamp":1760956111324,"version":"3.37.3"},"reference-count":22,"publisher":"Oxford University Press (OUP)","issue":"18","license":[{"start":{"date-parts":[[2021,3,14]],"date-time":"2021-03-14T00:00:00Z","timestamp":1615680000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01AG040211"],"award-info":[{"award-number":["R01AG040211"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004606","name":"Chang Gung Medical Foundation","doi-asserted-by":"publisher","award":["CLRPG3J0012","BMRPE97"],"award-info":[{"award-number":["CLRPG3J0012","BMRPE97"]}],"id":[{"id":"10.13039\/501100004606","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,29]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Summary<\/jats:title>\n                  <jats:p>In this article, we introduce a hierarchical clustering and Gaussian mixture model with expectation-maximization (EM) algorithm for detecting copy number variants (CNVs) using whole exome sequencing (WES) data. The R shiny package \u2018HCMMCNVs\u2019 is also developed for processing user-provided bam files, running CNVs detection algorithm and conducting visualization. Through applying our approach to 325 cancer cell lines in 22 tumor types from Cancer Cell Line Encyclopedia (CCLE), we show that our algorithm is competitive with other existing methods and feasible in using multiple cancer cell lines for CNVs estimation. In addition, by applying our approach to WES data of 120 oral squamous cell carcinoma (OSCC) samples, our algorithm, using the tumor sample only, exhibits more power in detecting CNVs as compared with the methods using both tumors and matched normal counterparts.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>HCMMCNVs R shiny software is freely available at github repository https:\/\/github.com\/lunching\/HCMM_CNVs.and Zenodo https:\/\/doi.org\/10.5281\/zenodo.4593371.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab183","type":"journal-article","created":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T20:11:24Z","timestamp":1615579884000},"page":"3026-3028","source":"Crossref","is-referenced-by-count":1,"title":["HCMMCNVs: hierarchical clustering mixture model of copy number variants detection using whole exome sequencing technology"],"prefix":"10.1093","volume":"37","author":[{"given":"Chi","family":"Song","sequence":"first","affiliation":[{"name":"Division of Biostatistics, Ohio State University , Columbus, OH 43210, USA"}]},{"given":"Shih-Chi","family":"Su","sequence":"additional","affiliation":[{"name":"Whole-Genome Research Core Laboratory of Human Diseases, Chang Gung Memorial Hospital , Keelung 204, Taiwan"}]},{"given":"Zhiguang","family":"Huo","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, University of Florida , Gainsville, FL 32611, USA"}]},{"given":"Suleyman","family":"Vural","sequence":"additional","affiliation":[{"name":"Memorial Sloan Kettering Cancer Center , New York, NY 10065, USA"}]},{"given":"James E","family":"Galvin","sequence":"additional","affiliation":[{"name":"Comprehensive Center for Brain Health, Department of Neurology, Miller School of Medicine, University of Miami , Miami, FL 33101, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8039-5350","authenticated-orcid":false,"given":"Lun-Ching","family":"Chang","sequence":"additional","affiliation":[{"name":"Department of Mathematical Sciences, Florida Atlantic University , Boca Raton, FL 33431, USA"}]}],"member":"286","published-online":{"date-parts":[[2021,3,14]]},"reference":[{"key":"2023061310564955600_btab183-B1","doi-asserted-by":"crossref","first-page":"1056","DOI":"10.1093\/bioinformatics\/btt759","article-title":"AbsCN-seq: a statistical method to estimate tumor purity, ploidy and absolute copy numbers from next-generation sequencing data","volume":"30","author":"Bao","year":"2014","journal-title":"Bioinformatics"},{"key":"2023061310564955600_btab183-B2","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1038\/nature11003","article-title":"The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity","volume":"483","author":"Barretina","year":"2012","journal-title":"Nature"},{"key":"2023061310564955600_btab183-B3","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1093\/bioinformatics\/btr670","article-title":"Control-FREEC: a tool for assessing copy number and allelic content using next-generation sequencing data","volume":"28","author":"Boeva","year":"2012","journal-title":"Bioinformatics"},{"key":"2023061310564955600_btab183-B4","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1038\/nbt.2203","article-title":"Absolute quantification of somatic DNA alterations in human cancer","volume":"30","author":"Carter","year":"2012","journal-title":"Nat. Biotechnol"},{"key":"2023061310564955600_btab183-B5","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1093\/annonc\/mdu479","article-title":"Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data","volume":"26","author":"Favero","year":"2015","journal-title":"Ann. Oncol"},{"key":"2023061310564955600_btab183-B6","doi-asserted-by":"crossref","first-page":"e39","DOI":"10.1093\/nar\/gku1363","article-title":"CODEX: a normalization and copy number variation detection method for whole exome sequencing","volume":"43","author":"Jiang","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023061310564955600_btab183-B7","doi-asserted-by":"crossref","first-page":"568","DOI":"10.1101\/gr.129684.111","article-title":"VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing","volume":"22","author":"Koboldt","year":"2012","journal-title":"Genome Res"},{"key":"2023061310564955600_btab183-B8","doi-asserted-by":"crossref","first-page":"17087","DOI":"10.1073\/pnas.1108745108","article-title":"Exome sequencing identifies a spectrum of mutation frequencies in advanced and lethal prostate cancers","volume":"108","author":"Kumar","year":"2011","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023061310564955600_btab183-B9","doi-asserted-by":"crossref","first-page":"1307","DOI":"10.1093\/bioinformatics\/bts146","article-title":"CONTRA: copy number analysis for targeted resequencing","volume":"28","author":"Li","year":"2012","journal-title":"Bioinformatics"},{"key":"2023061310564955600_btab183-B10","doi-asserted-by":"crossref","first-page":"R120","DOI":"10.1186\/gb-2013-14-10-r120","article-title":"EXCAVATOR: detecting copy number variants from whole-exome sequencing data","volume":"14","author":"Magi","year":"2013","journal-title":"Genome Biol"},{"key":"2023061310564955600_btab183-B11","doi-asserted-by":"crossref","first-page":"R41","DOI":"10.1186\/gb-2011-12-4-r41","article-title":"GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers","volume":"12","author":"Mermel","year":"2011","journal-title":"Genome Biol"},{"key":"2023061310564955600_btab183-B12","doi-asserted-by":"crossref","first-page":"585","DOI":"10.1038\/ng.835","article-title":"Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations","volume":"43","author":"O'Roak","year":"2011","journal-title":"Nat. Genet"},{"key":"2023061310564955600_btab183-B13","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1093\/biostatistics\/kxh008","article-title":"Circular binary segmentation for the analysis of array-based DNA copy number data","volume":"5","author":"Olshen","year":"2004","journal-title":"Biostatistics"},{"key":"2023061310564955600_btab183-B14","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1093\/bioinformatics\/btv547","article-title":"CLAMMS: a scalable algorithm for calling common and rare copy number variants from exome sequencing data","volume":"32","author":"Packer","year":"2016","journal-title":"Bioinformatics"},{"key":"2023061310564955600_btab183-B15","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1038\/nature10945","article-title":"De novo mutations revealed by whole-exome sequencing are strongly associated with autism","volume":"485","author":"Sanders","year":"2012","journal-title":"Nature"},{"key":"2023061310564955600_btab183-B16","doi-asserted-by":"crossref","first-page":"2648","DOI":"10.1093\/bioinformatics\/btr462","article-title":"Exome sequencing-based copy-number variation and loss of heterozygosity detection: exomeCNV","volume":"27","author":"Sathirapongsasuti","year":"2011","journal-title":"Bioinformatics"},{"key":"2023061310564955600_btab183-B17","doi-asserted-by":"crossref","first-page":"1379","DOI":"10.1007\/s00439-019-02083-9","article-title":"Mutational signatures and mutagenic impacts associated with betel quid chewing in oral squamous cell carcinoma","volume":"138","author":"Su","year":"2019","journal-title":"Hum. Genet"},{"key":"2023061310564955600_btab183-B18","doi-asserted-by":"crossref","first-page":"1088","DOI":"10.7150\/thno.18551","article-title":"Exome sequencing of oral squamous cell carcinoma reveals molecular subgroups and novel therapeutic opportunities","volume":"7","author":"Su","year":"2017","journal-title":"Theranostics"},{"key":"2023061310564955600_btab183-B19","doi-asserted-by":"crossref","first-page":"16910","DOI":"10.1073\/pnas.1009843107","article-title":"Allele-specific copy number analysis of tumors","volume":"107","author":"Van Loo","year":"2010","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023061310564955600_btab183-B20","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1186\/s12859-018-2128-z","article-title":"nQuire: a statistical framework for ploidy estimation using next generation sequencing","volume":"19","author":"Wei\u00df","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2023061310564955600_btab183-B21","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1038\/ng.788","article-title":"Exome sequencing identifies somatic mutations of DNA methyltransferase gene DNMT3A in acute monocytic leukemia","volume":"43","author":"Yan","year":"2011","journal-title":"Nat. Genet"},{"key":"2023061310564955600_btab183-B22","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1186\/s12859-016-1174-7","article-title":"CloneCNA: detecting subclonal somatic copy number alterations in heterogeneous tumor samples from whole-exome sequencing data","volume":"17","author":"Yu","year":"2016","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab183\/36651072\/btab183.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/18\/3026\/50579473\/btab183.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/18\/3026\/50579473\/btab183.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,13]],"date-time":"2023-06-13T10:58:58Z","timestamp":1686653938000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/18\/3026\/6170657"}},"subtitle":[],"editor":[{"given":"Anthony","family":"Mathelier","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,3,14]]},"references-count":22,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2021,9,29]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab183","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2021,9,15]]},"published":{"date-parts":[[2021,3,14]]}}}