{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,20]],"date-time":"2026-05-20T03:40:16Z","timestamp":1779248416632,"version":"3.51.4"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2021,6,10]],"date-time":"2021-06-10T00:00:00Z","timestamp":1623283200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R21 HG010925"],"award-info":[{"award-number":["R21 HG010925"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100008899","name":"University of South Carolina","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100008899","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,11,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Copy number variation has been identified as a major source of genomic variation associated with disease susceptibility. With the advent of whole-exome sequencing (WES) technology, massive WES data have been generated, allowing for the identification of copy number variants (CNVs) in the protein-coding regions with direct functional interpretation. We have previously shown evidence of the genomic correlation structure in array data and developed a novel chromosomal breakpoint detection algorithm, LDcnv, which showed significantly improved detection power through integrating the correlation structure in a systematic modeling manner. However, it remains unexplored whether the genomic correlation exists in WES data and how such correlation structure integration can improve the CNV detection accuracy. In this study, we first explored the correlation structure of the WES data using the 1000 Genomes Project data. Both real raw read depth and median-normalized data showed strong evidence of the correlation structure. Motivated by this fact, we proposed a correlation-based method, CORRseq, as a novel release of the LDcnv algorithm in profiling WES data. The performance of CORRseq was evaluated in extensive simulation studies and real data analysis from the 1000 Genomes Project. CORRseq outperformed the existing methods in detecting medium and large CNVs. In conclusion, it would be more advantageous to model genomic correlation structure in detecting relatively long CNVs. This study provides great insights for methodology development of CNV detection with NGS data.<\/jats:p>","DOI":"10.1093\/bib\/bbab215","type":"journal-article","created":{"date-parts":[[2021,6,5]],"date-time":"2021-06-05T03:15:51Z","timestamp":1622862951000},"source":"Crossref","is-referenced-by-count":3,"title":["Shall genomic correlation structure be considered in copy number variants detection?"],"prefix":"10.1093","volume":"22","author":[{"given":"Fei","family":"Qin","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xizhi","family":"Luo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Guoshuai","family":"Cai","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Feifei","family":"Xiao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2021,6,10]]},"reference":[{"key":"2022011914551642900_ref1","doi-asserted-by":"crossref","first-page":"677","DOI":"10.1136\/jmedgenet-2014-102588","article-title":"The clinical significance of small copy number variants in neurodevelopmental disorders","volume":"51","author":"Asadollahi","year":"2014","journal-title":"J Med Genet"},{"key":"2022011914551642900_ref2","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1017\/thg.2014.6","article-title":"Copy number variation distribution in six monozygotic twin pairs discordant for schizophrenia","volume":"17","author":"Castellani","year":"2014","journal-title":"Twin Res Hum Genet Off J Int Soc Twin Stud"},{"key":"2022011914551642900_ref3","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1038\/s41588-018-0288-4","article-title":"Neurodevelopmental disease genes implicated by de novo mutation and copy number variation morbidity","volume":"51","author":"Coe","year":"2019","journal-title":"Nat Genet"},{"key":"2022011914551642900_ref4","doi-asserted-by":"crossref","first-page":"1481","DOI":"10.1007\/s00439-012-1183-1","article-title":"Identification of germline genomic copy number variation in familial pancreatic cancer","volume":"131","author":"Al-Sukhni","year":"2012","journal-title":"Hum Genet"},{"key":"2022011914551642900_ref5","doi-asserted-by":"crossref","first-page":"384","DOI":"10.1016\/j.ajhg.2012.07.003","article-title":"A functional copy-number variation in MAPKAPK2 predicts risk and prognosis of lung cancer","volume":"91","author":"Liu","year":"2012","journal-title":"Am J Hum Genet"},{"key":"2022011914551642900_ref6","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1186\/s12859-017-1705-x","article-title":"An evaluation of copy number variation detection tools for cancer using whole exome sequencing data","volume":"18","author":"Zare","year":"2017","journal-title":"BMC Bioinformatics"},{"key":"2022011914551642900_ref7","doi-asserted-by":"crossref","first-page":"597","DOI":"10.1016\/j.ajhg.2012.08.005","article-title":"Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth","volume":"91","author":"Fromer","year":"2012","journal-title":"Am J Hum Genet"},{"key":"2022011914551642900_ref8","doi-asserted-by":"crossref","first-page":"1525","DOI":"10.1101\/gr.138115.112","article-title":"Copy number variation detection and genotyping from exome sequence data","volume":"22","author":"Krumm","year":"2012","journal-title":"Genome Res"},{"key":"2022011914551642900_ref9","doi-asserted-by":"crossref","first-page":"R120","DOI":"10.1186\/gb-2013-14-10-r120","article-title":"EXCAVATOR: detecting copy number variants from whole-exome sequencing data","volume":"14","author":"Magi","year":"2013","journal-title":"Genome Biol"},{"key":"2022011914551642900_ref10","doi-asserted-by":"crossref","first-page":"e39","DOI":"10.1093\/nar\/gku1363","article-title":"CODEX: a normalization and copy number variation detection method for whole exome sequencing","volume":"43","author":"Jiang","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2022011914551642900_ref11","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1093\/biostatistics\/kxh008","article-title":"Circular binary segmentation for the analysis of array-based DNA copy number data","volume":"5","author":"Olshen","year":"2004","journal-title":"Biostatistics"},{"key":"2022011914551642900_ref12","doi-asserted-by":"crossref","first-page":"2648","DOI":"10.1093\/bioinformatics\/btr462","article-title":"Exome sequencing-based copy-number variation and loss of heterozygosity detection: ExomeCNV","volume":"27","author":"Sathirapongsasuti","year":"2011","journal-title":"Bioinformatics"},{"key":"2022011914551642900_ref13","doi-asserted-by":"crossref","first-page":"202","DOI":"10.1186\/s13059-018-1578-y","article-title":"CODEX2: full-spectrum copy number variation detection by high-throughput DNA sequencing","volume":"19","author":"Jiang","year":"2018","journal-title":"Genome Biol"},{"key":"2022011914551642900_ref14","doi-asserted-by":"crossref","first-page":"1306","DOI":"10.1214\/12-AOAS539","article-title":"The screening and ranking algorithm to detect DNA copy number variations","volume":"6","author":"Niu","year":"2012","journal-title":"Ann Appl Stat"},{"key":"2022011914551642900_ref15","doi-asserted-by":"crossref","first-page":"2384","DOI":"10.1093\/bioinformatics\/btx212","article-title":"modSaRa: a computationally efficient R package for CNV identification","volume":"33","author":"Xiao","year":"2017","journal-title":"Bioinformatics"},{"key":"2022011914551642900_ref16","doi-asserted-by":"crossref","first-page":"2891","DOI":"10.1093\/bioinformatics\/bty1041","article-title":"An accurate and powerful method for copy number variation detection","volume":"35","author":"Xiao","year":"2019","journal-title":"Bioinformatics"},{"key":"2022011914551642900_ref17","doi-asserted-by":"crossref","first-page":"312","DOI":"10.1093\/bioinformatics\/btaa737","article-title":"Integrating genomic correlation structure improves copy number variations detection","volume":"37","author":"Luo","year":"2020","journal-title":"Bioinformatics"},{"key":"2022011914551642900_ref18","doi-asserted-by":"crossref","DOI":"10.1002\/0471250953.bia03as18","article-title":"An introduction to hidden Markov models","author":"Schuster-B\u00f6ckler","year":"2007","journal-title":"Curr Protoc Bioinformatics"},{"key":"2022011914551642900_ref19","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","article-title":"A global reference for human genetic variation","volume":"526","author":"Auton","year":"2015","journal-title":"Nature"},{"key":"2022011914551642900_ref20","first-page":"1553","article-title":"Multiple change-point detection via a screening and ranking algorithm","volume":"23","author":"Hao","year":"2013","journal-title":"Stat Sin"},{"key":"2022011914551642900_ref21","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1111\/j.1541-0420.2006.00662.x","article-title":"A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data","volume":"63","author":"Zhang","year":"2007","journal-title":"Biometrics"},{"key":"2022011914551642900_ref22","first-page":"e154","article-title":"Enhanced copy number variants detection from whole-exome sequencing data using EXCAVATOR2","volume":"44","author":"D\u2019Aurizio","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2022011914551642900_ref23","doi-asserted-by":"crossref","first-page":"D986","DOI":"10.1093\/nar\/gkt958","article-title":"The database of genomic variants: a curated collection of structural variation in the human genome","volume":"42","author":"MacDonald","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2022011914551642900_ref24","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1093\/biostatistics\/kxp051","article-title":"A shifting level model algorithm that identifies aberrations in array-CGH data","volume":"11","author":"Magi","year":"2010","journal-title":"Biostatistics"},{"key":"2022011914551642900_ref25","doi-asserted-by":"crossref","first-page":"e65","DOI":"10.1093\/nar\/gkr068","article-title":"Detecting common copy number variants in high-throughput sequencing data by using JointSLM algorithm","volume":"39","author":"Magi","year":"2011","journal-title":"Nucleic Acids Res"},{"key":"2022011914551642900_ref26","doi-asserted-by":"crossref","first-page":"515","DOI":"10.1093\/biostatistics\/kxq008","article-title":"A very fast and accurate method for calling aberrations in array-CGH data","volume":"11","author":"Benelli","year":"2010","journal-title":"Biostatistics"},{"key":"2022011914551642900_ref27","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1038\/nature09298","article-title":"The international HapMap 3 consortium. Integrating common and rare genetic variation in diverse human populations","volume":"467","author":"Altshuler","year":"2010","journal-title":"Nature"},{"key":"2022011914551642900_ref28","doi-asserted-by":"crossref","first-page":"704","DOI":"10.1038\/nature08516","article-title":"Origins and functional impact of copy number variation in the human genome","volume":"464","author":"Conrad","year":"2010","journal-title":"Nature"},{"key":"2022011914551642900_ref29","doi-asserted-by":"crossref","first-page":"1166","DOI":"10.1038\/ng.238","article-title":"Integrated detection and population-genetic analysis of SNPs and copy number variation","volume":"40","author":"McCarroll","year":"2008","journal-title":"Nat Genet"},{"key":"2022011914551642900_ref30","doi-asserted-by":"crossref","first-page":"1406","DOI":"10.1198\/jasa.2009.tm08332","article-title":"A factor model approach to multiple testing under dependence","volume":"104","author":"Friguet","year":"2009","journal-title":"J Am Stat Assoc"},{"key":"2022011914551642900_ref31","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1146\/annurev-med-051010-162644","article-title":"Human genome sequencing in health and disease","volume":"63","author":"Gonzaga-Jauregui","year":"2012","journal-title":"Annu Rev Med"},{"key":"2022011914551642900_ref32","doi-asserted-by":"crossref","first-page":"464","DOI":"10.1093\/bioinformatics\/btp708","article-title":"CMDS: a population-based method for identifying recurrent DNA copy number aberrations in cancer from high-resolution data","volume":"26","author":"Zhang","year":"2010","journal-title":"Bioinformatics"},{"key":"2022011914551642900_ref33","doi-asserted-by":"crossref","DOI":"10.1038\/s41598-020-64353-1","article-title":"CONY: a Bayesian procedure for detecting copy number variations from sequencing read depths","volume":"10","author":"Wei","year":"2020","journal-title":"Sci Rep"},{"key":"2022011914551642900_ref34","doi-asserted-by":"crossref","first-page":"S2","DOI":"10.1186\/1471-2105-14-S2-S2","article-title":"CoNVEX: copy number variation estimation in exome sequencing data using HMM","volume":"14","author":"Amarasinghe","year":"2013","journal-title":"BMC Bioinformatics"},{"key":"2022011914551642900_ref35","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1186\/s12859-017-1802-x","article-title":"PennCNV in whole-genome sequencing data","volume":"18","author":"Ara\u00fajo Lima","year":"2017","journal-title":"BMC Bioinformatics"},{"key":"2022011914551642900_ref36","doi-asserted-by":"crossref","first-page":"2102","DOI":"10.1214\/16-AOAS966","article-title":"The screening and ranking algorithm for change-points detection in multiple samples","volume":"10","author":"Song","year":"2016","journal-title":"Ann Appl Stat"},{"key":"2022011914551642900_ref37","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1016\/j.cels.2020.03.005","article-title":"SCOPE: a normalization and copy-number estimation method for single-cell DNA sequencing","volume":"10","author":"Wang","year":"2020","journal-title":"Cell Syst"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/22\/6\/bbab215\/42242162\/bbab215.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/22\/6\/bbab215\/42242162\/bbab215.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,30]],"date-time":"2022-12-30T00:16:00Z","timestamp":1672359360000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbab215\/6295811"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,6,10]]},"references-count":37,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2021,11,5]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbab215","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,11]]},"published":{"date-parts":[[2021,6,10]]},"article-number":"bbab215"}}