{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,22]],"date-time":"2026-03-22T22:57:47Z","timestamp":1774220267875,"version":"3.50.1"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2022,9,3]],"date-time":"2022-09-03T00:00:00Z","timestamp":1662163200000},"content-version":"vor","delay-in-days":2,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"Shaanxi\u2019s Natural Science Basic Research Program","award":["2020JC-01"],"award-info":[{"award-number":["2020JC-01"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,9,20]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Copy number variation (CNV) is a class of key biomarkers in many complex traits and diseases. Detecting CNV from sequencing data is a substantial bioinformatics problem and a standard requirement in clinical practice. Although many proposed CNV detection approaches exist, the core statistical model at their foundation is weakened by two critical computational issues: (i) identifying the optimal setting on the sliding window and (ii) correcting for bias and noise. We designed a statistical process model to overcome these limitations by calculating regional read depths via an exponentially weighted moving average strategy. A one-run detection of CNVs of various lengths is then achieved by a dynamic sliding window, whose size is self-adopted according to the weighted averages. We also designed a novel bias\/noise reduction model, accompanied by the moving average, which can handle complicated patterns and extend training data. This model, called PEcnv, accurately detects CNVs ranging from kb-scale to chromosome-arm level. The model performance was validated with simulation samples and real samples. Comparative analysis showed that PEcnv outperforms current popular approaches. Notably, PEcnv provided considerable advantages in detecting small CNVs (1\u00a0kb\u20131\u00a0Mb) in panel sequencing data. Thus, PEcnv fills the gap left by existing methods focusing on large CNVs. PEcnv may have broad applications in clinical testing where panel sequencing is the dominant strategy. Availability and implementation: Source code is freely available at https:\/\/github.com\/Sherwin-xjtu\/PEcnv<\/jats:p>","DOI":"10.1093\/bib\/bbac375","type":"journal-article","created":{"date-parts":[[2022,9,3]],"date-time":"2022-09-03T09:38:59Z","timestamp":1662197939000},"source":"Crossref","is-referenced-by-count":17,"title":["PEcnv: accurate and efficient detection of copy number variations of various lengths"],"prefix":"10.1093","volume":"23","author":[{"given":"Xuwen","family":"Wang","sequence":"first","affiliation":[{"name":"Department of Computer Science and Technology, School of Electronics and Information Engineering, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"},{"name":"Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ying","family":"Xu","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, School of Electronics and Information Engineering, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"},{"name":"Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ruoyu","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, School of Electronics and Information Engineering, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"},{"name":"Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xin","family":"Lai","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, School of Electronics and Information Engineering, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"},{"name":"Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8536-0854","authenticated-orcid":false,"given":"Yuqian","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, School of Electronics and Information Engineering, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"},{"name":"Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shenjie","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, School of Electronics and Information Engineering, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"},{"name":"Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xuanping","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, School of Electronics and Information Engineering, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"},{"name":"Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3862-6557","authenticated-orcid":false,"given":"Jiayin","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, School of Electronics and Information Engineering, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"},{"name":"Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi\u2019an Jiaotong University , Xi\u2019an 710049, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2022,9,2]]},"reference":[{"issue":"3","key":"2022092013233982300_ref1","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1016\/j.gde.2012.02.012","article-title":"Mechanisms for recurrent and complex human genomic rearrangements","volume":"22","author":"Liu","year":"2012","journal-title":"Curr Opin Genet Dev"},{"issue":"1","key":"2022092013233982300_ref2","first-page":"7.23.21","article-title":"Using XHMM software to detect copy number variation in whole-exome sequencing data","volume":"81","author":"Fromer","year":"2014","journal-title":"Curr Protoc Hum Genet"},{"issue":"8","key":"2022092013233982300_ref3","doi-asserted-by":"crossref","first-page":"949","DOI":"10.1101\/gr.3677206","article-title":"Copy number variation: new insights in genome diversity","volume":"16","author":"Freeman","year":"2006","journal-title":"Genome Res"},{"issue":"4","key":"2022092013233982300_ref4","first-page":"369","article-title":"Chromosome aberrations in solid tumors","volume":"34","author":"Albertson","year":"2003","journal-title":"Recent Results Cancer Res"},{"issue":"7118","key":"2022092013233982300_ref5","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/nature05329","article-title":"Global variation in copy number in the human genome","volume":"444","author":"Redon","year":"2006","journal-title":"Nature"},{"issue":"1","key":"2022092013233982300_ref6","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1016\/j.neuron.2006.09.027","article-title":"Genomic rearrangements and gene copy-number alterations as a cause of nervous system disorders","volume":"52","author":"Lee","year":"2006","journal-title":"Neuron"},{"issue":"10","key":"2022092013233982300_ref7","doi-asserted-by":"crossref","first-page":"1307","DOI":"10.1093\/bioinformatics\/bts146","article-title":"CONTRA: copy number analysis for targeted resequencing","volume":"28","author":"Li","year":"2012","journal-title":"Bioinformatics"},{"issue":"2","key":"2022092013233982300_ref8","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1038\/ng.313","article-title":"Deletion of the late cornified envelope LCE3B and LCE3C genes as a susceptibility factor for psoriasis","volume":"41","author":"Cid","year":"2009","journal-title":"Nat Genet"},{"issue":"6","key":"2022092013233982300_ref9","doi-asserted-by":"crossref","first-page":"508","DOI":"10.1038\/ng.582","article-title":"Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci","volume":"42","author":"Stahl","year":"2010","journal-title":"Nat Genet"},{"issue":"6","key":"2022092013233982300_ref10","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1016\/j.ejmg.2009.09.002","article-title":"Challenges for CNV interpretation in clinical molecular karyotyping: lessons learned from a 1001 sample experience","volume":"52","author":"Buysse","year":"2009","journal-title":"Eur J Med Genet"},{"issue":"7","key":"2022092013233982300_ref11","doi-asserted-by":"crossref","first-page":"S16","DOI":"10.1038\/ng2028","article-title":"Methods and strategies for analyzing copy number variation using DNA microarrays","volume":"39","author":"Carter","year":"2007","journal-title":"Nat Genet"},{"issue":"10","key":"2022092013233982300_ref12","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1038\/nrg2841","article-title":"Advances in understanding cancer genomes through second-generation sequencing","volume":"11","author":"Meyerson","year":"2010","journal-title":"Nat Rev Genet"},{"issue":"9","key":"2022092013233982300_ref13","doi-asserted-by":"crossref","first-page":"e69","DOI":"10.1093\/nar\/gks003","article-title":"Cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate","volume":"40","author":"Klambauer","year":"2012","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2022092013233982300_ref14","doi-asserted-by":"crossref","first-page":"S3","DOI":"10.1016\/j.nbt.2010.01.291","article-title":"Next generation DNA sequencing techniques and applications","volume":"27","author":"Ansorge","year":"2010","journal-title":"N Biotechnol"},{"key":"2022092013233982300_ref15","doi-asserted-by":"crossref","first-page":"19","DOI":"10.7171\/jbt.15-2601-002","article-title":"An integrated approach for analyzing clinical genomic variant data from next-generation sequencing","volume":"26","author":"Crowgey","year":"2015","journal-title":"J Biomol Tech"},{"issue":"1","key":"2022092013233982300_ref16","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1186\/s12859-017-1705-x","article-title":"An evaluation of copy number variation detection tools for cancer using whole exome sequencing data","volume":"18","author":"Zare","year":"2017","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"2022092013233982300_ref17","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1186\/s12859-020-3421-1","article-title":"Comparative study of whole exome sequencing-based copy number variation detection tools","volume":"21","author":"Zhao","year":"2020","journal-title":"BMC Bioinformatics"},{"issue":"5","key":"2022092013233982300_ref18","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1002\/humu.22969","article-title":"CoNVaDING: single exon variation detection in targeted NGS data","volume":"37","author":"Johansson","year":"2016","journal-title":"Hum Mutat"},{"issue":"4","key":"2022092013233982300_ref19","doi-asserted-by":"crossref","first-page":"e1004873","DOI":"10.1371\/journal.pcbi.1004873","article-title":"CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing","volume":"12","author":"Talevich","year":"2016","journal-title":"PLoS Comput Biol"},{"issue":"16","key":"2022092013233982300_ref20","doi-asserted-by":"crossref","first-page":"e131","DOI":"10.1093\/nar\/gkw520","article-title":"FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing","volume":"44","author":"Shen","year":"2016","journal-title":"Nucleic Acids Res"},{"issue":"S11","key":"2022092013233982300_ref21","doi-asserted-by":"crossref","first-page":"S1","DOI":"10.1186\/1471-2105-14-S11-S1","article-title":"Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives","volume":"14","author":"Zhao","year":"2013","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"2022092013233982300_ref22","doi-asserted-by":"crossref","first-page":"76","DOI":"10.1186\/s12920-020-00731-y","article-title":"MetaCNV-a consensus approach to infer accurate copy numbers from low coverage data","volume":"13","author":"Friedrich","year":"2020","journal-title":"BMC Med Genomics"},{"issue":"16","key":"2022092013233982300_ref23","doi-asserted-by":"crossref","first-page":"e105","DOI":"10.1093\/nar\/gkn425","article-title":"Substantial biases in ultra-short read data sets from high-throughput DNA sequencing","volume":"36","author":"Dohm","year":"2008","journal-title":"Nucleic Acids Res"},{"issue":"6","key":"2022092013233982300_ref24","doi-asserted-by":"crossref","first-page":"e39","DOI":"10.1093\/nar\/gku1363","article-title":"CODEX: a normalization and copy number variation detection method for whole exome sequencing","volume":"43","author":"Jiang","year":"2015","journal-title":"Nucleic Acids Res"},{"issue":"16","key":"2022092013233982300_ref25","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2022092013233982300_ref26","doi-asserted-by":"crossref","first-page":"i639","DOI":"10.1093\/bioinformatics\/btu475","article-title":"cnvOffSeq: detecting intergenic copy number variation using off-target exome sequencing data","volume":"30","author":"Bellos","year":"2014","journal-title":"Bioinformatics"},{"issue":"3","key":"2022092013233982300_ref27","first-page":"239","article-title":"Control chart tests based on geometric moving averages","volume":"1","author":"Roberts","year":"1959","journal-title":"Dent Tech"},{"issue":"04","key":"2022092013233982300_ref28","doi-asserted-by":"crossref","first-page":"1250065","DOI":"10.1142\/S0219519412500650","article-title":"Zero inflated poisson ewma control chart for monitoring rare health-related events","volume":"12","author":"Fatahi","year":"2012","journal-title":"Journal of Mechanics in Medicine and Biology"},{"key":"2022092013233982300_ref29","volume-title":"2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","author":"Wang","year":"2019"},{"issue":"2","key":"2022092013233982300_ref30","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbac049","article-title":"CoverageMaster: comprehensive CNV detection and visualization from NGS short reads for genetic medicine applications","volume":"23","author":"Rapti","year":"2022","journal-title":"Brief Bioinform"},{"issue":"1","key":"2022092013233982300_ref31","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1186\/s12859-015-0502-7","article-title":"SCNVSim: somatic copy number variation and structure variation simulator","volume":"16","author":"Qin","year":"2015","journal-title":"BMC Bioinformatics"},{"issue":"9","key":"2022092013233982300_ref32","doi-asserted-by":"crossref","first-page":"1141","DOI":"10.1038\/s41587-021-00994-5","article-title":"Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing","volume":"39","author":"Xiao","year":"2021","journal-title":"Nat Biotechnol"},{"issue":"7289","key":"2022092013233982300_ref33","doi-asserted-by":"crossref","first-page":"704","DOI":"10.1038\/nature08516","article-title":"Origins and functional impact of copy number variation in the human genome","volume":"464","author":"Conrad","year":"2010","journal-title":"Nature"},{"issue":"5","key":"2022092013233982300_ref34","doi-asserted-by":"crossref","first-page":"400","DOI":"10.1038\/ng.555","article-title":"Discovery of common Asian copy number variants using integrated high-resolution array CGH and massively parallel DNA sequencing","volume":"42","author":"Park","year":"2012","journal-title":"Nat Genet"},{"issue":"21","key":"2022092013233982300_ref35","doi-asserted-by":"crossref","first-page":"e31","DOI":"10.1158\/0008-5472.CAN-17-0337","article-title":"Variant review with the integrative genomics viewer","volume":"77","author":"Robinson","year":"2017","journal-title":"Cancer Res"},{"key":"2022092013233982300_ref36","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1016\/j.mrrev.2019.02.005","article-title":"Free-access copy-number variant detection tools for targeted next-generation sequencing data","volume":"779","author":"Roca","year":"2019","journal-title":"Mutat Res Rev Mutat Res"},{"issue":"12","key":"2022092013233982300_ref37","doi-asserted-by":"crossref","first-page":"1606","DOI":"10.1016\/j.annonc.2020.08.2102","article-title":"ESMO recommendations on predictive biomarker testing for homologous recombination deficiency and PARP inhibitor benefit in ovarian cancer","volume":"31","author":"Miller","year":"2020","journal-title":"Ann Oncol"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/5\/bbac375\/45936840\/bbac375.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/5\/bbac375\/45936840\/bbac375.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,20]],"date-time":"2022-09-20T18:12:25Z","timestamp":1663697545000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbac375\/6686740"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9]]},"references-count":37,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,9,20]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbac375","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,9]]},"published":{"date-parts":[[2022,9]]},"article-number":"bbac375"}}