{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,18]],"date-time":"2026-04-18T06:47:48Z","timestamp":1776494868358,"version":"3.51.2"},"reference-count":23,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2010,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Algorithms and software for CNV detection have been developed, but they detect the CNV regions sample-by-sample with individual-specific breakpoints, while common CNV regions are likely to occur at the same genomic locations across different individuals in a homogenous population. Current algorithms to detect common CNV regions do not account for the varying reliability of the individual CNVs, typically reported as confidence scores by SNP-based CNV detection algorithms. General methodologies for identifying these recurrent regions, especially those directed at SNP arrays, are still needed.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>In this paper, we describe two new approaches for identifying common CNV regions based on (i) the frequency of occurrence of reliable CNVs, where reliability is determined by high confidence scores, and (ii) a weighted frequency of occurrence of CNVs, where the weights are determined by the confidence scores. In addition, motivated by the fact that we often observe partially overlapping CNV regions as a mixture of two or more distinct subregions, regions identified using the two approaches can be fine-tuned to smaller sub-regions using a clustering algorithm. We compared the performance of the methods with sequencing-based results in terms of discordance rates, rates of departure from Hardy-Weinberg equilibrium (HWE) and average frequency and size of the identified regions. The discordance rates as well as the rates of departure from HWE decrease when we select CNVs with higher confidence scores. We also performed comparisons with two previously published methods, STAC and GISTIC, and showed that the methods we consider are better at identifying low-frequency but high-confidence CNV regions.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>The proposed methods for identifying common CNV regions in multiple individuals perform well compared to existing methods. The identified common regions can be used for downstream analyses such as group comparisons in association studies.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-11-147","type":"journal-article","created":{"date-parts":[[2010,3,23]],"date-time":"2010-03-23T07:15:35Z","timestamp":1269328535000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Identification of recurrent regions of copy-number variants across multiple individuals"],"prefix":"10.1186","volume":"11","author":[{"given":"Teo Shu","family":"Mei","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Agus","family":"Salim","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Stefano","family":"Calza","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ku Chee","family":"Seng","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chia Kee","family":"Seng","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yudi","family":"Pawitan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2010,3,22]]},"reference":[{"key":"3604_CR1","doi-asserted-by":"publisher","first-page":"557","DOI":"10.1093\/biostatistics\/kxh008","volume":"5","author":"AB Olshen","year":"2004","unstructured":"Olshen AB, Venkatraman ES, Lucito R, Wigler M: Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics 2004, 5: 557\u2013572. 10.1093\/biostatistics\/kxh008","journal-title":"Biostatistics"},{"issue":"6","key":"3604_CR2","doi-asserted-by":"publisher","first-page":"e122","DOI":"10.1371\/journal.pcbi.0030122","volume":"3","author":"OM Rueda","year":"2007","unstructured":"Rueda OM, Diaz-Uriarte R: Flexible and accurate detection of genomic copy-number changes from aCGH. PLoS Computational Biology 2007, 3(6):e122. 10.1371\/journal.pcbi.0030122","journal-title":"PLoS Computational Biology"},{"key":"3604_CR3","doi-asserted-by":"publisher","first-page":"2143","DOI":"10.1093\/bioinformatics\/btn404","volume":"24","author":"C Erdman","year":"2008","unstructured":"Erdman C, Emerson JW: A fast Bayesian change point analysis for the segmentation of microarray data. Bioinformatics 2008, 24: 2143\u20132148. 10.1093\/bioinformatics\/btn404","journal-title":"Bioinformatics"},{"key":"3604_CR4","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1093\/bioinformatics\/btm601","volume":"24","author":"R Pique-Regi","year":"2008","unstructured":"Pique-Regi R, et al.: Sparse representation and Bayesian detection of genome copy number alterations from microarray data. Bioinformatics 2008, 24: 309\u20133182. 10.1093\/bioinformatics\/btm601","journal-title":"Bioinformatics"},{"issue":"10","key":"3604_CR5","doi-asserted-by":"publisher","first-page":"1223","DOI":"10.1093\/bioinformatics\/btp119","volume":"25","author":"R Pique-Regi","year":"2009","unstructured":"Pique-Regi R, et al.: Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA. Bioinformatics 2009, 25(10):1223\u20131230. 10.1093\/bioinformatics\/btp119","journal-title":"Bioinformatics"},{"key":"3604_CR6","doi-asserted-by":"publisher","first-page":"1665","DOI":"10.1101\/gr.6861907","volume":"17","author":"K Wang","year":"2007","unstructured":"Wang K, et al.: PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Research 2007, 17: 1665\u2013167. 10.1101\/gr.6861907","journal-title":"Genome Research"},{"key":"3604_CR7","doi-asserted-by":"publisher","first-page":"2013","DOI":"10.1093\/nar\/gkm076","volume":"35","author":"S Colella","year":"2007","unstructured":"Colella S, et al.: QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data. Nucleic Acids Research 2007, 35: 2013\u20132025. 10.1093\/nar\/gkm076","journal-title":"Nucleic Acids Research"},{"key":"3604_CR8","volume-title":"Collection of Biostatistics Research Archive","author":"OM Rueda","year":"2008","unstructured":"Rueda OM, Diaz-Uriarte R: Finding Recurrent Regions of Copy Number Variation: A Review. Collection of Biostatistics Research Archive 2008. Art42 Art42"},{"key":"3604_CR9","doi-asserted-by":"publisher","first-page":"1149","DOI":"10.1101\/gr.5076506","volume":"16","author":"SJ Diskin","year":"2006","unstructured":"Diskin SJ, et al.: STAC: A method for testing the significance of DNA copy number aberrations across multiple array-CGH experiments. Genome Research 2006, 16: 1149\u20131158. 10.1101\/gr.5076506","journal-title":"Genome Research"},{"key":"3604_CR10","doi-asserted-by":"publisher","first-page":"56","DOI":"10.1038\/nature06862","volume":"453","author":"JM Kidd","year":"2008","unstructured":"Kidd JM, et al.: Mapping and sequencing of structural variation from eight human genomes. Nature 2008, 453: 56\u201364. 10.1038\/nature06862","journal-title":"Nature"},{"issue":"50","key":"3604_CR11","doi-asserted-by":"publisher","first-page":"20007","DOI":"10.1073\/pnas.0710052104","volume":"104","author":"R Beroukhim","year":"2007","unstructured":"Beroukhim R, et al.: Assessing the significance of chromosomal aberrations in cancer: Methodology and application to glioma. PNAS 2007, 104(50):20007\u201320012. 10.1073\/pnas.0710052104","journal-title":"PNAS"},{"issue":"3","key":"3604_CR12","doi-asserted-by":"publisher","first-page":"484","DOI":"10.1093\/biostatistics\/kxm048","volume":"9","author":"WN Van Wieringen","year":"2008","unstructured":"Van Wieringen WN, Wiel MA, Ylstra B: Weighted clustering of called array CGH data. Biostatistics 2008, 9(3):484\u2013500. 10.1093\/biostatistics\/kxm048","journal-title":"Biostatistics"},{"key":"3604_CR13","doi-asserted-by":"publisher","first-page":"14863","DOI":"10.1073\/pnas.95.25.14863","volume":"95","author":"MB Eisen","year":"1998","unstructured":"Eisen MB, et al.: Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Acodemy of Sciences of the United States of America 1998, 95: 14863\u201314868. 10.1073\/pnas.95.25.14863","journal-title":"Proceedings of the National Acodemy of Sciences of the United States of America"},{"key":"3604_CR14","doi-asserted-by":"publisher","first-page":"1499","DOI":"10.1038\/sj.onc.1209919","volume":"26","author":"K Jong","year":"2007","unstructured":"Jong K, et al.: Cross-platform array comparative genomic hybridization meta-analysis separates hematopoietic and mesenchymal from epithelial tumors. Oncogene 2007, 26: 1499\u20131506. 10.1038\/sj.onc.1209919","journal-title":"Oncogene"},{"key":"3604_CR15","volume-title":"Cluster Analysis","author":"BS Everitt","year":"2001","unstructured":"Everitt BS, et al.: Cluster Analysis. 4th edition. Arnold; 2001.","edition":"4"},{"key":"3604_CR16","doi-asserted-by":"publisher","first-page":"1166","DOI":"10.1038\/ng.238","volume":"40","author":"SA McCarroll","year":"2008","unstructured":"McCarroll SA, et al.: Integrated detection and population-genetic analysis of SNPs and copy number variation. Nature Genetics 2008, 40: 1166\u20131174. 10.1038\/ng.238","journal-title":"Nature Genetics"},{"key":"3604_CR17","volume-title":"Nature","author":"DF Conrad","year":"2009","unstructured":"Conrad DF, et al.: Origins and functional impact of copy number variation in the human genome. Nature 2009."},{"issue":"18","key":"3604_CR18","doi-asserted-by":"publisher","first-page":"3413","DOI":"10.1093\/bioinformatics\/bth418","volume":"20","author":"P Hupe","year":"2004","unstructured":"Hupe P, et al.: Analysis of array CGH data: from signal ratio to gain and loss of DNA regions. Bioinformatics 2004, 20(18):3413\u20133422. 10.1093\/bioinformatics\/bth418","journal-title":"Bioinformatics"},{"issue":"8","key":"3604_CR19","doi-asserted-by":"publisher","first-page":"e143","DOI":"10.1371\/journal.pgen.0030143","volume":"3","author":"M Guttman","year":"2007","unstructured":"Guttman M, et al.: Assessing the significance of conserved genomic abberations using high resolution genomic microarrays. PLoS Genetics 2007, 3(8):e143. 10.1371\/journal.pgen.0030143","journal-title":"PLoS Genetics"},{"key":"3604_CR20","doi-asserted-by":"publisher","first-page":"275","DOI":"10.1086\/505653","volume":"79","author":"DP Locke","year":"2006","unstructured":"Locke DP, et al.: Linkage disequilibrium and heritability of copy-number polymorphisms within duplicated regions of the human genome. American Journal of Human Genetics 2006, 79: 275\u2013290. 10.1086\/505653","journal-title":"American Journal of Human Genetics"},{"key":"3604_CR21","doi-asserted-by":"publisher","first-page":"444","DOI":"10.1038\/nature05329","volume":"444","author":"R Redon","year":"2006","unstructured":"Redon R, et al.: Global variation in copy number in the human genome. Nature 2006, 444: 444\u2013454. 10.1038\/nature05329","journal-title":"Nature"},{"key":"3604_CR22","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1038\/79226","volume":"26","author":"KB Lane","year":"2000","unstructured":"Lane KB, Consortium TIP, et al.: Heterozygous germline mutations in BMPR2, encoding a TGF-beta receptor, cause familial primary pulmonary hypertension. Nature Genetics 2000, 26: 81\u201384. 10.1038\/79226","journal-title":"Nature Genetics"},{"key":"3604_CR23","doi-asserted-by":"publisher","first-page":"1639","DOI":"10.1093\/hmg\/6.10.1639","volume":"6","author":"M Hutton","year":"1997","unstructured":"Hutton M, Hardy J: The presenilins and Alzheimer's disease. Human Molecular Genetics 1997, 6: 1639\u20131646. 10.1093\/hmg\/6.10.1639","journal-title":"Human Molecular Genetics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-11-147.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T05:25:59Z","timestamp":1630473959000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-11-147"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,3,22]]},"references-count":23,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2010,12]]}},"alternative-id":["3604"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-11-147","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,3,22]]},"assertion":[{"value":"15 October 2009","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 March 2010","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 March 2010","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"147"}}