{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,14]],"date-time":"2026-04-14T07:19:36Z","timestamp":1776151176964,"version":"3.50.1"},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Illumina Infinium whole genome genotyping (WGG) arrays are increasingly being applied in cancer genomics to study gene copy number alterations and allele-specific aberrations such as loss-of-heterozygosity (LOH). Methods developed for normalization of WGG arrays have mostly focused on diploid, normal samples. However, for cancer samples genomic aberrations may confound normalization and data interpretation. Therefore, we examined the effects of the conventionally used normalization method for Illumina Infinium arrays when applied to cancer samples.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We demonstrate an asymmetry in the detection of the two alleles for each SNP, which deleteriously influences both allelic proportions and copy number estimates. The asymmetry is caused by a remaining bias between the two dyes used in the Infinium II assay after using the normalization method in Illumina's proprietary software (BeadStudio). We propose a quantile normalization strategy for correction of this dye bias. We tested the normalization strategy using 535 individual hybridizations from 10 data sets from the analysis of cancer genomes and normal blood samples generated on Illumina Infinium II 300 k version 1 and 2, 370 k and 550 k BeadChips. We show that the proposed normalization strategy successfully removes asymmetry in estimates of both allelic proportions and copy numbers. Additionally, the normalization strategy reduces the technical variation for copy number estimates while retaining the response to copy number alterations.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>The proposed normalization strategy represents a valuable tool that improves the quality of data obtained from Illumina Infinium arrays, in particular when used for LOH and copy number variation studies.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-9-409","type":"journal-article","created":{"date-parts":[[2008,10,24]],"date-time":"2008-10-24T18:13:37Z","timestamp":1224872017000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":116,"title":["Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios"],"prefix":"10.1186","volume":"9","author":[{"given":"Johan","family":"Staaf","sequence":"first","affiliation":[]},{"given":"Johan","family":"Vallon-Christersson","sequence":"additional","affiliation":[]},{"given":"David","family":"Lindgren","sequence":"additional","affiliation":[]},{"given":"Gunnar","family":"Juliusson","sequence":"additional","affiliation":[]},{"given":"Richard","family":"Rosenquist","sequence":"additional","affiliation":[]},{"given":"Mattias","family":"H\u00f6glund","sequence":"additional","affiliation":[]},{"given":"\u00c5ke","family":"Borg","sequence":"additional","affiliation":[]},{"given":"Markus","family":"Ringn\u00e9r","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2008,10,2]]},"reference":[{"key":"2394_CR1","doi-asserted-by":"publisher","first-page":"331","DOI":"10.1146\/annurev.genom.6.080604.162140","volume":"6","author":"D Pinkel","year":"2005","unstructured":"Pinkel D, Albertson DG: Comparative genomic hybridization. Annu Rev Genomics Hum Genet 2005, 6: 331\u2013354. 10.1146\/annurev.genom.6.080604.162140","journal-title":"Annu Rev Genomics Hum Genet"},{"key":"2394_CR2","doi-asserted-by":"publisher","first-page":"338","DOI":"10.1038\/nature03099","volume":"432","author":"H Rajagopalan","year":"2004","unstructured":"Rajagopalan H, Lengauer C: Aneuploidy and cancer. Nature 2004, 432: 338\u2013341. 10.1038\/nature03099","journal-title":"Nature"},{"key":"2394_CR3","doi-asserted-by":"publisher","first-page":"109","DOI":"10.1038\/nmeth718","volume":"1","author":"H Matsuzaki","year":"2004","unstructured":"Matsuzaki H, Dong S, Loi H, Di X, Liu G, Hubbell E, Law J, Berntsen T, Chadha M, Hui H, Yang G, Kennedy GC, Webster TA, Cawley S, Walsh PS, Jones KW, Fodor SP, Mei R: Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays. Nat Methods 2004, 1: 109\u2013111. 10.1038\/nmeth718","journal-title":"Nat Methods"},{"key":"2394_CR4","doi-asserted-by":"publisher","first-page":"549","DOI":"10.1038\/ng1547","volume":"37","author":"KL Gunderson","year":"2005","unstructured":"Gunderson KL, Steemers FJ, Lee G, Mendoza LG, Chee MS: A genome-wide scalable SNP genotyping assay using microarray technology. Nat Genet 2005, 37: 549\u2013554. 10.1038\/ng1547","journal-title":"Nat Genet"},{"key":"2394_CR5","doi-asserted-by":"publisher","first-page":"1136","DOI":"10.1101\/gr.5402306","volume":"16","author":"DA Peiffer","year":"2006","unstructured":"Peiffer DA, Le JM, Steemers FJ, Chang W, Jenniges T, Garcia F, Haden K, Li J, Shaw CA, Belmont J, Cheung SW, Shen RM, Barker DL, Gunderson KL: High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotyping. Genome Res 2006, 16: 1136\u20131148. 10.1101\/gr.5402306","journal-title":"Genome Res"},{"key":"2394_CR6","unstructured":"Affymetrix[http:\/\/www.affymetrix.com]"},{"key":"2394_CR7","unstructured":"Illumina[http:\/\/www.illumina.com]"},{"key":"2394_CR8","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1038\/nmeth842","volume":"3","author":"FJ Steemers","year":"2006","unstructured":"Steemers FJ, Chang W, Lee G, Barker DL, Shen R, Gunderson KL: Whole-genome genotyping with the single-base extension assay. Nat Methods 2006, 3: 31\u201333. 10.1038\/nmeth842","journal-title":"Nat Methods"},{"key":"2394_CR9","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1093\/bioinformatics\/19.2.185","volume":"19","author":"BM Bolstad","year":"2003","unstructured":"Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 2003, 19: 185\u2013193. 10.1093\/bioinformatics\/19.2.185","journal-title":"Bioinformatics"},{"key":"2394_CR10","doi-asserted-by":"publisher","first-page":"5914","DOI":"10.1093\/nar\/gki890","volume":"33","author":"M Barnes","year":"2005","unstructured":"Barnes M, Freudenberg J, Thompson S, Aronow B, Pavlidis P: Experimental comparison and cross-validation of the Affymetrix and Illumina gene expression analysis platforms. Nucleic Acids Res 2005, 33: 5914\u20135923. 10.1093\/nar\/gki890","journal-title":"Nucleic Acids Res"},{"key":"2394_CR11","doi-asserted-by":"publisher","first-page":"485","DOI":"10.1093\/biostatistics\/kxl042","volume":"8","author":"B Carvalho","year":"2007","unstructured":"Carvalho B, Bengtsson H, Speed TP, Irizarry RA: Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data. Biostatistics 2007, 8: 485\u2013499. 10.1093\/biostatistics\/kxl042","journal-title":"Biostatistics"},{"key":"2394_CR12","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1186\/1471-2105-9-85","volume":"9","author":"MJ Dunning","year":"2008","unstructured":"Dunning MJ, Barbosa-Morais NL, Lynch AG, Tavare S, Ritchie ME: Statistical issues in the analysis of Illumina data. BMC Bioinformatics 2008, 9: 85. 10.1186\/1471-2105-9-85","journal-title":"BMC Bioinformatics"},{"key":"2394_CR13","doi-asserted-by":"publisher","first-page":"368","DOI":"10.1101\/gr.5686107","volume":"17","author":"J Oosting","year":"2007","unstructured":"Oosting J, Lips EH, van Eijk R, Eilers PH, Szuhai K, Wijmenga C, Morreau H, van Wezel T: High-resolution copy number analysis of paraffin-embedded archival tissue using SNP BeadArrays. Genome Res 2007, 17: 368\u2013376. 10.1101\/gr.5686107","journal-title":"Genome Res"},{"key":"2394_CR14","doi-asserted-by":"publisher","first-page":"e15","DOI":"10.1093\/nar\/30.4.e15","volume":"30","author":"YH Yang","year":"2002","unstructured":"Yang YH, Dudoit S, Luu P, Lin DM, Peng V, Ngai J, Speed TP: Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res 2002, 30: e15. 10.1093\/nar\/30.4.e15","journal-title":"Nucleic Acids Res"},{"issue":"Suppl","key":"2394_CR15","doi-asserted-by":"publisher","first-page":"496","DOI":"10.1038\/ng1032","volume":"32","author":"J Quackenbush","year":"2002","unstructured":"Quackenbush J: Microarray data normalization and transformation. Nat Genet 2002, (32 Suppl):496\u2013501. 10.1038\/ng1032","journal-title":"Nat Genet"},{"key":"2394_CR16","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1016\/S1046-2023(03)00155-5","volume":"31","author":"GK Smyth","year":"2003","unstructured":"Smyth GK, Speed T: Normalization of cDNA microarray data. Methods 2003, 31: 265\u2013273. 10.1016\/S1046-2023(03)00155-5","journal-title":"Methods"},{"key":"2394_CR17","doi-asserted-by":"publisher","first-page":"274","DOI":"10.1186\/1471-2105-6-274","volume":"6","author":"M Khojasteh","year":"2005","unstructured":"Khojasteh M, Lam WL, Ward RK, MacAulay C: A stepwise framework for the normalization of array CGH data. BMC Bioinformatics 2005, 6: 274. 10.1186\/1471-2105-6-274","journal-title":"BMC Bioinformatics"},{"key":"2394_CR18","doi-asserted-by":"publisher","first-page":"382","DOI":"10.1186\/1471-2164-8-382","volume":"8","author":"J Staaf","year":"2007","unstructured":"Staaf J, Jonsson G, Ringner M, Vallon-Christersson J: Normalization of array-CGH data: influence of copy number imbalances. BMC Genomics 2007, 8: 382. 10.1186\/1471-2164-8-382","journal-title":"BMC Genomics"},{"key":"2394_CR19","doi-asserted-by":"publisher","first-page":"264","DOI":"10.1186\/1471-2105-7-264","volume":"7","author":"P Neuvial","year":"2006","unstructured":"Neuvial P, Hupe P, Brito I, Liva S, Manie E, Brennetot C, Radvanyi F, Aurias A, Barillot E: Spatial normalization of array-CGH data. BMC Bioinformatics 2006, 7: 264. 10.1186\/1471-2105-7-264","journal-title":"BMC Bioinformatics"},{"key":"2394_CR20","doi-asserted-by":"publisher","first-page":"903","DOI":"10.1016\/j.ajhg.2008.01.012","volume":"82","author":"G Assie","year":"2008","unstructured":"Assie G, LaFramboise T, Platzer P, Bertherat J, Stratakis CA, Eng C: SNP arrays in heterogeneous tissue: highly accurate collection of both germline and somatic genetic information from unpaired single tumor samples. Am J Hum Genet 2008, 82: 903\u2013915. 10.1016\/j.ajhg.2008.01.012","journal-title":"Am J Hum Genet"},{"key":"2394_CR21","doi-asserted-by":"publisher","first-page":"1233","DOI":"10.1093\/bioinformatics\/bth069","volume":"20","author":"M Lin","year":"2004","unstructured":"Lin M, Wei LJ, Sellers WR, Lieberfarb M, Wong WH, Li C: dChipSNP: significance curve and clustering of SNP-array-based loss-of-heterozygosity data. Bioinformatics 2004, 20: 1233\u20131240. 10.1093\/bioinformatics\/bth069","journal-title":"Bioinformatics"},{"key":"2394_CR22","doi-asserted-by":"publisher","first-page":"R136","DOI":"10.1186\/gb-2008-9-9-r136","volume":"9","author":"J Staaf","year":"2008","unstructured":"Staaf J, Lindgren D, Vallon-Christersson J, Isaksson A, Goransson H, Juliusson G, Rosenquist R, Hoglund M, Borg A, Ringner M: Segmentation-based detection of allelic imbalance and loss-of-heterozygosity in cancer cells using whole genome SNP arrays. Genome Biol 2008, 9: R136. 10.1186\/gb-2008-9-9-r136","journal-title":"Genome Biol"},{"key":"2394_CR23","doi-asserted-by":"publisher","first-page":"697","DOI":"10.1002\/gcc.20575","volume":"47","author":"R Gunnarsson","year":"2008","unstructured":"Gunnarsson R, Staaf J, Jansson M, Ottesen AM, Goransson H, Liljedahl U, Ralfkiaer U, Mansouri M, Buhl AM, Smedby KE, Hjalgrim H, Syvanen AC, Borg A, Isaksson A, Jurlander J, Juliusson G, Rosenquist R: Screening for copy-number alterations and loss of heterozygosity in chronic lymphocytic leukemia-A comparative study of four differently designed, high resolution microarray platforms. Genes Chromosomes Cancer 2008, 47: 697\u2013711. 10.1002\/gcc.20575","journal-title":"Genes Chromosomes Cancer"},{"key":"2394_CR24","doi-asserted-by":"publisher","first-page":"1665","DOI":"10.1101\/gr.6861907","volume":"17","author":"K Wang","year":"2007","unstructured":"Wang K, Li M, Hadley D, Liu R, Glessner J, Grant SF, Hakonarson H, Bucan M: PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res 2007, 17: 1665\u20131674. 10.1101\/gr.6861907","journal-title":"Genome Res"},{"key":"2394_CR25","doi-asserted-by":"publisher","first-page":"10173","DOI":"10.1158\/0008-5472.CAN-07-2102","volume":"67","author":"J Greshock","year":"2007","unstructured":"Greshock J, Feng B, Nogueira C, Ivanova E, Perna I, Nathanson K, Protopopov A, Weber BL, Chin L: A comparison of DNA copy number profiling platforms. Cancer Res 2007, 67: 10173\u201310180. 10.1158\/0008-5472.CAN-07-2102","journal-title":"Cancer Res"},{"key":"2394_CR26","unstructured":"HapMap[http:\/\/www.hapmap.org]"},{"key":"2394_CR27","unstructured":"PennCNV[http:\/\/www.neurogenome.org\/cnv\/penncnv\/]"},{"key":"2394_CR28","unstructured":"SCIBLU Genomics, Lund University, Sweden[http:\/\/www.lth.se\/sciblu]"},{"key":"2394_CR29","unstructured":"SNP Technology Platform in Uppsala, Sweden[http:\/\/www.genotyping.se]"},{"key":"2394_CR30","unstructured":"The R project for statistical computing[http:\/\/www.r-project.org]"},{"key":"2394_CR31","unstructured":"BioConductor[http:\/\/www.bioconductor.org]"},{"key":"2394_CR32","doi-asserted-by":"publisher","first-page":"657","DOI":"10.1093\/bioinformatics\/btl646","volume":"23","author":"ES Venkatraman","year":"2007","unstructured":"Venkatraman ES, Olshen AB: A faster circular binary segmentation algorithm for the analysis of array CGH data. Bioinformatics 2007, 23: 657\u2013663. 10.1093\/bioinformatics\/btl646","journal-title":"Bioinformatics"},{"key":"2394_CR33","unstructured":"Gene Expression Omnibus[http:\/\/www.ncbi.nlm.nih.gov\/geo\/]"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-409.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T03:25:44Z","timestamp":1630466744000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-9-409"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,10,2]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["2394"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-9-409","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,10,2]]},"assertion":[{"value":"3 June 2008","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 October 2008","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 October 2008","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"409"}}