{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,3]],"date-time":"2026-06-03T17:23:58Z","timestamp":1780507438699,"version":"3.54.1"},"reference-count":19,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>There are currently many different methods for processing and summarizing probe-level data from Affymetrix oligonucleotide arrays. It is of great interest to validate these methods and identify those that are most effective. There is no single best way to do this validation, and a variety of approaches is needed. Moreover, gene expression data are collected to answer a variety of scientific questions, and the same method may not be best for all questions. Only a handful of validation studies have been done so far, most of which rely on spike-in datasets and focus on the question of detecting differential expression. Here we seek methods that excel at estimating relative expression. We evaluate methods by identifying those that give the strongest linear association between expression measurements by array and the \"gold-standard\" assay.<\/jats:p><jats:p>Quantitative reverse-transcription polymerase chain reaction (qRT-PCR) is generally considered the \"gold-standard\" assay for measuring gene expression by biologists and is often used to confirm findings from microarray data. Here we use qRT-PCR measurements to validate methods for the components of processing oligo array data: background adjustment, normalization, mismatch adjustment, and probeset summary. An advantage of our approach over spike-in studies is that methods are validated on a real dataset that was collected to address a scientific question.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We initially identify three of six popular methods that consistently produced the best agreement between oligo array and RT-PCR data for medium- and high-intensity genes. The three methods are generally known as MAS5, gcRMA, and the dChip mismatch mode. For medium- and high-intensity genes, we identified use of data from mismatch probes (as in MAS5 and dChip mismatch) and a sequence-based method of background adjustment (as in gcRMA) as the most important factors in methods' performances. However, we found poor reliability for methods using mismatch probes for low-intensity genes, which is in agreement with previous studies.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>We advocate use of sequence-based background adjustment in lieu of mismatch adjustment to achieve the best results across the intensity spectrum. No method of normalization or probeset summary showed any consistent advantages.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-7-23","type":"journal-article","created":{"date-parts":[[2006,1,23]],"date-time":"2006-01-23T13:04:26Z","timestamp":1138021466000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":71,"title":["Evaluation of methods for oligonucleotide array data via quantitative real-time PCR"],"prefix":"10.1186","volume":"7","author":[{"given":"Li-Xuan","family":"Qin","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Richard P","family":"Beyer","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Francesca N","family":"Hudson","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nancy J","family":"Linford","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Daryl E","family":"Morris","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kathleen F","family":"Kerr","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2006,1,17]]},"reference":[{"issue":"4","key":"762_CR1","doi-asserted-by":"publisher","first-page":"701","DOI":"10.1111\/j.0006-341X.2002.00701.x","volume":"58","author":"DV Nguyen","year":"2002","unstructured":"Nguyen DV, Arpat AB, Wang N, Carroll RJ: DNA microarray experiments: biological and technological aspects. Biometrics 2002, 58(4):701\u2013717. 10.1111\/j.0006-341X.2002.00701.x","journal-title":"Biometrics"},{"issue":"4","key":"762_CR2","doi-asserted-by":"publisher","first-page":"e15","DOI":"10.1093\/nar\/gng015","volume":"31","author":"RA Irizarry","year":"2003","unstructured":"Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP: Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res 2003, 31(4):e15. 10.1093\/nar\/gng015","journal-title":"Nucleic Acids Res"},{"issue":"9","key":"762_CR3","doi-asserted-by":"publisher","first-page":"943","DOI":"10.1038\/ng1422","volume":"36","author":"T Mehta","year":"2004","unstructured":"Mehta T, Tanik M, Allison DB: Towards sound epistemological foundations of statistical methods for high-dimensional biology. Nat Genet 2004, 36(9):943\u2013947. 10.1038\/ng1422","journal-title":"Nat Genet"},{"issue":"3","key":"762_CR4","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1093\/bioinformatics\/btg410","volume":"20","author":"LM Cope","year":"2004","unstructured":"Cope LM, Irizarry RA, Jaffee HA, Wu Z, Speed TP: A benchmark for Affymetrix GeneChip expression measures. Bioinformatics 2004, 20(3):323\u2013331. 10.1093\/bioinformatics\/btg410","journal-title":"Bioinformatics"},{"issue":"1","key":"762_CR5","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1186\/1471-2105-6-26","volume":"6","author":"K Shedden","year":"2005","unstructured":"Shedden K, Chen W, Kuick R, Ghosh D, Macdonald J, Cho KR, Giordano TJ, Gruber SB, Fearon ER, Taylor JM, Hanash S: Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data. BMC Bioinformatics 2005, 6(1):26. 10.1186\/1471-2105-6-26","journal-title":"BMC Bioinformatics"},{"issue":"2","key":"762_CR6","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1093\/bioinformatics\/19.2.185","volume":"19","author":"BM Bolstad","year":"2003","unstructured":"Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 2003, 19(2):185\u2013193. 10.1093\/bioinformatics\/19.2.185","journal-title":"Bioinformatics"},{"issue":"2","key":"762_CR7","doi-asserted-by":"publisher","first-page":"R16","DOI":"10.1186\/gb-2005-6-2-r16","volume":"6","author":"SE Choe","year":"2005","unstructured":"Choe SE, Boutros M, Michelson AM, Church GM, Halfon MS: Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset. Genome Biol 2005, 6(2):R16. 10.1186\/gb-2005-6-2-r16","journal-title":"Genome Biol"},{"issue":"1","key":"762_CR8","doi-asserted-by":"publisher","first-page":"80","DOI":"10.1186\/1471-2105-6-80","volume":"6","author":"A Ploner","year":"2005","unstructured":"Ploner A, Miller LD, Hall P, Bergh J, Pawitan Y: Correlation test to assess low-level processing of high-density oligonucleotide microarray data. BMC Bioinformatics 2005, 6(1):80. 10.1186\/1471-2105-6-80","journal-title":"BMC Bioinformatics"},{"issue":"468","key":"762_CR9","doi-asserted-by":"publisher","first-page":"909","DOI":"10.1198\/016214504000000683","volume":"99","author":"Z Wu","year":"2004","unstructured":"Wu Z, Irizarry RA, Gentleman R, Martinez-Murillo F, Spencer F: A Model-Based Background Adjustment for Oligonucleotide Expression Arrays. Journal of the American Statistical Association 2004, 99(468):909\u2013917. 10.1198\/016214504000000683","journal-title":"Journal of the American Statistical Association"},{"issue":"4","key":"762_CR10","doi-asserted-by":"crossref","first-page":"618","DOI":"10.2144\/04364ST02","volume":"36","author":"W Etienne","year":"2004","unstructured":"Etienne W, Meyer MH, Peppers J, Meyer RAJ: Comparison of mRNA gene expression by RT-PCR and DNA microarray. Biotechniques 2004, 36(4):618\u201320, 622, 624\u20136.","journal-title":"Biotechniques"},{"issue":"1","key":"762_CR11","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1186\/1471-2164-6-16","volume":"6","author":"W Fan","year":"2005","unstructured":"Fan W, Pritchard JI, Olson JM, Khalid N, Zhao LP: A class of models for analyzing GeneChip gene expression analysis array data. BMC Genomics 2005, 6(1):16. 10.1186\/1471-2164-6-16","journal-title":"BMC Genomics"},{"issue":"9","key":"762_CR12","doi-asserted-by":"publisher","first-page":"e45","DOI":"10.1093\/nar\/29.9.e45","volume":"29","author":"MW Pfaffl","year":"2001","unstructured":"Pfaffl MW: A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res 2001, 29(9):e45. 10.1093\/nar\/29.9.e45","journal-title":"Nucleic Acids Res"},{"issue":"10","key":"762_CR13","doi-asserted-by":"publisher","first-page":"R80","DOI":"10.1186\/gb-2004-5-10-r80","volume":"5","author":"RC Gentleman","year":"2004","unstructured":"Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth G, Tierney L, Yang JY, Zhang J: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 2004, 5(10):R80. 10.1186\/gb-2004-5-10-r80","journal-title":"Genome Biol"},{"key":"762_CR14","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1080\/10618600.1996.10474713","volume":"5","author":"R Ihaka","year":"1996","unstructured":"Ihaka R, Gentleman R: R: A language for data analysis and graphics. Journal of Computational and Graphical Statistics 1996, 5: 299\u2013314.","journal-title":"Journal of Computational and Graphical Statistics"},{"key":"762_CR15","unstructured":"Bolstad B: affy: Built-in Processing Methods.[http:\/\/www.bioconductor.org\/repository\/devel\/vignette\/builtinMethods.pdf]"},{"key":"762_CR16","volume-title":"Science","author":"SE Schriner","year":"2005","unstructured":"Schriner SE, Linford NJ, Martin GM, Treuting P, Ogburn CE, Emond M, Coskun PE, Ladiges W, Wolf N, Van Remmen H, Wallace DC, Rabinovitch PS: Extension of Murine Lifespan by Overexpression of Catalase Targeted to Mitochondria. Science 2005."},{"key":"762_CR17","unstructured":"Affymetrix statistical algorithms description document[http:\/\/www.affymetrix.com\/support\/technical\/whitepapers\/sadd_whitepaper.pdf]"},{"key":"762_CR18","doi-asserted-by":"publisher","first-page":"S96","DOI":"10.1093\/bioinformatics\/18.suppl_1.S96","volume":"18 Suppl 1","author":"W Huber","year":"2002","unstructured":"Huber W, von Heydebreck A, Sultmann H, Poustka A, Vingron M: Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 2002, 18 Suppl 1: S96\u2013104.","journal-title":"Bioinformatics"},{"issue":"1","key":"762_CR19","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1073\/pnas.98.1.31","volume":"98","author":"C Li","year":"2001","unstructured":"Li C, Wong WH: Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. Proc Natl Acad Sci U S A 2001, 98(1):31\u201336. 10.1073\/pnas.011404098","journal-title":"Proc Natl Acad Sci U S A"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-23.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,7]],"date-time":"2025-01-07T09:21:10Z","timestamp":1736241670000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-23"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,1,17]]},"references-count":19,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["762"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-23","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,1,17]]},"assertion":[{"value":"20 May 2005","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 January 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 January 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"23"}}