{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,16]],"date-time":"2026-06-16T16:50:59Z","timestamp":1781628659990,"version":"3.54.5"},"reference-count":27,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Microarray technology has become a widely used tool in the biological sciences. Over the past decade, the number of users has grown exponentially, and with the number of applications and secondary data analyses rapidly increasing, we expect this rate to continue. Various initiatives such as the External RNA Control Consortium (ERCC) and the MicroArray Quality Control (MAQC) project have explored ways to provide standards for the technology. For microarrays to become generally accepted as a reliable technology, statistical methods for assessing quality will be an indispensable component; however, there remains a lack of consensus in both defining and measuring microarray quality.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We begin by providing a precise definition of microarray quality and reviewing existing Affymetrix GeneChip quality metrics in light of this definition. We show that the best-performing metrics require multiple arrays to be assessed simultaneously. While such <jats:italic>multi-array<\/jats:italic> quality metrics are adequate for bench science, as microarrays begin to be used in clinical settings, single-array quality metrics will be indispensable. To this end, we define a single-array version of one of the best multi-array quality metrics and show that this metric performs as well as the best multi-array metrics. We then use this new quality metric to assess the quality of microarry data available via the Gene Expression Omnibus (GEO) using more than 22,000 Affymetrix HGU133a and HGU133plus2 arrays from 809 studies.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>We find that approximately 10 percent of these publicly available arrays are of poor quality. Moreover, the quality of microarray measurements varies greatly from hybridization to hybridization, study to study, and lab to lab, with some experiments producing unusable data. Many of the concepts described here are applicable to other high-throughput technologies.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-12-137","type":"journal-article","created":{"date-parts":[[2011,5,7]],"date-time":"2011-05-07T18:15:17Z","timestamp":1304792117000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":74,"title":["Assessing affymetrix GeneChip microarray quality"],"prefix":"10.1186","volume":"12","author":[{"given":"Matthew N","family":"McCall","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Peter N","family":"Murakami","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Margus","family":"Lukk","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Wolfgang","family":"Huber","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Rafael A","family":"Irizarry","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2011,5,7]]},"reference":[{"key":"4482_CR1","doi-asserted-by":"publisher","first-page":"731","DOI":"10.1038\/nmeth1005-731","volume":"2","author":"S Baker","year":"2005","unstructured":"Baker S, Bauer S, Beyer R, Brenton J, Bromley B, Burrill J, Causton H, Conley M, Elespuru R, Fero M, Foy C, Fuscoe J, Gao X, Gerhold D, Gilles P, Goodsaid F, Guo X, Hackett J, Hockett R, Ikonomi P, Irizarry R, Kawasaki E, Kaysser-Kranich T, Kerr K, Kiser G, Koch W, Lee K, Liu C, Liu Z, Lucas A, et al.: The External RNA Controls Consortium: a progress report. Nature Methods 2005, 2: 731\u2013734. 10.1038\/nmeth1005-731","journal-title":"Nature Methods"},{"key":"4482_CR2","doi-asserted-by":"publisher","first-page":"1151","DOI":"10.1038\/nbt1239","volume":"24","author":"M Consortium","year":"2006","unstructured":"Consortium M, Shi L, Reid L, Jones W, Shippy R, Warrington J, Baker S, Collins P, de Longueville F, Kawasaki E, Lee K, Luo Y, Sun Y, Willey J, Setterquist R, Fischer G, Tong W, Dragan Y, Dix D, Frueh F, Goodsaid F, Herman D, Jensen R, Johnson C, Lobenhofer E, Puri R, Schrf U, Thierry-Mieg J, Wang C, Wilson M, et al.: The MicroArray Quality Control (MAQC) project shows inter-and intraplatform reproducibility of gene expression measurements. Nature Biotechnology 2006, 24: 1151\u20131161. 10.1038\/nbt1239","journal-title":"Nature Biotechnology"},{"issue":"8","key":"4482_CR3","doi-asserted-by":"publisher","first-page":"827","DOI":"10.1038\/nbt.1665","volume":"28","author":"L Shi","year":"2010","unstructured":"Shi L, Campbell G, Jones W, Campagne F, Wen Z, Walker S, Su Z, Chu T, Goodsaid F, Pusztai L, Shaughnessy JJ, Oberthuer A, Thomas R, Paules R, Fielden M, Barlogie B, Chen W, Du P, Fischer M, Furlanello C, Gallas B, Ge X, Megherbi D, Symmans W, Wang M, Zhang J, Bitter H, Brors B, Bushel P, Bylesjo M, et al.: The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models. Nature biotechnology 2010, 28(8):827. 10.1038\/nbt.1665","journal-title":"Nature biotechnology"},{"key":"4482_CR4","unstructured":"American Society of Quality[http:\/\/asq.org\/glossary\/index.html]"},{"key":"4482_CR5","doi-asserted-by":"publisher","first-page":"911","DOI":"10.1038\/nmeth1102","volume":"4","author":"M Zilliox","year":"2007","unstructured":"Zilliox M, Irizarry R: A gene expression bar code for microarray data. Nature Methods 2007, 4: 911\u2013913. 10.1038\/nmeth1102","journal-title":"Nature Methods"},{"issue":"suppl 1","key":"4482_CR6","doi-asserted-by":"publisher","first-page":"D1011","DOI":"10.1093\/nar\/gkq1259","volume":"39","author":"M McCall","year":"2011","unstructured":"McCall M, Uppal K, Jaffee H, Zilliox M, Irizarry R: The Gene Expression Barcode: leveraging public data repositories to begin cataloging the human and murine transcriptomes. Nucleic Acids Research 2011, 39(suppl 1):D1011.","journal-title":"Nucleic Acids Research"},{"issue":"7","key":"4482_CR7","doi-asserted-by":"publisher","first-page":"466","DOI":"10.2174\/138920208786241199","volume":"9","author":"X Li","year":"2008","unstructured":"Li X, Quigg R, Zhou J, Gu W, Rao P, Reed E: Clinical utility of microarrays: Current status, existing challenges and future outlook. Current genomics 2008, 9(7):466. 10.2174\/138920208786241199","journal-title":"Current genomics"},{"issue":"4","key":"4482_CR8","doi-asserted-by":"publisher","first-page":"322","DOI":"10.1038\/nbt0410-322","volume":"28","author":"M Lukk","year":"2010","unstructured":"Lukk M, Kapushesky M, Nikkila J, Parkinson H, Goncalves A, Huber W, Ukkonen E, Brazma A: A global map of human gene expression. Nature biotechnology 2010, 28(4):322. 10.1038\/nbt0410-322","journal-title":"Nature biotechnology"},{"key":"4482_CR9","doi-asserted-by":"publisher","first-page":"271","DOI":"10.1186\/1471-2105-9-271","volume":"9","author":"X Liu","year":"2008","unstructured":"Liu X, Yu X, Zack D, Zhu H, Qian J: TiGER: a database for tissue-specific gene expression and regulation. BMC bioinformatics 2008, 9: 271. 10.1186\/1471-2105-9-271","journal-title":"BMC bioinformatics"},{"issue":"suppl 1","key":"4482_CR10","doi-asserted-by":"publisher","first-page":"D628","DOI":"10.1093\/nar\/gkj137","volume":"34","author":"O Ogasawara","year":"2006","unstructured":"Ogasawara O, Otsuji M, Watanabe K, Iizuka T, Tamura T, Hishiki T, Kawamoto S, Okubo K: BodyMap-Xs: anatomical breakdown of 17 million animal ESTs for cross-species comparison of gene expression. Nucleic Acids Research 2006, 34(suppl 1):D628.","journal-title":"Nucleic Acids Research"},{"issue":"7","key":"4482_CR11","doi-asserted-by":"publisher","first-page":"4465","DOI":"10.1073\/pnas.012025199","volume":"99","author":"A Su","year":"2002","unstructured":"Su A, Cooke M, Ching K, Hakak Y, Walker J, Wiltshire T, Orth A, Vega R, Sapinoso L, Moqrich A, Patapoutian A, Hampton G, Schultz P, Hogenesch J: Large-scale analysis of the human and mouse transcriptomes. Proceedings of the National Academy of Sciences of the United States of America 2002, 99(7):4465. 10.1073\/pnas.012025199","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"9","key":"4482_CR12","doi-asserted-by":"publisher","first-page":"1273","DOI":"10.1093\/bioinformatics\/btq109","volume":"26","author":"S Xiao","year":"2010","unstructured":"Xiao S, Zhang C, Zou Q, Ji Z: TiSGeD: a database for tissue-specific genes. Bioinformatics 2010, 26(9):1273. 10.1093\/bioinformatics\/btq109","journal-title":"Bioinformatics"},{"issue":"6","key":"4482_CR13","doi-asserted-by":"publisher","first-page":"557","DOI":"10.1089\/106652701753307485","volume":"8","author":"D Rocke","year":"2001","unstructured":"Rocke D, Durbin B: A model for measurement error for gene expression arrays. Journal of Computational Biology 2001, 8(6):557\u2013569. 10.1089\/106652701753307485","journal-title":"Journal of Computational Biology"},{"issue":"Suppl 1","key":"4482_CR14","doi-asserted-by":"publisher","first-page":"S96","DOI":"10.1093\/bioinformatics\/18.suppl_1.S96","volume":"18","author":"W Huber","year":"2002","unstructured":"Huber W, Von Heydebreck A, Sultmann H, Poustka A, Vingron M: Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 2002, 18(Suppl 1):S96. 10.1093\/bioinformatics\/18.suppl_1.S96","journal-title":"Bioinformatics"},{"issue":"2","key":"4482_CR15","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1214\/07-AOAS116","volume":"1","author":"Z Wu","year":"2007","unstructured":"Wu Z, Irizarry R: A statistical framework for the analysis of microarray probe-level data. Ann Appl Stat 2007, 1(2):333\u2013357. 10.1214\/07-AOAS116","journal-title":"Ann Appl Stat"},{"issue":"2","key":"4482_CR16","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1093\/biostatistics\/4.2.249","volume":"4","author":"R Irizarry","year":"2003","unstructured":"Irizarry R, Hobbs B, Collin F, Beazer-Barclay Y, Antonellis K, Scherf U, Speed T: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 2003, 4(2):249. 10.1093\/biostatistics\/4.2.249","journal-title":"Biostatistics"},{"key":"4482_CR17","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1016\/S0074-7742(04)60002-X","volume":"60","author":"B Bolstad","year":"2004","unstructured":"Bolstad B, Collin F, Simpson K, Irizarry R, Speed T: Experimental design and low-level analysis of microarray data. International review of neurobiology 2004, 60: 25.","journal-title":"International review of neurobiology"},{"key":"4482_CR18","volume-title":"GeneChip Expression Analysis: Data Analysis Fundamentals. Santa Clara, CA","author":"Affymetrix","year":"2002","unstructured":"Affymetrix: GeneChip Expression Analysis: Data Analysis Fundamentals. Santa Clara, CA. 2002."},{"issue":"2","key":"4482_CR19","doi-asserted-by":"publisher","first-page":"242","DOI":"10.1093\/biostatistics\/kxp059","volume":"11","author":"M McCall","year":"2010","unstructured":"McCall M, Bolstad B, Irizarry R: Frozen robust multiarray analysis (fRMA). Biostatistics 2010, 11(2):242. 10.1093\/biostatistics\/kxp059","journal-title":"Biostatistics"},{"issue":"26","key":"4482_CR20","doi-asserted-by":"publisher","first-page":"4236","DOI":"10.1200\/JCO.2006.05.6861","volume":"24","author":"K Hess","year":"2006","unstructured":"Hess K, Anderson K, Symmans W, Valero V, Ibrahim N, Mejia J, Booser D, Theriault R, Buzdar A, Dempsey P, Rouzier R, Sneige N, Ross J, Vidaurre T, Gomez H, Hortobagyi G, Pusztai L: Pharmacogenomic predictor of sensitivity to preoperative chemotherapy with paclitaxel and fluorouracil, doxorubicin, and cyclophosphamide in breast cancer. Journal of clinical oncology 2006, 24(26):4236. 10.1200\/JCO.2006.05.6861","journal-title":"Journal of clinical oncology"},{"issue":"10","key":"4482_CR21","doi-asserted-by":"publisher","first-page":"6567","DOI":"10.1073\/pnas.082099299","volume":"99","author":"R Tibshirani","year":"2002","unstructured":"Tibshirani R, Hastie T, Narasimhan B, Chu G: Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proceedings of the National Academy of Sciences of the United states of America 2002, 99(10):6567. 10.1073\/pnas.082099299","journal-title":"Proceedings of the National Academy of Sciences of the United states of America"},{"key":"4482_CR22","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1093\/nar\/30.1.207","volume":"30","author":"R Edgar","year":"2002","unstructured":"Edgar R, Domrachev M, Lash A: Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Research 2002, 30: 207. 10.1093\/nar\/30.1.207","journal-title":"Nucleic Acids Research"},{"issue":"suppl 1","key":"4482_CR23","doi-asserted-by":"publisher","first-page":"D868","DOI":"10.1093\/nar\/gkn889","volume":"37","author":"H Parkinson","year":"2009","unstructured":"Parkinson H, Kapushesky M, Kolesnikov N, Rustici G, Shojatalab M, Abeygunawardena N, Berube H, Dylag M, Emam I, Farne A, Holloway E, Lukk M, Malone J, Mani R, Pilicheva E, Rayner T, Rezwan F, Sharma A, Williams E, Bradley X, Adamusiak T, Brandizi M, Burdett T, Coulson R, Krestyaninova M, Kurnosov P, Maguire E, Neogi S, Rocca-Serra P, Sansone S, et al.: ArrayExpress update-from an archive of functional genomics experiments to the atlas of gene expression. Nucleic acids research 2009, 37(suppl 1):D868.","journal-title":"Nucleic acids research"},{"issue":"16","key":"4482_CR24","doi-asserted-by":"publisher","first-page":"2092","DOI":"10.1093\/bioinformatics\/btp354","volume":"25","author":"A Kauffmann","year":"2009","unstructured":"Kauffmann A, Rayner T, Parkinson H, Kapushesky M, Lukk M, Brazma A, Huber W: Importing arrayexpress datasets into r\/bioconductor. Bioinformatics 2009, 25(16):2092. 10.1093\/bioinformatics\/btp354","journal-title":"Bioinformatics"},{"issue":"3","key":"4482_CR25","doi-asserted-by":"publisher","first-page":"415","DOI":"10.1093\/bioinformatics\/btn647","volume":"25","author":"A Kauffmann","year":"2009","unstructured":"Kauffmann A, Gentleman R, Huber W: arrayQualityMetrics-a bioconductor package for quality assessment of microarray data. Bioinformatics 2009, 25(3):415. 10.1093\/bioinformatics\/btn647","journal-title":"Bioinformatics"},{"key":"4482_CR26","doi-asserted-by":"publisher","first-page":"138","DOI":"10.1016\/j.ygeno.2010.01.003","volume":"95","author":"A Kauffmann","year":"2010","unstructured":"Kauffmann A, Huber W: Microarray data quality control improves the detection of differentially expressed genes. Genomics 2010, 95: 138\u2013142. [NA] [NA] 10.1016\/j.ygeno.2010.01.003","journal-title":"Genomics"},{"key":"4482_CR27","doi-asserted-by":"publisher","first-page":"R80","DOI":"10.1186\/gb-2004-5-10-r80","volume":"5","author":"RC Gentleman","year":"2004","unstructured":"Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth G, Tierney L, Yang JYH, Zhang J: Bioconductor: Open software development for computational biology and bioinformatics. Genome Biology 2004, 5: R80. [http:\/\/genomebiology.com\/2004\/5\/10\/R80] 10.1186\/gb-2004-5-10-r80","journal-title":"Genome Biology"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-12-137.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T13:33:20Z","timestamp":1630503200000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-12-137"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,5,7]]},"references-count":27,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["4482"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-12-137","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,5,7]]},"assertion":[{"value":"9 November 2010","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 May 2011","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 May 2011","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"137"}}