{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T13:32:28Z","timestamp":1777901548313,"version":"3.51.4"},"reference-count":54,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2020,8,13]],"date-time":"2020-08-13T00:00:00Z","timestamp":1597276800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"National Cancer Institute","doi-asserted-by":"publisher","award":["P01-CA151135"],"award-info":[{"award-number":["P01-CA151135"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"National Cancer Institute","doi-asserted-by":"publisher","award":["P50-CA05822"],"award-info":[{"award-number":["P50-CA05822"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"National Cancer Institute","doi-asserted-by":"publisher","award":["U01-CA179715"],"award-info":[{"award-number":["U01-CA179715"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"National Cancer Institute","doi-asserted-by":"publisher","award":["P01-CA142538"],"award-info":[{"award-number":["P01-CA142538"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"National Cancer Institute","doi-asserted-by":"publisher","award":["P30-ES010126"],"award-info":[{"award-number":["P30-ES010126"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"National Cancer Institute","doi-asserted-by":"publisher","award":["3P30CA016086"],"award-info":[{"award-number":["3P30CA016086"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Komen Career Catalyst","award":["CCR16376756"],"award-info":[{"award-number":["CCR16376756"]}]},{"DOI":"10.13039\/100000057","name":"National Institute of General Medical Sciences","doi-asserted-by":"publisher","award":["1T32GM12274"],"award-info":[{"award-number":["1T32GM12274"]}],"id":[{"id":"10.13039\/100000057","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,5,20]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>The NanoString RNA counting assay for formalin-fixed paraffin embedded samples is unique in its sensitivity, technical reproducibility and robustness for analysis of clinical and archival samples. While commercial normalization methods are provided by NanoString, they are not optimal for all settings, particularly when samples exhibit strong technical or biological variation or where housekeeping genes have variable performance across the cohort. Here, we develop and evaluate a more comprehensive normalization procedure for NanoString data with steps for quality control, selection of housekeeping targets, normalization and iterative data visualization and biological validation. The approach was evaluated using a large cohort ($N=\\kern0.5em 1649$) from the Carolina Breast Cancer Study, two cohorts of moderate sample size ($N=359$ and$130$) and a small published dataset ($N=12$). The iterative process developed here eliminates technical variation (e.g. from different study phases or sites) more reliably than the three other methods, including NanoString\u2019s commercial package, without diminishing biological variation, especially in long-term longitudinal multiphase or multisite cohorts. We also find that probe sets validated for nCounter, such as the PAM50 gene signature, are impervious to batch issues. This work emphasizes that systematic quality control, normalization and visualization of NanoString nCounter data are an imperative component of study design that influences results in downstream analyses.<\/jats:p>","DOI":"10.1093\/bib\/bbaa163","type":"journal-article","created":{"date-parts":[[2020,8,11]],"date-time":"2020-08-11T00:11:10Z","timestamp":1597104670000},"source":"Crossref","is-referenced-by-count":105,"title":["An approach for normalization and quality control for NanoString RNA expression data"],"prefix":"10.1093","volume":"22","author":[{"given":"Arjun","family":"Bhattacharya","sequence":"first","affiliation":[{"name":"University of North Carolina at Chapel Hill"}]},{"given":"Alina M","family":"Hamilton","sequence":"additional","affiliation":[{"name":"University of North Carolina at Chapel Hill"}]},{"given":"Helena","family":"Furberg","sequence":"additional","affiliation":[{"name":"Memorial Sloan Kettering Cancer Center"}]},{"given":"Eugene","family":"Pietzak","sequence":"additional","affiliation":[{"name":"Memorial Sloan Kettering Cancer Center"}]},{"given":"Mark P","family":"Purdue","sequence":"additional","affiliation":[{"name":"Division of Cancer Epidemiology and Genetics, National Cancer Institute"}]},{"given":"Melissa A","family":"Troester","sequence":"additional","affiliation":[{"name":"University of North Carolina at Chapel Hill"}]},{"given":"Katherine A","family":"Hoadley","sequence":"additional","affiliation":[{"name":"University of North Carolina at Chapel Hill"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8401-0545","authenticated-orcid":false,"given":"Michael I","family":"Love","sequence":"additional","affiliation":[{"name":"University of North Carolina at Chapel Hill"}]}],"member":"286","published-online":{"date-parts":[[2020,8,13]]},"reference":[{"key":"2021052110392617100_ref1","doi-asserted-by":"crossref","first-page":"317","DOI":"10.1038\/nbt1385","article-title":"Direct multiplexed measurement of gene expression with color-coded probe pairs","volume":"26","author":"Geiss","year":"2008","journal-title":"Nat Biotechnol"},{"key":"2021052110392617100_ref2","doi-asserted-by":"crossref","first-page":"2587","DOI":"10.1158\/0008-5472.CAN-15-0262","article-title":"Evaluating robustness and sensitivity of the NanoString technologies nCounter platform to enable multiplexed gene expression analysis of clinical samples","volume":"75","author":"Veldman-Jones","year":"2015","journal-title":"Cancer Res"},{"key":"2021052110392617100_ref3","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1093\/jnci\/djx135","article-title":"Racial differences in PAM50 subtypes in the Carolina Breast Cancer Study","volume":"110","author":"Troester","year":"2018","journal-title":"J Natl Cancer Inst"},{"key":"2021052110392617100_ref4","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1186\/s12920-015-0129-6","article-title":"Development and verification of the PAM50-based Prosigna breast cancer gene signature assay","volume":"8","author":"Wallden","year":"2015","journal-title":"BMC Med Genomics"},{"key":"2021052110392617100_ref5","doi-asserted-by":"crossref","first-page":"248","DOI":"10.3389\/fmed.2018.00248","article-title":"An update on breast cancer multigene prognostic tests-emergent clinical biomarkers","volume":"5","author":"Vieira","year":"2018","journal-title":"Front Med"},{"key":"2021052110392617100_ref6","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1093\/biostatistics\/kxr034","article-title":"Using control genes to correct for unwanted variation in microarray data","volume":"13","author":"Gagnon-Bartsch","year":"2012","journal-title":"Biostatistics"},{"key":"2021052110392617100_ref7","doi-asserted-by":"crossref","first-page":"6073","DOI":"10.1093\/nar\/gkz433","article-title":"A new normalization for Nanostring nCounter gene expression data","volume":"47","author":"Molania","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2021052110392617100_ref8","doi-asserted-by":"crossref","first-page":"896","DOI":"10.1038\/nbt.2931","article-title":"Normalization of RNA-seq data using factor analysis of control genes or samples","volume":"32","author":"Risso","year":"2014","journal-title":"Nat Biotechnol"},{"key":"2021052110392617100_ref9","first-page":"5","article-title":"nSolverTM 4.0 Analysis Software","author":"NanoString Technologies","year":"2018"},{"key":"2021052110392617100_ref10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/gb-2002-3-7-research0034","article-title":"Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes","volume":"3","author":"Vandesompele","year":"2002","journal-title":"Genome Biol"},{"key":"2021052110392617100_ref11","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1186\/1471-2164-13-296","article-title":"ReadqPCR and NormqPCR: R packages for the reading, quality checking and normalisation of RT-qPCR quantification cycle (Cq) data","volume":"13","author":"Perkins","year":"2012","journal-title":"BMC Genomics"},{"key":"2021052110392617100_ref12","doi-asserted-by":"crossref","first-page":"1546","DOI":"10.1093\/bioinformatics\/bts188","article-title":"Gene expression NanoStringNorm: an extensible R package for the pre-processing of NanoString mRNA and miRNA data","volume":"28","author":"Waggott","year":"2012","journal-title":"Bioinforma Appl Note"},{"key":"2021052110392617100_ref13","doi-asserted-by":"crossref","first-page":"gkw677","DOI":"10.1093\/nar\/gkw677","article-title":"NanoStringDiff: a novel statistical method for differential expression analysis based on NanoString nCounter data","volume":"44","author":"Wang","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2021052110392617100_ref14","doi-asserted-by":"crossref","first-page":"1617","DOI":"10.1214\/19-AOAS1249","article-title":"Rcrnorm: an integrated system of random-coefficient hierarchical regression models for normalizing nanostring ncounter data","volume":"13","author":"Jia","year":"2019","journal-title":"Ann Appl Stat"},{"key":"2021052110392617100_ref15","doi-asserted-by":"crossref","first-page":"970","DOI":"10.1093\/bioinformatics\/btz647","article-title":"NACHO: an R package for quality control of NanoString nCounter data","volume":"36","author":"Canouil","year":"2020","journal-title":"Bioinformatics"},{"key":"2021052110392617100_ref16","doi-asserted-by":"crossref","first-page":"437","DOI":"10.1007\/s10549-015-3474-4","article-title":"Race-associated biological differences among luminal A breast tumors","volume":"152","author":"D\u2019Arcy","year":"2015","journal-title":"Breast Cancer Res Treat"},{"key":"2021052110392617100_ref17","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1093\/aje\/kwh331","article-title":"Comparative analysis of breast cancer risk factors among African-American women and white women","volume":"161","author":"Hall","year":"2005","journal-title":"Am J Epidemiol"},{"key":"2021052110392617100_ref18","doi-asserted-by":"crossref","first-page":"1912","DOI":"10.1038\/sj.bjc.6604761","article-title":"Tobacco smoking, body mass index, hypertension, and kidney cancer risk in central and eastern Europe","volume":"99","author":"Brennan","year":"2008","journal-title":"Br J Cancer"},{"key":"2021052110392617100_ref19","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pgen.1002312","article-title":"Von Hippel-Lindau (VHL) inactivation in sporadic clear cell renal cancer: associations with germline VHL polymorphisms and etiologic risk factors","volume":"7","author":"Moore","year":"2011","journal-title":"PLoS Genet"},{"key":"2021052110392617100_ref20","doi-asserted-by":"crossref","first-page":"e0218674","DOI":"10.1371\/journal.pone.0218674","article-title":"Tumor- and cytokine-primed human natural killer cells exhibit distinct phenotypic and transcriptional signatures","volume":"14","author":"Sabry","year":"2019","journal-title":"PLoS One"},{"key":"2021052110392617100_ref21","article-title":"NanoStringQCPro: Quality metrics and data processing methods for NanoString mRNA gene expression data. R package version 1.20.0","author":"Nickles","year":"2015"},{"key":"2021052110392617100_ref22","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-21706-2","volume-title":"Modern Applied Statistics with S","author":"Venables","year":"2002"},{"key":"2021052110392617100_ref23","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1186\/1471-2105-11-94","article-title":"Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments","volume":"11","author":"Bullard","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2021052110392617100_ref24","doi-asserted-by":"crossref","first-page":"R106","DOI":"10.1186\/gb-2010-11-10-r106","article-title":"Differential expression analysis for sequence count data","volume":"11","author":"Anders","year":"2010","journal-title":"Genome Biol"},{"key":"2021052110392617100_ref25","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2021052110392617100_ref26","doi-asserted-by":"crossref","first-page":"e47","DOI":"10.1093\/nar\/gkv007","article-title":"Limma powers differential expression analyses for RNA-sequencing and microarray studies","volume":"43","author":"Ritchie","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2021052110392617100_ref27","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"Silhouettes: a graphical aid to the interpretation and validation of cluster analysis","volume":"20","author":"Rousseeuw","year":"1987","journal-title":"J Comput Appl Math"},{"key":"2021052110392617100_ref28","doi-asserted-by":"crossref","first-page":"1353","DOI":"10.1093\/bioinformatics\/bts163","article-title":"Gene expression matrix eQTL: ultra fast eQTL analysis via large matrix operations","volume":"28","author":"Shabalin","year":"2012","journal-title":"Bioinformatics"},{"issue":"1","key":"2021052110392617100_ref29","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1186\/s13059-020-1942-6","article-title":"A framework for transcriptome-wide association studies in breast cancer in diverse study populations","volume":"57","author":"Bhattacharya","year":"2020","journal-title":"Genome Biol"},{"key":"2021052110392617100_ref30","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple","volume":"57","author":"Benjamini","year":"1995","journal-title":"Source J R Stat Soc Ser B"},{"key":"2021052110392617100_ref31","doi-asserted-by":"crossref","first-page":"1160","DOI":"10.1200\/JCO.2008.18.1370","article-title":"Supervised risk predictor of breast cancer based on intrinsic subtypes","volume":"27","author":"Parker","year":"2009","journal-title":"J Clin Oncol"},{"key":"2021052110392617100_ref32","volume-title":"genefu: Computation of Gene Expression-Based Signatures in Breast Cancer. R package version 2.20.0","author":"Gendoo","year":"2020"},{"key":"2021052110392617100_ref33","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1146\/annurev-statistics-060116-054026","article-title":"The energy of data","volume":"4","author":"Sz\u00e9kely","year":"2017","journal-title":"Annu Rev Stat Its Appl"},{"key":"2021052110392617100_ref34","doi-asserted-by":"crossref","first-page":"1281","DOI":"10.7150\/jca.13141","article-title":"Cancer hallmarks, biomarkers and breast cancer molecular subtypes","volume":"7","author":"Dai","year":"2016","journal-title":"J Cancer"},{"key":"2021052110392617100_ref35","doi-asserted-by":"crossref","first-page":"2784","DOI":"10.1200\/JCO.2009.25.6529","article-title":"American Society of Clinical Oncology\/College of American Pathologists guideline recommendations for immunohistochemical testing of estrogen and progesterone receptors in breast cancer","volume":"28","author":"Elizabeth","year":"2010","journal-title":"J Clin Oncol"},{"key":"2021052110392617100_ref36","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1038\/nature10983","article-title":"The genomic and transcriptomic architecture of 2000 breast tumours reveals novel subgroups","volume":"486","author":"Curtis","year":"2012","journal-title":"Nature"},{"key":"2021052110392617100_ref37","doi-asserted-by":"crossref","first-page":"747","DOI":"10.1038\/35021093","article-title":"Molecular portraits of human breast tumours","volume":"406","author":"Perou","year":"2000","journal-title":"Nature"},{"key":"2021052110392617100_ref38","doi-asserted-by":"crossref","first-page":"8418","DOI":"10.1073\/pnas.0932692100","article-title":"Repeated observation of breast tumor subtypes in independent gene expression data sets","volume":"100","author":"S\u00f8rlie","year":"2003","journal-title":"Proc Natl Acad Sci USA"},{"key":"2021052110392617100_ref39","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1016\/j.cell.2018.03.022","article-title":"Cell-of-origin patterns dominate the molecular classification of 10,000 tumors from 33 types of cancer","volume":"173","author":"Hoadley","year":"2018","journal-title":"Cell"},{"key":"2021052110392617100_ref40","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12864-019-5849-0","article-title":"Breast cancer PAM50 signature: correlation and concordance between RNA-Seq and digital multiplexed gene expression technologies in a triple negative breast cancer series","volume":"20","author":"Picornell","year":"2019","journal-title":"BMC Genomics"},{"key":"2021052110392617100_ref41","first-page":"209","article-title":"The detection of disease clustering and a generalized regression approach","volume":"27","author":"Mantel","year":"1967","journal-title":"Cancer Res"},{"key":"2021052110392617100_ref42","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3390\/ht7030023","article-title":"P-value histograms: inference and diagnostics","volume":"7","author":"Breheny","year":"2018","journal-title":"High-Throughput"},{"key":"2021052110392617100_ref43","doi-asserted-by":"crossref","first-page":"e47510","DOI":"10.1371\/journal.pone.0047510","article-title":"Housekeeping gene selection advisory: glyceraldehyde-3-phosphate dehydrogenase (GAPDH) and \u03b2-actin are targets of miR-644a","volume":"7","author":"Sikand","year":"2012","journal-title":"PLoS One"},{"key":"2021052110392617100_ref44","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1152\/physiolgenomics.00025.2005","article-title":"GAPDH as a housekeeping gene: analysis of GAPDH mRNA expression in a panel of 72 human tissues","volume":"21","author":"Barber","year":"2005","journal-title":"Physiol Genomics"},{"key":"2021052110392617100_ref45","first-page":"773","article-title":"Adipose Tissue Gene Expression Associations Reveal Hundreds of Candidate Genes for Cardiometabolic Traits","volume-title":"Am J Hum Genet","author":"Raulerson","year":"2019"},{"key":"2021052110392617100_ref46","doi-asserted-by":"crossref","first-page":"204","DOI":"10.1038\/nature24277","article-title":"Genetic effects on gene expression across human tissues","volume":"550","author":"Aguet","year":"2017","journal-title":"Nature"},{"key":"2021052110392617100_ref47","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3389\/fgene.2018.00341","article-title":"Genome-wide expression quantitative trait loci analysis using mixed models","volume":"9","author":"Lee","year":"2018","journal-title":"Front Genet"},{"key":"2021052110392617100_ref48","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pone.0023192","article-title":"A robust statistical method for association-based eQTL analysis","volume":"6","author":"Jiang","year":"2011","journal-title":"PLoS One"},{"key":"2021052110392617100_ref49","doi-asserted-by":"crossref","first-page":"1909","DOI":"10.1534\/genetics.108.094201","article-title":"Accurate discovery of expression quantitative trait loci under confounding from spurious and genuine regulatory hotspots","volume":"180","author":"Hyun","year":"2008","journal-title":"Genetics"},{"key":"2021052110392617100_ref50","first-page":"1","article-title":"DataRemix: A Universal Data Transformation for Optimal Inference from Gene Expression Datasets","volume-title":"bioRxiv","author":"Mao","year":"2019"},{"key":"2021052110392617100_ref51","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1093\/biostatistics\/4.2.249","article-title":"Exploration, normalization, and summaries of high density oligonucleotide array probe level data","volume":"4","author":"Irizarry","year":"2003","journal-title":"Biostatistics"},{"key":"2021052110392617100_ref52","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1186\/s12859-015-0745-3","article-title":"Systematic noise degrades gene co-expression signals but can be corrected","volume":"16","author":"Freytag","year":"2015","journal-title":"BMC Bioinformatics"},{"key":"2021052110392617100_ref53","article-title":"bhattacharya-a-bt\/CBCS_normalization: Code and summary results for \u201cAn approach for normalization and quality control for NanoString RNA expression data\u201d (Version v1.0)","volume-title":"Zenodo","author":"Bhattacharya","year":"2020"},{"key":"2021052110392617100_ref54","article-title":"bhattacharya-a-bt\/CBCS_TWAS_Paper: Code, models, and results for CBCS TWAS Paper (Version v1.0)","author":"Bhattacharya","year":"2019","journal-title":"Zenodo"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/3\/bbaa163\/37965522\/bbaa163.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/3\/bbaa163\/37965522\/bbaa163.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,11]],"date-time":"2024-08-11T12:23:03Z","timestamp":1723378983000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaa163\/5891144"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,8,13]]},"references-count":54,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2021,5,20]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaa163","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.04.08.032490","asserted-by":"object"}]},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,5]]},"published":{"date-parts":[[2020,8,13]]},"article-number":"bbaa163"}}