{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T22:22:39Z","timestamp":1775082159950,"version":"3.50.1"},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2022,4,7]],"date-time":"2022-04-07T00:00:00Z","timestamp":1649289600000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,5,13]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Reproducibility of results obtained using ribonucleic acid (RNA) data across labs remains a major hurdle in cancer research. Often, molecular predictors trained on one dataset cannot be applied to another due to differences in RNA library preparation and quantification, which inhibits the validation of predictors across labs. While current RNA correction algorithms reduce these differences, they require simultaneous access to patient-level data from all datasets, which necessitates the sharing of training data for predictors when sharing predictors. Here, we describe SpinAdapt, an unsupervised RNA correction algorithm that enables the transfer of molecular models without requiring access to patient-level data. It computes data corrections only via aggregate statistics of each dataset, thereby maintaining patient data privacy. Despite an inherent trade-off between privacy and performance, SpinAdapt outperforms current correction methods, like Seurat and ComBat, on publicly available cancer studies, including TCGA and ICGC. Furthermore, SpinAdapt can correct new samples, thereby enabling unbiased evaluation on validation cohorts. We expect this novel correction paradigm to enhance research reproducibility and to preserve patient privacy.<\/jats:p>","DOI":"10.1093\/bib\/bbac110","type":"journal-article","created":{"date-parts":[[2022,3,17]],"date-time":"2022-03-17T04:18:15Z","timestamp":1647490695000},"source":"Crossref","is-referenced-by-count":2,"title":["Privacy preserving validation for multiomic prediction models"],"prefix":"10.1093","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5383-5349","authenticated-orcid":false,"given":"Talal","family":"Ahmed","sequence":"first","affiliation":[{"name":"Tempus Labs Inc., Chicago, IL 60654, USA"}]},{"given":"Mark A","family":"Carty","sequence":"additional","affiliation":[{"name":"Tempus Labs Inc., Chicago, IL 60654, USA"}]},{"given":"Stephane","family":"Wenric","sequence":"additional","affiliation":[{"name":"Tempus Labs Inc., Chicago, IL 60654, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1640-2616","authenticated-orcid":false,"given":"Jonathan R","family":"Dry","sequence":"additional","affiliation":[{"name":"Tempus Labs Inc., Chicago, IL 60654, USA"}]},{"given":"Ameen A","family":"Salahudeen","sequence":"additional","affiliation":[{"name":"Tempus Labs Inc., Chicago, IL 60654, USA"}]},{"given":"Aly A","family":"Khan","sequence":"additional","affiliation":[{"name":"Tempus Labs Inc., Chicago, IL 60654, USA"}]},{"given":"Eric","family":"Lefkofsky","sequence":"additional","affiliation":[{"name":"Tempus Labs Inc., Chicago, IL 60654, USA"}]},{"given":"Martin C","family":"Stumpe","sequence":"additional","affiliation":[{"name":"Tempus Labs Inc., Chicago, IL 60654, USA"}]},{"given":"Raphael","family":"Pelossof","sequence":"additional","affiliation":[{"name":"Tempus Labs Inc., Chicago, IL 60654, USA"}]}],"member":"286","published-online":{"date-parts":[[2022,4,6]]},"reference":[{"key":"2022051813463536200_ref1","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1038\/nrg2825","article-title":"Tackling the widespread and critical impact of batch effects in high-throughput data","volume":"11","author":"Leek","year":"2010","journal-title":"Nat Rev Genet"},{"key":"2022051813463536200_ref2","doi-asserted-by":"crossref","first-page":"e161","DOI":"10.1093\/nar\/gku864","article-title":"svaseq: removing batch effects and other unwanted noise from sequencing data","volume":"42","author":"Leek","year":"2014","journal-title":"Nucleic Acid Res"},{"key":"2022051813463536200_ref3","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1186\/s13059-019-1850-9","article-title":"A benchmark of batch-effect correction methods for single-cell RNA sequencing data","volume":"21","author":"Tran","year":"2020","journal-title":"Genome Biol"},{"key":"2022051813463536200_ref4","doi-asserted-by":"crossref","first-page":"929","DOI":"10.1016\/j.cell.2014.06.049","article-title":"Multiplatform analysis of 12 cancer types reveals molecular classification within and across tissues of origin","volume":"158","author":"Hoadley","year":"2014","journal-title":"Cell"},{"key":"2022051813463536200_ref5","doi-asserted-by":"crossref","first-page":"e47","DOI":"10.1093\/nar\/gkv007","article-title":"limma powers differential expression analyses for RNA-sequencing and microarray studies","volume":"43","author":"Ritchie","year":"2015","journal-title":"Nucleic Acid Res"},{"key":"2022051813463536200_ref6","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1038\/nbt.4091","article-title":"Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors","volume":"36","author":"Haghverdi","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2022051813463536200_ref7","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1038\/s41587-019-0113-3","article-title":"Efficient integration of heterogeneous single-cell transcriptomes using Scanorama","volume":"37","author":"Hie","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2022051813463536200_ref8","doi-asserted-by":"crossref","first-page":"715","DOI":"10.1038\/s41592-019-0494-8","article-title":"scGen predicts single-cell perturbation responses","volume":"16","author":"Lotfollahi","year":"2019","journal-title":"Nat Methods"},{"key":"2022051813463536200_ref9","doi-asserted-by":"crossref","first-page":"9775","DOI":"10.1073\/pnas.1820006116","article-title":"scMerge leverages factor analysis, stable expression, and pseudoreplication to merge multiple single-cell RNA-seq datasets","volume":"116","author":"Lin","year":"2019","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2022051813463536200_ref10","doi-asserted-by":"crossref","first-page":"1289","DOI":"10.1038\/s41592-019-0619-0","article-title":"Fast, sensitive and accurate integration of single-cell data with Harmony","volume":"16","author":"Korsunsky","year":"2019","journal-title":"Nat Method"},{"key":"2022051813463536200_ref11","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1093\/biostatistics\/kxj037","article-title":"Adjusting batch effects in microarray expression data using empirical Bayes methods","volume":"8","author":"Johnson","year":"2007","journal-title":"Biostatistics"},{"key":"2022051813463536200_ref12","doi-asserted-by":"crossref","first-page":"1888","DOI":"10.1016\/j.cell.2019.05.031","article-title":"Comprehensive integration of single-cell data","volume":"177","author":"Stuart","year":"2019","journal-title":"Cell"},{"key":"2022051813463536200_ref13","first-page":"219","article-title":"Enhancing access to microdata while protecting confidentiality: prospects for the future","volume":"6","author":"Duncan","year":"1991","journal-title":"Stat Sci"},{"key":"2022051813463536200_ref14","first-page":"1","article-title":"ComBat-seq: batch effect adjustment for RNA-seq count data","volume":"2","author":"Zhang","year":"2020","journal-title":"NAR Genom Bioinform"},{"key":"2022051813463536200_ref15","volume-title":"Bitcoin: A Peer-to-Peer Electronic Cash System","author":"Nakamoto","year":"2019"},{"key":"2022051813463536200_ref16","doi-asserted-by":"crossref","first-page":"420","DOI":"10.1016\/j.eururo.2019.09.006","article-title":"A consensus molecular classification of muscle-invasive bladder cancer","volume":"77","author":"Kamoun","year":"2020","journal-title":"Eur Urol"},{"key":"2022051813463536200_ref17","doi-asserted-by":"crossref","first-page":"7642","DOI":"10.1158\/1078-0432.CCR-09-1431","article-title":"Metastasis-associated gene expression changes predict poor outcomes in patients with dukes stage B and C colorectal cancer","volume":"15","author":"Jorissen","year":"2009","journal-title":"Clin Cancer Res"},{"key":"2022051813463536200_ref18","doi-asserted-by":"crossref","first-page":"12123","DOI":"10.1038\/s41598-020-69083-y","article-title":"ColoType: a forty gene signature for consensus molecular subtyping of colorectal cancer tumors using whole-genome assay or targeted RNA-sequencing","volume":"10","author":"Buechler","year":"2020","journal-title":"Sci Rep"},{"key":"2022051813463536200_ref19","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1016\/j.cell.2018.03.022","article-title":"Cell-of-origin patterns dominate the molecular classification of 10,000 Tumors from 33 types of cancer","volume":"173","author":"Hoadley","year":"2018","journal-title":"Cell"},{"key":"2022051813463536200_ref20","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1186\/s13073-015-0131-9","article-title":"The Sweden Cancerome analysis network\u2014breast (SCAN-B) initiative: a large-scale multicenter infrastructure towards implementation of breast cancer genomic analyses in the clinical routine","volume":"7","author":"Saal","year":"2015","journal-title":"Genome Med"},{"key":"2022051813463536200_ref21","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1186\/s12885-018-4546-8","article-title":"Gene expression profiling of 1200 pancreatic ductal adenocarcinoma reveals novel subtypes","volume":"18","author":"Zhao","year":"2018","journal-title":"BMC Cancer"},{"key":"2022051813463536200_ref22","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1007\/s10549-012-2143-0","article-title":"PAM50 assay and the three-gene model for identifying the major and clinically relevant molecular subtypes of breast cancer","volume":"135","author":"Prat","year":"2012","journal-title":"Breast Cancer Res Treat"},{"key":"2022051813463536200_ref23","doi-asserted-by":"crossref","first-page":"1350","DOI":"10.1038\/nm.3967","article-title":"The consensus molecular subtypes of colorectal cancer","volume":"21","author":"Guinney","year":"2015","journal-title":"Nat Med"},{"key":"2022051813463536200_ref24","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1038\/nature16965","article-title":"Genomic analyses identify molecular subtypes of pancreatic cancer","volume":"531","author":"Australian Pancreatic Cancer Genome Initiative","year":"2016","journal-title":"Nature"},{"key":"2022051813463536200_ref25","doi-asserted-by":"crossref","first-page":"1895","DOI":"10.1162\/089976698300017197","article-title":"Approximate statistical tests for comparing supervised classification learning algorithms","volume":"10","author":"Dietterich","year":"1998","journal-title":"Neural Comput"},{"key":"2022051813463536200_ref26","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"Silhouettes: a graphical aid to the interpretation and validation of cluster analysis","volume":"20","author":"Rousseeuw","year":"1987","journal-title":"J Comput Appl Math"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/3\/bbac110\/43745549\/bbac110.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/23\/3\/bbac110\/43745549\/bbac110.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,18]],"date-time":"2022-05-18T13:57:30Z","timestamp":1652882250000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbac110\/6564350"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,6]]},"references-count":26,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2022,5,13]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbac110","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,5]]},"published":{"date-parts":[[2022,4,6]]},"article-number":"bbac110"}}