{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T11:29:49Z","timestamp":1769599789852,"version":"3.49.0"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T00:00:00Z","timestamp":1761523200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01 HL169954"],"award-info":[{"award-number":["R01 HL169954"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000738","name":"US Department of Veterans Affairs","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000738","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Put VA Data to Work for Veterans","award":["22-D4V"],"award-info":[{"award-number":["22-D4V"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2026,2,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Objective<\/jats:title>\n                    <jats:p>To develop a transfer-learning Bayesian sparse logistic regression model that transfers information learned from one dataset to another by using an informed prior to facilitate model fitting in small-sample clinical patient-level prediction problems that suffer from a lack of available information.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>We propose a Bayesian framework for prediction using logistic regression that aims to conduct transfer-learning on regression coefficient information from a larger dataset model (order 105-106 patients by 105 features) into a small-sample model (order 103 patients). Our approach imposes an informed, hierarchical prior on each regression coefficient defined as a discrete mixture of the Bayesian Bridge shrinkage prior and an informed normal distribution. Performance of the informed model is compared against traditional methods, primarily measured by area under the curve, calibration, bias, and sparsity using both simulations and a real-world problem.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Across all experiments, transfer-learning outperformed the traditional L1-regularized model across discrimination, calibration, bias, and sparsity. In fact, even using only a continuous shrinkage prior without the informed prior increased model performance when compared to L1-regularization.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>Transfer-learning using informed priors can help fine-tune prediction models in small datasets suffering from a lack of information. One large benefit is in that the prior is not dependent on patient-level information, such that we can conduct transfer-learning without violating privacy. In future work, the model can be applied for learning between disparate databases, or similar lack-of-information cases such as rare outcome prediction.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/jamia\/ocaf146","type":"journal-article","created":{"date-parts":[[2025,8,12]],"date-time":"2025-08-12T11:35:14Z","timestamp":1754998514000},"page":"409-423","source":"Crossref","is-referenced-by-count":0,"title":["Transfer-learning on federated observational healthcare data for prediction models using Bayesian sparse logistic regression with informed priors"],"prefix":"10.1093","volume":"33","author":[{"ORCID":"https:\/\/orcid.org\/0009-0009-3891-9235","authenticated-orcid":false,"given":"Kelly Mohe","family":"Li","sequence":"first","affiliation":[{"name":"University of California, Los Angeles Department of Biostatistics, , Los Angeles, CA 90024,","place":["United States"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2970-0778","authenticated-orcid":false,"given":"Jenna Marie","family":"Reps","sequence":"additional","affiliation":[{"name":"Janssen Research and Development , Raritan, NJ 08869,","place":["United States"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6932-2513","authenticated-orcid":false,"given":"Akihiko","family":"Nishimura","sequence":"additional","affiliation":[{"name":"Department of Biostatistics , Johns Hopkins University , Baltimore, MD 21218,","place":["United States"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0817-5361","authenticated-orcid":false,"given":"Martijn J","family":"Schuemie","sequence":"additional","affiliation":[{"name":"University of California, Los Angeles Department of Biostatistics, , Los Angeles, CA 90024,","place":["United States"]},{"name":"Janssen Research and Development , Raritan, NJ 08869,","place":["United States"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9818-479X","authenticated-orcid":false,"given":"Marc A","family":"Suchard","sequence":"additional","affiliation":[{"name":"University of California, Los Angeles Department of Biostatistics, , Los Angeles, CA 90024,","place":["United States"]},{"name":"University of California, Los Angeles Department of Biomathematics, , Los Angeles, CA 90024,","place":["United States"]},{"name":"US Department of Veterans Affairs VA Informatics and Computing Infrastructure, , Salt Lake City, UT 84113,","place":["United States"]}]}],"member":"286","published-online":{"date-parts":[[2025,10,27]]},"reference":[{"key":"2026012716173285300_ocaf146-B1","doi-asserted-by":"publisher","first-page":"969","DOI":"10.1093\/jamia\/ocy032","article-title":"Design and implementation of a standardized framework to generate and evaluate patient-level prediction models using observational healthcare data","volume":"25","author":"Reps","year":"2018","journal-title":"J Am Med Inform Assoc"},{"key":"2026012716173285300_ocaf146-B2","doi-asserted-by":"publisher","first-page":"404","DOI":"10.1002\/sam.11237","article-title":"Big data, big results: knowledge discovery in output from large-scale analytics","volume":"7","author":"McCormick","year":"2014","journal-title":"Stat Anal"},{"key":"2026012716173285300_ocaf146-B3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2414416.2414791","article-title":"Massive parallelization of serial inference algorithms for a complex generalized linear model","volume":"23","author":"Suchard","year":"2013","journal-title":"ACM Trans Model Comput Simul"},{"key":"2026012716173285300_ocaf146-B4","doi-asserted-by":"publisher","first-page":"668","DOI":"10.1002\/da.22774","article-title":"Finding factors that predict treatment-resistant depression: results of a cohort study","volume":"35","author":"Cepeda","year":"2018","journal-title":"Depress Anxiety"},{"key":"2026012716173285300_ocaf146-B5","doi-asserted-by":"publisher","first-page":"142","DOI":"10.1186\/s12911-022-01879-6","article-title":"Learning patient-level prediction models across multiple healthcare databases: evaluation of ensembles for increasing model transportability","volume":"22","author":"Reps","year":"2022","journal-title":"BMC Med Inform Decis Mak"},{"key":"2026012716173285300_ocaf146-B6","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1186\/s41512-022-00136-8","article-title":"Targeted validation: validating clinical prediction models in their intended population and setting","volume":"6","author":"Sperrin","year":"2022","journal-title":"Diagn Progn Res"},{"key":"2026012716173285300_ocaf146-B7","doi-asserted-by":"publisher","first-page":"279","DOI":"10.1016\/j.jclinepi.2014.06.018","article-title":"A new framework to enhance the interpretation of external validation studies of clinical prediction models","volume":"68","author":"Debray","year":"2015","journal-title":"J Clin Epidemiol"},{"key":"2026012716173285300_ocaf146-B8","doi-asserted-by":"crossref","first-page":"826","DOI":"10.1016\/S0895-4356(03)00207-5","article-title":"External validation is necessary in prediction research: a clinical example","volume":"56","author":"Bleeker","year":"2003","journal-title":"J Clin Epidemiol"},{"key":"2026012716173285300_ocaf146-B9","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1016\/S0895-4356(03)00047-7","article-title":"Internal and external validation of predictive models: a simulation study of bias and precision in small samples","volume":"56","author":"Steyerberg","year":"2003","journal-title":"J Clin Epidemiol"},{"key":"2026012716173285300_ocaf146-B10","doi-asserted-by":"publisher","first-page":"265","DOI":"10.2188\/jea.JE20210089","article-title":"Bias in odds ratios from logistic regression methods with sparse data sets","volume":"33","author":"Gosho","year":"2023","journal-title":"J Epidemiol"},{"key":"2026012716173285300_ocaf146-B11","doi-asserted-by":"crossref","first-page":"2468","DOI":"10.1080\/01621459.2022.2057859","article-title":"Prior-preconditioned conjugate gradient method for accelerated gibbs sampling in \u201clarge n, large p\u201d Bayesian sparse regression","volume":"118","author":"Nishimura","year":"2022","journal-title":"J Am Stat Assoc"},{"key":"2026012716173285300_ocaf146-B12","author":"Observational Health Data Sciences and Informatics (OHDSI).","year":"2019"},{"key":"2026012716173285300_ocaf146-B13","first-page":"574","article-title":"Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers","volume":"216","author":"Hripcsak","year":"2015","journal-title":"Stud Health Technol Inform"},{"key":"2026012716173285300_ocaf146-B14","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J R Stat Soc"},{"key":"2026012716173285300_ocaf146-B15","volume-title":"Springer Series in Statistics","author":"Hastie","year":"2009","edition":"2nd ed"},{"key":"2026012716173285300_ocaf146-B16","doi-asserted-by":"publisher","first-page":"2684","DOI":"10.1080\/01621459.2022.2071278","article-title":"Transfer learning under high-dimensional generalized linear models","volume":"118","author":"Tian","year":"2023","journal-title":"J Am Stat Assoc"},{"key":"2026012716173285300_ocaf146-B17","doi-asserted-by":"crossref","DOI":"10.1201\/b16018","volume-title":"Bayesian Data Analysis","author":"Gelman","year":"2013","edition":"3rd ed"},{"key":"2026012716173285300_ocaf146-B18","doi-asserted-by":"crossref","first-page":"1339","DOI":"10.1080\/01621459.2013.829001","article-title":"Bayesian inference for logistic models using polya-gamma latent variables","volume":"108","author":"Polson","year":"2013","journal-title":"J Am Stat Assoc"},{"key":"2026012716173285300_ocaf146-B19","first-page":"1","article-title":"Scalable approximate MCMC algorithms for the horseshoe prior","volume":"21","author":"Johndrow","year":"2020","journal-title":"J Mach Learn Res"},{"key":"2026012716173285300_ocaf146-B20","author":"Schuemie","year":"2023"},{"key":"2026012716173285300_ocaf146-B21","doi-asserted-by":"crossref","first-page":"713","DOI":"10.1111\/rssb.12042","article-title":"The Bayesian Bridge","volume":"76","author":"Polson","year":"2014","journal-title":"J R Stat Soc B"},{"key":"2026012716173285300_ocaf146-B22","doi-asserted-by":"crossref","first-page":"905","DOI":"10.1214\/07-AOS587","article-title":"The formal definition of reference priors","volume":"37","author":"Berger","year":"2009","journal-title":"Ann Statist"},{"key":"2026012716173285300_ocaf146-B23","doi-asserted-by":"publisher","first-page":"83","DOI":"10.1016\/S0927-0507(06)13004-2","volume-title":"Handbooks in Operations Research and Management Science","author":"Devroye","year":"2006"},{"key":"2026012716173285300_ocaf146-B24","doi-asserted-by":"publisher","first-page":"881","DOI":"10.1080\/01621459.1993.10476353","article-title":"Variable selection via Gibbs sampling","volume":"88","author":"George","year":"1993","journal-title":"J Am Stat Assoc"},{"key":"2026012716173285300_ocaf146-B25","volume-title":"Data Analysis Using Regression and Multilevel\/Hierarchical Models. Analytical Methods for Social Research","author":"Gelman","year":"2007"},{"key":"2026012716173285300_ocaf146-B26","volume-title":"OMOP Common Data Model","author":"Observational Health Data Sciences and Informatics (OHDSI)","year":"2024"},{"key":"2026012716173285300_ocaf146-B27","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1145\/1143844.1143874","author":"Davis","year":"2006"},{"key":"2026012716173285300_ocaf146-B28","doi-asserted-by":"publisher","first-page":"230","DOI":"10.1186\/s12916-019-1466-7","article-title":"Calibration: the Achilles heel of predictive analytics","volume":"17","author":"Van Calster","year":"2019","journal-title":"BMC Med"},{"key":"2026012716173285300_ocaf146-B29","volume-title":"Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques","author":"Torrey","year":"2010"},{"key":"2026012716173285300_ocaf146-B30","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4614-6849-3","volume-title":"Applied Predictive Modeling","author":"Kuhn","year":"2013","edition":"1st ed"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/33\/2\/409\/64959710\/ocaf146.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/33\/2\/409\/64959710\/ocaf146.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T21:17:40Z","timestamp":1769548660000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/33\/2\/409\/8304361"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,27]]},"references-count":30,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2025,10,27]]},"published-print":{"date-parts":[[2026,2,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocaf146","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"value":"1067-5027","type":"print"},{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2026,2]]},"published":{"date-parts":[[2025,10,27]]}}}