{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T14:07:19Z","timestamp":1776348439959,"version":"3.51.2"},"reference-count":86,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2022,6,21]],"date-time":"2022-06-21T00:00:00Z","timestamp":1655769600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Research Foundation, Singapore under its AI Singapore Programme","award":["AISG-100E-2020-055 and No. AISG-GC-2019-002A"],"award-info":[{"award-number":["AISG-100E-2020-055 and No. AISG-GC-2019-002A"]}]},{"name":"NMRC HSRG","award":["MOH-000030-00"],"award-info":[{"award-number":["MOH-000030-00"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2022,10,31]]},"abstract":"<jats:p>In data-driven medical research, multi-center studies have long been preferred over single-center ones due to a single institute sometimes not having enough data to obtain sufficient statistical power for certain hypothesis testings as well as predictive and subgroup studies. The wide adoption of electronic health records (EHRs) has made multi-institutional collaboration much more feasible. However, concerns over infrastructures, regulations, privacy, and data standardization present a challenge to data sharing across healthcare institutions. Federated Learning (FL), which allows multiple sites to collaboratively train a global model without directly sharing data, has become a promising paradigm to break the data isolation. In this study, we surveyed existing works on FL applications in EHRs and evaluated the performance of current state-of-the-art FL algorithms on two EHR machine learning tasks of significant clinical importance on a real world multi-center EHR dataset.<\/jats:p>","DOI":"10.1145\/3514500","type":"journal-article","created":{"date-parts":[[2022,5,2]],"date-time":"2022-05-02T12:20:26Z","timestamp":1651494026000},"page":"1-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":79,"title":["Federated Learning for Electronic Health Records"],"prefix":"10.1145","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7562-6495","authenticated-orcid":false,"given":"Trung Kien","family":"Dang","sequence":"first","affiliation":[{"name":"Saw Swee Hock School of Public Health, National University of Singapore, Singapore"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0325-2065","authenticated-orcid":false,"given":"Xiang","family":"Lan","sequence":"additional","affiliation":[{"name":"Saw Swee Hock School of Public Health, National University of Singapore, Singapore"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2540-3829","authenticated-orcid":false,"given":"Jianshu","family":"Weng","sequence":"additional","affiliation":[{"name":"AI Singapore, Singapore"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5338-6248","authenticated-orcid":false,"given":"Mengling","family":"Feng","sequence":"additional","affiliation":[{"name":"Institute of Data Science &amp; Saw Swee Hock School of Public Health, National University of Singapore, Singapore"}]}],"member":"320","published-online":{"date-parts":[[2022,6,21]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3214303"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1088\/1361-6579\/abc960"},{"key":"e_1_3_2_4_2","doi-asserted-by":"crossref","unstructured":"George J. Annas. 2003. HIPAA regulations\u2014A new era of medical-record privacy? N. Engl. J. Med. 348 15 (2003) 1486\u20131490.","DOI":"10.1056\/NEJMlim035027"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/2857705.2857731"},{"issue":"5","key":"e_1_3_2_6_2","first-page":"1333","article-title":"Privacy-preserving deep learning via additively homomorphic encryption","volume":"13","author":"Aono Yoshinori","year":"2017","unstructured":"Yoshinori Aono, Takuya Hayashi, Lihua Wang, Shiho Moriai, et\u00a0al. 2017. Privacy-preserving deep learning via additively homomorphic encryption. IEEE Trans. Info. Forens. Secur. 13, 5 (2017), 1333\u20131345.","journal-title":"IEEE Trans. Info. Forens. Secur."},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijmedinf.2017.10.002"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1186\/cc2872"},{"key":"e_1_3_2_9_2","unstructured":"Sabri Boughorbel Fethi Jarray Neethu Venugopal Shabir Moosa Haithum Elhadi and Michel Makhlouf. 2019. Federated uncertainty-aware learning for distributed hospital ehr data. Retrieved from https:\/\/arXiv:1910.12191."},{"key":"e_1_3_2_10_2","first-page":"451","volume-title":"Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases","author":"Boyd Kendrick","year":"2013","unstructured":"Kendrick Boyd, Kevin H. Eng, and C. David Page. 2013. Area under the precision-recall curve: Point estimates and confidence intervals. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 451\u2013466."},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijmedinf.2018.01.007"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocy017"},{"key":"e_1_3_2_13_2","doi-asserted-by":"crossref","unstructured":"Min Chen Yongfeng Qian Jing Chen Kai Hwang Shiwen Mao and Long Hu. 2020. Privacy protection and intrusion avoidance for cloudlet-based medical data sharing. IEEE Trans. Cloud Comput. 8 4 (2020) 1274\u20131283.","DOI":"10.1109\/TCC.2016.2617382"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1681\/ASN.2004090740"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-63076-8_18"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.2307\/2531595"},{"key":"e_1_3_2_17_2","doi-asserted-by":"crossref","unstructured":"Hao Du Ziyuan Pan Kee Yuan Ngiam Fei Wang Ping Shum and Mengling Feng. 2021. Self-correcting recurrent neural network for acute kidney injury prediction in critical care. Health Data Sci. 2021 Article 9808426 (2021) 10 pages.","DOI":"10.34133\/2021\/9808426"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocz199"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/1866739.1866758"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1007\/11681878_14"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1561\/0400000042"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0028071"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2014-002747"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1016\/S1364-6613(99)01294-2"},{"key":"e_1_3_2_25_2","volume-title":"The Elements of Statistical Learning","author":"Friedman Jerome","year":"2001","unstructured":"Jerome Friedman, Trevor Hastie, Robert Tibshirani, et\u00a0al. 2001. The Elements of Statistical Learning. Springer Series in Statistics, Vol. 1. Springer, New York."},{"issue":"9","key":"e_1_3_2_26_2","first-page":"e489\u2013e492","article-title":"The myth of generalisability in clinical research and machine learning in health care","volume":"2","author":"Futoma Joseph","year":"2020","unstructured":"Joseph Futoma, Morgan Simons, Trishan Panch, Finale Doshi-Velez, and Leo Anthony Celi. 2020. The myth of generalisability in clinical research and machine learning in health care. Lancet Dig. Health 2, 9 (2020), e489\u2013e492.","journal-title":"Lancet Dig. Health"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.3390\/jcm9030678"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1001\/jama.285.23.3015"},{"key":"e_1_3_2_29_2","unstructured":"Matei Grama Maria Musat Luis Mu\u00f1oz-Gonz\u00e1lez Jonathan Passerat-Palmbach Daniel Rueckert and Amir Alansary. 2020. Robust aggregation for adaptive privacy preserving federated learning in healthcare. Retrieved from https:\/\/arXiv:2009.08294."},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jnca.2018.05.003"},{"key":"e_1_3_2_31_2","unstructured":"Tzu-Ming Harry Hsu Hang Qi and Matthew Brown. 2019. Measuring the effects of non-identical data distribution for federated visual classification. Retrieved from https:\/\/arXiv:1909.06335."},{"key":"e_1_3_2_32_2","unstructured":"Tzu-Ming Harry Hsu Hang Qi and Matthew Brown. 2020. Federated visual classification with real-world data distribution. Retrieved from https:\/\/arXiv:2003.08082."},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2019.103291"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0230706"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41591-020-0789-4"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.3301590"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41597-019-0322-0"},{"key":"e_1_3_2_38_2","article-title":"Generalizability of predictive models for intensive care unit patients","author":"Johnson Alistair E. W.","year":"2018","unstructured":"Alistair E. W. Johnson, Tom J. Pollard, and Tristan Naumann. 2018. Generalizability of predictive models for intensive care unit patients. In Proceedings of the Machine Learning for Health (ML4H) Workshop (NeurIPS\u201918). Retrieved from http:\/\/arxiv.org\/abs\/1812.02275.","journal-title":"Proceedings of the Machine Learning for Health (ML4H) Workshop (NeurIPS\u201918)"},{"key":"e_1_3_2_39_2","first-page":"5132","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Karimireddy Sai Praneeth","year":"2020","unstructured":"Sai Praneeth Karimireddy, Satyen Kale, Mehryar Mohri, Sashank Reddi, Sebastian Stich, and Ananda Theertha Suresh. 2020. SCAFFOLD: Stochastic controlled averaging for federated learning. In Proceedings of the International Conference on Machine Learning. PMLR, 5132\u20135143."},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1186\/s12911-016-0277-4"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.2196\/medinform.8805"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1145\/3097983.3098118"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1611835114"},{"key":"e_1_3_2_44_2","unstructured":"Jakub Kone\u010dn\u1ef3 H. Brendan McMahan Daniel Ramage and Peter Richt\u00e1rik. 2016. Federated optimization: Distributed machine learning for on-device intelligence. Retrieved from https:\/\/arXiv:1610.02527."},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.2196\/medinform.7744"},{"key":"e_1_3_2_46_2","unstructured":"Tian Li Anit Kumar Sahu Manzil Zaheer Maziar Sanjabi Ameet Talwalkar and Virginia Smith. 2018. Federated optimization in heterogeneous networks. Retrieved from https:\/\/arXiv:1812.06127."},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1378\/chest.13-1973"},{"key":"e_1_3_2_48_2","unstructured":"Dianbo Liu Dmitriy Dligach and Timothy Miller. 2019. Two-stage federated phenotyping and patient representation learning. Retrieved from https:\/\/arXiv:1908.05596."},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/CISS48834.2020.1570617414"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1377\/hlthaff.24.5.1214"},{"key":"e_1_3_2_51_2","first-page":"1273","volume-title":"Artificial Intelligence and Statistics","author":"McMahan Brendan","year":"2017","unstructured":"Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Aguera y Arcas. 2017. Communication-efficient learning of deep networks from decentralized data. In Artificial Intelligence and Statistics. PMLR, 1273\u20131282."},{"key":"e_1_3_2_52_2","first-page":"218","volume-title":"Proceedings of the 19th ACM Symposium on Theory of Computing (STOC\u201987)","author":"Micali Silvio","year":"1987","unstructured":"Silvio Micali, Oded Goldreich, and Avi Wigderson. 1987. How to play any mental game. In Proceedings of the 19th ACM Symposium on Theory of Computing (STOC\u201987). ACM, 218\u2013229."},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbx044"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.1177\/2054358118776326"},{"key":"e_1_3_2_55_2","unstructured":"Luis Mu\u00f1oz-Gonz\u00e1lez Kenneth T. Co and Emil C. Lupu. 2019. Byzantine-robust federated machine learning through adaptive model averaging. Retrieved from https:\/\/arXiv:1909.05125."},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1136\/jamia.2009.000893"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.21037\/qims-20-595"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2013.30"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.1177\/000313481908500731"},{"key":"e_1_3_2_60_2","unstructured":"Stephen R. Pfohl Andrew M. Dai and Katherine Heller. 2019. Federated and differentially private learning for electronic health records. Retrieved from https:\/\/arXiv:1911.05861."},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.1038\/sdata.2018.178"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2018.04.007"},{"key":"e_1_3_2_63_2","unstructured":"Sashank Reddi Zachary Charles Manzil Zaheer Zachary Garrett Keith Rush Jakub Kone\u010dn\u1ef3 Sanjiv Kumar and H. Brendan McMahan. 2020. Adaptive federated optimization. Retrieved from https:\/\/arXiv:2003.00295."},{"key":"e_1_3_2_64_2","article-title":"External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: Opportunities and challenges","volume":"353","author":"Riley Richard D.","year":"2016","unstructured":"Richard D. Riley, Joie Ensor, Kym I. E. Snell, Thomas P. A. Debray, Doug G. Altman, Karel G. M. Moons, and Gary S. Collins. 2016. External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: Opportunities and challenges. bmj 353 (2016).","journal-title":"bmj"},{"issue":"11","key":"e_1_3_2_65_2","first-page":"169","article-title":"On data banks and privacy homomorphisms","volume":"4","author":"Rivest Ronald L.","year":"1978","unstructured":"Ronald L. Rivest, Len Adleman, Michael L. Dertouzos, et\u00a0al. 1978. On data banks and privacy homomorphisms. Found. Secure Comput. 4, 11 (1978), 169\u2013180.","journal-title":"Found. Secure Comput."},{"key":"e_1_3_2_66_2","doi-asserted-by":"publisher","DOI":"10.1055\/s-0041-1732301"},{"key":"e_1_3_2_67_2","unstructured":"Pulkit Sharma Farah E. Shamout and David A. Clifton. 2019. Preserving patient privacy while training a predictive model of in-hospital mortality. Retrieved from https:\/\/arXiv:1912.00354."},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-020-69250-1"},{"key":"e_1_3_2_69_2","first-page":"92","volume-title":"Proceedings of the International MICCAI Brainlesion Workshop","author":"Sheller Micah J.","year":"2018","unstructured":"Micah J. Sheller, G. Anthony Reina, Brandon Edwards, Jason Martin, and Spyridon Bakas. 2018. Multi-institutional deep learning modeling without sharing patient data: A feasibility study on brain tumor segmentation. In Proceedings of the International MICCAI Brainlesion Workshop. Springer, 92\u2013104."},{"key":"e_1_3_2_70_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00287-015-0913-x"},{"key":"e_1_3_2_71_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcrc.2005.11.010"},{"key":"e_1_3_2_72_2","doi-asserted-by":"publisher","DOI":"10.1186\/s40697-016-0099-4"},{"key":"e_1_3_2_73_2","unstructured":"Xiaoqing Tan Chung-Chou H. Chang and Lu Tang. 2021. A tree-based federated learning approach for personalized treatment effect estimation from heterogeneous data sources. Retrieved from https:\/\/arXiv:2103.06261."},{"key":"e_1_3_2_74_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.compbiomed.2020.104130"},{"key":"e_1_3_2_75_2","doi-asserted-by":"crossref","unstructured":"Patrick J. Thoral Jan M. Peppink Ronald H. Driessen Eric J. G. Sijbrands Erwin J. O. Kompanje Lewis Kaplan Heatherlee Bailey Jozef Kesecioglu Maurizio Cecconi Matthew Churpek et\u00a0al. 2021. Sharing ICU patient data responsibly under the Society of Critical Care Medicine\/European Society of Intensive Care Medicine joint data science collaboration: The Amsterdam University Medical Centers Database (AmsterdamUMCdb) example. Crit. Care Med. 49 6 (2021) e563.","DOI":"10.1097\/CCM.0000000000004916"},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2016.2615052"},{"key":"e_1_3_2_77_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2020.103424"},{"key":"e_1_3_2_78_2","doi-asserted-by":"publisher","DOI":"10.2196\/24207"},{"key":"e_1_3_2_79_2","unstructured":"Praneeth Vepakomma Otkrist Gupta Tristan Swedish and Ramesh Raskar. 2018. Split learning for health: Distributed deep learning without sharing raw patient data. Retrieved from https:\/\/arXiv:1812.00564."},{"key":"e_1_3_2_80_2","first-page":"3152676","article-title":"The EU general data protection regulation (GDPR)","volume":"10","author":"Voigt Paul","year":"2017","unstructured":"Paul Voigt and Axel Von dem Bussche. 2017. The EU general data protection regulation (GDPR). A Practical Guide, vol. 10, 1st ed. Springer, Cham, 3152676.","journal-title":"A Practical Guide"},{"key":"e_1_3_2_81_2","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocu023"},{"issue":"1","key":"e_1_3_2_82_2","first-page":"1","article-title":"Untapped potential of multicenter studies: A review of cardiovascular risk prediction models revealed inappropriate analyses and wide variation in reporting","volume":"3","author":"Wynants L.","year":"2019","unstructured":"L. Wynants, D. M. Kent, D. Timmerman, C. M. Lundquist, and B. Van Calster. 2019. Untapped potential of multicenter studies: A review of cardiovascular risk prediction models revealed inappropriate analyses and wide variation in reporting. Diagnost. Prognost. Res. 3, 1 (2019), 1\u201317.","journal-title":"Diagnost. Prognost. Res."},{"key":"e_1_3_2_83_2","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocy068"},{"key":"e_1_3_2_84_2","doi-asserted-by":"publisher","DOI":"10.1007\/s41666-020-00082-4"},{"key":"e_1_3_2_85_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.6121"},{"key":"e_1_3_2_86_2","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2021.3057653"},{"key":"e_1_3_2_87_2","doi-asserted-by":"publisher","DOI":"10.5555\/1382436.1382751"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3514500","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3514500","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:21Z","timestamp":1750188621000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3514500"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,21]]},"references-count":86,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,10,31]]}},"alternative-id":["10.1145\/3514500"],"URL":"https:\/\/doi.org\/10.1145\/3514500","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"value":"2157-6904","type":"print"},{"value":"2157-6912","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,6,21]]},"assertion":[{"value":"2021-05-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-01-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-06-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}