{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T04:30:09Z","timestamp":1771475409591,"version":"3.50.1"},"reference-count":52,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2025,1,14]],"date-time":"2025-01-14T00:00:00Z","timestamp":1736812800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["BDCC"],"abstract":"<jats:p>Federated learning (FL) has emerged as a transformative framework for collaborative learning, offering robust model training across institutions while ensuring data privacy. In the context of making a COVID-19 diagnosis using lung imaging, FL enables institutions to collaboratively train a global model without sharing sensitive patient data. A central manager aggregates local model updates to compute global updates, ensuring secure and effective integration. The global model\u2019s generalization capability is evaluated using centralized testing data before dissemination to participating nodes, where local assessments facilitate personalized adaptations tailored to diverse datasets. Addressing data heterogeneity, a critical challenge in medical imaging, is essential for improving both global performance and local personalization in FL systems. This study emphasizes the importance of recognizing real-world data variability before proposing solutions to tackle non-independent and non-identically distributed (non-IID) data. We investigate the impact of data heterogeneity on FL performance in COVID-19 lung imaging across seven distinct heterogeneity settings. By comprehensively evaluating models using generalization and personalization metrics, we highlight challenges and opportunities for optimizing FL frameworks. The findings provide valuable insights that can guide future research toward achieving a balance between global generalization and local adaptation, ultimately enhancing diagnostic accuracy and patient outcomes in COVID-19 lung imaging.<\/jats:p>","DOI":"10.3390\/bdcc9010011","type":"journal-article","created":{"date-parts":[[2025,1,14]],"date-time":"2025-01-14T06:13:07Z","timestamp":1736835187000},"page":"11","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["The Data Heterogeneity Issue Regarding COVID-19 Lung Imaging in Federated Learning: An Experimental Study"],"prefix":"10.3390","volume":"9","author":[{"given":"Fatimah","family":"Alhafiz","sequence":"first","affiliation":[{"name":"Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia"}]},{"given":"Abdullah","family":"Basuhail","sequence":"additional","affiliation":[{"name":"Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia"}]}],"member":"1968","published-online":{"date-parts":[[2025,1,14]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"314","DOI":"10.1038\/s41569-021-00640-2","article-title":"Potential Long-Term Effects of SARS-CoV-2 Infection on the Pulmonary Vasculature: A Global Perspective","volume":"19","author":"Halawa","year":"2022","journal-title":"Nat. Rev. Cardiol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1038\/s41591-021-01506-3","article-title":"Federated Learning for Predicting Clinical Outcomes in Patients with COVID-19","volume":"27","author":"Dayan","year":"2021","journal-title":"Nat. Med."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"108035","DOI":"10.1016\/j.patcog.2021.108035","article-title":"Checklist for Responsible Deep Learning Modeling of Medical Images Based on COVID-19 Detection Studies","volume":"118","author":"Hryniewska","year":"2021","journal-title":"Pattern Recognit."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"691","DOI":"10.3390\/biomedinformatics3030045","article-title":"Deep Learning and Federated Learning for Screening COVID-19: A Review","volume":"3","author":"Mondal","year":"2023","journal-title":"BioMedInformatics"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1038\/s42256-020-0186-1","article-title":"Secure Privacy-Preserving and Federated Machine Learning in Medical Imaging","volume":"2","author":"Kaissis","year":"2020","journal-title":"Nat. Mach. Intell."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1007\/s11682-013-9269-5","article-title":"The ENIGMA Consortium: Large-Scale Collaborative Analyses of Neuroimaging and Genetic Data","volume":"8","author":"Thompson","year":"2014","journal-title":"Brain Imaging Behav."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Florescu, L.M., Streba, C.T., \u015eerb\u0103nescu, M.S., M\u0103muleanu, M., Florescu, D.N., Teic\u0103, R.V., Nica, R.E., and Gheonea, I.A. (2022). Federated Learning Approach with Pre-Trained Deep Learning Models for COVID-19 Detection from Unsegmented CT Images. Life, 12.","DOI":"10.3390\/life12070958"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"107330","DOI":"10.1016\/j.asoc.2021.107330","article-title":"Federated Learning for COVID-19 Screening from Chest X-Ray Images","volume":"106","author":"Feki","year":"2020","journal-title":"Appl. Soft Comput."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"10257","DOI":"10.1109\/JIOT.2021.3120998","article-title":"Federated Learning for COVID-19 Detection with Generative Adversarial Networks in Edge Cloud Computing","volume":"9","author":"Nguyen","year":"2020","journal-title":"IEEE Internet Things J."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1038\/s42256-021-00337-8","article-title":"End-to-End Privacy-Preserving Deep Learning on Multi-Institutional Medical Imaging","volume":"3","author":"Kaissis","year":"2021","journal-title":"Nat. Mach. Intell."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhou, J., Zhou, L., Wang, D., Xu, X., Li, H., Chu, Y., Han, W., and Gao, X. (2024). Personalized and Privacy-Preserving Federated Heterogeneous Medical Image Analysis with PPPML-HMI. Comput. Biol. Med., 169.","DOI":"10.1016\/j.compbiomed.2023.107861"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1081","DOI":"10.1038\/s42256-021-00421-z","article-title":"Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence","volume":"3","author":"Bai","year":"2021","journal-title":"Nat. Mach. Intell."},{"key":"ref_13","first-page":"3883","article-title":"COVID-19 Classification from X-Ray Images: An Approach to Implement Federated Learning on Decentralized Dataset","volume":"75","author":"Siddique","year":"2023","journal-title":"Comput. Mater. Contin."},{"key":"ref_14","unstructured":"Bhattacharya, A., Gawali, M., Seth, J., and Kulkarni, V. (2022). Application of Federated Learning in Building a Robust COVID-19 Chest X-Ray Classification Model. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1093\/jamia\/ocac188","article-title":"Evaluation of Federated Learning Variations for COVID-19 Diagnosis Using Chest Radiographs from 42 US and European Hospitals","volume":"30","author":"Peng","year":"2023","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"102139","DOI":"10.1016\/j.compmedimag.2022.102139","article-title":"Blockchain and Homomorphic Encryption Based Privacy-Preserving Model Aggregation for Medical Images","volume":"102","author":"Kumar","year":"2022","journal-title":"Comput. Med. Imaging Graph."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Xu, Y., Ma, L., Yang, F., Chen, Y., Ma, K., Yang, J., Yang, X., Chen, Y., Shu, C., and Fan, Z. (2020). A Collaborative Online AI Engine for CT-Based COVID-19 Diagnosis. medRxiv.","DOI":"10.1101\/2020.05.10.20096073"},{"key":"ref_18","first-page":"378","article-title":"Federated Contrastive Learning for Decentralized Unlabeled Medical Images","volume":"Volume 12903","author":"Dong","year":"2021","journal-title":"Proceedings of the Medical Image Computing and Computer Assisted Intervention\u2013MICCAI 2021, Strasbourg, France, 27 September\u2013October 2021"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"101992","DOI":"10.1016\/j.media.2021.101992","article-title":"Federated Semi-Supervised Learning for COVID Region Segmentation in Chest CT Using Multi-National Data from China, Italy, Japan R","volume":"70","author":"Yang","year":"2021","journal-title":"Med. Image Anal."},{"key":"ref_20","first-page":"41","article-title":"Experiments of Federated Learning for COVID-19 Chest X-Ray Images","volume":"1423","author":"Yan","year":"2021","journal-title":"Commun. Comput. Inf. Sci."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Duan, M., Liu, D., Ji, X., Liu, R., Liang, L., Chen, X., and Tan, Y. (October, January 30). FedGroup: Efficient Clustered Federated Learning via Decomposed Data-Driven Measure. Proceedings of the 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA\/BDCloud\/SocialCom\/SustainCom), New York, NY, USA.","DOI":"10.1109\/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00042"},{"key":"ref_22","unstructured":"Li, X., Jiang, M., Zhang, X., Kamp, M., and Dou, Q. (2021). FedBN: Federated Learning on Non-IID Features via Local Batch Normalization. arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1403","DOI":"10.1007\/s10796-021-10144-6","article-title":"FedDPGAN: Federated Differentially Private Generative Adversarial Networks Framework for the Detection of COVID-19 Pneumonia","volume":"23","author":"Zhang","year":"2021","journal-title":"Inf. Syst. Front."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Shyu, C.R., Putra, K.T., Chen, H.C., Tsai, Y.Y., Tozammel Hossain, K.S.M., Jiang, W., Shae, Z.Y., Hossain, K.S.M.T., and Jiang, W. (2021). A Systematic Review of Federated Learning in the Healthcare Area: From the Perspective of Data Properties and Applications. Appl. Sci., 11.","DOI":"10.3390\/app112311191"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Aich, S., Sinai, N.K., Kumar, S., Ali, M., Choi, Y.R., Joo, M., and Kim, H.C. (2021, January 13\u201316). Protecting Personal Healthcare Record Using Blockchain Federated Learning Technologies. Proceedings of the International Conference on Advanced Communication Technology (ICACT), PyeongChang, Republic of Korea.","DOI":"10.23919\/ICACT53585.2022.9728772"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1038\/s41746-021-00431-6","article-title":"Federated Deep Learning for Detecting COVID-19 Lung Abnormalities in CT: A Privacy-Preserving Multinational Validation Study","volume":"4","author":"Dou","year":"2012","journal-title":"NPJ Digit. Med."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Ho, T.T., Tran, K.D., Huang, Y., Differential, L., Using, P., Images, C.X., and Information, S. (2022). FedSGDCOVID: Federated SGD COVID-19 Detection under Local Differential Privacy Using Chest X-Ray Images and Symptom Information. Sensors, 22.","DOI":"10.3390\/s22103728"},{"key":"ref_28","unstructured":"Lo, S.K., Liu, Y., Lu, Q., Wang, C., Xu, X., Paik, H.-Y., and Zhu, L. (2021). Blockchain-Based Trustworthy Federated Learning Architecture. arXiv."},{"key":"ref_29","first-page":"111","article-title":"Federated Learning in the Cloud for Analysis of Medical Images\u2013Experience with Open Source Frameworks","volume":"Volume 12969","author":"Malawski","year":"2021","journal-title":"Proceedings of the Clinical Image-Based Procedures, Distributed and Collaborative Learning, Artificial Intelligence for Com-bating COVID-19 and Secure and Privacy-Preserving Machine Learning. (DCL 2021, PPML 2021, LL-COVID19 2021, CLIP 2021), Strasbourg, France, 27 September\u20131 October 2021"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Malik, H., Naeem, A., Naqvi, R.A., and Loh, W.K. (2023). DMFL_Net: A Federated Learning-Based Framework for the Classification of COVID-19 from Multiple Chest Diseases Using X-Rays. Sensors, 23.","DOI":"10.3390\/s23020743"},{"key":"ref_31","unstructured":"Adhikari, R., and Settles, C. (2024). Secure Federated Learning Approaches to Diagnosing COVID-19. arXiv."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1109\/OJCS.2022.3206407","article-title":"Collaborative Federated Learning for Healthcare: Multi-Modal COVID-19 Diagnosis at the Edge","volume":"3","author":"Qayyum","year":"2022","journal-title":"IEEE Open J. Comput. Soc."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"15884","DOI":"10.1109\/JIOT.2021.3056185","article-title":"Dynamic Fusion-Based Federated Learning for COVID-19 Detection","volume":"8","author":"Zhang","year":"2021","journal-title":"IEEE Internet Things J."},{"key":"ref_34","unstructured":"McMahan, B., Moore, E., Ramage, D., Hampson, S., and y Arcas, B.A. (2017, January 20\u201322). Communication-Efficient Learning of Deep Networks from Decentralized Data. Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, Fort Lauderdale, FL, USA."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"12131","DOI":"10.1109\/TMC.2024.3406554","article-title":"Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data","volume":"23","author":"Zhou","year":"2024","journal-title":"IEEE Trans. Mob. Comput."},{"key":"ref_36","first-page":"289","article-title":"Federated Learning of Models Pretrained on Different Features with Consensus Graphs","volume":"213","author":"Ma","year":"2023","journal-title":"Springer Optim. Its Appl."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Loddo, A., Pili, F., and di Ruberto, C. (2021). Deep Learning for COVID-19 Diagnosis from CT Images. Appl. Sci., 11.","DOI":"10.3390\/app11178227"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"2371","DOI":"10.1002\/int.22777","article-title":"A Comprehensive Review of Federated Learning for COVID-19 Detection","volume":"37","author":"Naz","year":"2022","journal-title":"Int. J. Intell. Syst."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"132665","DOI":"10.1109\/ACCESS.2020.3010287","article-title":"Can AI Help in Screening Viral and COVID-19 Pneumonia?","volume":"8","author":"Chowdhury","year":"2020","journal-title":"IEEE Access"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Rahman, T., Khandakar, A., Qiblawey, Y., Tahir, A., Kiranyaz, S., Kashem, S.B.A., Islam, M.T., Al Maadeed, S., Zughaier, S.M., and Khan, M.S. (2021). Exploring the Effect of Image Enhancement Techniques on COVID-19 Detection Using Chest X-Ray Images. Comput. Biol. Med., 132.","DOI":"10.1016\/j.compbiomed.2021.104319"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Vantaggiato, E., Paladini, E., Bougourzi, F., Distante, C., Hadid, A., and Taleb-Ahmed, A. (2021). COVID-19 Recognition Using Ensemble-Cnns in Two New Chest X-Ray Databases. Sensors, 21.","DOI":"10.3390\/s21051742"},{"key":"ref_42","unstructured":"Tahir, A.M., Chowdhury, M.E.H., Khandakar, A., Qiblawey, Y., Khurshid, U., Kiranyaz, S., Ibtehaz, N., Rahman, M.S., Al-Madeed, S., and Mahmud, S. (2025, January 10). COVID-QU-Ex. Kaggle. Available online: https:\/\/www.kaggle.com\/datasets\/anasmohammedtahir\/covidqu."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Umair, M., Khan, M.S., Ahmed, F., Baothman, F., Alqahtani, F., Alian, M., and Ahmad, J. (2021). Detection of COVID-19 Using Transfer Learning and Grad-Cam Visualization on Indigenously Collected X-Ray Dataset. Sensors, 21.","DOI":"10.3390\/s21175813"},{"key":"ref_44","unstructured":"Maftouni, M., Law AC, C., Shen, B., Grado ZJ, K., Zhou, Y., and Yazdi, N.A. (2021, January 22\u201325). A Robust Ensemble-Deep Learning Model for COVID-19 Diagnosis Based on an Integrated CT Scan Images Database. Proceedings of the 2021 IISE Annual Conference, Montreal, QC, Canada."},{"key":"ref_45","unstructured":"Soares, E., Angelov, P., Biaso, S., Froes, M.H., and Abe, D.K. (2020). SARS-CoV-2 CT-Scan Dataset: A Large Dataset of Real Patients CT Scans for SARS-CoV-2 Identification. medRxiv."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"852","DOI":"10.21037\/qims-20-595","article-title":"Federated Learning: A Collaborative Effort to Achieve Better Medical Imaging Models for Individual Sites That Have Small Labelled Datasets","volume":"11","author":"Ng","year":"2021","journal-title":"Quant. Imaging Med. Surg."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Hernandez-cruz, N., Saha, P., Sarker, M.K., and Noble, J.A. (2024). Review of Federated Learning and Machine Learning-Based Methods for Medical Image Analysis. Big Data Cogn. Comput., 8.","DOI":"10.3390\/bdcc8090099"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Li, Q., Diao, Y., Chen, Q., and He, B. (2022, January 9\u201312). Federated Learning on Non-IID Data Silos: An Experimental Study. Proceedings of the 2022 IEEE 38th International Conference on Data Engineering (ICDE), Kuala Lumpur, Malaysia.","DOI":"10.1109\/ICDE53745.2022.00077"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Abdul, M., Id, S., Taha, S., and Ramadan, M. (2021). COVID-19 Detection Using Federated Machine Learning. PLoS ONE, 16.","DOI":"10.1371\/journal.pone.0252573"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"16301","DOI":"10.1109\/JSEN.2021.3076767","article-title":"Blockchain-Federated-Learning and Deep Learning Models for COVID-19 Detection Using CT Imaging","volume":"21","author":"Kumar","year":"2021","journal-title":"IEEE Sens. J."},{"key":"ref_51","first-page":"102101","article-title":"Challenges in Medical Imaging Analysis with Heterogeneous Datasets","volume":"72","author":"Rao","year":"2021","journal-title":"Med. Image Anal."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"1985","DOI":"10.3390\/covid4120140","article-title":"Non-IID Medical Imaging Data on COVID-19 in the Federated Learning Framework: Impact and Directions","volume":"4","author":"Alhafiz","year":"2024","journal-title":"COVID"}],"container-title":["Big Data and Cognitive Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-2289\/9\/1\/11\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T10:28:25Z","timestamp":1759919305000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-2289\/9\/1\/11"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,14]]},"references-count":52,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,1]]}},"alternative-id":["bdcc9010011"],"URL":"https:\/\/doi.org\/10.3390\/bdcc9010011","relation":{},"ISSN":["2504-2289"],"issn-type":[{"value":"2504-2289","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1,14]]}}}