{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T04:38:45Z","timestamp":1777696725808,"version":"3.51.4"},"reference-count":26,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2025,1,1]],"date-time":"2025-01-01T00:00:00Z","timestamp":1735689600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Intelligent Data Analysis: An International Journal"],"published-print":{"date-parts":[[2025,1]]},"abstract":"<jats:p>The emergence of the novel COVID-19 virus has had a profound impact on global healthcare systems and economies, underscoring the imperative need for the development of precise and expeditious diagnostic tools. Machine learning techniques have emerged as a promising avenue for augmenting the capabilities of medical professionals in disease diagnosis and classification. In this research, the EFS-XGBoost classifier model, a robust approach for the classification of patients afflicted with COVID-19 is proposed. The key innovation in the proposed model lies in the Ensemble-based Feature Selection (EFS) strategy, which enables the judicious selection of relevant features from the expansive COVID-19 dataset. Subsequently, the power of the eXtreme Gradient Boosting (XGBoost) classifier to make precise distinctions among COVID-19-infected patients is harnessed.The EFS methodology amalgamates five distinctive feature selection techniques, encompassing correlation-based, chi-squared, information gain, symmetric uncertainty-based, and gain ratio approaches. To evaluate the effectiveness of the model, comprehensive experiments were conducted using a COVID-19 dataset procured from Kaggle, and the implementation was executed using Python programming. The performance of the proposed EFS-XGBoost model was gauged by employing well-established metrics that measure classification accuracy, including accuracy, precision, recall, and the F1-Score. Furthermore, an in-depth comparative analysis was conducted by considering the performance of the XGBoost classifier under various scenarios: employing all features within the dataset without any feature selection technique, and utilizing each feature selection technique in isolation. The meticulous evaluation reveals that the proposed EFS-XGBoost model excels in performance, achieving an astounding accuracy rate of 99.8%, surpassing the efficacy of other prevailing feature selection techniques. This research not only advances the field of COVID-19 patient classification but also underscores the potency of ensemble-based feature selection in conjunction with the XGBoost classifier as a formidable tool in the realm of medical diagnosis and classification.<\/jats:p>","DOI":"10.3233\/ida-230854","type":"journal-article","created":{"date-parts":[[2024,5,17]],"date-time":"2024-05-17T13:26:49Z","timestamp":1715952409000},"page":"171-185","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":2,"title":["EFS-XGBoost: A robust framework for precision classification of COVID-19 cases"],"prefix":"10.1177","volume":"29","author":[{"given":"Mustufa Haider","family":"Abidi","sequence":"first","affiliation":[{"name":"Advanced Manufacturing Institute, King Saud University, Riyadh, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Neelu","family":"Khare","sequence":"additional","affiliation":[{"name":"School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore, Tamil Nadu, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Preethi","family":"D.","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Faculty of Engineering and Technology, SRM Institute of Science and Technology, Ramapuram, Chennai, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hisham","family":"Alkhalefah","sequence":"additional","affiliation":[{"name":"Advanced Manufacturing Institute, King Saud University, Riyadh, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Usama","family":"Umer","sequence":"additional","affiliation":[{"name":"Advanced Manufacturing Institute, King Saud University, Riyadh, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2025,3,28]]},"reference":[{"key":"e_1_3_3_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/s13204-021-01868-7"},{"key":"e_1_3_3_3_2","doi-asserted-by":"crossref","first-page":"96088","DOI":"10.1109\/ACCESS.2020.2997663","article-title":"Optimal Scheduling of Flexible Manufacturing System Using Improved Lion-Based Hybrid Machine Learning Approach","volume":"8","author":"Abidi M.H.","year":"2020","unstructured":"Abidi M.H., et\u00a0al., Optimal Scheduling of Flexible Manufacturing System Using Improved Lion-Based Hybrid Machine Learning Approach, IEEE Access 8 (2020), 96088\u201396114.","journal-title":"IEEE Access"},{"issue":"6","key":"e_1_3_3_4_2","doi-asserted-by":"crossref","first-page":"1795","DOI":"10.1007\/s10845-021-01757-4","article-title":"Fuzzy harmony search based optimal control strategy for wireless cyber physical system with industry 4.0","volume":"33","author":"Abidi M.H.","year":"2022","unstructured":"Abidi M.H., Alkhalefah H., Umer U., Fuzzy harmony search based optimal control strategy for wireless cyber physical system with industry 4.0, Journal of Intelligent Manufacturing 33(6) (2022), 1795\u20131812.","journal-title":"Journal of Intelligent Manufacturing"},{"issue":"6","key":"e_1_3_3_5_2","doi-asserted-by":"crossref","first-page":"3387","DOI":"10.3390\/su14063387","article-title":"Predictive Maintenance Planning for Industry 4.0 Using Machine Learning for Sustainable Manufacturing","volume":"14","author":"Abidi M.H.","year":"2022","unstructured":"Abidi M.H., Mohammed M.K., Alkhalefah H., Predictive Maintenance Planning for Industry 4.0 Using Machine Learning for Sustainable Manufacturing, Sustainability 14(6) (2022), 3387.","journal-title":"Sustainability"},{"issue":"11","key":"e_1_3_3_6_2","doi-asserted-by":"crossref","first-page":"2008","DOI":"10.3390\/math8112008","article-title":"Automated Maintenance Data Classification Using Recurrent Neural Network: Enhancement by Spotted Hyena-Based Whale Optimization","volume":"8","author":"Abidi M.H.","year":"2020","unstructured":"Abidi M.H., et\u00a0al., Automated Maintenance Data Classification Using Recurrent Neural Network: Enhancement by Spotted Hyena-Based Whale Optimization, Mathematics 8(11) (2020), 2008.","journal-title":"Mathematics"},{"key":"e_1_3_3_7_2","first-page":"8921220","article-title":"Predicting Mortality in Hospitalized COVID-19 Patients in Zambia: An Application of Machine Learning","volume":"2023","author":"Mulenga C.","year":"2023","unstructured":"Mulenga C., et\u00a0al., Predicting Mortality in Hospitalized COVID-19 Patients in Zambia: An Application of Machine Learning, Global Health,\u00a0Epidemiology and Genomics 2023 (2023), 8921220.","journal-title":"Global Health,\u00a0Epidemiology and Genomics"},{"key":"e_1_3_3_8_2","doi-asserted-by":"crossref","first-page":"108055","DOI":"10.1016\/j.compeleceng.2022.108055","article-title":"Machine Learning Approach for Autonomous Detection and Classification of COVID-19 Virus","volume":"101","author":"Shahin O.R.","year":"2022","unstructured":"Shahin O.R., et\u00a0al., Machine Learning Approach for Autonomous Detection and Classification of COVID-19 Virus, Computers and Electrical Engineering 101 (2022), 108055.","journal-title":"Computers and Electrical Engineering"},{"issue":"2","key":"e_1_3_3_9_2","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1007\/s42979-022-01583-2","article-title":"Screening of COVID-19 Based on GLCM Features from CT Images Using Machine Learning Classifiers","volume":"4","author":"Godbin A.B.","year":"2023","unstructured":"Godbin A.B., Jasmine S.G., Screening of COVID-19 Based on GLCM Features from CT Images Using Machine Learning Classifiers, SN Computer Science 4(2) (2023), 133.","journal-title":"SN Computer Science"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.32604\/cmc.2021.015655"},{"issue":"4","key":"e_1_3_3_11_2","doi-asserted-by":"crossref","first-page":"72","DOI":"10.4018\/IJeC.2020100106","article-title":"EFS-LSTM (Ensemble-Based Feature Selection With LSTM) Classifier for Intrusion Detection System","volume":"16","author":"Devan P.","year":"2020","unstructured":"Devan P., Khare N., EFS-LSTM (Ensemble-Based Feature Selection With LSTM) Classifier for Intrusion Detection System, International Journal of e-Collaboration 16(4) (2020), 72\u201386.","journal-title":"International Journal of e-Collaboration"},{"issue":"16","key":"e_1_3_3_12_2","doi-asserted-by":"crossref","first-page":"12499","DOI":"10.1007\/s00521-020-04708-x","article-title":"An efficient XGBoost\u2013DNN-based classification model for network intrusion detection system","volume":"32","author":"Devan P.","year":"2020","unstructured":"Devan P., Khare N., An efficient XGBoost\u2013DNN-based classification model for network intrusion detection system, Neural Computing and Applications 32(16) (2020), 12499\u201312514.","journal-title":"Neural Computing and Applications"},{"key":"e_1_3_3_13_2","doi-asserted-by":"crossref","unstructured":"Chen Z. et\u00a0al. XGBoost Classifier for DDoS Attack Detection and Analysis in SDN-Based Cloud in IEEE International Conference on Big Data and Smart Computing (BigComp) (2018) Shanghai China.","DOI":"10.1109\/BigComp.2018.00044"},{"key":"e_1_3_3_14_2","doi-asserted-by":"crossref","unstructured":"Kumaran K. et\u00a0al. COVID-19 Detection and Classification using Transfer learning with XGboost in 7th International Conference on Computing Methodologies and Communication (ICCMC) (2023) Erode India.","DOI":"10.1109\/ICCMC56507.2023.10083788"},{"key":"e_1_3_3_15_2","doi-asserted-by":"crossref","first-page":"101188","DOI":"10.1016\/j.imu.2023.101188","article-title":"Machine learning models for predicting severe COVID-19 outcomes in hospitals","volume":"37","author":"Wendland P.","year":"2023","unstructured":"Wendland P., et\u00a0al., Machine learning models for predicting severe COVID-19 outcomes in hospitals, Informatics in Medicine Unlocked 37 (2023), 101188.","journal-title":"Informatics in Medicine Unlocked"},{"issue":"4","key":"e_1_3_3_16_2","doi-asserted-by":"crossref","first-page":"831","DOI":"10.1007\/s00607-022-01057-6","article-title":"Machine learning-based IoT system for COVID-19 epidemics","volume":"105","author":"Arowolo M.O.","year":"2023","unstructured":"Arowolo M.O., et\u00a0al., Machine learning-based IoT system for COVID-19 epidemics, Computing 105(4) (2023), 831\u2013847.","journal-title":"Computing"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11277-023-10466-5"},{"issue":"10","key":"e_1_3_3_18_2","article-title":"A Novel Deep Learning-Based Classification Framework for COVID-19 Assisted with Weighted Average Ensemble Modeling","volume":"13","author":"Chakraborty G.S.","year":"2023","unstructured":"Chakraborty G.S., et\u00a0al., A Novel Deep Learning-Based Classification Framework for COVID-19 Assisted with Weighted Average Ensemble Modeling, Diagnostics (Basel) 13(10) (2023).","journal-title":"Diagnostics (Basel)"},{"key":"e_1_3_3_19_2","doi-asserted-by":"crossref","first-page":"4694567","DOI":"10.1155\/2022\/4694567","article-title":"A Novel Framework Based on Deep Learning and ANOVA Feature Selection Method for Diagnosis of COVID-19 Cases from Chest X-Ray Images","volume":"2022","author":"Nasiri H.","year":"2022","unstructured":"Nasiri H., Alavi S.A., A Novel Framework Based on Deep Learning and ANOVA Feature Selection Method for Diagnosis of COVID-19 Cases from Chest X-Ray Images, Computational Intelligence and Neuroscience 2022 (2022), 4694567.","journal-title":"Computational Intelligence and Neuroscience"},{"key":"e_1_3_3_20_2","doi-asserted-by":"crossref","first-page":"9107430","DOI":"10.1155\/2022\/9107430","article-title":"PCA-Based Incremental Extreme Learning Machine (PCA-IELM) for COVID-19 Patient Diagnosis Using Chest X-Ray Images","volume":"2022","author":"Kumar V.","year":"2022","unstructured":"Kumar V., et\u00a0al., PCA-Based Incremental Extreme Learning Machine (PCA-IELM) for COVID-19 Patient Diagnosis Using Chest X-Ray Images, Computational Intelligence and Neuroscience 2022 (2022), 9107430.","journal-title":"Computational Intelligence and Neuroscience"},{"key":"e_1_3_3_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3052835"},{"key":"e_1_3_3_22_2","doi-asserted-by":"crossref","first-page":"179317","DOI":"10.1109\/ACCESS.2020.3028012","article-title":"Novel Feature Selection and Voting Classifier Algorithms for COVID-19 Classification in CT Images","volume":"8","author":"El-Kenawy E.S.M.","year":"2020","unstructured":"El-Kenawy E.S.M., et\u00a0al., Novel Feature Selection and Voting Classifier Algorithms for COVID-19 Classification in CT Images, IEEE Access 8 (2020), 179317\u2013179335.","journal-title":"IEEE Access"},{"issue":"21","key":"e_1_3_3_23_2","doi-asserted-by":"crossref","first-page":"15919","DOI":"10.1109\/JIOT.2021.3050775","article-title":"Realizing an Effective COVID-19 Diagnosis System Based on Machine Learning and IoT in Smart Hospital Environment","volume":"8","author":"Abdulkareem K.H.","year":"2021","unstructured":"Abdulkareem K.H., et\u00a0al., Realizing an Effective COVID-19 Diagnosis System Based on Machine Learning and IoT in Smart Hospital Environment, IEEE Internet of Things Journal 8(21) (2021), 15919\u201315928.","journal-title":"IEEE Internet of Things Journal"},{"key":"e_1_3_3_24_2","unstructured":"Han J. Kamber M. Pei J. Data Mining: Concepts and Techniques. 3rd ed. The Morgan Kaufmann Series in Data Management Systems. 2012 USA: Morgan Kaufmann Elsevier. 740."},{"key":"e_1_3_3_25_2","article-title":"Data Mining: Practical Machine Learning Tools and Techniques","author":"Witten I.H.","year":"2011","unstructured":"Witten I.H., et\u00a0al., Data Mining: Practical Machine Learning Tools and Techniques, 3rd ed. The Morgan Kaufmann Series in Data Management Systems, Morgan Kaufmann, Elsevier, 2011.","journal-title":"The Morgan Kaufmann Series in Data Management Systems"},{"key":"e_1_3_3_26_2","unstructured":"Kulkarni S. Dataset: COVID-19 Dataset for Year 2020 & 2021 https:\/\/data.gov.il\/dataset\/COVID-19\/resource\/d337959a-020a-4ed3-84f7-fca182292308 Accessed on 15th December 2022."},{"issue":"17","key":"e_1_3_3_27_2","doi-asserted-by":"crossref","first-page":"2714","DOI":"10.3390\/electronics11172714","article-title":"Genetic Clustered Federated Learning for COVID-19 Detection","volume":"11","author":"Kandati D.R.","year":"2022","unstructured":"Kandati D.R., Gadekallu T.R., Genetic Clustered Federated Learning for COVID-19 Detection, Electronics 11(17) (2022), 2714.","journal-title":"Electronics"}],"container-title":["Intelligent Data Analysis: An International Journal"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/IDA-230854","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/IDA-230854","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/IDA-230854","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:20:47Z","timestamp":1777454447000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/IDA-230854"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1]]},"references-count":26,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2025,1]]}},"alternative-id":["10.3233\/IDA-230854"],"URL":"https:\/\/doi.org\/10.3233\/ida-230854","relation":{},"ISSN":["1088-467X","1571-4128"],"issn-type":[{"value":"1088-467X","type":"print"},{"value":"1571-4128","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1]]}}}