{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,8]],"date-time":"2025-12-08T16:01:29Z","timestamp":1765209689389,"version":"3.46.0"},"reference-count":45,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2025,12,8]],"date-time":"2025-12-08T00:00:00Z","timestamp":1765152000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Coronary artery disease (CAD) is a leading cause of death worldwide. Unfortunately, due to various reasons, this disease is currently spreading rapidly. Many heart disease sufferers die due to the inaccuracy of diagnostic tools or the delay in seeing a doctor. Therefore, the correct and timely diagnosis of this disease plays an important role in preventing deaths. The method used in this research to diagnose coronary artery disease (CAD) is a combination of the following algorithms: Support Vector Machine, Na\u00efve Bayes, Logistic Regression, Random Forest, K-Nearest Neighbors, and XGBoost with ANOVA feature selection method. The dataset was evaluated using two complementary validation strategies: a hold-out test set and 10-fold cross-validation. In the proposed method, the SHAP algorithm was implemented on the output of the XGBoost model. Using this algorithm makes the output of the model interpretable and brings the model out of black box status. With the hold-out method, the K-Nearest Neighbor model with the features selected by the ANOVA method obtained an accuracy of 93.33%. Then, the Support Vector Machine and XGBoost models obtained the best results with an accuracy of 91.66%. With the 10-fold cross-validation method of the model, XGBoost achieved 86.16% accuracy and 87.41% recall value, which increased the results obtained in both methods compared to the state-of-the-art methods.<\/jats:p>","DOI":"10.3390\/a18120771","type":"journal-article","created":{"date-parts":[[2025,12,8]],"date-time":"2025-12-08T15:30:07Z","timestamp":1765207807000},"page":"771","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["From Black Box to Glass Box: SHAP-Explained XGBoost Model for Coronary Artery Disease Prediction"],"prefix":"10.3390","volume":"18","author":[{"given":"Hanieh","family":"Ahmadian","sequence":"first","affiliation":[{"name":"Department of Computer and Electrical Engineering, Semnan University, Semnan 35131-19111, Iran"}]},{"given":"Samaneh","family":"Emami","sequence":"additional","affiliation":[{"name":"Department of Computer and Electrical Engineering, Semnan University, Semnan 35131-19111, Iran"}]},{"given":"Hamid","family":"Nasiri","sequence":"additional","affiliation":[{"name":"Department of Computer and Electrical Engineering, Semnan University, Semnan 35131-19111, Iran"},{"name":"School of Computing and Communications, Lancaster University, Lancaster LA1 4YW, UK"}]}],"member":"1968","published-online":{"date-parts":[[2025,12,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"684","DOI":"10.14419\/ijet.v7i2.8.10557","article-title":"Heart disease prediction using machine learning techniques: A survey","volume":"7","author":"Ramalingam","year":"2018","journal-title":"Int. J. Eng. Technol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"100937","DOI":"10.1016\/j.cegh.2021.100937","article-title":"Cardiovascular disease (CVD) and its associated risk factors among older adults in India: Evidence from LASI Wave 1","volume":"13","author":"Kundu","year":"2022","journal-title":"Clin. Epidemiol. Glob. Health"},{"key":"ref_3","unstructured":"WHO (2024, December 06). Cardiovascular Diseases (CVDs). Available online: https:\/\/www.who.int\/news-room\/fact-sheets\/detail\/cardiovascular-diseases-(cvds)."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1016\/j.inffus.2020.06.008","article-title":"A smart healthcare monitoring system for heart disease prediction based on ensemble deep learning and feature fusion","volume":"63","author":"Ali","year":"2020","journal-title":"Inf. Fusion"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ali, M.M., Paul, B.K., Ahmed, K., Bui, F.M., Quinn, J.M., and Moni, M.A. (2021). Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison. Comput. Biol. Med., 136.","DOI":"10.1016\/j.compbiomed.2021.104672"},{"key":"ref_6","first-page":"47","article-title":"Heart disease prediction using data mining techniques","volume":"13","author":"Anitha","year":"2019","journal-title":"J. Anal. Comput."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1007\/s00521-016-2604-1","article-title":"Performance evaluation of different machine learning techniques for prediction of heart disease","volume":"29","author":"Dwivedi","year":"2018","journal-title":"Neural Comput. Appl."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"5229","DOI":"10.11591\/ijece.v11i6.pp5229-5239","article-title":"Analysis of WEKA data mining algorithms Bayes net, random forest, MLP and SMO for heart disease prediction system: A case study in Iraq","volume":"11","author":"Saleh","year":"2021","journal-title":"Int. J. Electr. Comput. Eng."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Pouriyeh, S., Vahid, S., Sannino, G., De Pietro, G., Arabnia, H., and Gutierrez, J. (2017, January 3\u20136). A comprehensive investigation and comparison of machine learning techniques in the domain of heart disease. Proceedings of the 2017 IEEE Symposium on Computers and Communications (ISCC), Heraklion, Greece.","DOI":"10.1109\/ISCC.2017.8024530"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Khan, A., Khan, A., Khan, M.M., Farid, K., Alam, M.M., and Su\u2019ud, M.B.M. (2022). Cardiovascular and Diabetes Diseases Classification Using Ensemble Stacking Classifiers with SVM as a Meta Classifier. Diagnostics, 12.","DOI":"10.3390\/diagnostics12112595"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Kavitha, M., Gnaneswar, G., Dinesh, R., Sai, Y.R., and Suraj, R.S. (2021, January 20\u201322). Heart disease prediction using hybrid machine learning model. Proceedings of the 2021 6th International Conference on Inventive Computation Technologies (ICICT), Coimbatore, India.","DOI":"10.1109\/ICICT50816.2021.9358597"},{"key":"ref_12","first-page":"36","article-title":"Analysis and Prediction of Heart Disease Using Machine Learning and Data Mining Techniques","volume":"3","author":"Hossain","year":"2021","journal-title":"Can. J. Med."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1007\/s42979-020-00365-y","article-title":"Heart disease prediction using machine learning techniques","volume":"1","author":"Shah","year":"2020","journal-title":"SN Comput. Sci."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"39707","DOI":"10.1109\/ACCESS.2021.3064084","article-title":"Improving the prediction of heart failure patients\u2019 survival using SMOTE and effective data mining techniques","volume":"9","author":"Ishaq","year":"2021","journal-title":"IEEE Access"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1007\/s12065-019-00327-1","article-title":"Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis","volume":"13","author":"Reddy","year":"2020","journal-title":"Evol. Intell."},{"key":"ref_16","first-page":"1093","article-title":"Enhancing the Performance of Heart Disease Prediction from Collecting Cleveland Heart Dataset using Bayesian Network","volume":"26","author":"Musa","year":"2022","journal-title":"J. Appl. Sci. Environ. Manag."},{"key":"ref_17","unstructured":"(2024, October 26). UCI Repository. Available online: https:\/\/archive.ics.uci.edu\/ml\/datasets\/heart+disease."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1023\/A:1022627411411","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Mach. Learn."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Suthaharan, S., and Suthaharan, S. (2016). Support vector machine. Machine Learning Models and Algorithms for Big Data Classification: Thinking with Examples for Effective Learning, Springer.","DOI":"10.1007\/978-1-4899-7641-3"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"5966","DOI":"10.1007\/s11227-019-02894-7","article-title":"Research on SVM environment performance of parallel computing based on large dataset of machine learning","volume":"75","author":"Gong","year":"2019","journal-title":"J. Supercomput."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"17403","DOI":"10.1007\/s11227-022-04568-3","article-title":"An investigation and comparison of machine learning approaches for intrusion detection in IoMT network","volume":"78","author":"Binbusayyis","year":"2022","journal-title":"J. Supercomput."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"16017","DOI":"10.1007\/s11227-023-05299-9","article-title":"An overview of machine learning methods in enabling IoMT-based epileptic seizure detection","volume":"79","year":"2023","journal-title":"J. Supercomput."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1016\/j.neucom.2019.10.118","article-title":"A comprehensive survey on support vector machine classification: Applications, challenges, and trends","volume":"408","author":"Cervantes","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"2395","DOI":"10.1161\/CIRCULATIONAHA.106.682658","article-title":"Logistic regression","volume":"117","author":"LaValley","year":"2008","journal-title":"Circulation"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"51","DOI":"10.20544\/HORIZONS.B.04.1.17.P05","article-title":"An overview of the supervised machine learning methods","volume":"4","author":"Nasteski","year":"2017","journal-title":"Horizons B"},{"key":"ref_26","unstructured":"Mitchell, T.M. (1997). Machine Learning, McGraw-Hill."},{"key":"ref_27","first-page":"381","article-title":"Machine learning algorithms-a review","volume":"9","author":"Mahesh","year":"2020","journal-title":"Int. J. Sci. Res."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"5198","DOI":"10.1007\/s11227-020-03481-x","article-title":"AI-based smart prediction of clinical disease using random forest classifier and Naive Bayes","volume":"77","author":"Jackins","year":"2021","journal-title":"J. Supercomput."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"19246","DOI":"10.1007\/s11227-022-04631-z","article-title":"Explainable artificial intelligence approach in combating real-time surveillance of COVID19 pandemic from CT scan and X-ray images using ensemble model","volume":"78","author":"Ullah","year":"2022","journal-title":"J. Supercomput."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Prabha, A., Yadav, J., Rani, A., and Singh, V. (2021). Design of intelligent diabetes mellitus detection system using hybrid feature selection based XGBoost classifier. Comput. Biol. Med., 136.","DOI":"10.1016\/j.compbiomed.2021.104664"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"4802","DOI":"10.1007\/s11227-018-2641-x","article-title":"Twitter spam account detection based on clustering and classification methods","volume":"76","author":"Adewole","year":"2020","journal-title":"J. Supercomput."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Farzipour, A., Elmi, R., and Nasiri, H. (2023). Detection of Monkeypox cases based on symptoms using XGBoost and Shapley additive explanations methods. Diagnostics, 13.","DOI":"10.3390\/diagnostics13142391"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Chen, T., and Guestrin, C. (2016, January 13\u201317). Xgboost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA.","DOI":"10.1145\/2939672.2939785"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"108382","DOI":"10.1016\/j.compeleceng.2022.108382","article-title":"Classification of breast tumors based on histopathology images using deep features and ensemble of gradient boosting methods","volume":"103","author":"Abbasniya","year":"2022","journal-title":"Comput. Electr. Eng."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Deng, A., Zhang, H., Wang, W., Zhang, J., Fan, D., Chen, P., and Wang, B. (2020). Developing computational model to predict protein-protein interaction sites based on the XGBoost algorithm. Int. J. Mol. Sci., 21.","DOI":"10.3390\/ijms21072274"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Maleki, A., Raahemi, M., and Nasiri, H. (2023). Breast cancer diagnosis from histopathology images using deep neural network and XGBoost. Biomed. Signal Process. Control, 86.","DOI":"10.1016\/j.bspc.2023.105152"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"8520","DOI":"10.1016\/j.eswa.2015.07.007","article-title":"Feature selection using joint mutual information maximization","volume":"42","author":"Bennasar","year":"2015","journal-title":"Expert Syst. Appl."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1016\/j.procs.2015.06.035","article-title":"Feature selection and classification of microarray data using MapReduce based ANOVA and K-nearest neighbor","volume":"54","author":"Kumar","year":"2015","journal-title":"Procedia Comput. Sci."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1016\/S0169-7439(01)00198-8","article-title":"Pattern recognition of jet fuels: Comprehensive GC\u00d7 GC with ANOVA-based feature selection and principal component analysis","volume":"60","author":"Johnson","year":"2002","journal-title":"Chemom. Intell. Lab. Syst."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"110927","DOI":"10.1016\/j.engstruct.2020.110927","article-title":"Failure mode and effects analysis of RC members based on machine-learning-based SHapley Additive exPlanations (SHAP) approach","volume":"219","author":"Mangalathu","year":"2020","journal-title":"Eng. Struct."},{"key":"ref_42","first-page":"4768","article-title":"A unified approach to interpreting model predictions","volume":"30","author":"Lundberg","year":"2017","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"105405","DOI":"10.1016\/j.aap.2019.105405","article-title":"Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis","volume":"136","author":"Parsa","year":"2020","journal-title":"Accid. Anal. Prev."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"715","DOI":"10.1080\/02726351.2022.2135470","article-title":"Modeling operational cement rotary kiln variables with explainable artificial intelligence methods\u2014A \u201cconscious lab\u201d development","volume":"41","author":"Fatahi","year":"2023","journal-title":"Part. Sci. Technol."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Das, S., Sultana, M., Bhattacharya, S., Sengupta, D., and De, D. (2023). XAI\u2013reduct: Accuracy preservation despite dimensionality reduction for heart disease classification using explainable AI. J. Supercomput., 1\u201331.","DOI":"10.1007\/s11227-023-05356-3"}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/18\/12\/771\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,8]],"date-time":"2025-12-08T15:39:24Z","timestamp":1765208364000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/18\/12\/771"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,8]]},"references-count":45,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["a18120771"],"URL":"https:\/\/doi.org\/10.3390\/a18120771","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,12,8]]}}}