{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,17]],"date-time":"2026-04-17T16:57:21Z","timestamp":1776445041786,"version":"3.51.2"},"reference-count":36,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2023,2,6]],"date-time":"2023-02-06T00:00:00Z","timestamp":1675641600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>The diagnosis and prognosis of cardiovascular disease are crucial medical tasks to ensure correct classification, which helps cardiologists provide proper treatment to the patient. Machine learning applications in the medical niche have increased as they can recognize patterns from data. Using machine learning to classify cardiovascular disease occurrence can help diagnosticians reduce misdiagnosis. This research develops a model that can correctly predict cardiovascular diseases to reduce the fatality caused by cardiovascular diseases. This paper proposes a method of k-modes clustering with Huang starting that can improve classification accuracy. Models such as random forest (RF), decision tree classifier (DT), multilayer perceptron (MP), and XGBoost (XGB) are used. GridSearchCV was used to hypertune the parameters of the applied model to optimize the result. The proposed model is applied to a real-world dataset of 70,000 instances from Kaggle. Models were trained on data that were split in 80:20 and achieved accuracy as follows: decision tree: 86.37% (with cross-validation) and 86.53% (without cross-validation), XGBoost: 86.87% (with cross-validation) and 87.02% (without cross-validation), random forest: 87.05% (with cross-validation) and 86.92% (without cross-validation), multilayer perceptron: 87.28% (with cross-validation) and 86.94% (without cross-validation). The proposed models have AUC (area under the curve) values: decision tree: 0.94, XGBoost: 0.95, random forest: 0.95, multilayer perceptron: 0.95. The conclusion drawn from this underlying research is that multilayer perceptron with cross-validation has outperformed all other algorithms in terms of accuracy. It achieved the highest accuracy of 87.28%.<\/jats:p>","DOI":"10.3390\/a16020088","type":"journal-article","created":{"date-parts":[[2023,2,7]],"date-time":"2023-02-07T02:56:08Z","timestamp":1675738568000},"page":"88","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":459,"title":["Effective Heart Disease Prediction Using Machine Learning Techniques"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0423-0159","authenticated-orcid":false,"given":"Chintan M.","family":"Bhatt","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering, School of Technology, Pandit Deendayal Energy University, Gandhinagar 382007, India"}]},{"given":"Parth","family":"Patel","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, School of Technology, Pandit Deendayal Energy University, Gandhinagar 382007, India"}]},{"given":"Tarang","family":"Ghetia","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, School of Technology, Pandit Deendayal Energy University, Gandhinagar 382007, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7552-2394","authenticated-orcid":false,"given":"Pier Luigi","family":"Mazzeo","sequence":"additional","affiliation":[{"name":"Institute of Applied Sciences and Intelligent Systems, National Research Council of Italy, 73100 Lecce, Italy"}]}],"member":"1968","published-online":{"date-parts":[[2023,2,6]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"896","DOI":"10.1016\/j.jhep.2018.05.036","article-title":"Modeling NAFLD disease burden in China, France, Germany, Italy, Japan, Spain, United Kingdom, and United States for the period 2016\u20132030","volume":"69","author":"Estes","year":"2018","journal-title":"J. Hepatol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1186\/s12933-022-01672-9","article-title":"Risk factors for cardiovascular disease in patients with metabolic-associated fatty liver disease: A machine learning approach","volume":"21","author":"Nabrdalik","year":"2022","journal-title":"Cardiovasc. Diabetol."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Murthy, H.S.N., and Meenakshi, M. (2014, January 21\u201322). Dimensionality reduction using neuro-genetic approach for early prediction of coronary heart disease. Proceedings of the International Conference on Circuits, Communication, Control and Computing, Bangalore, India.","DOI":"10.1109\/CIMCA.2014.7057817"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"e56","DOI":"10.1161\/CIR.0000000000000659","article-title":"Heart disease and stroke statistics\u20142019 update: A report from the American heart association","volume":"139","author":"Benjamin","year":"2019","journal-title":"Circulation"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"100655","DOI":"10.1016\/j.imu.2021.100655","article-title":"Early detection of coronary heart disease using ensemble techniques","volume":"26","author":"Shorewala","year":"2021","journal-title":"Inform. Med. Unlocked"},{"key":"ref_6","first-page":"e29","article-title":"Heart disease and stroke statistics\u20142015 update: A report from the American Heart Association","volume":"131","author":"Mozaffarian","year":"2015","journal-title":"Circulation"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Maiga, J., Hungilo, G.G. (2019, January 24\u201325). Comparison of Machine Learning Models in Prediction of Cardiovascular Disease Using Health Record Data. Proceedings of the 2019 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS), Jakarta, Indonesia.","DOI":"10.1109\/ICIMCIS48181.2019.8985205"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1539\/joh.15-0326-OP","article-title":"Work stress and cardiovascular disease: A life course perspective","volume":"58","author":"Li","year":"2016","journal-title":"J. Occup. Health"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"962","DOI":"10.1016\/j.procs.2016.05.288","article-title":"Efficient Heart Disease Prediction System","volume":"85","author":"Purushottam","year":"2016","journal-title":"Procedia Comput. Sci."},{"key":"ref_10","first-page":"43","article-title":"Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction","volume":"17","author":"Soni","year":"2011","journal-title":"Int. J. Comput. Appl."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"81542","DOI":"10.1109\/ACCESS.2019.2923707","article-title":"Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques","volume":"7","author":"Mohan","year":"2019","journal-title":"IEEE Access"},{"key":"ref_12","first-page":"1638","article-title":"Predicting the risk of heart disease using advanced machine learning approach","volume":"7","author":"Waigi","year":"2020","journal-title":"Eur. J. Mol. Clin. Med."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Chen, T., and Guestrin, C. (2016, January 13\u201317). XGBoost: A Scalable Tree Boosting System. Proceedings of the KDD \u201916: 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA.","DOI":"10.1145\/2939672.2939785"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1016\/j.cmpb.2013.03.006","article-title":"Performance comparison of accelerometer calibration algorithms based on 3D-ellipsoid fitting methods","volume":"111","author":"Gietzelt","year":"2013","journal-title":"Comput. Methods Programs Biomed."},{"key":"ref_16","first-page":"6","article-title":"Decision Support System for Congenital Heart Disease Diagnosis based on Signs and Symptoms using Neural Networks","volume":"19","author":"K","year":"2011","journal-title":"Int. J. Comput. Appl."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Narin, A., Isler, Y., and Ozer, M. (2016, January 27\u201329). Early prediction of Paroxysmal Atrial Fibrillation using frequency domain measures of heart rate variability. Proceedings of the 2016 Medical Technologies National Congress (TIPTEKNO), Antalya, Turkey.","DOI":"10.1109\/TIPTEKNO.2016.7863110"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1007\/s42979-020-00365-y","article-title":"Heart Disease Prediction using Machine Learning Techniques","volume":"1","author":"Shah","year":"2020","journal-title":"SN Comput. Sci."},{"key":"ref_19","first-page":"261","article-title":"Implementation of Machine Learning Model to Predict Heart Failure Disease","volume":"10","author":"Alotaibi","year":"2019","journal-title":"Int. J. Adv. Comput. Sci. Appl."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1007\/s12553-020-00499-2","article-title":"Comparing different feature selection algorithms for cardiovascular disease prediction","volume":"11","author":"Hasan","year":"2020","journal-title":"Health Technol."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"220","DOI":"10.35741\/issn.0258-2724.56.4.19","article-title":"A proposed paradigm for intelligent heart disease prediction system using data mining techniques","volume":"56","author":"Ouf","year":"2021","journal-title":"J. Southwest Jiaotong Univ."},{"key":"ref_22","first-page":"46","article-title":"Data-Driven Diagnosis of Heart Disease","volume":"176","author":"Khan","year":"2020","journal-title":"Int. J. Comput. Appl."},{"key":"ref_23","unstructured":"(2022, November 01). Kaggle Cardiovascular Disease Dataset. Available online: https:\/\/www.kaggle.com\/datasets\/sulianova\/cardiovascular-disease-dataset."},{"key":"ref_24","unstructured":"Han, J.A., and Kamber, M. (2011). Data Mining: Concepts and Techniques, Morgan Kaufmann Publishers. [3rd ed.]."},{"key":"ref_25","first-page":"674","article-title":"A Comparative Study of Discretization Techniques for Naive Bayes Classifiers","volume":"21","author":"Rivero","year":"2009","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"280","DOI":"10.1001\/jamacardio.2018.0022","article-title":"Association of body mass index with lifetime risk of cardiovascular disease and compression of morbidity","volume":"3","author":"Khan","year":"2018","journal-title":"JAMA Cardiol."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1161\/HYPERTENSIONAHA.109.133041","article-title":"Blood Pressure Variables and Cardiovascular Risk","volume":"54","author":"Kengne","year":"2009","journal-title":"Hypertension"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"8714745","DOI":"10.1155\/2016\/8714745","article-title":"Interaction between Mean Arterial Pressure and HbA1c in Prediction of Cardiovascular Disease Hospitalisation: A Population-Based Case-Control Study","volume":"2016","author":"Yu","year":"2016","journal-title":"J. Diabetes Res."},{"key":"ref_29","first-page":"34","article-title":"A Fast Clustering Algorithm to Cluster Very Large Categorical Data Sets in Data Mining","volume":"3","author":"Huang","year":"1997","journal-title":"DMKD"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"598","DOI":"10.1007\/s12471-010-0841-y","article-title":"Gender differences in coronary heart disease","volume":"18","author":"Maas","year":"2010","journal-title":"Neth. Heart J."},{"key":"ref_31","unstructured":"Bhunia, P.K., Debnath, A., Mondal, P., D E, M., Ganguly, K., and Rakshit, P. (2021). Heart Disease Prediction using Machine Learning. Int. J. Eng. Res. Technol., 9."},{"key":"ref_32","unstructured":"Mohanty, M.D., and Mohanty, M.N. (2022). Advanced Data Mining Tools and Methods for Social Computing, Academic Press."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Menzies, T., Kocag\u00fcneli, E., Minku, L., Peters, F., and Turhan, B. (2015). Sharing Data and Models in Software Engineering, Morgan Kaufmann.","DOI":"10.1016\/B978-0-12-417295-1.00011-4"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Fayez, M., and Kurnaz, S. (2021). Novel method for diagnosis diseases using advanced high-performance machine learning system. Appl. Nanosci.","DOI":"10.1007\/s13204-021-01990-6"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Hassan, C.A.U., Iqbal, J., Irfan, R., Hussain, S., Algarni, A.D., Bukhari, S.S.H., Alturki, N., and Ullah, S.S. (2022). Effectively Predicting the Presence of Coronary Heart Disease Using Machine Learning Classifiers. Sensors, 22.","DOI":"10.3390\/s22197227"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Subahi, A.F., Khalaf, O.I., Alotaibi, Y., Natarajan, R., Mahadev, N., and Ramesh, T. (2022). Modified Self-Adaptive Bayesian Algorithm for Smart Heart Disease Prediction in IoT System. Sustainability, 14.","DOI":"10.3390\/su142114208"}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/16\/2\/88\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T18:25:51Z","timestamp":1760120751000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/16\/2\/88"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,6]]},"references-count":36,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2023,2]]}},"alternative-id":["a16020088"],"URL":"https:\/\/doi.org\/10.3390\/a16020088","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,6]]}}}