{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T16:11:29Z","timestamp":1778083889427,"version":"3.51.4"},"reference-count":67,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2025,2,26]],"date-time":"2025-02-26T00:00:00Z","timestamp":1740528000000},"content-version":"vor","delay-in-days":56,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2025,1,1]],"date-time":"2025-01-01T00:00:00Z","timestamp":1735689600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/doi.wiley.com\/10.1002\/tdm_license_1.1"}],"funder":[{"DOI":"10.13039\/100014108","name":"University of Texas Rio Grande Valley","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100014108","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Applied Computational Intelligence and Soft Computing"],"published-print":{"date-parts":[[2025,1]]},"abstract":"<jats:p>While analyzing health data is important for improving health outcomes, class imbalance in datasets poses major challenges to machine learning classification models. This work, therefore, considers the class imbalance problem in stroke prediction using models such as K\u2010nearest neighbors, support vector machine, logistic regression, random forest, and decision tree. This work balances the stroke dataset, thereby enhancing model performance, through various oversampling strategies: random oversampling (RO), ADASYN, SMOTE, and SMOTE\u2013Tomek. Compared to the results of the imbalanced dataset, all applied oversampling techniques enhanced the correct classification of stroke events by the ML model. Among these, RO\u2013SVM with RBF kernel was the best in terms of sensitivity, specificity, G\u2010mean, F1\u2010score, and accuracy values, offering the highest results with respective values of 89.87%, 94.91%, 92.36%, 89.64%, and 89.87%. After applying oversampling techniques, all the machine learning classifications were good enough to classify stroke status, especially for the minority class. This study has highlighted the importance of class imbalance issues in health datasets. Precise detection of instances of minority classes can be enhanced considerably by employing classification models with the implementation of hybrid strategies to effectively solve class imbalance issues, which, in turn, will help improve healthcare outcomes. Further research in integrating more advanced deep learning techniques into other health datasets with imbalances is encouraged to further validate or refine class imbalance approaches, as effective handling of imbalanced classes can substantially promote predictive model performance in the analysis of healthcare.<\/jats:p>","DOI":"10.1155\/acis\/1013769","type":"journal-article","created":{"date-parts":[[2025,2,26]],"date-time":"2025-02-26T13:52:35Z","timestamp":1740577955000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Addressing Class Imbalance Problem in Health Data Classification: Practical Application From an Oversampling Viewpoint"],"prefix":"10.1155","volume":"2025","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8124-4493","authenticated-orcid":false,"given":"Edmund Fosu","family":"Agyemang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Joseph Agyapong","family":"Mensah","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eric","family":"Nyarko","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dennis","family":"Arku","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Benedict","family":"Mbeah-Baiden","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Enock","family":"Opoku","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0228-0483","authenticated-orcid":false,"given":"Ezekiel Nii","family":"Noye Nortey","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","published-online":{"date-parts":[[2025,2,26]]},"reference":[{"key":"e_1_2_13_1_2","doi-asserted-by":"publisher","DOI":"10.30574\/wjarr.2024.21.2.0246"},{"key":"e_1_2_13_2_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0295427"},{"key":"e_1_2_13_3_2","doi-asserted-by":"publisher","DOI":"10.1186\/s13040-024-00366-0"},{"key":"e_1_2_13_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.infsof.2020.106432"},{"key":"e_1_2_13_5_2","first-page":"444","article-title":"A Review on Imbalanced Data Handling Using Undersampling and Oversampling Technique","volume":"3","author":"Shelke M. S.","year":"2017","journal-title":"International Journal of Engineering Research in Current Trends"},{"key":"e_1_2_13_6_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2023.109721"},{"key":"e_1_2_13_7_2","doi-asserted-by":"publisher","DOI":"10.3390\/info14010054"},{"key":"e_1_2_13_8_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2024.123149"},{"key":"e_1_2_13_9_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2024.123328"},{"key":"e_1_2_13_10_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.heliyon.2023.e16807"},{"key":"e_1_2_13_11_2","doi-asserted-by":"publisher","DOI":"10.3389\/fmed.2024.1373244"},{"key":"e_1_2_13_12_2","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-019-0192-5"},{"key":"e_1_2_13_13_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11764-023-01465-3"},{"key":"e_1_2_13_14_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.sciaf.2024.e02386"},{"key":"e_1_2_13_15_2","doi-asserted-by":"publisher","DOI":"10.16929\/ajas\/2022.1297.269"},{"key":"e_1_2_13_16_2","doi-asserted-by":"publisher","DOI":"10.3233\/mas-221418"},{"key":"e_1_2_13_17_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.heliyon.2023.e18276"},{"key":"e_1_2_13_18_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2023.110415"},{"key":"e_1_2_13_19_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-020-04901-z"},{"key":"e_1_2_13_20_2","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-018-0151-6"},{"key":"e_1_2_13_21_2","doi-asserted-by":"publisher","DOI":"10.3390\/jcm11185342"},{"key":"e_1_2_13_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/CSDE50874.2020.9411607"},{"key":"e_1_2_13_23_2","doi-asserted-by":"publisher","DOI":"10.3390\/diagnostics12092115"},{"key":"e_1_2_13_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/access.2021.3102399"},{"key":"e_1_2_13_25_2","doi-asserted-by":"publisher","DOI":"10.14569\/ijacsa.2024.0150431"},{"key":"e_1_2_13_26_2","doi-asserted-by":"publisher","DOI":"10.1155\/2022\/3264367"},{"key":"e_1_2_13_27_2","doi-asserted-by":"publisher","DOI":"10.3390\/s22239311"},{"key":"e_1_2_13_28_2","doi-asserted-by":"publisher","DOI":"10.3390\/electronics13040686"},{"key":"e_1_2_13_29_2","doi-asserted-by":"publisher","DOI":"10.3390\/s22134670"},{"key":"e_1_2_13_30_2","article-title":"Improving Stroke Prediction Accuracy Through Machine Learning and Synthetic Minority Over-Sampling","volume":"7","author":"Abdullah Aish M.","year":"2024","journal-title":"Journal of Computing and Biomedical Informatics"},{"key":"e_1_2_13_31_2","doi-asserted-by":"publisher","DOI":"10.33640\/2405-609x.3355"},{"key":"e_1_2_13_32_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.health.2022.100032"},{"key":"e_1_2_13_33_2","doi-asserted-by":"publisher","DOI":"10.14569\/ijacsa.2021.0120662"},{"key":"e_1_2_13_34_2","doi-asserted-by":"publisher","DOI":"10.37256\/aie.4120232744"},{"key":"e_1_2_13_35_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2018.08.021"},{"key":"e_1_2_13_36_2","unstructured":"AgyemangE. F. MensahJ. A. AmpomahO.-A.et al. Predicting Students\u2019 Academic Performance via Machine Learning Algorithms: An Empirical Review and Practical Application 2024."},{"key":"e_1_2_13_37_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2024.03.033"},{"key":"e_1_2_13_38_2","doi-asserted-by":"publisher","DOI":"10.59170\/stattrans-2024-007"},{"key":"e_1_2_13_39_2","doi-asserted-by":"publisher","DOI":"10.30534\/ijatcse\/2020\/104932020"},{"key":"e_1_2_13_40_2","doi-asserted-by":"publisher","DOI":"10.30630\/joiv.7.1.1069"},{"key":"e_1_2_13_41_2","first-page":"1322","volume-title":"2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence)","author":"He H.","year":"2008"},{"key":"e_1_2_13_42_2","doi-asserted-by":"publisher","DOI":"10.5121\/ijdkp.2013.3402"},{"key":"e_1_2_13_43_2","article-title":"Enhancing Tumor Classification through Machine Learning Algorithms for Breast Cancer Diagnosis","volume":"15","author":"Agbota L.","year":"2024","journal-title":"Computer Engineering and Intelligent Systems"},{"key":"e_1_2_13_44_2","doi-asserted-by":"publisher","DOI":"10.1109\/access.2020.3041951"},{"key":"e_1_2_13_45_2","doi-asserted-by":"publisher","DOI":"10.3390\/en13102509"},{"key":"e_1_2_13_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICOIACT.2018.8350792"},{"key":"e_1_2_13_47_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2021.108288"},{"key":"e_1_2_13_48_2","first-page":"1017","article-title":"Oversampling Method for Imbalanced Classification","volume":"34","author":"Zheng Z.","year":"2015","journal-title":"Computing and Informatics"},{"key":"e_1_2_13_49_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2021.116221"},{"key":"e_1_2_13_50_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.artmed.2019.101723"},{"key":"e_1_2_13_51_2","doi-asserted-by":"publisher","DOI":"10.3390\/s22124399"},{"key":"e_1_2_13_52_2","doi-asserted-by":"publisher","DOI":"10.3390\/app12052677"},{"key":"e_1_2_13_53_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.compeleceng.2020.106956"},{"key":"e_1_2_13_54_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11063-020-10364-y"},{"key":"e_1_2_13_55_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-87240-3_16"},{"key":"e_1_2_13_56_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-15-3383-9_15"},{"key":"e_1_2_13_57_2","doi-asserted-by":"publisher","DOI":"10.3390\/s22134963"},{"key":"e_1_2_13_58_2","doi-asserted-by":"publisher","DOI":"10.1109\/access.2021.3057693"},{"key":"e_1_2_13_59_2","doi-asserted-by":"publisher","DOI":"10.4038\/icter.v16i1.7260"},{"key":"e_1_2_13_60_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.953"},{"key":"e_1_2_13_61_2","doi-asserted-by":"publisher","DOI":"10.26555\/jiteki.v10i1.28107"},{"key":"e_1_2_13_62_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-01307-2_43"},{"key":"e_1_2_13_63_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-48232-8_39"},{"key":"e_1_2_13_64_2","doi-asserted-by":"publisher","DOI":"10.1145\/1007730.1007735"},{"key":"e_1_2_13_65_2","doi-asserted-by":"publisher","DOI":"10.3233\/mas-221403"},{"key":"e_1_2_13_66_2","first-page":"1","article-title":"Imbalanced-Learn: A python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning","volume":"18","author":"LemaA\u02dcZtre G.","year":"2017","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_2_13_67_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2023.106030"}],"container-title":["Applied Computational Intelligence and Soft Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/acis\/1013769","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/full-xml\/10.1155\/acis\/1013769","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/acis\/1013769","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T11:43:41Z","timestamp":1772970221000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1155\/acis\/1013769"}},"subtitle":[],"editor":[{"given":"Cheng-Jian","family":"Lin","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2025,1]]},"references-count":67,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2025,1]]}},"alternative-id":["10.1155\/acis\/1013769"],"URL":"https:\/\/doi.org\/10.1155\/acis\/1013769","archive":["Portico"],"relation":{},"ISSN":["1687-9724","1687-9732"],"issn-type":[{"value":"1687-9724","type":"print"},{"value":"1687-9732","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1]]},"assertion":[{"value":"2024-07-11","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-01-13","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-02-26","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"1013769"}}