{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,13]],"date-time":"2026-05-13T17:46:03Z","timestamp":1778694363826,"version":"3.51.4"},"reference-count":33,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2021,7,6]],"date-time":"2021-07-06T00:00:00Z","timestamp":1625529600000},"content-version":"vor","delay-in-days":186,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100006261","name":"Taif University","doi-asserted-by":"publisher","award":["TURSP-2020\/215"],"award-info":[{"award-number":["TURSP-2020\/215"]}],"id":[{"id":"10.13039\/501100006261","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004386","name":"Universiti Malaya","doi-asserted-by":"publisher","award":["PG035-2016A"],"award-info":[{"award-number":["PG035-2016A"]}],"id":[{"id":"10.13039\/501100004386","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Complexity"],"published-print":{"date-parts":[[2021,1]]},"abstract":"<jats:p>Diabetes is one of the most common metabolic diseases that cause high blood sugar. Early diagnosis of such a condition is challenging due to its complex interdependence on various factors. There is a need to develop critical decision support systems to assist medical practitioners in the diagnosis process. This research proposes developing a predictive model that can achieve a high classification accuracy of type 2 diabetes. The study consisted of two fundamental parts. Firstly, the study investigated handling missing data adopting data imputation, namely, median value imputation, K\u2010nearest neighbor imputation, and iterative imputation. Consequently, the study validated the implications of these imputations using various classification algorithms, i.e., linear, tree\u2010based, and ensemble algorithms, to see how each method affected classification accuracy. Secondly, Artificial Neural Network was employed to model the best performing imputed data, balanced with SMOTETomek ensuring each class is represented fairly. This approach provided the best accuracy of 98% on the test data, outperforming accuracies achieved in prior studies using the same dataset. The dataset used in this study is concerned with gender and population. As a prospect, the study recommends adopting a larger population sample without geographic boundaries. Additionally, as the developed Artificial Neural Network model did not undergo any specific hyperparameter tuning, it would be interesting to explore tuning on top of normalized data to optimize accuracy further.<\/jats:p>","DOI":"10.1155\/2021\/9953314","type":"journal-article","created":{"date-parts":[[2021,7,6]],"date-time":"2021-07-06T17:20:09Z","timestamp":1625592009000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":37,"title":["An Enhanced Machine Learning Framework for Type 2 Diabetes Classification Using Imbalanced Data with Missing Values"],"prefix":"10.1155","volume":"2021","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6022-3172","authenticated-orcid":false,"given":"Kumarmangal","family":"Roy","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5047-1108","authenticated-orcid":false,"given":"Muneer","family":"Ahmad","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5754-424X","authenticated-orcid":false,"given":"Kinza","family":"Waqar","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1218-1836","authenticated-orcid":false,"given":"Kirthanaah","family":"Priyaah","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8610-3451","authenticated-orcid":false,"given":"Jamel","family":"Nebhen","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8194-9354","authenticated-orcid":false,"given":"Sultan S","family":"Alshamrani","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2299-4440","authenticated-orcid":false,"given":"Muhammad Ahsan","family":"Raza","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9549-2540","authenticated-orcid":false,"given":"Ihsan","family":"Ali","sequence":"additional","affiliation":[]}],"member":"311","published-online":{"date-parts":[[2021,7,6]]},"reference":[{"key":"e_1_2_9_1_2","unstructured":"International Diabetes Federation 2020 IDF SEA Members. 2020 https:\/\/idf.org\/our-network\/regions-members\/south-east-asia\/members\/94-india.html."},{"key":"e_1_2_9_2_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-22389-1_47"},{"key":"e_1_2_9_3_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2010.05.078"},{"key":"e_1_2_9_4_2","doi-asserted-by":"publisher","DOI":"10.2337\/diacare.27.3.813"},{"key":"e_1_2_9_5_2","article-title":"Nursing management of diabetes mellitus: a guide to the pattern approach","volume":"23","author":"Guthrie D. W.","year":"2005","journal-title":"Home Healthcare Nurse"},{"key":"e_1_2_9_6_2","article-title":"Predicting Type 2 diabetes using an electronic nose-based artificial neural network analysis","volume":"15","author":"Mohamed E. I.","year":"2002","journal-title":"Diabetes, Nutrition & Metabolism"},{"key":"e_1_2_9_7_2","unstructured":"Mayo Clinic Staff 2020 Prediabetes\u2014symptoms and causes 2020 https:\/\/www.mayoclinic.org\/diseases-conditions\/prediabetes\/symptoms-causes\/syc-20355278."},{"key":"e_1_2_9_8_2","unstructured":"SharmaN. C. 2019 Government survey found 11.8% prevalence of diabetes in India 2019 https:\/\/www.livemint.com\/science\/health\/government-survey-found-11-8-prevalence-of-diabetes-in-india-11570702665713.html."},{"key":"e_1_2_9_9_2","doi-asserted-by":"publisher","DOI":"10.2337\/diacare.27.5.1047"},{"key":"e_1_2_9_10_2","volume-title":"Global Report on Diabetes","author":"World Health Organization","year":"2016"},{"key":"e_1_2_9_11_2","doi-asserted-by":"crossref","unstructured":"AlJarullahA. A. Decision tree discovery for the diagnosis of type II diabetes Proceedings of the 2011 International Conference on Innovations in Information Technology April 2011 Abu Dhabi UAE 303\u2013307 https:\/\/doi.org\/10.1109\/INNOVATIONS.2011.5893838 2-s2.0-79959965420.","DOI":"10.1109\/INNOVATIONS.2011.5893838"},{"key":"e_1_2_9_12_2","doi-asserted-by":"publisher","DOI":"10.1037\/e308492005-001"},{"key":"e_1_2_9_13_2","doi-asserted-by":"crossref","unstructured":"KomiM. LiJ. ZhaiY. andXianguoZ. Application of data mining methods in diabetes prediction Proceedings of the International Conference on Image Vision and Computing Application June 2017 Chengdu China IEEE 1006\u20131010 https:\/\/doi.org\/10.1109\/ICIVC.2017.7984706 2-s2.0-85029391627.","DOI":"10.1109\/ICIVC.2017.7984706"},{"key":"e_1_2_9_14_2","doi-asserted-by":"publisher","DOI":"10.14569\/ijacsa.2016.070611"},{"key":"e_1_2_9_15_2","doi-asserted-by":"crossref","unstructured":"AlThunayanL. AlSahdiN. andSyedL. Comparative analysis of different classification algorithms for prediction of diabetes disease Proceedings of the Second International Conference on Internet of Things Data and Cloud Computing 2017 New York NY USA 1\u20136 https:\/\/doi.org\/10.1145\/3018896.3036387 2-s2.0-85044674058.","DOI":"10.1145\/3018896.3036387"},{"key":"e_1_2_9_16_2","first-page":"5","article-title":"Performance evaluation of different artificial neural network models in the classification of type 2 diabetes mellitus","volume":"5","author":"Guldogan E.","year":"2020","journal-title":"The Journal of Cognitive Systems"},{"key":"e_1_2_9_17_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.imu.2019.100204"},{"key":"e_1_2_9_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/access.2019.2929866"},{"key":"e_1_2_9_19_2","doi-asserted-by":"crossref","unstructured":"SarwarM. A. KamalN. HamidW. andShahM. A. Prediction of diabetes using machine learning algorithms in healthcare Proceedings of the 24th International Conference on Automation and Computing September 2018 Newcastle Upon Tyne UK IEEE 1\u20136 https:\/\/doi.org\/10.23919\/IConAC.2018.8748992 2-s2.0-85069177343.","DOI":"10.23919\/IConAC.2018.8748992"},{"key":"e_1_2_9_20_2","doi-asserted-by":"crossref","unstructured":"WoldemichaelF. G.andMenariaS. Prediction of diabetes using data mining techniques Proceedings of the International Conference on Trends in Electronics and Informatics (ICOEI) May 2018 Tirunelveli India IEEE 414\u2013418 https:\/\/doi.org\/10.1109\/ICOEI.2018.8553959 2-s2.0-85059982299.","DOI":"10.1109\/ICOEI.2018.8553959"},{"key":"e_1_2_9_21_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2018.05.122"},{"key":"e_1_2_9_22_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.imu.2017.12.006"},{"key":"e_1_2_9_23_2","doi-asserted-by":"crossref","unstructured":"VaishaliR. SasikalaR. RamasubbareddyS. RemyaS. andNalluriS. Genetic algorithm based feature selection and MOE Fuzzy classification algorithm on Pima Indians Diabetes dataset Proceedings of the International Conference on Computing Networking and Informatics (ICCNI) October 2017 Lagos Nigeria IEEE 1\u20135 https:\/\/doi.org\/10.1109\/ICCNI.2017.8123815 2-s2.0-85047253593.","DOI":"10.1109\/ICCNI.2017.8123815"},{"key":"e_1_2_9_24_2","doi-asserted-by":"crossref","unstructured":"GemanO. ChiuchisanI. andTodereanR. Application of adaptive neuro-fuzzy inference system for diabetes classification and prediction Proceedings of the 2017 E-Health and Bioengineering Conference EHB June 2017 Sinaia Romania IEEE 639\u2013642 https:\/\/doi.org\/10.1109\/EHB.2017.7995505 2-s2.0-85028532475.","DOI":"10.1109\/EHB.2017.7995505"},{"key":"e_1_2_9_25_2","doi-asserted-by":"publisher","DOI":"10.14257\/astl.2017.145.09"},{"key":"e_1_2_9_26_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-21326-7_45"},{"key":"e_1_2_9_27_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.artmed.2010.05.007"},{"key":"e_1_2_9_28_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2007.06.004"},{"key":"e_1_2_9_29_2","doi-asserted-by":"crossref","unstructured":"HanJ. RodriguzeJ. C. andBeheshtiM. Diabetes data analysis and prediction model discovery using RapidMiner Proceedings of the 2008 Second International Conference on Future Generation Communication and Networking December 2008 Hainan China IEEE 96\u201399 https:\/\/doi.org\/10.1109\/FGCN.2008.226 2-s2.0-62449195440.","DOI":"10.1109\/FGCN.2008.226"},{"key":"e_1_2_9_30_2","unstructured":"UCI Machine Learning 2016 Pima indians diabetes database 2016 https:\/\/www.kaggle.com\/uciml\/pima-indians-diabetes-database."},{"key":"e_1_2_9_31_2","unstructured":"LakshminarayanK. HarpS. A. GoldmanR. P. andSamadT. Imputation of missing data using machine learning techniques KDD August 1996 140\u2013145."},{"key":"e_1_2_9_32_2","first-page":"1625","article-title":"Handling missing values when applying classification models","volume":"8","author":"Saar-Tsechansky M.","year":"2007","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_2_9_33_2","doi-asserted-by":"publisher","DOI":"10.1613\/jair.953"}],"container-title":["Complexity"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/complexity\/2021\/9953314.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/complexity\/2021\/9953314.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/2021\/9953314","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,9]],"date-time":"2024-08-09T23:17:07Z","timestamp":1723245427000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1155\/2021\/9953314"}},"subtitle":[],"editor":[{"given":"M. Irfan","family":"Uddin","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,1]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,1]]}},"alternative-id":["10.1155\/2021\/9953314"],"URL":"https:\/\/doi.org\/10.1155\/2021\/9953314","archive":["Portico"],"relation":{},"ISSN":["1076-2787","1099-0526"],"issn-type":[{"value":"1076-2787","type":"print"},{"value":"1099-0526","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1]]},"assertion":[{"value":"2021-03-19","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-04-12","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-07-06","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"9953314"}}