{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,16]],"date-time":"2026-06-16T23:02:40Z","timestamp":1781650960844,"version":"3.54.5"},"reference-count":67,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,2,9]],"date-time":"2022-02-09T00:00:00Z","timestamp":1644364800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,2,9]],"date-time":"2022-02-09T00:00:00Z","timestamp":1644364800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001794","name":"The University of Queensland","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100001794","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Health Inf Sci Syst"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Type 2 Diabetes (T2D) is a chronic disease characterized by abnormally high blood glucose levels due to insulin resistance and reduced pancreatic insulin production. The challenge of this work is to identify T2D-associated features that can distinguish T2D sub-types for prognosis and treatment purposes. We thus employed machine learning (ML) techniques to categorize T2D patients using data from the Pima Indian Diabetes Dataset from the Kaggle ML repository. After data preprocessing, several feature selection techniques were used to extract feature subsets, and a range of classification techniques were used to analyze these. We then compared the derived classification results to identify the best classifiers by considering accuracy, kappa statistics, area under the receiver operating characteristic (AUROC), sensitivity, specificity, and logarithmic loss (logloss). To evaluate the performance of different classifiers, we investigated their outcomes using the summary statistics with a resampling distribution. Therefore, Generalized Boosted Regression modeling showed the highest accuracy (90.91%), followed by kappa statistics (78.77%) and specificity (85.19%). In addition, Sparse Distance Weighted Discrimination, Generalized Additive Model using LOESS and Boosted Generalized Additive Models also gave the maximum sensitivity (100%), highest AUROC (95.26%) and lowest logarithmic loss (30.98%) respectively. Notably, the Generalized Additive Model using LOESS was the top-ranked algorithm according to non-parametric Friedman testing. Of the features identified by these machine learning models, glucose levels, body mass index, diabetes pedigree function, and age were consistently identified as the best and most frequently accurate outcome predictors. These results indicate the utility of ML methods in constructing improved prediction models for T2D and successfully identified outcome predictors for this Pima Indian population.<\/jats:p>","DOI":"10.1007\/s13755-021-00168-2","type":"journal-article","created":{"date-parts":[[2022,2,9]],"date-time":"2022-02-09T13:08:53Z","timestamp":1644412133000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":63,"title":["Machine learning models for classification and identification of significant attributes to detect type 2 diabetes"],"prefix":"10.1007","volume":"10","author":[{"given":"Koushik Chandra","family":"Howlader","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Md. Shahriare","family":"Satu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Md. Abdul","family":"Awal","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Md. Rabiul","family":"Islam","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sheikh Mohammed Shariful","family":"Islam","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Julian M. W.","family":"Quinn","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0756-1006","authenticated-orcid":false,"given":"Mohammad Ali","family":"Moni","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2022,2,9]]},"reference":[{"key":"168_CR1","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-021-09825-z","author":"AA Abokhzam","year":"2021","unstructured":"Abokhzam AA, Gupta NK, Bose DK. Efficient diabetes mellitus prediction with grid based random forest classifier in association with natural language processing. Int J Speech Technol. 2021. https:\/\/doi.org\/10.1007\/s10772-021-09825-z.","journal-title":"Int J Speech Technol"},{"key":"168_CR2","doi-asserted-by":"publisher","first-page":"223","DOI":"10.1007\/978-981-15-6048-4_20","volume-title":"Advances on Smart and Soft Computing, Advances in Intelligent Systems and Computing","author":"BA Al-Hameli","year":"2021","unstructured":"Al-Hameli BA, Alsewari AA, Alsarem MY. Prediction of diabetes using hidden na\u00efve bayes: comparative stud. In: Saeed F, Al-Hadhrami T, Mohammed F, Mohammed E, editors. Advances on Smart and Soft Computing, Advances in Intelligent Systems and Computing. New York: Springer; 2021. p. 223\u201333. https:\/\/doi.org\/10.1007\/978-981-15-6048-4_20."},{"issue":"7","key":"168_CR3","doi-asserted-by":"publisher","first-page":"8170","DOI":"10.1016\/j.eswa.2010.12.160","volume":"38","author":"A Arauzo-Azofra","year":"2011","unstructured":"Arauzo-Azofra A, Aznarte JL, Ben\u00edtez JM. Empirical study of feature selection methods based on individual feature evaluation for classification problems. Expert Syst Appl. 2011;38(7):8170\u20137.","journal-title":"Expert Syst Appl"},{"key":"168_CR4","unstructured":"Banerjee O, Satyanarayana DKVV. Prediction of diabetes mellitus using ensembled machine learning techniques. Ann Romanian Soc Cell Biol 701\u2013711."},{"issue":"1","key":"168_CR5","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1007\/s00180-009-0165-9","volume":"25","author":"S Bashir","year":"2010","unstructured":"Bashir S, Carter EM. Penalized multinomial mixture logit model. Comput Stat. 2010;25(1):121\u201341. https:\/\/doi.org\/10.1007\/s00180-009-0165-9.","journal-title":"Comput Stat"},{"issue":"4","key":"168_CR6","doi-asserted-by":"publisher","first-page":"74","DOI":"10.3390\/machines7040074","volume":"7","author":"G Battineni","year":"2019","unstructured":"Battineni G, Sagaro GG, Nalini C, Amenta F, Tayebati SK. Comparative machine-learning approach: a follow-up study on type 2 diabetes predictions by cross-validation methods. Machines. 2019;7(4):74. https:\/\/doi.org\/10.3390\/machines7040074.","journal-title":"Machines"},{"key":"168_CR7","doi-asserted-by":"publisher","unstructured":"Benbelkacem S, Atmani B. Random forests for diabetes diagnosis. In: 2019 International Conference on Computer and Information Sciences (ICCIS), pp. 1\u20134. https:\/\/doi.org\/10.1109\/ICCISci.2019.8716405.","DOI":"10.1109\/ICCISci.2019.8716405"},{"key":"168_CR8","unstructured":"Bruce P, Bruce A. Practical statistics for data scientists: 50 essential concepts. O\u2019Reilly Media, Inc.; 2017."},{"key":"168_CR9","doi-asserted-by":"crossref","unstructured":"Chowdhury UN, Hasan MAM, Ahmad S, Islam MB, Quinn JM, Moni MA. Delineating common cell pathways that influence type 2 diabetes and neurodegenerative diseases using a network-based approach. In: 2019 international conference on computer, communication, chemical, materials and electronic engineering (IC4ME2), pp. 1\u20136. IEEE; 2019.","DOI":"10.1109\/IC4ME247184.2019.9036525"},{"key":"168_CR10","doi-asserted-by":"publisher","first-page":"100309","DOI":"10.1016\/j.imu.2020.100309","volume":"19","author":"UN Chowdhury","year":"2020","unstructured":"Chowdhury UN, Islam MB, Ahmad S, Moni MA. Network-based identification of genetic factors in ageing, lifestyle and type 2 diabetes that influence to the progression of alzheimer\u2019s disease. Inform Med Unlocked. 2020;19:100309.","journal-title":"Inform Med Unlocked"},{"issue":"2","key":"168_CR11","doi-asserted-by":"publisher","first-page":"295","DOI":"10.1177\/1932296817706375","volume":"12","author":"A Dagliati","year":"2018","unstructured":"Dagliati A, Marini S, Sacchi L, Cogni G, Teliti M, Tibollo V, De Cata P, Chiovato L, Bellazzi R. Machine learning methods to predict diabetes complications. J Diabetes Sci Technol. 2018;12(2):295\u2013302.","journal-title":"J Diabetes Sci Technol"},{"issue":"1","key":"168_CR12","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1890\/0012-9658(2007)88[243:BTFEMA]2.0.CO;2","volume":"88","author":"G De\u2019Ath","year":"2007","unstructured":"De\u2019Ath G. Boosted trees for ecological modeling and prediction. Ecology. 2007;88(1):243\u201351.","journal-title":"Ecology"},{"issue":"4","key":"168_CR13","doi-asserted-by":"publisher","first-page":"802","DOI":"10.1111\/j.1365-2656.2008.01390.x","volume":"77","author":"J Elith","year":"2008","unstructured":"Elith J, Leathwick JR, Hastie T. A working guide to boosted regression trees. J Anim Ecol. 2008;77(4):802\u201313.","journal-title":"J Anim Ecol"},{"key":"168_CR14","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511790942","volume-title":"Data analysis using regression and multilevel\/hierarchical models","author":"A Gelman","year":"2006","unstructured":"Gelman A, Hill J. Data analysis using regression and multilevel\/hierarchical models. Cambridge: Cambridge University Press; 2006."},{"key":"168_CR15","doi-asserted-by":"publisher","unstructured":"Giri B, Ghosh N.S, Majumdar R, Ghosh A. Predicting diabetes implementing hybrid approach. In: 2020 8th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), pp. 388\u2013391. https:\/\/doi.org\/10.1109\/ICRITO48877.2020.9197971.","DOI":"10.1109\/ICRITO48877.2020.9197971"},{"key":"168_CR16","doi-asserted-by":"crossref","unstructured":"Gupta H, Varshney H, Sharma T.K, Pachauri N, Verma O.P. Comparative performance analysis of quantum machine learning with deep learning for diabetes prediction. https:\/\/doi.org\/10.1007\/s40747-021-00398-7.","DOI":"10.1007\/s40747-021-00398-7"},{"key":"168_CR17","doi-asserted-by":"publisher","first-page":"76516","DOI":"10.1109\/ACCESS.2020.2989857","volume":"8","author":"MK Hasan","year":"2020","unstructured":"Hasan MK, Alam MA, Das D, Hossain E, Hasan M. Diabetes prediction using ensembling of different machine learning classifiers. IEEE Access. 2020;8:76516\u201331. https:\/\/doi.org\/10.1109\/ACCESS.2020.2989857.","journal-title":"IEEE Access"},{"key":"168_CR18","doi-asserted-by":"publisher","DOI":"10.1201\/b18401","volume-title":"Statistical learning with sparsity: the lasso and generalizations","author":"T Hastie","year":"2015","unstructured":"Hastie T, Tibshirani R, Wainwright M. Statistical learning with sparsity: the lasso and generalizations. Boca Raton: Chapman and Hall\/CRC; 2015."},{"issue":"2","key":"168_CR19","doi-asserted-by":"publisher","first-page":"596","DOI":"10.3390\/ijerph17020596","volume":"17","author":"ME Hossain","year":"2020","unstructured":"Hossain ME, Uddin S, Khan A, Moni MA. A framework to understand the progression of cardiovascular disease for type 2 diabetes mellitus patients using a network approach. Int J Environ Res Public Health. 2020;17(2):596.","journal-title":"Int J Environ Res Public Health"},{"key":"168_CR20","doi-asserted-by":"crossref","unstructured":"Islam MR, Kamal ARM, Sultana N, Islam R, Moni MA, et al. Detecting depression using k-nearest neighbors (knn) classification technique. In: 2018 International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2), pp. 1\u20134. IEEE; 2018.","DOI":"10.1109\/IC4ME2.2018.8465641"},{"issue":"2","key":"168_CR21","doi-asserted-by":"publisher","first-page":"280","DOI":"10.1007\/s13410-020-00875-8","volume":"41","author":"SMS Islam","year":"2021","unstructured":"Islam SMS, Uddin R, Zaman SB, Biswas T, Tansi T, Chegini Z, Moni MA, Niessen L, Naheed A. Healthcare seeking behavior and glycemic control in patients with type 2 diabetes attending a tertiary hospital. Int J Diabetes Dev Countries. 2021;41(2):280\u20137.","journal-title":"Int J Diabetes Dev Countries"},{"key":"168_CR22","doi-asserted-by":"publisher","DOI":"10.1007\/s11831-021-09582-x","author":"L Ismail","year":"2021","unstructured":"Ismail L, Materwala H, Tayefi M, Ngo P, Karduck AP. Type 2 diabetes with artificial intelligence machine learning: methods and evaluation. Arch Comput Methods Eng. 2021. https:\/\/doi.org\/10.1007\/s11831-021-09582-x.","journal-title":"Arch Comput Methods Eng"},{"issue":"1","key":"168_CR23","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1097\/00006842-200201000-00007","volume":"64","author":"CH Johnston-Brooks","year":"2002","unstructured":"Johnston-Brooks CH, Lewis MA, Garg S. Self-efficacy impacts self-care and hba1c in young adults with type I diabetes. Psychosom Med. 2002;64(1):43\u201351.","journal-title":"Psychosom Med"},{"issue":"1","key":"168_CR24","doi-asserted-by":"publisher","first-page":"82","DOI":"10.1016\/j.eswa.2007.06.004","volume":"35","author":"H Kahramanli","year":"2008","unstructured":"Kahramanli H, Allahverdi N. Design of a hybrid system for the diabetes and heart diseases. Expert Syst Appl. 2008;35(1):82\u20139. https:\/\/doi.org\/10.1016\/j.eswa.2007.06.004.","journal-title":"Expert Syst Appl"},{"key":"168_CR25","doi-asserted-by":"publisher","first-page":"104554","DOI":"10.1016\/j.compbiomed.2021.104554","volume":"135","author":"SK Kalagotla","year":"2021","unstructured":"Kalagotla SK, Gangashetty SV, Giridhar K. A novel stacking technique for prediction of diabetes. Comput Biol Med. 2021;135:104554. https:\/\/doi.org\/10.1016\/j.compbiomed.2021.104554.","journal-title":"Comput Biol Med"},{"key":"168_CR26","doi-asserted-by":"publisher","unstructured":"Kour H, Sabharwal M, Suvanov S, Anand D. An assessment of type-2 diabetes risk prediction using machine learning techniques. In: Tiwari S, Suryani E, Ng AK, Mishra KK, Singh N, editors. Proceedings of International Conference on Big Data, Machine Learning and their Applications, Lecture Notes in Networks and Systems, pp. 113\u2013122. Springer. https:\/\/doi.org\/10.1007\/978-981-15-8377-3_10.","DOI":"10.1007\/978-981-15-8377-3_10"},{"key":"168_CR27","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1016\/j.ijcce.2021.01.001","volume":"2","author":"S Kumari","year":"2021","unstructured":"Kumari S, Kumar D, Mittal M. An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier. International Journal of Cognitive Computing in Engineering. 2021;2:40\u20136. https:\/\/doi.org\/10.1016\/j.ijcce.2021.01.001","journal-title":"International Journal of Cognitive Computing in Engineering"},{"issue":"5","key":"168_CR28","first-page":"1267","volume":"5","author":"P Kumbhar","year":"2016","unstructured":"Kumbhar P, Mali M. A survey on feature selection techniques and classification algorithms for efficient text classification. Int J Sci Res. 2016;5(5):1267\u201375.","journal-title":"Int J Sci Res"},{"key":"168_CR29","doi-asserted-by":"crossref","unstructured":"Lu H, Uddin S, Hajati F, Moni MA, Khushi M. A patient network-based machine learning model for disease prediction: the case of type 2 diabetes mellitus. Appl Intell 2021;1\u201312","DOI":"10.1007\/s10489-021-02533-w"},{"issue":"5","key":"168_CR30","doi-asserted-by":"publisher","first-page":"92","DOI":"10.1007\/s10916-018-0940-7","volume":"42","author":"M Maniruzzaman","year":"2018","unstructured":"Maniruzzaman M, Rahman MJ, Al-MehediHasan M, Suri HS, Abedin MM, El-Baz A, Suri JS. Accurate diabetes risk stratification using machine learning: role of missing value and outliers. J Med Syst. 2018;42(5):92.","journal-title":"J Med Syst"},{"key":"168_CR31","doi-asserted-by":"publisher","first-page":"1486","DOI":"10.1109\/ACCESS.2019.2962091","volume":"8","author":"MA Moni","year":"2019","unstructured":"Moni MA, Islam MB, Rahman MR, Rashed-Al-Mahfuz M, Awal MA, Islam SMS, Mollah MNH, Quinn JM. Network-based computational approach to identify delineating common cell pathways influencing type 2 diabetes and diseases of bone and joints. IEEE Access. 2019;8:1486\u201397.","journal-title":"IEEE Access"},{"issue":"1","key":"168_CR32","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/2043-9113-4-8","volume":"4","author":"MA Moni","year":"2014","unstructured":"Moni MA, Li\u00f2 P. comor: a software for disease comorbidity risk assessment. J Clin Bioinform. 2014;4(1):1\u201311.","journal-title":"J Clin Bioinform"},{"key":"168_CR33","doi-asserted-by":"publisher","first-page":"28","DOI":"10.3389\/fcell.2015.00028","volume":"3","author":"MA Moni","year":"2015","unstructured":"Moni MA, Li\u00f2 P. How to build personalized multi-omics comorbidity profiles. Front Cell Dev Biol. 2015;3:28.","journal-title":"Front Cell Dev Biol"},{"issue":"1","key":"168_CR34","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1007\/s40200-020-00520-5","volume":"19","author":"H Naz","year":"2020","unstructured":"Naz H, Ahuja S. Deep learning approach for diabetes prediction using PIMA indian dataset. J Diab Metab Disord. 2020;19(1):391\u2013403. https:\/\/doi.org\/10.1007\/s40200-020-00520-5.","journal-title":"J Diab Metab Disord"},{"issue":"12","key":"168_CR35","doi-asserted-by":"publisher","first-page":"8102","DOI":"10.1016\/j.eswa.2010.05.078","volume":"37","author":"BM Patil","year":"2010","unstructured":"Patil BM, Joshi RC, Toshniwal D. Hybrid prediction model for type-2 diabetic patients. Expert Syst Appl. 2010;37(12):8102\u20138. https:\/\/doi.org\/10.1016\/j.eswa.2010.05.078.","journal-title":"Expert Syst Appl"},{"key":"168_CR36","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1016\/j.procs.2016.04.016","volume":"82","author":"S Perveen","year":"2016","unstructured":"Perveen S, Shahbaz M, Guergachi A, Keshavjee K. Performance analysis of data mining classification techniques to predict diabetes. Procedia Comput Sci. 2016;82:115\u201321.","journal-title":"Procedia Comput Sci"},{"key":"168_CR37","unstructured":"Pima indians diabetes database. https:\/\/www.kaggle.com\/uciml\/pima-indians-diabetes-database. 2018. Accessed 12 July 2018."},{"key":"168_CR38","doi-asserted-by":"publisher","first-page":"100830","DOI":"10.1016\/j.genrep.2020.100830","volume":"21","author":"NK Podder","year":"2020","unstructured":"Podder NK, Rana HK, Azam MS, Rana MS, Akhtar MR, Rahman MR, Rahman MH, Moni MA. A system biological approach to investigate the genetic profiling and comorbidities of type 2 diabetes. Gene Rep. 2020;21:100830.","journal-title":"Gene Rep"},{"issue":"4","key":"168_CR39","doi-asserted-by":"publisher","first-page":"201","DOI":"10.21917\/ijsc.2011.0031","volume":"1","author":"RP Priyadarsini","year":"2011","unstructured":"Priyadarsini RP, Valarmathi M, Sivakumari S. Gain ratio based feature selection method for privacy preservation. ICTACT J Soft Comput. 2011;1(4):201\u20135.","journal-title":"ICTACT J Soft Comput"},{"key":"168_CR40","doi-asserted-by":"crossref","unstructured":"Rahman MA, Shoaib S, Al Amin M, Toma RN, Moni MA, Awal MA. A bayesian optimization framework for the prediction of diabetes mellitus. In: 2019 5th International Conference on Advances in Electrical Engineering (ICAEE), pp. 357\u2013362. IEEE; 2019.","DOI":"10.1109\/ICAEE48663.2019.8975480"},{"issue":"3","key":"168_CR41","doi-asserted-by":"publisher","first-page":"1035","DOI":"10.3390\/ijerph17031035","volume":"17","author":"MH Rahman","year":"2020","unstructured":"Rahman MH, Peng S, Hu X, Chen C, Rahman MR, Uddin S, Quinn JM, Moni MA. A network-based bioinformatics approach to identify molecular biomarkers for type 2 diabetes that are linked to the progression of neurological diseases. Int J Environ Res Public Health. 2020;17(3):1035.","journal-title":"Int J Environ Res Public Health"},{"key":"168_CR42","doi-asserted-by":"publisher","first-page":"431","DOI":"10.1016\/j.compbiolchem.2018.12.011","volume":"78","author":"MR Rahman","year":"2019","unstructured":"Rahman MR, Islam T, Turanli B, Zaman T, Faruquee HM, Rahman MM, Mollah MNH, Nanda RK, Arga KY, Gov E, et al. Network-based approach to identify molecular signatures and therapeutic agents in Alzheimer\u2019s disease. Comput Biol Chem. 2019;78:431\u20139.","journal-title":"Comput Biol Chem"},{"issue":"1","key":"168_CR43","doi-asserted-by":"publisher","first-page":"012135","DOI":"10.1088\/1757-899X\/1116\/1\/012135","volume":"1116","author":"A Ram","year":"2021","unstructured":"Ram A, Vishwakarma H. Diabetes prediction using machine learning and data mining methods. IOP Conf Ser. 2021;1116(1):012135. https:\/\/doi.org\/10.1088\/1757-899X\/1116\/1\/012135.","journal-title":"IOP Conf Ser"},{"issue":"3","key":"168_CR44","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1049\/htl2.12010","volume":"8","author":"J Ramesh","year":"2021","unstructured":"Ramesh J, Aburukba R, Sagahyroon A. A remote healthcare monitoring framework for diabetes prediction using machine learning. Healthc Technol Lett. 2021;8(3):45\u201357. https:\/\/doi.org\/10.1049\/htl2.12010.","journal-title":"Healthc Technol Lett"},{"key":"168_CR45","doi-asserted-by":"publisher","first-page":"100448","DOI":"10.1016\/j.imu.2020.100448","volume":"21","author":"N Sakib","year":"2020","unstructured":"Sakib N, Chowdhury UN, Islam MB, Ahmad S, Moni MA. A systems biology approach to identifying genetic factors affected by aging, lifestyle factors, and type 2 diabetes that influences parkinson\u2019s disease progression. Inform Med Unlocked. 2020;21:100448.","journal-title":"Inform Med Unlocked"},{"key":"168_CR46","doi-asserted-by":"crossref","unstructured":"Sakib N, Chowdhury UN, Islam MB, Huq F, Quinn JM, Moni MA. A systems biology approach to identifying genetic markers that link progression of parkinson\u2019s disease to risk factors related to ageing, lifestyle and type 2 diabetes. In: 2019 International Conference on computer, Communication, chemical, materials and Electronic Engineering (IC4ME2), pp. 1\u20135. IEEE; 2019.","DOI":"10.1109\/IC4ME247184.2019.9036535"},{"key":"168_CR47","doi-asserted-by":"crossref","unstructured":"Satu MS, Ahamed S, Hossain F, Akter T, Farid DM. Mining traffic accident data of n5 national highway in bangladesh employing decision trees. In: 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC), pp. 722\u2013725. IEEE; 2017.","DOI":"10.1109\/R10-HTC.2017.8289059"},{"key":"168_CR48","doi-asserted-by":"crossref","unstructured":"Satu MS, Akter T, Uddin MJ. Performance analysis of classifying localization sites of protein using data mining techniques and artificial neural networks. In: 2017 International Conference on Electrical, Computer and Communication Engineering (ECCE), pp. 860\u2013865. IEEE; 2017.","DOI":"10.1109\/ECACE.2017.7913023"},{"key":"168_CR49","unstructured":"Satu MS, Atik ST, Moni MA. A novel hybrid machine learning model to predict diabetes mellitus. In: Proceedings of International Joint Conference on Computational Intelligence: IJCCI 2019. Springer; 2019."},{"key":"168_CR50","doi-asserted-by":"crossref","unstructured":"Satu MS, Tasnim F, Akter T, Halder S. Exploring significant heart disease factors based on semi supervised learning algorithms. In: 2018 International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2), pp. 1\u20134. IEEE; 2018.","DOI":"10.1109\/IC4ME2.2018.8465642"},{"key":"168_CR51","doi-asserted-by":"publisher","unstructured":"Shahriare Satu M, Atik ST, Moni MA. A novel hybrid machine learning model to predict diabetes mellitus. In: Uddin, MS, Bansal JC, editors. Proceedings of International Joint Conference on Computational Intelligence, Algorithms for Intelligent Systems, pp. 453\u2013465. Springer. https:\/\/doi.org\/10.1007\/978-981-15-3607-6_36.","DOI":"10.1007\/978-981-15-3607-6_36"},{"key":"168_CR52","doi-asserted-by":"publisher","first-page":"1578","DOI":"10.1016\/j.procs.2018.05.122","volume":"132","author":"D Sisodia","year":"2018","unstructured":"Sisodia D, Sisodia DS. Prediction of diabetes using classification algorithms. Procedia Comput Sci. 2018;132:1578\u201385.","journal-title":"Procedia Comput Sci"},{"key":"168_CR53","doi-asserted-by":"publisher","unstructured":"Taz NH, Islam A, Mahmud I. A comparative analysis of ensemble based machine learning techniques for diabetes identification. In: 2021 2nd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST), pp. 1\u20136. https:\/\/doi.org\/10.1109\/ICREST51555.2021.9331036.","DOI":"10.1109\/ICREST51555.2021.9331036"},{"issue":"4","key":"168_CR54","doi-asserted-by":"publisher","first-page":"8610","DOI":"10.1016\/j.eswa.2008.10.032","volume":"36","author":"H Temurtas","year":"2009","unstructured":"Temurtas H, Yumusak N, Temurtas F. A comparative study on diabetes disease diagnosis using neural networks. Expert Syst Appl. 2009;36(4):8610\u20135. https:\/\/doi.org\/10.1016\/j.eswa.2008.10.032.","journal-title":"Expert Syst Appl"},{"key":"168_CR55","doi-asserted-by":"publisher","unstructured":"Tripathi G, Kumar R. Early prediction of diabetes mellitus using machine learning. In: 2020 8th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), pp. 1009\u20131014. https:\/\/doi.org\/10.1109\/ICRITO48877.2020.9197832.","DOI":"10.1109\/ICRITO48877.2020.9197832"},{"issue":"4","key":"168_CR56","doi-asserted-by":"publisher","first-page":"961","DOI":"10.1111\/j.1541-0420.2006.00578.x","volume":"62","author":"G Tutz","year":"2006","unstructured":"Tutz G, Binder H. Generalized additive modeling with implicit variable selection by likelihood-based boosting. Biometrics. 2006;62(4):961\u201371.","journal-title":"Biometrics"},{"issue":"1","key":"168_CR57","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12911-019-1004-8","volume":"19","author":"S Uddin","year":"2019","unstructured":"Uddin S, Khan A, Hossain ME, Moni MA. Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inform Decis Making. 2019;19(1):1\u201316.","journal-title":"BMC Med Inform Decis Making"},{"key":"168_CR58","doi-asserted-by":"crossref","unstructured":"Urbanowicz RJ, Meeker M, La Cava W, Olson RS, Moore JH. Relief-based feature selection: introduction and review. J Biomed Inform 2018.","DOI":"10.1016\/j.jbi.2018.07.014"},{"key":"168_CR59","doi-asserted-by":"publisher","unstructured":"Vaishali R, Sasikala R, Ramasubbareddy S, Remya S, Nalluri S. Genetic algorithm based feature selection and MOE fuzzy classification algorithm on pima indians diabetes dataset. In: 2017 International Conference on Computing Networking and Informatics (ICCNI), pp. 1\u20135. https:\/\/doi.org\/10.1109\/ICCNI.2017.8123815.","DOI":"10.1109\/ICCNI.2017.8123815"},{"issue":"1","key":"168_CR60","doi-asserted-by":"publisher","first-page":"47","DOI":"10.1007\/s13721-012-0006-6","volume":"1","author":"J Van Hulse","year":"2012","unstructured":"Van Hulse J, Khoshgoftaar TM, Napolitano A, Wald R. Threshold-based feature selection techniques for high-dimensional bioinformatics data. Netw Model Anal Health Inform Bioinform. 2012;1(1):47\u201361. https:\/\/doi.org\/10.1007\/s13721-012-0006-6.","journal-title":"Netw Model Anal Health Inform Bioinform"},{"issue":"5","key":"168_CR61","doi-asserted-by":"publisher","first-page":"1758","DOI":"10.1016\/j.compeleceng.2013.07.003","volume":"40","author":"KV Varma","year":"2014","unstructured":"Varma KV, Rao AA, Lakshmi TS, Rao PN. A computational intelligence approach for a better diagnosis of diabetic patients. Comput Electr Eng. 2014;40(5):1758\u201365. https:\/\/doi.org\/10.1016\/j.compeleceng.2013.07.003.","journal-title":"Comput Electr Eng"},{"issue":"3","key":"168_CR62","doi-asserted-by":"publisher","first-page":"826","DOI":"10.1080\/10618600.2015.1049700","volume":"25","author":"B Wang","year":"2016","unstructured":"Wang B, Zou H. Sparse distance weighted discrimination. J Comput Graph Stat. 2016;25(3):826\u201338.","journal-title":"J Comput Graph Stat"},{"key":"168_CR63","doi-asserted-by":"publisher","first-page":"102232","DOI":"10.1109\/ACCESS.2019.2929866","volume":"7","author":"Q Wang","year":"2019","unstructured":"Wang Q, Cao W, Guo J, Ren J, Cheng Y, Davis DN. DMP\\_mi: an effective diabetes mellitus classification algorithm on imbalanced data with missing values. IEEE Access. 2019;7:102232\u20138. https:\/\/doi.org\/10.1109\/ACCESS.2019.2929866.","journal-title":"IEEE Access"},{"key":"168_CR64","doi-asserted-by":"publisher","unstructured":"Wei S, Zhao X, Miao C. A comprehensive exploration to the machine learning techniques for diabetes identification. In: 2018 IEEE 4th World Forum on Internet of Things (WF-IoT), pp. 291\u2013295. https:\/\/doi.org\/10.1109\/WF-IoT.2018.8355130.","DOI":"10.1109\/WF-IoT.2018.8355130"},{"key":"168_CR65","doi-asserted-by":"publisher","first-page":"100","DOI":"10.1016\/j.imu.2017.12.006","volume":"10","author":"H Wu","year":"2018","unstructured":"Wu H, Yang S, Huang Z, He J, Wang X. Type 2 diabetes mellitus prediction model based on data mining. Inform Med Unlocked. 2018;10:100\u20137.","journal-title":"Inform Med Unlocked"},{"key":"168_CR66","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1016\/j.compbiolchem.2015.08.010","volume":"59","author":"H Xu","year":"2015","unstructured":"Xu H, Moni MA, Li\u00f2 P. Network regularised cox regression and multiplex network models to predict disease comorbidities and survival of cancer. Comput Biol Chem. 2015;59:15\u201331.","journal-title":"Comput Biol Chem"},{"key":"168_CR67","unstructured":"Zhu J, Rosset S, Tibshirani R, Hastie TJ. 1-norm support vector machines. In: Advances in neural information processing systems, pp. 49\u201356; 2004."}],"container-title":["Health Information Science and Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13755-021-00168-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s13755-021-00168-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13755-021-00168-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,11,30]],"date-time":"2022-11-30T18:27:52Z","timestamp":1669832872000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s13755-021-00168-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,9]]},"references-count":67,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["168"],"URL":"https:\/\/doi.org\/10.1007\/s13755-021-00168-2","relation":{},"ISSN":["2047-2501"],"issn-type":[{"value":"2047-2501","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,2,9]]},"assertion":[{"value":"27 October 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 October 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 February 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"2"}}