{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,27]],"date-time":"2026-04-27T10:27:38Z","timestamp":1777285658021,"version":"3.51.4"},"reference-count":67,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2023,11,7]],"date-time":"2023-11-07T00:00:00Z","timestamp":1699315200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>In this systematic review of the literature on using Machine Learning (ML) for credit risk prediction, we raise the need for financial institutions to use Artificial Intelligence (AI) and ML to assess credit risk, analyzing large volumes of information. We posed research questions about algorithms, metrics, results, datasets, variables, and related limitations in predicting credit risk. In addition, we searched renowned databases responding to them and identified 52 relevant studies within the credit industry of microfinance. Challenges and approaches in credit risk prediction using ML models were identified; we had difficulties with the implemented models such as the black box model, the need for explanatory artificial intelligence, the importance of selecting relevant features, addressing multicollinearity, and the problem of the imbalance in the input data. By answering the inquiries, we identified that the Boosted Category is the most researched family of ML models; the most commonly used metrics for evaluation are Area Under Curve (AUC), Accuracy (ACC), Recall, precision measure F1 (F1), and Precision. Research mainly uses public datasets to compare models, and private ones to generate new knowledge when applied to the real world. The most significant limitation identified is the representativeness of reality, and the variables primarily used in the microcredit industry are data related to the Demographic, Operation, and Payment behavior. This study aims to guide developers of credit risk management tools and software towards the existing ability of ML methods, metrics, and techniques used to forecast it, thereby minimizing possible losses due to default and guiding risk appetite.<\/jats:p>","DOI":"10.3390\/data8110169","type":"journal-article","created":{"date-parts":[[2023,11,7]],"date-time":"2023-11-07T11:23:39Z","timestamp":1699356219000},"page":"169","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":60,"title":["Machine Learning for Credit Risk Prediction: A Systematic Literature Review"],"prefix":"10.3390","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8602-2766","authenticated-orcid":false,"given":"Jomark Pablo","family":"Noriega","sequence":"first","affiliation":[{"name":"Departamento Acad\u00e9mico de Ciencia de la Computacion, Universidad Nacional Mayor de San Marcos, Decana de Am\u00e9rica, Lima 15081, Peru"},{"name":"Financiera QAPAQ, Lima 150120, Peru"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5029-2561","authenticated-orcid":false,"given":"Luis Antonio","family":"Rivera","sequence":"additional","affiliation":[{"name":"Departamento Acad\u00e9mico de Ciencia de la Computacion, Universidad Nacional Mayor de San Marcos, Decana de Am\u00e9rica, Lima 15081, Peru"},{"name":"Centro de Ci\u00eancias Exatas e Tecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes 28013-602, Brazil"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8207-9714","authenticated-orcid":false,"given":"Jos\u00e9 Alfredo","family":"Herrera","sequence":"additional","affiliation":[{"name":"Departamento Acad\u00e9mico de Ciencia de la Computacion, Universidad Nacional Mayor de San Marcos, Decana de Am\u00e9rica, Lima 15081, Peru"},{"name":"Programme in Biotechnology, Engineering and Chemical Technology, Universidad Pablo de Olavide, 41013 Sevilla, Spain"}]}],"member":"1968","published-online":{"date-parts":[[2023,11,7]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Lombardo, G., Pellegrino, M., Adosoglou, G., Cagnoni, S., Pardalos, P.M., and Poggi, A. (2022). Machine Learning for Bankruptcy Prediction in the American Stock Market: Dataset and Benchmarks. Future Internet, 14.","DOI":"10.3390\/fi14080244"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Ziemba, P., Becker, J., Becker, A., Radomska-Zalas, A., Pawluk, M., and Wierzba, D. (2021). Credit decision support based on real set of cash loans using integrated machine learning algorithms. Electronics, 10.","DOI":"10.3390\/electronics10172099"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"111293","DOI":"10.1109\/ACCESS.2021.3103510","article-title":"Finding the next interesting loan for investors on a peer-to-peer lending platform","volume":"9","author":"Liu","year":"2021","journal-title":"IEEE Access"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"113647","DOI":"10.1016\/j.dss.2021.113647","article-title":"A holistic approach to interpretability in financial lending: Models, visualizations, and summary-explanations","volume":"152","author":"Chen","year":"2022","journal-title":"Decis. Support Syst."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Shih, D.H., Wu, T.W., Shih, P.Y., Lu, N.A., and Shih, M.H. (2022). A Framework of Global Credit-Scoring Modeling Using Outlier Detection and Machine Learning in a P2P Lending Platform. Mathematics, 10.","DOI":"10.3390\/math10132282"},{"key":"ref_6","first-page":"1465394","article-title":"Dynamic Prediction of Internet Financial Market Based on Deep Learning","volume":"2022","author":"Zhang","year":"2022","journal-title":"Comput. Intell. Neurosci."},{"key":"ref_7","unstructured":"(2021, December 22). BM Panorama General. Available online: https:\/\/www.bancomundial.org\/es\/topic\/financialsector\/overview."},{"key":"ref_8","unstructured":"Hani, U., Wickramasinghe, A., Kattiyapornpong, U., and Sajib, S. (2022). Annals of Operations Research, Springer."},{"key":"ref_9","first-page":"6159459","article-title":"A Method for Financial System Analysis of Listed Companies Based on Random Forest and Time Series","volume":"2022","author":"Zhang","year":"2022","journal-title":"Mob. Inf. Syst."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Majern\u00edk, M., Daneshjo, N., Malega, P., Dr\u00e1bik, P., and Barilov\u00e1, B. (2022). Sustainable development of the intelligent industry from Industry 4.0 to Industry 5.0. Adv. Sci. Technol. Res. J., 16.","DOI":"10.12913\/22998624\/146420"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"114840","DOI":"10.1016\/j.eswa.2021.114840","article-title":"Big data analytics for default prediction using graph theory","volume":"176","author":"Okay","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_12","first-page":"5346995","article-title":"Risk Assessment of Operator\u2019s Big Data Internet of Things Credit Financial Management Based on Machine Learning","volume":"2022","author":"Bi","year":"2022","journal-title":"Mob. Inf. Syst."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1186\/s40537-019-0206-3","article-title":"Uncertainty in big data analytics: Survey, opportunities, and challenges","volume":"6","author":"Hariri","year":"2019","journal-title":"J. Big Data"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"113155","DOI":"10.1016\/j.eswa.2019.113155","article-title":"Ensemble learning with label proportions for bankruptcy prediction","volume":"146","author":"Chen","year":"2020","journal-title":"Expert Syst. Appl."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"8706285","DOI":"10.1155\/2020\/8706285","article-title":"Improved ML-based technique for credit card scoring in internet financial risk control","volume":"2020","author":"Fan","year":"2020","journal-title":"Complexity"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1016\/j.inffus.2018.07.004","article-title":"Exploring the synergetic effects of sample types on the performance of ensembles for credit risk and corporate bankruptcy prediction","volume":"47","author":"Marques","year":"2019","journal-title":"Inf. Fusion"},{"key":"ref_17","unstructured":"Wang, M., and Yang, H. (2021, January 24\u201327). Research on personal credit risk assessment model based on instance-based transfer learning. Proceedings of the Intelligence Science III: 4th IFIP TC 12 International Conference, ICIS 2020, Durgapur, India. Revised Selected Papers 4."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"2492","DOI":"10.1002\/spe.2842","article-title":"Comparative study of support vector machines and random forests machine learning algorithms on credit operation","volume":"51","author":"Teles","year":"2021","journal-title":"Softw. Pract. Exp."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Orlova, E.V. (2020). Decision-making techniques for credit resource management using machine learning and optimization. Information, 11.","DOI":"10.3390\/info11030144"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"42623","DOI":"10.1109\/ACCESS.2022.3168857","article-title":"Business failure prediction based on a cost-sensitive extreme gradient boosting machine","volume":"10","author":"Zou","year":"2022","journal-title":"IEEE Access"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"779799","DOI":"10.3389\/frai.2022.779799","article-title":"Financial risk management and explainable, trustworthy, responsible AI","volume":"5","author":"Hein","year":"2022","journal-title":"Front. Artif. Intell."},{"key":"ref_22","first-page":"9007140","article-title":"Credit Risk Simulation of Enterprise Financial Management Based on Machine Learning Algorithm","volume":"2022","author":"Sun","year":"2022","journal-title":"Mob. Inf. Syst."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"113438","DOI":"10.1016\/j.eswa.2020.113438","article-title":"The application of PROMETHEE multi-criteria decision aid in financial decision making: Case of distress prediction models evaluation","volume":"159","author":"Mousavi","year":"2020","journal-title":"Expert Syst. Appl."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Zhao, L., Yang, S., Wang, S., and Shen, J. (2022). Research on PPP Enterprise Credit Dynamic Prediction Model. Appl. Sci., 12.","DOI":"10.3390\/app122010362"},{"key":"ref_25","first-page":"100037","article-title":"Optimal balancing & efficient feature ranking approach to minimize credit risk","volume":"1","author":"Pandey","year":"2021","journal-title":"Int. J. Inf. Manag. Data Insights"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"105740","DOI":"10.1016\/j.asoc.2019.105740","article-title":"Application of new deep genetic cascade ensemble of SVM classifiers to predict the Australian credit scoring","volume":"84","author":"Abdar","year":"2019","journal-title":"Appl. Soft Comput."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"119390","DOI":"10.1016\/j.eswa.2022.119390","article-title":"Feature-Weighted Counterfactual-Based Explanation for Bankruptcy Prediction","volume":"216","author":"Cho","year":"2023","journal-title":"Expert Syst. Appl."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1016\/j.eswa.2019.02.033","article-title":"Integration of unsupervised and supervised machine learning algorithms for credit risk assessment","volume":"128","author":"Bao","year":"2019","journal-title":"Expert Syst. Appl."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"118026","DOI":"10.1016\/j.eswa.2022.118026","article-title":"Financial supply chain analysis with borrower identification in smart lending platform","volume":"208","author":"Mitra","year":"2022","journal-title":"Expert Syst. Appl."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Jemai, J., and Zarrad, A. (2023). Feature Selection Engineering for Credit Risk Assessment in Retail Banking. Information, 14.","DOI":"10.3390\/info14030200"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Chen, S.F., Chakraborty, G., and Li, L.H. (2018, January 12\u201314). Feature selection on credit risk prediction for peer-to-peer lending. Proceedings of the New Frontiers in Artificial Intelligence: JSAI-isAI 2018 Workshops, JURISIN, AI-Biz, SKL, LENLS, IDAA, Yokohama, Japan. Revised Selected Papers.","DOI":"10.1007\/978-3-030-31605-1_1"},{"key":"ref_32","unstructured":"Si, Z., Niu, H., and Wang, W. (2022). Fuzzy Systems and Data Mining VIII, IOS Press."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Mer\u0107ep, A., Mr\u010dela, L., Birov, M., and Kostanj\u010dar, Z. (2020). Deep neural networks for behavioral credit rating. Entropy, 23.","DOI":"10.3390\/e23010027"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1007\/s10614-020-10042-0","article-title":"Explainable machine learning in credit risk management","volume":"57","author":"Bussmann","year":"2021","journal-title":"Comput. Econ."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"113986","DOI":"10.1016\/j.eswa.2020.113986","article-title":"A benchmark of machine learning approaches for credit score prediction","volume":"165","author":"Moscato","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"64873","DOI":"10.1109\/ACCESS.2020.2984412","article-title":"Explainability of a machine learning granting scoring model in peer-to-peer lending","volume":"8","author":"Arroyo","year":"2020","journal-title":"IEEE Access"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"222449","DOI":"10.1109\/ACCESS.2020.3043937","article-title":"A novel GSCI-based ensemble approach for credit scoring","volume":"8","author":"Chen","year":"2020","journal-title":"IEEE Access"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1178","DOI":"10.1016\/j.ejor.2021.06.053","article-title":"Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects","volume":"297","author":"Dumitrescu","year":"2022","journal-title":"Eur. J. Oper. Res."},{"key":"ref_39","first-page":"5986295","article-title":"Research on Efficiency in Credit Risk Prediction Using Logistic-SBM Model","volume":"2022","author":"Li","year":"2022","journal-title":"Wirel. Commun. Mob. Comput."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"8359","DOI":"10.1007\/s00521-018-3963-6","article-title":"Financial credit risk prediction in internet finance driven by machine learning","volume":"31","author":"Ma","year":"2019","journal-title":"Neural Comput. Appl."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"116","DOI":"10.22452\/mjcs.sp2022no1.9","article-title":"Designing a Deep Learning-Based Financial Decision Support System for Fintech to Support Corporate Customer\u2019s Credit Extension","volume":"2022","author":"Karn","year":"2022","journal-title":"Malays. J. Comput. Sci."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1504\/IJITST.2019.102796","article-title":"Financial default payment predictions using a hybrid of simulated annealing heuristics and extreme gradient boosting machines","volume":"9","author":"Zheng","year":"2019","journal-title":"Int. J. Internet Technol. Secur. Trans."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"114020","DOI":"10.1016\/j.eswa.2020.114020","article-title":"Learning latent representations of bank customers with the variational autoencoder","volume":"164","author":"Mancisidor","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"116236","DOI":"10.1016\/j.eswa.2021.116236","article-title":"Multi-classification assessment of bank personal credit risk based on multi-source information fusion","volume":"191","author":"Wang","year":"2022","journal-title":"Expert Syst. Appl."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"105466","DOI":"10.1016\/j.engappai.2022.105466","article-title":"Predicting and interpreting financial distress using a weighted boosted tree-based tree","volume":"116","author":"Liu","year":"2022","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"105758","DOI":"10.1016\/j.knosys.2020.105758","article-title":"Deep generative models for reject inference in credit scoring","volume":"196","author":"Kampffmeyer","year":"2020","journal-title":"Knowl.-Based Syst."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1607","DOI":"10.1007\/s10614-020-10090-6","article-title":"Using machine learning approach to evaluate the excessive financialization risks of trading enterprises","volume":"59","author":"Wu","year":"2021","journal-title":"Comput. Econ."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"116624","DOI":"10.1016\/j.eswa.2022.116624","article-title":"A two-stage hybrid credit risk prediction model based on XGBoost and graph-based deep neural network","volume":"195","author":"Liu","year":"2022","journal-title":"Expert Syst. Appl."},{"key":"ref_49","unstructured":"Shu, R. (2022). Deep Representations with Learned Constraints, Stanford University."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"103980","DOI":"10.1016\/j.engappai.2020.103980","article-title":"Evolutionary extreme learning machine with novel activation function for credit scoring","volume":"96","author":"Tripathi","year":"2020","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"538","DOI":"10.1016\/j.ijinfomgt.2018.12.001","article-title":"Financial crisis prediction model using ant colony optimization-ScienceDirect","volume":"50","author":"Uj","year":"2020","journal-title":"Int. J. Inf. Manag."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"3444317","DOI":"10.1155\/2022\/3444317","article-title":"Bank Green Credit Risk Assessment and Management by Mobile Computing and Machine Learning Neural Network under the Efficient Wireless Communication","volume":"2022","author":"Feng","year":"2022","journal-title":"Wirel. Commun. Mob. Comput."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"4060256","DOI":"10.1155\/2022\/4060256","article-title":"Digital universal financial credit risk analysis using particle swarm optimization algorithm with structure decision tree learning-based evaluation model","volume":"2022","author":"Tian","year":"2022","journal-title":"Wirel. Commun. Mob. Comput."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Chro\u015bcicki, D., and Chlebus, M. (2022). The Advantage of Case-Tailored Information Metrics for the Development of Predictive Models, Calculated Profit in Credit Scoring. Entropy, 24.","DOI":"10.3390\/e24091218"},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"105640","DOI":"10.1016\/j.asoc.2019.105640","article-title":"Machine learning models for credit analysis improvements: Predicting low-income families\u2019 default","volume":"83","author":"Barboza","year":"2019","journal-title":"Appl. Soft Comput."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"106963","DOI":"10.1016\/j.knosys.2021.106963","article-title":"How to identify early defaults in online lending: A cost-sensitive multi-layer learning framework","volume":"221","author":"Li","year":"2021","journal-title":"Knowl.-Based Syst."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"119882","DOI":"10.1016\/j.eswa.2023.119882","article-title":"Credit Risk Evaluation Using Clustering Based Fuzzy Classification Method","volume":"223","author":"Kestel","year":"2023","journal-title":"Expert Syst. Appl."},{"key":"ref_58","first-page":"785","article-title":"A data-driven and network-aware approach for credit risk prediction in supply chain finance","volume":"121","author":"Rasouli","year":"2021","journal-title":"Ind. Manag. Data Syst."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"118809","DOI":"10.1016\/j.eswa.2022.118809","article-title":"On the combination of graph data for assessing thin-file borrowers\u2019 creditworthiness","volume":"213","author":"Bravo","year":"2023","journal-title":"Expert Syst. Appl."},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"184","DOI":"10.3390\/forecast4010011","article-title":"A hybrid XGBoost-MLP model for credit risk assessment on digital supply chain finance","volume":"4","author":"Li","year":"2022","journal-title":"Forecasting"},{"key":"ref_61","unstructured":"Haro, B., Ortiz, C., and Armas, J. (2018). Brazilian Technology Symposium, Springer."},{"key":"ref_62","doi-asserted-by":"crossref","first-page":"116202","DOI":"10.1016\/j.eswa.2021.116202","article-title":"Financial distress prediction using a corrected feature selection measure and gradient boosted decision tree","volume":"190","author":"Qian","year":"2022","journal-title":"Expert Syst. Appl."},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"201173","DOI":"10.1109\/ACCESS.2020.3033784","article-title":"An investigation of credit card default prediction in the imbalanced datasets","volume":"8","author":"Alam","year":"2020","journal-title":"IEEE Access"},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"84897","DOI":"10.1109\/ACCESS.2019.2924923","article-title":"A MCDM-based evaluation approach for imbalanced classification methods in financial risk prediction","volume":"7","author":"Song","year":"2019","journal-title":"IEEE Access"},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Biswas, N., Mondal, A.S., Kusumastuti, A., Saha, S., and Mondal, K.C. (2022). Automated credit assessment framework using ETL process and machine learning. Innov. Syst. Softw. Eng., 1\u201314.","DOI":"10.1007\/s11334-022-00522-x"},{"key":"ref_66","doi-asserted-by":"crossref","first-page":"5565980","DOI":"10.1155\/2021\/5565980","article-title":"Research on supply chain financial risk assessment based on blockchain and fuzzy neural networks","volume":"2021","author":"Wang","year":"2021","journal-title":"Wirel. Commun. Mob. Comput."},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"116889","DOI":"10.1016\/j.eswa.2022.116889","article-title":"Assessing credit risk of commercial customers using hybrid machine learning algorithms","volume":"200","author":"Machado","year":"2022","journal-title":"Expert Syst. Appl."}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/8\/11\/169\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:18:49Z","timestamp":1760131129000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/8\/11\/169"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11,7]]},"references-count":67,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2023,11]]}},"alternative-id":["data8110169"],"URL":"https:\/\/doi.org\/10.3390\/data8110169","relation":{},"ISSN":["2306-5729"],"issn-type":[{"value":"2306-5729","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,11,7]]}}}