{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T06:15:22Z","timestamp":1775888122476,"version":"3.50.1"},"reference-count":42,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,11,14]],"date-time":"2022-11-14T00:00:00Z","timestamp":1668384000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,11,14]],"date-time":"2022-11-14T00:00:00Z","timestamp":1668384000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Commercial banks are required to explain the credit evaluation results to their customers. Therefore, banks attempt to improve the performance of their credit scoring models while ensuring the interpretability of the results. However, there is a tradeoff between the logistic regression model and machine learning-based techniques regarding interpretability and model performance because machine learning-based models are a black box. To deal with the tradeoff, in this study, we present a two-stage logistic regression method based on the Bayesian approach. In the first stage, we generate the derivative variables by linearly combining the original features with their explanatory powers based on the Bayesian inference. The second stage involves developing a credit scoring model through logistic regression using these derivative variables. Through this process, the explanatory power of a large number of original features can be utilized for default prediction, and the use of logistic regression maintains the model's interpretability. In the empirical analysis, the independent sample t-test reveals that our proposed approach significantly improves the model\u2019s performance compared to that based on the conventional single-stage approach, i.e., the baseline model. The Kolmogorov\u2013Smirnov statistics show a 3.42 percentage points (%p) increase, and the area under the receiver operating characteristic shows a 2.61%p increase. Given that our two-stage modeling approach has the advantages of interpretability and enhanced performance of the credit scoring model, our proposed method is essential for those in charge of banking who must explain credit evaluation results and find ways to improve the performance of credit scoring models.<\/jats:p>","DOI":"10.1186\/s40537-022-00665-5","type":"journal-article","created":{"date-parts":[[2022,11,14]],"date-time":"2022-11-14T20:16:09Z","timestamp":1668456969000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":17,"title":["Two-stage credit scoring using Bayesian approach"],"prefix":"10.1186","volume":"9","author":[{"given":"Sunghyon","family":"Kyeong","sequence":"first","affiliation":[]},{"given":"Jinho","family":"Shin","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,11,14]]},"reference":[{"key":"665_CR1","doi-asserted-by":"publisher","first-page":"411","DOI":"10.3390\/ijfs3030411","volume":"3","author":"M Khashei","year":"2015","unstructured":"Khashei M, Mirahmadi A. A soft intelligent risk evaluation model for credit scoring classification. Int J Financ Stud. 2015;3:411\u201322.","journal-title":"Int J Financ Stud"},{"key":"665_CR2","doi-asserted-by":"crossref","first-page":"571","DOI":"10.12785\/amis\/070221","volume":"7","author":"K Nurlybayeva","year":"2013","unstructured":"Nurlybayeva K, Balakayeva G. Algorithmic scoring models. Appl Math Sci. 2013;7:571\u201386.","journal-title":"Appl Math Sci"},{"key":"665_CR3","first-page":"84","volume":"27","author":"WS Walusala","year":"2017","unstructured":"Walusala WS, Rimiru DR, Otieno DC. A hybrid machine learning approach for credit scoring using PCA and logistic regression. Int J Comput. 2017;27:84\u2013102.","journal-title":"Int J Comput"},{"key":"665_CR4","doi-asserted-by":"publisher","first-page":"2463","DOI":"10.1016\/j.procs.2010.04.278","volume":"1","author":"G Dong","year":"2010","unstructured":"Dong G, Lai KK, Yen J. Credit scorecard based on logistic regression with random coefficients. Procedia Comput Sci. 2010;1:2463\u20138.","journal-title":"Procedia Comput Sci."},{"key":"665_CR5","unstructured":"Chen C, Lin K, Rudin C, Shaposhnik Y, Wang S, Wang T. An interpretable model with globally consistent explanations for credit risk. Comput Res Repos. 2018;abs\/1811.1. http:\/\/dblp.uni-trier.de\/db\/journals\/corr\/corr1811.html#abs-1811-12615"},{"key":"665_CR6","doi-asserted-by":"publisher","first-page":"1178","DOI":"10.1016\/j.ejor.2021.06.053","volume":"297","author":"E Dumitrescu","year":"2022","unstructured":"Dumitrescu E, Hu\u00e9 S, Hurlin C, Tokpavi S. Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects. Eur J Oper Res. 2022;297:1178\u201392.","journal-title":"Eur J Oper Res"},{"key":"665_CR7","doi-asserted-by":"publisher","first-page":"203","DOI":"10.1007\/s10614-020-10042-0","volume":"57","author":"N Bussmann","year":"2021","unstructured":"Bussmann N, Giudici P, Marinelli D, Papenbrock J. Explainable machine learning in credit risk management. Comput Econ. 2021;57:203\u201316. https:\/\/doi.org\/10.1007\/s10614-020-10042-0.","journal-title":"Comput Econ"},{"key":"665_CR8","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1186\/s40537-021-00461-7","volume":"8","author":"M Ala\u2019raj","year":"2021","unstructured":"Ala\u2019raj M, Abbod MF, Majdalawieh M. Modelling customers credit card behaviour using bidirectional LSTM neural networks. J Big Data. 2021;8:69. https:\/\/doi.org\/10.1186\/s40537-021-00461-7.","journal-title":"J Big Data."},{"key":"665_CR9","doi-asserted-by":"publisher","first-page":"151","DOI":"10.1186\/s40537-021-00541-8","volume":"8","author":"I Benchaji","year":"2021","unstructured":"Benchaji I, Douzi S, El Ouahidi B, Jaafari J. Enhanced credit card fraud detection based on attention mechanism and LSTM deep model. J Big Data. 2021;8:151. https:\/\/doi.org\/10.1186\/s40537-021-00541-8.","journal-title":"J Big Data."},{"key":"665_CR10","volume-title":"Pattern recognition and machine learning","author":"CM Bishop","year":"2006","unstructured":"Bishop CM. Pattern recognition and machine learning. New York: Springer; 2006."},{"key":"665_CR11","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1002\/isaf.325","volume":"18","author":"HA Abdou","year":"2011","unstructured":"Abdou HA, Pointon J. Credit scoring, statistical techniques and evaluation criteria: a review of the literature. Intell Syst Acc Financ Manag. 2011;18:59\u201388. https:\/\/doi.org\/10.1002\/isaf.325.","journal-title":"Intell Syst Acc Financ Manag."},{"key":"665_CR12","doi-asserted-by":"publisher","first-page":"292","DOI":"10.1016\/j.ejor.2021.03.006","volume":"295","author":"BR Gunnarsson","year":"2021","unstructured":"Gunnarsson BR, vanden Broucke S, Baesens B, \u00d3skarsd\u00f3ttir M, Lemahieu W. Deep learning for credit scoring: do or don\u2019t? Eur J Oper Res. 2021;295:292\u2013305.","journal-title":"Eur J Oper Res"},{"key":"665_CR13","first-page":"42","volume":"22","author":"I Genriha","year":"2012","unstructured":"Genriha I, Voronova I. Methods for evaluating the creditworthiness of borrowers. RTU Publ House. 2012;22:42\u20139.","journal-title":"RTU Publ House"},{"key":"665_CR14","doi-asserted-by":"crossref","unstructured":"L\u00f6ffler G, Posch PN, Schone C. Bayesian methods for improving credit scoring models. SSRN. 2005;","DOI":"10.2139\/ssrn.742469"},{"key":"665_CR15","doi-asserted-by":"crossref","unstructured":"Chen H, Jiang M, Wang X. Bayesian ensemble assessment for credit scoring. 2017 4th Int Conf Ind Econ Syst Ind Secur Eng. 2017;1\u20135.","DOI":"10.1109\/IEIS.2017.8078596"},{"key":"665_CR16","first-page":"228","volume":"2017","author":"OJ Okesola","year":"2017","unstructured":"Okesola OJ, Okokpujie KO, Adewale AA, John SN, Omoruyi O. An improved bank credit scoring model: a na\u00efve Bayesian approach. Int Conf Comput Sci Comput Intell. 2017;2017:228\u201333.","journal-title":"Int Conf Comput Sci Comput Intell"},{"key":"665_CR17","doi-asserted-by":"publisher","first-page":"433","DOI":"10.6339\/JDS.2013.11(3).1145","volume":"11","author":"L-J Kao","year":"2013","unstructured":"Kao L-J, Lin F, Yu CY. Bayesian behavior scoring model. J Data Sci. 2013;11:433\u201350.","journal-title":"J Data Sci."},{"key":"665_CR18","doi-asserted-by":"publisher","first-page":"743","DOI":"10.1016\/j.eswa.2004.12.031","volume":"28","author":"T-S Lee","year":"2005","unstructured":"Lee T-S, Chen IF. A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines. Expert Syst Appl. 2005;28:743\u201352.","journal-title":"Expert Syst Appl"},{"key":"665_CR19","doi-asserted-by":"publisher","DOI":"10.1002\/9781119769262.ch6","author":"D Tripathi","year":"2021","unstructured":"Tripathi D, Edla DR, Bablani A, Kuppili V. Two-stage credit scoring model based on evolutionary feature selection and ensemble neural networks. Mach Learn Algorithms Appl. 2021. https:\/\/doi.org\/10.1002\/9781119769262.ch6.","journal-title":"Mach Learn Algorithms Appl."},{"key":"665_CR20","doi-asserted-by":"crossref","unstructured":"Munkhdalai L, Lee JY, Ryu KH. A hybrid credit scoring model using neural networks and logistic regression. Adv Intell Inf Hiding Multimed Signal Process Smart Innov Syst Technol. Singapore: Springer; 2019. p. 251\u20138.","DOI":"10.1007\/978-981-13-9714-1_27"},{"key":"665_CR21","doi-asserted-by":"publisher","first-page":"2845","DOI":"10.1093\/rfs\/hhz099","volume":"33","author":"T Berg","year":"2020","unstructured":"Berg T, Burg V, Gombovi\u0107 A, Puri M. On the rise of FinTechs: credit scoring using digital footprints. Rev Financ Stud. 2020;33:2845\u201397. https:\/\/doi.org\/10.1093\/rfs\/hhz099.","journal-title":"Rev Financ Stud."},{"key":"665_CR22","doi-asserted-by":"publisher","first-page":"130","DOI":"10.3390\/su14010130","volume":"14","author":"S Kyeong","year":"2022","unstructured":"Kyeong S, Kim D, Shin J. Can system log data enhance the performance of credit scoring?\u2014Evidence from an internet bank in Korea. Sustainability. 2022;14:130.","journal-title":"Sustainability"},{"key":"665_CR23","doi-asserted-by":"crossref","unstructured":"Hsieh H, Lee T, Lee T. Data mining in building behavioral scoring models. 2010 Int Conf Comput Intell Softw Eng. 2010. p. 1\u20134.","DOI":"10.1109\/CISE.2010.5677005"},{"key":"665_CR24","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1186\/s40537-022-00573-8","volume":"9","author":"E Ileberi","year":"2022","unstructured":"Ileberi E, Sun Y, Wang Z. A machine learning based credit card fraud detection using the GA algorithm for feature selection. J Big Data. 2022;9:24. https:\/\/doi.org\/10.1186\/s40537-022-00573-8.","journal-title":"J Big Data."},{"key":"665_CR25","volume-title":"Credit risk scorecards: developing and implementing intelligent credit scoring","author":"N Siddiqi","year":"2005","unstructured":"Siddiqi N. Credit risk scorecards: developing and implementing intelligent credit scoring. Hoboken: Wiley; 2005."},{"key":"665_CR26","doi-asserted-by":"publisher","DOI":"10.1057\/9780230298989","volume-title":"Credit scoring, response modelling and insurance rating","author":"S Finlay","year":"2010","unstructured":"Finlay S. Credit scoring, response modelling and insurance rating. London: Palgrave Macmillan; 2010."},{"key":"665_CR27","doi-asserted-by":"publisher","first-page":"168","DOI":"10.1016\/j.ejor.2012.04.009","volume":"222","author":"S Akko\u00e7","year":"2012","unstructured":"Akko\u00e7 S. An empirical comparison of conventional techniques, neural networks and the three stage hybrid Adaptive Neuro Fuzzy Inference System (ANFIS) model for credit scoring analysis: the case of Turkish credit card data. Eur J Oper Res. 2012;222:168\u201378.","journal-title":"Eur J Oper Res"},{"key":"665_CR28","doi-asserted-by":"publisher","first-page":"38","DOI":"10.3390\/risks6020038","volume":"6","author":"PM Addo","year":"2018","unstructured":"Addo PM, Guegan D, Hassani B. Credit risk analysis using machine and deep learning models. Risks. 2018;6:38.","journal-title":"Risks"},{"key":"665_CR29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1504\/IJBIS.2016.078020","volume":"23","author":"M Alborzi","year":"2016","unstructured":"Alborzi M, Khanbabaei M. Using data mining and neural networks techniques to propose a new hybrid customer behaviour analysis and credit scoring model in banking services based on a developed RFM analysis method. Int J Bus Inf Syst. 2016;23:1\u201322. https:\/\/doi.org\/10.1504\/IJBIS.2016.078020.","journal-title":"Int J Bus Inf Syst."},{"key":"665_CR30","doi-asserted-by":"publisher","first-page":"234","DOI":"10.1016\/j.eswa.2018.01.037","volume":"100","author":"J Jurgovsky","year":"2018","unstructured":"Jurgovsky J, Granitzer M, Ziegler K, Calabretto S, Portier P-E, He-Guelton L, et al. Sequence classification for credit-card fraud detection. Expert Syst Appl. 2018;100:234\u201345.","journal-title":"Expert Syst Appl"},{"key":"665_CR31","first-page":"825","volume":"118","author":"N Khare","year":"2018","unstructured":"Khare N, Sait SY. Credit card fraud detection using machine learning models and collating machine learning models. Int J Pure Appl Math. 2018;118:825\u201338.","journal-title":"Int J Pure Appl Math"},{"key":"665_CR32","doi-asserted-by":"publisher","first-page":"631","DOI":"10.1016\/j.procs.2020.01.057","volume":"165","author":"VN Dornadula","year":"2019","unstructured":"Dornadula VN, Geetha S. Credit Card fraud detection using machine learning algorithms. Procedia Comput Sci. 2019;165:631\u201341.","journal-title":"Procedia Comput Sci."},{"key":"665_CR33","doi-asserted-by":"publisher","DOI":"10.1007\/s10479-021-04149-2","author":"M Seera","year":"2021","unstructured":"Seera M, Lim CP, Kumar A, Dhamotharan L, Tan KH. An intelligent payment card fraud detection system. Ann Oper Res. 2021. https:\/\/doi.org\/10.1007\/s10479-021-04149-2.","journal-title":"Ann Oper Res"},{"key":"665_CR34","doi-asserted-by":"publisher","first-page":"99217","DOI":"10.1109\/ACCESS.2019.2930332","volume":"7","author":"S Wei","year":"2019","unstructured":"Wei S, Yang D, Zhang W, Zhang S. A novel noise-adapted two-layer ensemble model for credit scoring based on backflow learning. IEEE Access. 2019;7:99217\u201330.","journal-title":"IEEE Access"},{"key":"665_CR35","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1111\/j.1468-0394.2010.00565.x","volume":"28","author":"C-L Chuang","year":"2011","unstructured":"Chuang C-L, Huang S-T. A hybrid neural network approach for credit scoring. Expert Syst. 2011;28:185\u201396. https:\/\/doi.org\/10.1111\/j.1468-0394.2010.00565.x.","journal-title":"Expert Syst"},{"key":"665_CR36","doi-asserted-by":"publisher","first-page":"1839","DOI":"10.1111\/0022-1082.00077","volume":"53","author":"K Daniel","year":"1998","unstructured":"Daniel K, Hirshleifer D, Subrahmanyam A. Investor psychology and security market under- and overreactions. J Financ. 1998;53:1839\u201385. https:\/\/doi.org\/10.1111\/0022-1082.00077.","journal-title":"J Financ"},{"key":"665_CR37","doi-asserted-by":"crossref","unstructured":"Demajo LM, Vella V, Dingli A. Explainable AI for interpretable credit scoring. 10th Int Conf Artif Intell Soft Comput Appl. London, United Kingdom; 2020. p. 3749. https:\/\/ideas.repec.org\/p\/arx\/papers\/2012.03749.html%5C","DOI":"10.5121\/csit.2020.101516"},{"key":"665_CR38","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1007\/978-981-13-9714-1_27","volume-title":"Adv Intell Inf hiding Multimed signal Process","author":"L Munkhdalai","year":"2020","unstructured":"Munkhdalai L, Lee JY, Ryu KH. A hybrid credit scoring model using neural networks and logistic regression. In: Pan J-S, Li J, Tsai P-W, Jain LC, editors. Adv Intell Inf hiding Multimed signal Process. Singapore: Springer; 2020. p. 251\u20138."},{"key":"665_CR39","doi-asserted-by":"publisher","first-page":"2650","DOI":"10.1016\/j.eswa.2011.08.120","volume":"39","author":"B-W Chi","year":"2012","unstructured":"Chi B-W, Hsu C-C. A hybrid approach to integrate genetic algorithm into dual scoring model in enhancing the performance of credit scoring model. Expert Syst Appl. 2012;39:2650\u201361.","journal-title":"Expert Syst Appl"},{"key":"665_CR40","doi-asserted-by":"publisher","first-page":"397","DOI":"10.3390\/info10120397","volume":"10","author":"B Niu","year":"2019","unstructured":"Niu B, Ren J, Li X. Credit scoring using machine learning by combing social network information: evidence from peer-to-peer lending. Information. 2019;10:397.","journal-title":"Information"},{"key":"665_CR41","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1016\/j.asoc.2018.10.004","volume":"74","author":"M \u00d3skarsd\u00f3ttir","year":"2019","unstructured":"\u00d3skarsd\u00f3ttir M, Bravo C, Sarraute C, Vanthienen J, Baesens B. The value of big data for credit scoring: Enhancing financial inclusion using mobile phone data and social network analytics. Appl Soft Comput. 2019;74:26\u201339.","journal-title":"Appl Soft Comput"},{"key":"665_CR42","doi-asserted-by":"publisher","first-page":"1825","DOI":"10.1111\/joca.12265","volume":"53","author":"AS Modestino","year":"2019","unstructured":"Modestino AS, Sederberg R, Tuller L. Assessing the effectiveness of financial coaching: evidence from the Boston youth credit building initiative. J Consum Aff. 2019;53:1825\u201373. https:\/\/doi.org\/10.1111\/joca.12265.","journal-title":"J Consum Aff"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-022-00665-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-022-00665-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-022-00665-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,11,16]],"date-time":"2022-11-16T08:44:20Z","timestamp":1668588260000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-022-00665-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,14]]},"references-count":42,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["665"],"URL":"https:\/\/doi.org\/10.1186\/s40537-022-00665-5","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,11,14]]},"assertion":[{"value":"10 January 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 October 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 November 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"106"}}