{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T20:34:34Z","timestamp":1775162074439,"version":"3.50.1"},"reference-count":98,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2025,1,25]],"date-time":"2025-01-25T00:00:00Z","timestamp":1737763200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,1,25]],"date-time":"2025-01-25T00:00:00Z","timestamp":1737763200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Technical University of Kosice"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Artif Intell Rev"],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Information about imminent bankruptcy is crucial for financial institutions, decision-making managers, and state agencies. Since bankruptcy prediction is a prevalent research topic, many new methods have been continuously proposed. Bankruptcy prediction is frequently approached as a binary classification task. Since bankruptcy datasets are inherently imbalanced, bankruptcy classification is usually performed using class imbalance learning methods. The nature of these methods is very diverse, but they can usually be categorized as ensemble, cost-sensitive, sampling, and hybrid methods. In this paper, we provide a comprehensive experimental comparison of 45 methods. These methods were selected because they cover the approaches and algorithms frequently employed for bankruptcy prediction and imbalanced learning. Extensive experiments on 15 publicly available datasets with different imbalance ratios showed that the methods based on a combination of ensemble learning and undersampling are able to handle data imbalance and achieve the best results for bankruptcy classification.<\/jats:p>","DOI":"10.1007\/s10462-025-11107-y","type":"journal-article","created":{"date-parts":[[2025,1,25]],"date-time":"2025-01-25T08:30:50Z","timestamp":1737793850000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["An experimental survey of imbalanced learning algorithms for bankruptcy prediction"],"prefix":"10.1007","volume":"58","author":[{"given":"Peter","family":"Gnip","sequence":"first","affiliation":[]},{"given":"R\u00f3bert","family":"Kan\u00e1sz","sequence":"additional","affiliation":[]},{"given":"Martin","family":"Zori\u010dak","sequence":"additional","affiliation":[]},{"given":"Peter","family":"Drot\u00e1r","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,1,25]]},"reference":[{"key":"11107_CR1","doi-asserted-by":"crossref","unstructured":"Ainan UH, Por LY, Chen Y-L, Yang J, Ku CS (2024) Advancing bankruptcy forecasting with hybrid machine learning techniques: insights from an unbalanced polish dataset. IEEE Access","DOI":"10.1109\/ACCESS.2024.3354173"},{"key":"11107_CR2","doi-asserted-by":"publisher","unstructured":"Al\u00a0Helal M, Haydar MS, Mostafa SAM (2016) Algorithms efficiency measurement on imbalanced data using geometric mean and cross validation. In: 2016 International workshop on computational intelligence (IWCI), pp 110\u2013114. https:\/\/doi.org\/10.1109\/IWCI.2016.7860349. IEEE","DOI":"10.1109\/IWCI.2016.7860349"},{"key":"11107_CR3","doi-asserted-by":"publisher","first-page":"164","DOI":"10.1016\/j.eswa.2017.10.040","volume":"94","author":"HA Alaka","year":"2018","unstructured":"Alaka HA, Oyedele LO, Owolabi HA, Kumar V, Ajayi SO, Akinade OO, Bilal M (2018) Systematic review of bankruptcy prediction models: towards a framework for tool selection. Expert Syst Appl 94:164\u2013184. https:\/\/doi.org\/10.1016\/j.eswa.2017.10.040","journal-title":"Expert Syst Appl"},{"issue":"4","key":"11107_CR4","doi-asserted-by":"publisher","first-page":"589","DOI":"10.1111\/j.1540-6261.1968.tb00843.x","volume":"23","author":"EI Altman","year":"1968","unstructured":"Altman EI (1968) Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. J Financ 23(4):589\u2013609","journal-title":"J Financ"},{"key":"11107_CR5","doi-asserted-by":"crossref","unstructured":"Amirshahi B, Lahmiri S (2024) Bankruptcy prediction using optimal ensemble models under balanced and imbalanced data. Expert Syst, 13599","DOI":"10.1111\/exsy.13599"},{"issue":"8","key":"11107_CR6","doi-asserted-by":"publisher","first-page":"5059","DOI":"10.1016\/j.jksuci.2022.06.005","volume":"34","author":"A Arafa","year":"2022","unstructured":"Arafa A, El-Fishawy N, Badawy M, Radad M (2022) Rn-smote: Reduced noise smote based on dbscan for enhancing imbalanced data classification. J King Saud Univ-Comput Inform Sci 34(8):5059\u20135074. https:\/\/doi.org\/10.1016\/j.jksuci.2022.06.005","journal-title":"J King Saud Univ-Comput Inform Sci"},{"key":"11107_CR7","first-page":"972","volume":"1141","author":"A Arning","year":"1996","unstructured":"Arning A, Agrawal R, Raghavan P (1996) A linear method for deviation detection in large databases. KDD 1141:972\u2013981","journal-title":"KDD"},{"issue":"5","key":"11107_CR8","first-page":"1184","volume":"29","author":"B Baik","year":"2022","unstructured":"Baik B, Han S-Y, Kim BH, Oh S (2022) Financial reporting quality of privately held firms: evidence from private corporations versus limited companies. Asia-Pac J Account Econ 29(5):1184\u20131207","journal-title":"Asia-Pac J Account Econ"},{"key":"11107_CR9","first-page":"10","volume":"3","author":"GE Batista","year":"2003","unstructured":"Batista GE, Bazzan AL, Monard MC et al (2003) Balancing training data for automated annotation of keywords: a case study. Wob 3:10\u201318","journal-title":"Wob"},{"key":"11107_CR10","doi-asserted-by":"crossref","unstructured":"Beaver WH (1966) Financial ratios as predictors of failure. J Account Res, pp 71\u2013111","DOI":"10.2307\/2490171"},{"key":"11107_CR11","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1007\/BF00058655","volume":"24","author":"L Breiman","year":"1996","unstructured":"Breiman L (1996) Bagging predictors. Mach Learn 24:123\u2013140. https:\/\/doi.org\/10.1007\/BF00058655","journal-title":"Mach Learn"},{"key":"11107_CR12","volume-title":"Classification and regression trees","author":"L Breiman","year":"1984","unstructured":"Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth and Brooks, Monterey"},{"key":"11107_CR13","doi-asserted-by":"publisher","unstructured":"Breunig MM, Kriegel H-P, Ng RT, Sander J (2000) Lof: identifying density-based local outliers. In: Proceedings of the 2000 ACM SIGMOD international conference on management of data, pp 93\u2013104. https:\/\/doi.org\/10.1145\/342009.335388","DOI":"10.1145\/342009.335388"},{"key":"11107_CR14","doi-asserted-by":"publisher","unstructured":"Chawla NV, Lazarevic A, Hall LO, Bowyer KW (2003) Smoteboost: Improving prediction of the minority class in boosting. In: Knowledge Discovery in Databases: PKDD 2003: 7th European conference on principles and practice of knowledge discovery in databases, Cavtat-Dubrovnik, Croatia, September 22\u201326, 2003. Proceedings 7, pp 107\u2013119. https:\/\/doi.org\/10.1007\/978-3-540-39804-2_12. Springer","DOI":"10.1007\/978-3-540-39804-2_12"},{"key":"11107_CR15","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1613\/jair.953","volume":"16","author":"NV Chawla","year":"2002","unstructured":"Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) Smote: synthetic minority over-sampling technique. J Artif Intell Res 16:321\u2013357. https:\/\/doi.org\/10.1613\/jair.953","journal-title":"J Artif Intell Res"},{"key":"11107_CR16","volume-title":"Using random forest to learn imbalanced data","author":"C Chen","year":"2004","unstructured":"Chen C, Breiman L (2004) Using random forest to learn imbalanced data. University of California, Berkeley"},{"issue":"8","key":"11107_CR17","doi-asserted-by":"publisher","first-page":"1348","DOI":"10.1016\/j.knosys.2011.06.008","volume":"24","author":"H-L Chen","year":"2011","unstructured":"Chen H-L, Yang B, Wang G, Liu J, Xu X, Wang S-J, Liu D-Y (2011) A novel bankruptcy prediction model based on an adaptive fuzzy k-nearest neighbor method. Knowl-Based Syst 24(8):1348\u20131359. https:\/\/doi.org\/10.1016\/j.knosys.2011.06.008","journal-title":"Knowl-Based Syst"},{"key":"11107_CR18","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2019.113155","volume":"146","author":"Z Chen","year":"2020","unstructured":"Chen Z, Chen W, Shi Y (2020) Ensemble learning with label proportions for bankruptcy prediction. Expert Syst Appl 146:113155. https:\/\/doi.org\/10.1016\/j.eswa.2019.113155","journal-title":"Expert Syst Appl"},{"key":"11107_CR19","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.120714","volume":"233","author":"T-K Chen","year":"2023","unstructured":"Chen T-K, Liao H-H, Chen G-D, Kang W-H, Lin Y-C (2023) Bankruptcy prediction using machine learning models with the text-based communicative value of annual reports. Expert Syst Appl 233:120714","journal-title":"Expert Syst Appl"},{"key":"11107_CR20","doi-asserted-by":"crossref","unstructured":"Chen T, Guestrin C (2016) Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd Acm Sigkdd international conference on knowledge discovery and data mining, pp 785\u2013794","DOI":"10.1145\/2939672.2939785"},{"key":"11107_CR21","doi-asserted-by":"crossref","unstructured":"Cheraghali H, Moln\u00e1r P (2023) Sme default prediction: a systematic methodology-focused review. J Small Business Manag, pp 1\u201359","DOI":"10.1080\/00472778.2024.2390500"},{"key":"11107_CR22","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2024.124761","volume":"255","author":"A Dasilas","year":"2024","unstructured":"Dasilas A, Rigani A (2024) Machine learning techniques in bankruptcy prediction: a systematic literature review. Expert Syst Appl 255:124761","journal-title":"Expert Syst Appl"},{"key":"11107_CR23","unstructured":"Deepthi AS (2014) Anomaly detection using principal component analysis"},{"key":"11107_CR24","first-page":"1","volume":"7","author":"J Dem\u0161ar","year":"2006","unstructured":"Dem\u0161ar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1\u201330","journal-title":"J Mach Learn Res"},{"issue":"4","key":"11107_CR25","doi-asserted-by":"publisher","first-page":"3081","DOI":"10.1016\/j.eswa.2007.06.037","volume":"34","author":"Y Ding","year":"2008","unstructured":"Ding Y, Song X, Zen Y (2008) Forecasting financial condition of Chinese listed companies based on support vector machine. Expert Syst Appl 34(4):3081\u20133089","journal-title":"Expert Syst Appl"},{"issue":"2","key":"11107_CR26","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2022.103235","volume":"60","author":"H Ding","year":"2023","unstructured":"Ding H, Sun Y, Wang Z, Huang N, Shen Z, Cui X (2023) Rgan-el: a gan and ensemble learning-based hybrid approach for imbalanced data classification. Inform Process Manag 60(2):103235. https:\/\/doi.org\/10.1016\/j.ipm.2022.103235","journal-title":"Inform Process Manag"},{"key":"11107_CR27","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1016\/j.ins.2023.01.147","volume":"629","author":"H Ding","year":"2023","unstructured":"Ding H, Sun Y, Huang N, Shen Z, Wang Z, Iftekhar A, Cui X (2023) Rvgan-tl: A generative adversarial networks and transfer learning-based hybrid approach for imbalanced data classification. Inf Sci 629:184\u2013203","journal-title":"Inf Sci"},{"key":"11107_CR28","doi-asserted-by":"publisher","DOI":"10.1109\/JBHI.2023.3253208","author":"R Ding","year":"2023","unstructured":"Ding R, Zhou Y, Xu J, Xie Y, Liang Q, Ren H, Wang Y, Chen Y, Wang L, Huang M (2023) Cross-hospital sepsis early detection via semi-supervised optimal transport with self-paced ensemble. IEEE J Biomed Health Inform. https:\/\/doi.org\/10.1109\/JBHI.2023.3253208","journal-title":"IEEE J Biomed Health Inform"},{"issue":"1","key":"11107_CR29","doi-asserted-by":"publisher","first-page":"2138124","DOI":"10.1080\/08839514.2022.2138124","volume":"36","author":"Kuiziniene RD Dovile","year":"2022","unstructured":"Dovile Kuiziniene RD, Tomas Krilavicius Maskeliunas R (2022) Systematic review of financial distress identification using artificial intelligence methods. Appl Artif Intell 36(1):2138124. https:\/\/doi.org\/10.1080\/08839514.2022.2138124","journal-title":"Appl Artif Intell"},{"key":"11107_CR30","doi-asserted-by":"publisher","DOI":"10.1016\/j.dib.2019.104360","author":"P Drot\u00e1r","year":"2019","unstructured":"Drot\u00e1r P, Gnip P, Zori\u010dak M, Gazda V (2019) Small-and medium-enterprises bankruptcy dataset. Data Brief. https:\/\/doi.org\/10.1016\/j.dib.2019.104360","journal-title":"Data Brief"},{"issue":"2","key":"11107_CR31","doi-asserted-by":"publisher","first-page":"13485","DOI":"10.1111\/exsy.13485","volume":"41","author":"K El Madou","year":"2024","unstructured":"El Madou K, Marso S, El Kharrim M, El Merouani M (2024) Evolutions in machine learning technology for financial distress prediction: a comprehensive review and comparative analysis. Expert Syst 41(2):13485","journal-title":"Expert Syst"},{"key":"11107_CR32","first-page":"97","volume":"99","author":"W Fan","year":"1999","unstructured":"Fan W, Stolfo SJ, Zhang J, Chan PK (1999) Adacost: misclassification cost-sensitive boosting. Icml 99:97\u2013105","journal-title":"Icml"},{"key":"11107_CR33","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1016\/j.dss.2017.08.001","volume":"102","author":"S Figini","year":"2017","unstructured":"Figini S, Bonelli F, Giovannini E (2017) Solvency prediction for small and medium enterprises in banking. Decis Support Syst 102:91\u201397. https:\/\/doi.org\/10.1016\/j.dss.2017.08.001","journal-title":"Decis Support Syst"},{"key":"11107_CR34","unstructured":"Fitzpatrick PJ (1932) A comparison of the ratios of successful industrial enterprises with those of failed companies"},{"key":"11107_CR35","doi-asserted-by":"publisher","unstructured":"Freund Y, Schapire RE (1995) A desicion-theoretic generalization of on-line learning and an application to boosting. In: European conference on computational learning theory, pp 23\u201337. https:\/\/doi.org\/10.1007\/3-540-59119-2_166. Springer","DOI":"10.1007\/3-540-59119-2_166"},{"issue":"200","key":"11107_CR36","doi-asserted-by":"publisher","first-page":"675","DOI":"10.1080\/01621459.1937.10503522","volume":"32","author":"M Friedman","year":"1937","unstructured":"Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32(200):675\u2013701","journal-title":"J Am Stat Assoc"},{"key":"11107_CR37","doi-asserted-by":"publisher","unstructured":"Gurnani I, Tandian FS, Anggreainy MS et al (2021) Predicting company bankruptcy using random forest method. In: 2021 2nd international conference on artificial intelligence and data sciences (AiDAS), pp 1\u20135. https:\/\/doi.org\/10.1109\/AiDAS53897.2021.9574384. IEEE","DOI":"10.1109\/AiDAS53897.2021.9574384"},{"key":"11107_CR38","doi-asserted-by":"publisher","unstructured":"Han H, Wang W-Y, Mao B-H (2005) Borderline-smote: a new over-sampling method in imbalanced data sets learning. In: International conference on intelligent computing. Springer, pp 878\u2013887. https:\/\/doi.org\/10.1007\/11538059","DOI":"10.1007\/11538059"},{"issue":"9\u201310","key":"11107_CR39","doi-asserted-by":"publisher","first-page":"1641","DOI":"10.1016\/S0167-8655(03)00003-5","volume":"24","author":"Z He","year":"2003","unstructured":"He Z, Xu X, Deng S (2003) Discovering cluster-based local outliers. Pattern Recogn Lett 24(9\u201310):1641\u20131650. https:\/\/doi.org\/10.1016\/S0167-8655(03)00003-5","journal-title":"Pattern Recogn Lett"},{"key":"11107_CR40","doi-asserted-by":"publisher","unstructured":"He H, Bai Y, Garcia EA, Li S (2008) Adasyn: Adaptive synthetic sampling approach for imbalanced learning. In: 2008 IEEE international joint conference on neural networks (IEEE World Congress on Computational Intelligence). IEEE, pp 1322\u20131328. https:\/\/doi.org\/10.1109\/IJCNN.2008.4633969","DOI":"10.1109\/IJCNN.2008.4633969"},{"issue":"5\u20136","key":"11107_CR41","doi-asserted-by":"publisher","first-page":"412","DOI":"10.1002\/sam.10061","volume":"2","author":"S Hido","year":"2009","unstructured":"Hido S, Kashima H, Takahashi Y (2009) Roughly balanced bagging for imbalanced data. Stat Anal Data Min: ASA Data Sci J 2(5\u20136):412\u2013426. https:\/\/doi.org\/10.1002\/sam.10061","journal-title":"Stat Anal Data Min: ASA Data Sci J"},{"key":"11107_CR42","doi-asserted-by":"publisher","unstructured":"Ho TK (1995) Random decision forests. In: Proceedings of 3rd international conference on document analysis and recognition. IEEE, 1, pp 278\u2013282. https:\/\/doi.org\/10.1109\/ICDAR.1995.598994","DOI":"10.1109\/ICDAR.1995.598994"},{"issue":"3","key":"11107_CR43","doi-asserted-by":"publisher","first-page":"1421","DOI":"10.1002\/wics.1421","volume":"10","author":"M Hubert","year":"2018","unstructured":"Hubert M, Debruyne M, Rousseeuw PJ (2018) Minimum covariance determinant and extensions. Wiley Interdiscip Rev: Comput Stat 10(3):1421. https:\/\/doi.org\/10.1002\/wics.1421","journal-title":"Wiley Interdiscip Rev: Comput Stat"},{"key":"11107_CR44","doi-asserted-by":"publisher","DOI":"10.1016\/j.techfore.2023.122450","volume":"190","author":"C Jiang","year":"2023","unstructured":"Jiang C, Zhou Y, Chen B (2023) Mining semantic features in patent text for financial distress prediction. Technol Forecast Soc Chang 190:122450","journal-title":"Technol Forecast Soc Chang"},{"key":"11107_CR45","doi-asserted-by":"publisher","first-page":"432","DOI":"10.1016\/j.neucom.2021.08.086","volume":"464","author":"M Juez-Gil","year":"2021","unstructured":"Juez-Gil M, Arnaiz-Gonzalez A, Rodriguez JJ, Lopez-Nozal C, Garcia-Osorio C (2021) Approx-smote: fast smote for big data on apache spark. Neurocomputing 464:432\u2013437. https:\/\/doi.org\/10.1016\/j.neucom.2021.08.086","journal-title":"Neurocomputing"},{"key":"11107_CR46","unstructured":"Karakoulas G, Shawe-Taylor J (1998) Optimizing classifers for imbalanced training sets. Adv Neural Inform Process Syst, 11"},{"issue":"4","key":"11107_CR47","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3343440","volume":"52","author":"H Kaur","year":"2019","unstructured":"Kaur H, Pannu HS, Malhi AK (2019) A systematic review on imbalanced data challenges in machine learning: applications and solutions. ACM Comput Surv (CSUR) 52(4):1\u201336. https:\/\/doi.org\/10.1145\/3343440","journal-title":"ACM Comput Surv (CSUR)"},{"key":"11107_CR48","doi-asserted-by":"crossref","unstructured":"Khan AA, Chaudhari O, Chandra R (2023) A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluation. Expert Syst Appl, 122778","DOI":"10.1016\/j.eswa.2023.122778"},{"issue":"2","key":"11107_CR49","doi-asserted-by":"publisher","first-page":"112","DOI":"10.1016\/j.jik.2021.01.001","volume":"6","author":"SY Kim","year":"2021","unstructured":"Kim SY, Upneja A (2021) Majority voting ensemble with a decision trees for business failure prediction during economic downturns. J Innov Knowl 6(2):112\u2013123. https:\/\/doi.org\/10.1016\/j.jik.2021.01.001","journal-title":"J Innov Knowl"},{"issue":"1","key":"11107_CR50","doi-asserted-by":"publisher","first-page":"2138124","DOI":"10.1080\/08839514.2022.2138124","volume":"36","author":"D Kuizinien\u0117","year":"2022","unstructured":"Kuizinien\u0117 D, Krilavi\u010dius T, Dama\u0161evi\u010dius R, Maskeli\u016bnas R (2022) Systematic review of financial distress identification using artificial intelligence methods. Appl Artif Intell 36(1):2138124","journal-title":"Appl Artif Intell"},{"issue":"1","key":"11107_CR51","doi-asserted-by":"publisher","first-page":"2138124","DOI":"10.1080\/08839514.2022.2138124","volume":"36","author":"D Kuizinien\u0117","year":"2022","unstructured":"Kuizinien\u0117 D, Krilavi\u010dius T, Dama\u0161evi\u010dius R, Maskeli\u016bnas R (2022) Systematic review of financial distress identification using artificial intelligence methods. Appl Artif Intell 36(1):2138124. https:\/\/doi.org\/10.1080\/08839514.2022.2138124","journal-title":"Appl Artif Intell"},{"key":"11107_CR52","doi-asserted-by":"publisher","unstructured":"Lazarevic A, Kumar V (2005) Feature bagging for outlier detection. In: Proceedings of the eleventh ACM SIGKDD international conference on knowledge discovery in data mining. KDD \u201905. Association for Computing Machinery, New York, NY, USA, pp 157\u2013166. https:\/\/doi.org\/10.1145\/1081870.1081891","DOI":"10.1145\/1081870.1081891"},{"issue":"5","key":"11107_CR53","doi-asserted-by":"publisher","first-page":"433","DOI":"10.1049\/cmu2.12268","volume":"16","author":"T Le","year":"2022","unstructured":"Le T (2022) A comprehensive survey of imbalanced learning methods for bankruptcy prediction. IET Commun 16(5):433\u2013441","journal-title":"IET Commun"},{"key":"11107_CR54","unstructured":"Leskovec J, Shawe-Taylor J (2003) Linear programming boosting for uneven datasets. In: Proceedings of the 20th international conference on machine learning (ICML-03), pp 456\u2013463"},{"key":"11107_CR55","doi-asserted-by":"publisher","first-page":"437","DOI":"10.1007\/s10115-017-1061-1","volume":"54","author":"D Liang","year":"2018","unstructured":"Liang D, Tsai C-F, Dai A-J, Eberle W (2018) A novel classifier ensemble approach for financial distress prediction. Knowl Inf Syst 54:437\u2013462. https:\/\/doi.org\/10.1007\/s10115-017-1061-1","journal-title":"Knowl Inf Syst"},{"key":"11107_CR56","unstructured":"Liang D, Tsai C-F (2020) UCI machine learning repository - Taiwanese bankruptcy prediction data set. https:\/\/archive.ics.uci.edu\/ml\/datasets\/Taiwanese+Bankruptcy+Prediction"},{"issue":"4","key":"11107_CR57","first-page":"421","volume":"42","author":"W-Y Lin","year":"2011","unstructured":"Lin W-Y, Hu Y-H, Tsai C-F (2011) Machine learning in financial crisis prediction: a survey. IEEE Trans Syst, Man, Cybern, Part C (Appl Rev) 42(4):421\u2013436","journal-title":"IEEE Trans Syst, Man, Cybern, Part C (Appl Rev)"},{"issue":"1","key":"11107_CR58","doi-asserted-by":"publisher","first-page":"12335","DOI":"10.1111\/exsy.12335","volume":"36","author":"W-C Lin","year":"2019","unstructured":"Lin W-C, Lu Y-H, Tsai C-F (2019) Feature selection in single and ensemble learning-based bankruptcy prediction models. Expert Syst 36(1):12335. https:\/\/doi.org\/10.1111\/exsy.12335","journal-title":"Expert Syst"},{"issue":"2","key":"11107_CR59","doi-asserted-by":"publisher","first-page":"845","DOI":"10.1007\/s10462-022-10186-5","volume":"56","author":"C Lin","year":"2023","unstructured":"Lin C, Tsai C-F, Lin W-C (2023) Towards hybrid over-and under-sampling combination methods for class imbalanced datasets: an experimental study. Artif Intell Rev 56(2):845\u2013863. https:\/\/doi.org\/10.1007\/s10462-022-10186-5","journal-title":"Artif Intell Rev"},{"key":"11107_CR60","doi-asserted-by":"publisher","unstructured":"Liu FT, Ting KM, Zhou Z-H (2008) Isolation forest. In: 2008 Eighth IEEE international conference on data mining. IEEE, pp 413\u2013422. https:\/\/doi.org\/10.1109\/ICDM.2008.17","DOI":"10.1109\/ICDM.2008.17"},{"issue":"2","key":"11107_CR61","doi-asserted-by":"publisher","first-page":"539","DOI":"10.1109\/TSMCB.2008.2007853","volume":"39","author":"X-Y Liu","year":"2008","unstructured":"Liu X-Y, Wu J, Zhou Z-H (2008) Exploratory undersampling for class-imbalance learning. IEEE Trans Syst, Man, Cybern, Part B (Cybern) 39(2):539\u2013550. https:\/\/doi.org\/10.1109\/TSMCB.2008.2007853","journal-title":"IEEE Trans Syst, Man, Cybern, Part B (Cybern)"},{"key":"11107_CR62","doi-asserted-by":"publisher","unstructured":"Liu Z, Cao W, Gao Z, Bian J, Chen H, Chang Y, Liu T-Y (2020) Self-paced ensemble for highly imbalanced massive data classification. In: 2020 IEEE 36th international conference on data engineering (ICDE). IEEE, pp 841\u2013852. https:\/\/doi.org\/10.1109\/ICDE48307.2020.00078","DOI":"10.1109\/ICDE48307.2020.00078"},{"key":"11107_CR63","doi-asserted-by":"publisher","unstructured":"Li Z, Zhao Y, Botta N, Ionescu C, Hu X (2020) Copod: copula-based outlier detection. In: 2020 IEEE international conference on data mining (ICDM). IEEE, pp 1118\u20131123. https:\/\/doi.org\/10.1109\/ICDM50108.2020.00135","DOI":"10.1109\/ICDM50108.2020.00135"},{"key":"11107_CR64","unstructured":"Loukas L, Stogiannidis I, Malakasiotis P, Vassos S (2023) Breaking the bank with chatgpt: Few-shot text classification for finance. arXiv preprint arXiv:2308.14634"},{"key":"11107_CR65","unstructured":"Mani I, Zhang I (2003) knn approach to unbalanced data distributions: a case study involving information extraction. In: Proceedings of workshop on learning from imbalanced datasets. ICML, 126, pp 1\u20137"},{"issue":"1\u20132","key":"11107_CR66","doi-asserted-by":"publisher","first-page":"76","DOI":"10.1504\/IJBPM.2020.106111","volume":"21","author":"N Memic","year":"2020","unstructured":"Memic N, Memic D (2020) Financial traits of bankruptcy, empirical evidence from Bosnia and Herzegovina. Int J Bus Perform Manag 21(1\u20132):76\u201394. https:\/\/doi.org\/10.1504\/IJBPM.2020.106111","journal-title":"Int J Bus Perform Manag"},{"key":"11107_CR67","doi-asserted-by":"publisher","DOI":"10.1016\/j.iswa.2023.200204","volume":"18","author":"MA Muslim","year":"2023","unstructured":"Muslim MA, Nikmah TL, Pertiwi DAA, Dasril Y et al (2023) New model combination meta-learner to improve accuracy prediction p2p lending with stacking ensemble learning. Intell Syst Appl 18:200204. https:\/\/doi.org\/10.1016\/j.iswa.2023.200204","journal-title":"Intell Syst Appl"},{"key":"11107_CR68","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.119640","volume":"219","author":"N Nazareth","year":"2023","unstructured":"Nazareth N, Ramana Reddy YV (2023) Financial applications of machine learning: a literature review. Expert Syst Appl 219:119640. https:\/\/doi.org\/10.1016\/j.eswa.2023.119640","journal-title":"Expert Syst Appl"},{"key":"11107_CR69","doi-asserted-by":"publisher","DOI":"10.1016\/j.cose.2023.103190","volume":"129","author":"Z Niu","year":"2023","unstructured":"Niu Z, Guo W, Xue J, Wang Y, Kong Z, Huang L (2023) A novel anomaly detection approach based on ensemble semi-supervised active learning (Adessa). Comput Secur 129:103190. https:\/\/doi.org\/10.1016\/j.cose.2023.103190","journal-title":"Comput Secur"},{"key":"11107_CR70","doi-asserted-by":"publisher","unstructured":"Opitz DW, Maclin RF (1997) An empirical evaluation of bagging and boosting for artificial neural networks. In: Proceedings of international conference on neural networks (ICNN\u201997). IEEE, 3, pp 1401\u20131405. https:\/\/doi.org\/10.1109\/ICNN.1997.613999","DOI":"10.1109\/ICNN.1997.613999"},{"key":"11107_CR71","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1016\/j.techfore.2017.05.034","volume":"131","author":"J Ouenniche","year":"2018","unstructured":"Ouenniche J, P\u00e9rez-Gladish B, Bouslah K (2018) An out-of-sample framework for topsis-based classifiers with application in bankruptcy prediction. Technol Forecast Soc Change 131:111\u2013116","journal-title":"Technol Forecast Soc Change"},{"key":"11107_CR72","doi-asserted-by":"publisher","unstructured":"Pamula R, Deka JK, Nandi S (2011) An outlier detection method based on clustering. In: 2011 Second international conference on emerging applications of information technology, pp 253\u2013256. https:\/\/doi.org\/10.1109\/EAIT.2011.25","DOI":"10.1109\/EAIT.2011.25"},{"issue":"7\u20138","key":"11107_CR73","doi-asserted-by":"publisher","first-page":"871","DOI":"10.1111\/jbfa.12317","volume":"45","author":"K Park","year":"2018","unstructured":"Park K (2018) Financial reporting quality and corporate innovation. J Bus Finance Account 45(7\u20138):871\u2013894","journal-title":"J Bus Finance Account"},{"key":"11107_CR74","doi-asserted-by":"publisher","unstructured":"Pearson, K (1901) Liii. on lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin philosophical magazine and journal of science 2(11):559\u2013572 https:\/\/doi.org\/10.1080\/14786440109462720","DOI":"10.1080\/14786440109462720"},{"key":"11107_CR75","doi-asserted-by":"publisher","first-page":"275","DOI":"10.1007\/s10994-015-5521-0","volume":"102","author":"T Pevn\u00fd","year":"2016","unstructured":"Pevn\u00fd T (2016) Loda: Lightweight on-line detector of anomalies. Mach Learn 102:275\u2013304. https:\/\/doi.org\/10.1007\/s10994-015-5521-0","journal-title":"Mach Learn"},{"key":"11107_CR76","doi-asserted-by":"publisher","first-page":"895","DOI":"10.1016\/j.procs.2019.12.065","volume":"162","author":"Y Qu","year":"2019","unstructured":"Qu Y, Quan P, Lei M, Shi Y (2019) Review of bankruptcy prediction using machine learning and deep learning techniques. Procedia Comput Sci 162:895\u2013899. https:\/\/doi.org\/10.1016\/j.procs.2019.12.065","journal-title":"Procedia Comput Sci"},{"key":"11107_CR77","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1016\/j.patrec.2013.12.016","volume":"40","author":"JA Quinn","year":"2014","unstructured":"Quinn JA, Sugiyama M (2014) A least-squares approach to anomaly detection in static and sequential data. Pattern Recogn Lett 40:36\u201340. https:\/\/doi.org\/10.1016\/j.patrec.2013.12.016","journal-title":"Pattern Recogn Lett"},{"key":"11107_CR78","doi-asserted-by":"crossref","unstructured":"Rezvani S, Wang X (2023) A broad review on class imbalance learning techniques. Appl Soft Comput, 110415","DOI":"10.1016\/j.asoc.2023.110415"},{"key":"11107_CR79","unstructured":"Sch\u00f6lkopf B, Williamson RC, Smola A, Shawe-Taylor J, Platt J (1999) Support vector method for novelty detection. Adv Neural Inform Process Syst, 12"},{"key":"11107_CR80","doi-asserted-by":"publisher","DOI":"10.1007\/s11222-021-10041-7","author":"J Schreurs","year":"2021","unstructured":"Schreurs J, Vranckx I, Hubert M, Suykens J, Rousseeuw P (2021) Outlier detection in non-elliptical data by kernel mrcd. Stat Comput. https:\/\/doi.org\/10.1007\/s11222-021-10041-7","journal-title":"Stat Comput"},{"issue":"1","key":"11107_CR81","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1109\/TSMCA.2009.2029559","volume":"40","author":"C Seiffert","year":"2009","unstructured":"Seiffert C, Khoshgoftaar TM, Van Hulse J, Napolitano A (2009) Rusboost: a hybrid approach to alleviating class imbalance. IEEE Trans Syst, Man, Cybern-Part A: Syst Humans 40(1):185\u2013197. https:\/\/doi.org\/10.1109\/TSMCA.2009.2029559","journal-title":"IEEE Trans Syst, Man, Cybern-Part A: Syst Humans"},{"key":"11107_CR82","doi-asserted-by":"crossref","unstructured":"Shakeel MR, Siddiqui TA, Alam S (2023) Feature selection in corporate bankruptcy prediction using ml techniques: a systematic literature review. Adv Signal Process, Embedded Syst IoT, pp 345\u2013363","DOI":"10.1007\/978-981-19-8865-3_32"},{"key":"11107_CR83","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1016\/j.ins.2022.12.046","volume":"624","author":"S Shi","year":"2023","unstructured":"Shi S, Li J, Zhu D, Yang F, Xu Y (2023) A hybrid imbalanced classification model based on data density. Inf Sci 624:50\u201367. https:\/\/doi.org\/10.1016\/j.ins.2022.12.046","journal-title":"Inf Sci"},{"key":"11107_CR84","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.121418","volume":"237","author":"E Silva Mattos","year":"2024","unstructured":"Silva Mattos E, Shasha D (2024) Bankruptcy prediction with low-quality financial information. Expert Syst Appl 237:121418","journal-title":"Expert Syst Appl"},{"key":"11107_CR85","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1016\/j.ins.2021.01.059","volume":"559","author":"J Sun","year":"2021","unstructured":"Sun J, Fujita H, Zheng Y, Ai W (2021) Multi-class financial distress prediction based on support vector machines integrated with the decomposition and fusion methods. Inf Sci 559:153\u2013170. https:\/\/doi.org\/10.1016\/j.ins.2021.01.059","journal-title":"Inf Sci"},{"key":"11107_CR86","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1023\/B:MACH.0000008084.60811.49","volume":"54","author":"DM Tax","year":"2004","unstructured":"Tax DM, Duin RP (2004) Support vector data description. Mach Learn 54:45\u201366. https:\/\/doi.org\/10.1023\/B:MACH.0000008084.60811.49","journal-title":"Mach Learn"},{"issue":"11","key":"11107_CR87","doi-asserted-by":"publisher","first-page":"769","DOI":"10.1109\/TSMC.1976.4309452","volume":"SMC\u20136","author":"I Tomek","year":"1976","unstructured":"Tomek I (1976) Two modifications of cnn. IEEE Trans Syst Man Cybern SMC\u20136(11):769\u2013772. https:\/\/doi.org\/10.1109\/TSMC.1976.4309452","journal-title":"IEEE Trans Syst Man Cybern"},{"key":"11107_CR88","doi-asserted-by":"publisher","first-page":"149","DOI":"10.1007\/s11187-015-9645-1","volume":"45","author":"H Vander Bauwhede","year":"2015","unstructured":"Vander Bauwhede H, De Meyere M, Van Cauwenberge P (2015) Financial reporting quality and the cost of debt of smes. Small Bus Econ 45:149\u2013164","journal-title":"Small Bus Econ"},{"issue":"5","key":"11107_CR89","doi-asserted-by":"publisher","first-page":"956","DOI":"10.1002\/for.2842","volume":"41","author":"D Veganzones","year":"2022","unstructured":"Veganzones D (2022) Corporate failure prediction using threshold-based models. J Forecast 41(5):956\u2013979. https:\/\/doi.org\/10.1002\/for.2842","journal-title":"J Forecast"},{"issue":"7","key":"11107_CR90","doi-asserted-by":"publisher","first-page":"0254030","DOI":"10.1371\/journal.pone.0254030","volume":"16","author":"H Wang","year":"2021","unstructured":"Wang H, Liu X (2021) Undersampling bankruptcy prediction: Taiwan bankruptcy data. PLoS ONE 16(7):0254030. https:\/\/doi.org\/10.1371\/journal.pone.0254030","journal-title":"PLoS ONE"},{"issue":"5","key":"11107_CR91","doi-asserted-by":"publisher","first-page":"5325","DOI":"10.1016\/j.eswa.2011.11.003","volume":"39","author":"G Wang","year":"2012","unstructured":"Wang G, Ma J (2012) A hybrid ensemble approach for enterprise credit risk assessment based on support vector machine. Expert Syst Appl 39(5):5325\u20135331","journal-title":"Expert Syst Appl"},{"issue":"3","key":"11107_CR92","doi-asserted-by":"publisher","first-page":"2421","DOI":"10.1109\/TKDE.2021.3112126","volume":"35","author":"R Wu","year":"2023","unstructured":"Wu R, Keogh EJ (2023) Current time series anomaly detection benchmarks are flawed and are creating the illusion of progress. IEEE Trans Knowl Data Eng 35(3):2421\u20132429. https:\/\/doi.org\/10.1109\/TKDE.2021.3112126","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"11107_CR93","doi-asserted-by":"publisher","unstructured":"Yang P, Huang B (2008) Knn based outlier detection algorithm in large dataset. In: 2008 International workshop on education technology and training & 2008 international workshop on geoscience and remote sensing. IEEE, 1, pp 611\u2013613. https:\/\/doi.org\/10.1109\/ETTandGRS.2008.306","DOI":"10.1109\/ETTandGRS.2008.306"},{"key":"11107_CR94","doi-asserted-by":"publisher","unstructured":"Yang X, Song Q, Cao A (2005) Weighted support vector machine for data classification. In: Proceedings. 2005 IEEE international joint conference on neural networks, 2, pp 859\u20138642. https:\/\/doi.org\/10.1109\/IJCNN.2005.1555965","DOI":"10.1109\/IJCNN.2005.1555965"},{"key":"11107_CR95","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2021.115559","volume":"185","author":"Y Zelenkov","year":"2021","unstructured":"Zelenkov Y, Volodarskiy N (2021) Bankruptcy prediction on the base of the unbalanced data using multi-objective selection of classifiers. Expert Syst Appl 185:115559. https:\/\/doi.org\/10.1016\/j.eswa.2021.115559","journal-title":"Expert Syst Appl"},{"key":"11107_CR96","doi-asserted-by":"publisher","unstructured":"Zhao Y, Hryniewicki MK (2018) Xgbod: improving supervised outlier detection with unsupervised representation learning. In: 2018 International joint conference on neural networks (IJCNN). IEEE, pp 1\u20138. https:\/\/doi.org\/10.1109\/IJCNN.2018.8489605","DOI":"10.1109\/IJCNN.2018.8489605"},{"key":"11107_CR97","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-022-10272-8","author":"Y Zheng","year":"2023","unstructured":"Zheng Y, Xu Z, Xiao A (2023) Deep learning in economics: a systematic and critical review. Artif Intell Rev. https:\/\/doi.org\/10.1007\/s10462-022-10272-8","journal-title":"Artif Intell Rev"},{"key":"11107_CR98","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1016\/j.eswa.2016.04.001","volume":"58","author":"M Zi\u0119ba","year":"2016","unstructured":"Zi\u0119ba M, Tomczak SK, Tomczak JM (2016) Ensemble boosted trees with synthetic features generation in application to bankruptcy prediction. Expert Syst Appl 58:93\u2013101. https:\/\/doi.org\/10.1016\/j.eswa.2016.04.001","journal-title":"Expert Syst Appl"}],"container-title":["Artificial Intelligence Review"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10462-025-11107-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10462-025-11107-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10462-025-11107-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T05:42:05Z","timestamp":1740116525000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10462-025-11107-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,25]]},"references-count":98,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2025,4]]}},"alternative-id":["11107"],"URL":"https:\/\/doi.org\/10.1007\/s10462-025-11107-y","relation":{},"ISSN":["1573-7462"],"issn-type":[{"value":"1573-7462","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,1,25]]},"assertion":[{"value":"7 January 2025","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 January 2025","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no Conflict of interest","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"104"}}