{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,22]],"date-time":"2025-10-22T20:42:56Z","timestamp":1761165776614,"version":"build-2065373602"},"reference-count":95,"publisher":"Springer Science and Business Media LLC","issue":"32","license":[{"start":{"date-parts":[[2025,9,24]],"date-time":"2025-09-24T00:00:00Z","timestamp":1758672000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,9,24]],"date-time":"2025-09-24T00:00:00Z","timestamp":1758672000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Institute for Energy Technology"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Neural Comput &amp; Applic"],"published-print":{"date-parts":[[2025,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Context<\/jats:title>\n                    <jats:p>Software Fault Prediction (SFP) leverages supervised Machine Learning to detect faulty software constructs using software metrics and corresponding labels. Despite recent advances in Deep Learning (DL) for tabular data, their application to SFP remains underexplored.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Objectives<\/jats:title>\n                    <jats:p>This study proposes a novel feature selection method, CorrBoost, which combines correlation analysis and XGBoost to address feature dimensionality. Additionally, we evaluate existing tabular DL architectures, super convergent deep neural networks (sDNN) and TabNet for SFP.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>Using 26 public datasets from NASA, PROMISE, and AEEEM repositories, we apply the adaptive synthetic oversampling technique to manage class imbalance. We compare DL models with five state-of-the-art techniques and two gradient-boosted tree models (XGBoost and LightGBM) using AUC-ROC, AUPRC, and Accuracy. Statistical significance is validated using the Bayesian Signed Rank Test and Scott-Knott ESD.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Gradient-boosted trees and existing state-of-the-art models outperform DL methods in AUC-ROC by 17.9% and 9.6%, respectively. CorrBoost achieves a 55% average reduction in feature dimensionality with negligible performance loss. DL methods, however, incur significantly higher processing time and perform poorly on unseen test data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>CorrBoost combined with boosted tree models offers a superior trade-off between performance and computation. While tabular DL architectures hold promise, they currently lag behind traditional methods for SFP on real-world data.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1007\/s00521-025-11648-x","type":"journal-article","created":{"date-parts":[[2025,9,24]],"date-time":"2025-09-24T18:23:00Z","timestamp":1758738180000},"page":"26845-26886","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["CorrBoost: a feature selection technique and utility of tabular deep neural networks in software fault prediction"],"prefix":"10.1007","volume":"37","author":[{"given":"Tamanna","family":"Sharma","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3556-9331","authenticated-orcid":false,"given":"Sanjay","family":"Misra","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ricardo","family":"Colomo-Palacios","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,9,24]]},"reference":[{"key":"11648_CR1","doi-asserted-by":"publisher","first-page":"450","DOI":"10.1016\/j.knosys.2018.09.004","volume":"163","author":"X Yan","year":"2019","unstructured":"Yan X, Jia M (2019) Intelligent fault diagnosis of rotating machinery using improved multiscale dispersion entropy and mRMR feature selection. Knowl-Based Syst 163:450\u2013471. https:\/\/doi.org\/10.1016\/j.knosys.2018.09.004","journal-title":"Knowl-Based Syst"},{"key":"11648_CR2","doi-asserted-by":"publisher","first-page":"216","DOI":"10.1016\/j.jss.2018.06.025","volume":"144","author":"R \u00d6zak\u0131nc\u0131","year":"2018","unstructured":"\u00d6zak\u0131nc\u0131 R, Tarhan A (2018) Early software defect prediction: a systematic map and review. J Syst Softw 144:216\u2013239. https:\/\/doi.org\/10.1016\/j.jss.2018.06.025","journal-title":"J Syst Softw"},{"key":"11648_CR3","doi-asserted-by":"publisher","first-page":"686","DOI":"10.1016\/j.jss.2017.04.016","volume":"137","author":"L Kumar","year":"2018","unstructured":"Kumar L, Sripada SK, Sureka A, Rath SKu (2018) Effective fault prediction model developed using least square support vector machine (LSSVM). J Syst Softw 137:686\u2013712. https:\/\/doi.org\/10.1016\/j.jss.2017.04.016","journal-title":"J Syst Softw"},{"doi-asserted-by":"publisher","unstructured":"Chen T, Guestrin C 2016. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, New York, NY, USA: ACM, p 785\u201394. https:\/\/doi.org\/10.1145\/2939672.2939785","key":"11648_CR4","DOI":"10.1145\/2939672.2939785"},{"unstructured":"Ellis CM. The latest Trends in data science | Kaggle 2023. https:\/\/www.kaggle.com\/code\/carlmcbrideellis\/the-latest-trends-in-data-science Accessed September 6 2023","key":"11648_CR5"},{"doi-asserted-by":"publisher","unstructured":"Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, et al. 2017 LightGBM: A highly efficient gradient boosting decision tree. https:\/\/doi.org\/10.5555\/3294996.","key":"11648_CR6","DOI":"10.5555\/3294996"},{"key":"11648_CR7","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y Lecun","year":"2015","unstructured":"Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436\u2013444. https:\/\/doi.org\/10.1038\/nature14539","journal-title":"Nature"},{"issue":"6","key":"11648_CR8","doi-asserted-by":"publisher","first-page":"7499","DOI":"10.48550\/arxiv.2110.01889","volume":"35","author":"V Borisov","year":"2021","unstructured":"Borisov V, Leemann T, Se\u00dfler K, Haug J, Pawelczyk M, Kasneci G (2021) Deep neural networks and tabular data: a survey. IEEE trans neural networks learn sys 35(6):7499\u20137519. https:\/\/doi.org\/10.48550\/arxiv.2110.01889","journal-title":"IEEE trans neural networks learn sys"},{"key":"11648_CR9","doi-asserted-by":"publisher","first-page":"23928","DOI":"10.48550\/arxiv.2106.11189","volume":"29","author":"A Kadra","year":"2021","unstructured":"Kadra A, Lindauer M, Hutter F, Grabocka J (2021) Well-tuned simple nets excel on tabular datasets. Adv Neural Inf Process Syst 29:23928\u201323941. https:\/\/doi.org\/10.48550\/arxiv.2106.11189","journal-title":"Adv Neural Inf Process Syst"},{"key":"11648_CR10","doi-asserted-by":"publisher","first-page":"84","DOI":"10.48550\/arxiv.2106.03253","volume":"81","author":"R Shwartz-Ziv","year":"2021","unstructured":"Shwartz-Ziv R, Armon A (2021) Tabular data: deep learning is not all you need. Info Fusion 81:84\u201390. https:\/\/doi.org\/10.48550\/arxiv.2106.03253","journal-title":"Info Fusion"},{"doi-asserted-by":"publisher","unstructured":"Arik S \u00d6, Pfister T (2021) Tabnet: Attentive interpretable tabular learning. In Proceedings of the AAAI conference on artificial intelligence, vol 35, No 8, pp 6679-6687). https:\/\/doi.org\/10.48550\/arxiv.1908.07442.","key":"11648_CR11","DOI":"10.48550\/arxiv.1908.07442"},{"unstructured":"Popov S, Morozov S, Babenko A 2019 Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data.","key":"11648_CR12"},{"unstructured":"Katzir L, Elidan G, R El-Yaniv 2020 Net-DNF: Effective deep modeling of tabular data. In: International conference on learning representations.","key":"11648_CR13"},{"unstructured":"GitHub - baosenguo\/Kaggle-MoA-2nd-Place-Solution n.d. https:\/\/github.com\/baosenguo\/Kaggle-MoA-2nd-Place-Solution. Accessed September 6 2023)","key":"11648_CR14"},{"unstructured":"Sayyad JS, Menzies TJ. PROMISE Software Engineering Repository 2005. http:\/\/promise.site.uottawa.ca\/SERepository\/. Accessed September 6 2023","key":"11648_CR15"},{"doi-asserted-by":"publisher","unstructured":"D\u2019Ambros M, Lanza M, Robbes R 2010 An extensive comparison of bug prediction approaches. Proceedings\u2013international conference on software engineering, p 31\u201341 https:\/\/doi.org\/10.1109\/MSR.2010.5463279","key":"11648_CR16","DOI":"10.1109\/MSR.2010.5463279"},{"doi-asserted-by":"publisher","unstructured":"Jureczko M, Madeyski L 2010 Towards identifying software project clusters with regard to defect prediction. Proceedings of the 6th international conference on predictive models in software engineering\u2013PROMISE \u201910, New York, New York, USA: ACM Press, p 1 https:\/\/doi.org\/10.1145\/1868328.1868342.","key":"11648_CR17","DOI":"10.1145\/1868328.1868342"},{"key":"11648_CR18","doi-asserted-by":"publisher","first-page":"603","DOI":"10.1109\/TSE.2014.2322358","volume":"40","author":"M Shepperd","year":"2014","unstructured":"Shepperd M, Bowes D, Hall T (2014) Researcher bias: the use of machine learning in software defect prediction. IEEE Trans Softw Eng 40:603\u2013616. https:\/\/doi.org\/10.1109\/TSE.2014.2322358","journal-title":"IEEE Trans Softw Eng"},{"key":"11648_CR19","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TSE.2016.2584050","volume":"43","author":"C Tantithamthavorn","year":"2017","unstructured":"Tantithamthavorn C, McIntosh S, Hassan AE, Matsumoto K (2017) An empirical comparison of model validation techniques for defect prediction models. IEEE Trans Software Eng 43:1\u201318. https:\/\/doi.org\/10.1109\/TSE.2016.2584050","journal-title":"IEEE Trans Software Eng"},{"key":"11648_CR20","doi-asserted-by":"publisher","first-page":"624","DOI":"10.1111\/aogs.14366","volume":"101","author":"PM Sedgwick","year":"2022","unstructured":"Sedgwick PM, Hammer A, Kesmodel US, Pedersen LH (2022) Current controversies: null hypothesis significance testing. Acta Obstet Gynecol Scand 101:624\u2013627. https:\/\/doi.org\/10.1111\/aogs.14366","journal-title":"Acta Obstet Gynecol Scand"},{"key":"11648_CR21","doi-asserted-by":"publisher","first-page":"308","DOI":"10.1109\/TSE.1976.233837","volume":"SE-2","author":"TJ Mccabe","year":"1976","unstructured":"Mccabe TJ (1976) A complexity measure. IEEE Trans Softw Eng SE-2:308\u2013320. https:\/\/doi.org\/10.1109\/TSE.1976.233837","journal-title":"IEEE Trans Softw Eng"},{"unstructured":"Halstead M H 1977. Elements of software science (Operating and programming systems series). Elsevier science inc","key":"11648_CR22"},{"key":"11648_CR23","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1016\/j.infsof.2015.09.001","volume":"69","author":"W Ma","year":"2016","unstructured":"Ma W, Chen L, Yang Y, Zhou Y, Xu B (2016) Empirical analysis of network measures for effort-aware fault-proneness prediction. Inf Softw Technol 69:50\u201370. https:\/\/doi.org\/10.1016\/j.infsof.2015.09.001","journal-title":"Inf Softw Technol"},{"key":"11648_CR24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2629648","volume":"23","author":"T Hall","year":"2014","unstructured":"Hall T, Zhang M, Bowes D, Sun Y (2014) Some code smells have a significant but small effect on faults. ACM Trans Softw Eng Methodol 23:1\u201339. https:\/\/doi.org\/10.1145\/2629648","journal-title":"ACM Trans Softw Eng Methodol"},{"doi-asserted-by":"publisher","unstructured":"Bowes D, Hall T, Harman M, Jia Y, Sarro F, Wu F 2016 Mutation-aware fault prediction. Proceedings of the 25th international symposium on software testing and analysis, New York, NY, USA: ACM, p 330\u201341 https:\/\/doi.org\/10.1145\/2931037.2931039","key":"11648_CR25","DOI":"10.1145\/2931037.2931039"},{"key":"11648_CR26","doi-asserted-by":"publisher","first-page":"476","DOI":"10.1109\/32.295895","volume":"20","author":"SR Chidamber","year":"1994","unstructured":"Chidamber SR, Kemerer CF (1994) A metrics suite for object oriented design. IEEE Trans Softw Eng 20:476\u2013493. https:\/\/doi.org\/10.1109\/32.295895","journal-title":"IEEE Trans Softw Eng"},{"key":"11648_CR27","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1109\/TSE.2007.70773","volume":"34","author":"S Kim","year":"2008","unstructured":"Kim S, Whitehead EJ, Zhang Y (2008) Classifying software changes: clean or buggy? IEEE Trans Software Eng 34:181\u2013196. https:\/\/doi.org\/10.1109\/TSE.2007.70773","journal-title":"IEEE Trans Software Eng"},{"key":"11648_CR28","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1007\/s41870-018-0211-3","volume":"11","author":"D Sharma","year":"2019","unstructured":"Sharma D, Chandra P (2019) A comparative analysis of soft computing techniques in software fault prediction model development. Int J Inf Technol 11:37\u201346. https:\/\/doi.org\/10.1007\/s41870-018-0211-3","journal-title":"Int J Inf Technol"},{"key":"11648_CR29","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1016\/j.infsof.2015.03.001","volume":"63","author":"HB Yadav","year":"2015","unstructured":"Yadav HB, Yadav DK (2015) A fuzzy logic based approach for phase-wise software defects prediction using software metrics. Inf Softw Technol 63:44\u201357. https:\/\/doi.org\/10.1016\/j.infsof.2015.03.001","journal-title":"Inf Softw Technol"},{"key":"11648_CR30","doi-asserted-by":"publisher","first-page":"260","DOI":"10.1016\/j.ins.2013.12.031","volume":"264","author":"G Czibula","year":"2014","unstructured":"Czibula G, Marian Z, Czibula IG (2014) Software defect prediction using relational association rule mining. Inf Sci (N Y) 264:260\u2013278. https:\/\/doi.org\/10.1016\/j.ins.2013.12.031","journal-title":"Inf Sci (N Y)"},{"doi-asserted-by":"publisher","unstructured":"Sharma T, Sangwan OP 2021. Sine-Cosine Algorithm for Software Fault Prediction. 2021 In: IEEE international conference on software maintenance and evolution (ICSME), IEEE p 701\u20136 https:\/\/doi.org\/10.1109\/ICSME52107.2021.00084","key":"11648_CR31","DOI":"10.1109\/ICSME52107.2021.00084"},{"key":"11648_CR32","doi-asserted-by":"publisher","first-page":"188","DOI":"10.14445\/22315381\/IJETT-V70I4P216","volume":"70","author":"Tamanna","year":"2022","unstructured":"Tamanna, Sangwan OP (2022) Random permutation-based hybrid feature selection for software bug prediction using bayesian statistical validation. Int j Eng Trend Technol 70:188\u2013202. https:\/\/doi.org\/10.14445\/22315381\/IJETT-V70I4P216","journal-title":"Int j Eng Trend Technol"},{"key":"11648_CR33","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1016\/j.knosys.2014.10.017","volume":"74","author":"G Abaei","year":"2015","unstructured":"Abaei G, Selamat A, Fujita H (2015) An empirical study based on semi-supervised hybrid self-organizing map for software fault prediction. Knowl-Based Syst 74:28\u201339. https:\/\/doi.org\/10.1016\/j.knosys.2014.10.017","journal-title":"Knowl-Based Syst"},{"key":"11648_CR34","doi-asserted-by":"publisher","first-page":"1","DOI":"10.4018\/IJOSSP.2019100101","volume":"10","author":"O Al Qasem","year":"2019","unstructured":"Al Qasem O, Akour M (2019) Software fault prediction using deep learning algorithms. Int J Open Source Softw Process 10:1\u201319. https:\/\/doi.org\/10.4018\/IJOSSP.2019100101","journal-title":"Int J Open Source Softw Process"},{"key":"11648_CR35","doi-asserted-by":"publisher","first-page":"100","DOI":"10.1016\/j.neucom.2019.11.067","volume":"385","author":"L Qiao","year":"2020","unstructured":"Qiao L, Li X, Umer Q, Guo P (2020) Deep learning based software defect prediction. Neurocomputing 385:100\u2013110. https:\/\/doi.org\/10.1016\/j.neucom.2019.11.067","journal-title":"Neurocomputing"},{"key":"11648_CR36","doi-asserted-by":"publisher","first-page":"106131","DOI":"10.1016\/j.knosys.2020.106131","volume":"203","author":"AI Hammouri","year":"2020","unstructured":"Hammouri AI, Mafarja M, Al-Betar MA, Awadallah MA, Abu-Doush I (2020) An improved dragonfly algorithm for feature selection. Knowl-Based Syst 203:106131. https:\/\/doi.org\/10.1016\/j.knosys.2020.106131","journal-title":"Knowl-Based Syst"},{"key":"11648_CR37","doi-asserted-by":"publisher","first-page":"109737","DOI":"10.1016\/j.asoc.2022.109737","volume":"131","author":"K Thirumoorthy","year":"2022","unstructured":"Thirumoorthy K (2022) A feature selection model for software defect prediction using binary Rao optimization algorithm. Appl Soft Comput 131:109737. https:\/\/doi.org\/10.1016\/j.asoc.2022.109737","journal-title":"Appl Soft Comput"},{"doi-asserted-by":"publisher","unstructured":"Ni C, Xu X, Yang K, Lo D 2023 Boosting Just-in-Time Defect Prediction with Specific Features of C\/C++ Programming Languages in Code Changes. 2023 In: IEEE\/ACM 20th International Conference on Mining Software Repositories (MSR), IEEE, p 472\u201384. https:\/\/doi.org\/10.1109\/MSR59073.2023.00072","key":"11648_CR38","DOI":"10.1109\/MSR59073.2023.00072"},{"doi-asserted-by":"publisher","unstructured":"Li J, He P, Zhu J, Lyu MR 2017 Software defect prediction via convolutional neural network. Proceedings - 2017 In: IEEE International conference on software quality, Reliability and Security, QRS 2017, institute of electrical and electronics engineers inc, p 318\u201328. https:\/\/doi.org\/10.1109\/QRS.2017.42","key":"11648_CR39","DOI":"10.1109\/QRS.2017.42"},{"key":"11648_CR40","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1016\/J.ESWA.2018.12.033","volume":"122","author":"H Turabieh","year":"2019","unstructured":"Turabieh H, Mafarja M, Li X (2019) Iterated feature selection algorithms with layered recurrent neural network for software fault prediction. Expert Syst Appl 122:27\u201342. https:\/\/doi.org\/10.1016\/J.ESWA.2018.12.033","journal-title":"Expert Syst Appl"},{"key":"11648_CR41","doi-asserted-by":"publisher","first-page":"94","DOI":"10.1016\/j.infsof.2017.11.008","volume":"96","author":"H Tong","year":"2018","unstructured":"Tong H, Liu B, Wang S (2018) Software defect prediction using stacked denoising autoencoders and two-stage ensemble learning. Inf Softw Technol 96:94\u2013111. https:\/\/doi.org\/10.1016\/j.infsof.2017.11.008","journal-title":"Inf Softw Technol"},{"key":"11648_CR42","doi-asserted-by":"publisher","first-page":"1355","DOI":"10.1109\/TR.2020.2996261","volume":"69","author":"PR Bal","year":"2020","unstructured":"Bal PR, Kumar S (2020) Wr-elm: weighted regularization extreme learning machine for imbalance learning in software fault prediction. IEEE Trans Reliab 69:1355\u20131375. https:\/\/doi.org\/10.1109\/TR.2020.2996261","journal-title":"IEEE Trans Reliab"},{"key":"11648_CR43","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2019.113085","author":"SK Pandey","year":"2020","unstructured":"Pandey SK, Mishra RB, Tripathi AK (2020) BPDET: an effective software bug prediction model using deep representation and ensemble learning techniques. Expert Syst Appl. https:\/\/doi.org\/10.1016\/j.eswa.2019.113085","journal-title":"Expert Syst Appl"},{"key":"11648_CR44","doi-asserted-by":"publisher","first-page":"9847","DOI":"10.1007\/s10586-018-1696-z","volume":"22","author":"C Manjula","year":"2019","unstructured":"Manjula C, Florence L (2019) Deep neural network based hybrid approach for software defect prediction using software metrics. Cluster Comput 22:9847\u20139863. https:\/\/doi.org\/10.1007\/s10586-018-1696-z","journal-title":"Cluster Comput"},{"key":"11648_CR45","doi-asserted-by":"publisher","first-page":"419","DOI":"10.1016\/j.jksuci.2019.03.006","volume":"32","author":"W Rhmann","year":"2020","unstructured":"Rhmann W, Pandey B, Ansari G, Pandey DK (2020) Software fault prediction based on change metrics using hybrid algorithms: an empirical study. J King Saud Univ Comp Info Sci 32:419\u2013424. https:\/\/doi.org\/10.1016\/j.jksuci.2019.03.006","journal-title":"J King Saud Univ Comp Info Sci"},{"doi-asserted-by":"publisher","unstructured":"Tran HD, Hanh LTM, Binh NT 2019 Combining feature selection, feature learning and ensemble learning for software fault prediction. 2019 11th international conference on knowledge and systems engineering (KSE), IEEE, p 1\u20138. https:\/\/doi.org\/10.1109\/KSE.2019.8919292.","key":"11648_CR46","DOI":"10.1109\/KSE.2019.8919292"},{"key":"11648_CR47","doi-asserted-by":"publisher","first-page":"111026","DOI":"10.1016\/j.jss.2021.111026","volume":"180","author":"K Zhu","year":"2021","unstructured":"Zhu K, Ying S, Zhang N, Zhu D (2021) Software defect prediction based on enhanced metaheuristic feature selection optimization and a hybrid deep neural network. J Syst Softw 180:111026. https:\/\/doi.org\/10.1016\/j.jss.2021.111026","journal-title":"J Syst Softw"},{"doi-asserted-by":"publisher","unstructured":"Wang H, Khoshgoftaar TM, Napolitano A 2010 A Comparative study of ensemble feature selection techniques for software defect prediction. 2010 Ninth international conference on machine learning and applications, IEEE, p 135\u201340. https:\/\/doi.org\/10.1109\/ICMLA.2010.27","key":"11648_CR48","DOI":"10.1109\/ICMLA.2010.27"},{"doi-asserted-by":"publisher","unstructured":"Liu S, Chen X, Liu W, Chen J, Gu Q, Chen D 2014. FECAR: A feature selection framework for software defect prediction. 2014 IEEE 38th annual computer software and applications conference, IEEE, p 426\u201335. https:\/\/doi.org\/10.1109\/COMPSAC.2014.66","key":"11648_CR49","DOI":"10.1109\/COMPSAC.2014.66"},{"key":"11648_CR50","doi-asserted-by":"publisher","first-page":"2764","DOI":"10.3390\/app9132764","volume":"9","author":"AO Balogun","year":"2019","unstructured":"Balogun AO, Basri S, Abdulkadir SJ, Hashim AS (2019) Performance analysis of feature selection methods in software defect prediction: a search method approach. Appl Sci 9:2764. https:\/\/doi.org\/10.3390\/app9132764","journal-title":"Appl Sci"},{"doi-asserted-by":"publisher","unstructured":"Brarda S, Yeres P, Bowman SR 2017 Sequential attention: a context-aware alignment function for machine reading 75\u201380 https:\/\/doi.org\/10.18653\/V1\/W17-2610","key":"11648_CR51","DOI":"10.18653\/V1\/W17-2610"},{"unstructured":"Vaswani A, Brain G, Shazeer N, Parmar N, Uszkoreit J, Jones L et al 2017 attention is all you need. Adv Neural Inf Process Syst 30","key":"11648_CR52"},{"doi-asserted-by":"publisher","unstructured":"Martins AFT, Astudillo RF 2016. From softmax to sparsemax: a sparse model of attention and multi-label classification. 33rd international conference on machine learning, ICML 4 2432\u201343. https:\/\/doi.org\/10.48550\/arxiv.1602.02068","key":"11648_CR53","DOI":"10.48550\/arxiv.1602.02068"},{"doi-asserted-by":"publisher","unstructured":"Dauphin YN, Fan A, Auli M, Grangier D 2016 Language modeling with gated convolutional networks. 34th international conference on machine learning, ICML 2017 2 1551\u20139. https:\/\/doi.org\/10.48550\/arxiv.1612.08083","key":"11648_CR54","DOI":"10.48550\/arxiv.1612.08083"},{"doi-asserted-by":"publisher","unstructured":"Gehring J, Auli M, Grangier D, Yarats D, Dauphin YN 2017 Convolutional sequence to sequence learning. 34th international conference on machine learning, ICML 2017. 3 2029\u201342. https:\/\/doi.org\/10.48550\/arxiv.1705.03122.","key":"11648_CR55","DOI":"10.48550\/arxiv.1705.03122"},{"doi-asserted-by":"publisher","unstructured":"Smith LN, Topin N 2017 Super-convergence: Very fast training of neural networks using large learning rates 36. https:\/\/doi.org\/10.48550\/arxiv.1708.07120","key":"11648_CR56","DOI":"10.48550\/arxiv.1708.07120"},{"doi-asserted-by":"publisher","unstructured":"Smith LN 2018 A disciplined approach to neural network hyper-parameters: Part 1\u2013learning rate, batch size, momentum, and weight decay. https:\/\/doi.org\/10.48550\/arxiv.1803.09820.","key":"11648_CR57","DOI":"10.48550\/arxiv.1803.09820"},{"doi-asserted-by":"publisher","unstructured":"Smith LN 2015. Cyclical learning rates for training neural networks. Proceedings\u20132017, IEEE Winter conference on applications of computer vision, WACV 2017. 464\u201372. https:\/\/doi.org\/10.48550\/arxiv.1506.01186","key":"11648_CR58","DOI":"10.48550\/arxiv.1506.01186"},{"key":"11648_CR59","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1103\/PhysRevE.69.066138","volume":"69","author":"A Kraskov","year":"2004","unstructured":"Kraskov A, St\u00f6gbauer H, Grassberger P (2004) Estimating mutual information. Phys Rev E 69:16. https:\/\/doi.org\/10.1103\/PhysRevE.69.066138","journal-title":"Phys Rev E"},{"key":"11648_CR60","doi-asserted-by":"publisher","first-page":"87","DOI":"10.2307\/2340521","volume":"85","author":"RA Fisher","year":"1922","unstructured":"Fisher RA (1922) On the interpretation of \u03c7 2 from contingency tables, and the calculation of P. J Roy Stat Soc 85:87. https:\/\/doi.org\/10.2307\/2340521","journal-title":"J Roy Stat Soc"},{"key":"11648_CR61","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v033.i01","volume":"33","author":"J Friedman","year":"2010","unstructured":"Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33:1\u201322","journal-title":"J Stat Softw"},{"key":"11648_CR62","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1023\/A:1012487302797","volume":"46","author":"I Guyon","year":"2002","unstructured":"Guyon I, Weston J, Barnhill S, Vapnik V (2002) Gene selection for cancer classification using support vector machines. Mach Learn 46:389\u2013422. https:\/\/doi.org\/10.1023\/A:1012487302797","journal-title":"Mach Learn"},{"key":"11648_CR63","doi-asserted-by":"publisher","first-page":"399","DOI":"10.1017\/S0080456800012163","volume":"52","author":"RA Fisher","year":"1919","unstructured":"Fisher RA (1919) The correlation between relatives on the supposition of Mendelian inheritance. Trans R Soc Edinb 52:399\u2013433. https:\/\/doi.org\/10.1017\/S0080456800012163","journal-title":"Trans R Soc Edinb"},{"doi-asserted-by":"publisher","unstructured":"Haibo He, Yang Bai, Garcia EA, Shutao Li 2008 ADASYN: Adaptive synthetic sampling approach for imbalanced learning. 2008 IEEE international joint conference on neural networks (IEEE World congress on computational intelligence), IEEE, p 1322\u20138. https:\/\/doi.org\/10.1109\/IJCNN.2008.4633969","key":"11648_CR64","DOI":"10.1109\/IJCNN.2008.4633969"},{"key":"11648_CR65","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/1082983.1083147","volume":"30","author":"J \u015aliwerski","year":"2005","unstructured":"\u015aliwerski J, Zimmermann T, Zeller A (2005) When do changes induce fixes? ACM SIGSOFT Softw Eng Notes 30:1\u20135. https:\/\/doi.org\/10.1145\/1082983.1083147","journal-title":"ACM SIGSOFT Softw Eng Notes"},{"doi-asserted-by":"publisher","unstructured":"Williams C, Spacco J 2008 SZZ revisited. Proceedings of the 2008 workshop on Defects in large software systems, New York, NY, USA: ACM, p 32\u20136. https:\/\/doi.org\/10.1145\/1390817.1390826","key":"11648_CR66","DOI":"10.1145\/1390817.1390826"},{"key":"11648_CR67","doi-asserted-by":"publisher","first-page":"531","DOI":"10.1007\/s10664-011-9173-9","volume":"17","author":"M D\u2019Ambros","year":"2012","unstructured":"D\u2019Ambros M, Lanza M, Robbes R (2012) Evaluating defect prediction approaches: a benchmark and an extensive comparison. Empir Softw Eng 17:531\u2013577. https:\/\/doi.org\/10.1007\/s10664-011-9173-9","journal-title":"Empir Softw Eng"},{"key":"11648_CR68","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L (2001) Random forests. Mach Learn 45:5\u201332. https:\/\/doi.org\/10.1023\/A:1010933404324","journal-title":"Mach Learn"},{"issue":"24","key":"11648_CR69","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1007\/BF00058655","volume":"2","author":"L Breiman","year":"1996","unstructured":"Breiman L (1996) Bagging predictors. Mach Learn 2(24):123\u2013140. https:\/\/doi.org\/10.1007\/BF00058655","journal-title":"Mach Learn"},{"doi-asserted-by":"crossref","unstructured":"Breiman L, Friedman J H, Olshen RA, Stone CJ 2017 Classification and regression trees 1\u2013358","key":"11648_CR70","DOI":"10.1201\/9781315139470-1"},{"key":"11648_CR71","doi-asserted-by":"publisher","first-page":"183","DOI":"10.1142\/S0218001405003983","volume":"19","author":"H Zhang","year":"2011","unstructured":"Zhang H (2011) Exploring conditions for the optimality of Naive Bayes. Int J Pattern Recognit Artif Intell 19:183\u2013198. https:\/\/doi.org\/10.1142\/S0218001405003983","journal-title":"Int J Pattern Recognit Artif Intell"},{"key":"11648_CR72","doi-asserted-by":"publisher","DOI":"10.1145\/1961189.1961199","author":"CC Chang","year":"2011","unstructured":"Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol. https:\/\/doi.org\/10.1145\/1961189.1961199","journal-title":"ACM Trans Intell Syst Technol"},{"key":"11648_CR73","doi-asserted-by":"publisher","first-page":"175","DOI":"10.2307\/2685209","volume":"46","author":"NS Altman","year":"1992","unstructured":"Altman NS (1992) An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat 46:175. https:\/\/doi.org\/10.2307\/2685209","journal-title":"Am Stat"},{"key":"11648_CR74","doi-asserted-by":"publisher","first-page":"1189","DOI":"10.1214\/AOS\/1013203451","volume":"29","author":"JH Friedman","year":"2001","unstructured":"Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29:1189\u20131232. https:\/\/doi.org\/10.1214\/AOS\/1013203451","journal-title":"Ann Stat"},{"unstructured":"Benavoli A, Corani G, Mangili F, Zaffalon M, Ruggeri F 2014 A Bayesian Wilcoxon signed-rank test based on the Dirichlet process. International conference on machine learning, PMLR, p 1026\u201334","key":"11648_CR75"},{"doi-asserted-by":"crossref","unstructured":"Bernardo, J M, Smith A F M 1994 Bayesian theory, vol 586. New York: Wiley","key":"11648_CR76","DOI":"10.1002\/9780470316870"},{"key":"11648_CR77","doi-asserted-by":"publisher","first-page":"1","DOI":"10.48550\/arxiv.1606.04316","volume":"18","author":"A Benavoli","year":"2016","unstructured":"Benavoli A, Corani G, Dem\u0161ar J, Zaffalon M (2016) Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis. J Mach Learn Res 18:1\u201336. https:\/\/doi.org\/10.48550\/arxiv.1606.04316","journal-title":"J Mach Learn Res"},{"key":"11648_CR78","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1093\/biomet\/20A.1-2.175","volume":"20A","author":"J Neyman","year":"1928","unstructured":"Neyman J, Pearson ES (1928) On the use and interpretation of certain test criteria for purposes of statistical inference part I. Biometrika 20A:175\u2013240. https:\/\/doi.org\/10.1093\/biomet\/20A.1-2.175","journal-title":"Biometrika"},{"key":"11648_CR79","doi-asserted-by":"publisher","first-page":"111","DOI":"10.2307\/271063","volume":"25","author":"AE Raftery","year":"1995","unstructured":"Raftery AE (1995) Bayesian model selection in social research. Sociol Methodol 25:111. https:\/\/doi.org\/10.2307\/271063","journal-title":"Sociol Methodol"},{"key":"11648_CR80","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1080\/00031305.2016.1154108","volume":"70","author":"RL Wasserstein","year":"2016","unstructured":"Wasserstein RL, Lazar NA (2016) The ASA\u2019s statement on p-values: context, process, and purpose. Am Stat 70:129\u2013133. https:\/\/doi.org\/10.1080\/00031305.2016.1154108","journal-title":"Am Stat"},{"key":"11648_CR81","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyg.2018.00699","author":"D Trafimow","year":"2018","unstructured":"Trafimow D, Amrhein V, Areshenkoff CN, Barrera-Causil CJ, Beh EJ, Bilgi\u00e7 YK (2018) Manipulating the alpha level cannot cure significance testing. Front Psychol. https:\/\/doi.org\/10.3389\/fpsyg.2018.00699","journal-title":"Front Psychol"},{"issue":"2","key":"11648_CR82","doi-asserted-by":"publisher","first-page":"209","DOI":"10.1214\/AOS\/1176342360","volume":"1","author":"TS Ferguson","year":"1973","unstructured":"Ferguson TS (1973) A bayesian analysis of some nonparametric problems. Ann Stat 1(2):209\u2013230. https:\/\/doi.org\/10.1214\/AOS\/1176342360","journal-title":"Ann Stat"},{"key":"11648_CR83","doi-asserted-by":"publisher","first-page":"861","DOI":"10.1016\/J.PATREC.2005.10.010","volume":"27","author":"T Fawcett","year":"2006","unstructured":"Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27:861\u2013874. https:\/\/doi.org\/10.1016\/J.PATREC.2005.10.010","journal-title":"Pattern Recogn Lett"},{"unstructured":"Anaconda | The World\u2019s Most Popular Data Science Platform n.d. https:\/\/www.anaconda.com\/. Accessed September 7 2023","key":"11648_CR84"},{"key":"11648_CR85","doi-asserted-by":"publisher","first-page":"2825","DOI":"10.48550\/arxiv.1201.0490","volume":"12","author":"F Pedregosa","year":"2012","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O (2012) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825\u20132830. https:\/\/doi.org\/10.48550\/arxiv.1201.0490","journal-title":"J Mach Learn Res"},{"key":"11648_CR86","doi-asserted-by":"publisher","first-page":"108","DOI":"10.3390\/info11020108","volume":"11","author":"J Howard","year":"2020","unstructured":"Howard J, Gugger S (2020) Fastai: a layered API for deep learning. Information 11:108. https:\/\/doi.org\/10.3390\/info11020108","journal-title":"Information"},{"doi-asserted-by":"publisher","unstructured":"Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, et al 2019 PyTorch: an imperative style, high-performance deep learning library. Adv Neural Inf Process Syst 32. https:\/\/doi.org\/10.48550\/arxiv.1912.01703","key":"11648_CR87","DOI":"10.48550\/arxiv.1912.01703"},{"key":"11648_CR88","doi-asserted-by":"publisher","first-page":"1","DOI":"10.48550\/arxiv.1609.06570","volume":"18","author":"G Lema\u00eetre","year":"2016","unstructured":"Lema\u00eetre G, Nogueira F, Aridas CK (2016) Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning. J Mach Learn Res 18:1\u20135. https:\/\/doi.org\/10.48550\/arxiv.1609.06570","journal-title":"J Mach Learn Res"},{"doi-asserted-by":"publisher","unstructured":"Shapiro A 2003 Monte carlo sampling methods, p 353\u2013425. https:\/\/doi.org\/10.1016\/S0927-0507(03)10006-0.","key":"11648_CR89","DOI":"10.1016\/S0927-0507(03)10006-0"},{"key":"11648_CR90","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TSE.2016.2584050","volume":"43","author":"C Tantithamthavorn","year":"2017","unstructured":"Tantithamthavorn C, McIntosh S, Hassan AE, Matsumoto K (2017) An empirical comparison of model validation techniques for defect prediction models. IEEE Trans Softw Eng 43:1\u201318. https:\/\/doi.org\/10.1109\/TSE.2016.2584050","journal-title":"IEEE Trans Softw Eng"},{"key":"11648_CR91","doi-asserted-by":"publisher","first-page":"683","DOI":"10.1109\/TSE.2018.2794977","volume":"45","author":"C Tantithamthavorn","year":"2019","unstructured":"Tantithamthavorn C, McIntosh S, Hassan AE, Matsumoto K (2019) The impact of automated parameter optimization on defect prediction models. IEEE Trans Softw Eng 45:683\u2013711. https:\/\/doi.org\/10.1109\/TSE.2018.2794977","journal-title":"IEEE Trans Softw Eng"},{"unstructured":"torch.optim\u2014PyTorch 2.0 documentation n.d. https:\/\/pytorch.org\/docs\/stable\/optim.html. Accessed September 7 2023","key":"11648_CR92"},{"key":"11648_CR93","doi-asserted-by":"publisher","first-page":"1283","DOI":"10.1109\/TSMCA.2009.2027131","volume":"39","author":"C Seiffert","year":"2009","unstructured":"Seiffert C, Khoshgoftaar TM, Van Hulse J (2009) Improving software-quality predictions with data sampling and boosting. IEEE Trans Syst Man Cybern Part A Syst Hum 39:1283\u20131294. https:\/\/doi.org\/10.1109\/TSMCA.2009.2027131","journal-title":"IEEE Trans Syst Man Cybern Part A Syst Hum"},{"doi-asserted-by":"publisher","unstructured":"Peters F, Menzies T, Marcus A 2013 Better cross company defect prediction. In: IEEE international working conference on mining software repositories 409\u201318. https:\/\/doi.org\/10.1109\/MSR.2013.6624057","key":"11648_CR94","DOI":"10.1109\/MSR.2013.6624057"},{"key":"11648_CR95","doi-asserted-by":"publisher","first-page":"375","DOI":"10.1007\/s10515-010-0069-5","volume":"17","author":"T Menzies","year":"2010","unstructured":"Menzies T, Milton Z, Turhan B, Cukic B, Jiang Y, Bener A (2010) Defect prediction from static code features: current results, limitations, new approaches. Autom Softw Eng 17:375\u2013407. https:\/\/doi.org\/10.1007\/s10515-010-0069-5","journal-title":"Autom Softw Eng"}],"container-title":["Neural Computing and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-025-11648-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00521-025-11648-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-025-11648-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,21]],"date-time":"2025-10-21T20:15:51Z","timestamp":1761077751000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00521-025-11648-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,24]]},"references-count":95,"journal-issue":{"issue":"32","published-print":{"date-parts":[[2025,11]]}},"alternative-id":["11648"],"URL":"https:\/\/doi.org\/10.1007\/s00521-025-11648-x","relation":{},"ISSN":["0941-0643","1433-3058"],"issn-type":[{"type":"print","value":"0941-0643"},{"type":"electronic","value":"1433-3058"}],"subject":[],"published":{"date-parts":[[2025,9,24]]},"assertion":[{"value":"12 September 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 September 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 September 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Authors do not have any competing interests with anybody. Disclaimer: The authors used \u00a0AI-assisted tool like GRAMMARLY\u00a0for improving English or enhancing language accuracy, including spelling, grammar, and punctuation. The author accepts full responsibility for the originality of the final content of this publication.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}