{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,6]],"date-time":"2026-04-06T15:24:06Z","timestamp":1775489046702,"version":"3.50.1"},"reference-count":53,"publisher":"Walter de Gruyter GmbH","issue":"2","license":[{"start":{"date-parts":[[2022,12,1]],"date-time":"2022-12-01T00:00:00Z","timestamp":1669852800000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>In software development, defects influence the quality and cost in an undesirable way. Software defect prediction (SDP) is one of the techniques which improves the software quality and testing efficiency by early identification of defects(bug\/fault\/error). Thus, several experiments have been suggested for defect prediction (DP) techniques. Mainly DP method utilises historical project data for constructing prediction models. SDP performs well within projects until there is an adequate amount of data accessible to train the models. However, if the data are inadequate or limited for the same project, the researchers mainly use Cross-Project Defect Prediction (CPDP). CPDP is a possible alternative option that refers to anticipating defects using prediction models built on historical data from other projects. CPDP is challenging due to its data distribution and domain difference problem. The proposed framework is an effective two-stage approach for CPDP, i.e., model generation and prediction process. In model generation phase, the conglomeration of different pre-processing, including feature selection and class reweights technique, is used to improve the initial data quality. Finally, a fine-tuned efficient bagging and boosting based hybrid ensemble model is developed, which avoids model over -fitting\/under-fitting and helps enhance the prediction performance. In the prediction process phase, the generated model predicts the historical data from other projects, which has defects or clean. The framework is evaluated using25 software projects obtained from public repositories. The result analysis shows that the proposed model has achieved a 0.71\u00b10.03 f1-score, which significantly improves the state-of-the-art approaches by 23 % to 60 %.<\/jats:p>","DOI":"10.2478\/acss-2022-0015","type":"journal-article","created":{"date-parts":[[2023,1,24]],"date-time":"2023-01-24T11:34:22Z","timestamp":1674560062000},"page":"137-148","source":"Crossref","is-referenced-by-count":3,"title":["Cross-Project Defect Prediction with Metrics Selection and Balancing Approach"],"prefix":"10.2478","volume":"27","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8846-7518","authenticated-orcid":false,"given":"Meetesh","family":"Nevendra","sequence":"first","affiliation":[{"name":"Department of Computer Science & Engineering , National Institute of Technology , Raipur , India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2806-8432","authenticated-orcid":false,"given":"Pradeep","family":"Singh","sequence":"additional","affiliation":[{"name":"Department of Computer Science & Engineering , National Institute of Technology , Raipur , India"}]}],"member":"374","published-online":{"date-parts":[[2023,1,24]]},"reference":[{"key":"2025101108193314049_j_acss-2022-0015_ref_001","doi-asserted-by":"crossref","unstructured":"[1] T. Menzies, B. Turhan, A. Bener, G. Gay, B. Cukic, and Y. Jiang, \u201cImplications of ceiling effects in defect predictors,\u201d in Proceedings of the 4th international workshop on Predictor models in software engineering - PROMISE \u201908, 2008, pp. 47\u201354. https:\/\/doi.org\/10.1145\/1370788.1370801","DOI":"10.1145\/1370788.1370801"},{"key":"2025101108193314049_j_acss-2022-0015_ref_002","doi-asserted-by":"crossref","unstructured":"[2] M. Nevendra and P. Singh, \u201cSoftware defect prediction using deep learning,\u201d Acta Polytech. Hungarica, vol. 18, no. 10, pp. 173\u2013189, 2021. https:\/\/doi.org\/10.12700\/aph.18.10.2021.10.9","DOI":"10.12700\/APH.18.10.2021.10.9"},{"key":"2025101108193314049_j_acss-2022-0015_ref_003","doi-asserted-by":"crossref","unstructured":"[3] M. Shepperd, D. Bowes, and T. Hall, \u201cResearcher Bias: The use of machine learning in software defect prediction,\u201d IEEE Trans. Softw. Eng., vol. 40, no. 6, pp. 603\u2013616, Jun. 2014. https:\/\/doi.org\/10.1109\/TSE.2014.2322358","DOI":"10.1109\/TSE.2014.2322358"},{"key":"2025101108193314049_j_acss-2022-0015_ref_004","unstructured":"[4] T. Wang, W. Li, H. Shi, and Z. Liu, \u201cSoftware defect prediction based on classifiers ensemble,\u201d J. Inf. C. Sci., vol. 8, no. 16, pp. 4241\u20134254, 2011."},{"key":"2025101108193314049_j_acss-2022-0015_ref_005","doi-asserted-by":"crossref","unstructured":"[5] J. Nam, S. J. Pan, and S. Kim, \u201cTransfer defect learning,\u201d in 2013 35th International Conference on Software Engineering (ICSE), San Francisco, CA, USA, May 2013, pp. 382\u2013391. https:\/\/doi.org\/10.1109\/ICSE.2013.6606584","DOI":"10.1109\/ICSE.2013.6606584"},{"key":"2025101108193314049_j_acss-2022-0015_ref_006","doi-asserted-by":"crossref","unstructured":"[6] X. Xia, D. Lo, S. J. Pan, N. Nagappan, and X. Wang, \u201cHYDRA: Massively compositional model for cross-project defect prediction,\u201d IEEE Trans. Softw. Eng., vol. 42, no. 10, pp. 977\u2013998, Oct. 2016. https:\/\/doi.org\/10.1109\/TSE.2016.2543218","DOI":"10.1109\/TSE.2016.2543218"},{"key":"2025101108193314049_j_acss-2022-0015_ref_007","doi-asserted-by":"crossref","unstructured":"[7] Z. He, F. Shu, Y. Yang, M. Li, and Q. Wang, \u201cAn investigation on the feasibility of cross-project defect prediction,\u201d Autom. Softw. Eng., vol. 19, no. 2, pp. 167\u2013199, Jul. 2012.https:\/\/doi.org\/10.1007\/s10515-011-0090-3","DOI":"10.1007\/s10515-011-0090-3"},{"key":"2025101108193314049_j_acss-2022-0015_ref_008","doi-asserted-by":"crossref","unstructured":"[8] H. Chen, X. Jing, Z. Li, D. Wu, Y. Peng, and Z. Huang, \u201cAn empirical study on heterogeneous defect prediction approaches,\u201d IEEE Trans. Softw. Eng., vol. 47, no. 12, pp. 2803\u20132822, Jan. 2020. https:\/\/doi.org\/10.1109\/TSE.2020.2968520","DOI":"10.1109\/TSE.2020.2968520"},{"key":"2025101108193314049_j_acss-2022-0015_ref_009","doi-asserted-by":"crossref","unstructured":"[9] H. Turabieh, M. Mafarja, and X. Li, \u201cIterated feature selection algorithms with layered recurrent neural network for software fault prediction,\u201d Expert Syst. Appl., vol. 122, pp. 27\u201342, May 2019. https:\/\/doi.org\/10.1016\/j.eswa.2018.12.033","DOI":"10.1016\/j.eswa.2018.12.033"},{"key":"2025101108193314049_j_acss-2022-0015_ref_010","doi-asserted-by":"crossref","unstructured":"[10] T. Hall, S. Beecham, D. Bowes, D. Gray, and S. Counsell, \u201cA systematic literature review on fault prediction performance in software engineering,\u201d IEEE Trans. Softw. Eng., vol. 38, no. 6, pp. 1276\u20131304, Oct. 2012. https:\/\/doi.org\/10.1109\/TSE.2011.103","DOI":"10.1109\/TSE.2011.103"},{"key":"2025101108193314049_j_acss-2022-0015_ref_011","doi-asserted-by":"crossref","unstructured":"[11] S. Hosseini, B. Turhan, and D. Gunarathna, \u201cA systematic literature review and meta-analysis on cross project defect prediction,\u201d IEEE Trans. Softw. Eng., vol. 45, no. 2, pp. 111\u2013147, Nov. 2019. https:\/\/doi.org\/10.1109\/TSE.2017.2770124","DOI":"10.1109\/TSE.2017.2770124"},{"key":"2025101108193314049_j_acss-2022-0015_ref_012","unstructured":"[12] F.J. Provost, \u201cMachine learning from imbalanced data sets 101,\u201d inProc. AAAI\u20192000 Workshop onImbalanced Data Sets, 2000, pp. 1\u20133."},{"key":"2025101108193314049_j_acss-2022-0015_ref_013","doi-asserted-by":"crossref","unstructured":"[13] D. Ryu, J. I. Jang, and J. Baik, \u201cA transfer cost-sensitive boosting approach for cross-project defect prediction,\u201d Softw. Qual. J., vol. 25, no. 1, pp. 235\u2013272, Sep. 2017. https:\/\/doi.org\/10.1007\/s11219-015-9287-1","DOI":"10.1007\/s11219-015-9287-1"},{"key":"2025101108193314049_j_acss-2022-0015_ref_014","doi-asserted-by":"crossref","unstructured":"[14] S. Wang and X. Yao, \u201cUsing class imbalance learning for software defect prediction,\u201d IEEE Trans. Reliab., vol. 62, no. 2, pp. 434\u2013443, Apr. 2013. https:\/\/doi.org\/10.1109\/TR.2013.2259203","DOI":"10.1109\/TR.2013.2259203"},{"key":"2025101108193314049_j_acss-2022-0015_ref_015","doi-asserted-by":"crossref","unstructured":"[15] N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, \u201cSMOTE: Synthetic minority over-sampling technique,\u201d J. Artif. Intell. Res., vol. 16, pp. 321\u2013357, Jun. 2002. https:\/\/doi.org\/10.1613\/jair.953","DOI":"10.1613\/jair.953"},{"key":"2025101108193314049_j_acss-2022-0015_ref_016","doi-asserted-by":"crossref","unstructured":"[16] H. He, Y. Bai, E. A. Garcia, and S. Li, \u201cADASYN: Adaptive synthetic sampling approach for imbalanced learning,\u201d inProc. Int. Jt. Conf. Neural Networks, Hong Kong, Jun. 2008, pp. 1322\u20131328. https:\/\/doi.org\/10.1109\/IJCNN.2008.4633969","DOI":"10.1109\/IJCNN.2008.4633969"},{"key":"2025101108193314049_j_acss-2022-0015_ref_017","doi-asserted-by":"crossref","unstructured":"[17] A. Agrawal and T. Menzies, \u201cIs \u2018better data\u2019 better than \u2018better data miners\u2019?: on the benefits of tuning SMOTE for defect prediction,\u201d in2018 IEEE\/ACM 40th Int. Conf. Softw. Eng., May 2018, pp. 1050\u20131061. https:\/\/doi.org\/10.1145\/3180155.3180197","DOI":"10.1145\/3180155.3180197"},{"key":"2025101108193314049_j_acss-2022-0015_ref_018","unstructured":"[18] C. Tantithamthavorn and A. E. Hassan, \u201cThe impact of class rebalancing techniques on the performance and interpretation of defect prediction models,\u201d IEEE Trans. Softw. Eng., 2018. [Online]\/Available: chrome-extension:\/\/efaidnbmnnnibpcajpcglclefindmkaj\/https:\/\/chakkrit.com\/assets\/papers\/tantithamthavorn2018imbalance.pdf"},{"key":"2025101108193314049_j_acss-2022-0015_ref_019","doi-asserted-by":"crossref","unstructured":"[19] S. S. Rathore and S. Kumar, \u201cLinear and non-linear heterogeneous ensemble methods to predict the number of faults in software systems,\u201d Knowledge-Based Syst., vol. 119, pp. 232\u2013256, Mar. 2017. https:\/\/doi.org\/10.1016\/j.knosys.2016.12.017","DOI":"10.1016\/j.knosys.2016.12.017"},{"key":"2025101108193314049_j_acss-2022-0015_ref_020","doi-asserted-by":"crossref","unstructured":"[20] S. S. Rathore and S. Kumar, \u201cAn approach for the prediction of number of software faults based on the dynamic selection of learning techniques,\u201d IEEE Trans. Reliab., vol. 68, no. 1, pp. 216\u2013236, Aug. 2018. https:\/\/doi.org\/10.1109\/TR.2018.2864206","DOI":"10.1109\/TR.2018.2864206"},{"key":"2025101108193314049_j_acss-2022-0015_ref_021","doi-asserted-by":"crossref","unstructured":"[21] T. G. Dietterich, \u201cEnsemble methods in machine learning,\u201d in Multiple Classifier Systems, MCS 2000. Lecture Notes in Computer Science, vol 1857. Springer, Berlin, Heidelberg, Jan. 2000, pp. 1\u201315. https:\/\/doi.org\/10.1007\/3-540-45014-9_1","DOI":"10.1007\/3-540-45014-9_1"},{"key":"2025101108193314049_j_acss-2022-0015_ref_022","doi-asserted-by":"crossref","unstructured":"[22] Q. Song, Z. Jia, M. Shepperd, S. Ying, and J. Liu, \u201cA general software defect-proneness prediction framework,\u201d IEEE Trans. Softw. Eng., vol. 37, no. 3, pp. 356\u2013370, 2011. https:\/\/doi.org\/10.1109\/TSE.2010.90","DOI":"10.1109\/TSE.2010.90"},{"key":"2025101108193314049_j_acss-2022-0015_ref_023","doi-asserted-by":"crossref","unstructured":"[23] X. Rong, F. Li, and Z. Cui, \u201cA model for software defect prediction using support vector machine based on CBA,\u201d Int. J. Intell. Syst. Technol. Appl., vol. 15, no. 1, pp. 19\u201334, Apr. 2016. https:\/\/doi.org\/10.1504\/IJISTA.2016.076102","DOI":"10.1504\/IJISTA.2016.076102"},{"key":"2025101108193314049_j_acss-2022-0015_ref_024","doi-asserted-by":"crossref","unstructured":"[24] K. Dejaeger, T. Verbraken, and B. Baesens, \u201cToward comprehensible software fault prediction models using bayesian network classifiers,\u201d IEEE Trans. Softw. Eng., vol. 39, no. 2, pp. 237\u2013257, Apr. 2013. https:\/\/doi.org\/10.1109\/TSE.2012.20","DOI":"10.1109\/TSE.2012.20"},{"key":"2025101108193314049_j_acss-2022-0015_ref_025","doi-asserted-by":"crossref","unstructured":"[25] T. Zimmermann, N. Nagappan, H. Gall, E. Giger, and B. Murphy, \u201cCross-project defect prediction: a large scale experiment on data vs. domain vs. process,\u201d in Proceedings of the 7th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering,Aug. 2009, pp. 91\u2013100. https:\/\/doi.org\/10.1145\/1595696.1595713","DOI":"10.1145\/1595696.1595713"},{"key":"2025101108193314049_j_acss-2022-0015_ref_026","doi-asserted-by":"crossref","unstructured":"[26] B. Turhan, T. Menzies, A. B. Bener, and J. Di Stefano, \u201cOn the relative value of cross-company and within-company data for defect prediction,\u201d Empir. Softw. Eng., vol. 14, pp. 540\u2013578, Jan. 2009. https:\/\/doi.org\/10.1007\/s10664-008-9103-7","DOI":"10.1007\/s10664-008-9103-7"},{"key":"2025101108193314049_j_acss-2022-0015_ref_027","doi-asserted-by":"crossref","unstructured":"[27] B. Turhan, A. T. Misirli, and A. Bener, \u201cEmpirical evaluation of the effects of mixed project data on learning defect predictors,\u201d Inf. Softw. Technol., vol. 55, no. 6, pp. 1101\u20131118, Jun. 2013. https:\/\/doi.org\/10.1016\/j.infsof.2012.10.003","DOI":"10.1016\/j.infsof.2012.10.003"},{"key":"2025101108193314049_j_acss-2022-0015_ref_028","unstructured":"[28] P. He, Y. Ma, and B. Li, \u201cTDSelector: a training data selection method for cross-project defect prediction,\u201d 2016.[Online]. Available: https:\/\/arxiv.org\/ftp\/arxiv\/papers\/1612\/1612.09065.pdf"},{"key":"2025101108193314049_j_acss-2022-0015_ref_029","doi-asserted-by":"crossref","unstructured":"[29] S. Herbold, A. Trautsch, and J. Grabowski, \u201cA comparative study to benchmark cross-project defect prediction approaches,\u201d IEEE Trans. Softw. Eng., vol. 44, no. 9, pp. 811\u2013833, Sep. 2018. https:\/\/doi.org\/10.1109\/TSE.2017.2724538","DOI":"10.1109\/TSE.2017.2724538"},{"key":"2025101108193314049_j_acss-2022-0015_ref_030","doi-asserted-by":"crossref","unstructured":"[30] J. Nam, W. Fu, S. Kim, T. Menzies, and L. Tan, \u201cHeterogeneous defect prediction,\u201d IEEE Trans. Softw. Eng., vol. 44, no. 9, pp. 874\u2013896, Sep. 2018. https:\/\/doi.org\/10.1109\/TSE.2017.2720603","DOI":"10.1109\/TSE.2017.2720603"},{"key":"2025101108193314049_j_acss-2022-0015_ref_031","doi-asserted-by":"crossref","unstructured":"[31] P. He, B. Li, X. Liu, J. Chen, and Y. Ma, \u201cAn empirical study on software defect prediction with a simplified metric set,\u201d Inf. Softw. Technol., vol. 59, pp. 170\u2013190, Mar. 2015. https:\/\/doi.org\/10.1016\/j.infsof.2014.11.006","DOI":"10.1016\/j.infsof.2014.11.006"},{"key":"2025101108193314049_j_acss-2022-0015_ref_032","doi-asserted-by":"crossref","unstructured":"[32] Y. Zhang, D. Lo, X. Xia, and J. Sun, \u201cAn empirical study of classifier combination for cross-project defect prediction,\u201d in 2015 IEEE 39th Annual Computer Software and Applications Conference, vol. 2, Taichung, Taiwan, Jul. 2015, pp. 264\u2013269. https:\/\/doi.org\/10.1109\/COMPSAC.2015.58","DOI":"10.1109\/COMPSAC.2015.58"},{"key":"2025101108193314049_j_acss-2022-0015_ref_033","doi-asserted-by":"crossref","unstructured":"[33] C. Ni, W. S. Liu, X. Chen, Q. Gu, D. X. Chen, and Q. G. Huang, \u201cA cluster based feature selection method for cross-project software defect prediction,\u201d J. Comput. Sci. Technol., vol. 32, no. 6, pp. 1090\u20131107, Dec. 2017. https:\/\/doi.org\/10.1007\/s11390-017-1785-0","DOI":"10.1007\/s11390-017-1785-0"},{"key":"2025101108193314049_j_acss-2022-0015_ref_034","doi-asserted-by":"crossref","unstructured":"[34] B. Turhan, \u201cOn the dataset shift problem in software engineering prediction models,\u201d Empir. Softw. Eng., vol. 17, no. 1\u20132, pp. 62\u201374, Oct. 2012. https:\/\/doi.org\/10.1007\/s10664-011-9182-8","DOI":"10.1007\/s10664-011-9182-8"},{"key":"2025101108193314049_j_acss-2022-0015_ref_035","doi-asserted-by":"crossref","unstructured":"[35] Q. Song, Y. Guo, and M. Shepperd, \u201cA comprehensive investigation of the role of imbalanced learning for software defect prediction,\u201d IEEE Trans. Softw. Eng., vol. 45, no. 12, Dec. 2018. https:\/\/doi.org\/10.1109\/TSE.2018.2836442","DOI":"10.1109\/TSE.2018.2836442"},{"key":"2025101108193314049_j_acss-2022-0015_ref_036","doi-asserted-by":"crossref","unstructured":"[36] T. Fukushima, Y. Kamei, S. McIntosh, K. Yamashita, and N. Ubayashi, \u201cAn empirical study of just-in-time defect prediction using cross-project models,\u201d in Proceedings of the 11th Working Conference on Mining Software Repositories \u2013MSR 2014, May 2014, pp. 172\u2013181. https:\/\/doi.org\/10.1145\/2597073.2597075","DOI":"10.1145\/2597073.2597075"},{"key":"2025101108193314049_j_acss-2022-0015_ref_037","doi-asserted-by":"crossref","unstructured":"[37] X.-Y. Jing, F. Wu, X. Dong, and B. Xu, \u201cAn improved SDA based defect prediction framework for both within-project and cross-project class-imbalance problems,\u201d IEEE Trans. Softw. Eng., vol. 43, no. 4, pp. 321\u2013339, Apr. 2017. https:\/\/doi.org\/10.1109\/TSE.2016.2597849","DOI":"10.1109\/TSE.2016.2597849"},{"key":"2025101108193314049_j_acss-2022-0015_ref_038","doi-asserted-by":"crossref","unstructured":"[38] A. K. Shukla, P. Singh, and M. Vardhan, \u201cA hybrid gene selection method for microarray recognition,\u201d Biocybern. Biomed. Eng., vol. 38, no. 4, pp. 975\u2013991, 2018. https:\/\/doi.org\/10.1016\/j.bbe.2018.08.004","DOI":"10.1016\/j.bbe.2018.08.004"},{"key":"2025101108193314049_j_acss-2022-0015_ref_039","doi-asserted-by":"crossref","unstructured":"[39] C. Ding and H. Peng, \u201cMinimum redundancy feature selection from microarray gene expression data,\u201d inBioinforma. Conf. 2003.CSB 2003. Proceedings of the 2003 IEEE Bioinformatics Conference, Stanford, CA, USA, Aug. 2003, pp. 523\u2013528. https:\/\/doi.org\/10.1109\/CSB.2003.1227396","DOI":"10.1109\/CSB.2003.1227396"},{"key":"2025101108193314049_j_acss-2022-0015_ref_040","doi-asserted-by":"crossref","unstructured":"[40] N. Settouti, M. E. A. Bechar, and M. A. Chikh, \u201cStatistical comparisons of the top 10 algorithms in data mining for classification task,\u201d Int. J. Interact. Multimed. Artif. Intell., vol. 4, no. 1, pp. 46\u201351, Sep. 2016. https:\/\/doi.org\/10.9781\/ijimai.2016.419","DOI":"10.9781\/ijimai.2016.419"},{"key":"2025101108193314049_j_acss-2022-0015_ref_041","doi-asserted-by":"crossref","unstructured":"[41] R. Malhotra, \u201cA systematic review of machine learning techniques for software fault prediction,\u201d Appl. Soft Comput. J., vol. 27, pp. 504\u2013518, Feb. 2015. https:\/\/doi.org\/10.1016\/j.asoc.2014.11.023","DOI":"10.1016\/j.asoc.2014.11.023"},{"key":"2025101108193314049_j_acss-2022-0015_ref_042","doi-asserted-by":"crossref","unstructured":"[42] Z. Zhang and X. Xie, \u201cResearch on AdaBoost.M1 with Random Forest,\u201d in ICCET 2010\u20132010 Int. Conf. Comput. Eng. Technol. Proc., vol. 1, Chengdu, Apr. 2010, pp. V1-647\u2013V1-652. https:\/\/doi.org\/10.1109\/ICCET.2010.5485910","DOI":"10.1109\/ICCET.2010.5485910"},{"key":"2025101108193314049_j_acss-2022-0015_ref_043","doi-asserted-by":"crossref","unstructured":"[43] C. Tantithamthavorn, S. McIntosh, A. E. Hassan, and K. Matsumoto, \u201cThe impact of automated parameter optimization on defect prediction models,\u201d IEEE Trans. Softw. Eng., vol. 45, no. 7, pp. 683\u2013711, Jul. 2019. https:\/\/doi.org\/10.1109\/TSE.2018.2794977","DOI":"10.1109\/TSE.2018.2794977"},{"key":"2025101108193314049_j_acss-2022-0015_ref_044","doi-asserted-by":"crossref","unstructured":"[44] M. Jureczko and L. Madeyski, \u201cTowards identifying software project clusters with regard to defect prediction,\u201d inProc. 6th Int. Conf. Predict. Model. Softw. Eng. \u2013PROMISE\u201910, Art. no.9, Sep. 2010, pp. 1\u201310. https:\/\/doi.org\/10.1145\/1868328.1868342","DOI":"10.1145\/1868328.1868342"},{"key":"2025101108193314049_j_acss-2022-0015_ref_045","unstructured":"[45] G. Boetticher, T. Menzies, and T. Ostrand, \u201cTera-PROMISE: Welcome to one of the largest repositories of SE research data.\u201d [Online]. Available: http:\/\/openscience.us\/repo\/.Accessed on: Nov. 30, 2017."},{"key":"2025101108193314049_j_acss-2022-0015_ref_046","doi-asserted-by":"crossref","unstructured":"[46] Z. He, F. Peters, T. Menzies, and Y. Yang, \u201cLearning from open-source projects: An empirical study on defect prediction,\u201d inInt. Symp. Empir. Softw. Eng. Meas., Baltimore, MD, USA, Oct. 2013, pp. 45\u201354. https:\/\/doi.org\/10.1109\/ESEM.2013.20","DOI":"10.1109\/ESEM.2013.20"},{"key":"2025101108193314049_j_acss-2022-0015_ref_047","doi-asserted-by":"crossref","unstructured":"[47] Y. Liu, T. M. Khoshgoftaar, and N. Seliya, \u201cEvolutionary optimization of software quality modeling with multiple repositories,\u201d IEEE Trans. Softw. Eng., vol. 36, no. 6, pp. 852\u2013864, May 2010. https:\/\/doi.org\/10.1109\/TSE.2010.51","DOI":"10.1109\/TSE.2010.51"},{"key":"2025101108193314049_j_acss-2022-0015_ref_048","doi-asserted-by":"crossref","unstructured":"[48] G. Canfora, A. De Lucia, M. Di Penta, R. Oliveto, A. Panichella, and S. Panichella, \u201cMulti -objective cross-project defect prediction,\u201d in Proceedings \u2013IEEE 6th International Conference on Software Testing, Verification and Validation, ICST 2013, Luxembourg, Mar. 2013, pp. 252\u2013261. https:\/\/doi.org\/10.1109\/ICST.2013.38","DOI":"10.1109\/ICST.2013.38"},{"key":"2025101108193314049_j_acss-2022-0015_ref_049","doi-asserted-by":"crossref","unstructured":"[49] A. Panichella, R. Oliveto, and A. De Lucia, \u201cCross-project defect prediction models: L\u2019Union fait la force,\u201d in 2014 Software Evolution Week \u2013IEEE Conference on Software Maintenance, Reengineering, and Reverse Engineering (CSMR-WCRE), Antwerpen, Belgium, Feb. 2014, pp. 164\u2013173. https:\/\/doi.org\/10.1109\/CSMR-WCRE.2014.6747166","DOI":"10.1109\/CSMR-WCRE.2014.6747166"},{"key":"2025101108193314049_j_acss-2022-0015_ref_050","unstructured":"[50] J. Dem\u0161ar, \u201cStatistical comparisons of classifiers over multiple data sets,\u201d J. Mach. Learn. Res., vol. 7, no. 4, pp. 1\u201330, Aug. 2006. https:\/\/doi.org\/10.1016\/j.jecp.2010.03.00520451214"},{"key":"2025101108193314049_j_acss-2022-0015_ref_051","unstructured":"[51] E. G. Jelihovschi, J. C. Faria, and I. B. Allaman, \u201cThe ScottKnott clustering algorithm,\u201d Univ. Estadual St. Cruz-UESC, Ilheus, Bahia, Bras., 2014."},{"key":"2025101108193314049_j_acss-2022-0015_ref_052","doi-asserted-by":"crossref","unstructured":"[52] Q. Yu, J. Qian, S. Jiang, Z. Wu, and G. Zhang, \u201cAn empirical study on the effectiveness of feature selection for cross-project defect prediction,\u201d IEEE Access, vol. 7, pp. 35710\u201335718, Jan. 2019. https:\/\/doi.org\/10.1109\/ACCESS.2019.2895614","DOI":"10.1109\/ACCESS.2019.2895614"},{"key":"2025101108193314049_j_acss-2022-0015_ref_053","doi-asserted-by":"crossref","unstructured":"[53] M.Friedman, \u201cThe use of ranks to avoid the assumption of normality implicit in the analysis of variance,\u201d J. Am. Stat. Assoc., vol. 32, no. 200, pp. 675\u2013701, 1937. https:\/\/doi.org\/10.1080\/01621459.1937.10503522","DOI":"10.1080\/01621459.1937.10503522"}],"container-title":["Applied Computer Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.sciendo.com\/pdf\/10.2478\/acss-2022-0015","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T08:19:50Z","timestamp":1760170790000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.sciendo.com\/article\/10.2478\/acss-2022-0015"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,1]]},"references-count":53,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2023,1,24]]},"published-print":{"date-parts":[[2022,12,1]]}},"alternative-id":["10.2478\/acss-2022-0015"],"URL":"https:\/\/doi.org\/10.2478\/acss-2022-0015","relation":{},"ISSN":["2255-8691"],"issn-type":[{"value":"2255-8691","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,1]]}}}