{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T03:25:08Z","timestamp":1769829908015,"version":"3.49.0"},"reference-count":46,"publisher":"World Scientific Pub Co Pte Ltd","issue":"12","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61902161"],"award-info":[{"award-number":["61902161"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62077029"],"award-info":[{"award-number":["62077029"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Natural Science Foundation of the Jiangsu Higher Education Institutions of China","award":["18KJB520016"],"award-info":[{"award-number":["18KJB520016"]}]},{"name":"Research Support Program for Doctorate Teachers of Jiangsu Normal University","award":["17XLR001"],"award-info":[{"award-number":["17XLR001"]}]},{"name":"Jiangsu Postgraduate Research and Practice Innovation Project","award":["2022XKT1525"],"award-info":[{"award-number":["2022XKT1525"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Soft. Eng. Knowl. Eng."],"published-print":{"date-parts":[[2024,12]]},"abstract":"<jats:p> The class overlap problem refers to instances from different categories heavily overlapping in the feature space. This issue is one of the challenges in improving the performance of software defect prediction (SDP). Currently, the studies on the impact of class overlap on SDP mainly focused on within-project defect prediction and cross-project defect prediction. Moreover, the existing class overlap instances cleaning methods are not suitable for cross-version defect prediction. In this paper, we propose a class overlap instances cleaning method based on the Ratio of K-nearest neighbors with the Same Label (RKSL). This method removes instances with the abnormal neighbor ratio in the training set. Based on the RKSL method, we investigate the impact of class overlap on the performance and interpretability of the cross-version defect prediction model. The experiment results show that class overlap can affect the performance of cross-version defect prediction models significantly. The RKSL method can handle the class overlap problem in defect datasets, but it may impact the interpretability of models. Through the analysis of feature changes, we consider that class overlap instances cleaning can assist models in identifying more important features. <\/jats:p>","DOI":"10.1142\/s0218194024500414","type":"journal-article","created":{"date-parts":[[2024,8,10]],"date-time":"2024-08-10T15:46:02Z","timestamp":1723304762000},"page":"1895-1918","source":"Crossref","is-referenced-by-count":3,"title":["An Empirical Study of the Impact of Class Overlap on the Performance and Interpretability of Cross-Version Defect Prediction"],"prefix":"10.1142","volume":"34","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3490-3908","authenticated-orcid":false,"given":"Hui","family":"Han","sequence":"first","affiliation":[{"name":"School of Computer Science and Technology, Jiangsu Normal University, Xuzhou, Jiangsu 221116, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0962-826X","authenticated-orcid":false,"given":"Qiao","family":"Yu","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Jiangsu Normal University, Xuzhou, Jiangsu 221116, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0996-0142","authenticated-orcid":false,"given":"Yi","family":"Zhu","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Jiangsu Normal University, Xuzhou, Jiangsu 221116, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-1121-8529","authenticated-orcid":false,"given":"Shengyi","family":"Cheng","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Jiangsu Normal University, Xuzhou, Jiangsu 221116, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-0943-9996","authenticated-orcid":false,"given":"Yu","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Jiangsu Normal University, Xuzhou, Jiangsu 221116, China"}]}],"member":"219","published-online":{"date-parts":[[2024,9,21]]},"reference":[{"key":"S0218194024500414BIB001","doi-asserted-by":"publisher","DOI":"10.3724\/SP.J.1001.2008.01565"},{"key":"S0218194024500414BIB002","doi-asserted-by":"publisher","DOI":"10.3156\/jsoft.27.1_1"},{"issue":"10","key":"S0218194024500414BIB003","first-page":"3090","volume":"30","author":"Gong L.","year":"2019","journal-title":"J. Softw."},{"issue":"1","key":"S0218194024500414BIB004","first-page":"50","volume":"34","author":"Mei Y.","year":"2023","journal-title":"J. Softw."},{"key":"S0218194024500414BIB005","doi-asserted-by":"publisher","DOI":"10.1007\/s10515-024-00424-1"},{"key":"S0218194024500414BIB006","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2013.11"},{"key":"S0218194024500414BIB007","doi-asserted-by":"publisher","DOI":"10.1049\/iet-sen.2017.0148"},{"key":"S0218194024500414BIB008","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2022.104773"},{"key":"S0218194024500414BIB009","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-24694-7_32"},{"key":"S0218194024500414BIB010","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-13059-5_22"},{"key":"S0218194024500414BIB011","first-page":"397","volume-title":"CIARP 2007: Progress in Pattern Recognition, Image Analysis and Applications","author":"Garc\u00eda V.","year":"2007"},{"key":"S0218194024500414BIB012","doi-asserted-by":"publisher","DOI":"10.1109\/ICTAI.2004.93"},{"key":"S0218194024500414BIB013","doi-asserted-by":"publisher","DOI":"10.1145\/1985793.1985859"},{"key":"S0218194024500414BIB014","doi-asserted-by":"publisher","DOI":"10.1007\/s11219-016-9342-6"},{"key":"S0218194024500414BIB015","doi-asserted-by":"publisher","DOI":"10.1109\/ASE.2019.00071"},{"key":"S0218194024500414BIB016","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2022.3220740"},{"issue":"10","key":"S0218194024500414BIB017","first-page":"1621","volume":"13","author":"Wu F.","year":"2019","journal-title":"J. Front. Comput. Sci. Technol."},{"issue":"10","key":"S0218194024500414BIB018","first-page":"8675","volume":"34","author":"Bhutamapuram U. S.","year":"2022","journal-title":"J. King Saud Univ.-Comput. Inf. Sci."},{"key":"S0218194024500414BIB019","doi-asserted-by":"publisher","DOI":"10.1049\/sfw2.12052"},{"key":"S0218194024500414BIB020","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2022.111522"},{"key":"S0218194024500414BIB021","doi-asserted-by":"publisher","DOI":"10.1016\/j.infsof.2022.106939"},{"key":"S0218194024500414BIB022","doi-asserted-by":"publisher","DOI":"10.1002\/smr.2563"},{"key":"S0218194024500414BIB023","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3001440"},{"key":"S0218194024500414BIB024","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2017.2724538"},{"key":"S0218194024500414BIB025","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-019-09777-8"},{"key":"S0218194024500414BIB026","doi-asserted-by":"publisher","DOI":"10.1109\/SANER.2018.8330210"},{"key":"S0218194024500414BIB027","doi-asserted-by":"publisher","DOI":"10.1109\/SANER.2018.8330212"},{"key":"S0218194024500414BIB028","doi-asserted-by":"publisher","DOI":"10.1145\/3545258.3545275"},{"key":"S0218194024500414BIB029","doi-asserted-by":"publisher","DOI":"10.1109\/TR.2018.2864206"},{"key":"S0218194024500414BIB030","doi-asserted-by":"publisher","DOI":"10.18293\/SEKE2019-113"},{"key":"S0218194024500414BIB031","doi-asserted-by":"publisher","DOI":"10.1145\/1868328.1868342"},{"key":"S0218194024500414BIB032","doi-asserted-by":"publisher","DOI":"10.3390\/electronics9081295"},{"key":"S0218194024500414BIB033","volume-title":"Decision Trees for Decision Making","author":"Magee J. F.","year":"1964"},{"key":"S0218194024500414BIB034","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1967.1053964"},{"key":"S0218194024500414BIB035","volume-title":"Logistic Regression","author":"Kleinbaum D. G.","year":"2002"},{"key":"S0218194024500414BIB036","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-56485-8"},{"key":"S0218194024500414BIB037","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-70911-1_20"},{"key":"S0218194024500414BIB038","doi-asserted-by":"publisher","DOI":"10.1038\/nbt1206-1565"},{"key":"S0218194024500414BIB039","first-page":"2825","volume":"12","author":"Pedregosa F.","year":"2011","journal-title":"J. Mach. Learn. Res."},{"key":"S0218194024500414BIB040","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2005.50"},{"key":"S0218194024500414BIB041","first-page":"1","volume-title":"SAS Global Forum","volume":"12","author":"Akosa J.","year":"2017"},{"key":"S0218194024500414BIB043","volume-title":"Explaining Psychological Statistics","author":"Cohen B. H.","year":"2008"},{"key":"S0218194024500414BIB044","doi-asserted-by":"publisher","DOI":"10.1016\/j.infsof.2022.107016"},{"key":"S0218194024500414BIB045","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939778"},{"key":"S0218194024500414BIB046","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2018.2876537"},{"key":"S0218194024500414BIB047","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2020.106428"}],"container-title":["International Journal of Software Engineering and Knowledge Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218194024500414","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,11]],"date-time":"2024-12-11T01:36:25Z","timestamp":1733880985000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218194024500414"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,21]]},"references-count":46,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2024,12]]}},"alternative-id":["10.1142\/S0218194024500414"],"URL":"https:\/\/doi.org\/10.1142\/s0218194024500414","relation":{},"ISSN":["0218-1940","1793-6403"],"issn-type":[{"value":"0218-1940","type":"print"},{"value":"1793-6403","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,21]]}}}