{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T19:59:57Z","timestamp":1773777597084,"version":"3.50.1"},"reference-count":90,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2025,7,25]],"date-time":"2025-07-25T00:00:00Z","timestamp":1753401600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,7,25]],"date-time":"2025-07-25T00:00:00Z","timestamp":1753401600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100002386","name":"Cairo University","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100002386","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Autom Softw Eng"],"published-print":{"date-parts":[[2025,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>In the realm of software development, detecting code smells is a critical task for ensuring good code quality. God class code smell specifically has a specific nature associated with a great deal of subjectivity due to the levels of coupling and cohesion associated to it. Automated techniques for code smell detection aim to resolve this subjectivity. Machine learning techniques have shown promising results that tend to improve accuracy and reduce the bias associated with other techniques for God class identification. This is due to their pattern recognition capabilities making them more objective in identifying patterns that indicate code smells. However, current results need to be further improved in terms of both accuracy and generalizability. The challenge in the use of machine learning is not only in selecting the most appropriate technique but also lies in effectively representing source code as input patterns fed to Machine Learning (ML) classifier(s). Code representation plays a pivotal role in encoding source code for ML algorithms. This study aims improving the accuracy and generalizability of God class code smell detection via exploring the effect of using various code representation techniques, namely, tree-based, metric-based, code embedding, and token-based code representation techniques on the ML detection results. The study is conducted on the MLCQ dataset, and applies various ML algorithms (specifically: Logistic Regression, Random Forest, SVM, Decision Tree, Naive Bayes, Gradient Boosting, XGBoost). The evaluation results show how different code representation techniques influence ML detection outcomes and the comparative performance of ML algorithms. The study findings reveal that the F1-score achieved outperforms prior studies on the MLCQ dataset, indicating the effectiveness of the proposed approach. The presented results reveal how the code representation technique used makes a significant impact on the ML classifier results. This paves the way for developing intelligent IDE plugins for just in time God Class code smell detection among other code smells.<\/jats:p>","DOI":"10.1007\/s10515-025-00534-4","type":"journal-article","created":{"date-parts":[[2025,7,25]],"date-time":"2025-07-25T07:00:34Z","timestamp":1753426834000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["From code to insight: studying code representation techniques for ML-based God class detection to support intelligent IDEs"],"prefix":"10.1007","volume":"32","author":[{"given":"Elmohanad","family":"Haroon","sequence":"first","affiliation":[]},{"given":"Khaled Tawfik","family":"Wassif","sequence":"additional","affiliation":[]},{"given":"Lamia Abo","family":"Zaid","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,7,25]]},"reference":[{"key":"534_CR1","doi-asserted-by":"crossref","unstructured":"Abeyratne, A., Samarage, C., Dahanayake, B., Wijesiriwardana, C., Wimalaratne, P.: A security specific knowledge modelling approach for secure software engineering.\u00a0J. Natl. Sci. Found. Sri.\u00a048(1) (2020)","DOI":"10.4038\/jnsfsr.v48i1.8950"},{"key":"534_CR2","doi-asserted-by":"publisher","unstructured":"AbuHassan, A., Alshayeb, M., Ghouti, L.: Software smell detection techniques: A systematic literature review. J. Software Evol. Process, 33(3), e2320 (2021). https:\/\/doi.org\/10.1002\/smr.2320","DOI":"10.1002\/smr.2320"},{"issue":"7","key":"534_CR3","doi-asserted-by":"publisher","first-page":"1159","DOI":"10.1080\/09720510.2020.1799576","volume":"23","author":"M Agnihotri","year":"2020","unstructured":"Agnihotri, M., Chug, A.: Application of machine learning algorithms for code smell prediction using object-oriented software metrics. J. Stat. Manag. Syst. 23(7), 1159\u20131171 (2020). https:\/\/doi.org\/10.1080\/09720510.2020.1799576","journal-title":"J. Stat. Manag. Syst."},{"key":"534_CR4","doi-asserted-by":"publisher","first-page":"107120","DOI":"10.1016\/j.infsof.2022.107120","volume":"155","author":"SM Al Khatib","year":"2023","unstructured":"Al Khatib, S.M., Alkharabsheh, K., Alawadi, S.: Selection of human evaluators for design smell detection using dragonfly optimization algorithm: An empirical study. Inf. Softw. Technol. 155, 107120 (2023)","journal-title":"Inf. Softw. Technol."},{"key":"534_CR5","doi-asserted-by":"crossref","unstructured":"Alkharabsheh, K., Crespo, Y., Fern\u00e1ndez-Delgado, M., Cotos, J. M., Taboada, J. A.: Assessing the influence of size category of the project in god class detection, an experimental approach based on machine learning (MLA). In International Conference on Software Engineering & Knowledge Engineering (pp. 361\u2013366)  (2019a)","DOI":"10.18293\/SEKE2019-140"},{"key":"534_CR6","doi-asserted-by":"crossref","unstructured":"Alkharabsheh, K.: An empirical study on the co-occurrence of design smells in the same software module: God class case study. 2021 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT) (2021)","DOI":"10.1109\/JEEIT53412.2021.9634144"},{"key":"534_CR7","doi-asserted-by":"publisher","first-page":"145191","DOI":"10.1109\/ACCESS.2021.3123123","volume":"9","author":"K Alkharabsheh","year":"2021","unstructured":"Alkharabsheh, K., Alawadi, S., Crespo, Y., Manso, M.E., Gonz\u00e1lez, J.A.T.: Analysing agreement among different evaluators in god class and feature envy detection. IEEE Access 9, 145191\u2013145211 (2021a)","journal-title":"IEEE Access"},{"issue":"1","key":"534_CR8","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1007\/s10586-024-04724-9","volume":"28","author":"K Alkharabsheh","year":"2025","unstructured":"Alkharabsheh, K., Alawadi, S., Crespo, Y., Taboada, J.A.: Exploring the role of project status information in effective code smell detection. Clust. Comput. 28(1), 29 (2025)","journal-title":"Clust. Comput."},{"issue":"10","key":"534_CR9","doi-asserted-by":"publisher","first-page":"9332","DOI":"10.1016\/j.jksuci.2022.09.011","volume":"34","author":"K Alkharabsheh","year":"2022","unstructured":"Alkharabsheh, K., Alawadi, S., Ignaim, K., Zanoon, N., Crespo, Y., Manso, E., Taboada, J.A.: Prioritization of god class design smell: A multi-criteria based approach. J. King Saud Univ. Comput. Inf. Sci. 34(10), 9332\u20139342 (2022a)","journal-title":"J. King Saud Univ. Comput. Inf. Sci."},{"key":"534_CR10","doi-asserted-by":"publisher","first-page":"106736","DOI":"10.1016\/j.infsof.2021.106736","volume":"143","author":"K Alkharabsheh","year":"2022","unstructured":"Alkharabsheh, K., Alawadi, S., Kebande, V.R., Crespo, Y., Fern\u00e1ndez-Delgado, M., Taboada, J.A.: A comparison of machine learning algorithms on design smell detection using balanced and imbalanced dataset: A study of God class. Inf. Softw. Technol. 143, 106736 (2022b)","journal-title":"Inf. Softw. Technol."},{"key":"534_CR11","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1007\/s11219-021-09550-5","volume":"29","author":"K Alkharabsheh","year":"2021","unstructured":"Alkharabsheh, K., Crespo, Y., Fern\u00e1ndez-Delgado, M., Viqueira, J.R., Taboada, J.A.: Exploratory study of the impact of project domain and size category on the detection of the god class design smell. Software Qual. J. 29, 197\u2013237 (2021b)","journal-title":"Software Qual. J."},{"key":"534_CR12","doi-asserted-by":"publisher","first-page":"1069","DOI":"10.1007\/s11219-018-9424-8","volume":"27","author":"K Alkharabsheh","year":"2019","unstructured":"Alkharabsheh, K., Crespo, Y., Manso, E., Taboada, J.A.: Software design smell detection: a systematic mapping study. Software Qual. J. 27, 1069\u20131148 (2019b)","journal-title":"Software Qual. J."},{"key":"534_CR13","unstructured":"Alon, U., Brody, S., Levy, O., Yahav, E.: code2seq: Generating Sequences from Structured Representations of Code (2018). arXiv:1808.01400. Retrieved August 01, 2018, from https:\/\/ui.adsabs.harvard.edu\/abs\/2018arXiv180801400A"},{"key":"534_CR14","doi-asserted-by":"crossref","unstructured":"Alon, U., Zilberstein, M., Levy, O., Yahav, E.: code2vec: Learning distributed representations of code. Proc. ACM Program. Languages. 3(POPL), 1\u201329 (2019)","DOI":"10.1145\/3290353"},{"issue":"4","key":"534_CR15","doi-asserted-by":"publisher","first-page":"2341","DOI":"10.1007\/s13369-019-04311-w","volume":"45","author":"A Al-Shaaby","year":"2020","unstructured":"Al-Shaaby, A., Aljamaan, H., Alshayeb, M.: Bad smell detection using machine learning techniques: A systematic literature review. Arab. J. Sci. Eng. 45(4), 2341\u20132369 (2020). https:\/\/doi.org\/10.1007\/s13369-019-04311-w","journal-title":"Arab. J. Sci. Eng."},{"key":"534_CR16","doi-asserted-by":"publisher","first-page":"1143","DOI":"10.1007\/s10664-015-9378-4","volume":"21","author":"F Arcelli Fontana","year":"2016","unstructured":"Arcelli Fontana, F., M\u00e4ntyl\u00e4, M.V., Zanoni, M., Marino, A.: Comparing and experimenting machine learning techniques for code smell detection. Empir. Softw. Eng. 21, 1143\u20131191 (2016)","journal-title":"Empir. Softw. Eng."},{"key":"534_CR17","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1016\/j.infsof.2018.12.009","volume":"108","author":"MI Azeem","year":"2019","unstructured":"Azeem, M.I., Palomba, F., Shi, L., Wang, Q.: Machine learning techniques for code smell detection: A systematic literature review and meta-analysis. Inf. Softw. Technol. 108, 115\u2013138 (2019)","journal-title":"Inf. Softw. Technol."},{"issue":"8","key":"534_CR18","doi-asserted-by":"publisher","first-page":"e2255","DOI":"10.1002\/smr.2255","volume":"32","author":"B Bafandeh Mayvan","year":"2020","unstructured":"Bafandeh Mayvan, B., Rasoolzadegan, A., Javan Jafari, A.: Bad smell detection using quality metrics and refactoring opportunities. J. Software. Evol. Proc. 32(8), e2255 (2020). https:\/\/doi.org\/10.1002\/smr.2255","journal-title":"J. Software. Evol. Proc."},{"issue":"1999","key":"534_CR19","first-page":"75","volume":"1","author":"K Beck","year":"1999","unstructured":"Beck, K., Fowler, M., Beck, G.: Bad smells in code. Refact. Impr. Des. Exist. Code 1(1999), 75\u201388 (1999)","journal-title":"Refact. Impr. Des. Exist. Code"},{"issue":"SI","key":"534_CR20","doi-asserted-by":"publisher","first-page":"259","DOI":"10.1145\/223427.211856","volume":"20","author":"JM Bieman","year":"1995","unstructured":"Bieman, J.M., Kang, B.-K.: Cohesion and reuse in an object-oriented system. ACM SIGSOFT Software Eng. Notes 20(SI), 259\u2013262 (1995)","journal-title":"ACM SIGSOFT Software Eng. Notes"},{"issue":"1","key":"534_CR21","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1109\/32.748920","volume":"25","author":"LC Briand","year":"1999","unstructured":"Briand, L.C., Daly, J.W., Wust, J.K.: A unified framework for coupling measurement in object-oriented systems. IEEE Trans. Software Eng. 25(1), 91\u2013121 (1999)","journal-title":"IEEE Trans. Software Eng."},{"key":"534_CR22","volume-title":"AntiPatterns: refactoring software, architectures, and projects in crisis","author":"WH Brown","year":"1998","unstructured":"Brown, W.H., Malveau, R.C., McCormick, H.W.S., Mowbray, T.J.: AntiPatterns: refactoring software, architectures, and projects in crisis. John Wiley & Sons Inc (1998)"},{"key":"534_CR23","doi-asserted-by":"crossref","unstructured":"Bryton, S., Abreu, F. B.: Strengthening refactoring: Towards software evolution with quantitative and experimental grounds. 2009 Fourth International Conference on Software Engineering Advances (2009)","DOI":"10.1109\/ICSEA.2009.90"},{"key":"534_CR24","doi-asserted-by":"publisher","first-page":"170844","DOI":"10.1109\/ACCESS.2019.2953696","volume":"7","author":"Z Cai","year":"2019","unstructured":"Cai, Z., Lu, L., Qiu, S.: An abstract syntax tree encoding method for cross-project defect prediction. IEEE Access 7, 170844\u2013170853 (2019)","journal-title":"IEEE Access"},{"issue":"11","key":"534_CR25","doi-asserted-by":"publisher","first-page":"273","DOI":"10.3390\/info9110273","volume":"9","author":"AS Cairo","year":"2018","unstructured":"Cairo, A.S., Carneiro, G.D.F., Monteiro, M.P.: The impact of code smells on software bugs: A systematic literature review. Information 9(11), 273 (2018)","journal-title":"Information"},{"key":"534_CR26","unstructured":"Caldiera, V. R. B. G., & Rombach, H. D. The goal question metric approach. Encyclopedia of Software Engineering. 528\u2013532 (1994)"},{"key":"534_CR27","volume-title":"The Java Class Libraries","author":"P Chan","year":"1999","unstructured":"Chan, P., Lee, R., Kramer, D.: The Java Class Libraries. Addison Wesley (1999)"},{"issue":"6","key":"534_CR28","doi-asserted-by":"publisher","first-page":"476","DOI":"10.1109\/32.295895","volume":"20","author":"SR Chidamber","year":"1994","unstructured":"Chidamber, S.R., Kemerer, C.F.: A metrics suite for object oriented design. IEEE Trans. Software Eng. 20(6), 476\u2013493 (1994)","journal-title":"IEEE Trans. Software Eng."},{"issue":"4","key":"534_CR29","doi-asserted-by":"publisher","first-page":"915","DOI":"10.3745\/JIPS.04.0184","volume":"16","author":"MAAA Chug","year":"2020","unstructured":"Chug, M.A.A.A.: A systematic literature survey of software metrics, code smells and refactoring techniques. J. Inf. Process. Syst. 16(4), 915\u2013934 (2020). https:\/\/doi.org\/10.3745\/JIPS.04.0184","journal-title":"J. Inf. Process. Syst."},{"key":"534_CR30","unstructured":"CODEBEAT. Last Visit January 2025.\u00a0https:\/\/codebeat.co\/"},{"key":"534_CR31","doi-asserted-by":"crossref","unstructured":"Cruz, D., Santana, A., Figueiredo, E.: Detecting bad smells with machine learning algorithms: an empirical study. Proceedings of the 3rd International Conference on Technical Debt (2020)","DOI":"10.1145\/3387906.3388618"},{"key":"534_CR32","doi-asserted-by":"publisher","unstructured":"Devlin, J., Chang, M. W., Lee, K., & Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers) (pp. 4171\u20134186) (2018).\u200f https:\/\/doi.org\/10.18653\/v1\/N19-1423","DOI":"10.18653\/v1\/N19-1423"},{"key":"534_CR33","doi-asserted-by":"crossref","unstructured":"De Stefano, M., Gambardella, M. S., Pecorelli, F., Palomba, F., De Lucia, A.: cASpER: A plug-in for automated code smell detection and refactoring. Proceedings of the 2020 International Conference on Advanced Visual Interfaces (2020)","DOI":"10.1145\/3399715.3399955"},{"key":"534_CR34","doi-asserted-by":"publisher","unstructured":"Dewangan, S., Rao, R. S., Mishra, A., Gupta, M.: Code smell detection using ensemble machine learning algorithms.\u00a0Appl. Sci.\u00a012(20), 10321 (2022).\u200f https:\/\/doi.org\/10.3390\/app122010321","DOI":"10.3390\/app122010321"},{"key":"534_CR35","doi-asserted-by":"crossref","unstructured":"Di Nucci, D., Palomba, F., Tamburri, D. A., Serebrenik, A., De Lucia, A.: Detecting code smells using machine learning techniques: Are we there yet? 2018 ieee 25th international conference on software analysis, evolution and reengineering (saner) (2018)","DOI":"10.1109\/SANER.2018.8330266"},{"key":"534_CR36","doi-asserted-by":"crossref","unstructured":"Fontana, F. A., Zanoni, M., Marino, A., M\u00e4ntyl\u00e4, M. V.: Code smell detection: Towards a machine learning-based approach. In 2013 IEEE international conference on software maintenance (pp. 396\u2013399). IEEE (2013)\u200f","DOI":"10.1109\/ICSM.2013.56"},{"key":"534_CR37","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1016\/j.knosys.2017.04.014","volume":"128","author":"FA Fontana","year":"2017","unstructured":"Fontana, F.A., Zanoni, M.: Code smell severity classification using machine learning techniques. Knowl. Based Syst. 128, 43\u201358 (2017)","journal-title":"Knowl. Based Syst."},{"key":"534_CR38","unstructured":"Fowler, M. Refactoring: improving the design of existing code. Addison-Wesley Professional. (2018)\u200f"},{"key":"534_CR39","doi-asserted-by":"crossref","unstructured":"Hadj-Kacem, M., Bouassida, N.: Deep representation learning for code smells detection using variational auto-encoder. 2019 International Joint Conference on Neural Networks (IJCNN) (2019)","DOI":"10.1109\/IJCNN.2019.8851854"},{"key":"534_CR40","unstructured":"Hasantha, C.: A systematic review of code smell detection approaches. J Advance. Software Eng. Testing, 4(1) (2021)"},{"key":"534_CR41","first-page":"291","volume":"13","author":"N Hassan","year":"2020","unstructured":"Hassan, N., Gomaa, W., Khoriba, G., Haggag, M.: Credibility detection in twitter using word n-gram analysis and supervised machine learning techniques. Int. J. Intell. Eng. Syst. 13, 291\u2013300 (2020)","journal-title":"Int. J. Intell. Eng. Syst."},{"key":"534_CR42","unstructured":"Ioffe, S., Szegedy, C.: Batch normalization: Accelerating deep network training by reducing internal covariate shift. International conference on machine learning (2015)"},{"key":"534_CR43","unstructured":"Kanade, A., Maniatis, P., Balakrishnan, G., Shi, K.: Learning and evaluating contextual embedding of source code. International conference on machine learning (2020)"},{"key":"534_CR44","doi-asserted-by":"crossref","unstructured":"Kaur, A., Jain, S., Goel, S.: A support vector machine based approach for code smell detection. 2017 international conference on machine learning and data science (MLDS) (2017)","DOI":"10.1109\/MLDS.2017.8"},{"key":"534_CR45","doi-asserted-by":"crossref","unstructured":"Khomh, F., Vaucher, S., Gu\u00e9h\u00e9neuc, Y.-G., Sahraoui, H.: A bayesian approach for the detection of code and design smells. 2009 Ninth International Conference on Quality Software (2009)","DOI":"10.1109\/QSIC.2009.47"},{"issue":"4","key":"534_CR46","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1016\/j.jss.2010.11.921","volume":"84","author":"F Khomh","year":"2011","unstructured":"Khomh, F., Vaucher, S., Gu\u00e9h\u00e9neuc, Y.-G., Sahraoui, H.: BDTEX: A GQM-based Bayesian approach for the detection of antipatterns. J. Syst. Softw. 84(4), 559\u2013572 (2011)","journal-title":"J. Syst. Softw."},{"key":"534_CR47","doi-asserted-by":"publisher","first-page":"117607","DOI":"10.1016\/j.eswa.2022.117607","volume":"204","author":"A Kova\u010devi\u0107","year":"2022","unstructured":"Kova\u010devi\u0107, A., Slivka, J., Vidakovi\u0107, D., Gruji\u0107, K.-G., Luburi\u0107, N., Proki\u0107, S., Sladi\u0107, G.: Automatic detection of long method and God Class code smells through neural source code embeddings. Expert Syst. Appl. 204, 117607 (2022)","journal-title":"Expert Syst. Appl."},{"key":"534_CR48","doi-asserted-by":"publisher","unstructured":"Kramer, O. (2016). Scikit-Learn. In O. Kramer (Ed.), Machine Learning for Evolution Strategies (pp. 45\u201353). Springer International Publishing. https:\/\/doi.org\/10.1007\/978-3-319-33383-0_5","DOI":"10.1007\/978-3-319-33383-0_5"},{"key":"534_CR49","doi-asserted-by":"crossref","unstructured":"Kurbatova, Z., Veselov, I., Golubev, Y., Bryksin, T.: Recommendation of move method refactoring using path-based representation of code. Proceedings of the IEEE\/ACM 42nd International Conference on Software Engineering Workshops (2020)","DOI":"10.1145\/3387940.3392191"},{"key":"534_CR50","doi-asserted-by":"publisher","first-page":"110610","DOI":"10.1016\/j.jss.2020.110610","volume":"167","author":"G Lacerda","year":"2020","unstructured":"Lacerda, G., Petrillo, F., Pimenta, M., Gu\u00e9h\u00e9neuc, Y.G.: Code smells and refactoring: A tertiary systematic review of challenges and observations. J. Syst. Softw. 167, 110610 (2020)","journal-title":"J. Syst. Softw."},{"key":"534_CR51","unstructured":"Lanza, M., Marinescu, R.: Object-oriented metrics in practice: using software metrics to characterize, evaluate, and improve the design of object-oriented systems. Springer Science & Business Media (2007)"},{"issue":"1","key":"534_CR52","first-page":"559","volume":"18","author":"G Lema\u00eetre","year":"2017","unstructured":"Lema\u00eetre, G., Nogueira, F., Aridas, C.K.: Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. J. Mach. Learn. Res. 18(1), 559\u2013563 (2017)","journal-title":"J. Mach. Learn. Res."},{"issue":"9","key":"534_CR53","first-page":"1811","volume":"47","author":"H Liu","year":"2019","unstructured":"Liu, H., Jin, J., Xu, Z., Zou, Y., Bu, Y., Zhang, L.: Deep learning based code smell detection. IEEE Trans. Software Eng. 47(9), 1811\u20131837 (2019)","journal-title":"IEEE Trans. Software Eng."},{"key":"534_CR54","doi-asserted-by":"crossref","unstructured":"Madeyski, L., Lewowski, T.: MLCQ: Industry-relevant code smell data set. Proceedings of the 24th International Conference on Evaluation and Assessment in Software Engineering (2020)","DOI":"10.1145\/3383219.3383264"},{"key":"534_CR55","doi-asserted-by":"publisher","first-page":"107112","DOI":"10.1016\/j.infsof.2022.107112","volume":"155","author":"L Madeyski","year":"2023","unstructured":"Madeyski, L., Lewowski, T.: Detecting code smells using industry-relevant data. Inf. Softw. Technol. 155, 107112 (2023)","journal-title":"Inf. Softw. Technol."},{"key":"534_CR56","doi-asserted-by":"crossref","unstructured":"Maiga, A., Ali, N., Bhattacharya, N., Saban\u00e9, A., Gu\u00e9h\u00e9neuc, Y.-G., Aimeur, E.: Smurf: A svm-based incremental anti-pattern detection approach. 2012 19th Working conference on reverse engineering (2012)","DOI":"10.1109\/WCRE.2012.56"},{"issue":"4","key":"534_CR57","doi-asserted-by":"publisher","first-page":"308","DOI":"10.1109\/TSE.1976.233837","volume":"2","author":"TJ McCabe","year":"1976","unstructured":"McCabe, T.J.: A complexity measure. IEEE Trans. Software Eng. 2(4), 308\u2013320 (1976)","journal-title":"IEEE Trans. Software Eng."},{"key":"534_CR58","doi-asserted-by":"publisher","first-page":"1428","DOI":"10.1007\/s11390-020-0323-7","volume":"35","author":"MY Mhawish","year":"2020","unstructured":"Mhawish, M.Y., Gupta, M.: Predicting code smells and analysis of predictions: Using machine learning techniques and software metrics. J. Comput. Sci. Technol. 35, 1428\u20131445 (2020)","journal-title":"J. Comput. Sci. Technol."},{"issue":"1","key":"534_CR59","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1109\/TSE.2009.50","volume":"36","author":"N Moha","year":"2009","unstructured":"Moha, N., Gu\u00e9h\u00e9neuc, Y.-G., Duchien, L., Le Meur, A.-F.: Decor: A method for the specification and detection of code and design smells. IEEE Trans. Software Eng. 36(1), 20\u201336 (2009)","journal-title":"IEEE Trans. Software Eng."},{"key":"534_CR60","doi-asserted-by":"crossref","unstructured":"Murphy-Hill, E., Black, A. P.: An interactive ambient visualization for code smells. Proceedings of the 5th international symposium on Software visualization (2010)","DOI":"10.1145\/1879211.1879216"},{"issue":"6","key":"534_CR61","doi-asserted-by":"publisher","first-page":"673","DOI":"10.1007\/s42979-024-03013-x","volume":"5","author":"A Nandini","year":"2024","unstructured":"Nandini, A., Singh, R., Rathee, A.: Improving code smell detection by reducing dimensionality using ensemble feature selection and machine learning. SN Comput. Sci. 5(6), 673 (2024)","journal-title":"SN Comput. Sci."},{"key":"534_CR62","doi-asserted-by":"crossref","unstructured":"Olbrich, S., Cruzes, D. S., Basili, V., Zazworka, N.: The evolution and impact of code smells: A case study of two open source systems. 2009 3rd international symposium on empirical software engineering and measurement (2009)","DOI":"10.1109\/ESEM.2009.5314231"},{"key":"534_CR63","doi-asserted-by":"crossref","unstructured":"Olbrich, S. M., Cruzes, D. S., Sj\u00f8berg, D. I.: Are all code smells harmful? A study of God classes and brain classes in the evolution of three open source systems. 2010 IEEE international conference on software maintenance (2010)","DOI":"10.1109\/ICSM.2010.5609564"},{"key":"534_CR64","doi-asserted-by":"crossref","unstructured":"Palomba, F., Di Nucci, D., Tufano, M., Bavota, G., Oliveto, R., Poshyvanyk, D., De Lucia, A.: Landfill: An open dataset of code smells with public evaluation. 2015 IEEE\/ACM 12th Working Conference on Mining Software Repositories (2015)","DOI":"10.1109\/MSR.2015.69"},{"key":"534_CR65","doi-asserted-by":"crossref","unstructured":"Palomba, F., Bavota, G., Di Penta, M., Fasano, F., Oliveto, R., De Lucia, A.: On the diffuseness and the impact on maintainability of code smells: a large scale empirical investigation. Proceedings of the 40th International Conference on Software Engineering (2018)","DOI":"10.1145\/3180155.3182532"},{"key":"534_CR66","doi-asserted-by":"crossref","unstructured":"Pecorelli, F., Palomba, F., Di Nucci, D., De Lucia, A.: Comparing heuristic and machine learning approaches for metric-based code smell detection. 2019 IEEE\/ACM 27th international conference on program comprehension (ICPC) (2019)","DOI":"10.1109\/ICPC.2019.00023"},{"key":"534_CR67","doi-asserted-by":"publisher","first-page":"110693","DOI":"10.1016\/j.jss.2020.110693","volume":"169","author":"F Pecorelli","year":"2020","unstructured":"Pecorelli, F., Di Nucci, D., De Roover, C., De Lucia, A.: A large empirical assessment of the role of data balancing in machine-learning-based code smell detection. J. Syst. Softw. 169, 110693 (2020)","journal-title":"J. Syst. Softw."},{"key":"534_CR68","unstructured":"Peng, H., Mou, L., Li, G., Liu, Y., Zhang, L., Jin, Z.: Building program vector representations for deep learning. Knowledge Science, Engineering and Management: 8th International Conference, KSEM 2015, Chongqing, China, October 28\u201330, 2015, Proceedings 8 (2015)"},{"key":"534_CR69","doi-asserted-by":"crossref","unstructured":"Phan, A. V., Le Nguyen, M.: Convolutional neural networks on assembly code for predicting software defects. 2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES) (2017)","DOI":"10.1109\/IESYS.2017.8233558"},{"key":"534_CR70","unstructured":"PMD. Last Visit January 2025.\u00a0http:\/\/pmd.github.io\/"},{"key":"534_CR71","doi-asserted-by":"publisher","first-page":"351","DOI":"10.1049\/sfw2.12064","volume":"16","author":"HP Samoaa","year":"2022","unstructured":"Samoaa, H.P., Bayram, F., Salza, P., Leitner, P.: A systematic mapping study of source code representation for deep learning in software engineering. IET Softw. 16, 351\u2013385 (2022)","journal-title":"IET Softw."},{"key":"534_CR72","doi-asserted-by":"crossref","unstructured":"Sharma, T., Mishra, P., Tiwari, R.: Designite: A software design quality assessment tool. Proceedings of the 1st international workshop on bringing architectural design thinking into developers' daily activities (2016)","DOI":"10.1145\/2896935.2896938"},{"key":"534_CR73","unstructured":"Sharma, T., Efstathiou, V., Louridas, P., Spinellis, D.: On the feasibility of transfer-learning code smells using deep learning (2019). arXiv preprint arXiv:1904.03031"},{"key":"534_CR74","unstructured":"Sharma, T., Kechagia, M., Georgiou, S., Tiwari, R., Vats, I., Moazen, H., Sarro, F.: A survey on machine learning techniques for source code analysis (2021b). arXiv preprint arXiv:2110.09610"},{"key":"534_CR75","doi-asserted-by":"publisher","first-page":"110936","DOI":"10.1016\/j.jss.2021.110936","volume":"176","author":"T Sharma","year":"2021","unstructured":"Sharma, T., Efstathiou, V., Louridas, P., Spinellis, D.: Code smell detection by deep direct-learning and transfer-learning. J. Syst. Softw. 176, 110936 (2021a)","journal-title":"J. Syst. Softw."},{"key":"534_CR76","doi-asserted-by":"publisher","first-page":"158","DOI":"10.1016\/j.jss.2017.12.034","volume":"138","author":"T Sharma","year":"2018","unstructured":"Sharma, T., Spinellis, D.: A survey on software smells. J. Syst. Softw. 138, 158\u2013173 (2018)","journal-title":"J. Syst. Softw."},{"key":"534_CR77","unstructured":"SonarLint. Last Visit January 2025.\u00a0https:\/\/www.sonarsource.com\/products\/sonarlint\/"},{"issue":"1","key":"534_CR78","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1108\/eb026526","volume":"28","author":"K Sparck Jones","year":"1972","unstructured":"Sparck Jones, K.: A statistical interpretation of term specificity and its application in retrieval. J Document 28(1), 11\u201321 (1972). https:\/\/doi.org\/10.1108\/eb026526","journal-title":"J Document"},{"key":"534_CR79","unstructured":"Spinellis, D.: dspinellis\/tokenizer: Version 1.1. In: Feb (2019)"},{"key":"534_CR80","unstructured":"Tabassum, A., Patil, D. R. R. (2020). A survey on text pre-processing & feature extraction techniques in natural language processing."},{"key":"534_CR81","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1016\/j.jss.2017.11.073","volume":"138","author":"R Terra","year":"2018","unstructured":"Terra, R., Valente, M.T., Miranda, S., Sales, V.: JMove: A novel heuristic and tool to detect move method refactoring opportunities. J. Syst. Softw. 138, 19\u201336 (2018)","journal-title":"J. Syst. Softw."},{"key":"534_CR82","doi-asserted-by":"crossref","unstructured":"Thongkum, P., Mekruksavanich, S.: Design Flaws Prediction for Impact on Software Maintainability using Extreme Learning Machine. 2020 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI DAMT & NCON) (2020)","DOI":"10.1109\/ECTIDAMTNCON48261.2020.9090717"},{"key":"534_CR83","first-page":"1303","volume-title":"R\u00e9pertoire des d\u00e9fauts de conception","author":"A Tiberghien","year":"2007","unstructured":"Tiberghien, A., Moha, N., Mens, T., Mens, K.: R\u00e9pertoire des d\u00e9fauts de conception, p. 1303. Univ, Montreal, Montreal, QC, Canada, Tech. Rep (2007)"},{"issue":"3","key":"534_CR84","doi-asserted-by":"publisher","first-page":"347","DOI":"10.1109\/TSE.2009.1","volume":"35","author":"N Tsantalis","year":"2009","unstructured":"Tsantalis, N., Chatzigeorgiou, A.: Identification of move method refactoring opportunities. IEEE Trans. Software Eng. 35(3), 347\u2013367 (2009)","journal-title":"IEEE Trans. Software Eng."},{"key":"534_CR85","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, \u0141., Polosukhin, I.: Attention is all you need Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, California, USA (2017)"},{"key":"534_CR86","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s13369-024-08826-9","volume":"49","author":"R Verma","year":"2024","unstructured":"Verma, R., Kumar, K., Verma, H.K.: Prioritizing God class code smells in object-oriented software using fuzzy inference system. Arab. J. Sci. Eng. 49, 1\u201328 (2024)","journal-title":"Arab. J. Sci. Eng."},{"key":"534_CR87","doi-asserted-by":"crossref","unstructured":"Wang, S., Liu, T., Tan, L.: Automatically learning semantic features for defect prediction. Proceedings of the 38th International Conference on Software Engineering (2016)","DOI":"10.1145\/2884781.2884804"},{"key":"534_CR88","doi-asserted-by":"publisher","first-page":"109527","DOI":"10.1016\/j.engappai.2024.109527","volume":"139","author":"PS Yadav","year":"2025","unstructured":"Yadav, P.S., Rao, R.S., Mishra, A., Gupta, M.: Ensemble methods with feature selection and data balancing for improved code smells classification performance. Eng. Appl. Artif. Intell. 139, 109527 (2025)","journal-title":"Eng. Appl. Artif. Intell."},{"key":"534_CR89","doi-asserted-by":"crossref","unstructured":"Yamashita, A., Moonen, L.: Exploring the impact of inter-smell relations on software maintainability: An empirical study. 2013 35th International Conference on Software Engineering (ICSE) (2013)","DOI":"10.1109\/ICSE.2013.6606614"},{"key":"534_CR90","doi-asserted-by":"crossref","unstructured":"Zazworka, N., Shaw, M. A., Shull, F., Seaman, C.: Investigating the impact of design debt on software quality. Proceedings of the 2nd workshop on managing technical debt (2011)","DOI":"10.1145\/1985362.1985366"}],"container-title":["Automated Software Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10515-025-00534-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10515-025-00534-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10515-025-00534-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,10]],"date-time":"2025-09-10T13:58:01Z","timestamp":1757512681000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10515-025-00534-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,25]]},"references-count":90,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2025,11]]}},"alternative-id":["534"],"URL":"https:\/\/doi.org\/10.1007\/s10515-025-00534-4","relation":{},"ISSN":["0928-8910","1573-7535"],"issn-type":[{"value":"0928-8910","type":"print"},{"value":"1573-7535","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,25]]},"assertion":[{"value":"31 January 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 July 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 July 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"66"}}