{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T16:06:36Z","timestamp":1778083596366,"version":"3.51.4"},"reference-count":71,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2023,1,14]],"date-time":"2023-01-14T00:00:00Z","timestamp":1673654400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,1,14]],"date-time":"2023-01-14T00:00:00Z","timestamp":1673654400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"publisher","award":["402774445"],"award-info":[{"award-number":["402774445"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100016135","name":"Universit\u00e4t Passau","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100016135","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Empir Software Eng"],"published-print":{"date-parts":[[2023,3]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Many software metrics are designed to measure aspects that are believed to be related to software quality. Static software metrics, e.g., size, complexity and coupling are used in defect prediction research as well as software quality models to evaluate software quality. Static analysis tools also include boundary values for complexity and size that generate warnings for developers. While this indicates a relationship between quality and software metrics, the extent of it is not well understood. Moreover, recent studies found that complexity metrics may be unreliable indicators for understandability of the source code. To explore this relationship, we leverage the intent of developers about what constitutes a quality improvement in their own code base. We manually classify a randomized sample of 2,533 commits from 54 Java open source projects as quality improving depending on the intent of the developer by inspecting the commit message. We distinguish between perfective and corrective maintenance via predefined guidelines and use this data as ground truth for the fine-tuning of a state-of-the art deep learning model for natural language processing. The benchmark we provide with our ground truth indicates that the deep learning model can be confidently used for commit intent classification. We use the model to increase our data set to 125,482 commits. Based on the resulting data set, we investigate the differences in size and 14 static source code metrics between changes that increase quality, as indicated by the developer, and changes unrelated to quality. In addition, we investigate which files are targets of quality improvements. We find that quality improving commits are smaller than non-quality improving commits. Perfective changes have a positive impact on static source code metrics while corrective changes do tend to add complexity. Furthermore, we find that files which are the target of perfective maintenance already have a lower median complexity than files which are the target of non-pervective changes. Our study results provide empirical evidence for which static source code metrics capture quality improvement from the developers point of view. This has implications for program understanding as well as code smell detection and recommender systems.<\/jats:p>","DOI":"10.1007\/s10664-022-10257-9","type":"journal-article","created":{"date-parts":[[2023,1,14]],"date-time":"2023-01-14T09:02:44Z","timestamp":1673686964000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["What really changes when developers intend to improve their source code: a commit-level study of static metric value and static analysis warning changes"],"prefix":"10.1007","volume":"28","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5236-7953","authenticated-orcid":false,"given":"Alexander","family":"Trautsch","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Johannes","family":"Erbel","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9765-2803","authenticated-orcid":false,"given":"Steffen","family":"Herbold","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jens","family":"Grabowski","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2023,1,14]]},"reference":[{"key":"10257_CR1","unstructured":"Abdi H (2007) Bonferroni and sidak corrections for multiple comparisons. In: Encyclopedia of measurement and statistics. Sage, Thousand Oaks, pp 103\u2013107"},{"issue":"1","key":"10257_CR2","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1109\/TSE.2017.2658573","volume":"44","author":"J Al Dallal","year":"2018","unstructured":"Al Dallal J, Abdin A (2018) Empirical evaluation of the impact of object-oriented code refactoring on quality attributes: a systematic literature review. IEEE Trans Softw Eng 44(1):44\u201369. https:\/\/doi.org\/10.1109\/TSE.2017.2658573","journal-title":"IEEE Trans Softw Eng"},{"key":"10257_CR3","doi-asserted-by":"publisher","unstructured":"Alali A, Kagdi H, Maletic JI (2008) What\u2019s a typical commit? A characterization of open source software repositories. In: 2008 16th IEEE international conference on program comprehension. https:\/\/doi.org\/10.1109\/ICPC.2008.24, pp 182\u2013191","DOI":"10.1109\/ICPC.2008.24"},{"key":"10257_CR4","doi-asserted-by":"publisher","first-page":"110821","DOI":"10.1016\/j.jss.2020.110821","volume":"171","author":"EA AlOmar","year":"2021","unstructured":"AlOmar EA, Mkaouer MW, Ouni A (2021) Toward the automatic classification of self-affirmed refactoring. J Syst Softw 171:110821. https:\/\/doi.org\/10.1016\/j.jss.2020.110821. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S016412122030217X","journal-title":"J Syst Softw"},{"issue":"9","key":"10257_CR5","doi-asserted-by":"publisher","first-page":"1319","DOI":"10.1016\/j.infsof.2009.04.002","volume":"51","author":"M Alshayeb","year":"2009","unstructured":"Alshayeb M (2009) Empirical investigation of refactoring effect on software quality. Inf Softw Technol 51(9):1319\u20131326. https:\/\/doi.org\/10.1016\/j.infsof.2009.04.002. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S095058490900038X","journal-title":"Inf Softw Technol"},{"key":"10257_CR6","doi-asserted-by":"publisher","unstructured":"Bakota T, Hegedu\u030bs P, K\u00f6rtv\u00e9lyesi P, Ferenc R, Gyim\u00f3thy T (2011) A probabilistic software quality model. In: 2011 27th IEEE international conference on software maintenance (ICSM). https:\/\/doi.org\/10.1109\/ICSM.2011.6080791, pp 243\u2013252","DOI":"10.1109\/ICSM.2011.6080791"},{"key":"10257_CR7","doi-asserted-by":"publisher","unstructured":"Bakota T, Hegedu\u030bs P, Siket I, Lad\u00e1nyi G, Ferenc R (2014) Qualitygate sourceaudit: a tool for assessing the technical quality of software. In: 2014 Software evolution week\u2014IEEE conference on software maintenance, reengineering, and reverse engineering (CSMR-WCRE). https:\/\/doi.org\/10.1109\/CSMR-WCRE.2014.6747214, pp 440\u2013445","DOI":"10.1109\/CSMR-WCRE.2014.6747214"},{"key":"10257_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.jss.2015.05.024","volume":"107","author":"G Bavota","year":"2015","unstructured":"Bavota G, De Lucia A, Di Penta M, Oliveto R, Palomba F (2015) An experimental investigation on the innate relationship between quality and refactoring. J Syst Softw 107:1\u201314. https:\/\/doi.org\/10.1016\/j.jss.2015.05.024. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S0164121215001053","journal-title":"J Syst Softw"},{"key":"10257_CR9","unstructured":"Boehm BW, Brown JR, Lipow M (1976) Quantitative evaluation of software quality. In: Proceedings of the 2nd international conference on software engineering, ICSE \u201976. http:\/\/dl.acm.org\/citation.cfm?id=800253.807736. IEEE Computer Society Press, Los Alamitos, pp 592\u2013605"},{"key":"10257_CR10","doi-asserted-by":"crossref","unstructured":"Chahal KK, Saini M (2018) Developer dynamics and syntactic quality of commit messages in oss projects. In: Stamelos I, Gonzalez-Baraho\u00f1a J M, Varlamis I, Anagnostopoulos D (eds) Open source systems: enterprise software and solutions. Springer International Publishing, Cham, pp 61\u201376","DOI":"10.1007\/978-3-319-92375-8_6"},{"key":"10257_CR11","doi-asserted-by":"publisher","unstructured":"Ch\u2019avez A, Ferreira I, Fernandes E, Cedrim D, Garcia A (2017) How does refactoring affect internal quality attributes? A multi-project study. In: Proceedings of the 31st Brazilian symposium on software engineering, SBES\u201917. https:\/\/doi.org\/10.1145\/3131151.3131171. Association for Computing Machinery, New York, pp 74\u201383","DOI":"10.1145\/3131151.3131171"},{"issue":"6","key":"10257_CR12","doi-asserted-by":"publisher","first-page":"476","DOI":"10.1109\/32.295895","volume":"20","author":"SR Chidamber","year":"1994","unstructured":"Chidamber SR, Kemerer CF (1994) A metrics suite for object oriented design. IEEE Trans Softw Eng 20(6):476\u2013493. https:\/\/doi.org\/10.1109\/32.295895","journal-title":"IEEE Trans Softw Eng"},{"key":"10257_CR13","doi-asserted-by":"crossref","unstructured":"Cliff N (1993) Dominance statistics: ordinal analyses to answer ordinal questions. Psychol Bull","DOI":"10.1037\/0033-2909.114.3.494"},{"issue":"1","key":"10257_CR14","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1177\/001316446002000104","volume":"20","author":"J Cohen","year":"1960","unstructured":"Cohen J (1960) A coefficient of agreement for nominal scales. Educ Psychol Meas 20(1):37\u201346. https:\/\/doi.org\/10.1177\/001316446002000104","journal-title":"Educ Psychol Meas"},{"issue":"4\u20135","key":"10257_CR15","doi-asserted-by":"publisher","first-page":"531","DOI":"10.1007\/s10664-011-9173-9","volume":"17","author":"M D\u2019Ambros","year":"2012","unstructured":"D\u2019Ambros M, Lanza M, Robbes R (2012) Evaluating defect prediction approaches: a benchmark and an extensive comparison. Empirical Softw Engg 17(4\u20135):531\u2013577. https:\/\/doi.org\/10.1007\/s10664-011-9173-9","journal-title":"Empirical Softw Engg"},{"key":"10257_CR16","doi-asserted-by":"publisher","unstructured":"Dey T, Mousavi S, Ponce E, Fry T, Vasilescu B, Filippova A, Mockus A (2020) Detecting and characterizing bots that commit code. In: Proceedings of the 17th international conference on mining software repositories. https:\/\/doi.org\/10.1145\/3379597.3387478. Association for Computing Machinery, New York, pp 209\u2013219","DOI":"10.1145\/3379597.3387478"},{"key":"10257_CR17","doi-asserted-by":"publisher","unstructured":"Fakhoury S, Roy D, Hassan A, Arnaoudova V (2019) Improving source code readability: theory and practice. In: 2019 IEEE\/ACM 27th international conference on program comprehension (ICPC). https:\/\/doi.org\/10.1109\/ICPC.2019.00014, pp 2\u201312","DOI":"10.1109\/ICPC.2019.00014"},{"key":"10257_CR18","doi-asserted-by":"publisher","DOI":"10.1201\/b17461","volume-title":"Software metrics: a rigorous and practical approach, 3rd edn","author":"N Fenton","year":"2014","unstructured":"Fenton N, Bieman J (2014) Software metrics: a rigorous and practical approach, 3rd edn. CRC Press, Inc., Boca Raton"},{"key":"10257_CR19","doi-asserted-by":"publisher","first-page":"110691","DOI":"10.1016\/j.jss.2020.110691","volume":"169","author":"R Ferenc","year":"2020","unstructured":"Ferenc R, Gyimesi P, Gyimesi G, T\u00f3th Z, Gyim\u00f3thy T (2020) An automatically created novel bug dataset and its validation in bug prediction. J Syst Softw 169:110691. https:\/\/doi.org\/10.1016\/j.jss.2020.110691. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S0164121220301436","journal-title":"J Syst Softw"},{"key":"10257_CR20","doi-asserted-by":"publisher","first-page":"369","DOI":"10.1016\/j.infsof.2014.05.017","volume":"57","author":"Y Fu","year":"2015","unstructured":"Fu Y, Yan M, Zhang X, Xu L, Yang D, Kymer JD (2015) Automated classification of software change messages by semi-supervised latent dirichlet allocation. Inf Softw Technol 57:369\u2013377. https:\/\/doi.org\/10.1016\/j.infsof.2014.05.017. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S0950584914001347","journal-title":"Inf Softw Technol"},{"key":"10257_CR21","doi-asserted-by":"publisher","first-page":"106566","DOI":"10.1016\/j.infsof.2021.106566","volume":"135","author":"L Ghadhab","year":"2021","unstructured":"Ghadhab L, Jenhani I, Mkaouer MW, Ben Messaoud M (2021) Augmenting commit classification by using fine-grained source code changes and a pre-trained deep neural language model. Inf Softw Technol 135:106566. https:\/\/doi.org\/10.1016\/j.infsof.2021.106566. https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0950584921000495","journal-title":"Inf Softw Technol"},{"key":"10257_CR22","doi-asserted-by":"publisher","unstructured":"Gharbi S, Mkaouer MW, Jenhani I, Messaoud MB (2019) On the classification of software change messages using multi-label active learning. In: Proceedings of the 34th ACM\/SIGAPP symposium on applied computing, SAC \u201919. https:\/\/doi.org\/10.1145\/3297280.3297452. Association for Computing Machinery, New York, pp 1760\u20131767","DOI":"10.1145\/3297280.3297452"},{"key":"10257_CR23","unstructured":"Griessom RJ, Kim JJ (2005) Effect sizes for research: a broad practical approach. Lawrence Erlbaum Associates Publishers"},{"issue":"10","key":"10257_CR24","doi-asserted-by":"publisher","first-page":"897","DOI":"10.1109\/TSE.2005.112","volume":"31","author":"T Gyimothy","year":"2005","unstructured":"Gyimothy T, Ferenc R, Siket I (2005) Empirical validation of object-oriented metrics on open source software for fault prediction. IEEE Trans Softw Eng 31(10):897\u2013910. https:\/\/doi.org\/10.1109\/TSE.2005.112","journal-title":"IEEE Trans Softw Eng"},{"key":"10257_CR25","doi-asserted-by":"publisher","unstructured":"Hattori LP, Lanza M (2008) On the nature of commits. In: Proceedings of the 23rd IEEE\/ACM international conference on automated software engineering, ASE\u201908. https:\/\/doi.org\/10.1109\/ASEW.2008.4686322. IEEE Press, Piscataway, pp III\u201363\u2013III\u201371","DOI":"10.1109\/ASEW.2008.4686322"},{"key":"10257_CR26","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1007\/s10664-021-10092-4","volume":"27","author":"S Herbold","year":"2022","unstructured":"Herbold S, Trautsch A, Trautsch F, Ledel B (2022) Problems with SZZ and features: An empirical study of the state of practice of defect prediction data collection. Empir Software Eng 27:42. https:\/\/doi.org\/10.1007\/s10664-021-10092-4","journal-title":"Empir Software Eng"},{"key":"10257_CR27","doi-asserted-by":"crossref","unstructured":"Herzig K, Just S, Zeller A (2013) It\u2019s not a bug, it\u2019s a feature: how misclassification impacts bug prediction. In: Proceedings of the 2013 international conference on software engineering, ICSE \u201913. IEEE Press, pp 392\u2013401","DOI":"10.1109\/ICSE.2013.6606585"},{"key":"10257_CR28","doi-asserted-by":"publisher","unstructured":"H\u00f6nel S, Ericsson M, L\u00f6we W, Wingkvist A (2019) Importance and aptitude of source code density for commit classification into maintenance activities. In: 2019 IEEE 19th international conference on software quality, reliability and security (QRS). https:\/\/doi.org\/10.1109\/QRS.2019.00027, pp 109\u2013120","DOI":"10.1109\/QRS.2019.00027"},{"issue":"99","key":"10257_CR29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TSE.2017.2770124","volume":"PP","author":"S Hosseini","year":"2017","unstructured":"Hosseini S, Turhan B, Gunarathna D (2017) A systematic literature review and meta-analysis on cross project defect prediction. IEEE Trans Softw Eng PP(99):1\u20131. https:\/\/doi.org\/10.1109\/TSE.2017.2770124","journal-title":"IEEE Trans Softw Eng"},{"key":"10257_CR30","doi-asserted-by":"publisher","unstructured":"Huang Q, Xia X, Lo D (2017) Supervised vs unsupervised models: a holistic look at effort-aware just-in-time defect prediction. In: 2017 IEEE International conference on software maintenance and evolution (ICSME). https:\/\/doi.org\/10.1109\/ICSME.2017.51, pp 159\u2013170","DOI":"10.1109\/ICSME.2017.51"},{"key":"10257_CR31","unstructured":"ISO\/IEC (2001) Iso\/iec 9126. software engineering\u2014product quality"},{"key":"10257_CR32","unstructured":"ISO\/IEC (2011) ISO\/IEC 25010:2011, systems and software engineering\u2014systems and software quality requirements and evaluation (square)\u2014system and software quality models"},{"key":"10257_CR33","doi-asserted-by":"publisher","unstructured":"Jureczko M, Madeyski L (2010) Towards identifying software project clusters with regard to defect prediction. In: Proceedings of the 6th international conference on predictive models in software engineering, PROMISE \u201910. https:\/\/doi.org\/10.1145\/1868328.1868342. Association for Computing Machinery, New York","DOI":"10.1145\/1868328.1868342"},{"issue":"6","key":"10257_CR34","doi-asserted-by":"publisher","first-page":"757","DOI":"10.1109\/TSE.2012.70","volume":"39","author":"Y Kamei","year":"2013","unstructured":"Kamei Y, Shihab E, Adams B, Hassan AE, Mockus A, Sinha A, Ubayashi N (2013) A large-scale empirical study of just-in-time quality assurance. IEEE Trans Softw Eng 39(6):757\u2013773. https:\/\/doi.org\/10.1109\/TSE.2012.70","journal-title":"IEEE Trans Softw Eng"},{"key":"10257_CR35","doi-asserted-by":"publisher","unstructured":"Kim S, Zimmermann T, Whitehead E J Jr, Zeller A (2007) Predicting faults from cached history. In: 29th International conference on software engineering (ICSE\u201907). https:\/\/doi.org\/10.1109\/ICSE.2007.66, pp 489\u2013498","DOI":"10.1109\/ICSE.2007.66"},{"issue":"1","key":"10257_CR36","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1109\/52.476281","volume":"13","author":"B Kitchenham","year":"1996","unstructured":"Kitchenham B, Pfleeger SL (1996) Software quality: the elusive target [special issues section]. IEEE Softw 13(1):12\u201321. https:\/\/doi.org\/10.1109\/52.476281","journal-title":"IEEE Softw"},{"issue":"2","key":"10257_CR37","doi-asserted-by":"publisher","first-page":"363","DOI":"10.2307\/2529786","volume":"33","author":"JR Landis","year":"1977","unstructured":"Landis JR, Koch GG (1977) An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics 33(2):363\u2013374. http:\/\/www.jstor.org\/stable\/2529786","journal-title":"Biometrics"},{"key":"10257_CR38","doi-asserted-by":"publisher","unstructured":"Levin S, Yehudai A (2017) Boosting automatic commit classification into maintenance activities by utilizing source code changes. In: Proceedings of the 13th international conference on predictive models and data analytics in software engineering, PROMISE. https:\/\/doi.org\/10.1145\/3127005.3127016. Association for Computing Machinery, New York, pp 97\u2013106","DOI":"10.1145\/3127005.3127016"},{"key":"10257_CR39","doi-asserted-by":"publisher","unstructured":"Lewis C, Lin Z, Sadowski C, Zhu X, Ou R, Whitehead EJ (2013) Does bug prediction support human developers? findings from a google case study. In: 2013 35th International conference on software engineering (ICSE). https:\/\/doi.org\/10.1109\/ICSE.2013.6606583, pp 372\u2013381","DOI":"10.1109\/ICSE.2013.6606583"},{"issue":"1","key":"10257_CR40","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1214\/aoms\/1177730491","volume":"18","author":"HB Mann","year":"1947","unstructured":"Mann HB, Whitney DR (1947) On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat 18(1):50\u201360","journal-title":"Ann Math Stat"},{"key":"10257_CR41","doi-asserted-by":"publisher","unstructured":"Mauczka A, Huber M, Schanes C, Schramm W, Bernhart M, Grechenig T (2012) Tracing your maintenance work\u2014a cross-project validation of an automated classification dictionary for commit messages. In: Proceedings of the 15th international conference on fundamental approaches to software engineering, FASE\u201912. https:\/\/doi.org\/10.1007\/978-3-642-28872-2\u2216_21. Springer, Berlin, pp 301\u2013315","DOI":"10.1007\/978-3-642-28872-2\u2216_21"},{"key":"10257_CR42","doi-asserted-by":"crossref","unstructured":"Mauczka A, Brosch F, Schanes C, Grechenig T (2015) Dataset of developer-labeled commit messages. In: Proceedings of the 12th working conference on mining software repositories, MSR \u201915. http:\/\/dl.acm.org\/citation.cfm?id=2820518.2820595. IEEE Press, Piscataway, pp 490\u2013493","DOI":"10.1109\/MSR.2015.71"},{"issue":"4","key":"10257_CR43","doi-asserted-by":"publisher","first-page":"308","DOI":"10.1109\/TSE.1976.233837","volume":"2","author":"TJ McCabe","year":"1976","unstructured":"McCabe TJ (1976) A complexity measure. IEEE Trans Softw Eng 2(4):308\u2013320. https:\/\/doi.org\/10.1109\/TSE.1976.233837","journal-title":"IEEE Trans Softw Eng"},{"key":"10257_CR44","unstructured":"McCall JA, Richards PK, Walters GF (1977) Factors in software quality: concept and definitions of software quality, vol 1(3). Rome Air Development Center, Air Force Systems Command, Griffiss Air Force Base, New York"},{"key":"10257_CR45","doi-asserted-by":"publisher","unstructured":"Menzies T, Turhan B, Bener A, Gay G, Cukic B, Jiang Y (2008) Implications of ceiling effects in defect predictors. In: Proceedings of the 4th international workshop on predictor models in software engineering, PROMISE \u201908. https:\/\/doi.org\/10.1145\/1370788.1370801. Association for Computing Machinery, New York, pp 47\u201354","DOI":"10.1145\/1370788.1370801"},{"key":"10257_CR46","doi-asserted-by":"publisher","unstructured":"Mockus Votta (2000) Identifying reasons for software changes using historic databases. In: Proceedings 2000 international conference on software maintenance. https:\/\/doi.org\/10.1109\/ICSM.2000.883028, pp 120\u2013130","DOI":"10.1109\/ICSM.2000.883028"},{"key":"10257_CR47","doi-asserted-by":"publisher","unstructured":"Mordal-Manet K, Balmas F, Denier S, Ducasse S, Wertz H, Laval J, Bellingard F, Vaillergues P (2009) The squale model\u2014a practice-based industrial quality model. In: 2009 IEEE International conference on software maintenance. https:\/\/doi.org\/10.1109\/ICSM.2009.5306381, pp 531\u2013534","DOI":"10.1109\/ICSM.2009.5306381"},{"key":"10257_CR48","unstructured":"NASA (2004) Nasa IV & V facility metrics data program. http:\/\/mdp.ivv.nasa.gov\/repository.html"},{"key":"10257_CR49","doi-asserted-by":"publisher","unstructured":"Pantiuchina J, Lanza M, Bavota G (2018) Improving code: the (mis) perception of quality metrics. In: 2018 IEEE International conference on software maintenance and evolution (ICSME). https:\/\/doi.org\/10.1109\/ICSME.2018.00017, pp 80\u201391","DOI":"10.1109\/ICSME.2018.00017"},{"key":"10257_CR50","doi-asserted-by":"publisher","unstructured":"Pantiuchina J, Zampetti F, Scalabrino S, Piantadosi V, Oliveto R, Bavota G, Penta MD (2020) Why developers refactor source code: a mining-based study. ACM Trans Softw Eng Methodol 29(4). https:\/\/doi.org\/10.1145\/3408302","DOI":"10.1145\/3408302"},{"key":"10257_CR51","unstructured":"Parnas DL (2001) Software aging. Addison-Wesley Longman Publishing Co., Inc, pp 551\u2013567"},{"key":"10257_CR52","doi-asserted-by":"publisher","unstructured":"Peitek N, Apel S, Parnin C, Brechmann A, Siegmund J (2021) Program comprehension and code complexity metrics: an fmri study. In: 2021 IEEE\/ACM 43rd international conference on software engineering (ICSE). https:\/\/doi.org\/10.1109\/ICSE43902.2021.00056, pp 524\u2013536","DOI":"10.1109\/ICSE43902.2021.00056"},{"issue":"6","key":"10257_CR53","doi-asserted-by":"publisher","first-page":"511","DOI":"10.1109\/TSE.2005.74","volume":"31","author":"R Purushothaman","year":"2005","unstructured":"Purushothaman R, Perry DE (2005) Toward understanding the rhetoric of small source code changes. IEEE Trans Softw Eng 31(6):511\u2013526. https:\/\/doi.org\/10.1109\/TSE.2005.74","journal-title":"IEEE Trans Softw Eng"},{"key":"10257_CR54","doi-asserted-by":"publisher","unstructured":"Rahman F, Posnett D, Hindle A, Barr E, Devanbu P (2011) Bugcache for inspections: hit or miss?. In: Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on foundations of software engineering, ESEC\/FSE \u201911. https:\/\/doi.org\/10.1145\/2025113.2025157. Association for Computing Machinery, New York, pp 322\u2013331","DOI":"10.1145\/2025113.2025157"},{"key":"10257_CR55","doi-asserted-by":"publisher","unstructured":"Santos EA, Hindle A (2016) Judging a commit by its cover: correlating commit message entropy with build status on travis-ci. In: Proceedings of the 13th international conference on mining software repositories, MSR \u201916. https:\/\/doi.org\/10.1145\/2901739.2903493. Association for Computing Machinery, New York, pp 504\u2013507","DOI":"10.1145\/2901739.2903493"},{"issue":"3","key":"10257_CR56","doi-asserted-by":"publisher","first-page":"595","DOI":"10.1109\/TSE.2019.2901468","volume":"47","author":"S Scalabrino","year":"2021","unstructured":"Scalabrino S, Bavota G, Vendome C, Linares-V\u00e1squez M, Poshyvanyk D, Oliveto R (2021) Automatically assessing code understandability. IEEE Trans Softw Eng 47(3):595\u2013613. https:\/\/doi.org\/10.1109\/TSE.2019.2901468","journal-title":"IEEE Trans Softw Eng"},{"key":"10257_CR57","doi-asserted-by":"publisher","unstructured":"Stroggylos K, Spinellis D (2007) Refactoring\u2013does it improve software quality?. In: Fifth international workshop on software quality (woSQ\u201907: ICSE workshops 2007). https:\/\/doi.org\/10.1109\/WOSQ.2007.11, pp 10\u201310","DOI":"10.1109\/WOSQ.2007.11"},{"key":"10257_CR58","unstructured":"Swanson EB (1976) The dimensions of maintenance. In: Proceedings of the 2nd international conference on software engineering. ICSE \u201976. IEEE Computer Society Press, Washington, DC, pp 492\u2013497"},{"key":"10257_CR59","doi-asserted-by":"publisher","unstructured":"Tian Y, Zhang Y, Stol KJ, Jiang L, Liu H (2022) What makes a good commit message?. In: Proceedings of the 44th international conference on software engineering, ICSE \u201922. https:\/\/doi.org\/10.1145\/3510003.3510205. Association for Computing Machinery, New York, pp 2389\u20132401","DOI":"10.1145\/3510003.3510205"},{"key":"10257_CR60","doi-asserted-by":"publisher","unstructured":"Trautsch A, Herbold S, Grabowski J (2020a) A longitudinal study of static analysis warning evolution and the effects of PMD on software quality in apache open source projects. Empir Softw Eng. https:\/\/doi.org\/10.1007\/s10664-020-09880-1","DOI":"10.1007\/s10664-020-09880-1"},{"key":"10257_CR61","doi-asserted-by":"crossref","unstructured":"Trautsch A, Trautsch F, Herbold S, Ledel B, Grabowski J (2020b) The smartshark ecosystem for software repository mining. In: Proceedings of the 42st international conference on software engineering - demonstrations. ACM","DOI":"10.1145\/3377812.3382139"},{"key":"10257_CR62","unstructured":"Trautsch A, Erbel J, Herbold S, Grabowski J (2021) Replication kit. https:\/\/github.com\/atrautsch\/emse2021_replication"},{"key":"10257_CR63","doi-asserted-by":"publisher","unstructured":"Trautsch F, Herbold S, Makedonski P, Grabowski J (2017) Addressing problems with replicability and validity of repository mining studies through a smart data platform. Empir Softw Eng. https:\/\/doi.org\/10.1007\/s10664-017-9537-x","DOI":"10.1007\/s10664-017-9537-x"},{"key":"10257_CR64","doi-asserted-by":"publisher","unstructured":"von der Mosel J, Trautsch A, Herbold S (2022) On the validity of pre-trained transformers for natural language processing in the software engineering domain. IEEE Transactions on Software Engineering, 1\u20131. https:\/\/doi.org\/10.1109\/TSE.2022.3178469","DOI":"10.1109\/TSE.2022.3178469"},{"key":"10257_CR65","doi-asserted-by":"crossref","unstructured":"Wagner S, Lochmann K, Heinemann L, Kl\u00e4s M, Trendowicz A, Pl\u00f6sch R, Seidl A, Goeb A, Streit J (2012) The quamoco product quality modelling and assessment approach. In: Proceedings of the 34th International conference on software engineering, ICSE \u201912. http:\/\/dl.acm.org\/citation.cfm?id=2337223.2337372. IEEE Press, Piscataway, pp 1133\u20131142","DOI":"10.1109\/ICSE.2012.6227106"},{"key":"10257_CR66","doi-asserted-by":"publisher","first-page":"106408","DOI":"10.1016\/j.infsof.2020.106408","volume":"130","author":"S Wang","year":"2021","unstructured":"Wang S, Bansal C, Nagappan N (2021) Large-scale intent analysis for identifying large-review-effort code changes. Inf Softw Technol 130:106408. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S0950584920300033","journal-title":"Inf Softw Technol"},{"issue":"3-4","key":"10257_CR67","doi-asserted-by":"publisher","first-page":"591","DOI":"10.1093\/biomet\/52.3-4.591","volume":"52","author":"MB Wilk","year":"1965","unstructured":"Wilk MB, Shapiro SS (1965) An analysis of variance test for normality (complete samples). Biometrika 52(3-4):591\u2013611. https:\/\/doi.org\/10.1093\/biomet\/52.3-4.591","journal-title":"Biometrika"},{"key":"10257_CR68","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4615-4625-2","volume-title":"Experimentation in software engineering: an introduction","author":"C Wohlin","year":"2000","unstructured":"Wohlin C, Runeson P, H\u00f6st M, Ohlsson MC, Regnell B, Wessl\u00e9n A (2000) Experimentation in software engineering: an introduction. Kluwer Academic Publishers, Norwell"},{"key":"10257_CR69","doi-asserted-by":"publisher","first-page":"296","DOI":"10.1016\/j.jss.2015.12.019","volume":"113","author":"M Yan","year":"2016","unstructured":"Yan M, Fu Y, Zhang X, Yang D, Xu L, Kymer JD (2016) Automatically classifying software changes via discriminative topic model: supporting multi-category and cross-project. J Syst Softw 113:296\u2013308. https:\/\/doi.org\/10.1016\/j.jss.2015.12.019. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S016412121500285X","journal-title":"J Syst Softw"},{"key":"10257_CR70","doi-asserted-by":"publisher","unstructured":"Yatish S, Jiarpakdee J, Thongtanunam P, Tantithamthavorn C (2019) Mining software defects: should we consider affected releases?. In: 2019 IEEE\/ACM 41st international conference on software engineering (ICSE). https:\/\/doi.org\/10.1109\/ICSE.2019.00075, pp 654\u2013665","DOI":"10.1109\/ICSE.2019.00075"},{"key":"10257_CR71","doi-asserted-by":"publisher","unstructured":"Zhou Y, Yang Y, Lu H, Chen L, Li Y, Zhao Y, Qian J, Xu B (2018) How far we have progressed in the journey? An examination of cross-project defect prediction. ACM Trans Softw Eng Methodol 27(1). https:\/\/doi.org\/10.1145\/3183339","DOI":"10.1145\/3183339"}],"container-title":["Empirical Software Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10664-022-10257-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10664-022-10257-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10664-022-10257-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,4,3]],"date-time":"2023-04-03T06:55:53Z","timestamp":1680504953000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10664-022-10257-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,14]]},"references-count":71,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,3]]}},"alternative-id":["10257"],"URL":"https:\/\/doi.org\/10.1007\/s10664-022-10257-9","relation":{},"ISSN":["1382-3256","1573-7616"],"issn-type":[{"value":"1382-3256","type":"print"},{"value":"1573-7616","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,1,14]]},"assertion":[{"value":"2 November 2022","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 January 2023","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have no competing interests to declare that are relevant to the content of this article.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"<!--Emphasis Type='Bold' removed-->Conflict of Interests"}}],"article-number":"30"}}