{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,2]],"date-time":"2026-03-02T23:09:53Z","timestamp":1772492993153,"version":"3.50.1"},"reference-count":27,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2023,5,2]],"date-time":"2023-05-02T00:00:00Z","timestamp":1682985600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"5G-based research and application demonstration of key technologies for a new generation of automatic intelligent terminals","award":["2021YFQ0054"],"award-info":[{"award-number":["2021YFQ0054"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>Requirement traceability links are an essential part of requirement management software and are a basic prerequisite for software artifact changes. The manual establishment of requirement traceability links is time-consuming. When faced with large projects, requirement managers spend a lot of time in establishing relationships from numerous requirements and codes. However, existing techniques for automatic requirement traceability link recovery are limited by the semantic disparity between natural language and programming language, resulting in many methods being less accurate. In this paper, we propose a fine-grained requirement-code traceability link recovery approach based on query expansion, which analyzes the semantic similarity between requirements and codes from a fine-grained perspective, and uses a query expansion technique to establish valid links that deviate from the query, so as to further improve the accuracy of traceability link recovery. Experiments showed that the approach proposed in this paper outperforms state-of-the-art unsupervised traceability link recovery methods, not only specifying the obvious advantages of fine-grained structure analysis for word embedding-based traceability link recovery, but also improving the accuracy of establishing requirement traceability links. The experimental results demonstrate the superiority of our approach.<\/jats:p>","DOI":"10.3390\/info14050270","type":"journal-article","created":{"date-parts":[[2023,5,3]],"date-time":"2023-05-03T01:36:38Z","timestamp":1683077798000},"page":"270","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Enhancing Traceability Link Recovery with Fine-Grained Query Expansion Analysis"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-9239-3082","authenticated-orcid":false,"given":"Tao","family":"Peng","sequence":"first","affiliation":[{"name":"School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China"}]},{"given":"Kun","family":"She","sequence":"additional","affiliation":[{"name":"School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China"}]},{"given":"Yimin","family":"Shen","sequence":"additional","affiliation":[{"name":"School of Big Data and Artificial Intelligence, Chengdu Technological University, Chengdu 610031, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7675-113X","authenticated-orcid":false,"given":"Xiangliang","family":"Xu","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3452-8391","authenticated-orcid":false,"given":"Yue","family":"Yu","sequence":"additional","affiliation":[{"name":"School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,5,2]]},"reference":[{"key":"ref_1","unstructured":"Gotel, O., and Finkelstein, C. (1994, January 18\u201322). An analysis of the requirements traceability problem. Proceedings of the IEEE International Conference on Requirements Engineering, Colorado Springs, CO, USA."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Florez, J.M. (2019, January 25\u201331). Automated Fine-Grained Requirements-to-Code Traceability Link Recovery. Proceedings of the 2019 IEEE\/ACM 41st International Conference on Software Engineering: Companion Proceedings (ICSE-Companion), Montreal, QC, Canada.","DOI":"10.1109\/ICSE-Companion.2019.00087"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Falessi, D., Di Penta, M., Canfora, G., and Cantone, G. (2018, January 3\u20137). Estimating the Number of Remaining Links in Traceability Recovery (Journal-First Abstract). Proceedings of the 33rd ACM\/IEEE International Conference on Automated Software Engineering, Montpellier, France. ASE \u201918.","DOI":"10.1145\/3238147.3241982"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1145\/361219.361220","article-title":"A Vector Space Model for Automatic Indexing","volume":"18","author":"Salton","year":"1975","journal-title":"Commun. ACM"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Marcus, A., and Maletic, J. (2003, January 3\u201310). Recovering documentation-to-source-code traceability links using latent semantic indexing. Proceedings of the 25th International Conference on Software Engineering, Portland, OR, USA.","DOI":"10.1109\/ICSE.2003.1201194"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Asuncion, H.U., Asuncion, A.U., and Taylor, R.N. (2010, January 1\u20138). Software Traceability with Topic Modeling. Proceedings of the 32nd ACM\/IEEE International Conference on Software Engineering-Volume 1, Cape Town, South Africa. ICSE \u201910.","DOI":"10.1145\/1806799.1806817"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Guo, J., Cheng, J., and Cleland-Huang, J. (2017, January 20\u201328). Semantically Enhanced Software Traceability Using Deep Learning Techniques. Proceedings of the 2017 IEEE\/ACM 39th International Conference on Software Engineering (ICSE), Buenos Aires, Argentina.","DOI":"10.1109\/ICSE.2017.9"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1103","DOI":"10.1007\/s10664-016-9479-8","article-title":"Tackling the term-mismatch problem in automated trace retrieval","volume":"22","author":"Guo","year":"2017","journal-title":"Empir. Softw. Eng."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1007\/s10664-015-9421-5","article-title":"Learning to rank code examples for code search engines","volume":"22","author":"Niu","year":"2017","journal-title":"Empir. Softw. Eng."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Aung, T.W.W., Huo, H., and Sui, Y. (2020, January 13\u201315). A Literature Review of Automatic Traceability Links Recovery for Software Change Impact Analysis. Proceedings of the 28th International Conference on Program Comprehension, Seoul, Republic of Korea. ICPC \u201920.","DOI":"10.1145\/3387904.3389251"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Mahmoud, A., Niu, N., and Xu, S. (2012, January 11\u201313). A semantic relatedness approach for traceability link recovery. Proceedings of the 2012 20th IEEE International Conference on Program Comprehension (ICPC), Passau, Germany.","DOI":"10.1109\/ICPC.2012.6240487"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Ye, X., Bunescu, R., and Liu, C. (2014, January 16\u201321). Learning to Rank Relevant Files for Bug Reports Using Domain Knowledge. Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, Hong Kong, China. FSE 2014.","DOI":"10.1145\/2635868.2635874"},{"key":"ref_13","unstructured":"Moran, K., Palacio, D.N., Bernal-C\u00e1rdenas, C., McCrystal, D., Poshyvanyk, D., Shenefiel, C., and Johnson, J. (July, January 27). Improving the Effectiveness of Traceability Link Recovery Using Hierarchical Bayesian Networks. Proceedings of the ACM\/IEEE 42nd International Conference on Software Engineering, Seoul, Republic of Korea. ICSE \u201920."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Hey, T., Chen, F., Weigelt, S., and Tichy, W.F. (October, January 27). Improving Traceability Link Recovery Using Fine-grained Requirements-to-Code Relations. Proceedings of the 2021 IEEE International Conference on Software Maintenance and Evolution (ICSME), Luxembourg.","DOI":"10.1109\/ICSME52107.2021.00008"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Pennington, J., Socher, R., and Manning, C.D. (2014, January 25\u201329). Glove: Global vectors for word representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar.","DOI":"10.3115\/v1\/D14-1162"},{"key":"ref_16","unstructured":"Grave, E., Bojanowski, P., Gupta, P., Joulin, A., and Mikolov, T. (2018). Learning Word Vectors for 157 Languages. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zhao, T., Cao, Q., and Sun, Q. (2017, January 4\u20138). An Improved Approach to Traceability Recovery Based on Word Embeddings. Proceedings of the 2017 24th Asia-Pacific Software Engineering Conference (APSEC), Nanjing, China.","DOI":"10.1109\/APSEC.2017.14"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Lohar, S., Amornborvornwong, S., Zisman, A., and Cleland-Huang, J. (2013, January 18\u201326). Improving Trace Accuracy through Data-Driven Configuration and Composition of Tracing Features. Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering, Saint Petersburg, Russia. ESEC\/FSE 2013.","DOI":"10.1145\/2491411.2491432"},{"key":"ref_19","unstructured":"(2022, October 06). Center of Excellence for Software & Systems Traceability (CoEST)-Datasets. Available online: http:\/\/sarec.nd.edu\/coest\/datasets.html."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Panichella, A., McMillan, C., Moritz, E., Palmieri, D., Oliveto, R., Poshyvanyk, D., and De Lucia, A. (2013, January 5\u20138). When and How Using Structural Information to Improve IR-Based Traceability Recovery. Proceedings of the 2013 17th European Conference on Software Maintenance and Reengineering, Genova, Italy.","DOI":"10.1109\/CSMR.2013.29"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"838","DOI":"10.1002\/smr.1736","article-title":"Can method data dependencies support the assessment of traceability between requirements and source code?","volume":"27","author":"Kuang","year":"2015","journal-title":"J. Softw. Evol. Process"},{"key":"ref_22","first-page":"1","article-title":"Predictive Models in Software Engineering: Challenges and Opportunities","volume":"31","author":"Yang","year":"2022","journal-title":"ACM Trans. Softw. Eng. Methodol."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"110406","DOI":"10.1016\/j.jss.2019.110406","article-title":"DeepLink: Recovering issue-commit links based on deep learning","volume":"158","author":"Ruan","year":"2019","journal-title":"J. Syst. Softw."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Chen, L., Wang, D., Wang, J., and Wang, Q. (2019, January 2\u20135). Enhancing Unsupervised Requirements Traceability with Sequential Semantics. Proceedings of the 2019 26th Asia-Pacific Software Engineering Conference (APSEC), Putrajaya, Malaysia.","DOI":"10.1109\/APSEC48747.2019.00013"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Mills, C., Escobar-Avila, J., Bhattacharya, A., Kondyukov, G., Chakraborty, S., and Haiduc, S. (October, January 29). Tracing with Less Data: Active Learning for Classification-Based Traceability Link Recovery. Proceedings of the 2019 IEEE International Conference on Software Maintenance and Evolution (ICSME), Cleveland, OH, USA.","DOI":"10.1109\/ICSME.2019.00020"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Mills, C., Escobar-Avila, J., and Haiduc, S. (2018, January 23\u201329). Automatic Traceability Maintenance via Machine Learning Classification. Proceedings of the 2018 IEEE International Conference on Software Maintenance and Evolution (ICSME), Madrid, Spain.","DOI":"10.1109\/ICSME.2018.00045"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Prause, C.R. (2009, January 20\u201321). Maintaining Fine-Grained Code Metadata Regardless of Moving, Copying and Merging. Proceedings of the 2009 Ninth IEEE International Working Conference on Source Code Analysis and Manipulation, Edmonton, AB, Canada.","DOI":"10.1109\/SCAM.2009.20"}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/14\/5\/270\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:28:07Z","timestamp":1760124487000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/14\/5\/270"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,2]]},"references-count":27,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2023,5]]}},"alternative-id":["info14050270"],"URL":"https:\/\/doi.org\/10.3390\/info14050270","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,5,2]]}}}