{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,4]],"date-time":"2026-06-04T01:25:15Z","timestamp":1780536315065,"version":"3.54.1"},"reference-count":139,"publisher":"Association for Computing Machinery (ACM)","issue":"11","license":[{"start":{"date-parts":[[2023,2,9]],"date-time":"2023-02-09T00:00:00Z","timestamp":1675900800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"European Research Council","award":["851895"],"award-info":[{"award-number":["851895"]}]},{"name":"German Research Foundation within the ConcSys and DeMoCo"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2023,11,30]]},"abstract":"<jats:p>The immense amounts of source code provide ample challenges and opportunities during software development. To handle the size of code bases, developers commonly search for code, e.g., when trying to find where a particular feature is implemented or when looking for code examples to reuse. To support developers in finding relevant code, various code search engines have been proposed. This article surveys 30 years of research on code search, giving a comprehensive overview of challenges and techniques that address them. We discuss the kinds of queries that code search engines support, how to preprocess and expand queries, different techniques for indexing and retrieving code, and ways to rank and prune search results. Moreover, we describe empirical studies of code search in practice. Based on the discussion of prior work, we conclude the article with an outline of challenges and opportunities to be addressed in the future.<\/jats:p>","DOI":"10.1145\/3565971","type":"journal-article","created":{"date-parts":[[2022,10,12]],"date-time":"2022-10-12T13:10:20Z","timestamp":1665580220000},"page":"1-31","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":50,"title":["Code Search: A Survey of Techniques for Finding Code"],"prefix":"10.1145","volume":"55","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5306-8645","authenticated-orcid":false,"given":"Luca","family":"Di Grazia","sequence":"first","affiliation":[{"name":"Department of Computer Science, University of Stuttgart, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1623-498X","authenticated-orcid":false,"given":"Michael","family":"Pradel","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Stuttgart, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2023,2,9]]},"reference":[{"key":"e_1_3_2_2_2","volume-title":"ISO\/IEC 14882 International Standard - First Edition 1998-09-01: Programming Languages C++","year":"1998","unstructured":"1998. ISO\/IEC 14882 International Standard - First Edition 1998-09-01: Programming Languages C++. ISO."},{"key":"e_1_3_2_3_2","volume-title":"Compilers: Principles, Techniques, & Tools","author":"Aho Alfred V.","year":"2007","unstructured":"Alfred V. Aho, Monica S. Lam, Ravi Sethi, and Jeffrey D. Ullman. 2007. Compilers: Principles, Techniques, & Tools. Pearson Education India."},{"key":"e_1_3_2_4_2","first-page":"1","volume-title":"Proceedings of the IEEE\/ACM 16th International Conference on Mining Software Repositories (MSR\u201919)","author":"Akbar S.","year":"2019","unstructured":"S. Akbar and A. Kak. 2019. SCOR: Source code retrieval with semantics and order. In Proceedings of the IEEE\/ACM 16th International Conference on Mining Software Repositories (MSR\u201919). 1\u201312."},{"key":"e_1_3_2_5_2","first-page":"2055","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Babenko Artem","year":"2016","unstructured":"Artem Babenko and Victor Lempitsky. 2016. Efficient indexing of billion-scale datasets of deep descriptors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2055\u20132063."},{"key":"e_1_3_2_6_2","doi-asserted-by":"crossref","first-page":"681","DOI":"10.1145\/1176617.1176671","volume-title":"Companion to the 21st ACM SIGPLAN Symposium on Object-oriented Programming Systems, Languages, and Applications","author":"Bajracharya Sushil","year":"2006","unstructured":"Sushil Bajracharya, Trung Ngo, Erik Linstead, Yimeng Dou, Paul Rigor, Pierre Baldi, and Cristina Lopes. 2006. Sourcerer: A search engine for open source code supporting structure-based search. In Companion to the 21st ACM SIGPLAN Symposium on Object-oriented Programming Systems, Languages, and Applications. 681\u2013682."},{"key":"e_1_3_2_7_2","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1109\/MSR.2009.5069489","volume-title":"Proceedings of the 6th International Working Conference on Mining Software Repositories (MSR 2009) (Co-located with ICSE)","author":"Bajracharya Sushil Krishna","year":"2009","unstructured":"Sushil Krishna Bajracharya and Cristina Videira Lopes. 2009. Mining search topics from a code search engine usage log. In Proceedings of the 6th International Working Conference on Mining Software Repositories (MSR 2009) (Co-located with ICSE), Michael W. Godfrey and Jim Whitehead (Eds.). IEEE Computer Society, 111\u2013120. DOI:10.1109\/MSR.2009.5069489"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-010-9144-6"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/1882291.1882316"},{"key":"e_1_3_2_10_2","first-page":"467","volume-title":"Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME\u201915)","author":"Balachandran V.","year":"2015","unstructured":"V. Balachandran. 2015. Query by example in large-scale code repositories. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME\u201915). 467\u2013476."},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1145\/93597.98741"},{"key":"e_1_3_2_12_2","first-page":"513","volume-title":"Proceedings of the 28th International Conference on Human Factors in Computing Systems (CHI\u201910)","author":"Brandt Joel","year":"2010","unstructured":"Joel Brandt, Mira Dontcheva, Marcos Weskamp, and Scott R. Klemmer. 2010. Example-centric programming: Integrating web search into the development environment. In Proceedings of the 28th International Conference on Human Factors in Computing Systems (CHI\u201910), Elizabeth D. Mynatt, Don Schoner, Geraldine Fitzpatrick, Scott E. Hudson, W. Keith Edwards, and Tom Rodden (Eds.). ACM, 513\u2013522. DOI:10.1145\/1753326.1753402"},{"issue":"1","key":"e_1_3_2_13_2","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/S0169-7552(98)00110-X","article-title":"The anatomy of a large-scale hypertextual web search engine","volume":"30","author":"Brin Sergey","year":"1998","unstructured":"Sergey Brin and Lawrence Page. 1998. The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30, 1\u20137 (1998), 107\u2013117.","journal-title":"Comput. Netw. ISDN Syst."},{"key":"e_1_3_2_14_2","volume-title":"Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems (NeurIPS\u201920)","author":"Brown Tom B.","year":"2020","unstructured":"Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language models are few-shot learners. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems (NeurIPS\u201920), Hugo Larochelle, Marc\u2019Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin (Eds.)."},{"key":"e_1_3_2_15_2","first-page":"213","volume-title":"Proceedings of the European Software Engineering Conference and International Symposium on Foundations of Software Engineering (ESEC\/FSE\u201909)","author":"Bruch Marcel","year":"2009","unstructured":"Marcel Bruch, Martin Monperrus, and Mira Mezini. 2009. Learning from examples to improve code completion systems. In Proceedings of the European Software Engineering Conference and International Symposium on Foundations of Software Engineering (ESEC\/FSE\u201909). ACM, 213\u2013222."},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1145\/3338906.3340458"},{"key":"e_1_3_2_17_2","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1145\/3510003.3510125","volume-title":"Proceedings of the 44th International Conference on Software Engineering (ICSE\u201922)","author":"Chai Yitian","year":"2022","unstructured":"Yitian Chai, Hongyu Zhang, Beijun Shen, and Xiaodong Gu. 2022. Cross-domain deep code search with meta learning. In Proceedings of the 44th International Conference on Software Engineering (ICSE\u201922). Association for Computing Machinery, New York, NY, 487\u2013498. DOI:10.1145\/3510003.3510125"},{"key":"e_1_3_2_18_2","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1007\/978-3-642-00593-0_26","volume-title":"International Conference on Fundamental Approaches to Software Engineering","author":"Chatterjee Shaunak","year":"2009","unstructured":"Shaunak Chatterjee, Sudeep Juvekar, and Koushik Sen. 2009. Sniff: A search engine for java using free-form queries. In International Conference on Fundamental Approaches to Software Engineering. Springer, 385\u2013400."},{"key":"e_1_3_2_19_2","first-page":"364","volume-title":"Proceedings of the IEEE International Conference on Software Maintenance (ICSM\u201901)","author":"Chen Annie","year":"2001","unstructured":"Annie Chen, Eric Chou, Joshua Wong, Andrew Y. Yao, Qing Zhang, Shao Zhang, and Amir Michail. 2001. CVSSearch: Searching through source code using CVS comments. In Proceedings of the IEEE International Conference on Software Maintenance (ICSM\u201901). IEEE, 364\u2013373."},{"key":"e_1_3_2_20_2","unstructured":"Mark Chen Jerry Tworek Heewoo Jun Qiming Yuan Henrique Ponde de Oliveira Pinto Jared Kaplan Harrison Edwards Yuri Burda Nicholas Joseph Greg Brockman Alex Ray Raul Puri Gretchen Krueger Michael Petrov Heidy Khlaaf Girish Sastry Pamela Mishkin Brooke Chan Scott Gray Nick Ryder Mikhail Pavlov Alethea Power Lukasz Kaiser Mohammad Bavarian Clemens Winter Philippe Tillet Felipe Petroski Such Dave Cummings Matthias Plappert Fotios Chantzis Elizabeth Barnes Ariel Herbert-Voss William Hebgen Guss Alex Nichol Alex Paino Nikolas Tezak Jie Tang Igor Babuschkin Suchir Balaji Shantanu Jain William Saunders Christopher Hesse Andrew N. Carr Jan Leike Joshua Achiam Vedant Misra Evan Morikawa Alec Radford Matthew Knight Miles Brundage Mira Murati Katie Mayer Peter Welinder Bob McGrew Dario Amodei Sam McCandlish Ilya Sutskever and Wojciech Zaremba. 2021. Evaluating large language models trained on code. CoRR abs\/2107.03374 (2021). arXiv:2107.03374. Retrieved from https:\/\/arxiv.org\/abs\/2107.03374."},{"key":"e_1_3_2_21_2","doi-asserted-by":"crossref","first-page":"826","DOI":"10.1145\/3238147.3240471","volume-title":"Proceedings of the 33rd ACM\/IEEE International Conference on Automated Software Engineering (ASE\u201918)","author":"Chen Qingying","year":"2018","unstructured":"Qingying Chen and Minghui Zhou. 2018. A neural framework for retrieval and summarization of source code. In Proceedings of the 33rd ACM\/IEEE International Conference on Automated Software Engineering (ASE\u201918). Association for Computing Machinery, New York, NY, 826\u2013831. DOI:10.1145\/3238147.3240471"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/2666356.2594343"},{"key":"e_1_3_2_23_2","first-page":"21","volume-title":"Proceedings of the 6th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering (RAISE\u201918)","author":"Diamantopoulos Themistoklis","year":"2018","unstructured":"Themistoklis Diamantopoulos, Georgios Karagiannopoulos, and Andreas L. Symeonidis. 2018. Codecatch: Extracting source code snippets from online sources. In Proceedings of the 6th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering (RAISE\u201918). Association for Computing Machinery, New York, NY, 21\u201327. DOI:10.1145\/3194104.3194107"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1002\/smr.567"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482127"},{"key":"e_1_3_2_26_2","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1145\/1363686.1363952","volume-title":"Proceedings of the ACM Symposium on Applied Computing (SAC\u201908)","author":"Dur\u00e3o Frederico A.","year":"2008","unstructured":"Frederico A. Dur\u00e3o, Taciana A. Vanderlei, Eduardo S. Almeida, and Silvio R. de L. Meira. 2008. Applying a semantic layer in a source code search tool. In Proceedings of the ACM Symposium on Applied Computing (SAC\u201908). Association for Computing Machinery, New York, NY, 1151\u20131157. DOI:10.1145\/1363686.1363952"},{"key":"e_1_3_2_27_2","unstructured":"ECMA. 2011. Standard ECMA-262 ECMAScript Language Specification 5.1 Edition."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.139"},{"key":"e_1_3_2_29_2","series-title":"Proceedings of the 34th International Conference on Machine Learning","first-page":"1126","volume":"70","author":"Finn Chelsea","year":"2017","unstructured":"Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-agnostic meta-learning for fast adaptation of deep networks. In Proceedings of the 34th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 70), Doina Precup and Yee Whye Teh (Eds.). PMLR, 1126\u20131135."},{"key":"e_1_3_2_30_2","first-page":"1","volume-title":"Proceedings of the IEEE 13th International Workshop on Software Clones (IWSC\u201919)","author":"Fujiwara Y.","year":"2019","unstructured":"Y. Fujiwara, N. Yoshida, E. Choi, and K. Inoue. 2019. Code-to-code search based on deep neural network and code mutation. In Proceedings of the IEEE 13th International Workshop on Software Clones (IWSC\u201919). 1\u20137."},{"key":"e_1_3_2_31_2","article-title":"Software reuse and software component technology [J]","volume":"27","author":"Fuqing Yang","year":"1999","unstructured":"Yang Fuqing, Mei Hong, and Li Keqin. 1999. Software reuse and software component technology [J]. Acta Electr. Sin. 27 (1999), 68\u201375.","journal-title":"Acta Electr. Sin."},{"key":"e_1_3_2_32_2","first-page":"61","volume-title":"Proceedings of the 13th Asia Pacific Software Engineering Conference (APSEC\u201906)","author":"Garcia V. C.","year":"2006","unstructured":"V. C. Garcia, E. S. de Almeida, L. B. Lisboa, A. C. Martins, S. R. L. Meira, D. Lucredio, and R. P. d. M. Fortes. 2006. Toward a code search engine based on the state-of-art and practice. In Proceedings of the 13th Asia Pacific Software Engineering Conference (APSEC\u201906). 61\u201370."},{"key":"e_1_3_2_33_2","unstructured":"GitHub. 2021. The 2020 State of the Octoverse. Retrieved from https:\/\/octoverse.github.com\/."},{"key":"e_1_3_2_34_2","volume-title":"The Java Language Specification","author":"Gosling James","year":"1996","unstructured":"James Gosling, William N. Joy, and Guy L. Steele Jr.1996. The Java Language Specification. Addison-Wesley."},{"key":"e_1_3_2_35_2","doi-asserted-by":"crossref","unstructured":"Luca Di Grazia Paul Bredl and Michael Pradel. 2022. DiffSearch: A scalable and precise search engine for code changes. arXiv:2204.02787. Retrieved from https:\/\/arxiv.org\/abs\/2204.02787.","DOI":"10.1109\/TSE.2022.3218859"},{"key":"e_1_3_2_36_2","doi-asserted-by":"crossref","first-page":"475","DOI":"10.1145\/1806799.1806868","volume-title":"Proceedings of the 32nd ACM\/IEEE International Conference on Software Engineering, Volume 1 (ICSE\u201910)","author":"Grechanik Mark","year":"2010","unstructured":"Mark Grechanik, Chen Fu, Qing Xie, Collin McMillan, Denys Poshyvanyk, and Chad M. Cumby. 2010. A search engine for finding highly relevant applications. In Proceedings of the 32nd ACM\/IEEE International Conference on Software Engineering, Volume 1 (ICSE\u201910), Jeff Kramer, Judith Bishop, Premkumar T. Devanbu, and Sebasti\u00e1n Uchitel (Eds.). ACM, 475\u2013484. DOI:10.1145\/1806799.1806868"},{"key":"e_1_3_2_37_2","first-page":"483","volume-title":"Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME\u201921)","author":"Gu Jian","year":"2021","unstructured":"Jian Gu, Zimin Chen, and Martin Monperrus. 2021. Multimodal representation for neural code search. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME\u201921). IEEE, 483\u2013494. DOI:10.1109\/ICSME52107.2021.00049"},{"key":"e_1_3_2_38_2","volume-title":"Proceedings of the ACM\/IEEE International Conference on Software Engineering (ICSE\u201918)","author":"Gu Xiaodong","year":"2018","unstructured":"Xiaodong Gu, Hongyu Zhang, and Sunghun Kim. 2018. Deep code search. In Proceedings of the ACM\/IEEE International Conference on Software Engineering (ICSE\u201918)."},{"key":"e_1_3_2_39_2","volume-title":"Proceedings of the 9th International Conference on Learning Representations (ICLR\u201921)","author":"Guo Daya","year":"2021","unstructured":"Daya Guo, Shuo Ren, Shuai Lu, Zhangyin Feng, Duyu Tang, Shujie Liu, Long Zhou, Nan Duan, Alexey Svyatkovskiy, Shengyu Fu, Michele Tufano, Shao Kun Deng, Colin B. Clement, Dawn Drain, Neel Sundaresan, Jian Yin, Daxin Jiang, and Ming Zhou. 2021. GraphCodeBERT: Pre-training code representations with data flow. In Proceedings of the 9th International Conference on Learning Representations (ICLR\u201921)."},{"key":"e_1_3_2_40_2","doi-asserted-by":"crossref","first-page":"416","DOI":"10.1145\/2814270.2814295","volume-title":"Proceedings of the ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA\u201915)","author":"Gvero Tihomir","year":"2015","unstructured":"Tihomir Gvero and Viktor Kuncak. 2015. Synthesizing Java expressions from free-form queries. In Proceedings of the ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA\u201915), Jonathan Aldrich and Patrick Eugster (Eds.). ACM, 416\u2013432. DOI:10.1145\/2814270.2814295"},{"key":"e_1_3_2_41_2","series-title":"Proceedings of the 20th European Conference on Object-Oriented Programming (ECOOP\u201906),","first-page":"2","volume":"4067","author":"Hajiyev Elnar","year":"2006","unstructured":"Elnar Hajiyev, Mathieu Verbaere, and Oege de Moor. 2006. codeQuest: Scalable source code queries with datalog. In Proceedings of the 20th European Conference on Object-Oriented Programming (ECOOP\u201906),Lecture Notes in Computer Science, Vol. 4067, Dave Thomas (Ed.). Springer, 2\u201327. DOI:10.1007\/11785477_2"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE.2009.5070524"},{"key":"e_1_3_2_43_2","first-page":"524","volume-title":"Proceedings of the 26th IEEE\/ACM International Conference on Automated Software Engineering (ASE\u201911)","author":"Hill Emily","year":"2011","unstructured":"Emily Hill, Lori Pollock, and K. Vijay-Shanker. 2011. Improving source code search with natural language phrasal representations of method signatures. In Proceedings of the 26th IEEE\/ACM International Conference on Automated Software Engineering (ASE\u201911). IEEE, 524\u2013527."},{"key":"e_1_3_2_44_2","first-page":"117","volume-title":"Proceedings of the 27th International Conference on Software Engineering (ICSE\u201905)","author":"Holmes Reid","year":"2005","unstructured":"Reid Holmes and Gail C. Murphy. 2005. Using structural context to recommend source code examples. In Proceedings of the 27th International Conference on Software Engineering (ICSE\u201905). Association for Computing Machinery, New York, NY, 117\u2013125. DOI:10.1145\/1062455.1062491"},{"key":"e_1_3_2_45_2","first-page":"5690","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, (ACL\/IJCNLP\u201921), Volume 1: Long Papers","author":"Huang Junjie","year":"2021","unstructured":"Junjie Huang, Duyu Tang, Linjun Shou, Ming Gong, Ke Xu, Daxin Jiang, Ming Zhou, and Nan Duan. 2021. CoSQA: 20,000+ web queries for code search and question answering. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, (ACL\/IJCNLP\u201921), Volume 1: Long Papers, Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (Eds.). Association for Computational Linguistics, 5690\u20135700. DOI:10.18653\/v1\/2021.acl-long.442"},{"key":"e_1_3_2_46_2","unstructured":"Hamel Husain Ho-Hsiang Wu Tiferet Gazit Miltiadis Allamanis and Marc Brockschmidt. 2019. CodeSearchNet challenge: Evaluating the state of semantic code search. arxiv:1909.09436. Retrieved from http:\/\/arxiv.org\/abs\/1909.09436."},{"key":"e_1_3_2_47_2","unstructured":"Hamel Husain and Ho-Hsiang Wu. 2018. How to create natural language semantic search for arbitrary objects with deep learning. Retrieved November 5 (2018) 2019."},{"key":"e_1_3_2_48_2","unstructured":"Katsuro Inoue Yuya Miyamoto Daniel M. Germ\u00e1n and Takashi Ishio. 2020. Code clone matching: A practical and effective approach to find code snippets. arxiv:2003.05615. Retrieved from https:\/\/arxiv.org\/abs\/2003.05615."},{"key":"e_1_3_2_49_2","doi-asserted-by":"crossref","first-page":"178","DOI":"10.1145\/643603.643622","volume-title":"Proceedings of the 2nd International Conference on Aspect-Oriented Software Development (AOSD\u201903)","author":"Janzen Doug","year":"2003","unstructured":"Doug Janzen and Kris De Volder. 2003. Navigating and querying code without getting lost. In Proceedings of the 2nd International Conference on Aspect-Oriented Software Development (AOSD\u201903), William G. Griswold and Mehmet Aksit (Eds.). ACM, 178\u2013187. DOI:10.1145\/643603.643622"},{"key":"e_1_3_2_50_2","first-page":"92","volume-title":"Proceedings of the 18th IEEE International Working Conference on Source Code Analysis and Manipulation (SCAM\u201918)","author":"Jiang Renhe","year":"2018","unstructured":"Renhe Jiang, Zhengzhao Chen, Zejun Zhang, Yu Pei, Minxue Pan, and Tian Zhang. 2018. [Research Paper] semantics-based code search using input\/output examples. In Proceedings of the 18th IEEE International Working Conference on Source Code Analysis and Manipulation (SCAM\u201918). IEEE Computer Society, 92\u2013102. DOI:10.1109\/SCAM.2018.00018"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1002\/smr.344"},{"key":"e_1_3_2_52_2","first-page":"295","volume-title":"Proceedings of the 30th IEEE\/ACM International Conference on Automated Software Engineering (ASE\u201915)","author":"Ke Yalin","year":"2015","unstructured":"Yalin Ke, Kathryn T. Stolee, Claire Le Goues, and Yuriy Brun. 2015. Repairing programs with semantic code search (t). In Proceedings of the 30th IEEE\/ACM International Conference on Automated Software Engineering (ASE\u201915). IEEE, 295\u2013306."},{"key":"e_1_3_2_53_2","unstructured":"Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova. 2019. Bert: Pre-training of deep bidirectional transformers for language understanding. 4171\u20134186."},{"key":"e_1_3_2_54_2","unstructured":"Muhammad Khalifa. 2019. Semantic source code search: A study of the past and a glimpse at the future. arxiv:1908.06738. Retrieved from http:\/\/arxiv.org\/abs\/1908.06738."},{"key":"e_1_3_2_55_2","first-page":"329","volume-title":"Proceedings of the 10th Working Conference on Mining Software Repositories (MSR\u201913)","author":"Khoo W. M.","year":"2013","unstructured":"W. M. Khoo, A. Mycroft, and R. Anderson. 2013. Rendezvous: A search engine for binary code. In Proceedings of the 10th Working Conference on Mining Software Repositories (MSR\u201913). 329\u2013338."},{"key":"e_1_3_2_56_2","first-page":"946","volume-title":"Proceedings of the 40th International Conference on Software Engineering","author":"Kim Kisub","year":"2018","unstructured":"Kisub Kim, Dongsun Kim, Tegawend\u00e9 F. Bissyand\u00e9, Eunjong Choi, Li Li, Jacques Klein, and Yves Le Traon. 2018. FaCoY: A code-to-code search engine. In Proceedings of the 40th International Conference on Software Engineering. 946\u2013957."},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2006.116"},{"key":"e_1_3_2_58_2","unstructured":"Julia Lawall Derek Palinski Lukas Gnirke and Gilles Muller. 2017. Fast and precise retrieval of forward and back porting information for Linux device drivers. In Proceeding of the 2017 USENIX Annual Technical Conference (USENIX ATC\u201917) . 15\u201326."},{"key":"e_1_3_2_59_2","first-page":"601","volume-title":"Proceedings of the USENIX Annual Technical Conference (USENIX ATC\u201918)","author":"Lawall Julia","year":"2018","unstructured":"Julia Lawall and Gilles Muller. 2018. Coccinelle: 10 years of automated evolution in the linux kernel. In Proceedings of the USENIX Annual Technical Conference (USENIX ATC\u201918), Haryadi S. Gunawi and Benjamin Reed (Eds.). USENIX Association, 601\u2013614."},{"issue":"2","key":"e_1_3_2_60_2","doi-asserted-by":"crossref","first-page":"265","DOI":"10.7551\/mitpress\/7287.003.0018","article-title":"Combining local context and WordNet similarity for word sense identification","volume":"49","author":"Leacock Claudia","year":"1998","unstructured":"Claudia Leacock and Martin Chodorow. 1998. Combining local context and WordNet similarity for word sense identification. WordNet Electr. Lexic. Datab. 49, 2 (1998), 265\u2013283.","journal-title":"WordNet Electr. Lexic. Datab."},{"key":"e_1_3_2_61_2","unstructured":"Kenton Lee Ming-Wei Chang and Kristina Toutanova. 2019. Latent retrieval for weakly supervised open domain question answering. In Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL\u201919) Anna Korhonen David R. Traum and Llu\u00eds M\u00e0rquez (Eds.). Volume 1: Long Papers Association for Computational Linguistics Florence Italy July 28- August 2 2019 6086\u20136096. 10.18653\/v1\/p19-1612"},{"key":"e_1_3_2_62_2","first-page":"1336","volume-title":"Proceedings of the IEEE 27th International Conference on Data Engineering","author":"Lee M.","year":"2011","unstructured":"M. Lee, S. Hwang, and S. Kim. 2011. Integrating code search into the development session. In Proceedings of the IEEE 27th International Conference on Data Engineering. 1336\u20131339."},{"key":"e_1_3_2_63_2","first-page":"167","volume-title":"Proceedings of the 18th ACM SIGSOFT International Symposium on Foundations of Software Engineering (FSE\u201910)","author":"Lee Mu-Woong","year":"2010","unstructured":"Mu-Woong Lee, Jong-Won Roh, Seung-won Hwang, and Sunghun Kim. 2010. Instant code clone search. In Proceedings of the 18th ACM SIGSOFT International Symposium on Foundations of Software Engineering (FSE\u201910). Association for Computing Machinery, New York, NY, 167\u2013176. DOI:10.1145\/1882291.1882317"},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.infsof.2010.11.009"},{"key":"e_1_3_2_65_2","first-page":"525","volume-title":"Proceedings of the 22nd IEEE\/ACM International Conference on Automated Software Engineering (ASE\u201907)","author":"Lemos Ot\u00e1vio Augusto Lazzarini","year":"2007","unstructured":"Ot\u00e1vio Augusto Lazzarini Lemos, Sushil Krishna Bajracharya, Joel Ossher, Ricardo Santos Morla, Paulo Cesar Masiero, Pierre Baldi, and Cristina Videira Lopes. 2007. CodeGenie: Using test-cases to search and reuse source code. In Proceedings of the 22nd IEEE\/ACM International Conference on Automated Software Engineering (ASE\u201907), R. E. Kurt Stirewalt, Alexander Egyed, and Bernd Fischer (Eds.). ACM, 525\u2013526. DOI:10.1145\/1321631.1321726"},{"key":"e_1_3_2_66_2","first-page":"458","volume-title":"Proceedings of the 26th Asia-Pacific Software Engineering Conference (APSEC\u201919)","author":"Li Wei","year":"2019","unstructured":"Wei Li, Shuhan Yan, Beijun Shen, and Yuting Chen. 2019. Reinforcement learning of code search sessions. In Proceedings of the 26th Asia-Pacific Software Engineering Conference (APSEC\u201919). IEEE, 458\u2013465. DOI:10.1109\/APSEC48747.2019.00068"},{"key":"e_1_3_2_67_2","first-page":"690","volume-title":"Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering (FSE\u201916)","author":"Li Xuan","year":"2016","unstructured":"Xuan Li, Zerui Wang, Qianxiang Wang, Shoumeng Yan, Tao Xie, and Hong Mei. 2016. Relationship-aware code search for JavaScript frameworks. In Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering (FSE\u201916). Association for Computing Machinery, New York, NY, 690\u2013701. DOI:10.1145\/2950290.2950341"},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1145\/3447571"},{"key":"e_1_3_2_69_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-008-0118-x"},{"key":"e_1_3_2_70_2","doi-asserted-by":"publisher","DOI":"10.1145\/3480027"},{"key":"e_1_3_2_71_2","first-page":"29","volume-title":"Proceedings of the 3rd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages (MAPL\u201919)","author":"Liu Jason","year":"2019","unstructured":"Jason Liu, Seohyun Kim, Vijayaraghavan Murali, Swarat Chaudhuri, and Satish Chandra. 2019. Neural query expansion for code search. In Proceedings of the 3rd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages (MAPL\u201919). Association for Computing Machinery, New York, NY, 29\u201337. DOI:10.1145\/3315508.3329975"},{"key":"e_1_3_2_72_2","first-page":"300","volume-title":"Proceedings of the 25th International Conference on Software Analysis, Evolution and Reengineering (SANER\u201918)","author":"Liu Wenjian","year":"2018","unstructured":"Wenjian Liu, Xin Peng, Zhenchang Xing, Junyi Li, Bing Xie, and Wenyun Zhao. 2018. Supporting exploratory code search with differencing and visualization. In Proceedings of the 25th International Conference on Software Analysis, Evolution and Reengineering (SANER\u201918), Rocco Oliveto, Massimiliano Di Penta, and David C. Shepherd (Eds.). IEEE Computer Society, 300\u2013310. DOI:10.1109\/SANER.2018.8330218"},{"key":"e_1_3_2_73_2","doi-asserted-by":"crossref","first-page":"75660","DOI":"10.1109\/ACCESS.2018.2883963","article-title":"Interactive query reformulation for source-code search with word relations","volume":"6","author":"Lu J.","year":"2018","unstructured":"J. Lu, Y. Wei, X. Sun, B. Li, W. Wen, and C. Zhou. 2018. Interactive query reformulation for source-code search with word relations. IEEE Access 6 (2018), 75660\u201375668.","journal-title":"IEEE Access"},{"key":"e_1_3_2_74_2","first-page":"545","volume-title":"Proceedings of the IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER\u201915)","author":"Lu Meili","year":"2015","unstructured":"Meili Lu, Xiaobing Sun, Shaowei Wang, David Lo, and Yucong Duan. 2015. Query expansion via wordnet for effective code search. In Proceedings of the IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER\u201915). IEEE, 545\u2013549."},{"key":"e_1_3_2_75_2","article-title":"CodeXGLUE: A machine learning benchmark dataset for code understanding and generation","volume":"2102","author":"Lu Shuai","year":"2021","unstructured":"Shuai Lu, Daya Guo, Shuo Ren, Junjie Huang, Alexey Svyatkovskiy, Ambrosio Blanco, Colin B. Clement, Dawn Drain, Daxin Jiang, Duyu Tang, Ge Li, Lidong Zhou, Linjun Shou, Long Zhou, Michele Tufano, Ming Gong, Ming Zhou, Nan Duan, Neel Sundaresan, Shao Kun Deng, Shengyu Fu, and Shujie Liu. 2021. CodeXGLUE: A machine learning benchmark dataset for code understanding and generation. CoRR abs\/2102.04664.","journal-title":"CoRR"},{"key":"e_1_3_2_76_2","first-page":"152","article-title":"Aroma: Code recommendation via structural code search","volume":"3","author":"Luan Sifei","year":"2019","unstructured":"Sifei Luan, Di Yang, Celeste Barnaby, Koushik Sen, and Satish Chandra. 2019. Aroma: Code recommendation via structural code search. Proc. ACM Program. Lang. 3 (2019), 152.","journal-title":"Proc. ACM Program. Lang."},{"key":"e_1_3_2_77_2","first-page":"260","volume-title":"Proceedings of the 30th IEEE\/ACM International Conference on Automated Software Engineering","author":"Lv Fei","year":"2015","unstructured":"Fei Lv, Hongyu Zhang, Jian guang Lou, Shaowei Wang, Dongmei Zhang, and Jianjun Zhao. 2015. CodeHow: Effective code search based on API understanding and extended boolean model (E). In Proceedings of the 30th IEEE\/ACM International Conference on Automated Software Engineering. 260\u2013270."},{"key":"e_1_3_2_78_2","doi-asserted-by":"crossref","first-page":"774","DOI":"10.1145\/3106237.3106293","volume-title":"Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering (ESEC\/FSE\u201917)","author":"Martie Lee","year":"2017","unstructured":"Lee Martie, Andr\u00e9 van der Hoek, and Thomas Kwak. 2017. Understanding the impact of support for iteration on code search. In Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering (ESEC\/FSE\u201917). Association for Computing Machinery, New York, NY, 774\u2013785. DOI:10.1145\/3106237.3106293"},{"key":"e_1_3_2_79_2","first-page":"24","volume-title":"Proceedings of the 30th IEEE\/ACM International Conference on Automated Software Engineering (ASE\u201915)","author":"Martie Lee","year":"2015","unstructured":"Lee Martie, Thomas D. LaToza, and Andre van der Hoek. 2015. Codeexchange: Supporting reformulation of internet-scale code queries in context (t). In Proceedings of the 30th IEEE\/ACM International Conference on Automated Software Engineering (ASE\u201915). IEEE, 24\u201335."},{"key":"e_1_3_2_80_2","first-page":"76","volume-title":"Proceedings of the IEEE\/ACM 12th Working Conference on Mining Software Repositories","author":"Martie L.","year":"2015","unstructured":"L. Martie and A. v. d. Hoek. 2015. Sameness: An experiment in code search. In Proceedings of the IEEE\/ACM 12th Working Conference on Mining Software Repositories. 76\u201387."},{"key":"e_1_3_2_81_2","first-page":"210","volume-title":"Proceedings of the 42nd International Conference on Software Engineering (ICSE\u201920)","author":"Mathew George","year":"2020","unstructured":"George Mathew, Chris Parnin, and Kathryn T. Stolee. 2020. SLACC: Simion-based language agnostic code clones. In Proceedings of the 42nd International Conference on Software Engineering (ICSE\u201920), Gregg Rothermel and Doo-Hwan Bae (Eds.). ACM, 210\u2013221. DOI:10.1145\/3377811.3380407"},{"key":"e_1_3_2_82_2","first-page":"205","volume-title":"Proceedings of the 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ISEC\/FSE\u201921)","author":"Mathew George","year":"2021","unstructured":"George Mathew and Kathryn T. Stolee. 2021. Cross-language code search using static and dynamic analyses. In Proceedings of the 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ISEC\/FSE\u201921), Diomidis Spinellis, Georgios Gousios, Marsha Chechik, and Massimiliano Di Penta (Eds.). ACM, 205\u2013217. DOI:10.1145\/3468264.3468538"},{"issue":"5","key":"e_1_3_2_83_2","doi-asserted-by":"crossref","first-page":"1069","DOI":"10.1109\/TSE.2011.84","article-title":"Exemplar: A source code search engine for finding highly relevant applications","volume":"38","author":"McMillan C.","year":"2012","unstructured":"C. McMillan, M. Grechanik, D. Poshyvanyk, C. Fu, and Q. Xie. 2012. Exemplar: A source code search engine for finding highly relevant applications. IEEE Trans. Softw. Eng. 38, 5 (2012), 1069\u20131087.","journal-title":"IEEE Trans. Softw. Eng."},{"key":"e_1_3_2_84_2","doi-asserted-by":"publisher","DOI":"10.1145\/2522920.2522930"},{"key":"e_1_3_2_85_2","first-page":"327","volume-title":"Proceedings of the 24th International Conference on Software Engineering (ICSE\u201902)","author":"Michail Amir","year":"2002","unstructured":"Amir Michail. 2002. Browsing and searching source code of applications written using a GUI framework. In Proceedings of the 24th International Conference on Software Engineering (ICSE\u201902). Association for Computing Machinery, New York, NY, 327\u2013337. DOI:10.1145\/581339.581381"},{"key":"e_1_3_2_86_2","first-page":"3111","volume-title":"Advances in Neural Information Processing Systems","author":"Mikolov Tomas","year":"2013","unstructured":"Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems. 3111\u20133119."},{"key":"e_1_3_2_87_2","doi-asserted-by":"crossref","first-page":"997","DOI":"10.1145\/2384616.2384689","volume-title":"Proceedings of the ACM International Conference on Object Oriented Programming Systems Languages and Applications (OOPSLA\u201912)","author":"Mishne Alon","year":"2012","unstructured":"Alon Mishne, Sharon Shoham, and Eran Yahav. 2012. Typestate-based semantic code search over partial programs. In Proceedings of the ACM International Conference on Object Oriented Programming Systems Languages and Applications (OOPSLA\u201912). Association for Computing Machinery, New York, NY, 997\u20131016. DOI:10.1145\/2384616.2384689"},{"key":"e_1_3_2_88_2","doi-asserted-by":"crossref","DOI":"10.1561\/9781680835335","volume-title":"An Introduction to Neural Information Retrieval","author":"Mitra Bhaskar","year":"2018","unstructured":"Bhaskar Mitra, Nick Craswell, et\u00a0al. 2018. An Introduction to Neural Information Retrieval. Now Foundations and Trends."},{"key":"e_1_3_2_89_2","doi-asserted-by":"publisher","DOI":"10.14778\/3401960.3401972"},{"key":"e_1_3_2_90_2","doi-asserted-by":"publisher","DOI":"10.1145\/2950290.2950333"},{"key":"e_1_3_2_91_2","first-page":"183","volume-title":"Proceedings of the IEEE\/ACM 39th International Conference on Software Engineering Companion (ICSE-C\u201917)","author":"Nguyen T. V.","year":"2017","unstructured":"T. V. Nguyen, A. T. Nguyen, H. D. Phan, T. D. Nguyen, and T. N. Nguyen. 2017. Combining Word2Vec with revised vector space model for better code retrieval. In Proceedings of the IEEE\/ACM 39th International Conference on Software Engineering Companion (ICSE-C\u201917). 183\u2013185."},{"issue":"5","key":"e_1_3_2_92_2","doi-asserted-by":"crossref","first-page":"771","DOI":"10.1109\/TSC.2016.2560165","article-title":"Query expansion based on crowd knowledge for code search","volume":"9","author":"Nie L.","year":"2016","unstructured":"L. Nie, H. Jiang, Z. Ren, Z. Sun, and X. Li. 2016. Query expansion based on crowd knowledge for code search. IEEE Trans. Serv. Comput. 9, 5 (2016), 771\u2013783.","journal-title":"IEEE Trans. Serv. Comput."},{"key":"e_1_3_2_93_2","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1109\/MSR.2009.5069501","volume-title":"Proceedings of the 6th International Working Conference on Mining Software Repositories (MSR\u201909)","author":"Ossher Joel","year":"2009","unstructured":"Joel Ossher, Sushil Krishna Bajracharya, Erik Linstead, Pierre Baldi, and Cristina Videira Lopes. 2009. SourcererDB: An aggregated repository of statically analyzed and cross-linked open source Java projects. In Proceedings of the 6th International Working Conference on Mining Software Repositories (MSR\u201909), Michael W. Godfrey and Jim Whitehead (Eds.). IEEE Computer Society, 183\u2013186. DOI:10.1109\/MSR.2009.5069501"},{"key":"e_1_3_2_94_2","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1145\/1985429.1985438","volume-title":"Proceedings of the 3rd International Workshop on Search-Driven Development: Users, Infrastructure, Tools, and Evaluation","author":"Panchenko Oleksandr","year":"2011","unstructured":"Oleksandr Panchenko, Hasso Plattner, and Alexander Zeier. 2011. What do developers search for in source code and why. In Proceedings of the 3rd International Workshop on Search-Driven Development: Users, Infrastructure, Tools, and Evaluation. 33\u201336."},{"key":"e_1_3_2_95_2","first-page":"329","volume-title":"Proceedings of the Conference of the Centre for Advanced Studies on Collaborative Research, Volume 1 (CASCON\u201992)","author":"Paul Santanu","year":"1992","unstructured":"Santanu Paul. 1992. SCRUPLE: A reengineer\u2019s tool for source code search. In Proceedings of the Conference of the Centre for Advanced Studies on Collaborative Research, Volume 1 (CASCON\u201992). IBM Press, 329\u2013346."},{"issue":"6","key":"e_1_3_2_96_2","doi-asserted-by":"crossref","first-page":"463","DOI":"10.1109\/32.295894","article-title":"A framework for source code search using program patterns","volume":"20","author":"Paul Santanu","year":"1994","unstructured":"Santanu Paul and Atul Prakash. 1994. A framework for source code search using program patterns. IEEE Trans. Softw. Eng. 20, 6 (1994), 463\u2013475.","journal-title":"IEEE Trans. Softw. Eng."},{"key":"e_1_3_2_97_2","first-page":"1532","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP\u201914)","author":"Pennington Jeffrey","year":"2014","unstructured":"Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. Glove: Global vectors for word representation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP\u201914). 1532\u20131543."},{"key":"e_1_3_2_98_2","doi-asserted-by":"publisher","DOI":"10.1145\/152388.152392"},{"key":"e_1_3_2_99_2","first-page":"283","volume-title":"Proceedings of the 31st International Conference on Software Engineering\u2013Companion Volume","author":"Poshyvanyk D.","year":"2009","unstructured":"D. Poshyvanyk and M. Grechanik. 2009. Creating and evolving software by searching, selecting and synthesizing relevant source code. In Proceedings of the 31st International Conference on Software Engineering\u2013Companion Volume. 283\u2013286."},{"key":"e_1_3_2_100_2","doi-asserted-by":"publisher","DOI":"10.1145\/3460348"},{"key":"e_1_3_2_101_2","volume-title":"ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI\u201920)","author":"Premtoon Varot","year":"2020","unstructured":"Varot Premtoon, James Koppel, and Armando Solar-Lezama. 2020. Semantic code search via equational reasoning. In ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI\u201920)."},{"key":"e_1_3_2_102_2","doi-asserted-by":"publisher","DOI":"10.1145\/2884781.2884808"},{"key":"e_1_3_2_103_2","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1145\/3196398.3196425","volume-title":"Proceedings of the 15th International Conference on Mining Software Repositories (MSR\u201918)","author":"Rahman Md. Masudur","year":"2018","unstructured":"Md. Masudur Rahman, Jed Barson, Sydney Paul, Joshua Kayan, Federico Andres Lois, Sebastian Fernandez Quezada, Christopher Parnin, Kathryn T. Stolee, and Baishakhi Ray. 2018. Evaluating how developers use general-purpose web-search for code retrieval. In Proceedings of the 15th International Conference on Mining Software Repositories (MSR\u201918), Andy Zaidman, Yasutaka Kamei, and Emily Hill (Eds.). ACM, 465\u2013475. DOI:10.1145\/3196398.3196425"},{"key":"e_1_3_2_104_2","first-page":"473","volume-title":"Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME\u201918)","author":"Rahman M. M.","year":"2018","unstructured":"M. M. Rahman and C. Roy. 2018. Effective reformulation of query for code search using crowdsourced knowledge and extra-large data analytics. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME\u201918). 473\u2013484."},{"key":"e_1_3_2_105_2","first-page":"575","volume-title":"Proceedings of the 18th IEEE\/ACM International Conference on Mining Software Repositories (MSR\u201921)","author":"Rao Nikitha","year":"2021","unstructured":"Nikitha Rao, Chetan Bansal, and Joe Guan. 2021. Search4Code: Code search intent classification using weak supervision. In Proceedings of the 18th IEEE\/ACM International Conference on Mining Software Repositories (MSR\u201921). IEEE, 575\u2013579. DOI:10.1109\/MSR52588.2021.00077"},{"key":"e_1_3_2_106_2","first-page":"44","volume-title":"Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI\u201914)","author":"Raychev Veselin","year":"2014","unstructured":"Veselin Raychev, Martin T. Vechev, and Eran Yahav. 2014. Code completion with statistical language models. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI\u201914). 44."},{"key":"e_1_3_2_107_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE.2009.5070525"},{"issue":"115","key":"e_1_3_2_108_2","first-page":"64","article-title":"A survey on software clone detection research","volume":"541","author":"Roy Chanchal Kumar","year":"2007","unstructured":"Chanchal Kumar Roy and James R. Cordy. 2007. A survey on software clone detection research. Queen\u2019s Sch. Comput. TR 541, 115 (2007), 64\u201368.","journal-title":"Queen\u2019s Sch. Comput. TR"},{"key":"e_1_3_2_109_2","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1145\/3211346.3211353","volume-title":"Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages","author":"Sachdev Saksham","year":"2018","unstructured":"Saksham Sachdev, Hongyu Li, Sifei Luan, Seohyun Kim, Koushik Sen, and Satish Chandra. 2018. Retrieval on source code: A neural code search. In Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages. ACM, 31\u201341."},{"key":"e_1_3_2_110_2","first-page":"191","volume-title":"Proceedings of the 10th Joint Meeting on Foundations of Software Engineering","author":"Sadowski Caitlin","year":"2015","unstructured":"Caitlin Sadowski, Kathryn T. Stolee, and Sebastian Elbaum. 2015. How developers search for code: A case study. In Proceedings of the 10th Joint Meeting on Foundations of Software Engineering. 191\u2013201."},{"issue":"11","key":"e_1_3_2_111_2","doi-asserted-by":"crossref","first-page":"1022","DOI":"10.1145\/182.358466","article-title":"Extended Boolean information retrieval","volume":"26","author":"Salton Gerard","year":"1983","unstructured":"Gerard Salton, Edward A. Fox, and Harry Wu. 1983. Extended Boolean information retrieval. Commun. ACM 26, 11 (1983), 1022\u20131036.","journal-title":"Commun. ACM"},{"key":"e_1_3_2_112_2","article-title":"On the effectiveness of transfer learning for code search","author":"Salza Pasquale","year":"2022","unstructured":"Pasquale Salza, Christoph Schwizer, Jian Gu, and Harald C. Gall. 2022. On the effectiveness of transfer learning for code search. IEEE Trans. Softw. Eng. (2022).","journal-title":"IEEE Trans. Softw. Eng."},{"issue":"4","key":"e_1_3_2_113_2","first-page":"47","article-title":"History of search engines","volume":"15","author":"Seymour Tom","year":"2011","unstructured":"Tom Seymour, Dean Frantsvog, Satheesh Kumar, et\u00a0al. 2011. History of search engines. Int. J. Manage. Inf. Syst. 15, 4 (2011), 47\u201358.","journal-title":"Int. J. Manage. Inf. Syst."},{"key":"e_1_3_2_114_2","volume-title":"Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering (FSE\u201912)","author":"Shepherd David","year":"2012","unstructured":"David Shepherd, Kostadin Damevski, Bartosz Ropski, and Thomas Fritz. 2012. Sando: An extensible local code search framework. In Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering (FSE\u201912). Association for Computing Machinery, New York, NY, Article 15, 2 pages. DOI:10.1145\/2393596.2393612"},{"key":"e_1_3_2_115_2","doi-asserted-by":"publisher","DOI":"10.1109\/WPC.1998.693351"},{"key":"e_1_3_2_116_2","doi-asserted-by":"publisher","DOI":"10.1145\/2063239.2063243"},{"key":"e_1_3_2_117_2","first-page":"905","volume-title":"Proceedings of the 28th International Conference on Software Engineering (ICSE\u201906)","author":"Sindhgatta Renuka","year":"2006","unstructured":"Renuka Sindhgatta. 2006. Using an information retrieval system to retrieve source code samples. In Proceedings of the 28th International Conference on Software Engineering (ICSE\u201906). Association for Computing Machinery, New York, NY, 905\u2013908. DOI:10.1145\/1134285.1134448"},{"key":"e_1_3_2_118_2","first-page":"21","volume-title":"Proceedings of the Conference of the Centre for Advanced Studies on Collaborative Research","author":"Singer Janice","year":"1997","unstructured":"Janice Singer, Timothy C. Lethbridge, Norman G. Vinson, and Nicolas Anquetil. 1997. An examination of software engineering work practices. In Proceedings of the Conference of the Centre for Advanced Studies on Collaborative Research, J. Howard Johnson (Ed.). IBM, 21."},{"key":"e_1_3_2_119_2","doi-asserted-by":"crossref","first-page":"945","DOI":"10.1145\/3180155.3182513","volume-title":"Proceedings of the 40th International Conference on Software Engineering (ICSE\u201918)","author":"Sirres Raphael","year":"2018","unstructured":"Raphael Sirres, Tegawend\u00e9 F. Bissyand\u00e9, Dongsun Kim, David Lo, Jacques Klein, Kisub Kim, and Yves Le Traon. 2018. Augmenting and structuring user queries to support efficient free-form code search. In Proceedings of the 40th International Conference on Software Engineering (ICSE\u201918), Michel Chaudron, Ivica Crnkovic, Marsha Chechik, and Mark Harman (Eds.). ACM, 945. DOI:10.1145\/3180155.3182513"},{"key":"e_1_3_2_120_2","first-page":"309","volume-title":"Proceedings of the 10th Working Conference on Mining Software Repositories (MSR\u201913)","author":"Sisman B.","year":"2013","unstructured":"B. Sisman and A. C. Kak. 2013. Assisting code search with automatic Query Reformulation for bug localization. In Proceedings of the 10th Working Conference on Mining Software Repositories (MSR\u201913). 309\u2013318."},{"key":"e_1_3_2_121_2","first-page":"292","volume-title":"Proceedings of the 41st International Conference on Software Engineering (ICSE\u201919)","author":"Sivaraman Aishwarya","year":"2019","unstructured":"Aishwarya Sivaraman, Tianyi Zhang, Guy Van den Broeck, and Miryung Kim. 2019. Active inductive logic programming for code search. In Proceedings of the 41st International Conference on Software Engineering (ICSE\u201919), Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (Eds.). IEEE\/ACM, 292\u2013303. DOI:10.1109\/ICSE.2019.00044"},{"key":"e_1_3_2_122_2","doi-asserted-by":"publisher","DOI":"10.1145\/2581377"},{"key":"e_1_3_2_123_2","first-page":"25","volume-title":"Proceedings of the 20th ACM SIGSOFT Symposium on the Foundations of Software Engineering (SIGSOFT\/FSE\u201912)","author":"Stolee Kathryn T.","year":"2012","unstructured":"Kathryn T. Stolee and Sebastian G. Elbaum. 2012. Toward semantic search via SMT solver. In Proceedings of the 20th ACM SIGSOFT Symposium on the Foundations of Software Engineering (SIGSOFT\/FSE\u201912), Will Tracz, Martin P. Robillard, and Tevfik Bultan (Eds.). ACM, 25. DOI:10.1145\/2393596.2393625"},{"key":"e_1_3_2_124_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2015.04.081"},{"key":"e_1_3_2_125_2","first-page":"388","volume-title":"Proceedings of the 44th International Conference on Software Engineering (ICSE\u201922)","author":"Sun Weisong","year":"2022","unstructured":"Weisong Sun, Chunrong Fang, Yuchen Chen, Guanhong Tao, Tingxu Han, and Quanjun Zhang. 2022. Code search based on context-aware code translation. In Proceedings of the 44th International Conference on Software Engineering (ICSE\u201922). Association for Computing Machinery, New York, NY, 388\u2013400. DOI:10.1145\/3510003.3510140"},{"key":"e_1_3_2_126_2","first-page":"476","volume-title":"Proceedings of the 30th IEEE International Conference on Software Maintenance and Evolution","author":"Svajlenko Jeffrey","year":"2014","unstructured":"Jeffrey Svajlenko, Judith F. Islam, Iman Keivanloo, Chanchal Kumar Roy, and Mohammad Mamun Mia. 2014. Towards a big data curated benchmark of inter-project code clones. In Proceedings of the 30th IEEE International Conference on Software Maintenance and Evolution. IEEE Computer Society, 476\u2013480. DOI:10.1109\/ICSME.2014.77"},{"key":"e_1_3_2_127_2","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1145\/1985429.1985434","volume-title":"Proceedings of the 3rd International Workshop on Search-Driven Development: Users, Infrastructure, Tools, and Evaluation (SUITE\u201911)","author":"Takuya Watanabe","year":"2011","unstructured":"Watanabe Takuya and Hidehiko Masuhara. 2011. A spontaneous code recommendation tool based on associative search. In Proceedings of the 3rd International Workshop on Search-Driven Development: Users, Infrastructure, Tools, and Evaluation (SUITE\u201911). Association for Computing Machinery, New York, NY, 17\u201320. DOI:10.1145\/1985429.1985434"},{"key":"e_1_3_2_128_2","first-page":"264","volume-title":"Proceedings of the 36th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages (POPL\u201909)","author":"Tate Ross","year":"2009","unstructured":"Ross Tate, Michael Stepp, Zachary Tatlock, and Sorin Lerner. 2009. Equality saturation: A new approach to optimization. In Proceedings of the 36th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages (POPL\u201909), Zhong Shao and Benjamin C. Pierce (Eds.). ACM, 264\u2013276. DOI:10.1145\/1480881.1480915"},{"key":"e_1_3_2_129_2","doi-asserted-by":"publisher","DOI":"10.1145\/3018661.3018691"},{"key":"e_1_3_2_130_2","first-page":"119","volume-title":"Proceedings of the 18th Working Conference on Reverse Engineering","author":"Wang S.","year":"2011","unstructured":"S. Wang, D. Lo, and L. Jiang. 2011. Code search via topic-enriched dependence graph matching. In Proceedings of the 18th Working Conference on Reverse Engineering. 119\u2013123."},{"key":"e_1_3_2_131_2","doi-asserted-by":"crossref","first-page":"677","DOI":"10.1145\/2642937.2642947","volume-title":"Proceedings of the 29th ACM\/IEEE International Conference on Automated Software Engineering (ASE\u201914)","author":"Wang Shaowei","year":"2014","unstructured":"Shaowei Wang, David Lo, and Lingxiao Jiang. 2014. Active code search: Incorporating user feedback to improve code search relevance. In Proceedings of the 29th ACM\/IEEE International Conference on Automated Software Engineering (ASE\u201914). Association for Computing Machinery, New York, NY, 677\u2013682. DOI:10.1145\/2642937.2642947"},{"key":"e_1_3_2_132_2","unstructured":"Wenhua Wang Yuqun Zhang Zhengran Zeng and Guandong Xu. 2020. TranS******23033: A transformer-based framework for unifying code summarization and code search. arxiv:2003.03238. Retrieved from https:\/\/arxiv.org\/abs\/2003.03238."},{"key":"e_1_3_2_133_2","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1145\/1858996.1859091","volume-title":"Proceedings of the IEEE\/ACM International Conference on Automated Software Engineering","author":"Wang Xiaoyin","year":"2010","unstructured":"Xiaoyin Wang, David Lo, Jiefeng Cheng, Lu Zhang, Hong Mei, and Jeffrey Xu Yu. 2010. Matching dependence-related queries in the system dependence graph. In Proceedings of the IEEE\/ACM International Conference on Automated Software Engineering. 457\u2013466."},{"key":"e_1_3_2_134_2","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1145\/2380116.2380145","volume-title":"Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology (UIST\u201912)","author":"Wightman Doug","year":"2012","unstructured":"Doug Wightman, Zi Ye, Joel Brandt, and Roel Vertegaal. 2012. SnipMatch: Using source code context to enhance snippet retrieval and parameterization. In Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology (UIST\u201912). Association for Computing Machinery, New York, NY, 219\u2013228. DOI:10.1145\/2380116.2380145"},{"key":"e_1_3_2_135_2","doi-asserted-by":"crossref","first-page":"56796","DOI":"10.1109\/ACCESS.2019.2913560","article-title":"Code search based on alteration intent","volume":"7","author":"Wu H.","year":"2019","unstructured":"H. Wu and Y. Yang. 2019. Code search based on alteration intent. IEEE Access 7 (2019), 56796\u201356802.","journal-title":"IEEE Access"},{"key":"e_1_3_2_136_2","unstructured":"Yonghui Wu Mike Schuster Zhifeng Chen Quoc V. Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey Jeff Klingner Apurva Shah Melvin Johnson Xiaobing Liu \u0141ukasz Kaiser Stephan Gouws Yoshikiyo Kato Taku Kudo Hideto Kazawa Keith Stevens George Kurian Nishant Patil Wei Wang Cliff Young Jason Smith Jason Riesa Alex Rudnick Oriol Vinyals Greg Corrado Macduff Hughes and Jeffrey Dean. 2016. Google\u2019s neural machine translationsystem: Bridging the gap between human and machine translation. DOI:arxiv.1609.08144. Retrieved from https:\/\/arxiv.org\/abs\/1609.08144."},{"key":"e_1_3_2_137_2","doi-asserted-by":"publisher","DOI":"10.1109\/SANER50967.2021.00039"},{"key":"e_1_3_2_138_2","doi-asserted-by":"crossref","first-page":"2309","DOI":"10.1145\/3366423.3380295","volume-title":"Proceedings of the Web Conference 2020","author":"Ye Wei","year":"2020","unstructured":"Wei Ye, Rui Xie, Jinglei Zhang, Tianxiang Hu, Xiaoyin Wang, and Shikun Zhang. 2020. Leveraging code generation to improve code retrieval and summarization via dual learning. In Proceedings of the Web Conference 2020. 2309\u20132319."},{"key":"e_1_3_2_139_2","first-page":"1202","volume-title":"Proceedings of the 34th IEEE\/ACM International Conference on Automated Software Engineering (ASE\u201919)","author":"Zhou Shufan","year":"2019","unstructured":"Shufan Zhou, Beijun Shen, and Hao Zhong. 2019. Lancer: Your code tell me what you need. In Proceedings of the 34th IEEE\/ACM International Conference on Automated Software Engineering (ASE\u201919). IEEE, 1202\u20131205. DOI:10.1109\/ASE.2019.00137"},{"key":"e_1_3_2_140_2","first-page":"79","volume-title":"Proceedings of the 25th Asia-Pacific Software Engineering Conference (APSEC\u201918)","author":"Zhou S.","year":"2018","unstructured":"S. Zhou, H. Zhong, and B. Shen. 2018. SLAMPA: Recommending code snippets with statistical language model. In Proceedings of the 25th Asia-Pacific Software Engineering Conference (APSEC\u201918). 79\u201388."}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3565971","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3565971","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:08:32Z","timestamp":1750183712000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3565971"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,9]]},"references-count":139,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2023,11,30]]}},"alternative-id":["10.1145\/3565971"],"URL":"https:\/\/doi.org\/10.1145\/3565971","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,9]]},"assertion":[{"value":"2022-02-17","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-09-29","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-02-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}