{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,22]],"date-time":"2026-06-22T15:54:02Z","timestamp":1782143642403,"version":"3.54.5"},"reference-count":133,"publisher":"Association for Computing Machinery (ACM)","issue":"3","funder":[{"name":"Australian Government through the Australian Research Council\u2019s Discovery Early Career Researcher Award","award":["DE220101057"],"award-info":[{"award-number":["DE220101057"]}]},{"name":"European Research Council under the European Union\u2019s Horizon 2020 research and innovation program","award":["949014"],"award-info":[{"award-number":["949014"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Softw. Eng. Methodol."],"published-print":{"date-parts":[[2026,3,31]]},"abstract":"<jats:p>Software systems have been evolving rapidly and inevitably introducing bugs at an increasing rate, leading to significant maintenance costs. While large language models (LLMs) have demonstrated remarkable potential in enhancing software development and maintenance practices, particularly in automated program repair (APR), they rely heavily on high-quality code repositories. Most code repositories are proprietary assets that capture the diversity and nuances of real-world industry software practices, which public datasets cannot fully represent. However, obtaining such data from various industries is hindered by data privacy concerns, as companies are reluctant to share their proprietary codebases. There has also been no in-depth investigation of collaborative software development by learning from private and decentralized data while preserving data privacy for program repair.<\/jats:p>\n                  <jats:p>To address the gap, we investigate federated learning as a privacy-preserving method for fine-tuning LLMs on proprietary and decentralized data to boost collaborative software development and maintenance. We use the private industrial dataset TutorCode for fine-tuning and the EvalRepair-Java benchmark for evaluation, and assess whether federated fine-tuning enhances program repair. We then further explore how code heterogeneity (i.e., variations in coding style, complexity, and embedding) and different federated learning algorithms affect bug fixing to provide practical implications for real-world software development collaboration. Our evaluation reveals that federated fine-tuning can significantly enhance program repair, achieving increases of up to 16.67% for Top@10 and 18.44% for Pass@10, even comparable to the bug-fixing capabilities of centralized learning. Moreover, the negligible impact of code heterogeneity implies that industries can effectively collaborate despite diverse data distributions. Different federated algorithms also demonstrate unique strengths across LLMs, suggesting that tailoring the optimization process to specific LLM characteristics can further improve program repair.<\/jats:p>","DOI":"10.1145\/3733599","type":"journal-article","created":{"date-parts":[[2025,5,1]],"date-time":"2025-05-01T04:38:07Z","timestamp":1746074287000},"page":"1-46","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["When Fine-Tuning LLMs Meets Data Privacy: An Empirical Study of Federated Learning in LLM-Based Program Repair"],"prefix":"10.1145","volume":"35","author":[{"ORCID":"https:\/\/orcid.org\/0009-0005-4171-2025","authenticated-orcid":false,"given":"Wenqiang","family":"Luo","sequence":"first","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong, Hong Kong, Hong Kong"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3803-9600","authenticated-orcid":false,"given":"Jacky Wai","family":"Keung","sequence":"additional","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong, Hong Kong, Hong Kong"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9270-730X","authenticated-orcid":false,"given":"Boyang","family":"Yang","sequence":"additional","affiliation":[{"name":"Jisuan Institute of Technology, Beijing JudaoYouda Network Technology Co. Ltd., Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4807-2110","authenticated-orcid":false,"given":"He","family":"Ye","sequence":"additional","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3931-060X","authenticated-orcid":false,"given":"Claire Le","family":"Goues","sequence":"additional","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7270-9869","authenticated-orcid":false,"given":"Tegawend\u00e9 F.","family":"Bissyand\u00e9","sequence":"additional","affiliation":[{"name":"SnT, University of Luxembourg, Esch-sur-Alzette, Luxembourg"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8049-3997","authenticated-orcid":false,"given":"Haoye","family":"Tian","sequence":"additional","affiliation":[{"name":"School of Computing and Information Systems, University of Melbourne, Melbourne, Australia"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5044-1582","authenticated-orcid":false,"given":"Bach","family":"Le","sequence":"additional","affiliation":[{"name":"School of Computing and Information Systems, The University of Melbourne, Melbourne, Australia"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2026,2,13]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3395363.3397386"},{"key":"e_1_3_2_3_2","unstructured":"Josh Achiam Steven Adler Sandhini Agarwal Lama Ahmad Ilge Akkaya Florencia Leoni Aleman Diogo Almeida Janko Altenschmidt Sam Altman Shyamal Anadkat et al. 2023. Gpt-4 technical report. arXiv:2303.08774. Retrieved from https:\/\/arxiv.org\/abs\/2303.08774"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/3510003.3510049"},{"issue":"6","key":"e_1_3_2_5_2","doi-asserted-by":"crossref","first-page":"3057","DOI":"10.1007\/s10664-017-9508-2","article-title":"Evaluating code complexity triggers, use of complexity measures and the influence of code complexity on maintenance time","volume":"22","author":"Antinyan Vard","year":"2017","unstructured":"Vard Antinyan, Miroslaw Staron, and Anna Sandberg. 2017. Evaluating code complexity triggers, use of complexity measures and the influence of code complexity on maintenance time. Empirical Software Engineering 22, 6 (2017), 3057\u20133087.","journal-title":"Empirical Software Engineering"},{"key":"e_1_3_2_6_2","unstructured":"Jacob Austin Augustus Odena Maxwell Nye Maarten Bosma Henryk Michalewski David Dohan Ellen Jiang Carrie Cai Michael Terry Quoc Le et al. 2021. Program synthesis with large language models. arXiv:2108.07732. Retrieved from https:\/\/arxiv.org\/abs\/2108.07732"},{"key":"e_1_3_2_7_2","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1145\/3303772.3303813","volume-title":"Proceedings of the 9th International Conference on Learning Analytics & Knowledge","author":"Azcona David","year":"2019","unstructured":"David Azcona, Piyush Arora, I-Han Hsiao, and Alan Smeaton. 2019. user2code2vec: Embeddings for profiling students based on distributional representations of source code. In Proceedings of the 9th International Conference on Learning Analytics & Knowledge, 86\u201395."},{"key":"e_1_3_2_8_2","unstructured":"Jinze Bai Shuai Bai Yunfei Chu Zeyu Cui Kai Dang Xiaodong Deng Yang Fan Wenbin Ge Yu Han Fei Huang et al. 2023. Qwen technical report. arXiv:2309.16609. Retrieved from https:\/\/arxiv.org\/abs\/2309.16609"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i5.25685"},{"key":"e_1_3_2_10_2","unstructured":"CO Boulder. 2013. University of Cambridge study: Failure to adopt reverse debugging costs global economy $41 billion annually. Retrieved from https:\/\/totalview.io\/press-releases\/university-cambridge-study-failure-adopt-reverse-debugging-costs-global-economy-41"},{"key":"e_1_3_2_11_2","article-title":"Reversible Debugging Software","volume":"229","author":"Britton Tom","year":"2013","unstructured":"Tom Britton, Lisa Jeng, Graham Carver, Paul Cheak, and Tomer Katzenellenbogen. 2013. Reversible Debugging Software. Technical Report 229. University of Cambridge Judge Business School, Cambridge, UK.","journal-title":"Technical Report"},{"key":"e_1_3_2_12_2","first-page":"654","volume-title":"Proceedings of the European Conference on Computer Vision","author":"Caldarola Debora","year":"2022","unstructured":"Debora Caldarola, Barbara Caputo, and Marco Ciccone. 2022. Improving generalization in federated learning by seeking flat minima. In Proceedings of the European Conference on Computer Vision. Springer, 654\u2013672."},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.5555\/2831143.2831160"},{"key":"e_1_3_2_14_2","unstructured":"Jialun Cao Meiziniu Li Ming Wen and Shing-Chi Cheung. 2023. A study on prompt design advantages and limitations of ChatGPT for deep learning program repair. arXiv:2304.08191. Retrieved from https:\/\/arxiv.org\/abs\/2304.08191"},{"key":"e_1_3_2_15_2","unstructured":"Tianshi Che Ji Liu Yang Zhou Jiaxiang Ren Jiwen Zhou Victor S. Sheng Huaiyu Dai and Dejing Dou. 2023. Federated learning of large language models with parameter-efficient prompt tuning and adaptive optimization. arXiv:2310.15080. Retrieved from https:\/\/arxiv.org\/abs\/2310.15080"},{"key":"e_1_3_2_16_2","unstructured":"Chaochao Chen Xiaohua Feng Jun Zhou Jianwei Yin and Xiaolin Zheng. 2023. Federated large language model: A position paper. arXiv:2307.08925. Retrieved from https:\/\/arxiv.org\/abs\/2307.08925"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.00447"},{"key":"e_1_3_2_18_2","unstructured":"Mark Chen Jerry Tworek Heewoo Jun Qiming Yuan Henrique Ponde De Oliveira Pinto Jared Kaplan Harri Edwards Yuri Burda Nicholas Joseph Greg Brockman et al. 2021. Evaluating large language models trained on code. arXiv:2107.03374. Retrieved from https:\/\/arxiv.org\/abs\/2107.03374"},{"issue":"3","key":"e_1_3_2_19_2","doi-asserted-by":"crossref","first-page":"494","DOI":"10.1037\/0033-2909.114.3.494","article-title":"Dominance statistics: Ordinal analyses to answer ordinal questions","volume":"114","author":"Cliff Norman","year":"1993","unstructured":"Norman Cliff. 1993. Dominance statistics: Ordinal analyses to answer ordinal questions. Psychological Bulletin 114, 3 (1993), 494\u2013509.","journal-title":"Psychological Bulletin"},{"key":"e_1_3_2_20_2","article-title":"Qlora: Efficient finetuning of quantized LLMs","author":"Dettmers Tim","year":"2024","unstructured":"Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, and Luke Zettlemoyer. 2024. Qlora: Efficient finetuning of quantized LLMs. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 36.","journal-title":"Proceedings of the Advances in Neural Information Processing Systems"},{"issue":"3","key":"e_1_3_2_21_2","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1007\/s10664-022-10118-5","article-title":"Can pre-trained code embeddings improve model performance? Revisiting the use of code embeddings in software engineering tasks","volume":"27","author":"Ding Zishuo","year":"2022","unstructured":"Zishuo Ding, Heng Li, Weiyi Shang, and Tse-Hsun Peter Chen. 2022. Can pre-trained code embeddings improve model performance? Revisiting the use of code embeddings in software engineering tasks. Empirical Software Engineering 27, 3 (2022), 63.","journal-title":"Empirical Software Engineering"},{"key":"e_1_3_2_22_2","doi-asserted-by":"crossref","unstructured":"Zhangyin Feng Daya Guo Duyu Tang Nan Duan Xiaocheng Feng Ming Gong Linjun Shou Bing Qin Ting Liu Daxin Jiang et al. 2020. Codebert: A pre-trained model for programming and natural languages. arXiv:2002.08155. Retrieved from https:\/\/arxiv.org\/abs\/2002.08155","DOI":"10.18653\/v1\/2020.findings-emnlp.139"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1145\/3540250.3549098"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.295"},{"key":"e_1_3_2_25_2","first-page":"512","volume-title":"Proceedings of the 2023 IEEE\/ACM 45th International Conference on Software Engineering (ICSE)","author":"Gill Waris","year":"2023","unstructured":"Waris Gill, Ali Anwar, and Muhammad Ali Gulzar. 2023. Feddebug: Systematic debugging for federated learning applications. In Proceedings of the 2023 IEEE\/ACM 45th International Conference on Software Engineering (ICSE). IEEE, 512\u2013523."},{"issue":"4","key":"e_1_3_2_26_2","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1145\/3296979.3192387","article-title":"Automated clustering and program repair for introductory programming assignments","volume":"53","author":"Gulwani Sumit","year":"2018","unstructured":"Sumit Gulwani, Ivan Radi\u010dek, and Florian Zuleger. 2018. Automated clustering and program repair for introductory programming assignments. ACM SIGPLAN Notices 53, 4 (2018), 465\u2013480.","journal-title":"ACM SIGPLAN Notices"},{"key":"e_1_3_2_27_2","unstructured":"Daya Guo Qihao Zhu Dejian Yang Zhenda Xie Kai Dong Wentao Zhang Guanting Chen Xiao Bi Yu Wu Y. K. Li et al. 2024. DeepSeek-Coder: When the large language model meets programming\u2013The rise of code intelligence. arXiv:2401.14196. Retrieved from https:\/\/arxiv.org\/abs\/2401.14196"},{"key":"e_1_3_2_28_2","first-page":"136","volume-title":"Proceedings of the 2023 IEEE International Conference on Software Maintenance and Evolution (ICSME)","author":"Hao Sichong","year":"2023","unstructured":"Sichong Hao, Xianjun Shi, Hongwei Liu, and Yanjun Shu. 2023. Enhancing code language models for program repair by curricular fine-tuning framework. In Proceedings of the 2023 IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 136\u2013146."},{"key":"e_1_3_2_29_2","first-page":"4387","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Hsieh Kevin","year":"2020","unstructured":"Kevin Hsieh, Amar Phanishayee, Onur Mutlu, and Phillip Gibbons. 2020. The non-IID data quagmire of decentralized machine learning. In Proceedings of the International Conference on Machine Learning. PMLR, 4387\u20134398."},{"key":"e_1_3_2_30_2","unstructured":"Edward J. Hu Yelong Shen Phillip Wallis Zeyuan Allen-Zhu Yuanzhi Li Shean Wang Lu Wang and Weizhu Chen. 2021. Lora: Low-rank adaptation of large language models. arXiv:2106.09685. Retrieved from https:\/\/arxiv.org\/abs\/2106.09685"},{"key":"e_1_3_2_31_2","first-page":"388","volume-title":"Proceedings of the 2019 34th IEEE\/ACM International Conference on Automated Software Engineering (ASE)","author":"Hu Yang","year":"2019","unstructured":"Yang Hu, Umair Z. Ahmed, Sergey Mechtaev, Ben Leong, and Abhik Roychoudhury. 2019. Re-factoring based program repair applied to programming assignments. In Proceedings of the 2019 34th IEEE\/ACM International Conference on Automated Software Engineering (ASE). IEEE, 388\u2013398."},{"key":"e_1_3_2_32_2","article-title":"Maximum Likelihood Estimation of Dirichlet Distribution Parameters","volume":"76","author":"Huang Jonathan","year":"2005","unstructured":"Jonathan Huang. 2005. Maximum Likelihood Estimation of Dirichlet Distribution Parameters. CMU Technique Report 76.","journal-title":"CMU Technique Report"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/ASE56229.2023.00181"},{"key":"e_1_3_2_34_2","unstructured":"Albert Q. Jiang Alexandre Sablayrolles Arthur Mensch Chris Bamford Devendra Singh Chaplot Diego de las Casas Florian Bressand Gianna Lengyel Guillaume Lample Lucile Saulnier et al. 2023. Mistral 7B. arXiv:2310.06825. Retrieved from https:\/\/arxiv.org\/abs\/2310.06825"},{"key":"e_1_3_2_35_2","unstructured":"Jingang Jiang Xiangyang Liu and Chenyou Fan. 2023. Low-parameter federated learning with large language models. arXiv:2307.13896. Retrieved from https:\/\/arxiv.org\/abs\/2307.13896"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE48619.2023.00125"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE43902.2021.00107"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1145\/3611643.3613892"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1145\/2610384.2628055"},{"key":"e_1_3_2_40_2","first-page":"5201","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Kairouz Peter","year":"2021","unstructured":"Peter Kairouz, Ziyu Liu, and Thomas Steinke. 2021. The distributed discrete Gaussian mechanism for federated learning with secure aggregation. In Proceedings of the International Conference on Machine Learning. PMLR, 5201\u20135212."},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1561\/2200000083"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.nlp.2023.100048"},{"key":"e_1_3_2_43_2","first-page":"11058","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Kim Jinkyu","year":"2022","unstructured":"Jinkyu Kim, Geeho Kim, and Bohyung Han. 2022. Multi-level branched regularization for federated learning. In Proceedings of the International Conference on Machine Learning. PMLR, 11058\u201311073."},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.1145\/2931037.2931051"},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1145\/3387940.3391494"},{"issue":"3","key":"e_1_3_2_46_2","doi-asserted-by":"crossref","first-page":"1980","DOI":"10.1007\/s10664-019-09780-z","article-title":"Fixminer: Mining relevant fix patterns for automated program repair","volume":"25","author":"Koyuncu Anil","year":"2020","unstructured":"Anil Koyuncu, Kui Liu, Tegawend\u00e9 F. Bissyand\u00e9, Dongsun Kim, Jacques Klein, Martin Monperrus, and Yves Le Traon. 2020. Fixminer: Mining relevant fix patterns for automated program repair. Empirical Software Engineering 25, 3 (2020), 1980\u20132024.","journal-title":"Empirical Software Engineering"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1145\/2901739.2901749"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1145\/3637528.3671573"},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/3661167.3661210"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE.2019.00064"},{"issue":"12","key":"e_1_3_2_51_2","doi-asserted-by":"crossref","first-page":"1236","DOI":"10.1109\/TSE.2015.2454513","article-title":"The ManyBugs and IntroClass benchmarks for automated repair of C programs","volume":"41","author":"Le Goues Claire","year":"2015","unstructured":"Claire Le Goues, Neal Holtschulte, Edward K. Smith, Yuriy Brun, Premkumar Devanbu, Stephanie Forrest, and Westley Weimer. 2015. The ManyBugs and IntroClass benchmarks for automated repair of C programs. IEEE Transactions on Software Engineering 41, 12 (2015), 1236\u20131256.","journal-title":"IEEE Transactions on Software Engineering"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE53745.2022.00077"},{"key":"e_1_3_2_53_2","first-page":"6357","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Li Tian","year":"2021","unstructured":"Tian Li, Shengyuan Hu, Ahmad Beirami, and Virginia Smith. 2021. Ditto: Fair and robust federated learning through personalization. In Proceedings of the International Conference on Machine Learning. PMLR, 6357\u20136368."},{"key":"e_1_3_2_54_2","first-page":"429","article-title":"Ameet Talwalkar, and Virginia Smith. 2020. Federated optimization in heterogeneous networks","volume":"2","author":"Li Tian","year":"2020","unstructured":"Tian Li, Anit Kumar Sahu, Manzil Zaheer, and Maziar Sanjabi, Ameet Talwalkar, and Virginia Smith. 2020. Federated optimization in heterogeneous networks. Proceedings of Machine Learning and Systems 2 (2020), 429\u2013450.","journal-title":"Proceedings of Machine Learning and Systems"},{"key":"e_1_3_2_55_2","unstructured":"Xiaoxiao Li Meirui Jiang Xiaofei Zhang Michael Kamp and Qi Dou. 2021. Fedbn: Federated learning on non-IID features via local batch normalization. arXiv:2102.07623. Retrieved from https:\/\/arxiv.org\/abs\/2102.07623"},{"key":"e_1_3_2_56_2","first-page":"1918","volume-title":"Proceedings of the 44th International Conference on Software Engineering","author":"Li Zhen","year":"2022","unstructured":"Zhen Li, Guenevere Chen, Chen Chen, Yayi Zou, and Shouhuai Xu. 2022. Ropgen: Towards robust code authorship attribution via automatic coding style transformation. In Proceedings of the 44th International Conference on Software Engineering, 1918."},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMC.2024.3376697"},{"key":"e_1_3_2_58_2","first-page":"307","volume-title":"Proceedings of the 2023 38th IEEE\/ACM International Conference on Automated Software Engineering (ASE)","author":"Liang Wentao","year":"2023","unstructured":"Wentao Liang, Xiang Ling, Jingzheng Wu, Tianyue Luo, and Yanjun Wu. 2023. A needle is an outlier in a haystack: Hunting malicious PyPI packages with code clustering. In Proceedings of the 2023 38th IEEE\/ACM International Conference on Automated Software Engineering (ASE). IEEE, 307\u2013318."},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.1145\/3505247"},{"key":"e_1_3_2_60_2","unstructured":"Bill Yuchen Lin Chaoyang He Zihang Zeng Hulin Wang Yufen Huang Christophe Dupuy Rahul Gupta Mahdi Soltanolkotabi Xiang Ren and Salman Avestimehr. 2021. FedNLP: Benchmarking federated learning methods for natural language processing tasks. arXiv:2104.08815. Retrieved from https:\/\/arxiv.org\/abs\/2104.08815"},{"key":"e_1_3_2_61_2","article-title":"Is your code generated by ChatGPT really correct? Rigorous evaluation of large language models for code generation","volume":"36","author":"Liu Jiawei","year":"2024","unstructured":"Jiawei Liu, Chunqiu Steven Xia, Yuyao Wang, and Lingming Zhang. 2024. Is your code generated by ChatGPT really correct? Rigorous evaluation of large language models for code generation. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 36.","journal-title":"Proceedings of the Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1145\/3682068"},{"key":"e_1_3_2_63_2","unstructured":"Ilya Loshchilov and Frank Hutter. 2019. Decoupled weight decay regularization. arXiv:1711.05101. Retrieved from https:\/\/arxiv.org\/abs\/1711.05101"},{"key":"e_1_3_2_64_2","first-page":"5972","article-title":"No fear of heterogeneity: Classifier calibration for federated learning with non-IID data","volume":"34","author":"Luo Mi","year":"2021","unstructured":"Mi Luo, Fei Chen, Dapeng Hu, Yifan Zhang, Jian Liang, and Jiashi Feng. 2021. No fear of heterogeneity: Classifier calibration for federated learning with non-IID data. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 34, 5972\u20135984.","journal-title":"Proceedings of the Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_65_2","unstructured":"Wenqiang Luo Jacky Wai Keung Boyang Yang Tegawende F. Bissyande Haoye Tian and Bach Le. 2025. Unlocking LLM repair capabilities in low-resource programming languages through cross-language translation and multi-agent refinement. arXiv:2503.22512. Retrieved from https:\/\/arxiv.org\/abs\/2503.22512"},{"key":"e_1_3_2_66_2","unstructured":"Ziyang Luo Can Xu Pu Zhao Qingfeng Sun Xiubo Geng Wenxiang Hu Chongyang Tao Jing Ma Qingwei Lin and Daxin Jiang. 2023. Wizardcoder: Empowering code large language models with Evol-Instruct. arXiv:2306.08568. Retrieved from https:\/\/arxiv.org\/abs\/2306.08568"},{"key":"e_1_3_2_67_2","doi-asserted-by":"crossref","first-page":"106368","DOI":"10.1016\/j.infsof.2020.106368","article-title":"Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions","volume":"127","author":"Lwakatare Lucy Ellen","year":"2020","unstructured":"Lucy Ellen Lwakatare, Aiswarya Raj, Ivica Crnkovic, Jan Bosch, and Helena Holmstr\u00f6m Olsson. 2020. Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions. Information and Software Technology 127 (2020), 106368.","journal-title":"Information and Software Technology"},{"key":"e_1_3_2_68_2","first-page":"8566","volume-title":"Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing","author":"Ma Xinge","year":"2023","unstructured":"Xinge Ma, Jiangming Liu, Jin Wang, and Xuejie Zhang. 2023. FedID: Federated interactive distillation for large-scale pretraining language models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 8566\u20138577."},{"key":"e_1_3_2_69_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE-SEIP.2019.00039"},{"key":"e_1_3_2_70_2","first-page":"505","volume-title":"Proceedings of the 2021 IEEE\/ACM 18th International Conference on Mining Software Repositories (MSR)","author":"Mashhadi Ehsan","year":"2021","unstructured":"Ehsan Mashhadi and Hadi Hemmati. 2021. Applying CodeBERT for automated program repair of java simple bugs. In Proceedings of the 2021 IEEE\/ACM 18th International Conference on Mining Software Repositories (MSR). IEEE, 505\u2013509."},{"key":"e_1_3_2_71_2","first-page":"1273","volume-title":"Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 54).","author":"McMahan Brendan","year":"2017","unstructured":"Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Aguera y Arcas. 2017. Communication-efficient learning of deep networks from decentralized data. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 54). Aarti Singh, and Jerry Zhu (Eds.), PMLR, 1273\u20131282."},{"key":"e_1_3_2_72_2","volume-title":"Proceedings of the KDD Workshop on Text Mining","author":"Michael Steinbach","year":"2000","unstructured":"Steinbach Michael. 2000. A comparison of document clustering techniques. In Proceedings of the KDD Workshop on Text Mining."},{"key":"e_1_3_2_73_2","first-page":"2875","volume-title":"Proceedings of the 2022 IEEE International Conference on Big Data (Big Data)","author":"Nguyen Do-Van","year":"2022","unstructured":"Do-Van Nguyen, Anh-Khoa Tran, and Koji Zettsu. 2022. FedProb: An aggregation method based on feature probability distribution for federated learning on non-IID data. In Proceedings of the 2022 IEEE International Conference on Big Data (Big Data). IEEE, 2875\u20132881."},{"key":"e_1_3_2_74_2","first-page":"2228","volume-title":"Proceedings of the 44th International Conference on Software Engineering","author":"Noller Yannic","unstructured":"Yannic Noller, Ridwan Shariffdeen, Xiang Gao, and Abhik Roychoudhury. 2022. Trust enhancement issues in program repair. In Proceedings of the 44th International Conference on Software Engineering, 2228\u20132240."},{"key":"e_1_3_2_75_2","doi-asserted-by":"crossref","first-page":"440","DOI":"10.1145\/3650212.3652140","volume-title":"Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis","author":"Ouyang Yicheng","year":"2024","unstructured":"Yicheng Ouyang, Jun Yang, and Lingming Zhang. 2024. Benchmarking automated program repair: An extensive study on both real-world and artificial bugs. In Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 440\u2013452."},{"key":"e_1_3_2_76_2","unstructured":"Rishov Paul Md Mohib Hossain Mohammed Latif Siddiq Masum Hasan Anindya Iqbal and Joanna Santos. 2023. Enhancing automated program repair through fine-tuning and prompt engineering. arXiv:2304.07840. Retrieved from https:\/\/arxiv.org\/abs\/2304.07840"},{"key":"e_1_3_2_77_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE43902.2021.00056"},{"key":"e_1_3_2_78_2","doi-asserted-by":"publisher","DOI":"10.1145\/3712187"},{"key":"e_1_3_2_79_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v37i4.25654"},{"key":"e_1_3_2_80_2","first-page":"17716","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Pillutla Krishna","year":"2022","unstructured":"Krishna Pillutla, Kshitiz Malik, Abdel-Rahman Mohamed, Mike Rabbat, Maziar Sanjabi, and Lin Xiao. 2022. Federated learning with partial model personalization. In Proceedings of the International Conference on Machine Learning. PMLR, 17716\u201317758."},{"key":"e_1_3_2_81_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Reddi Sashank J.","year":"2021","unstructured":"Sashank J. Reddi, Zachary Charles, Manzil Zaheer, Zachary Garrett, Keith Rush, Jakub Kone\u010dn\u00fd, Sanjiv Kumar, and Hugh Brendan McMahan. 2021. Adaptive federated optimization. In Proceedings of the International Conference on Learning Representations. Retrieved from https:\/\/openreview.net\/forum?id=LkFG3lB13U5"},{"key":"e_1_3_2_82_2","first-page":"9830","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"33","author":"J\u00f6rg Rothe","year":"2019","unstructured":"J\u00f6rg Rothe. 2019. Borda count in collective decision making: A summary of recent results. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33, 9830\u20139836."},{"key":"e_1_3_2_83_2","unstructured":"Baptiste Roziere Jonas Gehring Fabian Gloeckle Sten Sootla Itai Gat Xiaoqing Ellen Tan Yossi Adi Jingyu Liu Romain Sauvestre Tal Remez et al. 2023. Code Llama: Open foundation models for code. arXiv:2308.12950. Retrieved from https:\/\/arxiv.org\/abs\/2308.12950"},{"key":"e_1_3_2_84_2","first-page":"598","volume-title":"Proceedings of the 2015 IEEE\/ACM 37th IEEE International Conference on Software Engineering","volume":"1","author":"Sadowski Caitlin","year":"2015","unstructured":"Caitlin Sadowski, Jeffrey Van Gogh, Ciera Jaspan, Emma Soderberg, and Collin Winter. 2015. Tricorder: Building a program analysis ecosystem. In Proceedings of the 2015 IEEE\/ACM 37th IEEE International Conference on Software Engineering, Vol. 1. IEEE, 598\u2013608."},{"key":"e_1_3_2_85_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2023.3334955"},{"key":"e_1_3_2_86_2","doi-asserted-by":"publisher","DOI":"10.1145\/3540250.3560883"},{"key":"e_1_3_2_87_2","unstructured":"Andr\u00e9 Silva Sen Fang and Martin Monperrus. 2023. RepairLlama: Efficient representations and fine-tuned adapters for program repair. arXiv:2312.15698. Retrieved from https:\/\/arxiv.org\/abs\/2312.15698"},{"key":"e_1_3_2_88_2","unstructured":"Jingwei Sun Ziyue Xu Hongxu Yin Dong Yang Daguang Xu Yiran Chen and Holger R. Roth. 2023. Fedbpt: Efficient federated black-box prompt tuning for large language models. arXiv:2310.01467. Retrieved from https:\/\/arxiv.org\/abs\/2310.01467"},{"key":"e_1_3_2_89_2","first-page":"21394","article-title":"Personalized federated learning with Moreau envelopes","volume":"33","author":"Dinh Canh T.","year":"2020","unstructured":"Canh T. Dinh, Nguyen Tran, and Josh Nguyen. 2020. Personalized federated learning with Moreau envelopes. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 33, 21394\u201321405.","journal-title":"Proceedings of the Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_90_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2023.3340109"},{"key":"e_1_3_2_91_2","unstructured":"Xunzhu Tang Zhenghan Chen Kisub Kim Haoye Tian Saad Ezzini and Jacques Klein. 2023. Just-in-time security patch detection\u2013LLM at the rescue for data augmentation. arXiv:2312.01241. Retrieved from https:\/\/arxiv.org\/abs\/2312.01241"},{"key":"e_1_3_2_92_2","unstructured":"Xueyang Tang Song Guo and Jingcai Guo. 2021. Personalized federated learning with contextualized generalization. arXiv:2106.13044. Retrieved from https:\/\/arxiv.org\/abs\/2106.13044"},{"key":"e_1_3_2_93_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.emnlp-main.632"},{"key":"e_1_3_2_94_2","unstructured":"Haoye Tian Weiqi Lu Tsz On Li Xunzhu Tang Shing-Chi Cheung Jacques Klein and Tegawend\u00e9 F. Bissyand\u00e9. 2023. Is ChatGPT the ultimate programming assistant\u2013How far is it? arXiv:2304.11938. Retrieved from https:\/\/arxiv.org\/abs\/2304.11938"},{"key":"e_1_3_2_95_2","first-page":"1","volume-title":"Proceedings of the 37th IEEE\/ACM International Conference on Automated Software Engineering","author":"Tian Haoye","year":"2022","unstructured":"Haoye Tian, Xunzhu Tang, Andrew Habib, Shangwen Wang, Kui Liu, Xin Xia, Jacques Klein, and Tegawend\u00e9 F. Bissyand\u00e9. 2022. Is this change the answer to that problem? Correlating descriptions of bug and code changes for evaluating patch correctness. In Proceedings of the 37th IEEE\/ACM International Conference on Automated Software Engineering, 1\u201313."},{"key":"e_1_3_2_96_2","first-page":"16485","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"37","author":"Ting Chih-Kai","year":"2023","unstructured":"Chih-Kai Ting, Karl Munson, Serenity Wade, Anish Savla, Kiran Kate, and Kavitha Srinivas. 2023. CodeStylist: A system for performing code style transfer using neural networks. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37, 16485\u201316487."},{"key":"e_1_3_2_97_2","unstructured":"Hugo Touvron Louis Martin Kevin Stone Peter Albert Amjad Almahairi Yasmine Babaei Nikolay Bashlykov Soumya Batra Prajjwal Bhargava Shruti Bhosale et al. 2023. Llama 2: Open foundation and fine-tuned chat models. arXiv:2307.09288. Retrieved from https:\/\/arxiv.org\/abs\/2307.09288"},{"issue":"4","key":"e_1_3_2_98_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3340544","article-title":"An empirical study on learning bug-fixing patches in the wild via neural machine translation","volume":"28","author":"Tufano Michele","year":"2019","unstructured":"Michele Tufano, Cody Watson, Gabriele Bavota, Massimiliano Di Penta, Martin White, and Denys Poshyvanyk. 2019. An empirical study on learning bug-fixing patches in the wild via neural machine translation. ACM Transactions on Software Engineering and Methodology 28, 4 (2019), 1\u201329.","journal-title":"ACM Transactions on Software Engineering and Methodology"},{"key":"e_1_3_2_99_2","volume-title":"Proceedings of the 41st International Conference on Machine Learning","author":"Villalobos Pablo","year":"2024","unstructured":"Pablo Villalobos, Anson Ho, Jaime Sevilla, Tamay Besiroglu, Lennart Heim, and Marius Hobbhahn. 2024. Position: Will we run out of data? Limits of LLM scaling based on human-generated data. In Proceedings of the 41st International Conference on Machine Learning. Retrieved from https:\/\/openreview.net\/forum?id=ViZcgDQjyG"},{"key":"e_1_3_2_100_2","unstructured":"Boxin Wang Yibo Jacky Zhang Yuan Cao Bo Li H. Brendan McMahan Sewoong Oh Zheng Xu and Manzil Zaheer. 2023. Can public large language models help private cross-device federated learning? arXiv:2305.12132. Retrieved from https:\/\/arxiv.org\/abs\/2305.12132"},{"key":"e_1_3_2_101_2","doi-asserted-by":"crossref","first-page":"968","DOI":"10.1145\/3324884.3416590","volume-title":"Proceedings of the 35th IEEE\/ACM International Conference on Automated Software Engineering","author":"Wang Shangwen","year":"2020","unstructured":"Shangwen Wang, Ming Wen, Bo Lin, Hongjun Wu, Yihao Qin, Deqing Zou, Xiaoguang Mao, and Hai Jin. 2020. Automated patch correctness assessment: How far are we? In Proceedings of the 35th IEEE\/ACM International Conference on Automated Software Engineering, 968\u2013980."},{"key":"e_1_3_2_102_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2023.emnlp-main.68"},{"key":"e_1_3_2_103_2","doi-asserted-by":"publisher","DOI":"10.1145\/3611643.3616271"},{"key":"e_1_3_2_104_2","first-page":"1","volume-title":"Proceedings of the 4th International Workshop on Mining Software Repositories (MSR \u201907: ICSE Workshops \u201907)","author":"Weiss Cathrin","year":"2007","unstructured":"Cathrin Weiss, Rahul Premraj, Thomas Zimmermann, and Andreas Zeller. 2007. How long will it take to fix this bug? In Proceedings of the 4th International Workshop on Mining Software Repositories (MSR \u201907: ICSE Workshops \u201907). IEEE, 1\u20131."},{"key":"e_1_3_2_105_2","first-page":"13","volume-title":"Proceedings of the 2019 IEEE\/ACM 27th International Conference on Program Comprehension (ICPC)","author":"Wiese Eliane S.","year":"2019","unstructured":"Eliane S. Wiese, Anna N. Rafferty, Daniel M. Kopta, and Jacqulyn M. Anderson. 2019. Replicating novices\u2019 struggles with coding style. In Proceedings of the 2019 IEEE\/ACM 27th International Conference on Program Comprehension (ICPC). IEEE, 13\u201318."},{"key":"e_1_3_2_106_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4612-4380-9_16"},{"key":"e_1_3_2_107_2","first-page":"39","volume-title":"Proceedings of the 8th Workshop on Data Management for End-to-End Machine Learning","author":"Woisetschl\u00e4ger Herbert","year":"2024","unstructured":"Herbert Woisetschl\u00e4ger, Alexander Erben, Shiqiang Wang, Ruben Mayer, and Hans-Arno Jacobsen. 2024. Federated fine-tuning of LLMs on the very edge: The good, the bad, the ugly. In Proceedings of the 8th Workshop on Data Management for End-to-End Machine Learning, 39\u201350."},{"key":"e_1_3_2_108_2","unstructured":"Fangzhou Wu Qingzhao Zhang Ati Priya Bajaj Tiffany Bao Ning Zhang Ruoyu \u201cFish\u201d Wang and Chaowei Xiao. 2023. Exploring the limits of ChatGPT in software security applications. arXiv:2312.05275. Retrieved from https:\/\/arxiv.org\/abs\/2312.05275"},{"key":"e_1_3_2_109_2","doi-asserted-by":"publisher","DOI":"10.1145\/3663529.3663815"},{"key":"e_1_3_2_110_2","first-page":"37860","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Wu Yue","year":"2023","unstructured":"Yue Wu, Shuaicheng Zhang, Wenchao Yu, Yanchi Liu, Quanquan Gu, Dawei Zhou, Haifeng Chen, and Wei Cheng. 2023. Personalized federated learning under mixture of distributions. In Proceedings of the International Conference on Machine Learning. PMLR, 37860\u201337879."},{"key":"e_1_3_2_111_2","unstructured":"Chunqiu Steven Xia Yuxiang Wei and Lingming Zhang. 2022. Practical program repair in the era of large pre-trained language models. arXiv:2210.14179. Retrieved from https:\/\/arxiv.org\/abs\/2210.14179"},{"key":"e_1_3_2_112_2","first-page":"1482","volume-title":"Proceedings of the 2023 IEEE\/ACM 45th International Conference on Software Engineering (ICSE)","author":"Xia Chunqiu Steven","year":"2023","unstructured":"Chunqiu Steven Xia, Yuxiang Wei, and Lingming Zhang. 2023. Automated program repair in the era of large pre-trained language models. In Proceedings of the 2023 IEEE\/ACM 45th International Conference on Software Engineering (ICSE). IEEE, 1482\u20131494."},{"key":"e_1_3_2_113_2","first-page":"959","volume-title":"Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering","author":"Steven Xia Chunqiu","year":"2022","unstructured":"Chunqiu Steven Xia and Lingming Zhang. 2022. Less training, more repairing please: Revisiting automated program repair via zero-shot learning. In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 959\u2013971."},{"key":"e_1_3_2_114_2","unstructured":"Can Xu Qingfeng Sun Kai Zheng Xiubo Geng Pu Zhao Jiazhan Feng Chongyang Tao and Daxin Jiang. 2023. WizardLM: Empowering large language models to follow complex instructions. arXiv:2304.12244. Retrieved from https:\/\/arxiv.org\/abs\/2304.12244"},{"key":"e_1_3_2_115_2","first-page":"13","volume-title":"Proceedings of the 12th ACM Workshop on Artificial Intelligence and Security","author":"Xu Runhua","year":"2019","unstructured":"Runhua Xu, Nathalie Baracaldo, Yi Zhou, Ali Anwar, and Heiko Ludwig. 2019. Hybridalpha: An efficient approach for privacy-preserving federated learning. In Proceedings of the 12th ACM Workshop on Artificial Intelligence and Security, 13\u201323."},{"key":"e_1_3_2_116_2","doi-asserted-by":"publisher","DOI":"10.1145\/3597503.3623342"},{"key":"e_1_3_2_117_2","doi-asserted-by":"crossref","first-page":"882","DOI":"10.1145\/3650212.3680328","volume-title":"Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis","author":"Yang Boyang","year":"2024","unstructured":"Boyang Yang, Haoye Tian, Weiguo Pian, Haoran Yu, Haitao Wang, Jacques Klein, Tegawend\u00e9, F. Bissyand\u00e9, and Shunfu Jin. 2024. CREF: An LLM-based conversational software repair framework for programming tutors. In Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 882\u2013894."},{"key":"e_1_3_2_118_2","unstructured":"Boyang Yang Haoye Tian Jiadong Ren Shunfu Jin Yang Liu Feng Liu and Bach Le. 2025. Enhancing repository-level software repair via repository-aware knowledge graphs. arXiv:2503.21710. Retrieved from https:\/\/arxiv.org\/abs\/2503.21710"},{"key":"e_1_3_2_119_2","unstructured":"Boyang Yang Haoye Tian Jiadong Ren Hongyu Zhang Jacques Klein Tegawend\u00e9 F. Bissyand\u00e9 Claire Le Goues and Shunfu Jin. 2024. Multi-objective fine-tuning for enhanced program repair with LLMs. arXiv:2404.12636. Retrieved from https:\/\/arxiv.org\/abs\/2404.12636"},{"key":"e_1_3_2_120_2","doi-asserted-by":"publisher","DOI":"10.1145\/3106237.3106274"},{"key":"e_1_3_2_121_2","first-page":"72181","article-title":"Dynamic personalized federated learning with adaptive differential privacy","volume":"36","author":"Yang Xiyuan","year":"2023","unstructured":"Xiyuan Yang, Wenke Huang, and Mang Ye. 2023. Dynamic personalized federated learning with adaptive differential privacy. In Proceedings of the Advances in Neural Information Processing Systems, Vol. 36, 72181\u201372192.","journal-title":"Proceedings of the Advances in Neural Information Processing Systems"},{"issue":"2","key":"e_1_3_2_122_2","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1109\/TSE.2023.3347898","article-title":"Federated learning for software engineering: A case study of code clone detection and defect prediction","volume":"50","author":"Yang Yanming","year":"2024","unstructured":"Yanming Yang, Xing Hu, Zhipeng Gao, Jinfu Chen, Chao Ni, Xin Xia, and David Lo. 2024. Federated learning for software engineering: A case study of code clone detection and defect prediction. IEEE Transactions on Software Engineering 50, 2 (2024), 296\u2013321.","journal-title":"IEEE Transactions on Software Engineering"},{"issue":"8","key":"e_1_3_2_123_2","doi-asserted-by":"crossref","first-page":"2920","DOI":"10.1109\/TSE.2021.3071750","article-title":"Automated classification of overfitting patches with statically extracted code features","volume":"48","author":"Ye He","year":"2021","unstructured":"He Ye, Jian Gu, Matias Martinez, Thomas Durieux, and Martin Monperrus. 2021. Automated classification of overfitting patches with statically extracted code features. IEEE Transactions on Software Engineering 48, 8 (2021), 2920\u20132938.","journal-title":"IEEE Transactions on Software Engineering"},{"key":"e_1_3_2_124_2","doi-asserted-by":"publisher","DOI":"10.1145\/3533767.3534219"},{"key":"e_1_3_2_125_2","unstructured":"Linan Yue Qi Liu Yichao Du Weibo Gao Ye Liu and Fangzhou Yao. 2023. Fedjudge: Federated legal large language model. arXiv:2309.08173. Retrieved from https:\/\/arxiv.org\/abs\/2309.08173"},{"key":"e_1_3_2_126_2","volume-title":"Proceedings of the 12th International Conference on Learning Representations","author":"Zhang Biao","year":"2024","unstructured":"Biao Zhang, Zhongtao Liu, Colin Cherry, and Orhan Firat. 2024. When scaling meets LLM finetuning: The effect of data, model and finetuning method. In Proceedings of the 12th International Conference on Learning Representations. Retrieved from https:\/\/openreview.net\/forum?id=5HCnKDeTws"},{"issue":"2","key":"e_1_3_2_127_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3631974","article-title":"A survey of learning-based automated program repair","volume":"33","author":"Zhang Quanjun","year":"2023","unstructured":"Quanjun Zhang, Chunrong Fang, Yuxiang Ma, Weisong Sun, and Zhenyu Chen. 2023. A survey of learning-based automated program repair. ACM Transactions on Software Engineering and Methodology 33, 2 (2023), 1\u201369.","journal-title":"ACM Transactions on Software Engineering and Methodology"},{"issue":"3","key":"e_1_3_2_128_2","doi-asserted-by":"crossref","first-page":"474","DOI":"10.1109\/TSE.2024.3354969","article-title":"Appt: Boosting automated patch correctness prediction via fine-tuning pre-trained models","volume":"50","author":"Zhang Quanjun","year":"2024","unstructured":"Quanjun Zhang, Chunrong Fang, Weisong Sun, Yan Liu, Tieke He, Xiaodong Hao, and Zhenyu Chen. 2024. Appt: Boosting automated patch correctness prediction via fine-tuning pre-trained models. IEEE Transactions on Software Engineering 50, 3 (2024), 474\u2013494.","journal-title":"IEEE Transactions on Software Engineering"},{"key":"e_1_3_2_129_2","unstructured":"Quanjun Zhang Chunrong Fang Yang Xie YuXiang Ma Weisong Sun and Yun Yang Zhenyu Chen. 2024. A systematic literature review on large language models for automated program repair. arXiv:2405.01466. Retrieved from https:\/\/arxiv.org\/abs\/2405.01466"},{"key":"e_1_3_2_130_2","first-page":"460","volume-title":"Proceedings of the 2023 IEEE\/ACM 45th International Conference on Software Engineering (ICSE)","author":"Zhang Ziqi","year":"2023","unstructured":"Ziqi Zhang, Yuanchun Li, Bingyan Liu, Yifeng Cai, Ding Li, Yao Guo, and Xiangqun Chen. 2023. FedSlice: Protecting federated learning models from malicious participants with model slicing. In Proceedings of the 2023 IEEE\/ACM 45th International Conference on Software Engineering (ICSE). IEEE, 460\u2013472."},{"key":"e_1_3_2_131_2","unstructured":"Jujia Zhao Wenjie Wang Chen Xu Zhaochun Ren See-Kiong Ng and Tat-Seng Chua. 2024. LLM-based federated recommendation. arXiv:2402.09959. Retrieved from https:\/\/arxiv.org\/abs\/2402.09959"},{"key":"e_1_3_2_132_2","unstructured":"Fei Zheng. 2023. Input reconstruction attack against vertical federated large language models. arXiv:2311.07585. Retrieved from https:\/\/arxiv.org\/abs\/2311.07585"},{"key":"e_1_3_2_133_2","doi-asserted-by":"publisher","DOI":"10.1145\/3580305.3599790"},{"issue":"3","key":"e_1_3_2_134_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3631972","article-title":"Improving automated program repair with domain adaptation","volume":"33","author":"Zirak Armin","year":"2024","unstructured":"Armin Zirak and Hadi Hemmati. 2024. Improving automated program repair with domain adaptation. ACM Transactions on Software Engineering and Methodology 33, 3 (2024), 1\u201343.","journal-title":"ACM Transactions on Software Engineering and Methodology"}],"container-title":["ACM Transactions on Software Engineering and Methodology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3733599","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T14:36:50Z","timestamp":1770993410000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3733599"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,2,13]]},"references-count":133,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2026,3,31]]}},"alternative-id":["10.1145\/3733599"],"URL":"https:\/\/doi.org\/10.1145\/3733599","relation":{},"ISSN":["1049-331X","1557-7392"],"issn-type":[{"value":"1049-331X","type":"print"},{"value":"1557-7392","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,2,13]]},"assertion":[{"value":"2024-10-23","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-04-13","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2026-02-13","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}