{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,24]],"date-time":"2026-04-24T07:50:01Z","timestamp":1777017001294,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":26,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,12,17]]},"DOI":"10.1145\/3799830.3799841","type":"proceedings-article","created":{"date-parts":[[2026,4,24]],"date-time":"2026-04-24T06:45:08Z","timestamp":1777013108000},"page":"97-105","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["TranSCoT: Towards Improving Quality of Code Translations"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-5252-7734","authenticated-orcid":false,"given":"Lalit","family":"Meena","sequence":"first","affiliation":[{"name":"IIT Delhi, New Delhi, India"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-8125-0671","authenticated-orcid":false,"given":"Shashank","family":"Govindappa","sequence":"additional","affiliation":[{"name":"IIT Delhi, New Delhi, India"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-4847-0405","authenticated-orcid":false,"given":"Monika","family":"Gupta","sequence":"additional","affiliation":[{"name":"IIT Delhi, New Delhi, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9831-7895","authenticated-orcid":false,"given":"Anamitra","family":"Choudhury","sequence":"additional","affiliation":[{"name":"IBM Research, New Delhi, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8892-6761","authenticated-orcid":false,"given":"Vijay","family":"Arya","sequence":"additional","affiliation":[{"name":"IBM Research, New Delhi, India"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-5053-2059","authenticated-orcid":false,"given":"Yogish","family":"Sabharwal","sequence":"additional","affiliation":[{"name":"IBM Research, New Delhi, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3949-2175","authenticated-orcid":false,"given":"Srikanta","family":"Bedathur","sequence":"additional","affiliation":[{"name":"IIT Delhi, New Delhi, India"}]}],"member":"320","published-online":{"date-parts":[[2026,4,23]]},"reference":[{"key":"e_1_3_3_1_2_2","unstructured":"2023. GPT-4 Technical Report. arXiv:https:\/\/arXiv.org\/abs\/2303.08774\u00a0[cs.CL]"},{"key":"e_1_3_3_1_3_2","unstructured":"2023. TSS. The Most Accurate and Reliable Source Code Converters 2023.(Tangible Software Solutions). https:\/\/www.tangiblesoftwaresolutions.com."},{"key":"e_1_3_3_1_4_2","unstructured":"2024. Granite Code Models: A Family of Open Foundation Models for Code Intelligence. arXiv:https:\/\/arXiv.org\/abs\/2405.04324\u00a0[cs.AI]"},{"key":"e_1_3_3_1_5_2","unstructured":"Wasi\u00a0Uddin Ahmad Md\u00a0Golam\u00a0Rahman Tushar Saikat Chakraborty and Kai-Wei Chang. 2021. Avatar: A parallel corpus for java-python program translation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2108.11590 (2021)."},{"key":"e_1_3_3_1_6_2","unstructured":"Tom Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared\u00a0D Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell et\u00a0al. 2020. Language models are few-shot learners. Advances in neural information processing systems 33 (2020) 1877\u20131901."},{"key":"e_1_3_3_1_7_2","unstructured":"Mark Chen Jerry Tworek Heewoo Jun Qiming Yuan Henrique\u00a0Ponde de Oliveira\u00a0Pinto Jared Kaplan Harri Edwards Yuri Burda Nicholas Joseph Greg Brockman Alex Ray Raul Puri Gretchen Krueger Michael Petrov Heidy Khlaaf Girish Sastry Pamela Mishkin Brooke Chan Scott Gray Nick Ryder Mikhail Pavlov Alethea Power Lukasz Kaiser Mohammad Bavarian Clemens Winter Philippe Tillet Felipe\u00a0Petroski Such Dave Cummings Matthias Plappert Fotios Chantzis Elizabeth Barnes Ariel Herbert-Voss William\u00a0Hebgen Guss Alex Nichol Alex Paino Nikolas Tezak Jie Tang Igor Babuschkin Suchir Balaji Shantanu Jain William Saunders Christopher Hesse Andrew\u00a0N. Carr Jan Leike Josh Achiam Vedant Misra Evan Morikawa Alec Radford Matthew Knight Miles Brundage Mira Murati Katie Mayer Peter Welinder Bob McGrew Dario Amodei Sam McCandlish Ilya Sutskever and Wojciech Zaremba. 2021. Evaluating Large Language Models Trained on Code. arXiv:https:\/\/arXiv.org\/abs\/2107.03374\u00a0[cs.LG] https:\/\/arxiv.org\/abs\/2107.03374"},{"key":"e_1_3_3_1_8_2","unstructured":"Nikita Fomin. 2019. py2java: Python to Java Language Translator 2019. https:\/\/pypi.org\/project\/py2java\/."},{"key":"e_1_3_3_1_9_2","unstructured":"Dong Huang Qingwen Bu Yuhao Qing and Heming Cui. 2024. CodeCoT: Tackling Code Syntax Errors in CoT Reasoning for Code Generation. arXiv:https:\/\/arXiv.org\/abs\/2308.08784\u00a0[cs.SE]"},{"key":"e_1_3_3_1_10_2","unstructured":"Ali\u00a0Reza Ibrahimzada Kaiyao Ke Mrigank Pawagi Muhammad\u00a0Salman Abid Rangeet Pan Saurabh Sinha and Reyhaneh Jabbarvand. 2024. Repository-Level Compositional Code Translation and Validation. arXiv:https:\/\/arXiv.org\/abs\/2410.24117\u00a0[cs.SE] https:\/\/arxiv.org\/abs\/2410.24117"},{"key":"e_1_3_3_1_11_2","doi-asserted-by":"publisher","unstructured":"Ali\u00a0Reza Ibrahimzada Kaiyao Ke Mrigank Pawagi Muhammad\u00a0Salman Abid Rangeet Pan Saurabh Sinha and Reyhaneh Jabbarvand. 2025. AlphaTrans: A Neuro-Symbolic Compositional Approach for Repository-Level Code Translation and Validation. Proceedings of the ACM on Software Engineering 2 FSE (June 2025) 2454\u20132476. 10.1145\/3729379","DOI":"10.1145\/3729379"},{"key":"e_1_3_3_1_12_2","unstructured":"Prithwish Jana Piyush Jha Haoyang Ju Gautham Kishore Aryan Mahajan and Vijay Ganesh. 2024. CoTran: An LLM-based Code Translator using Reinforcement Learning with Feedback from Compiler and Symbolic Execution. arXiv:https:\/\/arXiv.org\/abs\/2306.06755\u00a0[cs.PL]"},{"key":"e_1_3_3_1_13_2","doi-asserted-by":"crossref","unstructured":"Takeshi Kojima Shixiang\u00a0Shane Gu Machel Reid Yutaka Matsuo and Yusuke Iwasawa. 2022. Large language models are zero-shot reasoners. Advances in neural information processing systems 35 (2022) 22199\u201322213.","DOI":"10.52202\/068431-1613"},{"key":"e_1_3_3_1_14_2","unstructured":"Jia Li Ge Li Yongmin Li and Zhi Jin. 2023. Structured chain-of-thought prompting for code generation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2305.06599 (2023)."},{"key":"e_1_3_3_1_15_2","unstructured":"Raymond Li Loubna\u00a0Ben Allal Yangtian Zi Niklas Muennighoff Denis Kocetkov Chenghao Mou Marc Marone Christopher Akiki Jia Li Jenny Chim et\u00a0al. 2023. StarCoder: may the source be with you! arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2305.06161 (2023)."},{"key":"e_1_3_3_1_16_2","unstructured":"Troy Melhase Brian Kearns Ling Li Iulius Curt and Shyam Saladi. 2016. java2python: Simple but Effective Tool to Translate Java Source Code into Python. https:\/\/github.com\/natural\/java2python."},{"key":"e_1_3_3_1_17_2","doi-asserted-by":"crossref","unstructured":"Sewon Min Xinxi Lyu Ari Holtzman Mikel Artetxe Mike Lewis Hannaneh Hajishirzi and Luke Zettlemoyer. 2022. Rethinking the role of demonstrations: What makes in-context learning work? arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2202.12837 (2022).","DOI":"10.18653\/v1\/2022.emnlp-main.759"},{"key":"e_1_3_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1145\/3597503.3639226"},{"key":"e_1_3_3_1_19_2","volume-title":"Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2)","author":"Puri Ruchir","year":"2021","unstructured":"Ruchir Puri, David\u00a0S Kung, Geert Janssen, Wei Zhang, Giacomo Domeniconi, Vladimir Zolotov, Julian Dolby, Jie Chen, Mihir Choudhury, Lindsey Decker, et\u00a0al. 2021. CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2)."},{"key":"e_1_3_3_1_20_2","unstructured":"Baptiste Roziere Marie-Anne Lachaux Lowik Chanussot and Guillaume Lample. 2020. Unsupervised translation of programming languages. Advances in Neural Information Processing Systems 33 (2020) 20601\u201320611."},{"key":"e_1_3_3_1_21_2","unstructured":"Baptiste Rozi\u00e8re Jonas Gehring Fabian Gloeckle Sten Sootla Itai Gat Xiaoqing\u00a0Ellen Tan Yossi Adi Jingyu Liu Romain Sauvestre Tal Remez J\u00e9r\u00e9my Rapin Artyom Kozhevnikov Ivan Evtimov Joanna Bitton Manish Bhatt Cristian\u00a0Canton Ferrer Aaron Grattafiori Wenhan Xiong Alexandre D\u00e9fossez Jade Copet Faisal Azhar Hugo Touvron Louis Martin Nicolas Usunier Thomas Scialom and Gabriel Synnaeve. 2024. Code Llama: Open Foundation Models for Code. arXiv:https:\/\/arXiv.org\/abs\/2308.12950\u00a0[cs.CL]"},{"key":"e_1_3_3_1_22_2","unstructured":"Parshin Shojaee Aneesh Jain Sindhu Tipirneni and Chandan\u00a0K Reddy. 2023. Execution-based Code Generation using Deep Reinforcement Learning. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2301.13816 2023."},{"key":"e_1_3_3_1_23_2","unstructured":"Yanli Wang Yanlin Wang Suiquan Wang Daya Guo Jiachi Chen John Grundy Xilin Liu Yuchi Ma Mingzhi Mao Hongyu Zhang and Zibin Zheng. 2024. RepoTransBench: A Real-World Benchmark for Repository-Level Code Translation. arXiv:https:\/\/arXiv.org\/abs\/2412.17744\u00a0[cs.SE] https:\/\/arxiv.org\/abs\/2412.17744"},{"key":"e_1_3_3_1_24_2","doi-asserted-by":"crossref","unstructured":"Jason Wei Xuezhi Wang Dale Schuurmans Maarten Bosma Fei Xia Ed Chi Quoc\u00a0V Le Denny Zhou et\u00a0al. 2022. Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems 35 (2022) 24824\u201324837.","DOI":"10.52202\/068431-1800"},{"key":"e_1_3_3_1_25_2","unstructured":"Yiqing Xie Atharva Naik Daniel Fried and Carolyn Rose. 2023. Data Augmentation for Code Translation with Comparable Corpora and Multiple References. arXiv:https:\/\/arXiv.org\/abs\/2311.00317\u00a0[cs.CL] https:\/\/arxiv.org\/abs\/2311.00317"},{"key":"e_1_3_3_1_26_2","unstructured":"Guang Yang Yu Zhou Xiang Chen Xiangyu Zhang Terry\u00a0Yue Zhuo and Taolue Chen. 2023. Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2312.05562 (2023)."},{"key":"e_1_3_3_1_27_2","unstructured":"Zhuosheng Zhang Aston Zhang Mu Li and Alex Smola. 2022. Automatic chain of thought prompting in large language models. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2210.03493 (2022)."}],"event":{"name":"CODS 2025: 13th ACM IKDD International Conference on Data Science","location":"Pune India","acronym":"CODS 2025"},"container-title":["Proceedings of the 13th ACM IKDD International Conference on Data Science"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3799830.3799841","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,24]],"date-time":"2026-04-24T07:20:39Z","timestamp":1777015239000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3799830.3799841"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,17]]},"references-count":26,"alternative-id":["10.1145\/3799830.3799841","10.1145\/3799830"],"URL":"https:\/\/doi.org\/10.1145\/3799830.3799841","relation":{},"subject":[],"published":{"date-parts":[[2025,12,17]]},"assertion":[{"value":"2026-04-23","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}