{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T11:56:22Z","timestamp":1775822182321,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":28,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,10,23]],"date-time":"2019-10-23T00:00:00Z","timestamp":1571788800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,10,23]]},"DOI":"10.1145\/3359591.3359735","type":"proceedings-article","created":{"date-parts":[[2019,10,10]],"date-time":"2019-10-10T18:52:21Z","timestamp":1570733541000},"page":"143-153","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":189,"title":["The adverse effects of code duplication in machine learning models of code"],"prefix":"10.1145","author":[{"given":"Miltiadis","family":"Allamanis","sequence":"first","affiliation":[{"name":"Microsoft Research, UK"}]}],"member":"320","published-online":{"date-parts":[[2019,10,23]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3212695"},{"key":"e_1_3_2_2_2_1","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR).","author":"Allamanis Miltiadis","year":"2018","unstructured":"Miltiadis Allamanis , Marc Brockschmidt , and Mahmoud Khademi . 2018 . Learning to Represent Programs with Graphs . In Proceedings of the International Conference on Learning Representations (ICLR). Miltiadis Allamanis, Marc Brockschmidt, and Mahmoud Khademi. 2018. Learning to Represent Programs with Graphs. In Proceedings of the International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_2_3_1","volume-title":"Proceedings of the International Conference on Machine Learning (ICML). 2091\u20132100","author":"Allamanis Miltiadis","year":"2016","unstructured":"Miltiadis Allamanis , Hao Peng , and Charles Sutton . 2016 . A convolutional attention network for extreme summarization of source code . In Proceedings of the International Conference on Machine Learning (ICML). 2091\u20132100 . Miltiadis Allamanis, Hao Peng, and Charles Sutton. 2016. A convolutional attention network for extreme summarization of source code. In Proceedings of the International Conference on Machine Learning (ICML). 2091\u20132100."},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSR.2013.6624029"},{"key":"e_1_3_2_2_5_1","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR).","author":"Alon Uri","year":"2010","unstructured":"Uri Alon , Omer Levy , and Eran Yahav . 2010 . code2seq: Generating Sequences from Structured Representations of Code . In Proceedings of the International Conference on Learning Representations (ICLR). Uri Alon, Omer Levy, and Eran Yahav. 2010. code2seq: Generating Sequences from Structured Representations of Code. In Proceedings of the International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290353"},{"key":"e_1_3_2_2_7_1","volume-title":"Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 2: Short Papers)","volume":"2","author":"Miceli Barone Antonio Valerio","year":"2017","unstructured":"Antonio Valerio Miceli Barone and Rico Sennrich . 2017 . A Parallel Corpus of Python Functions and Documentation Strings for Automated Code Documentation and Code Generation . In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 2: Short Papers) , Vol. 2 . 314\u2013319. Antonio Valerio Miceli Barone and Rico Sennrich. 2017. A Parallel Corpus of Python Functions and Documentation Strings for Automated Code Documentation and Code Generation. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 2: Short Papers), Vol. 2. 314\u2013319."},{"key":"e_1_3_2_2_8_1","first-page":"1137","article-title":"A neural probabilistic language model","author":"Bengio Yoshua","year":"2003","unstructured":"Yoshua Bengio , R\u00e9jean Ducharme , Pascal Vincent , and Christian Jauvin . 2003 . A neural probabilistic language model . Journal of Machine Learning Research (JMLR) 3 , Feb (2003), 1137 \u2013 1155 . Yoshua Bengio, R\u00e9jean Ducharme, Pascal Vincent, and Christian Jauvin. 2003. A neural probabilistic language model. Journal of Machine Learning Research (JMLR) 3, Feb (2003), 1137\u20131155.","journal-title":"Journal of Machine Learning Research (JMLR) 3"},{"key":"e_1_3_2_2_9_1","volume-title":"Proceedings of the International Conference on Machine Learning (ICML). 2933\u20132942","author":"Bielik Pavol","year":"2016","unstructured":"Pavol Bielik , Veselin Raychev , and Martin Vechev . 2016 . PHOG: Probabilistic Model for Code . In Proceedings of the International Conference on Machine Learning (ICML). 2933\u20132942 . Pavol Bielik, Veselin Raychev, and Martin Vechev. 2016. PHOG: Probabilistic Model for Code. In Proceedings of the International Conference on Machine Learning (ICML). 2933\u20132942."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2597073.2597102"},{"key":"e_1_3_2_2_11_1","volume-title":"Abbas Heydarnoori, and Vladimir Filkov.","author":"Gharehyazie Mohammad","year":"2018","unstructured":"Mohammad Gharehyazie , Baishakhi Ray , Mehdi Keshani , Masoumeh Soleimani Zavosht , Abbas Heydarnoori, and Vladimir Filkov. 2018 . Cross-project code clones in GitHub. Empirical Software Engineering ( 2018), 1\u201336. Mohammad Gharehyazie, Baishakhi Ray, Mehdi Keshani, Masoumeh Soleimani Zavosht, Abbas Heydarnoori, and Vladimir Filkov. 2018. Cross-project code clones in GitHub. Empirical Software Engineering (2018), 1\u201336."},{"key":"e_1_3_2_2_12_1","volume-title":"Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS).","author":"Hashimoto Tatsunori B","year":"2018","unstructured":"Tatsunori B Hashimoto , Kelvin Guu , Yonatan Oren , and Percy Liang . 2018 . A Retrieve-and-Edit Framework for Predicting Structured Outputs . In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS). Tatsunori B Hashimoto, Kelvin Guu, Yonatan Oren, and Percy Liang. 2018. A Retrieve-and-Edit Framework for Predicting Structured Outputs. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS)."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3236024.3236051"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3106237.3106290"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/2337223.2337322"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1195"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1192"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-008-9076-6"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3133908"},{"key":"e_1_3_2_2_20_1","volume-title":"Proceedings of the International Conference on Machine Learning (ICML). 649\u2013657","author":"Maddison Chris","year":"2014","unstructured":"Chris Maddison and Daniel Tarlow . 2014 . Structured generative models of natural source code . In Proceedings of the International Conference on Machine Learning (ICML). 649\u2013657 . Chris Maddison and Daniel Tarlow. 2014. Structured generative models of natural source code. In Proceedings of the International Conference on Machine Learning (ICML). 649\u2013657."},{"key":"e_1_3_2_2_21_1","volume-title":"Machine Learning: A Probabilistic Perspective","author":"Murphy Kevin P","year":"2012","unstructured":"Kevin P Murphy . 2012 . Machine Learning: A Probabilistic Perspective . MIT Press . Kevin P Murphy. 2012. Machine Learning: A Probabilistic Perspective. MIT Press."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/2837614.2837671"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2676726.2677009"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2666356.2594321"},{"key":"e_1_3_2_2_25_1","volume-title":"A survey on software clone detection research. Queen\u2019s School of Computing TR 541, 115","author":"Roy Chanchal Kumar","year":"2007","unstructured":"Chanchal Kumar Roy and James R Cordy . 2007. A survey on software clone detection research. Queen\u2019s School of Computing TR 541, 115 ( 2007 ), 64\u201368. Chanchal Kumar Roy and James R Cordy. 2007. A survey on software clone detection research. Queen\u2019s School of Computing TR 541, 115 (2007), 64\u201368."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2884781.2884877"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/APSEC.2010.46"},{"key":"e_1_3_2_2_28_1","volume-title":"Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning 4, 2","author":"Tieleman Tijmen","year":"2012","unstructured":"Tijmen Tieleman and Geoffrey Hinton . 2012. Lecture 6.5-RMS Prop : Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning 4, 2 ( 2012 ), 26\u201331. Tijmen Tieleman and Geoffrey Hinton. 2012. Lecture 6.5-RMSProp: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning 4, 2 (2012), 26\u201331."}],"event":{"name":"SPLASH '19: 2019 ACM SIGPLAN International Conference on Systems, Programming, Languages, and Applications: Software for Humanity","location":"Athens Greece","acronym":"SPLASH '19","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages"]},"container-title":["Proceedings of the 2019 ACM SIGPLAN International Symposium on New Ideas, New Paradigms, and Reflections on Programming and Software"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3359591.3359735","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3359591.3359735","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:23:06Z","timestamp":1750202586000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3359591.3359735"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,10,23]]},"references-count":28,"alternative-id":["10.1145\/3359591.3359735","10.1145\/3359591"],"URL":"https:\/\/doi.org\/10.1145\/3359591.3359735","relation":{},"subject":[],"published":{"date-parts":[[2019,10,23]]},"assertion":[{"value":"2019-10-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}