{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,24]],"date-time":"2025-06-24T11:44:38Z","timestamp":1750765478695,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":171,"publisher":"ACM","license":[{"start":{"date-parts":[[2024,6,3]],"date-time":"2024-06-03T00:00:00Z","timestamp":1717372800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2024,6,3]]},"DOI":"10.1145\/3630106.3659019","type":"proceedings-article","created":{"date-parts":[[2024,6,5]],"date-time":"2024-06-05T13:14:21Z","timestamp":1717593261000},"page":"1971-1983","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["The Emerging Artifacts of Centralized Open-Code"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0008-4752-7164","authenticated-orcid":false,"given":"Madiha Zahrah","family":"Choksi","sequence":"first","affiliation":[{"name":"Department of Computing and Information Science, Cornell Tech, United States of America"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8569-5176","authenticated-orcid":false,"given":"Ilan","family":"Mandel","sequence":"additional","affiliation":[{"name":"Department of Computing and Information Science, Cornell Tech, United States of America"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6912-8067","authenticated-orcid":false,"given":"David","family":"Widder","sequence":"additional","affiliation":[{"name":"Department of Computing and Information Science, Cornell Tech, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5954-916X","authenticated-orcid":false,"given":"Yan","family":"Shvartzshnaider","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science, York University, Canada"}]}],"member":"320","published-online":{"date-parts":[[2024,6,5]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2022. Dependabot README.MD. https:\/\/github.com\/dependabot\/dependabot-core"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3106237.3121278"},{"key":"e_1_3_2_1_3_1","unstructured":"Darren Abramson and Ali Emami. 2022. Interpreting docstrings without using common sense. (2022). https:\/\/www.fsf.org\/licensing\/copilot\/interpreting-docstrings-without-using-common-sense"},{"key":"e_1_3_2_1_4_1","unstructured":"Pietro Albinim. 2019. Shipping a compiler every six weeks. https:\/\/www.pietroalbini.org\/blog\/shipping-a-compiler-every-six-weeks\/"},{"key":"e_1_3_2_1_5_1","volume-title":"SantaCoder: don\u2019t reach for the stars!arXiv preprint arXiv:2301.03988","author":"Allal Loubna\u00a0Ben","year":"2023","unstructured":"Loubna\u00a0Ben Allal, Raymond Li, Denis Kocetkov, Chenghao Mou, Christopher Akiki, Carlos\u00a0Munoz Ferrandis, Niklas Muennighoff, Mayank Mishra, Alex Gu, Manan Dey, 2023. SantaCoder: don\u2019t reach for the stars!arXiv preprint arXiv:2301.03988 (2023)."},{"key":"e_1_3_2_1_6_1","unstructured":"Brian Anderson. 2017. How Rust is Tested. https:\/\/brson.github.io\/2017\/07\/10\/how-rust-is-tested"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3568199.3568228"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3545945.3569759"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3442188.3445922"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/MAHC.2006.76"},{"key":"e_1_3_2_1_11_1","volume-title":"Julia: A fast dynamic language for technical computing. arXiv preprint arXiv:1209.5145","author":"Bezanson Jeff","year":"2012","unstructured":"Jeff Bezanson, Stefan Karpinski, Viral\u00a0B Shah, and Alan Edelman. 2012. Julia: A fast dynamic language for technical computing. arXiv preprint arXiv:1209.5145 (2012)."},{"volume-title":"Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency. 948\u2013958","author":"Birhane Abeba","key":"e_1_3_2_1_12_1","unstructured":"Abeba Birhane, Elayne Ruane, Thomas Laurent, Matthew S.\u00a0Brown, Johnathan Flowers, Anthony Ventresque, and Christopher L.\u00a0Dancy. 2022. The forgotten margins of AI ethics. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency. 948\u2013958."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3531146.3533111"},{"key":"e_1_3_2_1_14_1","unstructured":"Valeriia Boldosova. 2015. Looking beyond traditional network relationships: Online Subcontracting Platform as an unconventional tool for connecting and benefiting actors in the network. (2015)."},{"key":"e_1_3_2_1_15_1","unstructured":"Mara Bos. 2022. Do we need a \"Rust Standard\"?https:\/\/blog.m-ou.se\/rust-standard\/ Additional reference: https:\/\/blog.m-ou.se\/rust-standard\/."},{"key":"e_1_3_2_1_16_1","unstructured":"Paul Brown. 2017. State of the Union: npm - Linux.com. https:\/\/www.linux.com\/news\/state-union-npm\/"},{"key":"e_1_3_2_1_17_1","unstructured":"Matthew Butterick. 2022. This CoPilot is stupid and wants to kill me."},{"key":"e_1_3_2_1_18_1","unstructured":"Kate Catlin. 2022. GitHub Advisory Database now open to community contributions. https:\/\/github.blog\/2022-02-22-github-advisory-database-now-open-to-community-contributions\/ Accessed Date."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3500868.3561406"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/SANER50967.2021.00043"},{"key":"e_1_3_2_1_21_1","volume-title":"Jared Kaplan, Harri Edwards, Yuri Burda","author":"Chen Mark","year":"2021","unstructured":"Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de\u00a0Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, 2021. Evaluating large language models trained on code. arXiv preprint arXiv:2107.03374 (2021)."},{"key":"e_1_3_2_1_22_1","volume-title":"Intellectual Property, and Ethics. arXiv preprint arXiv:2304.02839","author":"Choksi Madiha\u00a0Zahrah","year":"2023","unstructured":"Madiha\u00a0Zahrah Choksi and David Goedicke. 2023. Whose Text Is It Anyway? Exploring BigCode, Intellectual Property, and Ethics. arXiv preprint arXiv:2304.02839 (2023)."},{"key":"e_1_3_2_1_23_1","article-title":"How Licenses Learn","volume":"28","author":"Choksi Madiha\u00a0Zahrah","year":"2024","unstructured":"Madiha\u00a0Zahrah Choksi and James Grimmelmann. 2024. How Licenses Learn. Forthcoming, Lewis & Clark Law Review 28, 2 (2024).","journal-title":"Forthcoming, Lewis & Clark Law Review"},{"key":"e_1_3_2_1_24_1","volume-title":"RIAA blitz takes down 18 GitHub projects used for downloading YouTube videos. ZDNET (23","author":"Cimpanu Catalin","year":"2024","unstructured":"Catalin Cimpanu. 2024. RIAA blitz takes down 18 GitHub projects used for downloading YouTube videos. ZDNET (23 Oct 2024). https:\/\/www.zdnet.com\/article\/riaa-blitz-takes-down-18-github-projects-used-for-downloading-youtube-videos\/"},{"key":"e_1_3_2_1_25_1","unstructured":"Thomas Claburn. 2020. Microsoft\u2019s GitHub absorbs NPM into its code-hosting empire: JavaScript library vault used by 12 million devs now under Redmond\u2019s roof. https:\/\/www.theregister.com\/2020\/03\/16\/microsofts_github_npm\/"},{"key":"e_1_3_2_1_26_1","unstructured":"Thomas Claburn. 2022. FauxPilot: Like GitHub Copilot without Microsoft telemetry. https:\/\/www.theregister.com\/2022\/08\/06\/fauxpilot_github_copilot\/"},{"key":"e_1_3_2_1_27_1","volume-title":"2015 IEEE\/ACM 12th Working Conference on Mining Software Repositories. IEEE, 212\u2013223","author":"Claes Maelick","year":"2015","unstructured":"Maelick Claes, Tom Mens, Roberto Di\u00a0Cosmo, and J\u00e9r\u00f4me Vouillon. 2015. A historical analysis of Debian package incompatibilities. In 2015 IEEE\/ACM 12th Working Conference on Mining Software Repositories. IEEE, 212\u2013223."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3593013.3594073"},{"key":"e_1_3_2_1_29_1","volume-title":"Three ethical moments in Debian. Available at SSRN 805287","author":"Coleman E\u00a0Gabriella","year":"2005","unstructured":"E\u00a0Gabriella Coleman. 2005. Three ethical moments in Debian. Available at SSRN 805287 (2005)."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1515\/9781400845293"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3531146.3533143"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/3531146.3533150"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3531146.3533137"},{"key":"e_1_3_2_1_34_1","volume-title":"The sustainability of open source commons. European Journal of Information Systems","author":"Curto-Millet Daniel","year":"2022","unstructured":"Daniel Curto-Millet and Alberto Cors\u00edn\u00a0Jim\u00e9nez. 2022. The sustainability of open source commons. European Journal of Information Systems (2022), 1\u201319."},{"key":"e_1_3_2_1_35_1","unstructured":"Ryan Dahl. 2018. 10 Things I Regret About Node.js. https:\/\/www.youtube.com\/watch?v=m3bm9tb-8ya"},{"key":"e_1_3_2_1_36_1","unstructured":"Ryan Dahl. 2023. Why We Added package.json Support to Deno. https:\/\/deno.com\/blog\/package-json-support"},{"key":"e_1_3_2_1_37_1","unstructured":"Ryan Dahl Bert Belder and Bartek Iwa\u0144czuk. 2020. Deno 1.0. https:\/\/deno.com\/blog\/v1"},{"key":"e_1_3_2_1_38_1","unstructured":"Ryan Dahl and Alon Bonder. 2022. Big Changes Ahead for Deno. https:\/\/deno.com\/blog\/changes"},{"key":"e_1_3_2_1_39_1","volume-title":"Github copilot ai pair programmer: Asset or liability?Journal of Systems and Software 203","author":"Dakhel Arghavan\u00a0Moradi","year":"2023","unstructured":"Arghavan\u00a0Moradi Dakhel, Vahid Majdinasab, Amin Nikanjam, Foutse Khomh, Michel\u00a0C Desmarais, and Zhen Ming\u00a0Jack Jiang. 2023. Github copilot ai pair programmer: Asset or liability?Journal of Systems and Software 203 (2023), 111734."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-017-9589-y"},{"key":"e_1_3_2_1_41_1","unstructured":"[41] Ankur Desai and Atul Deo. 2022. https:\/\/aws.amazon.com\/blogs\/machine-learning\/introducing-amazon-codewhisperer-the-ml-powered-coding-companion\/"},{"key":"e_1_3_2_1_42_1","unstructured":"Drew DeVault. 2022. GitHub Copilot and open source laundering. https:\/\/drewdevault.com\/2022\/06\/23\/Copilot-GPL-washing.html"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE43902.2021.00093"},{"key":"e_1_3_2_1_44_1","unstructured":"Thomas Dohmke. 2023. 100 million developers and counting. https:\/\/github.blog\/2023-01-25-100-million-developers-and-counting\/"},{"volume-title":"The Go programming language","author":"Donovan AA","key":"e_1_3_2_1_45_1","unstructured":"Alan\u00a0AA Donovan and Brian\u00a0W Kernighan. 2015. The Go programming language. Addison-Wesley Professional."},{"key":"e_1_3_2_1_46_1","volume-title":"Big Tech Struggles to Turn AI Hype Into Profits. WSJ (Oct","author":"Dotan Tom","year":"2023","unstructured":"Tom Dotan and Deepa Seetharaman. 2023. Big Tech Struggles to Turn AI Hype Into Profits. WSJ (Oct 2023). https:\/\/www.wsj.com\/tech\/ai\/ais-costly-buildup-could-make-early-products-a-hard-sell-bdd29b9f"},{"key":"e_1_3_2_1_47_1","first-page":"671","article-title":"Microsoft as an Antitrust Target: IBM in Software","volume":"25","author":"Dratler\u00a0Jr Jay","year":"1995","unstructured":"Jay Dratler\u00a0Jr. 1995. Microsoft as an Antitrust Target: IBM in Software. Sw. UL REv. 25 (1995), 671.","journal-title":"Sw. UL REv."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10606-005-9000-1"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.2307\/j.ctvj4sxc6"},{"volume-title":"Working in public: the making and maintenance of open source software","author":"Eghbal Nadia","key":"e_1_3_2_1_50_1","unstructured":"Nadia Eghbal. 2020. Working in public: the making and maintenance of open source software. Stripe Press San Francisco."},{"key":"e_1_3_2_1_51_1","first-page":"62","article-title":"First nation in cyberspace","volume":"6","author":"Elmer-Dewitt Philip","year":"1993","unstructured":"Philip Elmer-Dewitt and D Jackson. 1993. First nation in cyberspace. Time 6 (1993), 62\u201364.","journal-title":"Time"},{"key":"e_1_3_2_1_52_1","volume-title":"Artificial Intelligence and Intellectual Property Law. In IMEC Workshop, Date: 2022\/06\/17-2022\/06\/17","author":"Emanuilov Ivo","year":"2022","unstructured":"Ivo Emanuilov. 2022. Artificial Intelligence and Intellectual Property Law. In IMEC Workshop, Date: 2022\/06\/17-2022\/06\/17, Location: Antwerp."},{"key":"e_1_3_2_1_53_1","unstructured":"Nat Friedman. 2022. Introducing GitHub Copilot: your AI pair programmer. https:\/\/github.blog\/2021-06-29-introducing-github-copilot-ai-pair-programmer\/"},{"key":"e_1_3_2_1_54_1","volume-title":"Frischmann","author":"Frischmann M","year":"2012","unstructured":"Brett\u00a0M Frischmann. 2012. Intellectual Infrastructure. B. Frischmann, Infrastructure: The Social Value of Shared Resources (2012), 253."},{"key":"e_1_3_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/3531146.3533241"},{"key":"e_1_3_2_1_56_1","volume-title":"The pile: An 800gb dataset of diverse text for language modeling. arXiv preprint arXiv:2101.00027","author":"Gao Leo","year":"2020","unstructured":"Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, 2020. The pile: An 800gb dataset of diverse text for language modeling. arXiv preprint arXiv:2101.00027 (2020)."},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/3458723"},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/3449249"},{"volume-title":"Case study research: Principles and practices","author":"Gerring John","key":"e_1_3_2_1_59_1","unstructured":"John Gerring. 2006. Case study research: Principles and practices. Cambridge university press."},{"key":"e_1_3_2_1_60_1","unstructured":"GitHub. 2021. Telemetry terms - GitHub Docs. https:\/\/web.archive.org\/web\/20210704072124\/https:\/\/docs.github.com\/en\/github\/copilot\/telemetry-terms"},{"key":"e_1_3_2_1_61_1","unstructured":"Github. 2023. Code Security: Dependabot. https:\/\/docs.github.com\/en\/code-security\/dependabot\/dependabot-alerts\/about-dependabot-alerts Additional reference: https:\/\/docs.github.com\/en\/code-security\/dependabot\/dependabot-alerts\/about-dependabot-alerts."},{"key":"e_1_3_2_1_62_1","unstructured":"GitHub. 2024. Standing up for developers: youtube-dl is back. https:\/\/github.blog\/2020-11-16-standing-up-for-developers-youtube-dl-is-back\/."},{"key":"e_1_3_2_1_63_1","volume-title":"2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 662\u2013672","author":"Golzadeh Mehdi","year":"2022","unstructured":"Mehdi Golzadeh, Alexandre Decan, and Tom Mens. 2022. On the rise and fall of CI services in GitHub. In 2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER). IEEE, 662\u2013672."},{"key":"e_1_3_2_1_64_1","volume-title":"Evolution or Revolution. Notices of the AMS 56, 5","author":"Gr\u00e4tzer G","year":"2009","unstructured":"G Gr\u00e4tzer. 2009. What Is New in LATEX? II. TEX implementations, Evolution or Revolution. Notices of the AMS 56, 5 (2009)."},{"key":"e_1_3_2_1_65_1","first-page":"2799","article-title":"The Internet is a semicommons","volume":"78","author":"Grimmelmann James","year":"2009","unstructured":"James Grimmelmann. 2009. The Internet is a semicommons. Fordham L. Rev. 78 (2009), 2799.","journal-title":"Fordham L. Rev."},{"key":"e_1_3_2_1_66_1","first-page":"657","article-title":"Copyright for literate robots","volume":"101","author":"Grimmelmann James","year":"2015","unstructured":"James Grimmelmann. 2015. Copyright for literate robots. Iowa L. Rev. 101 (2015), 657.","journal-title":"Iowa L. Rev."},{"key":"e_1_3_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1108\/14779960380000235"},{"key":"e_1_3_2_1_68_1","unstructured":"The\u00a0VAR guy. 2016. Torvalds Talks about Early Linux History GPL License and Money. https:\/\/web.archive.org\/web\/20170324170531http:\/\/thevarguy.com\/open-source-application-software-companies\/torvalds-talks-about-early-linux-history-gpl-license-and-"},{"key":"e_1_3_2_1_69_1","volume-title":"Protocols of Control: Collaboration in Free and Open Source Software. Technologies of Labour and the Politics of Contradiction","author":"Handler Reinhard\u00a0Anton","year":"2018","unstructured":"Reinhard\u00a0Anton Handler. 2018. Protocols of Control: Collaboration in Free and Open Source Software. Technologies of Labour and the Politics of Contradiction (2018), 175\u2013192."},{"key":"e_1_3_2_1_70_1","volume-title":"Automating Dependency Updates in Practice: An Exploratory Study on GitHub Dependabot. arXiv preprint arXiv:2206.07230","author":"He Runzhi","year":"2022","unstructured":"Runzhi He, Hao He, Yuxia Zhang, and Minghui Zhou. 2022. Automating Dependency Updates in Practice: An Exploratory Study on GitHub Dependabot. arXiv preprint arXiv:2206.07230 (2022)."},{"key":"e_1_3_2_1_71_1","volume-title":"Lisa\u00a0Anne Hendricks","author":"Hoffmann Jordan","year":"2022","unstructured":"Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de\u00a0Las Casas, Lisa\u00a0Anne Hendricks, Johannes Welbl, Aidan Clark, 2022. Training compute-optimal large language models. arXiv preprint arXiv:2203.15556 (2022)."},{"key":"e_1_3_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/3287560.3287600"},{"key":"e_1_3_2_1_73_1","unstructured":"Github Inc.2021. Introducing GitHub Copilot: your AI pair programmer. https:\/\/github.blog\/2021-06-29-introducing-github-copilot-ai-pair-programmer\/"},{"key":"e_1_3_2_1_74_1","unstructured":"Github Inc.2022. GitHub Copilot is generally available to all developers. https:\/\/github.blog\/2022-06-21-github-copilot-is-generally-available-to-all-developers\/"},{"key":"e_1_3_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1145\/3531146.3534637"},{"key":"e_1_3_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1145\/3491102.3501870"},{"key":"e_1_3_2_1_77_1","volume-title":"An empirical study of pre-trained model reuse in the hugging face deep learning model registry. arXiv preprint arXiv:2303.02552","author":"Jiang Wenxin","year":"2023","unstructured":"Wenxin Jiang, Nicholas Synovic, Matt Hyatt, Taylor\u00a0R Schorlemmer, Rohan Sethi, Yung-Hsiang Lu, George\u00a0K Thiruvathukal, and James\u00a0C Davis. 2023. An empirical study of pre-trained model reuse in the hugging face deep learning model registry. arXiv preprint arXiv:2303.02552 (2023)."},{"key":"e_1_3_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1525\/can.2005.20.2.185"},{"key":"e_1_3_2_1_79_1","volume-title":"Free software and open source: The freedom debate and its consequences. First Monday","author":"Klang Mathias","year":"2005","unstructured":"Mathias Klang. 2005. Free software and open source: The freedom debate and its consequences. First Monday (2005)."},{"key":"e_1_3_2_1_80_1","unstructured":"Azer Ko\u00e7ulu. 2016. I\u2019ve Just Liberated My Modules. web.archive.org\/web\/20180217094442 http:\/\/azer.bike\/journal\/i-ve-just-liberated-my-modules"},{"key":"e_1_3_2_1_81_1","volume-title":"Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 505\u2013517","author":"Lamba Hemank","year":"2020","unstructured":"Hemank Lamba, Asher Trockman, Daniel Armanios, Christian K\u00e4stner, Heather Miller, and Bogdan Vasilescu. 2020. Heard it through the Gitvine: an empirical study of tool diffusion across the npm ecosystem. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 505\u2013517."},{"volume-title":"Computer Systems: Theory, Technology, and Applications","author":"Lampson W","key":"e_1_3_2_1_82_1","unstructured":"Butler\u00a0W Lampson. 2004. Software components: Only the giants survive. In Computer Systems: Theory, Technology, and Applications. Springer, 137\u2013145."},{"key":"e_1_3_2_1_83_1","volume-title":"Teven Le\u00a0Scao, Leandro Von\u00a0Werra, Chenghao Mou","author":"Lauren\u00e7on Hugo","year":"2022","unstructured":"Hugo Lauren\u00e7on, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova\u00a0del Moral, Teven Le\u00a0Scao, Leandro Von\u00a0Werra, Chenghao Mou, Eduardo Gonz\u00e1lez\u00a0Ponferrada, Huu Nguyen, 2022. The bigscience roots corpus: A 1.6 tb composite multilingual dataset. Advances in Neural Information Processing Systems 35 (2022), 31809\u201331826."},{"key":"e_1_3_2_1_84_1","first-page":"21314","article-title":"Coderl: Mastering code generation through pretrained models and deep reinforcement learning","volume":"35","author":"Le Hung","year":"2022","unstructured":"Hung Le, Yue Wang, Akhilesh\u00a0Deepak Gotmare, Silvio Savarese, and Steven Chu\u00a0Hong Hoi. 2022. Coderl: Mastering code generation through pretrained models and deep reinforcement learning. Advances in Neural Information Processing Systems 35 (2022), 21314\u201321328.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_85_1","first-page":"459","article-title":"Terms of use","volume":"91","author":"Lemley A","year":"2006","unstructured":"Mark\u00a0A Lemley. 2006. Terms of use. Minn. L. Rev. 91 (2006), 459.","journal-title":"Minn. L. Rev."},{"key":"e_1_3_2_1_86_1","volume-title":"StarCoder: may the source be with you!arXiv preprint arXiv:2305.06161","author":"Li Raymond","year":"2023","unstructured":"Raymond Li, Loubna\u00a0Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, 2023. StarCoder: may the source be with you!arXiv preprint arXiv:2305.06161 (2023)."},{"key":"e_1_3_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.1145\/3449093"},{"key":"e_1_3_2_1_88_1","unstructured":"Yichen Li Yintong Huo Zhihan Jiang Renyi Zhong Pinjia He Yuxin Su and Michael\u00a0R. Lyu. 2023. Exploring the Effectiveness of LLMs in Automated Logging Generation: An Empirical Study. arxiv:2307.05950\u00a0[cs.SE]"},{"key":"e_1_3_2_1_89_1","unstructured":"Rita Liao and Manish Singh. 2019. GitHub confirms it has blocked developers in Iran Syria and Crimea. https:\/\/techcrunch.com\/2019\/07\/29\/github-ban-sanctioned-countries\/"},{"key":"e_1_3_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.1145\/3555051.3555067"},{"key":"e_1_3_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.1145\/3555051.3555066"},{"key":"e_1_3_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.1145\/3531146.3533086"},{"key":"e_1_3_2_1_93_1","doi-asserted-by":"publisher","DOI":"10.1177\/20539517211047734"},{"key":"e_1_3_2_1_94_1","unstructured":"Bradley M.\u00a0Kuhn. 2022. If Software is My Copilot Who Programmed My Software?https:\/\/sfconservancy.org\/blog\/2022\/feb\/03\/github-copilot-copyleft-gpl\/"},{"key":"e_1_3_2_1_95_1","unstructured":"James Maguire. 2007. The SourceForge Story - Datamation. web.archive.org\/web\/20110804024950 http:\/\/itmanagement.earthweb.com\/cnews\/article.php\/3705731"},{"key":"e_1_3_2_1_96_1","volume-title":"2nd Workshop on Research with Security Vulnerability Databases","author":"Mann E","year":"1999","unstructured":"David\u00a0E Mann and Steven\u00a0M Christey. 1999. Towards a common enumeration of vulnerabilities. In 2nd Workshop on Research with Security Vulnerability Databases, Purdue University, West Lafayette, Indiana."},{"key":"e_1_3_2_1_97_1","doi-asserted-by":"publisher","DOI":"10.1145\/2441776.2441794"},{"key":"e_1_3_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.1145\/2441776.2441792"},{"key":"e_1_3_2_1_99_1","doi-asserted-by":"crossref","unstructured":"Niko Matsakis. 2014. Semantic versioning for the language. RFC 1122. https:\/\/rust-lang.github.io\/rfcs\/1122-language-semver.html","DOI":"10.1145\/2663171.2663188"},{"key":"e_1_3_2_1_100_1","unstructured":"Mike McDonald. 2021. Goodbye Dependabot Preview hello Dependabot. https:\/\/github.blog\/2021-04-29-goodbye-dependabot-preview-hello-dependabot\/ Additional reference: https:\/\/github.blog\/2021-04-29-goodbye-dependabot-preview-hello-dependabot\/."},{"key":"e_1_3_2_1_101_1","unstructured":"Rachel Metz. [n. d.]. AI Startup Hugging Face Valued at 4.5 Billion After Raising Funding From Google Nvidia. Bloomberg ([n. d.]). https:\/\/www.bloomberg.com\/news\/articles\/2023-08-24\/ai-startup-hugging-face-valued-at-4-5-billion-after-fundraising"},{"key":"e_1_3_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.1109\/MITP.2010.147"},{"key":"e_1_3_2_1_103_1","doi-asserted-by":"publisher","DOI":"10.5210\/fm.v4i8.684"},{"key":"e_1_3_2_1_104_1","unstructured":"Glyn Moody. 2009. Rebel code: Linux and the open source revolution. Hachette UK."},{"key":"e_1_3_2_1_105_1","volume-title":"Essence of distributed work. Online communication and collaboration: A reader 125","author":"Moon J","year":"2010","unstructured":"J Moon and Lee Sproull. 2010. Essence of distributed work. Online communication and collaboration: A reader 125 (2010)."},{"key":"e_1_3_2_1_106_1","unstructured":"Alex Mullans. 2021. Keep all your packages up to date with Dependabot. https:\/\/github.blog\/2020-06-01-keep-all-your-packages-up-to-date-with-dependabot\/"},{"key":"e_1_3_2_1_107_1","unstructured":"Ian Murdock. 2007. How package management changed everything. https:\/\/web.archive.org\/web\/20090223072201http:\/\/ianmurdock.com\/2007\/07\/21\/how-package-management-changed-everything\/"},{"key":"e_1_3_2_1_108_1","unstructured":"national institute\u00a0of Standards and Security. 2023. National Vulnerability Database. https:\/\/nvd.nist.gov\/"},{"key":"e_1_3_2_1_109_1","doi-asserted-by":"publisher","DOI":"10.1093\/oxfordhb"},{"key":"e_1_3_2_1_110_1","doi-asserted-by":"publisher","DOI":"10.1145\/3548606.3560596"},{"key":"e_1_3_2_1_111_1","volume-title":"CodeGen2: Lessons for Training LLMs on Programming and Natural Languages. ICLR","author":"Nijkamp Erik","year":"2023","unstructured":"Erik Nijkamp, Hiroaki Hayashi, Caiming Xiong, Silvio Savarese, and Yingbo Zhou. 2023. CodeGen2: Lessons for Training LLMs on Programming and Natural Languages. ICLR (2023)."},{"key":"e_1_3_2_1_112_1","volume-title":"CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis. ICLR","author":"Nijkamp Erik","year":"2023","unstructured":"Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, and Caiming Xiong. 2023. CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis. ICLR (2023)."},{"key":"e_1_3_2_1_113_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02639315"},{"key":"e_1_3_2_1_115_1","volume-title":"Accelerating package expansion in Rust through development of a semantic versioning tool. arXiv preprint arXiv:2308.14623","author":"Nowak Tomasz","year":"2023","unstructured":"Tomasz Nowak, Micha\u0142 Staniewski, Mieszko Grodzicki, and Bartosz Smolarczyk. 2023. Accelerating package expansion in Rust through development of a semantic versioning tool. arXiv preprint arXiv:2308.14623 (2023)."},{"key":"e_1_3_2_1_117_1","volume-title":"Tragedy of the commons. The new palgrave dictionary of economics 2","author":"Ostrom Elinor","year":"2008","unstructured":"Elinor Ostrom. 2008. Tragedy of the commons. The new palgrave dictionary of economics 2 (2008), 1\u20134."},{"key":"e_1_3_2_1_118_1","volume-title":"American Affairs","author":"Pasquale A","year":"2018","unstructured":"Frank\u00a0A Pasquale. 2018. Tech platforms and the knowledge problem. American Affairs, Summer (2018)."},{"key":"e_1_3_2_1_119_1","volume-title":"Christian Fufezan, Tobias Ternent, Stephen\u00a0J Eglen","author":"Perez-Riverol Yasset","year":"2016","unstructured":"Yasset Perez-Riverol, Laurent Gatto, Rui Wang, Timo Sachsenberg, Julian Uszkoreit, Felipe da\u00a0Veiga Leprevost, Christian Fufezan, Tobias Ternent, Stephen\u00a0J Eglen, Daniel\u00a0S Katz, 2016. Ten simple rules for taking advantage of Git and GitHub., e1004947\u00a0pages."},{"key":"e_1_3_2_1_120_1","unstructured":"Billy Perrigo. 2023. OpenAI Used Kenyan Workers on Less Than $2 Per Hour: Exclusive | Time. https:\/\/time.com\/6247678\/openai-chatgpt-kenya-workers\/."},{"key":"e_1_3_2_1_121_1","doi-asserted-by":"publisher","DOI":"10.5555\/3575618.3575622"},{"key":"e_1_3_2_1_122_1","doi-asserted-by":"publisher","DOI":"10.1145\/3377816.3381732"},{"key":"e_1_3_2_1_123_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12130-999-1026-0"},{"key":"e_1_3_2_1_124_1","doi-asserted-by":"publisher","DOI":"10.1093\/oxfordjournals.jpart.a024296"},{"key":"e_1_3_2_1_125_1","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress"},{"key":"e_1_3_2_1_126_1","doi-asserted-by":"publisher","DOI":"10.1145\/1806596.1806598"},{"key":"e_1_3_2_1_127_1","unstructured":"Emma Roth. 2023. Microsoft GitHub and OpenAI ask court to throw out AI copyright lawsuit. https:\/\/www.theverge.com\/2023\/1\/28\/23575919\/microsoft-openai-github-dismiss-copilot-ai-copyright-lawsuit"},{"key":"e_1_3_2_1_128_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411764.3445518"},{"key":"e_1_3_2_1_129_1","doi-asserted-by":"publisher","DOI":"10.1145\/3597151"},{"key":"e_1_3_2_1_130_1","volume-title":"Intermediate Perl: Beyond The Basics of Learning Perl. \" O\u2019Reilly Media","author":"Schwartz L","year":"2012","unstructured":"Randal\u00a0L Schwartz, Tom Phoenix, 2012. Intermediate Perl: Beyond The Basics of Learning Perl. \" O\u2019Reilly Media, Inc.\"."},{"volume-title":"Internet success: a study of open-source software commons","author":"Schweik M","key":"e_1_3_2_1_131_1","unstructured":"Charles\u00a0M Schweik and Robert\u00a0C English. 2012. Internet success: a study of open-source software commons. MIT Press."},{"key":"e_1_3_2_1_132_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2012.57"},{"key":"e_1_3_2_1_133_1","volume-title":"Dev corrupts NPM libs \u2019colors","author":"Sharma Ax","year":"2022","unstructured":"Ax Sharma. 2022. Dev corrupts NPM libs \u2019colors\u2019 and \u2019faker\u2019 breaking thousands of apps. BleepingComputer (Jan 2022). https:\/\/www.bleepingcomputer.com\/news\/security\/dev-corrupts-npm-libs-colors-and-faker-breaking-thousands-of-apps\/"},{"key":"e_1_3_2_1_134_1","volume-title":"The Curse of Recursion: Training on Generated Data Makes Models Forget. arXiv preprint arxiv:2305.17493","author":"Shumailov Ilia","year":"2023","unstructured":"Ilia Shumailov, Zakhar Shumaylov, Yiren Zhao, Yarin Gal, Nicolas Papernot, and Ross Anderson. 2023. The Curse of Recursion: Training on Generated Data Makes Models Forget. arXiv preprint arxiv:2305.17493 (2023)."},{"key":"e_1_3_2_1_135_1","volume-title":"Will Rust Solve Software Security?","author":"Sible Joseph","year":"2023","unstructured":"Joseph Sible, David Svoboda, and Garret Wassermann. 2023. Will Rust Solve Software Security? (2023)."},{"key":"e_1_3_2_1_136_1","unstructured":"Sid Sijbrandij. 2020. Upcoming changes to CI\/CD Minutes for free tier users on GitLab.com. about.gitlab.com\/blog\/2020\/09\/01\/ci-minutes-update-free-users"},{"key":"e_1_3_2_1_137_1","volume-title":"Property and property rules. NYUL rev. 79","author":"Smith E","year":"2004","unstructured":"Henry\u00a0E Smith. 2004. Property and property rules. NYUL rev. 79 (2004), 1719."},{"key":"e_1_3_2_1_138_1","doi-asserted-by":"publisher","DOI":"10.1145\/3593013.3593981"},{"key":"e_1_3_2_1_139_1","doi-asserted-by":"publisher","DOI":"10.1145\/3468264.3468589"},{"key":"e_1_3_2_1_140_1","unstructured":"Nick Srnicek. 2016. Platform Capitalism. John Wiley & Sons."},{"key":"e_1_3_2_1_141_1","doi-asserted-by":"publisher","DOI":"10.1145\/1516046.1516058"},{"key":"e_1_3_2_1_142_1","unstructured":"Rust Team. 2015. Crater. https:\/\/rustc-dev-guide.rust-lang.org\/tests\/crater.html Additional reference: https:\/\/github.com\/rust-lang\/crater."},{"key":"e_1_3_2_1_143_1","unstructured":"Rust Team. 2015. Crater. https:\/\/rustc-dev-guide.rust-lang.org\/licenses.html"},{"key":"e_1_3_2_1_144_1","doi-asserted-by":"publisher","DOI":"10.1177\/0263775816633195"},{"key":"e_1_3_2_1_145_1","unstructured":"Michael Tiemann. 2006. History of the OSI. web.archive.org\/web\/20090116020539\/https:\/\/opensource.org\/history Accessed: 2023-01-21."},{"key":"e_1_3_2_1_146_1","unstructured":"Linus Torvalds. 1991. LINUX\u2013a free unix-386 kernel."},{"key":"e_1_3_2_1_147_1","volume-title":"Tech Talk: Linus Torvalds on git. https:\/\/www.youtube.com\/watch?v=4xpnkhjaok8","author":"Torvalds Linus","year":"2007","unstructured":"Linus Torvalds. 2007. Tech Talk: Linus Torvalds on git. https:\/\/www.youtube.com\/watch?v=4xpnkhjaok8"},{"volume-title":"CPAN","author":"Tregar Sam","key":"e_1_3_2_1_148_1","unstructured":"Sam Tregar. 2002. CPAN. In Writing Perl Modules for CPAN. Springer, 1\u201320."},{"key":"e_1_3_2_1_149_1","unstructured":"Aaron Turon and Niko Madsakis. 2014. Stability as a Deliverable. https:\/\/blog.rust-lang.org\/2014\/10\/30\/Stability.html Additional reference: https:\/\/blog.rust-lang.org\/2014\/10\/30\/Stability.html."},{"key":"e_1_3_2_1_150_1","doi-asserted-by":"publisher","DOI":"10.1145\/3491101.3519665"},{"key":"e_1_3_2_1_151_1","doi-asserted-by":"publisher","DOI":"10.1145\/3236024.3236062"},{"key":"e_1_3_2_1_152_1","unstructured":"James Vincent. 2022. The lawsuit that could rewrite the rules of AI copyright. https:\/\/www.theverge.com\/2022\/11\/8\/23446821\/microsoft-openai-github-copilot-class-action-lawsuit-ai-copyright-violation-training-data"},{"key":"e_1_3_2_1_153_1","unstructured":"Laurie Voss. 2014. npm Blog Archive: npm and front-end packaging. https:\/\/blog.npmjs.org\/post\/101775448305\/npm-and-front-end-packaging."},{"volume-title":"Typosquatting and combosquatting attacks on the python ecosystem. In 2020 ieee european symposium on security and privacy workshops (euros&pw)","author":"Vu Duc-Ly","key":"e_1_3_2_1_154_1","unstructured":"Duc-Ly Vu, Ivan Pashchenko, Fabio Massacci, Henrik Plate, and Antonino Sabetta. 2020. Typosquatting and combosquatting attacks on the python ecosystem. In 2020 ieee european symposium on security and privacy workshops (euros&pw). IEEE, 509\u2013514."},{"key":"e_1_3_2_1_155_1","unstructured":"Jason Warner. 2019. Thank you for 100 million repositories."},{"key":"e_1_3_2_1_156_1","volume-title":"Why Microsoft is willing to pay so much for GitHub. Harvard Business Review 6","author":"Weinstein V","year":"2018","unstructured":"Paul\u00a0V Weinstein. 2018. Why Microsoft is willing to pay so much for GitHub. Harvard Business Review 6 (2018)."},{"key":"e_1_3_2_1_157_1","doi-asserted-by":"publisher","DOI":"10.1145\/3545945.3569830"},{"key":"e_1_3_2_1_158_1","doi-asserted-by":"publisher","DOI":"10.1145\/3274451"},{"key":"e_1_3_2_1_159_1","doi-asserted-by":"publisher","DOI":"10.1145\/3488666"},{"key":"e_1_3_2_1_160_1","doi-asserted-by":"publisher","DOI":"10.1177\/20539517231177620"},{"key":"e_1_3_2_1_161_1","doi-asserted-by":"publisher","DOI":"10.1145\/3531146.3533779"},{"key":"e_1_3_2_1_162_1","doi-asserted-by":"publisher","unstructured":"David\u00a0Gray Widder Sarah West and Meredith Whittaker. 2023. Open (For Business): Big Tech Concentrated Power and the Political Economy of Open AI. https:\/\/doi.org\/10.2139\/ssrn.4543807","DOI":"10.2139\/ssrn.4543807"},{"key":"e_1_3_2_1_163_1","doi-asserted-by":"publisher","DOI":"10.1145\/3593013.3594012"},{"key":"e_1_3_2_1_164_1","volume-title":"How one developer just broke Node, Babel and thousands of projects in 11 lines of JavaScript. The Register 172 (23","author":"Williams Chris","year":"2016","unstructured":"Chris Williams. 2016. How one developer just broke Node, Babel and thousands of projects in 11 lines of JavaScript. The Register 172 (23 Mar 2016)."},{"volume-title":"Do artifacts have politics? In Computer Ethics","author":"Winner Langdon","key":"e_1_3_2_1_165_1","unstructured":"Langdon Winner. 2017. Do artifacts have politics? In Computer Ethics. Routledge, 177\u2013192."},{"key":"e_1_3_2_1_166_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.348001"},{"key":"e_1_3_2_1_167_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.928619"},{"key":"e_1_3_2_1_168_1","doi-asserted-by":"publisher","DOI":"10.1145\/2556420.2556483"},{"key":"e_1_3_2_1_169_1","volume-title":"2021 IEEE\/ACM Third International Workshop on Bots in Software Engineering (BotSE). IEEE, 6\u201310","author":"Wyrich Marvin","year":"2021","unstructured":"Marvin Wyrich, Raoul Ghit, Tobias Haller, and Christian M\u00fcller. 2021. Bots don\u2019t mind waiting, do they? Comparing the interaction with automatically and manually created pull requests. In 2021 IEEE\/ACM Third International Workshop on Bots in Software Engineering (BotSE). IEEE, 6\u201310."},{"key":"e_1_3_2_1_170_1","unstructured":"Robert\u00a0K Yin. 2011. Applications of case study research. sage."},{"key":"e_1_3_2_1_171_1","unstructured":"Andrew Zhou and Nicolas Ouporov. 2023. Building a RAG Pipeline for the Entire Python Ecosystem. https:\/\/fleet.so\/blog\/library-rag."},{"key":"e_1_3_2_1_172_1","unstructured":"Albert Ziegler. 2022. GitHub Copilot research recitation. https:\/\/github.blog\/2021-06-30-github-copilot-research-recitation\/"},{"key":"e_1_3_2_1_173_1","doi-asserted-by":"publisher","DOI":"10.1145\/3520312.3534864"}],"event":{"name":"FAccT '24: The 2024 ACM Conference on Fairness, Accountability, and Transparency","acronym":"FAccT '24","location":"Rio de Janeiro Brazil"},"container-title":["The 2024 ACM Conference on Fairness, Accountability, and Transparency"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3630106.3659019","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3630106.3659019","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T23:57:07Z","timestamp":1750291027000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3630106.3659019"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,3]]},"references-count":171,"alternative-id":["10.1145\/3630106.3659019","10.1145\/3630106"],"URL":"https:\/\/doi.org\/10.1145\/3630106.3659019","relation":{},"subject":[],"published":{"date-parts":[[2024,6,3]]},"assertion":[{"value":"2024-06-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}