{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,6]],"date-time":"2026-03-06T01:54:25Z","timestamp":1772762065841,"version":"3.50.1"},"reference-count":27,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2022,1,1]],"date-time":"2022-01-01T00:00:00Z","timestamp":1640995200000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"The Center for Equity, Gender, and Leadership at Berkeley Haas School of Business"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Big Data &amp; Society"],"published-print":{"date-parts":[[2022,1]]},"abstract":"<jats:p> As natural language processing tools powered by big data become increasingly ubiquitous, questions of how to design, develop, and manage these tools and their impacts on diverse populations are pressing. We propose utilizing the concept of linguistic justice\u2014the realization of equitable access to social and political life regardless of language\u2014to provide a framework for examining natural language processing tools that learn from and use human language data. To support linguistic justice, we argue that natural language processing tools (along with the datasets that are used to train and evaluate them) must be examined not only from the perspective of a privileged, majority language user, but also from the perspectives of minoritized language users. Considering such perspectives can help to surface areas in which the data used within natural language processing tools may be (often inadvertently) working against linguistic justice by failing to provide access to information, services, or opportunities in users\u2019 language of choice, underperforming for certain linguistic groups, or advancing harmful stereotypes that can lead to negative life outcomes for members of marginalized groups. At the same time, this framework can help to illuminate ways that these shortcomings can be addressed and allow us to use inclusive language data and approaches to leverage natural language processing technologies that advance linguistic justice. <\/jats:p>","DOI":"10.1177\/20539517221090930","type":"journal-article","created":{"date-parts":[[2022,4,26]],"date-time":"2022-04-26T07:22:19Z","timestamp":1650957739000},"update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":21,"title":["Linguistic justice as a framework for designing, developing, and managing natural language processing tools"],"prefix":"10.1177","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0177-3155","authenticated-orcid":false,"given":"Julia","family":"Nee","sequence":"first","affiliation":[{"name":"University of California, Berkeley, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0615-6378","authenticated-orcid":false,"given":"Genevieve Macfarlane","family":"Smith","sequence":"additional","affiliation":[{"name":"University of California, Berkeley, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9630-7143","authenticated-orcid":false,"given":"Alicia","family":"Sheares","sequence":"additional","affiliation":[{"name":"University of California, Berkeley, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7130-0321","authenticated-orcid":false,"given":"Ishita","family":"Rustagi","sequence":"additional","affiliation":[{"name":"University of California, Berkeley, USA"}]}],"member":"179","published-online":{"date-parts":[[2022,4,26]]},"reference":[{"key":"bibr1-20539517221090930","doi-asserted-by":"publisher","DOI":"10.1080\/15348450701341378"},{"key":"bibr2-20539517221090930","doi-asserted-by":"publisher","DOI":"10.4324\/9781315147383"},{"key":"bibr3-20539517221090930","doi-asserted-by":"publisher","DOI":"10.17265\/2159-5313\/2016.09.003"},{"key":"bibr4-20539517221090930","first-page":"155","volume-title":"Black Linguistics: Language, Society, and Politics in Africa and the Americas","author":"Baugh J","year":"2003"},{"key":"bibr5-20539517221090930","doi-asserted-by":"crossref","unstructured":"Bender E, Gebru T, McMillan-Major A, et al. (2021) On the dangers of Stochastic Parrots: can language models be too big? In: FAccT \u201921, Virtual Event, 2021, pp. 610\u2013623.","DOI":"10.1145\/3442188.3445922"},{"key":"bibr6-20539517221090930","doi-asserted-by":"publisher","DOI":"10.1215\/02705346-3592499"},{"key":"bibr600-20539517221090930","doi-asserted-by":"publisher","DOI":"10.1080\/08838151.2012.732147"},{"key":"bibr7-20539517221090930","volume-title":"Critical Race Theory: The Key Writings That Formed the Movement","author":"Crenshaw K","year":"1995"},{"key":"bibr8-20539517221090930","unstructured":"Dastin J (2018) Amazon scraps secret AI recruiting tool that showed bias against women. Reuters, 10 October. Available at: https:\/\/www.reuters.com\/article\/us-amazon-com-jobs-automation-insight\/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G."},{"key":"bibr9-20539517221090930","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W19-3504"},{"key":"bibr10-20539517221090930","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/11805.001.0001"},{"key":"bibr11-20539517221090930","unstructured":"Gazzola M, Wickstr\u00f6m B-A, Fettes M (2021) Towards an index of linguistic justice. Working Paper. Ulster University. Available at: https:\/\/www.ulster.ac.uk\/__data\/assets\/pdf_file\/0011\/677306\/REAL20-1.pdf (accessed 23 July 2021)."},{"key":"bibr12-20539517221090930","doi-asserted-by":"publisher","DOI":"10.3998\/mpub.10257"},{"key":"bibr13-20539517221090930","doi-asserted-by":"publisher","DOI":"10.1002\/9781444304732"},{"key":"bibr14-20539517221090930","doi-asserted-by":"publisher","DOI":"10.1108\/02683941011019339"},{"key":"bibr15-20539517221090930","first-page":"249","volume-title":"Diversity in the Workforce: Current Issues and Emerging Trends","author":"Hughes C","year":"2013"},{"key":"bibr16-20539517221090930","doi-asserted-by":"publisher","DOI":"10.1353\/lan.2019.0042"},{"key":"bibr17-20539517221090930","doi-asserted-by":"crossref","unstructured":"Joshi P, Santy S, Budhiraja A, et al. (2021) The state and fate of linguistic diversity and inclusion in the NLP world. In: Proceedings of the 58th Annual Meeting of the ACL., Online, 2021, pp. 6282\u20136293.","DOI":"10.18653\/v1\/2020.acl-main.560"},{"key":"bibr18-20539517221090930","doi-asserted-by":"publisher","DOI":"10.1146\/annurev-linguistics-011619-030556"},{"key":"bibr19-20539517221090930","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1915768117"},{"key":"bibr20-20539517221090930","doi-asserted-by":"publisher","DOI":"10.1177\/0261927X99018001002"},{"key":"bibr21-20539517221090930","doi-asserted-by":"publisher","DOI":"10.1145\/3351095.3372828"},{"key":"bibr22-20539517221090930","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-1606"},{"issue":"1","key":"bibr23-20539517221090930","first-page":"257","volume":"28","author":"Tatsch S","year":"2004","journal-title":"Collegium Antropologicum"},{"key":"bibr24-20539517221090930","unstructured":"Toxic Twitter - The Psychological Harms of Violence and Abuse Against Women Online (n.d.) Amnesty International. Available at: https:\/\/www.amnesty.org\/en\/latest\/research\/2018\/03\/online-violence-against-women-chapter-6\/#topanchor."},{"key":"bibr25-20539517221090930","unstructured":"W3Techs Web Technology Service (n.d.) Usage statistics of content languages for websites. Available at: https:\/\/w3techs.com\/technologies\/overview\/content_language."},{"key":"bibr26-20539517221090930","unstructured":"Zielinski D (2020) Addressing artificial intelligence-based hiring concerns. SHRM, 22 May. Available at: https:\/\/www.shrm.org\/hr-today\/news\/hr-magazine\/summer2020\/pages\/artificial-intelligence-based-hiring-concerns.aspx."}],"container-title":["Big Data &amp; Society"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/20539517221090930","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/20539517221090930","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/20539517221090930","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,28]],"date-time":"2025-02-28T22:19:21Z","timestamp":1740781161000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/20539517221090930"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,1]]},"references-count":27,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,1]]}},"alternative-id":["10.1177\/20539517221090930"],"URL":"https:\/\/doi.org\/10.1177\/20539517221090930","relation":{},"ISSN":["2053-9517","2053-9517"],"issn-type":[{"value":"2053-9517","type":"print"},{"value":"2053-9517","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,1]]},"article-number":"20539517221090930"}}