{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,26]],"date-time":"2026-06-26T02:36:18Z","timestamp":1782441378920,"version":"3.54.5"},"reference-count":196,"publisher":"Springer Science and Business Media LLC","issue":"11","license":[{"start":{"date-parts":[[2025,4,3]],"date-time":"2025-04-03T00:00:00Z","timestamp":1743638400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,4,3]],"date-time":"2025-04-03T00:00:00Z","timestamp":1743638400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Front. Comput. Sci."],"published-print":{"date-parts":[[2025,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Based on the foundation of Large Language Models (LLMs), Multilingual LLMs (MLLMs) have been developed to address the challenges faced in multilingual natural language processing, hoping to achieve knowledge transfer from high-resource languages to low-resource languages. However, significant limitations and challenges still exist, such as language imbalance, multilingual alignment, and inherent bias. In this paper, we aim to provide a comprehensive analysis of MLLMs, delving deeply into discussions surrounding these critical issues. First of all, we start by presenting an overview of MLLMs, covering their evolutions, key techniques, and multilingual capacities. Secondly, we explore the multilingual training corpora of MLLMs and the multilingual datasets oriented for downstream tasks that are crucial to enhance the cross-lingual capability of MLLMs. Thirdly, we survey the state-of-the-art studies of multilingual representations and investigate whether the current MLLMs can learn a universal language representation. Fourthly, we discuss bias on MLLMs, including its categories, evaluation metrics, and debiasing techniques. Finally, we discuss existing challenges and point out promising research directions of MLLMs.<\/jats:p>","DOI":"10.1007\/s11704-024-40579-4","type":"journal-article","created":{"date-parts":[[2025,4,5]],"date-time":"2025-04-05T04:16:24Z","timestamp":1743826584000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":49,"title":["A survey on multilingual large language models: corpora, alignment, and bias"],"prefix":"10.1007","volume":"19","author":[{"given":"Yuemei","family":"Xu","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ling","family":"Hu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jiayi","family":"Zhao","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zihan","family":"Qiu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kexin","family":"Xu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yuqi","family":"Ye","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hanwen","family":"Gu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,4,3]]},"reference":[{"key":"40579_CR1","first-page":"6000","volume-title":"Proceedings of the 31st International Conference on Neural Information Processing Systems","author":"A Vaswani","year":"2017","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez A N, Kaiser \u0141, Polosukhin I. Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017, 6000\u20136010"},{"key":"40579_CR2","first-page":"4171","volume-title":"Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics","author":"J Devlin","year":"2019","unstructured":"Devlin J, Chang M W, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics. 2019, 4171\u20134186"},{"key":"40579_CR3","first-page":"634","volume-title":"Proceedings of the 33rd International Conference on Neural Information Processing Systems","author":"A Conneau","year":"2019","unstructured":"Conneau A, Lample G. Cross-lingual language model pretraining. In: Proceedings of the 33rd International Conference on Neural Information Processing Systems. 2019, 634"},{"key":"40579_CR4","first-page":"483","volume-title":"Proceedings of 2021 Conference of the North American Chapter of the Association for Computational Linguistics","author":"L Xue","year":"2021","unstructured":"Xue L, Constant N, Roberts A, Kale M, Al-Rfou R, Siddhant A, Barua A, Raffel C. mT5: A massively multilingual pre-trained text-to-text transformer. In: Proceedings of 2021 Conference of the North American Chapter of the Association for Computational Linguistics. 2021, 483\u2013498"},{"key":"40579_CR5","unstructured":"Le Scao T, Fan A, Akiki C, Pavlick E, Ili\u0107 S et al. BLOOM: A 176B-parameter open-access multilingual language model. 2022, arXiv preprint arXiv: 2211.05100"},{"key":"40579_CR6","unstructured":"Touvron H, Lavril T, Izacard G, Martinet X, Lachaux M A, Lacroix T, Rozi\u00e8re B, Goyal N, Hambro E, Azhar F, Rodriguez A, Joulin A, Grave E, Lample G. LLaMA: open and efficient foundation language models. 2023, arXiv preprint arXiv: 2302.13971"},{"key":"40579_CR7","doi-asserted-by":"publisher","first-page":"8440","DOI":"10.18653\/v1\/2020.acl-main.747","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"A Conneau","year":"2020","unstructured":"Conneau A, Khandelwal K, Goyal N, Chaudhary V, Wenzek G, Guzm\u00e1n F, Grave \u00c9, Ott M, Zettlemoyer L, Stoyanov V. Unsupervised cross-lingual representation learning at scale. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 8440\u20138451"},{"key":"40579_CR8","volume-title":"Proceedings of the 8th International Conference on Learning Representations","author":"S Cao","year":"2020","unstructured":"Cao S, Kitaev N, Klein D. Multilingual alignment of contextual word representations. In: Proceedings of the 8th International Conference on Learning Representations. 2020"},{"key":"40579_CR9","first-page":"3111","volume-title":"Proceedings of the 26th International Conference on Neural Information Processing Systems","author":"T Mikolov","year":"2013","unstructured":"Mikolov T, Sutskever I, Chen K, Corrado G, Dean J. Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th International Conference on Neural Information Processing Systems. 2013, 3111\u20133119"},{"key":"40579_CR10","first-page":"1532","volume-title":"Proceedings of 2014 Conference on Empirical Methods in Natural Language Processing","author":"J Pennington","year":"2014","unstructured":"Pennington J, Socher R, Manning C. GloVe: Global vectors for word representation. In: Proceedings of 2014 Conference on Empirical Methods in Natural Language Processing. 2014, 1532\u20131543"},{"key":"40579_CR11","doi-asserted-by":"publisher","first-page":"610","DOI":"10.1145\/3442188.3445922","volume-title":"Proceedings of 2021 ACM Conference on Fairness, Accountability, and Transparency","author":"E M Bender","year":"2021","unstructured":"Bender E M, Gebru T, McMillan-Major A, Shmitchell S. On the dangers of stochastic parrots: Can language models be too big? In: Proceedings of 2021 ACM Conference on Fairness, Accountability, and Transparency. 2021, 610\u2013623"},{"key":"40579_CR12","doi-asserted-by":"publisher","first-page":"26","DOI":"10.18653\/v1\/2022.bigscience-1.3","volume-title":"Proceedings of BigScience Episode #5\u2013Workshop on Challenges & Perspectives in Creating Large Language Models","author":"Z Talat","year":"2022","unstructured":"Talat Z, N\u00e9v\u00e9ol A, Biderman S, Clinciu M, Dey M, Longpre S, Luccioni S, Masoud M, Mitchell M, Radev D, Sharma S, Subramonian A, Tae J, Tan S, Tunuguntla D, Van Der Wal O. You reap what you sow: On the challenges of bias evaluation under multilingual settings. In: Proceedings of BigScience Episode #5\u2013Workshop on Challenges & Perspectives in Creating Large Language Models. 2022, 26\u201341"},{"key":"40579_CR13","doi-asserted-by":"publisher","first-page":"5491","DOI":"10.18653\/v1\/2020.acl-main.487","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"B Hutchinson","year":"2020","unstructured":"Hutchinson B, Prabhakaran V, Denton E, Webster K, Zhong Y, Denuyl S. Social biases in NLP models as barriers for persons with disabilities. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 5491\u20135501"},{"key":"40579_CR14","first-page":"5356","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing","author":"M Nadeem","year":"2021","unstructured":"Nadeem M, Bethke A, Reddy S. StereoSet: measuring stereotypical bias in pretrained language models. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. 2021, 5356\u20135371"},{"key":"40579_CR15","first-page":"2479","volume-title":"Proceedings of the 12th Language Resources and Evaluation Conference","author":"H Le","year":"2020","unstructured":"Le H, Vial L, Frej J, Segonne V, Coavoux M, Lecouteux B, Allauzen A, Crabb\u00e9 B, Besacier L, Schwab D. FlauBERT: unsupervised language model pre-training for French. In: Proceedings of the 12th Language Resources and Evaluation Conference. 2020, 2479\u20132490"},{"key":"40579_CR16","unstructured":"De Vries W, Van Cranenburgh A, Bisazza A, Caselli T, Van Noord G, Nissim M. BERTje: A Dutch BERT model. 2019, arXiv preprint arXiv: 1912.09582"},{"key":"40579_CR17","first-page":"9","volume-title":"Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection","author":"W Antoun","year":"2020","unstructured":"Antoun W, Baly F, Hajj H. AraBERT: Transformer-based model for Arabic language understanding. In: Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection. 2020, 9\u201315"},{"key":"40579_CR18","volume-title":"OpenAI Blog","author":"A Radford","year":"2018","unstructured":"Radford A, Narasimhan K, Salimans T, Sutskever I. Improving language understanding by generative pre-training. OpenAI Blog, 2018"},{"issue":"8","key":"40579_CR19","first-page":"9","volume":"1","author":"A Radford","year":"2019","unstructured":"Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I. Language models are unsupervised multitask learners. OpenAI Blog, 2019, 1(8): 9","journal-title":"OpenAI Blog"},{"key":"40579_CR20","first-page":"159","volume-title":"Proceedings of the 34th International Conference on Neural Information Processing Systems","author":"T B Brown","year":"2020","unstructured":"Brown T B, Mann B, Ryder N, Subbiah M, Kaplan J. et al. Language models are few-shot learners. In: Proceedings of the 34th International Conference on Neural Information Processing Systems. 2020, 159"},{"key":"40579_CR21","first-page":"2011","volume-title":"Proceedings of the 36th International Conference on Neural Information Processing Systems","author":"L Ouyang","year":"2022","unstructured":"Ouyang L, Wu J, Jiang X, Almeida D, Wainwright C L et al. Training language models to follow instructions with human feedback. In: Proceedings of the 36th International Conference on Neural Information Processing Systems. 2022, 2011"},{"key":"40579_CR22","unstructured":"Achiam J, Adler S, Agarwal S, Ahmad L, Akkaya I et al. Gpt-4 technical report. 2023, arXiv preprint arXiv: 2303.08774"},{"issue":"1","key":"40579_CR23","first-page":"140","volume":"21","author":"C Raffel","year":"2020","unstructured":"Raffel C, Shazeer N, Roberts A, Lee K, Narang S, Matena M, Zhou Y, Li W, Liu P J. Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, 2020, 21(1): 140","journal-title":"The Journal of Machine Learning Research"},{"key":"40579_CR24","doi-asserted-by":"publisher","first-page":"7871","DOI":"10.18653\/v1\/2020.acl-main.703","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"M Lewis","year":"2020","unstructured":"Lewis M, Liu Y, Goyal N, Ghazvininejad M, Mohamed A, Levy O, Stoyanov V, Zettlemoyer L. BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 7871\u20137880"},{"key":"40579_CR25","first-page":"296","volume-title":"Proceedings of the 8th International Joint Conference on Natural Language Processing","author":"T Q Nguyen","year":"2017","unstructured":"Nguyen T Q, Chiang D. Transfer learning across low-resource, related languages for neural machine translation. In: Proceedings of the 8th International Joint Conference on Natural Language Processing. 2017, 296\u2013301"},{"key":"40579_CR26","first-page":"726","volume-title":"Proceedings of Transactions of the Association for Computational Linguistics","author":"Y Liu","year":"2020","unstructured":"Liu Y, Gu J, Goyal N, Li X, Edunov S, Ghazvininejad M, Lewis M, Zettlemoyer L. Multilingual denoising pre-training for neural machine translation. In: Proceedings of Transactions of the Association for Computational Linguistics. 2020, 726\u2013742"},{"key":"40579_CR27","doi-asserted-by":"publisher","first-page":"4996","DOI":"10.18653\/v1\/P19-1493","volume-title":"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics","author":"T Pires","year":"2019","unstructured":"Pires T, Schlinger E, Garrette D. How multilingual is multilingual BERT? In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019, 4996\u20135001"},{"key":"40579_CR28","doi-asserted-by":"publisher","first-page":"4623","DOI":"10.18653\/v1\/2020.acl-main.421","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"M Artetxe","year":"2020","unstructured":"Artetxe M, Ruder S, Yogatama D. On the cross-lingual transferability of monolingual representations. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 4623\u20134637"},{"issue":"1","key":"40579_CR29","first-page":"240","volume":"24","author":"A Chowdhery","year":"2023","unstructured":"Chowdhery A, Narang S, Devlin J, Bosma M, Mishra G et al. PaLM: Scaling language modeling with pathways. The Journal of Machine Learning Research, 2023, 24(1): 240","journal-title":"The Journal of Machine Learning Research"},{"key":"40579_CR30","unstructured":"Thoppilan R, De Freitas D, Hall J, Shazeer N, Kulshreshtha A. et al. LaMDA: language models for dialog applications. 2022, arXiv preprint arXiv: 2201.08239"},{"key":"40579_CR31","unstructured":"Zhang S, Roller S, Goyal N, Artetxe M, Chen M et al. OPT: open pre-trained transformer language models. 2022, arXiv preprint arXiv: 2205.01068"},{"key":"40579_CR32","first-page":"320","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"Z Du","year":"2022","unstructured":"Du Z, Qian Y, Liu X, Ding M, Qiu J, Yang Z, Tang J. GLM: general language model pretraining with autoregressive blank infilling. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022, 320\u2013335"},{"key":"40579_CR33","volume-title":"Proceedings of the 11th International Conference on Learning Representations","author":"A Zeng","year":"2023","unstructured":"Zeng A, Liu X, Du Z, Wang Z, Lai H, Ding M, Yang Z, Xu Y, Zheng W, Xia X, Tam W L, Ma Z, Xue Y, Zhai J, Chen W, Liu Z, Zhang P, Dong Y, Tang J. GLM-130B: an open bilingual pre-trained model. In: Proceedings of the 11th International Conference on Learning Representations. 2023"},{"key":"40579_CR34","volume-title":"See vicuna. lmsys. org websit","author":"W L Chiang","year":"2023","unstructured":"Chiang W L, Li Z, Lin Z, Sheng Y, Wu Z, Zhang H, Zheng L, Zhuang S, Zhuang Y, Gonzalez J E et al. Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality. See vicuna. lmsys. org websit, 2023"},{"key":"40579_CR35","unstructured":"Anil R, Borgeaud S, Alayrac J B, Yu J, Soricut R. et al. Gemini: a family of highly capable multimodal models. 2023, arXiv preprint arXiv: 2312.11805"},{"key":"40579_CR36","first-page":"3118","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing","author":"P Rust","year":"2021","unstructured":"Rust P, Pfeiffer J, Vuli\u0107 I, Ruder S, Gurevych I. How good is your tokenizer? On the monolingual performance of multilingual language models. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. 2021, 3118\u20133135"},{"key":"40579_CR37","doi-asserted-by":"publisher","first-page":"12401","DOI":"10.18653\/v1\/2024.findings-acl.738","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics: ACL 2024","author":"D Zhang","year":"2024","unstructured":"Zhang D, Yu Y, Dong J, Li C, Su D, Chu C, Yu D. MM-LLMs: recent advances in MultiModal large language models. In: Proceedings of the Findings of the Association for Computational Linguistics: ACL 2024. 2024, 12401\u201312430"},{"key":"40579_CR38","unstructured":"Rae J W, Borgeaud S, Cai T, Millican K, Hoffmann J. et al. Scaling language models: Methods, analysis & insights from training gopher. 2021, arXiv preprint arXiv: 2112.11446"},{"issue":"70","key":"40579_CR39","first-page":"1","volume":"25","author":"H W Chung","year":"2024","unstructured":"Chung H W, Hou L, Longpre S, Zoph B, Tay Y. et al. Scaling instruction-finetuned language models. Journal of Machine Learning Research, 2024, 25(70): 1\u201353","journal-title":"Journal of Machine Learning Research"},{"key":"40579_CR40","unstructured":"OpenAI. Introducing chatGPT. See openai.com\/index\/chatgpt\/ website, 2022"},{"key":"40579_CR41","first-page":"8469","volume-title":"Proceedings of the 40th International Conference on Machine Learning","author":"D Driess","year":"2023","unstructured":"Driess D, Xia F, Sajjadi M S M, Lynch C, Chowdhery A. et al. PaLME: An embodied multimodal language model. In: Proceedings of the 40th International Conference on Machine Learning. 2023, 8469\u20138488"},{"key":"40579_CR42","volume-title":"See github. com\/tatsulab\/stanford_alpaca website","author":"R Taori","year":"2023","unstructured":"Taori R, Gulrajani I, Zhang T, Dubois Y, Li X, Guestrin C, Liang P, Hashimoto T B. Stanford alpaca: An instruction-following llama model. See github. com\/tatsulab\/stanford_alpaca website, 2023"},{"key":"40579_CR43","unstructured":"Ren X, Zhou P, Meng X, Huang X, Wang Y, Wang W, Li P, Zhang X, Podolskiy A, Arshinov G, Bout A, Piontkovskaya I, Wei J, Jiang X, Su T, Liu Q, Yao J. PanGu-\u03a3: Towards trillion parameter language model with sparse heterogeneous computing. 2023, arXiv preprint arXiv: 2303.10845"},{"key":"40579_CR44","first-page":"2397","volume-title":"Proceedings of the 40th International Conference on Machine Learning","author":"S Biderman","year":"2023","unstructured":"Biderman S, Schoelkopf H, Anthony Q G, Bradley H, O\u2019Brien K, Hallahan E, Khan M A, Purohit S, Prashanth U S, Raff E, Skowron A, Sutawika L, Van Der Wal O. Pythia: a suite for analyzing large language models across training and scaling. In: Proceedings of the 40th International Conference on Machine Learning. 2023, 2397\u20132430"},{"key":"40579_CR45","unstructured":"Anil R, Dai A M, Firat O, Johnson M, Lepikhin D. et al. PaLM 2 technical report. 2023, arXiv preprint arXiv: 2305.10403"},{"key":"40579_CR46","unstructured":"Touvron H, Martin L, Stone K, Albert P, Almahairi A. et al. Llama 2: open foundation and fine-tuned chat models. 2023, arXiv preprint arXiv: 2307.09288"},{"key":"40579_CR47","volume-title":"See ai.google\/static\/documents\/google-about-bard.pdf Google Static Documents","author":"J Manyika","year":"2023","unstructured":"Manyika J, Hsiao S. An overview of bard: an early experiment with generative AI. See ai.google\/static\/documents\/google-about-bard.pdf Google Static Documents, 2023"},{"key":"40579_CR48","unstructured":"Yang A, Xiao B, Wang B, Zhang B, Bian C. et al. Baichuan 2: Open large-scale language models. 2023, arXiv preprint arXiv: 2309.10305"},{"key":"40579_CR49","unstructured":"MICROSOFT. Phi-2: the surprising power of small language models. See microsoft.com\/en-us\/research\/blog\/phi-2-the-surprising-power-of-small-language-models\/ website, 2023"},{"key":"40579_CR50","unstructured":"Zeng A, Xu B, Wang B, Zhang C, Yin D. et al. ChatGLM: a family of large language models from GLM-130B to GLM-4 all tools. 2024, arXiv preprint arXiv: 2406.12793"},{"key":"40579_CR51","unstructured":"Anthropic. The Claude 3 model family: Opus, sonnet, haiku. See anthropic.com\/news\/claude-3-family\/ website, 2024"},{"key":"40579_CR52","unstructured":"Dubey A, Jauhri A, Pandey A, Kadian A, Al-Dahle A. et al. The llama 3 herd of models. 2024, arXiv preprint arXiv: 2407.21783"},{"key":"40579_CR53","unstructured":"Zhao W X, Zhou K, Li J, Tang T, Wang X. et al. A survey of large language models. 2023, arXiv preprint arXiv: 2303.18223"},{"key":"40579_CR54","unstructured":"Doddapaneni S, Ramesh G, Kunchukuttan A, Kumar P, Khapra M M. A primer on pretrained multilingual language models. 2021, arXiv preprint arXiv: 2107.00676"},{"issue":"10","key":"40579_CR55","doi-asserted-by":"publisher","first-page":"1872","DOI":"10.1007\/s11431-020-1647-3","volume":"63","author":"X Qiu","year":"2020","unstructured":"Qiu X, Sun T, Xu Y, Shao Y, Dai N, Huang X. Pre-trained models for natural language processing: a survey. Science China Technological Sciences, 2020, 63(10): 1872\u20131897","journal-title":"Science China Technological Sciences"},{"key":"40579_CR56","unstructured":"Shen T, Jin R, Huang Y, Liu C, Dong W, Guo Z, Wu X, Liu Y, Xiong D. Large language model alignment: A survey. 2023, arXiv preprint arXiv: 2309.15025"},{"key":"40579_CR57","unstructured":"Glaese A, McAleese N, Tr\u0119bacz M, Aslanides J, Firoiu V. et al. Improving alignment of dialogue agents via targeted human judgements. 2022, arXiv preprint arXiv: 2209.14375"},{"key":"40579_CR58","unstructured":"Bai Y, Jones A, Ndousse K, Askell A, Chen A. et al. Training a helpful and harmless assistant with reinforcement learning from human feedback. 2022, arXiv preprint arXiv: 2204.05862"},{"key":"40579_CR59","first-page":"241","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics","author":"R Liu","year":"2022","unstructured":"Liu R, Zhang G, Feng X, Vosoughi S. Aligning generative language models with human values. In: Proceedings of the Findings of the Association for Computational Linguistics. 2022, 241\u2013252"},{"key":"40579_CR60","unstructured":"Baheti A, Lu X, Brahman F, Le Bras R, Sap M, Riedl M O. Improving language models with advantage-based offline policy gradients. 2023, arXiv preprint arXiv: 2305.14718"},{"key":"40579_CR61","first-page":"463","volume-title":"Proceedings of the 40th International Conference on Machine Learning","author":"D Go","year":"2023","unstructured":"Go D, Korbak T, Kruszewski G, Rozen J, Ryu N, Dymetman M. Aligning language models with preferences through f-divergence minimization. In: Proceedings of the 40th International Conference on Machine Learning. 2023, 463"},{"key":"40579_CR62","unstructured":"Askell A, Bai Y, Chen A, Drain D, Ganguli D. et al. A general language assistant as a laboratory for alignment. 2021, arXiv preprint arXiv: 2112.00861"},{"key":"40579_CR63","volume-title":"See huggingface.co\/blog\/rlhf website","author":"N Lambert","year":"2022","unstructured":"Lambert N, Castricato L, Werra V L, Havrilla A. Illustrating reinforcement learning from human feedback (RLHF). See huggingface.co\/blog\/rlhf website, 2022"},{"key":"40579_CR64","first-page":"253","volume-title":"Proceedings of the 34th International Conference on Neural Information Processing Systems","author":"N Stiennon","year":"2020","unstructured":"Stiennon N, Ouyang L, Wu J, Ziegler D M, Lowe R, Voss C, Radford A, Amodei D, Christiano P. Learning to summarize from human feedback. In: Proceedings of the 34th International Conference on Neural Information Processing Systems. 2020, 253"},{"key":"40579_CR65","unstructured":"Schulman J, Wolski F, Dhariwal P, Radford A, Klimov O. Proximal policy optimization algorithms. 2017, arXiv preprint arXiv: 1707.06347"},{"key":"40579_CR66","first-page":"1928","volume-title":"Proceedings of the 33rd International Conference on Machine Learning","author":"V Mnih","year":"2016","unstructured":"Mnih V, Badia A P, Mirza M, Graves A, Lillicrap T P, Harley T, Silver D, Kavukcuoglu K. Asynchronous methods for deep reinforcement learning. In: Proceedings of the 33rd International Conference on Machine Learning. 2016, 1928\u20131937"},{"issue":"4","key":"40579_CR67","doi-asserted-by":"publisher","first-page":"128","DOI":"10.1016\/S1364-6613(99)01294-2","volume":"3","author":"R M French","year":"1999","unstructured":"French R M. Catastrophic forgetting in connectionist networks. Trends in Cognitive Sciences, 1999, 3(4): 128\u2013135","journal-title":"Trends in Cognitive Sciences"},{"key":"40579_CR68","first-page":"2545","volume-title":"Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics","author":"M A Hedderich","year":"2021","unstructured":"Hedderich M A, Lange L, Adel H, Str\u00f6tgen J, Klakow D. A survey on recent approaches for natural language processing in low-resource scenarios. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics. 2021, 2545\u20132568"},{"key":"40579_CR69","first-page":"4336","volume-title":"Proceedings of the 29th International Conference on Computational Linguistics","author":"J O Alabi","year":"2022","unstructured":"Alabi J O, Adelani D I, Mosbach M, Klakow D. Adapting pre-trained language models to african languages via multilingual adaptive fine-tuning. In: Proceedings of the 29th International Conference on Computational Linguistics. 2022, 4336\u20134349"},{"issue":"1","key":"40579_CR70","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1186\/s40537-022-00590-7","volume":"9","author":"W Wongso","year":"2022","unstructured":"Wongso W, Lucky H, Suhartono D. Pre-trained transformer-based language models for sundanese. Journal of Big Data, 2022, 9(1): 39","journal-title":"Journal of Big Data"},{"key":"40579_CR71","first-page":"1","volume-title":"Proceedings of the 9th Workshop on Slavic Natural Language Processing","author":"S Torge","year":"2023","unstructured":"Torge S, Politov A, Lehmann C, Saffar B, Tao Z. Named entity recognition for low-resource languages-profiting from language families. In: Proceedings of the 9th Workshop on Slavic Natural Language Processing. 2023, 1\u201310"},{"key":"40579_CR72","first-page":"29","volume-title":"Proceedings of the 1st NLPL Workshop on Deep Learning for Natural Language Processing","author":"S R\u00f6nnqvist","year":"2019","unstructured":"R\u00f6nnqvist S, Kanerva J, Salakoski T, Ginter F. Is multilingual BERT fluent in language generation? In: Proceedings of the 1st NLPL Workshop on Deep Learning for Natural Language Processing. 2019, 29\u201336"},{"key":"40579_CR73","first-page":"2649","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics: EMNLP","author":"Z Wang","year":"2020","unstructured":"Wang Z, Karthikeyan K, Mayhew S, Roth D. Extending multilingual BERT to low-resource languages. In: Proceedings of the Findings of the Association for Computational Linguistics: EMNLP. 2020, 2649\u20132656"},{"key":"40579_CR74","doi-asserted-by":"publisher","first-page":"13244","DOI":"10.18653\/v1\/2023.emnlp-main.818","volume-title":"Proceedings of 2023 Conference on Empirical Methods in Natural Language Processing","author":"R Choenni","year":"2023","unstructured":"Choenni R, Garrette D, Shutova E. How do languages influence each other? Studying cross-lingual data sharing during LM fine-tuning. In: Proceedings of 2023 Conference on Empirical Methods in Natural Language Processing. 2023, 13244\u201313257"},{"issue":"1","key":"40579_CR75","doi-asserted-by":"publisher","first-page":"224","DOI":"10.1007\/s11263-023-01868-w","volume":"132","author":"Y Wang","year":"2024","unstructured":"Wang Y, Yu Z, Wang J, Heng Q, Chen H, Ye W, Xie R, Xie X, Zhang S. Exploring vision-language models for imbalanced learning. International Journal of Computer Vision, 2024, 132(1): 224\u2013237","journal-title":"International Journal of Computer Vision"},{"key":"40579_CR76","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1007\/978-3-031-47843-7_6","volume-title":"Proceedings of the 34th Australasian Database Conference on Databases Theory and Applications","author":"Y Jiang","year":"2024","unstructured":"Jiang Y, Qiu R, Zhang Y, Zhang P F. Balanced and explainable social media analysis for public health with large language models. In: Proceedings of the 34th Australasian Database Conference on Databases Theory and Applications. 2024, 73\u201386"},{"key":"40579_CR77","doi-asserted-by":"publisher","first-page":"9019","DOI":"10.18653\/v1\/2022.emnlp-main.616","volume-title":"Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing","author":"X V Lin","year":"2022","unstructured":"Lin X V, Mihaylov T, Artetxe M, Wang T, Chen S et al. Few-shot learning with multilingual generative language models. In: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022, 9019\u20139052"},{"key":"40579_CR78","first-page":"603","volume-title":"Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases","author":"L Tian","year":"2021","unstructured":"Tian L, Zhang X, Lau J H. Rumour detection via zero-shot cross-lingual transfer learning. In: Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases. 2021, 603\u2013618"},{"key":"40579_CR79","volume-title":"Proceedings of the 11th International Conference on Learning Representations","author":"F Shi","year":"2023","unstructured":"Shi F, Suzgun M, Freitag M, Wang X, Srivats S, Vosoughi S, Chung H W, Tay Y, Ruder S, Zhou D, Das D, Wei J. Language models are multilingual chain-of-thought reasoners. In: Proceedings of the 11th International Conference on Learning Representations. 2023"},{"key":"40579_CR80","first-page":"1251","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics","author":"T Ogunremi","year":"2023","unstructured":"Ogunremi T, Jurafsky D, Manning C D. Mini but mighty: Efficient multilingual pretraining with linguistically-informed data selection. In: Proceedings of the Findings of the Association for Computational Linguistics. 2023, 1251\u20131266"},{"key":"40579_CR81","doi-asserted-by":"publisher","first-page":"116","DOI":"10.18653\/v1\/2021.mrl-1.11","volume-title":"Proceedings of the 1st Workshop on Multilingual Representation Learning","author":"K Ogueji","year":"2021","unstructured":"Ogueji K, Zhu Y, Lin J. Small data? No problem! Exploring the viability of pretrained multilingual language models for low-resourced languages. In: Proceedings of the 1st Workshop on Multilingual Representation Learning. 2021, 116\u2013126"},{"key":"40579_CR82","doi-asserted-by":"publisher","first-page":"113765","DOI":"10.1016\/j.eswa.2020.113765","volume":"165","author":"M Pikuliak","year":"2021","unstructured":"Pikuliak M, \u0160imko M, Bielikov\u00e1 M. Cross-lingual learning for text processing: a survey. Expert Systems with Applications, 2021, 165: 113765","journal-title":"Expert Systems with Applications"},{"key":"40579_CR83","first-page":"5877","volume-title":"Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics","author":"F Philippy","year":"2023","unstructured":"Philippy F, Guo S, Haddadan S. Towards a common understanding of contributing factors for cross-lingual transfer in multilingual language models: a review. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics. 2023, 5877\u20135891"},{"key":"40579_CR84","first-page":"3464","volume-title":"Proceedings of the 37th International Conference on Neural Information Processing Systems","author":"G Penedo","year":"2023","unstructured":"Penedo G, Malartic Q, Hesslow D, Cojocaru R, Alobeidli H, Cappelli A, Pannier B, Almazrouei E, Launay J. The RefinedWeb dataset for falcon LLM: outperforming curated corpora with web data only. In: Proceedings of the 37th International Conference on Neural Information Processing Systems. 2023, 3464"},{"key":"40579_CR85","unstructured":"Luo Y, Kong Q, Xu N, Cao J, Hao B. et al. YAYI 2: multilingual open-source large language models. 2023, arXiv preprint arXiv: 2312.14862"},{"key":"40579_CR86","first-page":"1499","volume-title":"Proceedings of 2024 Conference on Empirical Methods in Natural Language Processing","author":"H Sun","year":"2024","unstructured":"Sun H, Jin R, Xu S, Pan L, Supryadi, Cui M, Du J, Lei Y, Yang L, Shi L, Xiao J, Zhu S, Xiong D. FuxiTranyu: a multilingual large language model trained with balanced data. In: Proceedings of 2024 Conference on Empirical Methods in Natural Language Processing. 2024, 1499\u20131522"},{"key":"40579_CR87","doi-asserted-by":"publisher","first-page":"4488","DOI":"10.18653\/v1\/2022.emnlp-main.298","volume-title":"Proceedings of 2022 Conference on Empirical Methods in Natural Language Processing","author":"D Adelani","year":"2022","unstructured":"Adelani D, Neubig G, Ruder S, Rijhwani S, Beukman M. et al. MasakhaNER 2.0: Africa-centric transfer learning for named entity recognition. In: Proceedings of 2022 Conference on Empirical Methods in Natural Language Processing. 2022, 4488\u20134508"},{"key":"40579_CR88","first-page":"3798","volume-title":"Proceedings of the 29th International Conference on Computational Linguistics","author":"S Malmasi","year":"2022","unstructured":"Malmasi S, Fang A, Fetahu B, Kar S, Rokhlenko O. MultiCoNER: a large-scale multilingual dataset for complex named entity recognition. In: Proceedings of the 29th International Conference on Computational Linguistics. 2022, 3798\u20133809"},{"key":"40579_CR89","doi-asserted-by":"publisher","first-page":"6542","DOI":"10.18653\/v1\/2020.coling-main.575","volume-title":"Proceedings of the 28th International Conference on Computational Linguistics","author":"E \u00d6hman","year":"2020","unstructured":"\u00d6hman E, P\u00e0mies M, Kajava K, Tiedemann J. XED: a multilingual dataset for sentiment analysis and emotion detection. In: Proceedings of the 28th International Conference on Computational Linguistics. 2020, 6542\u20136552"},{"key":"40579_CR90","first-page":"986","volume-title":"Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics","author":"I Shode","year":"2023","unstructured":"Shode I, Adelani D I, Peng J, Feldman A. NollySenti: Leveraging transfer learning and machine translation for Nigerian movie sentiment classification. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics. 2023, 986\u2013998"},{"key":"40579_CR91","first-page":"590","volume-title":"Proceedings of the 13th Language Resources and Evaluation Conference","author":"S H Muhammad","year":"2022","unstructured":"Muhammad S H, Adelani D I, Ruder S, Ahmad I S, Abdulmumin I, Bello B S, Choudhury M, Emezue C C, Abdullahi S S, Aremu A, Jorge A, Brazdil P. NaijaSenti: a Nigerian twitter sentiment corpus for multilingual sentiment analysis. In: Proceedings of the 13th Language Resources and Evaluation Conference. 2022, 590\u2013602"},{"key":"40579_CR92","doi-asserted-by":"publisher","first-page":"8721","DOI":"10.18653\/v1\/2022.emnlp-main.597","volume-title":"Proceedings of 2022 Conference on Empirical Methods in Natural Language Processing","author":"O Ogundepo","year":"2022","unstructured":"Ogundepo O, Zhang X, Sun S, Duh K, Lin J. AfriCLIRMatrix: enabling cross-lingual information retrieval for African languages. In: Proceedings of 2022 Conference on Empirical Methods in Natural Language Processing. 2022, 8721\u20138728"},{"key":"40579_CR93","first-page":"4160","volume-title":"Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing","author":"S Sun","year":"2020","unstructured":"Sun S, Duh K. CLIRMatrix: a massively large collection of bilingual and multilingual datasets for cross-lingual information retrieval. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. 2020, 4160\u20134170"},{"key":"40579_CR94","doi-asserted-by":"crossref","unstructured":"Ma C, Imani A, Ye H, Asgari E, Sch\u00fctze H. Taxi1500: a multilingual dataset for text classification in 1500 languages. 2023, arXiv preprint arXiv: 2305.08487","DOI":"10.21203\/rs.3.rs-3235946\/v1"},{"key":"40579_CR95","first-page":"4563","volume-title":"Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing","author":"P Keung","year":"2020","unstructured":"Keung P, Lu Y, Szarvas G, Smith N A. The multilingual Amazon reviews corpus. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. 2020, 4563\u20134568"},{"key":"40579_CR96","volume-title":"Proceedings of the 6th International Conference on Learning Representations","author":"G Lample","year":"2018","unstructured":"Lample G, Conneau A, Ranzato M, Denoyer L, J\u00e9gou H. Word translation without parallel data. In: Proceedings of the 6th International Conference on Learning Representations. 2018"},{"key":"40579_CR97","unstructured":"Linguatools.org. Wikipedia monolingual corpora. See linguatools\/tools\/corpora\/wikipedia-monolingual-corpora\/website, 2018"},{"key":"40579_CR98","first-page":"2080","volume-title":"Proceedings of the 13th Language Resources and Evaluation Conference","author":"C Palen-Michel","year":"2022","unstructured":"Palen-Michel C, Kim J, Lignos C. Multilingual open text release 1: Public domain news in 44 languages. In: Proceedings of the 13th Language Resources and Evaluation Conference. 2022, 2080\u20132089"},{"key":"40579_CR99","first-page":"923","volume-title":"Proceedings of the 10th International Conference on Language Resources and Evaluation","author":"P Lison","year":"2016","unstructured":"Lison P, Tiedemann J. OpenSubtitles2016: Extracting large parallel corpora from movie and TV subtitles. In: Proceedings of the 10th International Conference on Language Resources and Evaluation. 2016, 923\u2013929"},{"key":"40579_CR100","first-page":"2765","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics","author":"W Zhu","year":"2024","unstructured":"Zhu W, Liu H, Dong Q, Xu J, Huang S, Kong L, Chen J, Li L. Multilingual machine translation with large language models: empirical results and analysis. In: Proceedings of the Findings of the Association for Computational Linguistics. 2024, 2765\u20132781"},{"key":"40579_CR101","first-page":"29","volume-title":"Proceedings of the 6th Workshop on Representation Learning for NLP","author":"N Goyal","year":"2021","unstructured":"Goyal N, Du J, Ott M, Anantharaman G, Conneau A. Larger-scale transformers for multilingual masked language modeling. In: Proceedings of the 6th Workshop on Representation Learning for NLP. 2021, 29\u201333"},{"key":"40579_CR102","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1162\/tacl_a_00051","volume":"5","author":"P Bojanowski","year":"2017","unstructured":"Bojanowski P, Grave E, Joulin A, Mikolov T. Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics, 2017, 5: 135\u2013146","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"40579_CR103","first-page":"789","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics","author":"M Artetxe","year":"2018","unstructured":"Artetxe M, Labaka G, Agirre E. A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018, 789\u2013798"},{"key":"40579_CR104","first-page":"778","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics","author":"A S\u00f8gaard","year":"2018","unstructured":"S\u00f8gaard A, Ruder S, Vuli\u0107 I. On the limitations of unsupervised bilingual dictionary induction. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018, 778\u2013788"},{"key":"40579_CR105","doi-asserted-by":"publisher","first-page":"512","DOI":"10.18653\/v1\/D18-1047","volume-title":"Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing","author":"N Nakashole","year":"2018","unstructured":"Nakashole N. NORMA: Neighborhood sensitive maps for multilingual word embeddings. In: Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing. 2018, 512\u2013522"},{"key":"40579_CR106","first-page":"463","volume-title":"Proceedings of 2021 Conference of the North American Chapter of the Association for Computational Linguistics","author":"H Wang","year":"2021","unstructured":"Wang H, Henderson J, Merlo P. Multi-adversarial learning for cross-lingual word embeddings. In: Proceedings of 2021 Conference of the North American Chapter of the Association for Computational Linguistics. 2021, 463\u2013472"},{"key":"40579_CR107","doi-asserted-by":"publisher","first-page":"114135","DOI":"10.1016\/j.psychres.2021.114135","volume":"304","author":"J Sarzynska-Wawer","year":"2021","unstructured":"Sarzynska-Wawer J, Wawer A, Pawlak A, Szymanowska J, Stefaniak I, Jarkiewicz M, Okruszek L. Detecting formal thought disorder by deep contextualized word representations. Psychiatry Research, 2021, 304: 114135","journal-title":"Psychiatry Research"},{"key":"40579_CR108","first-page":"1599","volume-title":"Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistic","author":"T Schuster","year":"2019","unstructured":"Schuster T, Ram O, Barzilay R, Globerson A. Cross-lingual alignment of contextual word embeddings, with applications to zero-shot dependency parsing. In: Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistic. 2019, 1599\u20131613"},{"issue":"2","key":"40579_CR109","first-page":"23","volume":"12","author":"P Gage","year":"1994","unstructured":"Gage P. A new algorithm for data compression. The C Users Journal, 1994, 12(2): 23\u201338","journal-title":"The C Users Journal"},{"key":"40579_CR110","first-page":"5149","volume-title":"Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing","author":"M Schuster","year":"2012","unstructured":"Schuster M, Nakajima K. Japanese and Korean voice search. In: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. 2012, 5149\u20135152"},{"key":"40579_CR111","first-page":"7222","volume-title":"Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing","author":"I Vuli\u0107","year":"2020","unstructured":"Vuli\u0107 I, Ponti E M, Litschko R, Glava\u0161 G, Korhonen A. Probing pretrained language models for lexical semantics. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. 2020, 7222\u20137240"},{"key":"40579_CR112","first-page":"2943","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics","author":"J Zhang","year":"2021","unstructured":"Zhang J, Ji B, Xiao N, Duan X, Zhang M, Shi Y, Luo W. Combining static word embeddings and contextual representations for bilingual lexicon induction. In: Proceedings of the Findings of the Association for Computational Linguistics. 2021, 2943\u20132955"},{"key":"40579_CR113","first-page":"2316","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics","author":"K H\u00e4mmerl","year":"2022","unstructured":"H\u00e4mmerl K, Libovick\u00fd J, Fraser A. Combining static and contextualised multilingual embeddings. In: Proceedings of the Findings of the Association for Computational Linguistics. 2022, 2316\u20132329"},{"key":"40579_CR114","first-page":"8154","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"J Zheng","year":"2022","unstructured":"Zheng J, Wang Y, Wang G, Xia J, Huang Y, Zhao G, Zhang Y, Li S. Using context-to-vector with graph retrofitting to improve word embeddings. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022, 8154\u20138163"},{"key":"40579_CR115","first-page":"4353","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"Y Li","year":"2022","unstructured":"Li Y, Liu F, Collier N, Korhonen A, Vuli\u0107 I. Improving word translation via two-stage contrastive learning. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022, 4353\u20134374"},{"key":"40579_CR116","doi-asserted-by":"publisher","first-page":"1881","DOI":"10.18653\/v1\/D18-1214","volume-title":"Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing","author":"D Alvarez-Melis","year":"2018","unstructured":"Alvarez-Melis D, Jaakkola T. Gromov-wasserstein alignment of word embedding spaces. In: Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing. 2018, 1881\u20131890"},{"key":"40579_CR117","doi-asserted-by":"publisher","first-page":"3476","DOI":"10.18653\/v1\/2020.acl-main.318","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"S Ren","year":"2020","unstructured":"Ren S, Liu S, Zhou M, Ma S. A graph-based coarse-to-fine method for unsupervised bilingual lexicon induction. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 3476\u20133485"},{"key":"40579_CR118","first-page":"3857","volume-title":"Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics","author":"T Mohiuddin","year":"2019","unstructured":"Mohiuddin T, Joty S. Revisiting adversarial autoencoder for unsupervised word translation with cycle consistency and improved training. In: Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics. 2019, 3857\u20133867"},{"key":"40579_CR119","first-page":"2712","volume-title":"Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing","author":"T Mohiuddin","year":"2020","unstructured":"Mohiuddin T, Bari M S, Joty S. LNMap: Departures from isomorphic assumption in bilingual lexicon induction through non-linear mapping in latent space. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. 2020, 2712\u20132723"},{"key":"40579_CR120","doi-asserted-by":"publisher","first-page":"7548","DOI":"10.18653\/v1\/2020.acl-main.675","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"G Glava\u0161","year":"2020","unstructured":"Glava\u0161 G, Vuli\u0107 I. Non-linear instance-based cross-lingual mapping for non-isomorphic embedding spaces. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 7548\u20137555"},{"key":"40579_CR121","doi-asserted-by":"publisher","first-page":"6019","DOI":"10.18653\/v1\/2022.emnlp-main.404","volume-title":"Proceedings of 2022 Conference on Empirical Methods in Natural Language Processing","author":"K Marchisio","year":"2022","unstructured":"Marchisio K, Verma N, Duh K, Koehn P. IsoVec: controlling the relative isomorphism of word embedding spaces. In: Proceedings of 2022 Conference on Empirical Methods in Natural Language Processing. 2022, 6019\u20136033"},{"key":"40579_CR122","doi-asserted-by":"publisher","first-page":"47","DOI":"10.18653\/v1\/D19-6106","volume-title":"Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP","author":"J Singh","year":"2019","unstructured":"Singh J, McCann B, Socher R, Xiong C. BERT is not an interlingua and the bias of tokenization. In: Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP. 2019, 47\u201355"},{"key":"40579_CR123","first-page":"1330","volume-title":"Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing","author":"H Taitelbaum","year":"2019","unstructured":"Taitelbaum H, Chechik G, Goldberger J. Multilingual word translation using auxiliary languages. In: Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. 2019, 1330\u20131335"},{"key":"40579_CR124","volume-title":"Proceedings of the 8th International Conference on Learning Representations","author":"K Karthikeyan","year":"2020","unstructured":"Karthikeyan K, Wang Z, Mayhew S, Roth D. Cross-lingual ability of multilingual BERT: an empirical study. In: Proceedings of the 8th International Conference on Learning Representations. 2020"},{"key":"40579_CR125","unstructured":"Liu C L, Hsu T Y, Chuang Y S, Lee H Y. A study of cross-lingual ability and language-specific information in multilingual BERT. 2020, arXiv preprint arXiv: 2004.09205"},{"key":"40579_CR126","unstructured":"Ranjan R, Gupta S, Singh S N. A comprehensive survey of bias in LLMs: current landscape and future directions. 2024, arXiv preprint arXiv: 2409.16430"},{"key":"40579_CR127","unstructured":"Cao S, Cheng R, Wang Z. AGR: age group fairness reward for bias mitigation in LLMs. 2024, arXiv preprint arXiv: 2409.04340"},{"key":"40579_CR128","doi-asserted-by":"publisher","first-page":"533","DOI":"10.18653\/v1\/2021.emnlp-main.42","volume-title":"Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing","author":"J Ahn","year":"2021","unstructured":"Ahn J, Oh A. Mitigating language-dependent ethnic bias in BERT. In: Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. 2021, 533\u2013549"},{"key":"40579_CR129","first-page":"1878","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"N Meade","year":"2022","unstructured":"Meade N, Poole-Dayan E, Reddy S. An empirical survey of the effectiveness of debiasing techniques for pre-trained language models. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022, 1878\u20131898"},{"key":"40579_CR130","doi-asserted-by":"publisher","first-page":"2896","DOI":"10.18653\/v1\/2020.acl-main.260","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"J Zhao","year":"2020","unstructured":"Zhao J, Mukherjee S, Hosseini S, Chang K W, Awadallah A H. Gender bias in multilingual embeddings and cross-lingual transfer. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 2896\u20132907"},{"key":"40579_CR131","doi-asserted-by":"crossref","unstructured":"Ferrara E. Should ChatGPT be biased? Challenges and risks of bias in large language models. 2023, arXiv preprint arXiv: 2304.03738","DOI":"10.2139\/ssrn.4627814"},{"key":"40579_CR132","doi-asserted-by":"publisher","first-page":"120","DOI":"10.18653\/v1\/2020.repl4nlp-1.16","volume-title":"Proceedings of the 5th Workshop on Representation Learning for NLP","author":"S Wu","year":"2020","unstructured":"Wu S, Dredze M. Are all languages created equal in multilingual BERT? In: Proceedings of the 5th Workshop on Representation Learning for NLP. 2020, 120\u2013130"},{"key":"40579_CR133","first-page":"2681","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics","author":"J Wang","year":"2022","unstructured":"Wang J, Liu Y, Wang X. Assessing multilingual fairness in pre-trained multimodal representations. In: Proceedings of the Findings of the Association for Computational Linguistics. 2022, 2681\u20132695"},{"key":"40579_CR134","first-page":"3250","volume-title":"Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics","author":"N Kassner","year":"2021","unstructured":"Kassner N, Dufter P, Sch\u00fctze H. Multilingual LAMA: investigating knowledge in multilingual pretrained language models. In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics. 2021, 3250\u20133258"},{"key":"40579_CR135","doi-asserted-by":"publisher","first-page":"10260","DOI":"10.18653\/v1\/2023.emnlp-main.634","volume-title":"Proceedings of 2023 Conference on Empirical Methods in Natural Language Processing","author":"S Levy","year":"2023","unstructured":"Levy S, John N A, Liu L, Vyas Y, Ma J, Fujinuma Y, Ballesteros M, Castelli V, Roth D. Comparing biases and the impact of multilingual training across multiple languages. In: Proceedings of 2023 Conference on Empirical Methods in Natural Language Processing. 2023, 10260\u201310280"},{"key":"40579_CR136","first-page":"3597","volume-title":"Proceedings of the 29th International Conference on Computational Linguistics","author":"L C Piqueras","year":"2022","unstructured":"Piqueras L C, S\u00f8gaard A. Are pretrained multilingual models equally fair across languages? In: Proceedings of the 29th International Conference on Computational Linguistics. 2022, 3597\u20133605"},{"key":"40579_CR137","first-page":"200","volume-title":"Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing","author":"S Touileb","year":"2022","unstructured":"Touileb S, \u00d8vrelid L, Velldal E. Occupational biases in Norwegian and multilingual language models. In: Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing. 2022, 200\u2013211"},{"key":"40579_CR138","first-page":"16366","volume-title":"Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics","author":"T Naous","year":"2024","unstructured":"Naous T, Ryan M J, Ritter A, Xu W. Having beer after prayer? Measuring cultural bias in large language models. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. 2024, 16366\u201316393"},{"issue":"6","key":"40579_CR139","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1038\/s42256-021-00359-2","volume":"3","author":"A Abid","year":"2021","unstructured":"Abid A, Farooqi M, Zou J. Large language models associate Muslims with violence. Nature Machine Intelligence, 2021, 3(6): 461\u2013463","journal-title":"Nature Machine Intelligence"},{"key":"40579_CR140","first-page":"561","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"Y T Cao","year":"2022","unstructured":"Cao Y T, Pruksachatkun Y, Chang K W, Gupta R, Kumar V, Dhamala J, Galstyan A. On the intrinsic and extrinsic fairness evaluation metrics for contextualized language representations. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022, 561\u2013570"},{"issue":"75","key":"40579_CR141","first-page":"1","volume":"25","author":"C Leiter","year":"2024","unstructured":"Leiter C, Lertvittayakumjorn P, Fomicheva M, Zhao W, Gao Y, Eger S. Towards explainable evaluation metrics for machine translation. Journal of Machine Learning Research, 2024, 25(75): 1\u201349","journal-title":"Journal of Machine Learning Research"},{"key":"40579_CR142","doi-asserted-by":"publisher","first-page":"3726","DOI":"10.18653\/v1\/2022.emnlp-main.245","volume-title":"Proceedings of 2022 Conference on Empirical Methods in Natural Language Processing","author":"T Sun","year":"2022","unstructured":"Sun T, He J, Qiu X, Huang X. BERTScore is unfair: on social bias in language model-based metrics for text generation. In: Proceedings of 2022 Conference on Empirical Methods in Natural Language Processing. 2022, 3726\u20133739"},{"key":"40579_CR143","volume-title":"Proceedings of the 8th International Conference on Learning Representations","author":"T Zhang","year":"2020","unstructured":"Zhang T, Kishore V, Wu F, Weinberger K Q, Artzi Y. BERTScore: evaluating text generation with BERT. In: Proceedings of the 8th International Conference on Learning Representations. 2020"},{"key":"40579_CR144","doi-asserted-by":"publisher","first-page":"7881","DOI":"10.18653\/v1\/2020.acl-main.704","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"T Sellam","year":"2020","unstructured":"Sellam T, Das D, Parikh A. BLEURT: learning robust metrics for text generation. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 7881\u20137892"},{"key":"40579_CR145","first-page":"2088","volume-title":"Proceedings of the 35th International Conference on Neural Information Processing Systems","author":"W Yuan","year":"2021","unstructured":"Yuan W, Neubig G, Liu P. BARTSCORE: evaluating generated text as text generation. In: Proceedings of the 35th International Conference on Neural Information Processing Systems. 2021, 2088"},{"key":"40579_CR146","first-page":"517","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics","author":"R Koo","year":"2024","unstructured":"Koo R, Lee M, Raheja V, Park J I, Kim Z M, Kang D. Benchmarking cognitive biases in large language models as evaluators. In: Proceedings of the Findings of the Association for Computational Linguistics. 2024, 517\u2013545"},{"key":"40579_CR147","first-page":"1693","volume-title":"Proceedings of 2022 Conference of the North American Chapter of the Association for Computational Linguistics","author":"P Delobelle","year":"2022","unstructured":"Delobelle P, Tokpo E, Calders T, Berendt B. Measuring fairness with biased rulers: A comparative study on bias metrics for pre-trained language models. In: Proceedings of 2022 Conference of the North American Chapter of the Association for Computational Linguistics. 2022, 1693\u20131706"},{"issue":"6334","key":"40579_CR148","doi-asserted-by":"publisher","first-page":"183","DOI":"10.1126\/science.aal4230","volume":"356","author":"A Caliskan","year":"2017","unstructured":"Caliskan A, Bryson J J, Narayanan A. Semantics derived automatically from language corpora Contain human-like biases. Science, 2017, 356(6334): 183\u2013186","journal-title":"Science"},{"key":"40579_CR149","doi-asserted-by":"publisher","first-page":"353","DOI":"10.18653\/v1\/W18-5446","volume-title":"Proceedings of 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP","author":"A Wang","year":"2018","unstructured":"Wang A, Singh A, Michael J, Hill F, Levy O, Bowman S. GLUE: a multi-task benchmark and analysis platform for natural language understanding. In: Proceedings of 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP. 2018, 353\u2013355"},{"key":"40579_CR150","first-page":"622","volume-title":"Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics","author":"C May","year":"2019","unstructured":"May C, Wang A, Bordia S, Bowman S R, Rudinger R. On measuring social biases in sentence encoders. In: Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics. 2019, 622\u2013628"},{"key":"40579_CR151","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1145\/3461702.3462536","volume-title":"Proceedings of 2021 AAAI\/ACM Conference on AI, Ethics, and Society","author":"W Guo","year":"2021","unstructured":"Guo W, Caliskan A. Detecting emergent intersectional biases: contextualized word embeddings contain a distribution of human-like biases. In: Proceedings of 2021 AAAI\/ACM Conference on AI, Ethics, and Society. 2021, 122\u2013133"},{"key":"40579_CR152","first-page":"27","volume-title":"Proceedings of the 32nd ACM Conference on Hypertext and Social Media","author":"S Bansal","year":"2021","unstructured":"Bansal S, Garimella V, Suhane A, Mukherjee A. Debiasing multilingual word embeddings: a case study of three Indian languages. In: Proceedings of the 32nd ACM Conference on Hypertext and Social Media. 2021, 27\u201334"},{"key":"40579_CR153","first-page":"8","volume-title":"Proceedings of 2018 Conference of the North American Chapter of the Association for Computational Linguistics","author":"R Rudinger","year":"2018","unstructured":"Rudinger R, Naradowsky J, Leonard B, Van Durme B. Gender bias in coreference resolution. In: Proceedings of 2018 Conference of the North American Chapter of the Association for Computational Linguistics. 2018, 8\u201314"},{"key":"40579_CR154","first-page":"15","volume-title":"Proceedings of 2018 Conference of the North American Chapter of the Association for Computational Linguistics","author":"J Zhao","year":"2018","unstructured":"Zhao J, Wang T, Yatskar M, Ordonez V, Chang K W. Gender bias in coreference resolution: Evaluation and debiasing methods. In: Proceedings of 2018 Conference of the North American Chapter of the Association for Computational Linguistics. 2018, 15\u201320"},{"key":"40579_CR155","doi-asserted-by":"publisher","first-page":"43","DOI":"10.18653\/v1\/S18-2005","volume-title":"Proceedings of the 7th Joint Conference on Lexical and Computational Semantics","author":"S Kiritchenko","year":"2018","unstructured":"Kiritchenko S, Mohammad S. Examining gender and race bias in two hundred sentiment analysis systems. In: Proceedings of the 7th Joint Conference on Lexical and Computational Semantics. 2018, 43\u201353"},{"key":"40579_CR156","first-page":"1953","volume-title":"Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing","author":"N Nangia","year":"2020","unstructured":"Nangia N, Vania C, Bhalerao R, Bowman S R. CrowS-Pairs: a challenge dataset for measuring social biases in masked language models. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. 2020, 1953\u20131967"},{"key":"40579_CR157","doi-asserted-by":"publisher","first-page":"1679","DOI":"10.18653\/v1\/P19-1164","volume-title":"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics","author":"G Stanovsky","year":"2019","unstructured":"Stanovsky G, Smith N A, Zettlemoyer L. Evaluating gender bias in machine translation. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019, 1679\u20131684"},{"key":"40579_CR158","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1145\/3287560.3287572","volume-title":"Proceedings of the Conference on Fairness, Accountability, and Transparency","author":"M De-Arteaga","year":"2019","unstructured":"De-Arteaga M, Romanov A, Wallach H, Chayes J, Borgs C, Chouldechova A, Geyik S, Kenthapadi K, Kalai A T. Bias in bios: a case study of semantic representation bias in a high-stakes setting. In: Proceedings of the Conference on Fairness, Accountability, and Transparency. 2019, 120\u2013128"},{"key":"40579_CR159","first-page":"1547","volume-title":"Proceedings of 2021 IEEE Winter Conference on Applications of Computer Vision","author":"K Karkkainen","year":"2021","unstructured":"Karkkainen K, Joo J. FairFace: face attribute dataset for balanced race, gender, and age for bias measurement and mitigation. In: Proceedings of 2021 IEEE Winter Conference on Applications of Computer Vision. 2021, 1547\u20131557"},{"key":"40579_CR160","first-page":"85","volume-title":"Proceedings of the 8th Joint Conference on Lexical and Computational Semantics","author":"A Lauscher","year":"2019","unstructured":"Lauscher A, Glava\u0161 G. Are we consistently biased? Multidimensional analysis of biases in distributional word vectors. In: Proceedings of the 8th Joint Conference on Lexical and Computational Semantics. 2019, 85\u201391"},{"key":"40579_CR161","first-page":"8521","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"A N\u00e9v\u00e9ol","year":"2022","unstructured":"N\u00e9v\u00e9ol A, Dupont Y, Bezan\u00e7on J, Fort K. French CrowS-pairs: extending a challenge dataset for measuring social bias in masked language models to a language other than English. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022, 8521\u20138531"},{"key":"40579_CR162","first-page":"3730","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics","author":"Y Wan","year":"2023","unstructured":"Wan Y, Pu G, Sun J, Garimella A, Chang K W, Peng N. \u201cKelly is a warm person, joseph is a role model\u201d: Gender biases in LLM-generated reference letters. In: Proceedings of the Findings of the Association for Computational Linguistics. 2023, 3730\u20133748"},{"key":"40579_CR163","doi-asserted-by":"publisher","first-page":"5502","DOI":"10.18653\/v1\/2020.acl-main.488","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"P P Liang","year":"2020","unstructured":"Liang P P, Li I M, Zheng E, Lim Y C, Salakhutdinov R, Morency L P. Towards debiasing sentence representations. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 5502\u20135515"},{"key":"40579_CR164","doi-asserted-by":"publisher","first-page":"7237","DOI":"10.18653\/v1\/2020.acl-main.647","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"S Ravfogel","year":"2020","unstructured":"Ravfogel S, Elazar Y, Gonen H, Twiton M, Goldberg Y. Null it out: Guarding protected attributes by iterative nullspace projection. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020, 7237\u20137256"},{"key":"40579_CR165","doi-asserted-by":"publisher","first-page":"5825","DOI":"10.18653\/v1\/2021.emnlp-main.470","volume-title":"Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing","author":"Z Yang","year":"2021","unstructured":"Yang Z, Yang Y, Cer D, Darve E. A simple and effective method to eliminate the self language bias in multilingual representations. In: Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. 2021, 5825\u20135832"},{"key":"40579_CR166","unstructured":"Webster K, Wang X, Tenney I, Beutel A, Pitler E, Pavlick E, Chen J, Chi E, Petrov S. Measuring and reducing gendered correlations in pre-trained models. 2020, arXiv preprint arXiv: 2010.06032"},{"issue":"1","key":"40579_CR167","first-page":"1929","volume":"15","author":"N Srivastava","year":"2014","unstructured":"Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research, 2014, 15(1): 1929\u20131958","journal-title":"The Journal of Machine Learning Research"},{"key":"40579_CR168","first-page":"4227","volume-title":"Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics","author":"F Zhou","year":"2023","unstructured":"Zhou F, Mao Y, Yu L, Yang Y, Zhong T. Causal-debias: Unifying debiasing in pretrained language models and fine-tuning via causal invariant learning. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics. 2023, 4227\u20134241"},{"key":"40579_CR169","first-page":"372","volume-title":"Proceedings of the 13th Joint Conference on Lexical and Computational Semantics","author":"L Ranaldi","year":"2024","unstructured":"Ranaldi L, Ruzzetti E S, Venditti D, Onorati D, Zanzotto F M. A trip towards fairness: Bias and de-biasing in large language models. In: Proceedings of the 13th Joint Conference on Lexical and Computational Semantics. 2024, 372\u2013384"},{"key":"40579_CR170","volume-title":"Proceedings of the 10th International Conference on Learning Representations","author":"E J Hu","year":"2022","unstructured":"Hu E J, Shen Y, Wallis P, Allen-Zhu Z, Li Y, Wang S, Wang L, Chen W. Lora: Low-rank adaptation of large language models. In: Proceedings of the 10th International Conference on Learning Representations. 2022"},{"key":"40579_CR171","first-page":"3934","volume-title":"Proceedings of IEEE\/CVF International Conference on Computer Vision","author":"A Wang","year":"2023","unstructured":"Wang A, Russakovsky O. Overwriting pretrained bias with finetuning data. In: Proceedings of IEEE\/CVF International Conference on Computer Vision. 2023, 3934\u20133945"},{"key":"40579_CR172","first-page":"1012","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"Y Guo","year":"2022","unstructured":"Guo Y, Yang Y, Abbasi A. Auto-debias: debiasing masked language models with automated biased prompts. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022, 1012\u20131023"},{"key":"40579_CR173","unstructured":"Mattern J, Jin Z, Sachan M, Mihalcea R, Sch\u00f6lkopf B. Understanding stereotypes in language models: Towards robust measurement and zero-shot debiasing. 2022, arXiv preprint arXiv: 2212.10678"},{"key":"40579_CR174","unstructured":"Dhingra H, Jayashanker P, Moghe S, Strubell E. Queer people are people first: Deconstructing sexual identity stereotypes in large language models. 2023, arXiv preprint arXiv: 2307, 0010, 1: 2023"},{"key":"40579_CR175","doi-asserted-by":"publisher","first-page":"1408","DOI":"10.1162\/tacl_a_00434","volume":"9","author":"T Schick","year":"2021","unstructured":"Schick T, Udupa S, Sch\u00fctze H. Self-diagnosis and self-debiasing: a proposal for reducing corpus-based bias in NLP. Transactions of the Association for Computational Linguistics, 2021, 9: 1408\u20131424","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"40579_CR176","doi-asserted-by":"publisher","first-page":"2475","DOI":"10.18653\/v1\/D18-1269","volume-title":"Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing","author":"A Conneau","year":"2018","unstructured":"Conneau A, Rinott R, Lample G, Williams A, Bowman S, Schwenk H, Stoyanov V. XNLI: Evaluating cross-lingual sentence representations. In: Proceedings of 2018 Conference on Empirical Methods in Natural Language Processing. 2018, 2475\u20132485"},{"key":"40579_CR177","first-page":"4226","volume-title":"Proceedings of 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation","author":"T Nguyen","year":"2024","unstructured":"Nguyen T, Van Nguyen C, Lai V D, Man H, Ngo N T, Dernoncourt F, Rossi R A, Nguyen T H. CulturaX: a cleaned, enormous, and multilingual dataset for large language models in 167 languages. In: Proceedings of 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation. 2024, 4226\u20134237"},{"key":"40579_CR178","first-page":"2306","volume-title":"Proceedings of the 36th International Conference on Neural Information Processing Systems","author":"H Lauren\u00e7on","year":"2022","unstructured":"Lauren\u00e7on H, Saulnier L, Wang T, Akiki C, Del Moral A V. et al. The BigScience roots corpus: a 1.6TB composite multilingual dataset. In: Proceedings of the 36th International Conference on Neural Information Processing Systems. 2022, 2306"},{"key":"40579_CR179","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1162\/tacl_a_00447","volume":"10","author":"J Kreutzer","year":"2022","unstructured":"Kreutzer J, Caswell I, Wang L, Wahab A, Van Esch D. et al. Quality at a glance: An audit of web-crawled multilingual datasets. Transactions of the Association for Computational Linguistics, 2022, 10: 50\u201372","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"40579_CR180","doi-asserted-by":"publisher","first-page":"10480","DOI":"10.18653\/v1\/2023.emnlp-main.649","volume-title":"Proceedings of 2023 Conference on Empirical Methods in Natural Language Processing","author":"I Sen","year":"2023","unstructured":"Sen I, Assenmacher D, Samory M, Augenstein I, Aalst W, Wagner C. People make better edits: measuring the efficacy of LLM-generated counterfactually augmented data for harmful language detection. In: Proceedings of 2023 Conference on Empirical Methods in Natural Language Processing. 2023, 10480\u201310504"},{"key":"40579_CR181","first-page":"629","volume-title":"Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics","author":"J Zhao","year":"2019","unstructured":"Zhao J, Wang T, Yatskar M, Cotterell R, Ordonez V, Chang K W. Gender bias in contextualized word embeddings. In: Proceedings of 2019 Conference of the North American Chapter of the Association for Computational Linguistics. 2019, 629\u2013634"},{"key":"40579_CR182","first-page":"306","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing","author":"L Yang","year":"2021","unstructured":"Yang L, Li J, Cunningham P, Zhang Y, Smyth B, Dong R. Exploring the efficacy of automatically generated counterfactuals for sentiment analysis. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. 2021, 306\u2013316"},{"key":"40579_CR183","doi-asserted-by":"publisher","first-page":"325","DOI":"10.18653\/v1\/2021.emnlp-main.28","volume-title":"Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing","author":"I Sen","year":"2021","unstructured":"Sen I, Samory M, Fl\u00f6ck F, Wagner C, Augenstein I. How does counterfactually augmented data impact models for social computing constructs? In: Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. 2021, 325\u2013344"},{"key":"40579_CR184","first-page":"4458","volume-title":"Proceedings of the Findings of the Association for Computational Linguistics","author":"S Goldfarb-Tarrant","year":"2023","unstructured":"Goldfarb-Tarrant S, Lopez A, Blanco R, Marcheggiani D. Bias beyond English: counterfactual tests for bias in sentiment analysis in four languages. In: Proceedings of the Findings of the Association for Computational Linguistics. 2023, 4458\u20134468"},{"key":"40579_CR185","first-page":"4716","volume-title":"Proceedings of 2022 Conference of the North American Chapter of the Association for Computational Linguistic","author":"I Sen","year":"2022","unstructured":"Sen I, Samory M, Wagner C, Augenstein I. Counterfactually augmented data and unintended bias: The case of sexism and hate speech detection. In: Proceedings of 2022 Conference of the North American Chapter of the Association for Computational Linguistic. 2022, 4716\u20134726"},{"key":"40579_CR186","first-page":"3668","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"N Joshi","year":"2022","unstructured":"Joshi N, He H. An investigation of the (in)effectiveness of counterfactually augmented data. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022, 3668\u20133681"},{"key":"40579_CR187","unstructured":"Zhang Q, Duan Q, Yuan B, Shi Y, Liu J. Exploring accuracy-fairness trade-off in large language models. 2024, arXiv preprint arXiv: 2411.14500"},{"issue":"9","key":"40579_CR188","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1007\/s10462-024-10896-y","volume":"57","author":"Z Lin","year":"2024","unstructured":"Lin Z, Guan S, Zhang W, Zhang H, Li Y, Zhang H. Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models. Artificial Intelligence Review, 2024, 57(9): 243","journal-title":"Artificial Intelligence Review"},{"key":"40579_CR189","first-page":"9061","volume-title":"Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics","author":"N Yang","year":"2024","unstructured":"Yang N, Kang T, Choi S J, Lee H, Jung K. Mitigating biases for instruction-following language models via bias neurons elimination. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. 2024, 9061\u20139073"},{"key":"40579_CR190","first-page":"5071","volume-title":"Proceedings of the 13th Language Resources and Evaluation Conference","author":"H Yadav","year":"2022","unstructured":"Yadav H, Sitaram S. A survey of multilingual models for automatic speech recognition. In: Proceedings of the 13th Language Resources and Evaluation Conference. 2022, 5071\u20135079"},{"key":"40579_CR191","first-page":"410","volume-title":"Proceedings of the 37th International Conference on Machine Learning","author":"J Hu","year":"2020","unstructured":"Hu J, Ruder S, Siddhant A, Neubig G, Firat O, Johnson M. XTREME: a massively multilingual multi-task benchmark for evaluating cross-lingual generalization. In: Proceedings of the 37th International Conference on Machine Learning. 2020, 410"},{"key":"40579_CR192","first-page":"4423","volume-title":"Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing","author":"P Dufter","year":"2020","unstructured":"Dufter P, Sch\u00fctze H. Identifying elements essential for BERT\u2019s multilinguality. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing. 2020, 4423\u20134437"},{"key":"40579_CR193","first-page":"5347","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"A Nzeyimana","year":"2022","unstructured":"Nzeyimana A, Niyongabo Rubungo A. KinyaBERT: a morphology-aware Kinyarwanda language model. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022, 5347\u20135363"},{"key":"40579_CR194","unstructured":"Naveed H, Khan A U, Qiu S, Saqib M, Anwar S, Usman M, Akhtar N, Barnes N, Mian A. A comprehensive overview of large language models. 2023, arXiv preprint arXiv: 2307.06435"},{"key":"40579_CR195","first-page":"1946","volume-title":"Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics","author":"X Pan","year":"2017","unstructured":"Pan X, Zhang B, May J, Nothman J, Knight K, Ji H. Cross-lingual name tagging and linking for 282 languages. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 2017, 1946\u20131958"},{"key":"40579_CR196","doi-asserted-by":"publisher","first-page":"10467","DOI":"10.18653\/v1\/2021.emnlp-main.818","volume-title":"Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing","author":"F Liu","year":"2021","unstructured":"Liu F, Bugliarello E, Ponti E M, Reddy S, Collier N, Elliott D. Visually grounded reasoning across languages and cultures. In: Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. 2021, 10467\u201310485"}],"container-title":["Frontiers of Computer Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11704-024-40579-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11704-024-40579-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11704-024-40579-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,6]],"date-time":"2025-04-06T02:34:16Z","timestamp":1743906856000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11704-024-40579-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,4,3]]},"references-count":196,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2025,11]]}},"alternative-id":["40579"],"URL":"https:\/\/doi.org\/10.1007\/s11704-024-40579-4","relation":{},"ISSN":["2095-2228","2095-2236"],"issn-type":[{"value":"2095-2228","type":"print"},{"value":"2095-2236","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,4,3]]},"assertion":[{"value":"7 June 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 December 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 April 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Competing interests The authors declare that they have no competing interests or financial conflicts to disclose.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics"}}],"article-number":"1911362"}}