{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,16]],"date-time":"2026-05-16T16:19:39Z","timestamp":1778948379938,"version":"3.51.4"},"reference-count":105,"publisher":"Association for Computing Machinery (ACM)","issue":"5","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2026,4,30]]},"abstract":"<jats:p>Cross-lingual word representations allow us to analyse word meanings across diverse language settings. It is crucial in aiding cross-lingual knowledge transfer when constructing natural language processing (NLP) models for languages with limited resources. This survey presents a comprehensive classification of cross-lingual contextual embedding models. We assess their data requirements and objective functions, and we introduce a taxonomy for categorising these approaches. Then, we present a comprehensive table containing a set of hierarchical criteria to compare them better, along with information regarding the availability of code and data to enable replication of the research. Furthermore, we delve into the evaluation methodologies employed for cross-lingual embeddings, exploring their practical applications and addressing their current associated challenges.<\/jats:p>","DOI":"10.1145\/3764112","type":"journal-article","created":{"date-parts":[[2025,8,26]],"date-time":"2025-08-26T11:37:41Z","timestamp":1756208261000},"page":"1-34","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Lost in Alignment: A Survey on Cross-Lingual Alignment Methods for Contextualized Representation"],"prefix":"10.1145","volume":"58","author":[{"ORCID":"https:\/\/orcid.org\/0009-0006-0190-2833","authenticated-orcid":false,"given":"Filippo","family":"Pallucchini","sequence":"first","affiliation":[{"name":"Statistics and Quantitative Method, University of Milan-Bicocca","place":["Milano, Italy"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0222-9365","authenticated-orcid":false,"given":"Lorenzo","family":"Malandri","sequence":"additional","affiliation":[{"name":"Statistics and Quantitative Method, University of Milan-Bicocca","place":["Milano, Italy"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6864-2702","authenticated-orcid":false,"given":"Fabio","family":"Mercorio","sequence":"additional","affiliation":[{"name":"Statistics and Quantitative Method, University of Milan-Bicocca","place":["Milano, Italy"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0399-2810","authenticated-orcid":false,"given":"Mario","family":"Mezzanzanica","sequence":"additional","affiliation":[{"name":"Statistics and Quantitative Method, University of Milan-Bicocca","place":["Milano, Italy"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,11,20]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1145\/3539618.3592006"},{"key":"e_1_3_2_3_2","first-page":"3906","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Aldarmaki Hanan","year":"2019","unstructured":"Hanan Aldarmaki and Mona Diab. 2019. Context-aware cross-lingual mapping. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 3906\u20133911."},{"key":"e_1_3_2_4_2","doi-asserted-by":"crossref","first-page":"3904","DOI":"10.18653\/v1\/2021.findings-emnlp.329","volume-title":"Findings of the Association for Computational Linguistics: EMNLP 2021","author":"Alqahtani Sawsan","year":"2021","unstructured":"Sawsan Alqahtani, Garima Lalwani, Yi Zhang, Salvatore Romeo, and Saab Mansour. 2021. Using optimal transport as alignment objective for fine-tuning multilingual contextualized embeddings. In Findings of the Association for Computational Linguistics: EMNLP 2021. 3904\u20133919."},{"key":"e_1_3_2_5_2","unstructured":"Waleed Ammar George Mulcaire Yulia Tsvetkov Guillaume Lample Chris Dyer and Noah A. Smith. 2016. Massively multilingual word embeddings. arXiv:1602.01925. Retrieved from https:\/\/arxiv.org\/abs\/1602.01925"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.5555\/3504035.3504649"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1073"},{"key":"e_1_3_2_8_2","doi-asserted-by":"crossref","unstructured":"Mikel Artetxe and Holger Schwenk. 2019. Massively multilingual sentence embeddings for zero-shot cross-lingual transfer and beyond. Transactions of the Association for Computational Linguistics 7 (2019) 597\u2013610.","DOI":"10.1162\/tacl_a_00288"},{"key":"e_1_3_2_9_2","doi-asserted-by":"crossref","first-page":"562","DOI":"10.18653\/v1\/2025.naacl-short.48","volume-title":"Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers)","author":"Bakos Steve","year":"2025","unstructured":"Steve Bakos, David Guzm\u00e1n, Riddhi More, Kelly Chutong Li, F\u00e9lix Gaschi, and En-Shiun Annie Lee. 2025. AlignFreeze: Navigating the impact of realignment on the layers of multilingual models across diverse languages. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers). 562\u2013586."},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.emnlp-main.233"},{"key":"e_1_3_2_11_2","volume-title":"Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"Briakou Eleftheria","year":"2023","unstructured":"Eleftheria Briakou, Colin Cherry, and George Foster. 2023. Searching for needles in a haystack: On the role of incidental bilingualism in palm\u2019s translation capability. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics."},{"key":"e_1_3_2_12_2","unstructured":"Peter F. Brown Stephen A. Della Pietra Vincent J. Della Pietra Robert L. Mercer et\u00a0al. 1993. The mathematics of statistical machine translation: Parameter estimation. Computational linguistics 19 2 (1993) 263\u2013311."},{"key":"e_1_3_2_13_2","volume-title":"International Conference on Learning Representations","author":"Cao Steven","year":"2020","unstructured":"Steven Cao, Nikita Kitaev, and Dan Klein. 2020. Multilingual alignment of contextual word representations. In International Conference on Learning Representations."},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6256"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.280"},{"key":"e_1_3_2_16_2","first-page":"3418","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":"Chi Zewen","year":"2021","unstructured":"Zewen Chi, Li Dong, Bo Zheng, Shaohan Huang, Xian-Ling Mao, He-Yan Huang, and Furu Wei. 2021. Improving pretrained cross-lingual language models via self-labeled word alignment. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 3418\u20133430."},{"key":"e_1_3_2_17_2","doi-asserted-by":"crossref","unstructured":"Seongkuk Cho Jihoon Moon Junhyeok Bae Jiwon Kang and Sangwook Lee. 2023. A framework for understanding unstructured financial documents using RPA and multimodal approach. Electronics 12 4 (2023) 939.","DOI":"10.3390\/electronics12040939"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"e_1_3_2_19_2","unstructured":"Alexis Conneau and Guillaume Lample. 2019. Cross-lingual language model pretraining. Proceedings of the 33rd International Conference on Neural Information Processing Systems. 7059\u20137069."},{"key":"e_1_3_2_20_2","volume-title":"International Conference on Learning Representations","author":"Conneau Alexis","year":"2018","unstructured":"Alexis Conneau, Guillaume Lample, Marc\u2019Aurelio Ranzato, Ludovic Denoyer, and Herv\u00e9 J\u00e9gou. 2018. Word translation without parallel data. In International Conference on Learning Representations."},{"key":"e_1_3_2_21_2","doi-asserted-by":"crossref","first-page":"2183","DOI":"10.18653\/v1\/2020.semeval-1.290","volume-title":"Proceedings of the Fourteenth Workshop on Semantic Evaluation","author":"Dadu Tanvi","year":"2020","unstructured":"Tanvi Dadu and Kartikey Pant. 2020. Team rouges at SemEval-2020 task 12: Cross-lingual inductive transfer to detect offensive language. In Proceedings of the Fourteenth Workshop on Semantic Evaluation. 2183\u20132189."},{"key":"e_1_3_2_22_2","first-page":"1","volume-title":"2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA)","author":"D\u2019Amico Simone","year":"2024","unstructured":"Simone D\u2019Amico, Lorenzo Malandri, Fabio Mercorio, Mario Mezzanzanica, and Filippo Pallucchini. 2024. Alignment of multilingual embeddings to estimate job similarities in online labour market. In 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA). IEEE, 1\u201310."},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","unstructured":"Jingcheng Deng Zhongtao Jiang Liang Pang Liwei Chen Kun Xu Zihao Wei Huawei Shen and Xueqi Cheng. 2025. Following the autoregressive nature of LLM embeddings via compression and alignment. CoRR abs\/2502.11401 (February 2025). Retrieved from 10.48550\/arXiv.2502.11401","DOI":"10.48550\/arXiv.2502.11401"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N19-1423"},{"key":"e_1_3_2_25_2","first-page":"4372","volume-title":"Proceedings of the 29th International Conference on Computational Linguistics","author":"Ding Kunbo","year":"2022","unstructured":"Kunbo Ding, Weijie Liu, Yuejian Fang, Weiquan Mao, Zhe Zhao, Tao Zhu, Haoyan Liu, Rong Tian, and Yiren Chen. 2022. A simple and effective method to improve zero-shot cross-lingual transfer learning. In Proceedings of the 29th International Conference on Computational Linguistics. 4372\u20134380."},{"key":"e_1_3_2_26_2","first-page":"644","volume-title":"Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Dyer Chris","year":"2013","unstructured":"Chris Dyer, Victor Chahuneau, and Noah A. Smith. 2013. A simple, fast, and effective reparameterization of IBM model 2. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 644\u2013648."},{"key":"e_1_3_2_27_2","first-page":"51","volume-title":"European Conference on Information Retrieval","author":"Efimov Pavel","year":"2023","unstructured":"Pavel Efimov, Leonid Boytsov, Elena Arslanova, and Pavel Braslavski. 2023. The impact of cross-lingual adjustment of contextual word representations on zero-shot transfer. In European Conference on Information Retrieval. Springer, 51\u201367."},{"key":"e_1_3_2_28_2","unstructured":"Akiko Eriguchi Melvin Johnson Orhan Firat Hideto Kazawa and Wolfgang Macherey. 2018. Zero-shot cross-lingual classification using multilingual neural machine translation. arXiv:1809.04686. Retrieved from https:\/\/arxiv.org\/abs\/1809.04686"},{"key":"e_1_3_2_29_2","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)","author":"Ethayarajh Kawin","year":"2019","unstructured":"Kawin Ethayarajh. 2019. How contextual are contextualized word representations? comparing the geometry of BERT, ELMo, and GPT-2 embeddings. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics."},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.acl-long.62"},{"key":"e_1_3_2_31_2","first-page":"2026","volume-title":"Proceedings of the 31st International Conference on Computational Linguistics","author":"Feng Zihao","year":"2025","unstructured":"Zihao Feng, Hailong Cao, Wang Xu, and Tiejun Zhao. 2025. Word-level cross-lingual structure in large language models. In Proceedings of the 31st International Conference on Computational Linguistics. 2026\u20132037."},{"key":"e_1_3_2_32_2","volume-title":"International Conference on Learning Representations","author":"Gao Jun","unstructured":"Jun Gao, Di He, Xu Tan, Tao Qin, Liwei Wang, and Tieyan Liu. Representation degeneration problem in training natural language generation models. In International Conference on Learning Representations."},{"key":"e_1_3_2_33_2","first-page":"30","volume-title":"Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":"Ghader Hamidreza","year":"2017","unstructured":"Hamidreza Ghader and Christof Monz. 2017. What does attention in neural machine translation pay attention to?. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 30\u201339."},{"key":"e_1_3_2_34_2","doi-asserted-by":"crossref","first-page":"568","DOI":"10.1007\/978-3-030-62466-8_35","volume-title":"The Semantic Web\u2013ISWC 2020: 19th International Semantic Web Conference, Athens, Greece, November 2\u20136, 2020, Proceedings, Part II 19","author":"Giabelli Anna","year":"2020","unstructured":"Anna Giabelli, Lorenzo Malandri, Fabio Mercorio, Mario Mezzanzanica, and Andrea Seveso. 2020. NEO: A tool for taxonomy enrichment with new emerging occupations. In The Semantic Web\u2013ISWC 2020: 19th International Semantic Web Conference, Athens, Greece, November 2\u20136, 2020, Proceedings, Part II 19. Springer, 568\u2013584."},{"key":"e_1_3_2_35_2","first-page":"35","volume-title":"EACL 2024-18th Conference of the European Chapter of the Association for Computational Linguistics","author":"Godey Nathan","year":"2024","unstructured":"Nathan Godey, Eric Villemonte de La Clergerie, and Beno\u00eet Sagot. 2024. Anisotropy is inherent to self-attention in transformers. In EACL 2024-18th Conference of the European Chapter of the Association for Computational Linguistics. 35\u201348."},{"key":"e_1_3_2_36_2","unstructured":"Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014. 2672\u20132680."},{"key":"e_1_3_2_37_2","doi-asserted-by":"crossref","first-page":"9099","DOI":"10.18653\/v1\/2021.emnlp-main.716","volume-title":"Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing","author":"Goswami Koustava","year":"2021","unstructured":"Koustava Goswami, Sourav Dutta, Haytham Assem, Theodorus Fransen, and John Philip McCrae. 2021. Cross-lingual sentence embedding using multi-task learning. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 9099\u20139113."},{"key":"e_1_3_2_38_2","doi-asserted-by":"crossref","first-page":"371","DOI":"10.18653\/v1\/2021.findings-acl.32","volume-title":"Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021","author":"Gritta Milan","year":"2021","unstructured":"Milan Gritta and Ignacio Iacobacci. 2021. XeroAlign: Zero-shot cross-lingual transformer alignment. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. 371\u2013381."},{"key":"e_1_3_2_39_2","unstructured":"Jiatao Gu Hany Hassan Jacob Devlin and Victor OK Li. 2018. Universal neural machine translation for extremely low resource languages. arXiv:1802.05368. Retrieved from https:\/\/arxiv.org\/abs\/1802.05368"},{"key":"e_1_3_2_40_2","doi-asserted-by":"crossref","first-page":"2316","DOI":"10.18653\/v1\/2022.findings-acl.182","volume-title":"Findings of the Association for Computational Linguistics: ACL 2022","author":"H\u00e4mmerl Katharina","year":"2022","unstructured":"Katharina H\u00e4mmerl, Jind\u0159ich Libovick\u1ef3, and Alexander Fraser. 2022. Combining static and contextualised multilingual embeddings. In Findings of the Association for Computational Linguistics: ACL 2022. 2316\u20132329."},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2024.findings-acl.649"},{"key":"e_1_3_2_42_2","doi-asserted-by":"crossref","first-page":"2101","DOI":"10.18653\/v1\/2022.findings-emnlp.154","volume-title":"Findings of the Association for Computational Linguistics: EMNLP 2022","author":"Heffernan Kevin","year":"2022","unstructured":"Kevin Heffernan, Onur \u00c7elebi, and Holger Schwenk. 2022. Bitext mining using distilled sentence representations for low-resource languages. In Findings of the Association for Computational Linguistics: EMNLP 2022. 2101\u20132112."},{"key":"e_1_3_2_43_2","first-page":"3633","volume-title":"Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies","author":"Hu Junjie","year":"2021","unstructured":"Junjie Hu, Melvin Johnson, Orhan Firat, Aditya Siddhant, and Graham Neubig. 2021. Explicit alignment objectives for multilingual bidirectional encoders. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 3633\u20133643."},{"key":"e_1_3_2_44_2","first-page":"4411","volume-title":"International Conference on Machine Learning","author":"Hu Junjie","year":"2020","unstructured":"Junjie Hu, Sebastian Ruder, Aditya Siddhant, Graham Neubig, Orhan Firat, and Melvin Johnson. 2020. Xtreme: A massively multilingual multi-task benchmark for evaluating cross-lingual generalisation. In International Conference on Machine Learning. PMLR, 4411\u20134421."},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1252"},{"key":"e_1_3_2_46_2","unstructured":"Minyoung Huh Brian Cheung Tongzhou Wang and Phillip Isola. 2024. The platonic representation hypothesis. arXiv preprint arXiv:2405.07987 (2024)."},{"key":"e_1_3_2_47_2","first-page":"1623","volume-title":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":"Ils Alexandra","year":"2021","unstructured":"Alexandra Ils, Dan Liu, Daniela Grunow, and Steffen Eger. 2021. Changes in european solidarity before and during covid-19: Evidence from a large crowd-and expert-annotated twitter dataset. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 1623\u20131637."},{"key":"e_1_3_2_48_2","unstructured":"Rishi Jha Collin Zhang Vitaly Shmatikov and John X. Morris. 2025. Harnessing the universal geometry of embeddings. arXiv:2405.07987. Retrieved from https:\/\/arxiv.org\/abs\/2405.07987"},{"key":"e_1_3_2_49_2","doi-asserted-by":"crossref","first-page":"13906","DOI":"10.18653\/v1\/2024.emnlp-main.770","volume-title":"Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing","author":"Jiang Fan","year":"2024","unstructured":"Fan Jiang, Tom Drummond, and Trevor Cohn. 2024. Pre-training cross-lingual open domain question answering with large-scale synthetic supervision. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 13906\u201313933."},{"key":"e_1_3_2_50_2","unstructured":"Zhuolin Jiang Amro El-Jaroudi William Hartmann Damianos Karakos and Lingjun Zhao. 2020. Cross-lingual information retrieval with BERT. arXiv:2004.13005. Retrieved from https:\/\/arxiv.org\/abs\/2004.13005"},{"key":"e_1_3_2_51_2","unstructured":"Staffs Keele et\u00a0al. 2007. Guidelines for performing systematic literature reviews in software engineering. (2007)."},{"key":"e_1_3_2_52_2","first-page":"1459","volume-title":"Proceedings of COLING 2012","author":"Klementiev Alexandre","year":"2012","unstructured":"Alexandre Klementiev, Ivan Titov, and Binod Bhattarai. 2012. Inducing crosslingual distributed representations of words. In Proceedings of COLING 2012. 1459\u20131474."},{"key":"e_1_3_2_53_2","doi-asserted-by":"crossref","unstructured":"Philipp Koehn and Rebecca Knowles. 2017. Six challenges for neural machine translation. Proceedings of the First Workshop on Neural Machine Translation NMT@ACL 2017 Vancouver Canada August 4 2017. Association for Computational Linguistics 28\u201339.","DOI":"10.18653\/v1\/W17-3204"},{"key":"e_1_3_2_54_2","volume-title":"International Conference on Learning Representations","author":"Lample Guillaume","year":"2018","unstructured":"Guillaume Lample, Alexis Conneau, Marc\u2019Aurelio Ranzato, Ludovic Denoyer, and Herv\u00e9 J\u00e9gou. 2018. Word translation without parallel data. In International Conference on Learning Representations."},{"key":"e_1_3_2_55_2","first-page":"8051","volume-title":"Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)","author":"Li Chong","year":"2024","unstructured":"Chong Li, Shaonan Wang, Jiajun Zhang, and Chengqing Zong. 2024. Improving in-context learning of multilingual generative language models with cross-lingual alignment. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). 8051\u20138069."},{"key":"e_1_3_2_56_2","first-page":"8212","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Li Juntao","year":"2020","unstructured":"Juntao Li, Chang Liu, Jian Wang, Lidong Bing, Hongsong Li, Xiaozhong Liu, Dongyan Zhao, and Rui Yan. 2020. Cross-lingual low-resource set-to-description retrieval for global e-commerce. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 8212\u20138219."},{"key":"e_1_3_2_57_2","first-page":"12461","volume-title":"Findings of the Association for Computational Linguistics: ACL 2023","author":"Li Tianjian","year":"2023","unstructured":"Tianjian Li and Kenton Murray. 2023. Why does zero-shot cross-lingual generation fail? an explanation and a solution. In Findings of the Association for Computational Linguistics: ACL 2023. 12461\u201312476."},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1124"},{"key":"e_1_3_2_59_2","doi-asserted-by":"crossref","first-page":"1663","DOI":"10.18653\/v1\/2020.findings-emnlp.150","volume-title":"Findings of the Association for Computational Linguistics: EMNLP 2020","author":"Libovick\u1ef3 Jind\u0159ich","year":"2020","unstructured":"Jind\u0159ich Libovick\u1ef3, Rudolf Rosa, and Alexander Fraser. 2020. On the language neutrality of pre-trained multilingual representations. In Findings of the Association for Computational Linguistics: EMNLP 2020. 1663\u20131674."},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2025.acl-long.778"},{"key":"e_1_3_2_61_2","first-page":"4381","volume-title":"Proceedings of the 29th International Conference on Computational Linguistics","author":"Liu Linlin","year":"2022","unstructured":"Linlin Liu, Thien Hai Nguyen, Shafiq Joty, Lidong Bing, and Luo Si. 2022. Towards multi-sense cross-lingual alignment of contextual embeddings. In Proceedings of the 29th International Conference on Computational Linguistics. 4381\u20134396."},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","unstructured":"Lorenzo Malandri Fabio Mercorio Mario Mezzanzanica and Navid Nobani. 2021. MEET-LM: A method for embeddings evaluation for taxonomic data in the labour market. Computers in Industry 124 (2021) 103341. 10.1016\/j.compind.2020.103341","DOI":"10.1016\/j.compind.2020.103341"},{"key":"e_1_3_2_63_2","doi-asserted-by":"crossref","unstructured":"Lorenzo Malandri Fabio Mercorio Mario Mezzanzanica and Filippo Pallucchini. 2025. SeNSe: Embedding alignment via semantic anchors selection. International Journal of Data Science and Analytics 20 1 (2025) 167\u2013181.","DOI":"10.1007\/s41060-024-00522-z"},{"key":"e_1_3_2_64_2","unstructured":"Bryan McCann James Bradbury Caiming Xiong and Richard Socher. 2017. Learned in translation: Contextualized word vectors. Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017. 6294\u20136305."},{"key":"e_1_3_2_65_2","doi-asserted-by":"publisher","DOI":"10.5555\/2002736.2002775"},{"key":"e_1_3_2_66_2","unstructured":"Tomas Mikolov Quoc V. Le and Ilya Sutskever. 2013. Exploiting similarities among languages for machine translation. arXiv e-prints (2013) arXiv-1309."},{"key":"e_1_3_2_67_2","unstructured":"Tomas Mikolov Ilya Sutskever Kai Chen Greg S. Corrado and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8 2013 Lake Tahoe Nevada United States. 3111\u20133119."},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1308"},{"key":"e_1_3_2_69_2","doi-asserted-by":"crossref","first-page":"555","DOI":"10.18653\/v1\/2020.emnlp-main.41","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Nagata Masaaki","year":"2020","unstructured":"Masaaki Nagata, Katsuki Chousa, and Masaaki Nishino. 2020. A supervised word alignment method based on cross-language span prediction using multilingual BERT. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 555\u2013565."},{"key":"e_1_3_2_70_2","doi-asserted-by":"crossref","unstructured":"Roberto Navigli. 2009. Word sense disambiguation: A survey. ACM Computing Surveys (CSUR) 41 2 (2009) 1\u201369.","DOI":"10.1145\/1459352.1459355"},{"key":"e_1_3_2_71_2","doi-asserted-by":"crossref","unstructured":"Franz Josef Och and Hermann Ney. 2003. A systematic comparison of various statistical alignment models. Computational Linguistics 29 1 (2003) 19\u201351.","DOI":"10.1162\/089120103321337421"},{"key":"e_1_3_2_72_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1492"},{"key":"e_1_3_2_73_2","doi-asserted-by":"crossref","unstructured":"Robert \u00d6stling and J\u00f6rg Tiedemann. 2016. Efficient word alignment with markov chain monte carlo. The Prague Bulletin of Mathematical Linguistics 106 (2016) 125\u2013146.","DOI":"10.1515\/pralin-2016-0013"},{"key":"e_1_3_2_74_2","volume-title":"Annual Conference of the North American Chapter of the Association for Computational Linguistics","author":"Pan Lin","year":"2021","unstructured":"Lin Pan, Chung-Wei Hang, Haode Qi, Abhishek Shah, Saloni Potdar, and Mo Yu. 2021. Multilingual BERT post-pretraining alignment. In Annual Conference of the North American Chapter of the Association for Computational Linguistics."},{"key":"e_1_3_2_75_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1162"},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1202"},{"key":"e_1_3_2_77_2","unstructured":"Matthew E. Peters Mark Neumann Mohit Iyyer Matt Gardner Christopher Clark Kenton Lee and Luke Zettlemoyer. 2018. Deep contextualized word representations. arXiv:1802.05365. Retrieved from https:\/\/arxiv.org\/abs\/1802.05365"},{"key":"e_1_3_2_78_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1493"},{"key":"e_1_3_2_79_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2022.findings-emnlp.93"},{"key":"e_1_3_2_80_2","doi-asserted-by":"crossref","first-page":"1309","DOI":"10.18653\/v1\/2022.findings-acl.103","volume-title":"Findings of the Association for Computational Linguistics: ACL 2022","author":"Rajaee Sara","year":"2022","unstructured":"Sara Rajaee and Mohammad Taher Pilehvar. 2022. An isotropy analysis in the multilingual BERT embedding space. In Findings of the Association for Computational Linguistics: ACL 2022. 1309\u20131316."},{"key":"e_1_3_2_81_2","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Reimers Nils","year":"2020","unstructured":"Nils Reimers and Iryna Gurevych. 2020. Making monolingual sentence embeddings multilingual using knowledge distillation. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics."},{"key":"e_1_3_2_82_2","first-page":"29","volume-title":"Proceedings of the First NLPL Workshop on Deep Learning for Natural Language Processing","author":"R\u00f6nnqvist Samuel","year":"2019","unstructured":"Samuel R\u00f6nnqvist, Jenna Kanerva, Tapio Salakoski, and Filip Ginter. 2019. Is multilingual BERT fluent in language generation?. In Proceedings of the First NLPL Workshop on Deep Learning for Natural Language Processing. 29\u201336."},{"key":"e_1_3_2_83_2","doi-asserted-by":"crossref","unstructured":"Sebastian Ruder Ivan Vuli\u0107 and Anders S\u00f8gaard. 2019. A survey of cross-lingual word embedding models. Journal of Artificial Intelligence Research 65 (2019) 569\u2013631.","DOI":"10.1613\/jair.1.11640"},{"key":"e_1_3_2_84_2","first-page":"1627","volume-title":"EMNLP 2020","author":"Sabet Masoud Jalili","year":"2020","unstructured":"Masoud Jalili Sabet, Philipp Dufter, Fran\u00e7ois Yvon, and Hinrich Sch\u00fctze. 2020. SimAlign: High quality word alignments without parallel training data using static and contextualized embeddings. In EMNLP 2020. 1627\u20131643."},{"key":"e_1_3_2_85_2","doi-asserted-by":"crossref","unstructured":"Dominik Schlechtweg Anna H\u00e4tty Marco Del Tredici and Sabine Schulte im Walde. 2019. A wind of change: Detecting and evaluating lexical semantic change across times and domains. arXiv:1906.02979. Retrieved from https:\/\/arxiv.org\/abs\/1906.02979","DOI":"10.18653\/v1\/P19-1072"},{"key":"e_1_3_2_86_2","first-page":"1599","volume-title":"Proceedings of NAACL-HLT","author":"Schuster Tal","year":"2019","unstructured":"Tal Schuster, Ori Ram, Regina Barzilay, and Amir Globerson. 2019. Cross-lingual alignment of contextual word embeddings, with applications to zero-shot dependency parsing. In Proceedings of NAACL-HLT. 1599\u20131613."},{"key":"e_1_3_2_87_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6414"},{"key":"e_1_3_2_88_2","doi-asserted-by":"crossref","first-page":"778","DOI":"10.18653\/v1\/P18-1072","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"S\u00f8gaard Anders","year":"2018","unstructured":"Anders S\u00f8gaard, Sebastian Ruder, and Ivan Vuli\u0107. 2018. On the limitations of unsupervised bilingual dictionary induction. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 778\u2013788."},{"key":"e_1_3_2_89_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2025.acl-long.118"},{"key":"e_1_3_2_90_2","first-page":"1477","volume-title":"Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics","author":"Tan Weiting","year":"2023","unstructured":"Weiting Tan, Kevin Heffernan, Holger Schwenk, and Philipp Koehn. 2023. Multilingual representation distillation with contrastive learning. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. 1477\u20131490."},{"key":"e_1_3_2_91_2","doi-asserted-by":"crossref","first-page":"8696","DOI":"10.18653\/v1\/2022.acl-long.595","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"Tien Chih-chan","year":"2022","unstructured":"Chih-chan Tien and Shane Steinert-Threlkeld. 2022. Bilingual alignment transfers to multilingual alignment for unsupervised parallel text mining. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 8696\u20138706."},{"key":"e_1_3_2_92_2","doi-asserted-by":"crossref","unstructured":"Matej Ul\u010dar and Marko Robnik-\u0160ikonja. 2022. Cross-lingual alignments of ELMo contextual embeddings. Neural Computing and Applications 34 15 (2022) 13043\u201313061.","DOI":"10.1007\/s00521-022-07164-x"},{"key":"e_1_3_2_93_2","doi-asserted-by":"crossref","first-page":"8163","DOI":"10.18653\/v1\/2024.findings-acl.486","volume-title":"Findings of the Association for Computational Linguistics: ACL 2024","author":"Vasilyev Oleg","year":"2024","unstructured":"Oleg Vasilyev, Fumika Isono, and John Bohannon. 2024. Linear cross-lingual mapping of sentence embeddings. In Findings of the Association for Computational Linguistics: ACL 2024. 8163\u20138171."},{"key":"e_1_3_2_94_2","doi-asserted-by":"crossref","unstructured":"Chao Wang Hengshu Zhu Peng Wang Chen Zhu Xi Zhang Enhong Chen and Hui Xiong. 2021. Personalized and explainable employee training course recommendations: A bayesian variational approach. ACM Transactions on Information Systems (TOIS) 40 4 (2021) 1\u201332.","DOI":"10.1145\/3490476"},{"key":"e_1_3_2_95_2","first-page":"5721","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)","author":"Wang Yuxuan","year":"2019","unstructured":"Yuxuan Wang, Wanxiang Che, Jiang Guo, Yijia Liu, and Ting Liu. 2019. Cross-lingual BERT transformation for zero-shot dependency parsing. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 5721\u20135727."},{"key":"e_1_3_2_96_2","volume-title":"International Conference on Learning Representations","author":"Wang Zirui","year":"2020","unstructured":"Zirui Wang, Jiateng Xie, Ruochen Xu, Yiming Yang, Graham Neubig, and Jaime G. Carbonell. 2020. Cross-lingual alignment vs joint training: A comparative study and a simple unified framework. In International Conference on Learning Representations."},{"key":"e_1_3_2_97_2","doi-asserted-by":"crossref","first-page":"4602","DOI":"10.18653\/v1\/P19-1453","volume-title":"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics","author":"Wieting John","year":"2019","unstructured":"John Wieting, Kevin Gimpel, Graham Neubig, and Taylor Berg-Kirkpatrick. 2019. Simple and effective paraphrastic similarity from parallel translations. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 4602\u20134608."},{"key":"e_1_3_2_98_2","unstructured":"Shijie Wu Alexis Conneau Haoran Li Luke Zettlemoyer and Veselin Stoyanov. 2019. Emerging cross-lingual structure in pretrained language models. arXiv:1911.01464. Retrieved from https:\/\/arxiv.org\/abs\/1911.01464"},{"key":"e_1_3_2_99_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1077"},{"key":"e_1_3_2_100_2","first-page":"4471","volume-title":"Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)","author":"Wu Shijie","year":"2020","unstructured":"Shijie Wu and Mark Dredze. 2020. Do explicit alignments robustly improve multilingual encoders?. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 4471\u20134482."},{"key":"e_1_3_2_101_2","unstructured":"Haoran Xu and Philipp Koehn. 2021. Cross-lingual bert contextual embedding space mapping with isotropic and isometric conditions. arXiv:2107.09186. Retrieved from https:\/\/arxiv.org\/abs\/2107.09186"},{"key":"e_1_3_2_102_2","first-page":"3035","volume-title":"Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing","author":"Yi Jingwei","year":"2022","unstructured":"Jingwei Yi, Fangzhao Wu, Chuhan Wu, Xiaolong Huang, Binxing Jiao, Guangzhong Sun, and Xing Xie. 2022. Effective and efficient query-aware snippet extraction for web search. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 3035\u20133046."},{"key":"e_1_3_2_103_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1179"},{"key":"e_1_3_2_104_2","unstructured":"Zheng Zhang Ruiqing Yin Jun Zhu and Pierre Zweigenbaum. 2019. Cross-lingual contextual word embeddings mapping with multi-sense words in mind. arXiv:1909.08681. Retrieved from https:\/\/arxiv.org\/abs\/1909.08681"},{"key":"e_1_3_2_105_2","doi-asserted-by":"crossref","first-page":"229","DOI":"10.18653\/v1\/2021.starsem-1.22","volume-title":"Proceedings of* SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics","author":"Zhao Wei","year":"2021","unstructured":"Wei Zhao, Steffen Eger, Johannes Bjerva, and Isabelle Augenstein. 2021. Inducing language-agnostic multilingual representations. In Proceedings of* SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics. 229\u2013240."},{"key":"e_1_3_2_106_2","first-page":"430","volume-title":"Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":"Zhou Huiwei","year":"2015","unstructured":"Huiwei Zhou, Long Chen, Fulin Shi, and Degen Huang. 2015. Learning bilingual sentiment word embeddings for cross-language sentiment classification. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 430\u2013440."}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3764112","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,20]],"date-time":"2025-11-20T13:35:12Z","timestamp":1763645712000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3764112"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,20]]},"references-count":105,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2026,4,30]]}},"alternative-id":["10.1145\/3764112"],"URL":"https:\/\/doi.org\/10.1145\/3764112","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,11,20]]},"assertion":[{"value":"2023-10-31","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-08-11","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-11-20","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}