{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,20]],"date-time":"2025-09-20T09:14:47Z","timestamp":1758359687661,"version":"3.44.0"},"reference-count":68,"publisher":"Association for Natural Language Processing","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Journal of Natural Language Processing"],"published-print":{"date-parts":[[2025]]},"DOI":"10.5715\/jnlp.32.886","type":"journal-article","created":{"date-parts":[[2025,9,14]],"date-time":"2025-09-14T22:08:09Z","timestamp":1757887689000},"page":"886-917","source":"Crossref","is-referenced-by-count":0,"title":["Cross-lingual Contextualized Phrase Retrieval"],"prefix":"10.5715","volume":"32","author":[{"given":"Huayang","family":"Li","sequence":"first","affiliation":[{"name":"Nara Institute of Science and Technology"}]},{"given":"Deng","family":"Cai","sequence":"additional","affiliation":[{"name":"Tencent AI Lab"}]},{"given":"Zhi","family":"Qu","sequence":"additional","affiliation":[{"name":"Nara Institute of Science and Technology"}]},{"given":"Qu","family":"Cui","sequence":"additional","affiliation":[{"name":"Tencent AI Lab"}]},{"given":"Hidetaka","family":"Kamigaito","sequence":"additional","affiliation":[{"name":"Nara Institute of Science and Technology"}]},{"given":"Lemao","family":"Liu","sequence":"additional","affiliation":[{"name":"Tencent AI Lab"}]},{"given":"Taro","family":"Watanabe","sequence":"additional","affiliation":[{"name":"Nara Institute of Science and Technology"}]}],"member":"3685","reference":[{"key":"1","unstructured":"Asai, A., Wu, Z., Wang, Y., Sil, A., and Hajishirzi, H. (2024). \u201cSelf-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection.\u201d In <i>The 12th International Conference on Learning Representations<\/i>."},{"key":"2","unstructured":"Bapna, A. and Firat, O. (2019). \u201cNon-Parametric Adaptation for Neural Machine Translation.\u201d In Burstein, J., Doran, C., and Solorio, T. (Eds.), <i>Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)<\/i>, pp. 1921\u20131931, Minneapolis, Minnesota. Association for Computational Linguistics."},{"key":"3","unstructured":"Borgeaud, S., Mensch, A., Hoffmann, J., Cai, T., Rutherford, E., Millican, K., Van Den Driessche, G. B., Lespiau, J.-B., Damoc, B., Clark, A., De Las Casas, D., Guy, A., Menick, J., Ring, R., Hennigan, T., Huang, S., Maggiore, L., Jones, C., Cassirer, A., Brock, A., Paganini, M., Irving, G., Vinyals, O., Osindero, S., Simonyan, K., Rae, J., Elsen, E., and Sifre, L. (2022). \u201cImproving Language Models by Retrieving from Trillions of Tokens.\u201d In Chaudhuri, K., Jegelka, S., Song, L., Szepesvari, C., Niu, G., and Sabato, S. (Eds.), <i>Proceedings of the 39th International Conference on Machine Learning<\/i>, Vol. 162 of <i>Proceedings of Machine Learning Research<\/i>, pp. 2206\u20132240. PMLR."},{"key":"4","unstructured":"Brown, P. F., Della Pietra, S. A., Della Pietra, V. J., and Mercer, R. L. (1993). \u201cThe Mathematics of Statistical Machine Translation: Parameter Estimation.\u201d <i>Computational Linguistics<\/i>, 19 (2), pp. 263\u2013311."},{"key":"5","doi-asserted-by":"crossref","unstructured":"Cai, D., Li, X., Ho, J. C.-S., Bing, L., and Lam, W. (2022). \u201cRetrofitting Multilingual Sentence Embeddings with Abstract Meaning Representation.\u201d In Goldberg, Y., Kozareva, Z., and Zhang, Y. (Eds.), <i>Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing<\/i>, pp. 6456\u20136472, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2022.emnlp-main.433"},{"key":"6","doi-asserted-by":"crossref","unstructured":"Cai, D., Wang, Y., Li, H., Lam, W., and Liu, L. (2021). \u201cNeural Machine Translation with Monolingual Translation Memory.\u201d In Zong, C., Xia, F., Li, W., and Navigli, R. (Eds.), <i>Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)<\/i>, pp. 7307\u20137318, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2021.acl-long.567"},{"key":"7","unstructured":"Cao, B., Cai, D., Cui, L., Cheng, X., Bi, W., Zou, Y., and Shi, S. (2024). \u201cRetrieval is Accurate Generation.\u201d In <i>The 12th International Conference on Learning Representations<\/i>."},{"key":"8","doi-asserted-by":"crossref","unstructured":"Chiang, D. (2005). \u201cA Hierarchical Phrase-Based Model for Statistical Machine Translation.\u201d In Knight, K., Ng, H. T., and Oflazer, K. (Eds.), <i>Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL\u201905)<\/i>, pp. 263\u2013270, Ann Arbor, Michigan. Association for Computational Linguistics.","DOI":"10.3115\/1219840.1219873"},{"key":"9","doi-asserted-by":"crossref","unstructured":"Chidambaram, M., Yang, Y., Cer, D., Yuan, S., Sung, Y., Strope, B., and Kurzweil, R. (2019). \u201cLearning Cross-Lingual Sentence Representations via a Multi-task Dual-Encoder Model.\u201d In Augenstein, I., Gella, S., Ruder, S., Kann, K., Can, B., Welbl, J., Conneau, A., Ren, X., and Rei, M. (Eds.), <i>Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019)<\/i>, pp. 250\u2013259, Florence, Italy. Association for Computational Linguistics.","DOI":"10.18653\/v1\/W19-4330"},{"key":"10","doi-asserted-by":"crossref","unstructured":"Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzm\u00e1n, F., Grave, E., Ott, M., Zettlemoyer, L., and Stoyanov, V. (2020). \u201cUnsupervised Cross-lingual Representation Learning at Scale.\u201d In Jurafsky, D., Chai, J., Schluter, N., and Tetreault, J. (Eds.), <i>Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics<\/i>, pp. 8440\u20138451, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"11","unstructured":"Cruse, D. A. (1986). <i>Lexical Semantics<\/i>. Cambridge University Press."},{"key":"12","doi-asserted-by":"crossref","unstructured":"Deguchi, H., Watanabe, T., Matsui, Y., Utiyama, M., Tanaka, H., and Sumita, E. (2023). \u201cSubset Retrieval Nearest Neighbor Machine Translation.\u201d In Rogers, A., Boyd-Graber, J., and Okazaki, N. (Eds.), <i>Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)<\/i>, pp. 174\u2013189, Toronto, Canada. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2023.acl-long.10"},{"key":"13","unstructured":"Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K. (2019). \u201cBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.\u201d In Burstein, J., Doran, C., and Solorio, T. (Eds.), <i>Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)<\/i>, pp. 4171\u20134186, Minneapolis, Minnesota. Association for Computational Linguistics."},{"key":"14","doi-asserted-by":"crossref","unstructured":"Dou, Z.-Y. and Neubig, G. (2021). \u201cWord Alignment by Fine-tuning Embeddings on Parallel Corpora.\u201d In Merlo, P., Tiedemann, J., and Tsarfaty, R. (Eds.), <i>Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume<\/i>, pp. 2112\u20132128, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2021.eacl-main.181"},{"key":"15","unstructured":"Douze, M., Guzhva, A., Deng, C., Johnson, J., Szilvasy, G., Mazar\u00e9, P.-E., Lomeli, M., Hosseini, L., and J\u00e9gou, H. (2024). \u201cThe Faiss Library.\u201d <i>arXiv preprint arXiv:2401.08281<\/i>."},{"key":"16","unstructured":"Dyer, C., Chahuneau, V., and Smith, N. A. (2013). \u201cA Simple, Fast, and Effective Reparameterization of IBM Model 2.\u201d In Vanderwende, L., Daum\u00e9 III, H., and Kirchhoff, K. (Eds.), <i>Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies<\/i>, pp. 644\u2013648, Atlanta, Georgia. Association for Computational Linguistics."},{"key":"17","doi-asserted-by":"crossref","unstructured":"Federmann, C., Kocmi, T., and Xin, Y. (2022). \u201cNTREX-128 \u2013 News Test References for MT Evaluation of 128 Languages.\u201d In Ahuja, K., Anastasopoulos, A., Patra, B., Neubig, G., Choudhury, M., Dandapat, S., Sitaram, S., and Chaudhary, V. (Eds.), <i>Proceedings of the 1st Workshop on Scaling Up Multilingual Evaluation<\/i>, pp. 21\u201324, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2022.sumeval-1.4"},{"key":"18","doi-asserted-by":"crossref","unstructured":"Feng, F., Yang, Y., Cer, D., Arivazhagan, N., and Wang, W. (2022). \u201cLanguage-agnostic BERT Sentence Embedding.\u201d In Muresan, S., Nakov, P., and Villavicencio, A. (Eds.), <i>Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)<\/i>, pp. 878\u2013891, Dublin, Ireland. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2022.acl-long.62"},{"key":"19","doi-asserted-by":"crossref","unstructured":"Gao, T., Yao, X., and Chen, D. (2021). \u201cSimCSE: Simple Contrastive Learning of Sentence Embeddings.\u201d In Moens, M.-F., Huang, X., Specia, L., and Yih, S. W.-t. (Eds.), <i>Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing<\/i>, pp. 6894\u20136910, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2021.emnlp-main.552"},{"key":"20","unstructured":"Ghader, H. and Monz, C. (2017). \u201cWhat does Attention in Neural Machine Translation Pay Attention to?\u201d In Kondrak, G. and Watanabe, T. (Eds.), <i>Proceedings of the 8th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)<\/i>, pp. 30\u201339, Taipei, Taiwan. Asian Federation of Natural Language Processing."},{"key":"21","doi-asserted-by":"crossref","unstructured":"Gillick, D., Kulkarni, S., Lansing, L., Presta, A., Baldridge, J., Ie, E., and Garcia-Olano, D. (2019). \u201cLearning Dense Representations for Entity Retrieval.\u201d In Bansal, M. and Villavicencio, A. (Eds.), <i>Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)<\/i>, pp. 528\u2013537, Hong Kong, China. Association for Computational Linguistics.","DOI":"10.18653\/v1\/K19-1049"},{"key":"22","doi-asserted-by":"crossref","unstructured":"Gu, J., Wang, Y., Cho, K., and Li, V. O. K. (2018). \u201cSearch Engine Guided Neural Machine Translation.\u201d In <i>AAAI Conference on Artificial Intelligence<\/i>.","DOI":"10.1609\/aaai.v32i1.12013"},{"key":"23","unstructured":"Guu, K., Lee, K., Tung, Z., Pasupat, P., and Chang, M. (2020). \u201cRetrieval augmented language model pre-training.\u201d In <i>International Conference on Machine Learning<\/i>, pp. 3929\u20133938. PMLR."},{"key":"24","doi-asserted-by":"crossref","unstructured":"He, Q., Huang, G., Cui, Q., Li, L., and Liu, L. (2021). \u201cFast and Accurate Neural Machine Translation with Translation Memory.\u201d In Zong, C., Xia, F., Li, W., and Navigli, R. (Eds.), <i>Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)<\/i>, pp. 3170\u20133180, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2021.acl-long.246"},{"key":"25","doi-asserted-by":"crossref","unstructured":"Heffernan, K., \u00c7elebi, O., and Schwenk, H. (2022). \u201cBitext Mining Using Distilled Sentence Representations for Low-Resource Languages.\u201d In Goldberg, Y., Kozareva, Z., and Zhang, Y. (Eds.), <i>Findings of the Association for Computational Linguistics: EMNLP 2022<\/i>, pp. 2101\u20132112, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2022.findings-emnlp.154"},{"key":"26","unstructured":"Hu, E. J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., and Chen, W. (2022). \u201cLoRA: Low-Rank Adaptation of Large Language Models.\u201d In <i>International Conference on Learning Representations<\/i>."},{"key":"27","unstructured":"Izacard, G., Caron, M., Hosseini, L., Riedel, S., Bojanowski, P., Joulin, A., and Grave, E. (2021). \u201cUnsupervised Dense Information Retrieval with Contrastive Learning.\u201d <i>arXiv preprint arXiv:2112.09118<\/i>."},{"key":"28","doi-asserted-by":"crossref","unstructured":"Jalili Sabet, M., Dufter, P., Yvon, F., and Sch\u00fctze, H. (2020). \u201cSimAlign: High Quality Word Alignments Without Parallel Training Data Using Static and Contextualized Embeddings.\u201d In Cohn, T., He, Y., and Liu, Y. (Eds.), <i>Findings of the Association for Computational Linguistics: EMNLP 2020<\/i>, pp. 1627\u20131643, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2020.findings-emnlp.147"},{"key":"29","doi-asserted-by":"crossref","unstructured":"Johnson, C. (2022). \u201cBinary Encoded Word Mover\u2019s Distance.\u201d In Gella, S., He, H., Majumder, B. P., Can, B., Giunchiglia, E., Cahyawijaya, S., Min, S., Mozes, M., Li, X. L., Augenstein, I., Rogers, A., Cho, K., Grefenstette, E., Rimell, L., and Dyer, C. (Eds.), <i>Proceedings of the 7th Workshop on Representation Learning for NLP<\/i>, pp. 167\u2013172, Dublin, Ireland. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2022.repl4nlp-1.17"},{"key":"30","doi-asserted-by":"crossref","unstructured":"Karpukhin, V., Oguz, B., Min, S., Lewis, P., Wu, L., Edunov, S., Chen, D., and Yih, W.-t. (2020). \u201cDense Passage Retrieval for Open-Domain Question Answering.\u201d In Webber, B., Cohn, T., He, Y., and Liu, Y. (Eds.), <i>Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)<\/i>, pp. 6769\u20136781, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2020.emnlp-main.550"},{"key":"31","unstructured":"Khandelwal, U., Fan, A., Jurafsky, D., Zettlemoyer, L., and Lewis, M. (2021). \u201cNearest Neighbor Machine Translation.\u201d In <i>International Conference on Learning Representations<\/i>."},{"key":"32","doi-asserted-by":"crossref","unstructured":"Koehn, P., Och, F. J., and Marcu, D. (2003). \u201cStatistical Phrase-Based Translation.\u201d In <i>Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics<\/i>, pp. 127\u2013133.","DOI":"10.3115\/1073445.1073462"},{"key":"33","doi-asserted-by":"crossref","unstructured":"Kudo, T. (2018). \u201cSubword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates.\u201d In Gurevych, I. and Miyao, Y. (Eds.), <i>Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)<\/i>, pp. 66\u201375, Melbourne, Australia. Association for Computational Linguistics.","DOI":"10.18653\/v1\/P18-1007"},{"key":"34","unstructured":"Lan, T., Cai, D., Wang, Y., Huang, H., and Mao, X.-L. (2023). \u201cCopy is All You Need.\u201d In <i>The 11th International Conference on Learning Representations<\/i>."},{"key":"35","unstructured":"Lee, A. N., Hunter, C. J., and Ruiz, N. (2023). \u201cPlatypus: Quick, Cheap, and Powerful Refinement of LLMs.\u201d <i>arXiv preprint arXiv:2308.07317<\/i>."},{"key":"36","doi-asserted-by":"crossref","unstructured":"Lee, J., Sung, M., Kang, J., and Chen, D. (2021a). \u201cLearning Dense Representations of Phrases at Scale.\u201d In Zong, C., Xia, F., Li, W., and Navigli, R. (Eds.), <i>Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)<\/i>, pp. 6634\u20136647, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2021.acl-long.518"},{"key":"37","doi-asserted-by":"crossref","unstructured":"Lee, J., Wettig, A., and Chen, D. (2021b). \u201cPhrase Retrieval Learns Passage Retrieval, Too.\u201d In Moens, M.-F., Huang, X., Specia, L., and Yih, S. W.-t. (Eds.), <i>Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing<\/i>, pp. 3661\u20133672, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2021.emnlp-main.297"},{"key":"38","unstructured":"Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., K\u00fcttler, H., Lewis, M., Yih, W.-t., Rockt\u00e4schel, T., Riedel, S., and Kiela, D. (2020a). \u201cRetrieval-augmented Generation for Knowledge-intensive NLP Tasks.\u201d <i>Advances in Neural Information Processing Systems<\/i>, 33, pp. 9459\u20139474."},{"key":"39","unstructured":"Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., K\u00fcttler, H., Lewis, M., Yih, W.-t., Rockt\u00e4schel, T., Riedel, S., and Kiela, D. (2020b). \u201cRetrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.\u201d In Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M., and Lin, H. (Eds.), <i>Advances in Neural Information Processing Systems<\/i>, Vol. 33, pp. 9459\u20139474. Curran Associates, Inc."},{"key":"40","doi-asserted-by":"crossref","unstructured":"Li, H., Cai, D., Qu, Z., Cui, Q., Kamigaito, H., Liu, L., and Watanabe, T. (2024). \u201cCross-lingual Contextualized Phrase Retrieval.\u201d In Al-Onaizan, Y., Bansal, M., and Chen, Y.-N. (Eds.), <i>Findings of the Association for Computational Linguistics: EMNLP 2024<\/i>, pp. 6562\u20136576, Miami, Florida, USA. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2024.findings-emnlp.383"},{"key":"41","doi-asserted-by":"crossref","unstructured":"Li, X., Li, G., Liu, L., Meng, M., and Shi, S. (2019). \u201cOn the Word Alignment from Neural Machine Translation.\u201d In Korhonen, A., Traum, D., and M\u00e0rquez, L. (Eds.), <i>Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics<\/i>, pp. 1293\u20131303, Florence, Italy. Association for Computational Linguistics.","DOI":"10.18653\/v1\/P19-1124"},{"key":"42","unstructured":"Li, Y., Liu, L., and Shi, S. (2020). \u201cEmpirical Analysis of Unlabeled Entity Problem in Named Entity Recognition.\u201d In <i>International Conference on Learning Representations<\/i>."},{"key":"43","doi-asserted-by":"crossref","unstructured":"Liu, Y., Gu, J., Goyal, N., Li, X., Edunov, S., Ghazvininejad, M., Lewis, M., and Zettlemoyer, L. (2020). \u201cMultilingual Denoising Pre-training for Neural Machine Translation.\u201d <i>Transactions of the Association for Computational Linguistics<\/i>, 8, pp. 726\u2013742.","DOI":"10.1162\/tacl_a_00343"},{"key":"44","doi-asserted-by":"crossref","unstructured":"Malkov, Y. A. and Yashunin, D. A. (2018). \u201cEfficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs.\u201d <i>IEEE Transactions on Pattern Analysis and Machine Intelligence<\/i>, 42 (4), pp. 824\u2013836.","DOI":"10.1109\/TPAMI.2018.2889473"},{"key":"45","unstructured":"Mare\u010dek, D. (2011). \u201cAutomatic Alignment of Tectogrammatical Trees from Czech-English Parallel Corpus.\u201d Univerzita Karlova, Matematicko-fyzik\u00e1ln\u00ed fakulta."},{"key":"46","doi-asserted-by":"crossref","unstructured":"Meng, Y., Li, X., Zheng, X., Wu, F., Sun, X., Zhang, T., and Li, J. (2022). \u201cFast Nearest Neighbor Machine Translation.\u201d In Muresan, S., Nakov, P., and Villavicencio, A. (Eds.), <i>Findings of the Association for Computational Linguistics: ACL 2022<\/i>, pp. 555\u2013565, Dublin, Ireland. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2022.findings-acl.47"},{"key":"47","doi-asserted-by":"crossref","unstructured":"Mihalcea, R. and Pedersen, T. (2003). \u201cAn Evaluation Exercise for Word Alignment.\u201d In <i>Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond<\/i>, pp. 1\u201310.","DOI":"10.3115\/1118905.1118906"},{"key":"48","doi-asserted-by":"crossref","unstructured":"Min, S., Chen, D., Hajishirzi, H., and Zettlemoyer, L. (2019). \u201cA Discrete Hard EM Approach for Weakly Supervised Question Answering.\u201d In Inui, K., Jiang, J., Ng, V., and Wan, X. (Eds.), <i>Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)<\/i>, pp. 2851\u20132864, Hong Kong, China. Association for Computational Linguistics.","DOI":"10.18653\/v1\/D19-1284"},{"key":"49","doi-asserted-by":"crossref","unstructured":"Ng, N., Yee, K., Baevski, A., Ott, M., Auli, M., and Edunov, S. (2019). \u201cFacebook FAIR\u2019s WMT19 News Translation Task Submission.\u201d In Bojar, O., Chatterjee, R., Federmann, C., Fishel, M., Graham, Y., Haddow, B., Huck, M., Yepes, A. J., Koehn, P., Martins, A., Monz, C., Negri, M., N\u00e9v\u00e9ol, A., Neves, M., Post, M., Turchi, M., and Verspoor, K. (Eds.), <i>Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1)<\/i>, pp. 314\u2013319, Florence, Italy. Association for Computational Linguistics.","DOI":"10.18653\/v1\/W19-5333"},{"key":"50","doi-asserted-by":"crossref","unstructured":"Ni, J., Hernandez Abrego, G., Constant, N., Ma, J., Hall, K., Cer, D., and Yang, Y. (2022). \u201cSentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models.\u201d In Muresan, S., Nakov, P., and Villavicencio, A. (Eds.), <i>Findings of the Association for Computational Linguistics: ACL 2022<\/i>, pp. 1864\u20131874, Dublin, Ireland. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2022.findings-acl.146"},{"key":"51","doi-asserted-by":"crossref","unstructured":"Och, F. J. and Ney, H. (2003). \u201cA Systematic Comparison of Various Statistical Alignment Models.\u201d <i>Computational Linguistics<\/i>, 29 (1), pp. 19\u201351.","DOI":"10.1162\/089120103321337421"},{"key":"52","unstructured":"Och, F. J., Tillmann, C., and Ney, H. (1999). \u201cImproved Alignment Models for Statistical Machine Translation.\u201d In <i>1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora<\/i>."},{"key":"53","unstructured":"Rei, R., C. de Souza, J. G., Alves, D., Zerva, C., Farinha, A. C., Glushkova, T., Lavie, A., Coheur, L., and Martins, A. F. T. (2022). \u201cCOMET-22: Unbabel-IST 2022 Submission for the Metrics Shared Task.\u201d In Koehn, P., Barrault, L., Bojar, O., Bougares, F., Chatterjee, R., Costa-juss\u00e0, M. R., Federmann, C., Fishel, M., Fraser, A., Freitag, M., Graham, Y., Grundkiewicz, R., Guzman, P., Haddow, B., Huck, M., Jimeno Yepes, A., Kocmi, T., Martins, A., Morishita, M., Monz, C., Nagata, M., Nakazawa, T., Negri, M., N\u00e9v\u00e9ol, A., Neves, M., Popel, M., Turchi, M., and Zampieri, M. (Eds.), <i>Proceedings of the 7th Conference on Machine Translation (WMT)<\/i>, pp. 578\u2013585, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics."},{"key":"54","doi-asserted-by":"crossref","unstructured":"Rei, R., Stewart, C., Farinha, A. C., and Lavie, A. (2020). \u201cCOMET: A Neural Framework for MT Evaluation.\u201d In Webber, B., Cohn, T., He, Y., and Liu, Y. (Eds.), <i>Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)<\/i>, pp. 2685\u20132702, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2020.emnlp-main.213"},{"key":"55","doi-asserted-by":"crossref","unstructured":"Reimers, N. and Gurevych, I. (2020). \u201cMaking Monolingual Sentence Embeddings Multilingual using Knowledge Distillation.\u201d In Webber, B., Cohn, T., He, Y., and Liu, Y. (Eds.), <i>Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)<\/i>, pp. 4512\u20134525, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2020.emnlp-main.365"},{"key":"56","doi-asserted-by":"crossref","unstructured":"Sennrich, R., Haddow, B., and Birch, A. (2016). \u201cNeural Machine Translation of Rare Words with Subword Units.\u201d In Erk, K. and Smith, N. A. (Eds.), <i>Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)<\/i>, pp. 1715\u20131725, Berlin, Germany. Association for Computational Linguistics.","DOI":"10.18653\/v1\/P16-1162"},{"key":"57","doi-asserted-by":"crossref","unstructured":"Seo, M., Kwiatkowski, T., Parikh, A., Farhadi, A., and Hajishirzi, H. (2018). \u201cPhrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension.\u201d In Riloff, E., Chiang, D., Hockenmaier, J., and Tsujii, J. (Eds.), <i>Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing<\/i>, pp. 559\u2013564, Brussels, Belgium. Association for Computational Linguistics.","DOI":"10.18653\/v1\/D18-1052"},{"key":"58","doi-asserted-by":"crossref","unstructured":"S\u00f8gaard, A., Ruder, S., and Vuli\u0107, I. (2018). \u201cOn the Limitations of Unsupervised Bilingual Dictionary Induction.\u201d In Gurevych, I. and Miyao, Y. (Eds.), <i>Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)<\/i>, pp. 778\u2013788, Melbourne, Australia. Association for Computational Linguistics.","DOI":"10.18653\/v1\/P18-1072"},{"key":"59","unstructured":"Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., Babaei, Y., Bashlykov, N., Batra, S., Bhargava, P., Bhosale, S., et al. (2023). \u201cLlama 2: Open foundation and fine-tuned chat models.\u201d <i>arXiv preprint arXiv:2307.09288<\/i>."},{"key":"60","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, \u023d., and Polosukhin, I. (2017). \u201cAttention is All You Need.\u201d <i>Advances in Neural Information Processing Systems<\/i>, 30."},{"key":"61","unstructured":"Wei, J., Wang, X., Schuurmans, D., Bosma, M., Xia, F., Chi, E., Le, Q. V., and Zhou, D. (2022). \u201cChain-of-Thought Prompting Elicits Reasoning in Large Language Models.\u201d <i>Advances in Neural Information Processing Systems<\/i>, 35, pp. 24824\u201324837."},{"key":"62","doi-asserted-by":"crossref","unstructured":"Wu, Q., Nagata, M., and Tsuruoka, Y. (2023). \u201cWSPAlign: Word Alignment Pre-training via Large-Scale Weakly Supervised Span Prediction.\u201d In Rogers, A., Boyd-Graber, J., and Okazaki, N. (Eds.), <i>Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)<\/i>, pp. 11084\u201311099, Toronto, Canada. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2023.acl-long.621"},{"key":"63","doi-asserted-by":"crossref","unstructured":"Xia, M., Huang, G., Liu, L., and Shi, S. (2019). \u201cGraph Based Translation Memory for Neural Machine Translation.\u201d In <i>Proceedings of the AAAI Conference on Artificial Intelligence<\/i>, Vol. 33, pp. 7297\u20137304.","DOI":"10.1609\/aaai.v33i01.33017297"},{"key":"64","unstructured":"Zhang, J. and Zong, C. (2016). \u201cBridging Neural Machine Translation and Bilingual Dictionaries.\u201d <i>arXiv preprint arXiv:1610.07272<\/i>."},{"key":"65","doi-asserted-by":"crossref","unstructured":"Zhang, J., Utiyama, M., Sumita, E., Neubig, G., and Nakamura, S. (2018). \u201cGuiding Neural Machine Translation with Retrieved Translation Pieces.\u201d In Walker, M., Ji, H., and Stent, A. (Eds.), <i>Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)<\/i>, pp. 1325\u20131335, New Orleans, Louisiana. Association for Computational Linguistics.","DOI":"10.18653\/v1\/N18-1120"},{"key":"66","unstructured":"Zhang, T., Kishore, V., Wu, F., Weinberger, K. Q., and Artzi, Y. (2019). \u201cBERTScore: Evaluating Text Generation with BERT.\u201d In <i>International Conference on Learning Representations<\/i>."},{"key":"67","doi-asserted-by":"crossref","unstructured":"Zheng, H., Zhang, X., Chi, Z., Huang, H., Tan, Y., Lan, T., Wei, W., and Mao, X.-L. (2022). \u201cCross-Lingual Phrase Retrieval.\u201d In Muresan, S., Nakov, P., and Villavicencio, A. (Eds.), <i>Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)<\/i>, pp. 4193\u20134204, Dublin, Ireland. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2022.acl-long.288"},{"key":"68","doi-asserted-by":"crossref","unstructured":"Zheng, X., Zhang, Z., Guo, J., Huang, S., Chen, B., Luo, W., and Chen, J. (2021). \u201cAdaptive Nearest Neighbor Machine Translation.\u201d In Zong, C., Xia, F., Li, W., and Navigli, R. (Eds.), <i>Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)<\/i>, pp. 368\u2013374, Online. Association for Computational Linguistics.","DOI":"10.18653\/v1\/2021.acl-short.47"}],"container-title":["Journal of Natural Language Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/jnlp\/32\/3\/32_886\/_pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,20]],"date-time":"2025-09-20T03:35:33Z","timestamp":1758339333000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.jstage.jst.go.jp\/article\/jnlp\/32\/3\/32_886\/_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025]]},"references-count":68,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2025]]}},"URL":"https:\/\/doi.org\/10.5715\/jnlp.32.886","relation":{},"ISSN":["1340-7619","2185-8314"],"issn-type":[{"type":"print","value":"1340-7619"},{"type":"electronic","value":"2185-8314"}],"subject":[],"published":{"date-parts":[[2025]]}}}