{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T02:33:34Z","timestamp":1760236414441,"version":"build-2065373602"},"reference-count":97,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2021,11,22]],"date-time":"2021-11-22T00:00:00Z","timestamp":1637539200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>Recent years have seen a surge of interest in dialogue translation, which is a significant application task for machine translation (MT) technology. However, this has so far not been extensively explored due to its inherent characteristics including data limitation, discourse properties and personality traits. In this article, we give the first comprehensive review of dialogue MT, including well-defined problems (e.g., 4 perspectives), collected resources (e.g., 5 language pairs and 4 sub-domains), representative approaches (e.g., architecture, discourse phenomena and personality) and useful applications (e.g., hotel-booking chat system). After systematical investigation, we also build a state-of-the-art dialogue NMT system by leveraging a breadth of established approaches such as novel architectures, popular pre-training and advanced techniques. Encouragingly, we push the state-of-the-art performance up to 62.7 BLEU points on a commonly-used benchmark by using mBART pre-training. We hope that this survey paper could significantly promote the research in dialogue MT.<\/jats:p>","DOI":"10.3390\/info12110484","type":"journal-article","created":{"date-parts":[[2021,11,30]],"date-time":"2021-11-30T23:22:28Z","timestamp":1638314548000},"page":"484","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Recent Advances in Dialogue Machine Translation"],"prefix":"10.3390","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4524-4976","authenticated-orcid":false,"given":"Siyou","family":"Liu","sequence":"first","affiliation":[{"name":"Macao Polytechnic Institute, School of Languages and Translation, Macao, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7310-1385","authenticated-orcid":false,"given":"Yuqi","family":"Sun","sequence":"additional","affiliation":[{"name":"Department of Portuguese, Faculty of Arts and Humanities, University of Macau, Macao, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9062-6183","authenticated-orcid":false,"given":"Longyue","family":"Wang","sequence":"additional","affiliation":[{"name":"NLP Centre, AI Lab, Tencent, Shenzhen 518000, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,11,22]]},"reference":[{"key":"ref_1","unstructured":"Simpson, J.A., and Weiner, E.S.C. (1989). Oxford English Dictionary, Oxford University Press."},{"key":"ref_2","unstructured":"Bahdanau, D., Cho, K., and Bengio, Y. (2015, January 7\u20139). Neural machine translation by jointly learning to align and translate. Proceedings of the 3rd International Conference on Learning Representations, San Diego, CA, USA."},{"key":"ref_3","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017, January 4\u20139). Attention is all you need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_4","unstructured":"Gehring, J., Auli, M., Grangier, D., Yarats, D., and Dauphin, Y.N. (2017, January 6\u201311). Convolutional sequence to sequence learning. Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia."},{"key":"ref_5","unstructured":"Danescu-Niculescu-Mizil, C., and Lee, L. (2011, January 23). Chameleons in Imagined Conversations: A New Approach to Understanding Coordination of Linguistic Style in Dialogs. Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics, Portland, OR, USA."},{"key":"ref_6","unstructured":"Banchs, R.E. (2017, January 8\u201314). Movie-DiC: A Movie Dialogue Corpus for Research and Development. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju Island, Korea."},{"key":"ref_7","unstructured":"Walker, M.A., Lin, G.I., and Sawyer, J. (2012, January 23\u201325). An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style. Proceedings of the 8th International Conference on Language Resources and Evaluation, Istanbul, Turkey."},{"key":"ref_8","unstructured":"Schmitt, A., Ultes, S., and Minker, W. (2012, January 23\u201325). A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let\u2019s Go Bus Information System. Proceedings of the 8th International Conference on Language Resources and Evaluation, Istanbul, Turkey."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Byrne, B., Krishnamoorthi, K., Sankar, C., Neelakantan, A., Goodrich, B., Duckworth, D., Yavuz, S., Dubey, A., Kim, K.Y., and Cedilnik, A. (2019, January 3\u20137). Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, Hong Kong, China.","DOI":"10.18653\/v1\/D19-1459"},{"key":"ref_10","unstructured":"Tiedemann, J. (2012, January 23\u201325). Parallel Data, Tools and Interfaces in OPUS. Proceedings of the 8th International Conference on Language Resources and Evaluation, Istanbul, Turkey."},{"key":"ref_11","unstructured":"Wang, L., Zhang, X., Tu, Z., Liu, Q., and Way, A. (2016, January 23\u201328). Automatic Construction of Discourse Corpora for Dialogue Translation. Proceedings of the 10th International Conference on Language Resources and Evaluation, Portoro\u017e, Slovenia."},{"key":"ref_12","unstructured":"Farajian, M.A., Lopes, A.V., Martins, A.F., Maruf, S., and Haffari, G. (2020, January 19\u201320). Findings of the WMT 2020 shared task on chat translation. Proceedings of the 5th Conference on Machine Translation, Online."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Wang, L., Tu, Z., Way, A., and Liu, Q. (2017, January 7\u201311). Exploiting Cross-Sentence Context for Neural Machine Translation. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark.","DOI":"10.18653\/v1\/D17-1301"},{"key":"ref_14","unstructured":"Maruf, S., Martins, A.F., and Haffari, G. (November, January 31). Contextual Neural Model for Translating Bilingual Multi-Speaker Conversations. Proceedings of the 3rd Conference on Machine Translation, Belgium, Brussels."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Wang, L., Tu, Z., Shi, S., Zhang, T., Graham, Y., and Liu, Q. (2018, January 2\u20137). Translating Pro-Drop Languages with Reconstruction Models. Proceedings of the 32nd AAAI Conference on Artificial Intelligence, New Orleans, LA, USA.","DOI":"10.1609\/aaai.v32i1.11913"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Wang, L., Tu, Z., Wang, X., and Shi, S. (2019, January 3\u20137). One Model to Learn Both: Zero Pronoun Prediction and Translation. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, Hong Kong, China.","DOI":"10.18653\/v1\/D19-1085"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Yang, J., Tong, J., Li, S., Gao, S., Guo, J., and Xue, N. (2019, January 3\u20135). Recovering dropped pronouns in Chinese conversations via modeling their referents. Proceedings of the the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, Minneapolis, MN, USA.","DOI":"10.18653\/v1\/N19-1095"},{"key":"ref_18","unstructured":"Meyer, T., and Pol\u00e1kov\u00e1, L. (2013, January 9). Machine translation with many manually labeled discourse connectives. Proceedings of the Workshop on Discourse in Machine Translation, Sofia, Bulgaria."},{"key":"ref_19","unstructured":"Meyer, T., and Webber, B. (2013, January 9). Implicitation of discourse connectives in (machine) translation. Proceedings of the Workshop on Discourse in Machine Translation, Sofia, Bulgaria."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Liang, Y., Meng, F., Chen, Y., Xu, J., and Zhou, J. (2021, January 1\u20136). Modeling bilingual conversational characteristics for neural chat translation. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Online.","DOI":"10.18653\/v1\/2021.acl-long.444"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Liang, Y., Zhou, C., Meng, F., Xu, J., Chen, Y., Su, J., and Zhou, J. (2021, January 12). Towards Making the Most of Dialogue Characteristics for Neural Chat Translation. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Punta Cana, Dominican Republic.","DOI":"10.18653\/v1\/2021.emnlp-main.6"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Nirenburg, S., Raskin, V., and Tucker, A. (1986, January 25\u201329). On knowledge-based machine translation. Proceedings of the 11th Conference on Computational Linguistics, Bonn, Germany.","DOI":"10.3115\/991365.991549"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Koehn, P. (2009). Statistical Machine Translation, Cambridge University Press.","DOI":"10.1017\/CBO9780511815829"},{"key":"ref_24","unstructured":"Kalchbrenner, N., and Blunsom, P. (2013, January 18\u201321). Recurrent Continuous Translation Models. Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, WA, USA."},{"key":"ref_25","unstructured":"Sutskever, I., Vinyals, O., and Le, Q.V. (2014, January 8\u201313). Sequence to sequence learning with neural networks. Proceedings of the 28th Conference on Neural Information Processing Systems, Montr\u00e9al, QC, Canada."},{"key":"ref_26","first-page":"263","article-title":"The mathematics of statistical machine translation: Parameter estimation","volume":"19","author":"Brown","year":"1993","journal-title":"Comput. Linguist."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Och, F.J., and Ney, H. (2002, January 7\u201312). Discriminative training and maximum entropy models for statistical machine translation. Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, Philadelphia, PA, USA.","DOI":"10.3115\/1073083.1073133"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1038\/scientificamerican0749-11","article-title":"The mathematics of communication","volume":"181","author":"Weaver","year":"1949","journal-title":"Sci. Am."},{"key":"ref_29","unstructured":"Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., and Zens, R. (2007, January 23\u201330). Moses: Open Source Toolkit for Statistical Machine Translation. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1162\/089120103321337421","article-title":"A Systematic Comparison of Various Statistical Alignment Models","volume":"29","author":"Och","year":"2003","journal-title":"Comput. Linguist."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Stolcke, A. (2002, January 16\u201320). Srilm\u2014An extensible language modeling toolkit. Proceedings of the 7th International Conference on Spoken Language Processing, Denver, CO, USA.","DOI":"10.21437\/ICSLP.2002-303"},{"key":"ref_32","unstructured":"Och, F.J. (, January 7\u201312). Minimum Error Rate Training in Statistical Machine Translation. Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, Sapporo, Japan."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Cho, K., van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., and Bengio, Y. (2014, January 25\u201329). Learning Phrase Representations using RNN Encoder\u2013Decoder for Statistical Machine Translation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, Doha, Qatar.","DOI":"10.3115\/v1\/D14-1179"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Papineni, K., Roukos, S., Ward, T., and Zhu, W.J. (2002, January 7\u201312). BLEU: A Method for Automatic Evaluation of Machine Translation. Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, Philadelphia, PA, USA.","DOI":"10.3115\/1073083.1073135"},{"key":"ref_35","first-page":"440","article-title":"Cohesion and coherence: Linguistic approaches","volume":"99","author":"Sanders","year":"2006","journal-title":"Reading"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1515\/text.1.1988.8.3.243","article-title":"Rhetorical structure theory: Toward a functional theory of text organization","volume":"8","author":"Mann","year":"1988","journal-title":"Text-Interdiscip. J. Study Discourse"},{"key":"ref_37","unstructured":"Foster, G., Isabelle, P., and Kuhn, R. (November, January 31). Translating Structured Documents. Proceedings of the 9th Conference of the Association for Machine Translation in the Americas, Denver, CO, USA."},{"key":"ref_38","unstructured":"Marcu, D., Carlson, L., and Watanabe, M. (May, January 29). The automatic translation of discourse structures. Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics, Seattle, WA, USA."},{"key":"ref_39","unstructured":"Tu, M., Zhou, Y., and Zong, C. (2013, January 4\u20139). A novel translation framework based on rhetorical structure theory. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Guzm\u00e1n, F., Joty, S., M\u00e0rquez, L., and Nakov, P. (2014, January 22\u201327). Using discourse structure improves machine translation evaluation. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Baltimore, MD, USA.","DOI":"10.3115\/v1\/P14-1065"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Chen, J., Li, X., Zhang, J., Zhou, C., Cui, J., Wang, B., and Su, J. (2020, January 9). Modeling Discourse Structure for Document-level Neural Machine Translation. Proceedings of the 1st Workshop on Automatic Simultaneous Translation, Seattle, WA, USA.","DOI":"10.18653\/v1\/2020.autosimtrans-1.5"},{"key":"ref_42","unstructured":"Smith, K.S., and Specia, L. (2018). Assessing crosslingual discourse relations in machine translation. arXiv."},{"key":"ref_43","unstructured":"Xiao, T., Zhu, J., Yao, S., and Zhang, H. (2011, January 19\u201323). Document-level consistency verification in machine translation. Proceedings of the 13th Machine Translation Summit, Xiamen, China."},{"key":"ref_44","unstructured":"Gong, Z., Zhang, M., Tan, C.L., and Zhou, G. (2012, January 8\u201315). Classifier-based tense model for SMT. Proceedings of the 24th International Conference on Computational Linguistics, Mumbai, India."},{"key":"ref_45","unstructured":"Gong, Z., Zhang, M., Tan, C.L., and Zhou, G. (2012, January 12\u201314). N-gram-based tense models for statistical machine translation. Proceedings of the the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island, Korea."},{"key":"ref_46","unstructured":"Sun, Z., Wang, M., Zhou, H., Zhao, C., Huang, S., Chen, J., and Li, L. (2020). Capturing longer context for document-level neural machine translation: A multi-resolutional approach. arXiv."},{"key":"ref_47","unstructured":"Guillou, L. (2013, January 9). Analysing lexical consistency in translation. Proceedings of the Workshop on Discourse in Machine Translation, Sofia, Bulgaria."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Chen, B., and Zhu, X. (2014, January 26\u201330). Bilingual sentiment consistency for statistical machine translation. Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, Sweden.","DOI":"10.3115\/v1\/E14-1064"},{"key":"ref_49","unstructured":"Tiedemann, J. (2010, January 15). Context adaptation in statistical machine translation using models with exponentially decaying cache. Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing, Uppsala, Sweden."},{"key":"ref_50","unstructured":"Gong, Z., Zhang, M., and Zhou, G. (2011, January 27\u201331). Cache-based document-level statistical machine translation. Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK."},{"key":"ref_51","unstructured":"Hardmeier, C., Nivre, J., and Tiedemann, J. (2012, January 12\u201314). Document-wide decoding for phrase-based statistical machine translation. Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island, Korea."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1162\/tacl_a_00319","article-title":"Better Document-Level Machine Translation with Bayes\u2019 Rule","volume":"8","author":"Yu","year":"2020","journal-title":"Trans. Assoc. Comput. Linguist."},{"key":"ref_53","unstructured":"Jean, S., Lauly, S., Firat, O., and Cho, K. (2017). Does Neural Machine Translation Benefit from Larger Context?. arXiv."},{"key":"ref_54","unstructured":"Halliday, M.A.K., and Hasan, R. (1976). Cohesion in English, Longman."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Voita, E., Serdyukov, P., Sennrich, R., and Titov, I. (2018, January 15\u201320). Context-Aware Neural Machine Translation Learns Anaphora Resolution. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, Melbourne, Australia.","DOI":"10.18653\/v1\/P18-1117"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Li, C.N., and Thompson, S.A. (1979). Third-person pronouns and zero-anaphora in Chinese discourse. Discourse and Syntax, Brill.","DOI":"10.1163\/9789004368897_014"},{"key":"ref_57","unstructured":"Chung, T., and Gildea, D. (2010, January 9\u201311). Effects of empty categories on machine translation. Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, Cambridge, MA, USA."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Wang, L., Tu, Z., Zhang, X., Li, H., Way, A., and Liu, Q. (2016, January 12\u201317). A Novel Approach for Dropped Pronoun Translation. Proceedings of the The 2016 Conference of the North American Chapter of the Association for Computational Linguistics, San Diego, CA, USA.","DOI":"10.18653\/v1\/N16-1113"},{"key":"ref_59","unstructured":"Xiong, D., Ben, G., Zhang, M., Lv, Y., and Liu, Q. (2013, January 3\u20139). Modeling lexical cohesion for document-level machine translation. Proceedings of the 23rd International Joint Conference on Artificial Intelligence, Beijing, China."},{"key":"ref_60","unstructured":"Wong, B.T., and Kit, C. (2012, January 12\u201314). Extending machine translation evaluation metrics with lexical cohesion to document level. Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island, Korea."},{"key":"ref_61","unstructured":"Zheng, Y., Chen, G., Huang, M., Liu, S., and Zhu, X. (2019). Personalized dialogue generation with diversified traits. arXiv."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Vanmassenhove, E., Hardmeier, C., and Way, A. (November, January 31). Getting Gender Right in Neural Machine Translation. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium.","DOI":"10.18653\/v1\/D18-1334"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Mirkin, S., Nowson, S., Brun, C., and Perez, J. (2015, January 17\u201321). Motivating personality-aware machine translation. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal.","DOI":"10.18653\/v1\/D15-1130"},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"1970","DOI":"10.1109\/TASLP.2019.2937190","article-title":"Neural Machine Translation With Sentence-Level Topic Context","volume":"27","author":"Chen","year":"2019","journal-title":"IEEE\/ACM Trans. Audio, Speech, Lang. Process."},{"key":"ref_65","unstructured":"Lavecchia, C., Smaili, K., and Langlois, D. (2007, January 12\u201313). Building Parallel Corpora from Movies. Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science, Madeira, Portugal."},{"key":"ref_66","unstructured":"Tiedemann, J. (2007, January 1\u20133). Improved sentence alignment for movie subtitles. Proceedings of the 3rd Conference on Recent Advances in Natural Language Processing, Varna, Bulgaria."},{"key":"ref_67","unstructured":"Itamar, E., and Itai, A. (June, January 26). Using Movie Subtitles for Creating a Large-Scale Bilingual Corpora. Proceedings of the 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco."},{"key":"ref_68","unstructured":"Tiedemann, J. (June, January 26). Synchronizing Translated Movie Subtitles. Proceedings of the 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco."},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Xiao, H., and Wang, X. (2009, January 26\u201327). Constructing Parallel Corpus from Movie Subtitles. Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy, Hong Kong, China.","DOI":"10.1007\/978-3-642-00831-3_32"},{"key":"ref_70","unstructured":"Zhang, S., Ling, W., and Dyer, C. (2016, January 23\u201328). Dual Subtitles as Parallel Corpora. Proceedings of the 10th International Conference on Language Resources and Evaluation, Portoro\u017e, Slovenia."},{"key":"ref_71","unstructured":"Lison, P., and Tiedemann, J. (2016, January 23\u201328). OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles. Proceedings of the 10th International Conference on Language Resources and Evaluation, Portoro\u017e, Slovenia."},{"key":"ref_72","unstructured":"Paul, M., Federico, M., and St\u00fcker, S. (2010, January 2\u20133). Overview of the IWSLT 2010 evaluation campaign. Proceedings of the 2010 International Workshop on Spoken Language Translation, Paris, France."},{"key":"ref_73","unstructured":"Koehn, P., and Knowles, R. (August, January 30). Six Challenges for Neural Machine Translation. Proceedings of the 1st Workshop on Neural Machine Translation, Vancouver, BC, Canada."},{"key":"ref_74","unstructured":"Poria, S., Hazarika, D., Majumder, N., Naik, G., Cambria, E., and Mihalcea, R. (August, January 28). MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy."},{"key":"ref_75","unstructured":"Koehn, P. Europarl: A Parallel Corpus for Statistical Machine Translation. Proceedings of 10th Machine Translation Summit Proceedings of Conference, Phuket, Thailand, 13\u201315 September 2005."},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Yang, Y., Liu, Y., and Xue, N. (2015, January 26\u201331). Recovering dropped pronouns from Chinese text messages. Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Beijing, China.","DOI":"10.3115\/v1\/P15-2051"},{"key":"ref_77","unstructured":"Wang, L., Du, J., Li, L., Tu, Z., Way, A., and Liu, Q. (2017, January 28\u201330). Semantics-Enhanced Task-Oriented Dialogue Translation: A Case Study on Hotel Booking. Proceedings of the 8th International Joint Conference on Natural Language Processing: System Demonstrations, Taiwan, China."},{"key":"ref_78","doi-asserted-by":"crossref","unstructured":"Ghazvininejad, M., Levy, O., Liu, Y., and Zettlemoyer, L. (2019, January 3\u20137). Mask-predict: Parallel decoding of conditional masked language models. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, Hong Kong, China.","DOI":"10.18653\/v1\/D19-1633"},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"Sennrich, R., Haddow, B., and Birch, A. (2016, January 7\u201312). Improving Neural Machine Translation Models with Monolingual Data. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany.","DOI":"10.18653\/v1\/P16-1009"},{"key":"ref_80","first-page":"745485","article-title":"A systematic comparison of data selection criteria for smt domain adaptation","volume":"2014","author":"Wang","year":"2014","journal-title":"Sci. World J."},{"key":"ref_81","unstructured":"Wang, L., Li, M., Liu, F., Shi, S., Tu, Z., Wang, X., Wu, S., Zeng, J., and Zhang, W. (2021, January 7\u201311). Tencent Translation System for the WMT21 News Translation Task. Proceedings of the 6th Conference on Machine Translation, Punta Cana, Dominican Republic."},{"key":"ref_82","unstructured":"Ott, M., Edunov, S., Grangier, D., and Auli, M. (November, January 31). Scaling Neural Machine Translation. Proceedings of the 3rd Conference on Machine Translation, Brussels, Belgium."},{"key":"ref_83","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019, January 2\u20137). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, MN, USA."},{"key":"ref_84","unstructured":"Conneau, A., and Lample, G. (2019, January 8\u201314). Cross-lingual language model pretraining. Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada."},{"key":"ref_85","doi-asserted-by":"crossref","unstructured":"Liu, Y., Gu, J., Goyal, N., Li, X., Edunov, S., Ghazvininejad, M., Lewis, M., and Zettlemoyer, L. (2020). Multilingual denoising pre-training for neural machine translation. arXiv.","DOI":"10.1162\/tacl_a_00343"},{"key":"ref_86","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_87","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., and Wojna, Z. (2016, January 27\u201330). Rethinking the inception architecture for computer vision. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.308"},{"key":"ref_88","doi-asserted-by":"crossref","unstructured":"Zhang, J., Luan, H., Sun, M., Zhai, F., Xu, J., Zhang, M., and Liu, Y. (November, January 31). Improving the Transformer Translation Model with Document-Level Context. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium.","DOI":"10.18653\/v1\/D18-1049"},{"key":"ref_89","unstructured":"Gu, J., Bradbury, J., Xiong, C., Li, V.O., and Socher, R. (May, January 30). Non-Autoregressive Neural Machine Translation. Proceedings of the 6th International Conference on Learning Representations, Vancouver, BC, Canada."},{"key":"ref_90","unstructured":"Ding, L., Wang, L., Liu, X., Wong, D.F., Tao, D., and Tu, Z. (2021, January 4\u20138). Understanding and Improving Lexical Choice in Non-Autoregressive Translation. Proceedings of the 9th International Conference on Learning Representations, Vienna, Austria."},{"key":"ref_91","doi-asserted-by":"crossref","unstructured":"Kim, Y., and Rush, A.M. (2016, January 1\u20135). Sequence-Level Knowledge Distillation. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, TX, USA.","DOI":"10.18653\/v1\/D16-1139"},{"key":"ref_92","unstructured":"Kasai, J., Cross, J., Ghazvininejad, M., and Gu, J. (2020). Parallel Machine Translation with Disentangled Context Transformer. arXiv."},{"key":"ref_93","unstructured":"Li, L., Jiang, X., and Liu, Q. (2019). Pretrained language models for document-level neural machine translation. arXiv."},{"key":"ref_94","doi-asserted-by":"crossref","unstructured":"Ott, M., Edunov, S., Baevski, A., Fan, A., Gross, S., Ng, N., Grangier, D., and Auli, M. (2019, January 2\u20137). FAIRSEQ: A Fast, Extensible Toolkit for Sequence Modeling. Proceedings of the the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, Minneapolis, MN, USA.","DOI":"10.18653\/v1\/N19-4009"},{"key":"ref_95","doi-asserted-by":"crossref","unstructured":"Sennrich, R., Haddow, B., and Birch, A. (2016, January 7\u201312). Neural Machine Translation of Rare Words with Subword Units. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany.","DOI":"10.18653\/v1\/P16-1162"},{"key":"ref_96","doi-asserted-by":"crossref","unstructured":"Kim, Y., Tran, D.T., and Ney, H. (2019, January 3). When and Why is Document-level Context Useful in Neural Machine Translation?. Proceedings of the 4th Workshop on Discourse in Machine Translation, Hong Kong, China.","DOI":"10.18653\/v1\/D19-6503"},{"key":"ref_97","doi-asserted-by":"crossref","unstructured":"Li, B., Liu, H., Wang, Z., Jiang, Y., Xiao, T., Zhu, J., Liu, T., and Li, C. (2020, January 5\u201310). Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online.","DOI":"10.18653\/v1\/2020.acl-main.322"}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/12\/11\/484\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:33:49Z","timestamp":1760168029000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/12\/11\/484"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,22]]},"references-count":97,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2021,11]]}},"alternative-id":["info12110484"],"URL":"https:\/\/doi.org\/10.3390\/info12110484","relation":{},"ISSN":["2078-2489"],"issn-type":[{"type":"electronic","value":"2078-2489"}],"subject":[],"published":{"date-parts":[[2021,11,22]]}}}