{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,7]],"date-time":"2025-11-07T09:45:34Z","timestamp":1762508734878,"version":"3.41.0"},"reference-count":25,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2022,3,25]],"date-time":"2022-03-25T00:00:00Z","timestamp":1648166400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National key research and development plan","award":["2019QY1800"],"award-info":[{"award-number":["2019QY1800"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61732005, 61672271, 61761026, 61762056, and 61866020"],"award-info":[{"award-number":["61732005, 61672271, 61761026, 61762056, and 61866020"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Yunnan provincial major science and technology special plan projects","award":["202002AD080001"],"award-info":[{"award-number":["202002AD080001"]}]},{"name":"Yunnan high-tech industry development project","award":["201606"],"award-info":[{"award-number":["201606"]}]},{"DOI":"10.13039\/501100005273","name":"Natural Science Foundation of Yunnan Province","doi-asserted-by":"crossref","award":["2018FB104"],"award-info":[{"award-number":["2018FB104"]}],"id":[{"id":"10.13039\/501100005273","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2022,3,31]]},"abstract":"<jats:p>\n            We present a simple, efficient data augmentation approach for boosting Chinese-Vietnamese neural machine translation performance by leveraging the linguistic difference between the two languages. We first define the formalized representation of modifier symmetry, which is one of the most representative linguistic differences between Chinese and Vietnamese. We then propose and test two data augmentation strategies for leveraging the linguistic difference, which can be integrated naturally with different translation models. Results indicate that both strategies can introduce linguistic rules to boost translation accuracy. Tests on Chinese-Vietnamese benchmarks show significant accuracy improvements. To facilitate studies in this domain, we also release an open-source toolkit\n            <jats:xref ref-type=\"fn\">\n              <jats:sup>1<\/jats:sup>\n            <\/jats:xref>\n            with flexible implementation for Chinese-Vietnamese linguistic difference tagging.\n          <\/jats:p>","DOI":"10.1145\/3477536","type":"journal-article","created":{"date-parts":[[2022,3,25]],"date-time":"2022-03-25T10:05:17Z","timestamp":1648202717000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Improving Chinese-Vietnamese Neural Machine Translation with Linguistic Differences"],"prefix":"10.1145","volume":"21","author":[{"given":"Zhiqiang","family":"Yu","sequence":"first","affiliation":[{"name":"Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Yunnan Minzu University, Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming, Yunnan, China"}]},{"given":"Zhengtao","family":"Yu","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming, Yunnan, China"}]},{"given":"Yantuan","family":"Xian","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming, Yunnan, China"}]},{"given":"Yuxin","family":"Huang","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming, Yunnan, China"}]},{"given":"Junjun","family":"Guo","sequence":"additional","affiliation":[{"name":"Faculty of Information Engineering and Automation, Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming, Yunnan, China"}]}],"member":"320","published-online":{"date-parts":[[2022,3,25]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.5555\/2969033.2969173"},{"key":"e_1_3_2_3_2","unstructured":"D. Bahdanau K. Cho and Y. Bengio. 2015. Neural machine translation by jointly learning to align and translate. arXiv.2014. 1409.0473."},{"key":"e_1_3_2_4_2","first-page":"5998","volume-title":"Advances in Neural Information Processing Systems","volume":"30","author":"Vaswani A.","year":"2017","unstructured":"A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, \u0141. Kaiser, and I. Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems 30, 5998\u20136008."},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-3204"},{"key":"e_1_3_2_6_2","doi-asserted-by":"crossref","unstructured":"Z. Tan S. Wang Z. Yang G. Chen X. Huang M. Sun and Y. Liu. 2021. Neural machine translation: A review of methods resources and tools. arXiv:2012.15515.","DOI":"10.1016\/j.aiopen.2020.11.001"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/3365244"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1579"},{"key":"e_1_3_2_9_2","doi-asserted-by":"crossref","unstructured":"F. Burlot and F. Yvon. 2019. Using monolingual data in neural machine translation: A systematic study. arxiv:cs.CL\/1903.11437.","DOI":"10.18653\/v1\/W18-6315"},{"key":"e_1_3_2_10_2","first-page":"449","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota","author":"Song K.","year":"2019","unstructured":"K. Song, Y. Zhang, H. Yu, W. Luo, K. Wang, and M. Zhang. 2019. Code-Switching for Enhancing NMT with Pre-Specified Translation. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota. Association for Computational Linguistics, 449\u2013459."},{"key":"e_1_3_2_11_2","unstructured":"D. Georgiana M. Prashant F. Marcello and A. Yaser. 2019. Training neural machine translation to apply terminology constraints. arxiv:cs.CL\/1906.01105."},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/3314938"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/3373267"},{"key":"e_1_3_2_14_2","first-page":"715","volume-title":"Tsinghua Science and Technology","volume":"23","author":"Huu A. T.","year":"2018","unstructured":"A. T. Huu, H. Huang, and S. Shi. 2018. Integrating pronunciation into Chinese-Vietnamese statistical machine translation. Tsinghua Science and Technology 23, 6 (2018), 715\u2013723."},{"key":"e_1_3_2_15_2","first-page":"375","volume-title":"IEICE Transactions on Information and Systems","volume":"2","author":"Huu A. T.","year":"2019","unstructured":"A. T. Huu, H. Huang, and S. Shi. 2019. Preordering for Chinese-Vietnam statistical machine translation. IEICE Transactions on Information and Systems E102-D, 2, 375\u2013382."},{"key":"e_1_3_2_16_2","first-page":"265","volume-title":"ICIC Express Letters, Part B: Applications","volume":"9","author":"Huu A. T.","year":"2018","unstructured":"A. T. Huu, P. Tran, D. Dinh, V. V. Vu, and T. Le. 2018. Dependency-based pre-ordering of preposition phrases in Chinese-Vietnamese machine translation. ICIC Express Letters, Part B: Applications 9 (2018), 265\u2013272."},{"key":"e_1_3_2_17_2","article-title":"Language post positioned characteristic based Chinese-Vietnamese statistical machine translation method","author":"He J.","year":"2017","unstructured":"J. He, Z. Yu, C. Lv, H. Lai, S. Gao, and Y. Zhang. 2017. Language post positioned characteristic based Chinese-Vietnamese statistical machine translation method. In Proceedings of the International Conference on Asian Language Processing (IALP\u201917), Singapore. IEEE, 2017.","journal-title":"Proceedings of the International Conference on Asian Language Processing (IALP\u201917)"},{"key":"e_1_3_2_18_2","article-title":"Code-switching for enhancing NMT with pre-specified translation[C]","author":"Song K.","year":"2019","unstructured":"K. Song, Y. Zhang, and H. Yu. 2019. Code-switching for enhancing NMT with pre-specified translation[C]. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL\u201919). Minneapolis, Minnesota, 2019.","journal-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL\u201919)"},{"key":"e_1_3_2_19_2","first-page":"3063","article-title":"Training neural machine translation to apply terminology constraints","author":"Dinu G.","unstructured":"G. Dinu, P. Mathur, M. Federico, and Y. Al-Onaizan. Training neural machine translation to apply terminology constraints. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy. Association for Computational Linguistics, 3063\u20133068.","journal-title":"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1021"},{"key":"e_1_3_2_21_2","first-page":"5998","volume-title":"Advances in Neural Information Processing Systems","author":"Vaswani A.","year":"2017","unstructured":"A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.). 30. 5998\u20136008."},{"key":"e_1_3_2_22_2","unstructured":"D. P. Kingma and J. Ba. 2015. Adam: A method for stochastic optimization. arXiv:1412.6980V5."},{"key":"e_1_3_2_23_2","article-title":"Bleu: A method for automatic evaluation of machine translation","author":"Papineni K.","year":"2002","unstructured":"K. Papineni, S. Roukos, T. Ward, and W. Zhu. 2002. Bleu: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, Pennsylvania, USA. Association for Computational Linguistics, 311--318.","journal-title":"Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, Pennsylvania, USA"},{"key":"e_1_3_2_24_2","article-title":"Statistical significance tests for machine translation evaluation","author":"Philipp K.","year":"2004","unstructured":"K. Philipp. 2004. Statistical significance tests for machine translation evaluation. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing.","journal-title":"Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing"},{"key":"e_1_3_2_25_2","article-title":"Automatic evaluation of translation quality for distant language pairs","author":"Isozaki H.","year":"2010","unstructured":"H. Isozaki, T. Hirao, and K. Duh. 2010. Automatic evaluation of translation quality for distant language pairs. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing Cambridge, MA. Association for Computational Linguistics, 944--952.","journal-title":"Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing Cambridge, MA"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.3115\/1626431.1626480"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477536","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3477536","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:10:37Z","timestamp":1750183837000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477536"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,25]]},"references-count":25,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,3,31]]}},"alternative-id":["10.1145\/3477536"],"URL":"https:\/\/doi.org\/10.1145\/3477536","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"type":"print","value":"2375-4699"},{"type":"electronic","value":"2375-4702"}],"subject":[],"published":{"date-parts":[[2022,3,25]]},"assertion":[{"value":"2021-04-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-07-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-03-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}