{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T01:56:18Z","timestamp":1769824578056,"version":"3.49.0"},"reference-count":56,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2022,4,29]],"date-time":"2022-04-29T00:00:00Z","timestamp":1651190400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61732005 and 61671064"],"award-info":[{"award-number":["61732005 and 61671064"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"National Key Research and Development Program of China","award":["2017YFB1002103"],"award-info":[{"award-number":["2017YFB1002103"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2022,9,30]]},"abstract":"<jats:p>Statistical machine translation (SMT) models rely on word-, phrase-, and syntax-level alignments. But neural machine translation (NMT) models rarely explicitly learn the phrase- and syntax-level alignments. In this article, we propose to improve NMT by explicitly learning the bilingual syntactic constituent alignments. Specifically, we first utilize syntactic parsers to induce syntactic structures of sentences, and then we propose two ways to utilize the syntactic constituents in a perceptual (not adversarial) generator-discriminator training framework. One way is to use them to measure the alignment score of sentence-level training examples, and the other is to directly score the alignments of constituent-level examples generated with an algorithm based on word-level alignments from SMT. In our generator-discriminator framework, the discriminator is pre-trained to learn constituent alignments and distinguish the ground-truth translation from the fake ones, while the generative translation model is fine-tuned to receive the alignment knowledge and to generate translations that best approximate the true ones. Experiments and analysis show that the learned constituent alignments can help improve the translation results.<\/jats:p>","DOI":"10.1145\/3510580","type":"journal-article","created":{"date-parts":[[2022,3,23]],"date-time":"2022-03-23T14:43:37Z","timestamp":1648046617000},"page":"1-15","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Improving Neural Machine Translation by Transferring Knowledge from Syntactic Constituent Alignment Learning"],"prefix":"10.1145","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6771-329X","authenticated-orcid":false,"given":"Chao","family":"Su","sequence":"first","affiliation":[{"name":"Beijing Institute of Technology, and Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0320-7520","authenticated-orcid":false,"given":"Heyan","family":"Huang","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, and Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3436-7575","authenticated-orcid":false,"given":"Shumin","family":"Shi","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, and Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7236-2922","authenticated-orcid":false,"given":"Ping","family":"Jian","sequence":"additional","affiliation":[{"name":"Beijing Institute of Technology, and Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,4,29]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-2021"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1122"},{"key":"e_1_3_2_4_2","volume-title":"Proceedings of the 3rd International Conference on Learning Representations (ICLR\u201915)","author":"Bahdanau Dzmitry","year":"2015","unstructured":"Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In Proceedings of the 3rd International Conference on Learning Representations (ICLR\u201915). Retrieved from http:\/\/arxiv.org\/abs\/1409.0473."},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.5555\/972470.972474"},{"key":"e_1_3_2_6_2","first-page":"13","volume-title":"Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations","author":"Che Wanxiang","year":"2010","unstructured":"Wanxiang Che, Zhenghua Li, and Ting Liu. 2010. Ltp: A chinese language technology platform. In Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations. Association for Computational Linguistics, 13\u201316."},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1177"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.3115\/1219840.1219873"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1179"},{"key":"e_1_3_2_10_2","first-page":"5094","volume-title":"Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI\u201918)","author":"Choi Jihun","year":"2018","unstructured":"Jihun Choi, Kang Min Yoo, and Sang-goo Lee. 2018. Learning to compose task-specific tree structures. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI\u201918). 5094\u20135101."},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1515\/9783110218329"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1148"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-3348"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-1024"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1078"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-2012"},{"key":"e_1_3_2_17_2","unstructured":"Jiangtao Feng Lingpeng Kong Po-Sen Huang Chong Wang Da Huang Jiayuan Mao Kan Qiao and Dengyong Zhou. 2018. Neural Phrase-to-Phrase Machine Translation. Retrieved from https:\/\/arxiv:cs.CL\/1811.02172."},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33013723"},{"key":"e_1_3_2_19_2","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Huang Po-Sen","year":"2018","unstructured":"Po-Sen Huang, Chong Wang, Sitao Huang, Dengyong Zhou, and Li Deng. 2018. Towards neural phrase-based machine translation. In Proceedings of the International Conference on Learning Representations. Retrieved from https:\/\/openreview.net\/forum?id=HktJec1RZ."},{"key":"e_1_3_2_20_2","first-page":"944","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP\u201910)","author":"Isozaki Hideki","year":"2010","unstructured":"Hideki Isozaki, Tsutomu Hirao, Kevin Duh, Katsuhito Sudoh, and Hajime Tsukada. 2010. Automatic evaluation of translation quality for distant language pairs. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP\u201910). 944\u2013952. Retrieved from http:\/\/dl.acm.org\/citation.cfm?id=1870658.1870750."},{"key":"e_1_3_2_21_2","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR\u201917)","author":"Jang Eric","year":"2017","unstructured":"Eric Jang, Shixiang Gu, and Ben Poole. 2017. Categorical reparameterization with gumbel-softmax. In Proceedings of the International Conference on Learning Representations (ICLR\u201917)."},{"key":"e_1_3_2_22_2","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR\u201915)","volume":"5","author":"Kingma D.","year":"2015","unstructured":"D. Kingma and J. Ba Adam. 2015. A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations (ICLR\u201915), Vol. 5."},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1085"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.3115\/1075096.1075150"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.3115\/1557769.1557821"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.21236\/ADA461156"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.3115\/1075096.1075152"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.3115\/1220175.1220252"},{"key":"e_1_3_2_29_2","volume-title":"Proceedings of the 4th International Conference on Learning Representations (ICLR\u201916)","author":"Luong Minh-Thang","year":"2016","unstructured":"Minh-Thang Luong, Quoc V. Le, Ilya Sutskever, Oriol Vinyals, and Lukasz Kaiser. 2016. Multi-task sequence to sequence learning. In Proceedings of the 4th International Conference on Learning Representations (ICLR\u201916), Yoshua Bengio and Yann LeCun (Eds.). Retrieved from http:\/\/arxiv.org\/abs\/1511.06114."},{"key":"e_1_3_2_30_2","first-page":"489","volume-title":"Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC\u201906).","author":"Ma Xiaoyi","year":"2006","unstructured":"Xiaoyi Ma. 2006. Champollion: A robust parallel text sentence aligner. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC\u201906).489\u2013492. Retrieved from http:\/\/www.lrec-conf.org\/proceedings\/lrec2006\/pdf\/746_pdf.pdf."},{"key":"e_1_3_2_31_2","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR\u201917)","author":"Maddison Chris J.","year":"2017","unstructured":"Chris J. Maddison, Andriy Mnih, and Yee Whye Teh. 2017. The concrete distribution: A continuous relaxation of discrete random variables. In Proceedings of the International Conference on Learning Representations (ICLR\u201917)."},{"key":"e_1_3_2_32_2","unstructured":"Haitao Mi Zhiguo Wang and Abe Ittycheriah. 2016. Vocabulary Manipulation for Neural Machine Translation. Retrieved from https:\/\/arxiv:cs.CL\/1605.03209."},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1162\/coli.2008.34.1.35"},{"key":"e_1_3_2_34_2","unstructured":"Phi Xuan Nguyen and Shafiq Joty. 2018. Phrase-Based Attentions. Retrieved from https:\/\/arxiv:cs.CL\/1810.03444."},{"key":"e_1_3_2_35_2","first-page":"311","volume-title":"Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics","author":"Papineni K.","year":"2002","unstructured":"K. Papineni, S. Roukos, T. Ward, and W. J. Zhu. 2002. Bleu: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics311\u2013318. Retrieved from http:\/\/www.aclweb.org\/anthology\/P02-1040.pdf."},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-5626"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.5555\/1621401.1621407"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1162"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1180"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K19-1025"},{"key":"e_1_3_2_41_2","first-page":"1857\u20131865","volume-title":"Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS\u201916)","author":"Sohn Kihyuk","year":"2016","unstructured":"Kihyuk Sohn. 2016. Improved deep metric learning with multi-class n-pair loss objective. In Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS\u201916). Curran Associates, 1857\u20131865."},{"key":"e_1_3_2_42_2","first-page":"3104","volume-title":"Proceedings of the Annual Conference on Neural Information Processing Systems","author":"Sutskever Ilya","year":"2014","unstructured":"Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Proceedings of the Annual Conference on Neural Information Processing Systems. 3104\u20133112. Retrieved from http:\/\/papers.nips.cc\/paper\/5346-sequence-to-sequence-learning-with-neural-networks."},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P15-1150"},{"key":"e_1_3_2_44_2","unstructured":"John Tinsley Ventsislav Zhechev Mary Hearne and Andy Way. 2007. Robust language pair-independent sub-tree alignment. In Proceedings of Machine Translation Summit XI: Papers . https:\/\/aclanthology.org\/2007.mtsummit-papers.62."},{"key":"e_1_3_2_45_2","first-page":"5998","volume-title":"Advances in Neural Information Processing Systems 30","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems 30. MIT Press, 5998\u20136008. Retrieved from http:\/\/papers.nips.cc\/paper\/7181-attention-is-all-you-need.pdf."},{"key":"e_1_3_2_46_2","first-page":"3674","volume-title":"Proceedings of the 34th International Conference on Machine Learning (ICML\u201917)","author":"Wang Chong","year":"2017","unstructured":"Chong Wang, Yining Wang, Po-Sen Huang, Abdelrahman Mohamed, Dengyong Zhou, and Li Deng. 2017. Sequence modeling via segmentations. In Proceedings of the 34th International Conference on Machine Learning (ICML\u201917). 3674\u20133683."},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1509"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1149"},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1356"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1065"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2018.2855968"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2017\/584"},{"key":"e_1_3_2_53_2","unstructured":"Pengcheng Yang Boxing Chen Pei Zhang and Xu Sun. 2019. Visual Agreement Regularized Training for Multi-Modal Machine Translation. Retrieved from https:\/\/arxiv:cs.CL\/1912.12014."},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D16-1138"},{"key":"e_1_3_2_55_2","first-page":"1421","volume-title":"Proceedings of the 27th International Conference on Computational Linguistics","author":"Zaremoodi Poorya","year":"2018","unstructured":"Poorya Zaremoodi and Gholamreza Haffari. 2018. Incorporating syntactic uncertainty in neural machine translation with a forest-to-sequence model. In Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics, 1421\u20131429. Retrieved from https:\/\/www.aclweb.org\/anthology\/C18-1120."},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/641"},{"key":"e_1_3_2_57_2","first-page":"1604","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Zhu Xiaodan","year":"2015","unstructured":"Xiaodan Zhu, Parinaz Sobhani, and Hongyu Guo. 2015. Long short-term memory over recursive structures. In Proceedings of the International Conference on Machine Learning. 1604\u20131612."}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3510580","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3510580","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:12:19Z","timestamp":1750191139000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3510580"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,29]]},"references-count":56,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,9,30]]}},"alternative-id":["10.1145\/3510580"],"URL":"https:\/\/doi.org\/10.1145\/3510580","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,4,29]]},"assertion":[{"value":"2020-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-01-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-04-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}