{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,21]],"date-time":"2026-04-21T15:53:49Z","timestamp":1776786829452,"version":"3.51.2"},"reference-count":32,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2025,4,23]],"date-time":"2025-04-23T00:00:00Z","timestamp":1745366400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2025,5,31]]},"abstract":"<jats:p>\n            The Dongba script, a logographic writing system used by the Naxi people in religious activities, faces challenges in translation due to the advanced age of Dongba script experts and the time-consuming nature of manual deciphering. This study focuses on translating the resource-scarce Dongba script into Modern Chinese using a novel approach based on cross-lingual transfer learning from Ancient Chinese. By examining translation patterns from Ancient Chinese to Modern Chinese, we determine the feasibility of transferring knowledge from Ancient Chinese to Dongba script translation. We propose the Dongba Machine Translation Model (DMTM), a pre-trained, low-resource machine translation model that utilizes the linguistic similarities between Ancient Chinese and Dongba script to improve translation quality. The model undergoes pre-training on a large-scale Ancient Chinese corpus and fine-tuning on a small-scale Dongba script corpus, enabling effective knowledge transfer. To address the scarcity of Dongba script translation resources, we present DongBa Corpus 1.0, a fine-grained parallel dataset of Dongba script and Modern Chinese. Experimental results demonstrate that our proposed DMTM achieves a translation score of 50.01% BLEU on the test set. As no prior methods exist for Dongba script translation, we compared various architectures commonly used in low-resource translation tasks, and DMTM exhibited the best performance with a 5.39% improvement over alternative architectures tested. The implementation codes and dataset for our approach are available at\n            <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"https:\/\/github.com\/Chloe-mxxxxc\/DMTM\">https:\/\/github.com\/Chloe-mxxxxc\/DMTM<\/jats:ext-link>\n            .\n          <\/jats:p>","DOI":"10.1145\/3721980","type":"journal-article","created":{"date-parts":[[2025,3,5]],"date-time":"2025-03-05T10:10:26Z","timestamp":1741169426000},"page":"1-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Dongba Machine Translation with Transfer Learning: Leveraging Pre-trained Ancient Chinese Models"],"prefix":"10.1145","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-9416-0634","authenticated-orcid":false,"given":"Xinchen","family":"Ma","sequence":"first","affiliation":[{"name":"School of Communication and Electronic Engineering, East China Normal University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9114-4629","authenticated-orcid":false,"given":"Man","family":"Lan","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, East China Normal University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4197-8676","authenticated-orcid":false,"given":"Wenbo","family":"Hu","sequence":"additional","affiliation":[{"name":"School of Communication and Electronic Engineering, East China Normal University, Shanghai, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8784-4657","authenticated-orcid":false,"given":"Yue","family":"Lu","sequence":"additional","affiliation":[{"name":"School of Communication and Electronic Engineering, East China Normal University, Shanghai, China"}]}],"member":"320","published-online":{"date-parts":[[2025,4,23]]},"reference":[{"key":"e_1_3_2_2_2","first-page":"52","article-title":"Denomination of Dongba script","volume":"12","author":"Deng Z.","year":"2010","unstructured":"Z. Deng. 2010. Denomination of Dongba script. China Terminology 12, 4 (2010), 52.","journal-title":"China Terminology"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.118865"},{"key":"e_1_3_2_4_2","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence 38","author":"Hu W.","year":"2024","unstructured":"W. Hu, H. Zhan, X. Ma, Y. Lu, and C. Y. Suen. 2024. Spotting the unseen: Reciprocal consensus network guided by visual archetypes. Proceedings of the AAAI Conference on Artificial Intelligence 38, 11 (2024), 12556\u201312564."},{"key":"e_1_3_2_5_2","unstructured":"D. Bahdanau K. Cho and Y. Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473."},{"key":"e_1_3_2_6_2","first-page":"1","article-title":"Attention is all you need","volume":"30","author":"Vaswani A.","year":"2017","unstructured":"A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, \u0141. Kaiser, and I. Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems 30 (2017), 1\u201311.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/3531535"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1145\/3579164"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10590-021-09281-1"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10590-017-9203-5"},{"key":"e_1_3_2_11_2","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. 1568\u20131575","author":"Zoph B.","unstructured":"B. Zoph, D. Yuret, J. May, and K. Knight. 2016. Transfer learning for low-resource neural machine translation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 1568\u20131575."},{"key":"e_1_3_2_12_2","volume-title":"Natural Language Processing and Chinese Computing. Lecture Notes in Computer Science","volume":"13028","author":"Yang Z.","unstructured":"Z. Yang, K. Chen, and J. Chen. 2021. Guwen-UNILM: Machine translation between ancient and modern Chinese based on pre-trained models. In Natural Language Processing and Chinese Computing. Lecture Notes in Computer Science, Vol. 13028. Springer, 116\u2013128."},{"key":"e_1_3_2_13_2","volume-title":"BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.","author":"Devlin J.","year":"2018","unstructured":"J. Devlin, M. W. Chang, K. Lee, and K. Toutanova. 2018. BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805."},{"key":"e_1_3_2_14_2","unstructured":"Y. Liu M. Ott N. Goyal J. Du M. Joshi D. Chen O. Levy M. Lewis L. Zettlemoyer and V. Stoyanov. 2019. RoBERTa: A robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692."},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2021.3124365"},{"key":"e_1_3_2_16_2","first-page":"13063","article-title":"Unified language model pre-training for natural language understanding and generation","volume":"32","author":"Dong L.","year":"2019","unstructured":"L. Dong, N. Yang, W. Wang, F. Wei, X. Liu, Y. Wang, J. Gao, M. Zhou, and H. W. Hon. 2019. Unified language model pre-training for natural language understanding and generation. Advances in Neural Information Processing Systems 32 (2019), 13063\u201313075.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_17_2","doi-asserted-by":"crossref","unstructured":"M. Freitag and Y. Al-Onaizan. 2017. Beam search strategies for neural machine translation. arXiv preprint arXiv:1702.01806.","DOI":"10.18653\/v1\/W17-3207"},{"key":"e_1_3_2_18_2","doi-asserted-by":"crossref","unstructured":"S. Edunov M. Ott M. Auli and D. Grangier. 2018. Understanding back-translation at scale. arXiv preprint arXiv:1808.09381.","DOI":"10.18653\/v1\/D18-1045"},{"key":"e_1_3_2_19_2","volume-title":"Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. 311\u2013318","author":"Papineni K.","unstructured":"K. Papineni, S. Roukos, T. Ward, and W. J. Zhu. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. 311\u2013318."},{"key":"e_1_3_2_20_2","volume-title":"ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out","author":"Lin C. Y.","year":"2004","unstructured":"C. Y. Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out. Association for Computational Linguistics, 74\u201381."},{"key":"e_1_3_2_21_2","volume-title":"Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and\/or Summarization. 65\u201372","author":"Banerjee S.","unstructured":"S. Banerjee and A. Lavie. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and\/or Summarization. 65\u201372."},{"key":"e_1_3_2_22_2","doi-asserted-by":"crossref","unstructured":"C. Fellbaum (Ed.). 1998. WordNet: An Electronic Lexical Database. Library Quarterly Information Community Policy. MIT Press.","DOI":"10.7551\/mitpress\/7287.001.0001"},{"key":"e_1_3_2_23_2","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv :1412.6980.","author":"Kingma D. P.","year":"2014","unstructured":"D. P. Kingma and J. Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv :1412.6980."},{"key":"e_1_3_2_24_2","doi-asserted-by":"crossref","unstructured":"J. Gu H. Hassan J. Devlin and V. O. K. Li. 2018. Universal neural machine translation for extremely low resource languages. arXiv preprint arXiv:1802.05368.","DOI":"10.18653\/v1\/N18-1032"},{"key":"e_1_3_2_25_2","volume-title":"Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing. 848\u2013856","author":"Galley M.","unstructured":"M. Galley and C. D. Manning. 2008. A simple and effective hierarchical phrase reordering model. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing. 848\u2013856."},{"key":"e_1_3_2_26_2","volume-title":"Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics. 127\u2013133","author":"Koehn P.","unstructured":"P. Koehn, F. J. Och, and D. Marcu. 2003. Statistical phrase-based translation. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics. 127\u2013133."},{"key":"e_1_3_2_27_2","doi-asserted-by":"crossref","unstructured":"W. Yang Y. Xie A. Lin X. Li L. Tan K. Xiong M. Li and J. Lin. 2019. End-to-end open-domain question answering with BERTserini. arXiv preprint arXiv:1902.01718.","DOI":"10.18653\/v1\/N19-4013"},{"key":"e_1_3_2_28_2","unstructured":"F. Souza R. Nogueira and R. Lotufo. 2019. Portuguese named entity recognition using BERT-CRF. arXiv preprint arXiv:1909.10649."},{"key":"e_1_3_2_29_2","unstructured":"R. Nogueira and K. Cho. 2019. Passage re-ranking with BERT. arXiv preprint arXiv:1901.04085."},{"key":"e_1_3_2_30_2","unstructured":"G. Lample and A. Conneau. 2019. Cross-lingual language model pretraining. arXiv preprint arXiv:1901.07291."},{"key":"e_1_3_2_31_2","volume-title":"MASS: Masked sequence to sequence pre-training for language generation. arXiv preprint arXiv:1905.02450.","author":"Song K.","year":"2019","unstructured":"K. Song, X. Tan, T. Qin, J. Lu, and T. Y. Liu. 2019. MASS: Masked sequence to sequence pre-training for language generation. arXiv preprint arXiv:1905.02450."},{"key":"e_1_3_2_32_2","volume-title":"Proceedings of the International Conference of the Italian Association for Artificial Intelligence. 580\u2013590","author":"Moukafih Y.","unstructured":"Y. Moukafih, N. Sbihi, M. Ghogho, and K. Sma\u00efli. 2021. Improving machine translation of Arabic dialects through multi-task learning. In Proceedings of the International Conference of the Italian Association for Artificial Intelligence. 580\u2013590."},{"key":"e_1_3_2_33_2","doi-asserted-by":"crossref","unstructured":"X. Garcia A. Siddhant O. Firat and A. P. Parikh. 2020. Harnessing multilinguality in unsupervised machine translation for rare languages. arXiv preprint arXiv:2009.11201.","DOI":"10.18653\/v1\/2021.naacl-main.89"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3721980","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3721980","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T01:09:48Z","timestamp":1750295388000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3721980"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,4,23]]},"references-count":32,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2025,5,31]]}},"alternative-id":["10.1145\/3721980"],"URL":"https:\/\/doi.org\/10.1145\/3721980","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,4,23]]},"assertion":[{"value":"2023-11-17","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-12-30","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-04-23","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}