{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T01:34:01Z","timestamp":1772069641093,"version":"3.50.1"},"reference-count":49,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2016,5,16]],"date-time":"2016-05-16T00:00:00Z","timestamp":1463356800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2016,6,2]]},"abstract":"<jats:p>The lack of computational support has significantly slowed down automatic understanding of endangered languages. In this paper, we take Nyushu (simplified Chinese: \u5973\u4e66; literally: \u201cwomen\u2019s writing\u201d) as a case study to present the first computational approach that combines Computer Vision and Natural Language Processing techniques to deeply understand an endangered language. We developed an end-to-end system to read a scanned hand-written Nyushu article, segment it into characters, link them to standard characters, and then translate the article into Mandarin Chinese. We propose several novel methods to address the new challenges introduced by noisy input and low resources, including Nyushu-specific feature selection for character segmentation and linking, and character linking lattice based Machine Translation. The end-to-end system performance indicates that the system is a promising approach and can serve as a standard benchmark.<\/jats:p>","DOI":"10.1145\/2857052","type":"journal-article","created":{"date-parts":[[2016,5,17]],"date-time":"2016-05-17T15:58:00Z","timestamp":1463500680000},"page":"1-16","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["From Image to Translation"],"prefix":"10.1145","volume":"15","author":[{"given":"Tongtao","family":"Zhang","sequence":"first","affiliation":[{"name":"Rensselaer Polytechnic Institute"}]},{"given":"Aritra","family":"Chowdhury","sequence":"additional","affiliation":[{"name":"Rensselaer Polytechnic Institute"}]},{"given":"Nimit","family":"Dhulekar","sequence":"additional","affiliation":[{"name":"Rensselaer Polytechnic Institute"}]},{"given":"Jinjing","family":"Xia","sequence":"additional","affiliation":[{"name":"Tsinghua University"}]},{"given":"Kevin","family":"Knight","sequence":"additional","affiliation":[{"name":"Information Sciences Institute, University of Southern California"}]},{"given":"Heng","family":"Ji","sequence":"additional","affiliation":[{"name":"Rensselaer Polytechnic Institute"}]},{"given":"B\u00fclent","family":"Yener","sequence":"additional","affiliation":[{"name":"Rensselaer Polytechnic Institute"}]},{"given":"Liming","family":"Zhao","sequence":"additional","affiliation":[{"name":"Tsinghua University"}]}],"member":"320","published-online":{"date-parts":[[2016,5,16]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2005.34"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-2214"},{"key":"e_1_2_1_3_1","volume-title":"the Proc. of ACL","author":"Bender Emily","year":"2008","unstructured":"Emily Bender . 2008 . Evaluating a crosslinguistic grammar resource: A case study of wambaya . In the Proc. of ACL 2008. Emily Bender. 2008. Evaluating a crosslinguistic grammar resource: A case study of wambaya. In the Proc. of ACL 2008."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-2206"},{"key":"e_1_2_1_5_1","volume-title":"Proc. of LaTeCH2013","author":"Bender Emily","year":"2013","unstructured":"Emily Bender , Michael Wayne Goodman , Joshua Crowgey , and Fei Xia . 2013 . Towards creating precision grammars from interlinear glossed text: Inferring large-scale typological properties . In Proc. of LaTeCH2013 . Emily Bender, Michael Wayne Goodman, Joshua Crowgey, and Fei Xia. 2013. Towards creating precision grammars from interlinear glossed text: Inferring large-scale typological properties. In Proc. of LaTeCH2013."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-2203"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1162\/coli.35.3.469"},{"key":"e_1_2_1_8_1","volume-title":"Proc. of ICCL2012","author":"Bird Steven","year":"2012","unstructured":"Steven Bird and David Chiang . 2012 . Machine translation for language preservation . In Proc. of ICCL2012 . Steven Bird and David Chiang. 2012. Machine translation for language preservation. In Proc. of ICCL2012."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-2201"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1515\/ijsl.2005.2005.173.1"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2001.1034664"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1022627411411"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2006.142"},{"key":"e_1_2_1_15_1","unstructured":"Jonathan Graehl. 1997. Carmel finite-state toolkit. http:\/\/www.isi.edu\/licensed-sw\/carmel\/. (1997).  Jonathan Graehl. 1997. Carmel finite-state toolkit. http:\/\/www.isi.edu\/licensed-sw\/carmel\/. (1997)."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2005.38"},{"key":"e_1_2_1_17_1","volume-title":"Naxi Language Briefing","author":"He Jiren","unstructured":"Jiren He and Zuyi Jiang . 1985. Naxi Language Briefing . Minzu Press . Jiren He and Zuyi Jiang. 1985. Naxi Language Briefing. Minzu Press."},{"key":"e_1_2_1_18_1","volume-title":"An overview of muya language. National Languages 8","author":"Huang Bufan","year":"1985","unstructured":"Bufan Huang . 1985. An overview of muya language. National Languages 8 ( 1985 ). Bufan Huang. 1985. An overview of muya language. National Languages 8 (1985)."},{"key":"e_1_2_1_19_1","volume-title":"Jiangyong dialect research","author":"Huang Xuezhen","year":"1993","unstructured":"Xuezhen Huang . 1993. Jiangyong dialect research . Social Science Press ( 1993 ). Xuezhen Huang. 1993. Jiangyong dialect research. Social Science Press (1993)."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/0010-4809(71)90034-6"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.5555\/648179.749225"},{"key":"e_1_2_1_22_1","first-page":"4","article-title":"The world\u2019s languages in crisis","volume":"68","author":"Krauss Michael","year":"1992","unstructured":"Michael Krauss . 1992 . The world\u2019s languages in crisis . Languages 68 , 1 (1992), 4 -- 10 . Michael Krauss. 1992. The world\u2019s languages in crisis. Languages 68, 1 (1992), 4--10.","journal-title":"Languages"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073445.1073464"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-2207"},{"key":"e_1_2_1_25_1","volume-title":"Po-ai dialect","author":"Li.","year":"2005","unstructured":"Fang-kuei Li. 2005. Po-ai dialect . Tsinghua University Press ( 2005 ). Fang-kuei Li. 2005. Po-ai dialect. Tsinghua University Press (2005)."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2004.1262182"},{"key":"e_1_2_1_27_1","volume-title":"Advances in Multimodal Interfaces\u2014ICMI","author":"Liu Cheng-Lin","year":"2000","unstructured":"Cheng-Lin Liu , Masashi Koga , Hiroshi Sako , and Hiromichi Fujisawa . 2000. Aspect ratio adaptive normalization for handwritten character recognition . In Advances in Multimodal Interfaces\u2014ICMI 2000 , Tieniu Tan, Yuanchun Shi , and Wen Gao (Eds.). Lecture Notes in Computer Science, Vol. 1948 . Springer Berlin Heidelberg , 418--425. Cheng-Lin Liu, Masashi Koga, Hiroshi Sako, and Hiromichi Fujisawa. 2000. Aspect ratio adaptive normalization for handwritten character recognition. In Advances in Multimodal Interfaces\u2014ICMI 2000, Tieniu Tan, Yuanchun Shi, and Wen Gao (Eds.). Lecture Notes in Computer Science, Vol. 1948. Springer Berlin Heidelberg, 418--425."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2012.06.021"},{"key":"e_1_2_1_29_1","volume-title":"Proc. of ACL2014","author":"Edward O.","year":"2014","unstructured":"Edward O. Ombui1, Peter W. Wagacha , and Wanjiku Nganga . 2014 . InterlinguaPlus machine translation approach for under-resourced languages: Ekegusii and swahili . In Proc. of ACL2014 , Workshop on ComputEL. Edward O. Ombui1, Peter W. Wagacha, and Wanjiku Nganga. 2014. InterlinguaPlus machine translation approach for under-resourced languages: Ekegusii and swahili. In Proc. of ACL2014, Workshop on ComputEL."},{"key":"e_1_2_1_30_1","first-page":"1","article-title":"A threshold selection method from gray-level histograms. Systems, Man and Cybernetics","volume":"9","author":"Otsu Nobuyuki","year":"1979","unstructured":"Nobuyuki Otsu . 1979 . A threshold selection method from gray-level histograms. Systems, Man and Cybernetics , IEEE Transactions on 9 , 1 (Jan 1979), 62--66. DOI:http:\/\/dx.doi.org\/10.1109\/TSMC.1979.4310076 10.1109\/TSMC.1979.4310076 Nobuyuki Otsu. 1979. A threshold selection method from gray-level histograms. Systems, Man and Cybernetics, IEEE Transactions on 9, 1 (Jan 1979), 62--66. DOI:http:\/\/dx.doi.org\/10.1109\/TSMC.1979.4310076","journal-title":"IEEE Transactions on"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073083.1073135"},{"key":"e_1_2_1_32_1","volume-title":"Proc. of the Royal Society of London. 240--242","author":"Pearson Karl","year":"1895","unstructured":"Karl Pearson . 1895 . Notes on regression and inheritance in the case of two parents . In Proc. of the Royal Society of London. 240--242 . Karl Pearson. 1895. Notes on regression and inheritance in the case of two parents. In Proc. of the Royal Society of London. 240--242."},{"key":"e_1_2_1_33_1","volume-title":"Proc. of IJCNLP2008","author":"Riza Hammam","year":"2008","unstructured":"Hammam Riza . 2008 . Indigenous languages of Indonesia: Creating language resources for language preservation . In Proc. of IJCNLP2008 , Workshop on NLP for Less Privileged Languages. Hammam Riza. 2008. Indigenous languages of Indonesia: Creating language resources for language preservation. In Proc. of IJCNLP2008, Workshop on NLP for Less Privileged Languages."},{"key":"e_1_2_1_34_1","volume-title":"Overview of ersu language. Language Research","author":"Sun Hongkai","year":"1983","unstructured":"Hongkai Sun . 1983. Overview of ersu language. Language Research ( 1983 ). Hongkai Sun. 1983. Overview of ersu language. Language Research (1983)."},{"key":"e_1_2_1_35_1","volume-title":"Chinese Languages","author":"Sun Hongkai","unstructured":"Hongkai Sun , Zengyi Hu , and Xing Huang . 2007. Chinese Languages . Commercial Press . Hongkai Sun, Zengyi Hu, and Xing Huang. 2007. Chinese Languages. Commercial Press."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/0031-3203(95)00118-2"},{"key":"e_1_2_1_37_1","volume-title":"Kernel Methods in Computational Biology","author":"Tsuda Koji","unstructured":"Koji Tsuda and Bernhard Sch\u00f6lkopf . 2004. A primer on kernel methods . In Kernel Methods in Computational Biology . MIT Press , 35--70. Koji Tsuda and Bernhard Sch\u00f6lkopf. 2004. A primer on kernel methods. In Kernel Methods in Computational Biology. MIT Press, 35--70."},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-2202"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2011.264"},{"key":"e_1_2_1_40_1","unstructured":"Junru Zhang. 1980. Shuiyu Briefing. Minzu Press.  Junru Zhang. 1980. Shuiyu Briefing. Minzu Press."},{"key":"e_1_2_1_41_1","volume-title":"Nyushu and Nyushu Culture","author":"Zhao Liming","unstructured":"Liming Zhao . 1995. Nyushu and Nyushu Culture . Xinhua Press . Liming Zhao. 1995. Nyushu and Nyushu Culture. Xinhua Press."},{"key":"e_1_2_1_42_1","volume-title":"The Comparison of Nyushu Characters","author":"Zhao Liming","unstructured":"Liming Zhao . 2004a. The Comparison of Nyushu Characters . Intellectual Property Press . Liming Zhao. 2004a. The Comparison of Nyushu Characters. Intellectual Property Press."},{"key":"e_1_2_1_43_1","volume-title":"Research on the Characters in the Nyushu Script by the one Hundred Years Old Lady Yang Huanyi","author":"Zhao Liming","unstructured":"Liming Zhao . 2004b. Research on the Characters in the Nyushu Script by the one Hundred Years Old Lady Yang Huanyi . International Culture Publishing House . Liming Zhao. 2004b. Research on the Characters in the Nyushu Script by the one Hundred Years Old Lady Yang Huanyi. International Culture Publishing House."},{"key":"e_1_2_1_44_1","volume-title":"Chinese Nyushu Script Collection","author":"Zhao Liming","unstructured":"Liming Zhao . 2005. Chinese Nyushu Script Collection . Zhonghua Book Company . Liming Zhao. 2005. Chinese Nyushu Script Collection. Zhonghua Book Company."},{"key":"e_1_2_1_45_1","unstructured":"Liming Zhao. 2008. Nyushu Booklet. Hunan People\u2019s Press.  Liming Zhao. 2008. Nyushu Booklet. Hunan People\u2019s Press."},{"key":"e_1_2_1_46_1","volume-title":"A Map Record of the Endangered Languages in Southwestern China","author":"Zhao Liming","unstructured":"Liming Zhao and Zhaolin Song . 2011. A Map Record of the Endangered Languages in Southwestern China . Xueyuan Press . Liming Zhao and Zhaolin Song. 2011. A Map Record of the Endangered Languages in Southwestern China. Xueyuan Press."},{"key":"e_1_2_1_47_1","volume-title":"The collection of endangered literature from the minority groups in southwestern china -- namuyi-tibetan bozi literature","author":"Zhao Liming","year":"2014","unstructured":"Liming Zhao and Yan Zhang . 2014. The collection of endangered literature from the minority groups in southwestern china -- namuyi-tibetan bozi literature . Guangxi Normal University Press ( 2014 ). Liming Zhao and Yan Zhang. 2014. The collection of endangered literature from the minority groups in southwestern china -- namuyi-tibetan bozi literature. Guangxi Normal University Press (2014)."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0031-3203(02)00041-9"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273641"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2857052","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2857052","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:39:08Z","timestamp":1750221548000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2857052"}},"subtitle":["Processing the Endangered Nyushu Script"],"short-title":[],"issued":{"date-parts":[[2016,5,16]]},"references-count":49,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2016,6,2]]}},"alternative-id":["10.1145\/2857052"],"URL":"https:\/\/doi.org\/10.1145\/2857052","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"value":"2375-4699","type":"print"},{"value":"2375-4702","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,5,16]]},"assertion":[{"value":"2015-05-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-12-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2016-05-16","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}