{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T11:23:44Z","timestamp":1772969024547,"version":"3.50.1"},"reference-count":178,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2021,6,7]],"date-time":"2021-06-07T00:00:00Z","timestamp":1623024000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,6,7]],"date-time":"2021-06-07T00:00:00Z","timestamp":1623024000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001602","name":"Science Foundation Ireland","doi-asserted-by":"publisher","award":["SFI\/12\/RC\/2289 (Insight)"],"award-info":[{"award-number":["SFI\/12\/RC\/2289 (Insight)"]}],"id":[{"id":"10.13039\/501100001602","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008530","name":"European Regional Development Fund","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010663","name":"H2020 European Research Council","doi-asserted-by":"publisher","award":["731015 (ELEXIS-European Lexical Infrastructure)"],"award-info":[{"award-number":["731015 (ELEXIS-European Lexical Infrastructure)"]}],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010663","name":"H2020 European Research Council","doi-asserted-by":"publisher","award":["825182 (Pr\u00eat-\u00e0-LLOD)"],"award-info":[{"award-number":["825182 (Pr\u00eat-\u00e0-LLOD)"]}],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001602","name":"Science Foundation Ireland","doi-asserted-by":"crossref","award":["SFI\/12\/RC\/2289_P2 (Insight_2)"],"award-info":[{"award-number":["SFI\/12\/RC\/2289_P2 (Insight_2)"]}],"id":[{"id":"10.13039\/501100001602","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001602","name":"Science Foundation Ireland","doi-asserted-by":"crossref","award":["SFI\/18\/CRT\/6223 (CRT-Centre for Research Training in Artificial Intelligence)"],"award-info":[{"award-number":["SFI\/18\/CRT\/6223 (CRT-Centre for Research Training in Artificial Intelligence)"]}],"id":[{"id":"10.13039\/501100001602","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100002081","name":"Irish Research Council","doi-asserted-by":"publisher","award":["IRCLA\/2017\/129 (CARDAMOM-Comparative Deep Models of Language for Minority and Historical Languages)"],"award-info":[{"award-number":["IRCLA\/2017\/129 (CARDAMOM-Comparative Deep Models of Language for Minority and Historical Languages)"]}],"id":[{"id":"10.13039\/501100002081","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National University Ireland, Galway"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["SN COMPUT. SCI."],"published-print":{"date-parts":[[2021,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Machine translation is one of the applications of natural language processing which has been explored in different languages. Recently researchers started paying attention towards machine translation for resource-poor languages and closely related languages. A widespread and underlying problem for these machine translation systems is the linguistic difference and variation in orthographic conventions which causes many issues to traditional approaches. Two languages written in two different orthographies are not easily comparable but orthographic information can also be used to improve the machine translation system. This article offers a survey of research regarding orthography\u2019s influence on machine translation of under-resourced languages. It introduces under-resourced languages in terms of machine translation and how orthographic information can be utilised to improve machine translation. We describe previous work in this area, discussing what underlying assumptions were made, and showing how orthographic knowledge improves the performance of machine translation of under-resourced languages. We discuss different types of machine translation and demonstrate a recent trend that seeks to link orthographic information with well-established machine translation methods. Considerable attention is given to current efforts using cognate information at different levels of machine translation and the lessons that can be drawn from this. Additionally, multilingual neural machine translation of closely related languages is given a particular focus in this survey. This article ends with a discussion of the way forward in machine translation with orthographic information, focusing on multilingual settings and bilingual lexicon induction.<\/jats:p>","DOI":"10.1007\/s42979-021-00723-4","type":"journal-article","created":{"date-parts":[[2021,6,7]],"date-time":"2021-06-07T19:03:08Z","timestamp":1623092588000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["A Survey of Orthographic Information in Machine Translation"],"prefix":"10.1007","volume":"2","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4575-7934","authenticated-orcid":false,"given":"Bharathi Raja","family":"Chakravarthi","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Priya","family":"Rani","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mihael","family":"Arcan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"John P.","family":"McCrae","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,6,7]]},"reference":[{"issue":"1","key":"723_CR1","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1007\/s10590-017-9203-5","volume":"32","author":"A Karakanta","year":"2018","unstructured":"Karakanta A, Dehdari J, van Genabith J. Neural machine translation for low-resource languages without parallel corpora. Machans. 2018;32(1):167\u201389. https:\/\/doi.org\/10.1007\/s10590-017-9203-5.","journal-title":"Machans."},{"key":"723_CR2","unstructured":"Lewis W, Munro R, Vogel S. Crisis MT: Developing a cookbook for MT in crisis situations. In: Proceedings of the sixth workshop on statistical machine translation. Association for computational linguistics, Edinburgh, Scotland; 2011. p. 501\u2013511. https:\/\/www.aclweb.org\/anthology\/W11-2164."},{"key":"723_CR3","doi-asserted-by":"publisher","unstructured":"Neubig G, Hu J. Rapid adaptation of neural machine translation to new languages. In: Proceedings of the 2018 conference on empirical methods in natural language processing. Association for computational linguistics, Brussels, Belgium; 2018;. p. 75\u2013880. https:\/\/doi.org\/10.18653\/v1\/D18-1103. https:\/\/www.aclweb.org\/anthology\/D18-1103","DOI":"10.18653\/v1\/D18-1103"},{"key":"723_CR4","unstructured":"Abercrombie G. A rule-based shallow-transfer machine translation system for Scots and English. In: Proceedings of the tenth international conference on language resources and evaluation (LREC\u201916), European Language Resources Association (ELRA), Portoro\u017e, Slovenia, 2016. p. 578\u2013584. https:\/\/www.aclweb.org\/anthology\/L16-1092"},{"key":"723_CR5","doi-asserted-by":"publisher","unstructured":"Allauzen C, Byrne B, Gispert A, Iglesias G, Riley M. Pushdown automata in statistical machine translation. Comput Linguistics. 2014;40(3):687\u2013723. https:\/\/doi.org\/10.1162\/COLI_a_00197. https:\/\/www.aclweb.org\/anthology\/J14-3008","DOI":"10.1162\/COLI_a_00197"},{"key":"723_CR6","doi-asserted-by":"publisher","unstructured":"Centelles J, Costa-juss\u00e0 MR. Chinese-to-Spanish rule-based machine translation system. In: Proceedings of the 3rd workshop on hybrid approaches to machine translation (HyTra), association for computational linguistics, Gothenburg, Sweden; 2014. p. 82\u201386. https:\/\/doi.org\/10.3115\/v1\/W14-1015. https:\/\/www.aclweb.org\/anthology\/W14-1015","DOI":"10.3115\/v1\/W14-1015"},{"key":"723_CR7","doi-asserted-by":"crossref","unstructured":"Charoenpornsawat P, Sornlertlamvanich V, Charoenporn T. Improving translation quality of rule-based machine translation. In: COLING-02: machine translation in Asia; 2002. https:\/\/www.aclweb.org\/anthology\/W02-1605","DOI":"10.3115\/1118794.1118799"},{"key":"723_CR8","doi-asserted-by":"publisher","unstructured":"Hurskainen A, Tiedemann J. Rule-based machine translation from English to Finnish. In: Proceedings of the second conference on machine translation. Association for computational linguistics, Copenhagen, Denmark; 2017. p. 323\u2013329. https:\/\/doi.org\/10.18653\/v1\/W17-4731. https:\/\/www.aclweb.org\/anthology\/W17-4731","DOI":"10.18653\/v1\/W17-4731"},{"key":"723_CR9","doi-asserted-by":"crossref","unstructured":"Kaji H. An efficient execution method for rule-based machine translation. In: Coling Budapest 1988 volume 2: international conference on computational linguistics; 1988. https:\/\/www.aclweb.org\/anthology\/C88-2167.","DOI":"10.3115\/991719.991803"},{"key":"723_CR10","unstructured":"Susanto RH, Larasati SD, Tyers FM. Rule-based machine translation between Indonesian and Malaysian. In: Proceedings of the 3rd workshop on South and Southeast Asian natural language processing. The COLING 2012 Organizing Committee, Mumbai, India; 2012. p. 191\u2013200. https:\/\/www.aclweb.org\/anthology\/W12-5017"},{"key":"723_CR11","doi-asserted-by":"crossref","unstructured":"Carl M. A model of competence for corpus-based machine translation. In: COLING 2000 volume 2: the 18th international conference on computational linguistics; 2000. https:\/\/www.aclweb.org\/anthology\/C00-2145","DOI":"10.3115\/992730.992792"},{"key":"723_CR12","doi-asserted-by":"crossref","unstructured":"Dauphin E, Lux V. Corpus-based annotated test set for machine translation evaluation by an industrial user. In: COLING 1996 volume 2: the 16th international conference on computational linguistics; 1996. https:\/\/www.aclweb.org\/anthology\/C96-2188","DOI":"10.3115\/993268.993366"},{"key":"723_CR13","doi-asserted-by":"publisher","unstructured":"Green S, Cer D, Manning C. An empirical comparison of features and tuning for phrase-based machine translation. In: Proceedings of the ninth workshop on statistical machine translation, association for computational linguistics, Baltimore, Maryland, USA; 2014. p. 466\u2013476, https:\/\/doi.org\/10.3115\/v1\/W14-3360. https:\/\/www.aclweb.org\/anthology\/W14-3360","DOI":"10.3115\/v1\/W14-3360"},{"key":"723_CR14","doi-asserted-by":"publisher","unstructured":"Junczys-Dowmunt M, Grundkiewicz R. Phrase-based machine translation is state-of-the-art for automatic grammatical error correction. In: Proceedings of the 2016 conference on empirical methods in natural language processing. Association for computational linguistics, Austin, Texas; 2016. p. 1546\u20131556,.https:\/\/doi.org\/10.18653\/v1\/D16-1161. https:\/\/www.aclweb.org\/anthology\/D16-1161","DOI":"10.18653\/v1\/D16-1161"},{"key":"723_CR15","unstructured":"Koehn P. Europarl: a parallel corpus for statistical machine translation. In: Conference proceedings: the tenth machine translation summit, AAMT; 2005."},{"key":"723_CR16","doi-asserted-by":"crossref","unstructured":"Koehn P, Hoang H, Birch A, Callison-Burch C, Federico M, Bertoldi N, Cowan B, Shen W, Moran C, Zens R, et\u00a0al. Moses: Open source toolkit for statistical machine translation. In: Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions. Association for computational linguistics; 2007. p. 177\u2013180.","DOI":"10.3115\/1557769.1557821"},{"key":"723_CR17","doi-asserted-by":"crossref","unstructured":"Kondrak G, Marcu D, Knight K. Cognates can improve statistical translation models. In: Companion volume of the proceedings of HLT-NAACL 2003\u2014short papers; 2003. p. 46\u201348. https:\/\/www.aclweb.org\/anthology\/N03-2016","DOI":"10.3115\/1073483.1073499"},{"key":"723_CR18","doi-asserted-by":"publisher","first-page":"576","DOI":"10.1007\/11562214_51","volume-title":"Natural language processing-IJCNLP 2005","author":"H Setiawan","year":"2005","unstructured":"Setiawan H, Li H, Zhang M, Ooi BC. Phrase-based statistical machine translation: a level of detail approach. In: Dale R, Wong KF, Su J, Kwong OY, editors. Natural language processing-IJCNLP 2005. Berlin Heidelberg: Springer; 2005. p. 576\u201387."},{"key":"723_CR19","unstructured":"Bahdanau D, Cho KH, Bengio Y. Neural machine translation by jointly learning to align and translate. In: 3rd international conference on learning representations, ICLR; 2015."},{"key":"723_CR20","doi-asserted-by":"publisher","unstructured":"Cho K, van Merri\u00ebnboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. Learning phrase representations using RNN encoder\u2013decoder for statistical machine translation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), Doha, Qatar; 2014. p. 1724\u20131734. https:\/\/doi.org\/10.3115\/v1\/D14-1179. https:\/\/www.aclweb.org\/anthology\/D14-1179","DOI":"10.3115\/v1\/D14-1179"},{"key":"723_CR21","unstructured":"Sutskever I, Vinyals O, Le QV. Sequence to sequence learning with neural networks. In: Proceedings of the 27th international conference on neural information processing systems - volume 2. MIT Press, Cambridge, MA, USA, NIPS\u201914; 2014. p. 3104\u20133112. http:\/\/dl.acm.org\/citation.cfm?id=2969033.2969173"},{"key":"723_CR22","doi-asserted-by":"publisher","unstructured":"Zhang J, Wang M, Liu Q, Zhou J. Incorporating word reordering knowledge into attention-based neural machine translation. In: Proceedings of the 55th annual meeting of the association for computational linguistics (volume 1: long papers). Association for computational linguistics, Vancouver, Canada; 2017. p. 1524\u20131534. https:\/\/doi.org\/10.18653\/v1\/P17-1140. https:\/\/www.aclweb.org\/anthology\/P17-1140","DOI":"10.18653\/v1\/P17-1140"},{"key":"723_CR23","doi-asserted-by":"publisher","unstructured":"Kim Y, Petrov P, Petrushkov P, Khadivi S, Ney H. Pivot-based transfer learning for neural machine translation between non-English languages. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). Association for computational linguistics, Hong Kong, China; 2019. p. 866\u2013876. https:\/\/doi.org\/10.18653\/v1\/D19-1080. https:\/\/www.aclweb.org\/anthology\/D19-1080","DOI":"10.18653\/v1\/D19-1080"},{"key":"723_CR24","unstructured":"Wu H, Wang H. Pivot language approach for phrase-based statistical machine translation. In: Proceedings of the 45th annual meeting of the association of computational linguistics, Prague, Czech Republic; 2007. p. 856\u2013863. https:\/\/www.aclweb.org\/anthology\/P07-1108"},{"key":"723_CR25","doi-asserted-by":"crossref","unstructured":"Wu H, Wang H. Revisiting pivot language approach for machine translation. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP. Association for computational linguistics, Suntec, Singapore; 2009. p. 154\u2013162. https:\/\/www.aclweb.org\/anthology\/P09-1018","DOI":"10.3115\/1687878.1687902"},{"key":"723_CR26","doi-asserted-by":"publisher","unstructured":"Currey A, Heafield K. Zero-resource neural machine translation with monolingual pivot data. In: Proceedings of the 3rd workshop on neural generation and translation, Association for computational linguistics, Hong Kong; 2019. p. 99\u2013107. https:\/\/doi.org\/10.18653\/v1\/D19-5610. https:\/\/www.aclweb.org\/anthology\/D19-5610","DOI":"10.18653\/v1\/D19-5610"},{"key":"723_CR27","doi-asserted-by":"publisher","unstructured":"Gu J, Wang Y, Cho K, Li VO. Improved zero-shot neural machine translation via ignoring spurious correlations. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy; 2019. p. 1258\u20131268. https:\/\/doi.org\/10.18653\/v1\/P19-1121. https:\/\/www.aclweb.org\/anthology\/P19-1121","DOI":"10.18653\/v1\/P19-1121"},{"key":"723_CR28","doi-asserted-by":"publisher","unstructured":"Johnson M, Schuster M, Le QV, Krikun M, Wu Y, Chen Z, Thorat N, Vi\u00e9gas F, Wattenberg M, Corrado G, Hughes M, Dean J. Google\u2019s multilingual neural machine translation system: enabling zero-shot translation. Trans Assoc Comput Linguistics. 2017;5:339\u201351. https:\/\/doi.org\/10.1162\/tacl_a_00065. https:\/\/www.aclweb.org\/anthology\/Q17-1024","DOI":"10.1162\/tacl_a_00065"},{"key":"723_CR29","doi-asserted-by":"publisher","unstructured":"Pham NQ, Niehues J, Ha TL, Waibel A. Improving zero-shot translation with language-independent constraints. In: Proceedings of the fourth conference on machine translation (volume 1: research papers). Association for computational linguistics, Florence, Italy; 2019. p. 13\u201323. https:\/\/doi.org\/10.18653\/v1\/W19-5202. https:\/\/www.aclweb.org\/anthology\/W19-5202","DOI":"10.18653\/v1\/W19-5202"},{"key":"723_CR30","doi-asserted-by":"publisher","unstructured":"Tan X, Chen J, He D, Xia Y, Qin T, Liu TY. Multilingual neural machine translation with language clustering. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, Hong Kong, China; 2019. p. 963\u2013973. https:\/\/doi.org\/10.18653\/v1\/D19-1089. https:\/\/www.aclweb.org\/anthology\/D19-1089.","DOI":"10.18653\/v1\/D19-1089"},{"key":"723_CR31","doi-asserted-by":"publisher","unstructured":"Artetxe M, Labaka G, Agirre E. Bilingual lexicon induction through unsupervised machine translation. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy; 2019. p. 5002\u20135007. https:\/\/doi.org\/10.18653\/v1\/P19-1494. https:\/\/www.aclweb.org\/anthology\/P19-1494","DOI":"10.18653\/v1\/P19-1494"},{"key":"723_CR32","doi-asserted-by":"publisher","unstructured":"Artetxe M, Labaka G, Agirre E. An effective approach to unsupervised machine translation. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy; 2019. p. 194\u2013203. https:\/\/doi.org\/10.18653\/v1\/P19-1019. https:\/\/www.aclweb.org\/anthology\/P19-1019","DOI":"10.18653\/v1\/P19-1019"},{"key":"723_CR33","doi-asserted-by":"publisher","unstructured":"Pourdamghani N, Aldarrab N, Ghazvininejad M, Knight K, May J. Translating translationese: a two-step approach to unsupervised machine translation. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy; 2019. p. 3057\u20133062, https:\/\/doi.org\/10.18653\/v1\/P19-1293. https:\/\/www.aclweb.org\/anthology\/P19-1293","DOI":"10.18653\/v1\/P19-1293"},{"key":"723_CR34","unstructured":"Abney S, Bird S. The Human Language Project: building a universal corpus of the world\u2019s languages. In: Proceedings of the 48th annual meeting of the association for computational linguistics; 2010. p. 88\u201397. http:\/\/www.aclweb.org\/anthology\/P10-1010"},{"key":"723_CR35","unstructured":"Hauksd\u00f3ttir A. An innovative world language centre : challenges for the use of language technology. In: Proceedings of the ninth international conference on language resources and evaluation (LREC-2014). European Language Resources Association (ELRA); 2014. http:\/\/www.aclweb.org\/anthology\/L14-1618"},{"key":"723_CR36","unstructured":"Alegria I, Artola X, De\u00a0Ilarraza AD, Sarasola K. Strategies to develop language technologies for less-resourced languages based on the case of Basque; 2011."},{"key":"723_CR37","first-page":"8","volume":"2003","author":"S Krauwer","year":"2003","unstructured":"Krauwer S. The basic language resource kit (BLARK) as the first milestone for the language resources roadmap. Proc SPECOM. 2003;2003:8\u201315.","journal-title":"Proc SPECOM."},{"key":"723_CR38","doi-asserted-by":"crossref","unstructured":"Maxwell M, Hughes B. Frontiers in linguistic annotation for lower-density languages. In: Proceedings of the workshop on frontiers in linguistically annotated Corpora 2006. Association for computational linguistics; 2006. p. 29\u201337. http:\/\/www.aclweb.org\/anthology\/W06-0605","DOI":"10.3115\/1641991.1641996"},{"key":"723_CR39","unstructured":"Jimerson R, Prud\u2019hommeaux E (2018) ASR for documenting acutely under-resourced indigenous languages. In: Chair NCC, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T, editors. Proceedings of the eleventh international conference on language resources and evaluation (LREC). European Language Resources Association (ELRA), Japan, Miyazaki; 2018."},{"key":"723_CR40","volume-title":"An introduction to language","author":"V Fromkin","year":"2018","unstructured":"Fromkin V, Rodman R, Hyams N. An introduction to language. Boston: Cengage Learning; 2018."},{"key":"723_CR41","unstructured":"Fischer A, J\u00e1grov\u00e1 K, Stenger I, Avgustinova T, Klakow D, Marti R. Orthographic and morphological correspondences between related slavic languages as a base for modeling of mutual intelligibility. In: Proceedings of the tenth international conference on language resources and evaluation (LREC\u201916); 2016. p. 4202\u20134209."},{"key":"723_CR42","doi-asserted-by":"crossref","unstructured":"Min Z, Haizhou L, Jian S. Direct orthographical mapping for machine transliteration. In: Proceedings of the 20th international conference on computational linguistics. Association for computational linguistics; 2004. p. 716.","DOI":"10.3115\/1220355.1220458"},{"key":"723_CR43","doi-asserted-by":"publisher","unstructured":"Kunchukuttan A, Khapra M, Singh G, Bhattacharyya P. Leveraging orthographic similarity for multilingual neural transliteration. Trans Assoc Comput Linguistics 2018;6:303\u201316. https:\/\/doi.org\/10.1162\/tacl_a_00022. https:\/\/www.aclweb.org\/anthology\/Q18-1022","DOI":"10.1162\/tacl_a_00022"},{"issue":"2","key":"723_CR44","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1007\/s10579-011-9137-0","volume":"45","author":"M Farr\u00fas","year":"2011","unstructured":"Farr\u00fas M, Costa-Jussa MR, Marino JB, Poch M, Hern\u00e1ndez A, Henr\u00edquez C, Fonollosa JA. Overcoming statistical machine translation limitations: error analysis and proposed solutions for the catalan-spanish language pair. Language Resour Eval. 2011;45(2):181\u2013208.","journal-title":"Language Resour Eval."},{"key":"723_CR45","doi-asserted-by":"crossref","unstructured":"Lita LV, Ittycheriah A, Roukos S, Kambhatla N. Truecasing. In: Proceedings of the 41st annual meeting on association for computational linguistics-volume 1. Association for computational linguistics; 2003. p. 152\u2013159.","DOI":"10.3115\/1075096.1075116"},{"key":"723_CR46","doi-asserted-by":"crossref","unstructured":"Schlippe T, Zhu C, Gebhardt J, Schultz T. Text normalization based on statistical machine translation and internet user support. In: Eleventh annual conference of the international speech communication association; 2010.","DOI":"10.21437\/Interspeech.2010-518"},{"key":"723_CR47","unstructured":"Leusch G, Ueffing N, Vilar D, Ney H. Preprocessing and normalization for automatic evaluation of machine translation. In: Proceedings of the ACL workshop on intrinsic and extrinsic evaluation measures for machine translation and\/or summarization, Association for Computational Linguistics, Ann Arbor, Michigan; 2005. p. 17\u201324. https:\/\/www.aclweb.org\/anthology\/W05-0903"},{"key":"723_CR48","unstructured":"Guzm\u00e1n F, Bouamor H, Baly R, Habash N. Machine translation evaluation for Arabic using morphologically-enriched embeddings. In: Proceedings of COLING 2016, the 26th international conference on computational linguistics: technical papers. The COLING 2016 Organizing Committee, Osaka, Japan; 2016. p. 1398\u20131408. https:\/\/www.aclweb.org\/anthology\/C16-1132"},{"key":"723_CR49","doi-asserted-by":"crossref","unstructured":"Kumaran A, Kellner T. A generic framework for machine transliteration. In: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM; 2007. p. 721\u2013722.","DOI":"10.1145\/1277741.1277876"},{"issue":"1","key":"723_CR50","first-page":"90","volume":"15","author":"MO Ayeomoni","year":"2006","unstructured":"Ayeomoni MO. Code-switching and code-mixing: Style of language use in childhood in Yoruba speech community. Nordic J Afr Stud. 2006;15(1):90\u20139.","journal-title":"Nordic J Afr Stud."},{"key":"723_CR51","doi-asserted-by":"publisher","unstructured":"Parshad RD, Bhowmick S, Chand V, Kumari N, Sinha N. What is India speaking? Exploring the \u201cHinglish\u201d invasion. Phys A Stat Mech Appl. 2016;449:375\u201389. https:\/\/doi.org\/10.1016\/j.physa.2016.01.015. http:\/\/www.sciencedirect.com\/science\/article\/pii\/S0378437116000236","DOI":"10.1016\/j.physa.2016.01.015"},{"key":"723_CR52","doi-asserted-by":"crossref","unstructured":"Ranjan P, Raja B, Priyadharshini R, Balabantaray RC. A comparative study on code-mixed data of Indian social media vs formal text. In: 2nd international conference on contemporary computing and informatics (IC3I), IEEE; 2016. p. 608\u2013611. https:\/\/ieeexplore.ieee.org\/document\/7918035","DOI":"10.1109\/IC3I.2016.7918035"},{"key":"723_CR53","first-page":"73","volume":"2017","author":"MM Yoder","year":"2017","unstructured":"Yoder MM, Rijhwani S, Ros\u00e9 CP, Levin L. Code-switching as a social act: the case of Arabic Wikipedia talk pages. ACL. 2017;2017:73.","journal-title":"ACL"},{"key":"723_CR54","first-page":"112","volume":"2016","author":"A Chanda","year":"2016","unstructured":"Chanda A, Das D, Mazumdar C. Columbia-Jadavpur submission for emnlp 2016 code-switching workshop shared task: system description. EMNLP. 2016;2016:112.","journal-title":"EMNLP"},{"key":"723_CR55","unstructured":"Chan JYC, Cao H, Ching PC, Lee T. Automatic recognition of Cantonese-English code-mixing speech. Int J Comput Linguistics Chin Language Process. 2009;14(3). https:\/\/www.aclweb.org\/anthology\/O09-5003"},{"key":"723_CR56","doi-asserted-by":"crossref","unstructured":"Lagarda AL, Alabau V, Casacuberta F, Silva R, D\u00edaz-de Lia\u00f1o E. Statistical post-editing of a rule-based machine translation system. In: Proceedings of human language technologies: the 2009 annual conference of the North American Chapter of the Association for Computational Linguistics, companion volume: short papers. Association for Computational Linguistics, Stroudsburg, PA, USA, NAACL-Short \u201909; 2009. p. 217\u2013220. http:\/\/dl.acm.org\/citation.cfm?id=1620853.1620913","DOI":"10.3115\/1620853.1620913"},{"key":"723_CR57","doi-asserted-by":"crossref","unstructured":"Slocum J, Bennett WS, Whiffin L, Norcross E. An evaluation of metal: the lrc machine translation system. In: Proceedings of the second conference on European chapter of the association for computational linguistics; 1985. p. 62\u201369.","DOI":"10.3115\/976931.976940"},{"key":"723_CR58","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1007\/11751984_6","volume-title":"Computational processing of the Portuguese language","author":"C Armentano-Oller","year":"2006","unstructured":"Armentano-Oller C, Carrasco RC, Corb\u00ed-Bellot AM, Forcada ML, Ginest\u00ed-Rosell M, Ortiz-Rojas S, P\u00e9rez-Ortiz JA, Ram\u00edrez-S\u00e1nchez G, S\u00e1nchez-Mart\u00ednez F, Scalco MA. Open-source portuguese-spanish machine translation. In: Mamede NJ, Oliveira C, Dias MC, Vieira R, Quaresma P, Nunes MGV, editors. Computational processing of the Portuguese language. Berlin, Heidelberg: Springer; 2006. p. 50\u20139."},{"issue":"2","key":"723_CR59","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1007\/s10590-011-9090-0","volume":"25","author":"ML Forcada","year":"2011","unstructured":"Forcada ML, Ginest\u00ed-Rosell M, Nordfalk J, O\u2019Regan J, Ortiz-Rojas S, P\u00e9rez-Ortiz JA, S\u00e1nchez-Mart\u00ednez F, Ram\u00edrez-S\u00e1nchez G, Tyers FM. Apertium: a free\/open-source platform for rule-based machine translation. Mach Transl. 2011;25(2):127\u201344.","journal-title":"Mach Transl."},{"key":"723_CR60","unstructured":"Garrido-Alenda A, Gilabert-Zarco P, P\u00e9rez-Ortiz JA, Pertusa-Ib\u00e1\u00f1ez A, Ram\u00edrez-S\u00e1nchez G, S\u00e1nchez-Mart\u00ednez F, Scalco MA, Forcada ML. Shallow parsing for portuguese\u2013spanish machine translation. In: Tagging and shallow processing of Portuguese: workshop notes of TASHA\u20192003, Citeseer; 2003. p. 21."},{"key":"723_CR61","unstructured":"Xu Q, Chen A, Li C. Detecting English-French cognates using orthographic edit distance. In: Proceedings of the Australasian Language Technology Association Workshop 2015, Parramatta, Australia, 2015. p. 145\u2013149. https:\/\/www.aclweb.org\/anthology\/U15-1020."},{"key":"723_CR62","unstructured":"Scannell KP. Machine translation for closely related language pairs. In: Proceedings of the workshop strategies for developing machine translation for minority languages, Citeseer; 2006. p. 103\u2013109."},{"key":"723_CR63","unstructured":"Ruth J, O\u2019Regan J. Shallow-transfer rule-based machine translation for Czech to Polish. In: Proceedings of the second international workshop on free\/open-source rule-based machine translation, Universitat Oberta de Catalunya; 2011. p. 69\u201376."},{"key":"723_CR64","unstructured":"Tyers FM, Nordfalk J, et\u00a0al. Shallow-transfer rule-based machine translation for swedish to danish. In: Proceedings of the first international workshop on free\/open-source rule-based machine translation. Universidad de Alicante. Departamento de Lenguajes y Sistemas Inform\u00e1ticos; 2009. p. 27\u201333."},{"key":"723_CR65","doi-asserted-by":"crossref","unstructured":"Tantu\u011f AC, Adal\u0131 E. Machine translation between Turkic languages. In: Sara\u00e7lar M, Oflazer K. editors. Turkish natural language processing. Springer; 2018. p. 237\u2013254.","DOI":"10.1007\/978-3-319-90165-7_11"},{"key":"723_CR66","unstructured":"Tantu\u011f AC, Adal\u0131 E, Oflazer K.A MT system from Turkmen to Turkish employing finite state and statistical methods. In: Machine translation summit XI, European Association for Machine Translation (EAMT); 2007. p. 459\u2013465."},{"key":"723_CR67","unstructured":"Brown PF, Della\u00a0Pietra SA, Della\u00a0Pietra VJ, Mercer RL. The mathematics of statistical machine translation: Parameter estimation. Comput Linguistics 1993;19(2):263\u2013311. https:\/\/www.aclweb.org\/anthology\/J93-2003"},{"key":"723_CR68","volume-title":"Statistical machine translation","author":"P Koehn","year":"2010","unstructured":"Koehn P. Statistical machine translation. 1st ed. New York, NY: Cambridge University Press; 2010.","edition":"1"},{"key":"723_CR69","doi-asserted-by":"publisher","unstructured":"Waite A, Byrne B. The geometry of statistical machine translation. In: Proceedings of the 2015 conference of the North American Chapter of the Association for Computational Linguistics: human language technologies. Association for computational linguistics, Denver, Colorado; 2015. p. 376\u2013386. https:\/\/doi.org\/10.3115\/v1\/N15-1041. https:\/\/www.aclweb.org\/anthology\/N15-1041","DOI":"10.3115\/v1\/N15-1041"},{"key":"723_CR70","doi-asserted-by":"publisher","unstructured":"Wang YY, Waibel A. Decoding algorithm in statistical machine translation. In: 35th annual meeting of the association for computational linguistics and 8th conference of the European Chapter of the Association for Computational Linguistics. Association for computational linguistics, Madrid, Spain; 1997. p. 366\u2013372. https:\/\/doi.org\/10.3115\/976909.979664. https:\/\/www.aclweb.org\/anthology\/P97-1047","DOI":"10.3115\/976909.979664"},{"issue":"1\u20132","key":"723_CR71","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1007\/s10590-011-9110-0","volume":"26","author":"A El Kholy","year":"2012","unstructured":"El Kholy A, Habash N. Orthographic and morphological processing for english\u2013arabic statistical machine translation. Mach Trans. 2012;26(1\u20132):25\u201345. https:\/\/doi.org\/10.1007\/s10590-011-9110-0.","journal-title":"Mach Trans"},{"issue":"2","key":"723_CR72","first-page":"245","volume":"31","author":"MR Costa-Jussa","year":"2012","unstructured":"Costa-Jussa MR, Farr\u00fas M, Marino JB, Fonollosa JA. Study and comparison of rule-based and statistical catalan-spanish machine translation systems. Comput Inf. 2012;31(2):245\u201370.","journal-title":"Comput Inf."},{"issue":"8","key":"723_CR73","doi-asserted-by":"publisher","first-page":"1696","DOI":"10.1109\/TASL.2008.2002054","volume":"16","author":"N Bertoldi","year":"2008","unstructured":"Bertoldi N, Zens R, Federico M, Shen W. Efficient speech translation through confusion network decoding. IEEE Trans Audio Speech Language Process. 2008;16(8):1696\u2013705. https:\/\/doi.org\/10.1109\/TASL.2008.2002054.","journal-title":"IEEE Trans Audio Speech Language Process"},{"key":"723_CR74","unstructured":"Bertoldi N, Cettolo M, Federico M. Statistical machine translation of texts with misspelled words. In: Human language technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics, Los Angeles, California; 2010. p. 412\u2013419. https:\/\/www.aclweb.org\/anthology\/N10-1064"},{"key":"723_CR75","unstructured":"Formiga L, Fonollosa JAR. Dealing with input noise in statistical machine translation. In: Proceedings of COLING 2012: posters, the COLING 2012 organizing committee, Mumbai, India; 2012. p. 319\u2013328. https:\/\/www.aclweb.org\/anthology\/C12-2032"},{"key":"723_CR76","doi-asserted-by":"publisher","unstructured":"Brill E, Moore RC. An improved error model for noisy channel spelling correction. In: Proceedings of the 38th annual meeting of the association for computational linguistics, Hong Kong; 2000. p. 286\u2013293. https:\/\/doi.org\/10.3115\/1075218.1075255. https:\/\/www.aclweb.org\/anthology\/P00-1037","DOI":"10.3115\/1075218.1075255"},{"key":"723_CR77","doi-asserted-by":"publisher","unstructured":"Toutanova K, Moore R. Pronunciation modeling for improved spelling correction. In: Proceedings of the 40th annual meeting of the association for computational linguistics. Association for computational linguistics, Philadelphia, Pennsylvania, USA; 2002. p. 144\u2013151. https:\/\/doi.org\/10.3115\/1073083.1073109. https:\/\/www.aclweb.org\/anthology\/P02-1019","DOI":"10.3115\/1073083.1073109"},{"key":"723_CR78","doi-asserted-by":"crossref","unstructured":"Nakov P. Improving English-Spanish statistical machine translation: experiments in domain adaptation, sentence paraphrasing, tokenization, and recasing. In: Proceedings of the third workshop on statistical machine translation. Association for computational linguistics, Columbus, Ohio; 2008. p. 147\u2013150. https:\/\/www.aclweb.org\/anthology\/W08-0320","DOI":"10.3115\/1626394.1626414"},{"key":"723_CR79","unstructured":"Oudah M, Almahairi A, Habash N. The impact of preprocessing on Arabic-English statistical and neural machine translation. In: Proceedings of machine translation summit XVII volume 1: research track. European Association for Machine Translation, Dublin, Ireland; 2019. p. 214\u2013221. https:\/\/www.aclweb.org\/anthology\/W19-6621"},{"key":"723_CR80","doi-asserted-by":"publisher","unstructured":"Sennrich R, Haddow B, Birch A. Improving neural machine translation models with monolingual data. In: Proceedings of the 54th annual meeting of the association for computational linguistics (volume 1: long papers). Association for computational linguistics, Berlin, Germany; 2016. p. 86\u201396. https:\/\/doi.org\/10.18653\/v1\/P16-1009. https:\/\/www.aclweb.org\/anthology\/P16-1009","DOI":"10.18653\/v1\/P16-1009"},{"key":"723_CR81","doi-asserted-by":"publisher","unstructured":"Chen Y, Avgustinova T. Machine translation from an intercomprehension perspective. In: Proceedings of the fourth conference on machine translation (volume 3: shared task papers, day 2). Association for computational linguistics, Florence, Italy; 2019. p. 192\u2013196. https:\/\/doi.org\/10.18653\/v1\/W19-5425. https:\/\/www.aclweb.org\/anthology\/W19-5425","DOI":"10.18653\/v1\/W19-5425"},{"key":"723_CR82","doi-asserted-by":"publisher","unstructured":"Scannell K. Statistical models for text normalization and machine translation. In: Proceedings of the first Celtic language technology workshop. Association for computational linguistics and Dublin City University, Dublin, Ireland; 2014. p. 33\u201340. https:\/\/doi.org\/10.3115\/v1\/W14-4605. https:\/\/www.aclweb.org\/anthology\/W14-4605","DOI":"10.3115\/v1\/W14-4605"},{"key":"723_CR83","unstructured":"Schneider G, Pettersson E, Percillier M. Comparing rule-based and SMT-based spelling normalisation for English historical texts. In: Proceedings of the NoDaLiDa 2017 workshop on processing historical language, Link\u00f6ping University Electronic Press, Gothenburg; 2017. p. 40\u201346. https:\/\/www.aclweb.org\/anthology\/W17-0508"},{"key":"723_CR84","unstructured":"H\u00e4m\u00e4l\u00e4inen M, S\u00e4ily T, Rueter J, Tiedemann J, M\u00e4kel\u00e4 E. Normalizing early English letters to present-day English spelling. In: Proceedings of the second joint SIGHUM workshop on computational linguistics for cultural heritage, social sciences, humanities and literature. Association for computational linguistics, Santa Fe, New Mexico; 2018. p. 87\u201396. https:\/\/www.aclweb.org\/anthology\/W18-4510"},{"key":"723_CR85","unstructured":"Honnet PE, Popescu-Belis A, Musat C, Baeriswyl M. Machine translation of low-resource spoken dialects: strategies for normalizing swiss German. In: Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018). European Language Resources Association (ELRA), Miyazaki, Japan; 2018. https:\/\/www.aclweb.org\/anthology\/L18-1597"},{"key":"723_CR86","doi-asserted-by":"publisher","unstructured":"Napoles C, Callison-Burch C. Systematically adapting machine translation for grammatical error correction. In: Proceedings of the 12th workshop on innovative use of NLP for building educational applications. Association for computational linguistics, Copenhagen, Denmark; 2017. p. 345\u2013356, https:\/\/doi.org\/10.18653\/v1\/W17-5039. https:\/\/www.aclweb.org\/anthology\/W17-5039","DOI":"10.18653\/v1\/W17-5039"},{"key":"723_CR87","unstructured":"Nakov P, Tiedemann J. Combining word-level and character-level models for machine translation between closely-related languages. In: Proceedings of the 50th annual meeting of the association for computational linguistics: short papers-volume 2. Association for computational linguistics; 2012. p. 301\u2013305."},{"key":"723_CR88","unstructured":"Levenshtein VI. Binary codes capable of correcting deletions, insertions and reversals. Sov Phys Doklady 1966;10(8):707\u2013710, Doklady Akad Nauk SSSR 1965;163(4):845\u2013848."},{"key":"723_CR89","unstructured":"Melamed ID. Bitext maps and alignment via pattern recognition. Comput Linguistics 1999;25(1):107\u2013130. https:\/\/www.aclweb.org\/anthology\/J99-1003"},{"key":"723_CR90","doi-asserted-by":"publisher","unstructured":"Ciobanu AM, Dinu LP. Automatic detection of cognates using orthographic alignment. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (volume 2: short papers). Association for computational linguistics, Baltimore, Maryland; 2014. p. 99\u2013105, https:\/\/doi.org\/10.3115\/v1\/P14-2017. https:\/\/www.aclweb.org\/anthology\/P14-2017","DOI":"10.3115\/v1\/P14-2017"},{"key":"723_CR91","unstructured":"Mulloni A, Pekar V. Automatic detection of orthographics cues for cognate recognition. In: Proceedings of the fifth international conference on language resources and evaluation (LREC\u201906). European Language Resources Association (ELRA), Genoa, Italy, 2006. http:\/\/www.lrec-conf.org\/proceedings\/lrec2006\/pdf\/676_pdf.pdf"},{"key":"723_CR92","unstructured":"Simard M, Foster GF, Isabelle P. Using cognates to align sentences in bilingual corpora. In: Proceedings of the 1993 conference of the Centre for Advanced Studies on Collaborative research: distributed computing-volume 2. IBM Press; 1993. p. 1071\u20131082."},{"key":"723_CR93","unstructured":"Simard M, Foster GF, Isabelle P. Using cognates to align sentences in bilingual corpora. In: Proceedings of the 1993 conference of the Centre for Advanced Studies on collaborative research: distributed computing - volume 2. IBM Press, CASCON \u201993; 1993. p. 1071\u20131082."},{"key":"723_CR94","doi-asserted-by":"publisher","unstructured":"Church KW. Char\\_align: a program for aligning parallel texts at the character level. In: 31st annual meeting of the association for computational linguistics, Columbus, Ohio, USA; 1993. p. 1\u20138. https:\/\/doi.org\/10.3115\/981574.981575. https:\/\/www.aclweb.org\/anthology\/P93-1001","DOI":"10.3115\/981574.981575"},{"key":"723_CR95","doi-asserted-by":"crossref","unstructured":"Bemova A, Oliva K, Panevova J. Some problems of machine translation between closely related languages. In: Coling Budapest 1988 Volume 1: international conference on computational linguistics; 1988. http:\/\/www.aclweb.org\/anthology\/C88-1010","DOI":"10.3115\/991635.991645"},{"key":"723_CR96","doi-asserted-by":"publisher","unstructured":"Hajic J. Machine translation of very close languages. In: Sixth applied natural language processing conference. Association for computational linguistics, Seattle, Washington, USA; 2000. p. 7\u201312. https:\/\/doi.org\/10.3115\/974147.974149. https:\/\/www.aclweb.org\/anthology\/A00-1002","DOI":"10.3115\/974147.974149"},{"key":"723_CR97","doi-asserted-by":"crossref","unstructured":"Nakov P, Ng HT. Improved statistical machine translation for resource-poor languages using related resource-rich languages. In: Proceedings of the 2009 conference on empirical methods in natural language processing, Association for computational linguistics; 2009. p. 1358\u20131367. http:\/\/www.aclweb.org\/anthology\/D09-1141","DOI":"10.3115\/1699648.1699682"},{"key":"723_CR98","doi-asserted-by":"publisher","unstructured":"Popovi\u0107 M, Ljube\u0161i\u0107 N. Exploring cross-language statistical machine translation for closely related South Slavic languages. In: Proceedings of the EMNLP\u20192014 workshop on language technology for closely related languages and language variants. Association for computational linguistics; 2014. p. 76\u201384. https:\/\/doi.org\/10.3115\/v1\/W14-4210. http:\/\/www.aclweb.org\/anthology\/W14-4210","DOI":"10.3115\/v1\/W14-4210"},{"key":"723_CR99","unstructured":"Popovi\u0107 M, Arcan M, Klubi\u010dka F. Language related issues for machine translation between closely related South Slavic languages. In: Proceedings of the third workshop on NLP for similar languages, varieties and dialects (VarDial3). The COLING 2016 Organizing Committee; 2016. p. 43\u201352. http:\/\/www.aclweb.org\/anthology\/W16-4806"},{"key":"723_CR100","unstructured":"Beinborn L, Zesch T, Gurevych I. Cognate production using character-based machine translation. In: Proceedings of the sixth international joint conference on natural language processing; 2013. p. 883\u2013891."},{"key":"723_CR101","doi-asserted-by":"crossref","unstructured":"Menacer MA, Langlois D, Jouvet D, Fohr D, Mella O, Sma\u00efli K. Machine translation on a parallel code-switched corpus. In: Canadian conference on artificial intelligence, Springer; 2019. p. 426\u2013432.","DOI":"10.1007\/978-3-030-18305-9_40"},{"key":"723_CR102","doi-asserted-by":"publisher","unstructured":"Fadaee M, Monz C. Back-translation sampling by targeting difficult words in neural machine translation. In: Proceedings of the 2018 conference on empirical methods in natural language processing. Association for computational linguistics, Brussels, Belgium; 2018. p. 436\u2013446. https:\/\/doi.org\/10.18653\/v1\/D18-1040. https:\/\/www.aclweb.org\/anthology\/D18-1040","DOI":"10.18653\/v1\/D18-1040"},{"key":"723_CR103","unstructured":"Chakravarthi BR, Arcan M, McCrae JP. Improving wordnets for under-resourced languages using machine translation. In: Proceedings of the 9th global WordNet conference, The Global WordNet Conference 2018 Committee; 2018. http:\/\/compling.hss.ntu.edu.sg\/events\/2018-gwc\/pdfs\/GWC2018_paper_16"},{"key":"723_CR104","unstructured":"Dhar M, Kumar V, Shrivastava M. Enabling code-mixed translation: parallel corpus creation and MT augmentation approach. In: Proceedings of the first workshop on linguistic resources for natural language processing. Association for computational linguistics, Santa Fe, New Mexico, USA; 2018. p. 131\u2013140. https:\/\/www.aclweb.org\/anthology\/W18-3817"},{"key":"723_CR105","unstructured":"Rijhwani S, Sequiera R, Choudhury MC, Bali K. Translating codemixed tweets: a language detection based system. In: 3rd workshop on Indian language data resource and evaluation-WILDRE-3; 2016. p. 81\u201382."},{"key":"723_CR106","doi-asserted-by":"publisher","unstructured":"Niu X, Denkowski M, Carpuat M. Bi-directional neural machine translation with synthetic parallel data. In: Proceedings of the 2nd workshop on neural machine translation and generation. Association for computational linguistics, Melbourne, Australia; 2018. p. 84\u201391. https:\/\/doi.org\/10.18653\/v1\/W18-2710.https:\/\/www.aclweb.org\/anthology\/W18-2710","DOI":"10.18653\/v1\/W18-2710"},{"key":"723_CR107","doi-asserted-by":"crossref","unstructured":"Riyadh RR, Kondrak G. Joint approach to deromanization of code-mixed texts. In: Proceedings of the sixth workshop on NLP for similar languages, varieties and dialects; 2019. p. 26\u201334.","DOI":"10.18653\/v1\/W19-1403"},{"key":"723_CR108","unstructured":"Cohn T, Lapata M. Machine translation by triangulation: making effective use of multi-parallel corpora. In: Proceedings of the 45th annual meeting of the association of computational linguistics, Prague, Czech Republic; 2007. p. 728\u2013735. https:\/\/www.aclweb.org\/anthology\/P07-1092"},{"key":"723_CR109","unstructured":"Utiyama M, Isahara H. A comparison of pivot methods for phrase-based statistical machine translation. In: Human Language Technologies 2007: the conference of the North American Chapter of the Association for computational linguistics; proceedings of the main conference. Association for computational linguistics, Rochester, New York; 2007. p. 484\u2013491. https:\/\/www.aclweb.org\/anthology\/N07-1061"},{"key":"723_CR110","doi-asserted-by":"publisher","unstructured":"Edunov S, Ott M, Auli M, Grangier D. Understanding back-translation at scale. In: Proceedings of the 2018 conference on empirical methods in natural language processing, Association for computational linguistics, Brussels, Belgium; 2018. p. 489\u2013500. https:\/\/doi.org\/10.18653\/v1\/D18-1045. https:\/\/www.aclweb.org\/anthology\/D18-1045","DOI":"10.18653\/v1\/D18-1045"},{"key":"723_CR111","doi-asserted-by":"publisher","unstructured":"Ahmadnia B, Serrano J, Haffari G. Persian-Spanish low-resource statistical machine translation through English as pivot language. In: Proceedings of the international conference recent advances in natural language processing, RANLP 2017, INCOMA Ltd., Varna, Bulgaria; 2017. p. 24\u201330. https:\/\/doi.org\/10.26615\/978-954-452-049-6_004","DOI":"10.26615\/978-954-452-049-6_004"},{"key":"723_CR112","doi-asserted-by":"publisher","unstructured":"Poncelas A, Popovi\u0107 M, Shterionov D, Maillette\u00a0de Buy\u00a0Wenniger G, Way A. Combining PBSMT and NMT back-translated data for efficient NMT. In: Natural language processing in a deep learning world, INCOMA Ltd., Varna, Bulgaria; 2019. p. 922\u2013931, https:\/\/doi.org\/10.26615\/978-954-452-056-4_107. https:\/\/www.aclweb.org\/anthology\/R19-1107","DOI":"10.26615\/978-954-452-056-4_107"},{"key":"723_CR113","doi-asserted-by":"publisher","unstructured":"Tiedemann J, Cap F, Kanerva J, Ginter F, Stymne S, \u00d6stling R, Weller-Di\u00a0Marco M. Phrase-based SMT for Finnish with more data, better models and alternative alignment and translation tools. In: Proceedings of the first conference on machine translation: volume 2, shared task papers. Association for computational linguistics, Berlin, Germany; 2016. p. 391\u2013398. https:\/\/doi.org\/10.18653\/v1\/W16-2326. https:\/\/www.aclweb.org\/anthology\/W16-2326","DOI":"10.18653\/v1\/W16-2326"},{"key":"723_CR114","doi-asserted-by":"publisher","unstructured":"Gra\u00e7a M, Kim Y, Schamper J, Khadivi S, Ney H. Generalizing back-translation in neural machine translation. In: Proceedings of the fourth conference on machine translation (volume 1: research papers). Association for computational linguistics, Florence, Italy; 2019. p. 45\u201352,.https:\/\/doi.org\/10.18653\/v1\/W19-5205. https:\/\/www.aclweb.org\/anthology\/W19-5205","DOI":"10.18653\/v1\/W19-5205"},{"key":"723_CR115","doi-asserted-by":"publisher","unstructured":"Hoang VCD, Koehn P, Haffari G, Cohn T. Iterative back-translation for neural machine translation. In: Proceedings of the 2nd workshop on neural machine translation and generation, Association for computational linguistics, Melbourne, Australia; 2018. p. 18\u201324. https:\/\/doi.org\/10.18653\/v1\/W18-2703. https:\/\/www.aclweb.org\/anthology\/W18-2703.","DOI":"10.18653\/v1\/W18-2703"},{"key":"723_CR116","doi-asserted-by":"publisher","unstructured":"Prabhumoye S, Tsvetkov Y, Salakhutdinov R, Black AW. Style transfer through back-translation. In: Proceedings of the 56th annual meeting of the association for computational linguistics (volume 1: long papers). Association for computational linguistics, Melbourne, Australia; 2018. p. 866\u2013876. https:\/\/doi.org\/10.18653\/v1\/P18-1080. https:\/\/www.aclweb.org\/anthology\/P18-1080","DOI":"10.18653\/v1\/P18-1080"},{"key":"723_CR117","unstructured":"Kunchukuttan A, Shah M, Prakash P, Bhattacharyya P. Utilizing lexical similarity between related, low-resource languages for pivot-based smt. arXiv preprint arXiv:170207203; 2017."},{"key":"723_CR118","doi-asserted-by":"publisher","unstructured":"Saunders D, Stahlberg F, de\u00a0Gispert A, Byrne B. Domain adaptive inference for neural machine translation. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy; 2019. p. 222\u2013228. https:\/\/doi.org\/10.18653\/v1\/P19-1022. https:\/\/www.aclweb.org\/anthology\/P19-1022","DOI":"10.18653\/v1\/P19-1022"},{"key":"723_CR119","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I. Attention is all you need. In: Proceedings of the 31st international conference on neural information processing systems. Curran Associates Inc., USA, NIPS\u201917; 2017. p. 6000\u20136010. http:\/\/dl.acm.org\/citation.cfm?id=3295222.3295349"},{"key":"723_CR120","doi-asserted-by":"publisher","unstructured":"Wang Q, Li B, Xiao T, Zhu J, Li C, Wong DF, Chao LS. Learning deep transformer models for machine translation. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy, 2019. p. 1810\u20131822. https:\/\/doi.org\/10.18653\/v1\/P19-1176. https:\/\/www.aclweb.org\/anthology\/P19-1176","DOI":"10.18653\/v1\/P19-1176"},{"key":"723_CR121","doi-asserted-by":"publisher","unstructured":"Cho K, van Merri\u00ebnboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. Learning phrase representations using RNN encoder\u2013decoder for statistical machine translation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), Doha, Qatar; 2014. p. 1724\u20131734. https:\/\/doi.org\/10.3115\/v1\/D14-1179. https:\/\/www.aclweb.org\/anthology\/D14-1179","DOI":"10.3115\/v1\/D14-1179"},{"key":"723_CR122","doi-asserted-by":"publisher","unstructured":"Luong T, Pham H, Manning CD. Effective approaches to attention-based neural machine translation. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for computational linguistics, Lisbon, Portugal; 2015. p. 1412\u20131421. https:\/\/doi.org\/10.18653\/v1\/D15-1166. https:\/\/www.aclweb.org\/anthology\/D15-1166","DOI":"10.18653\/v1\/D15-1166"},{"key":"723_CR123","doi-asserted-by":"publisher","unstructured":"Sen S, Gupta KK, Ekbal A, Bhattacharyya P. Multilingual unsupervised NMT using shared encoder and language-specific decoders. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy; 2019. p. 3083\u20133089. https:\/\/doi.org\/10.18653\/v1\/P19-1297. https:\/\/www.aclweb.org\/anthology\/P19-1297","DOI":"10.18653\/v1\/P19-1297"},{"key":"723_CR124","doi-asserted-by":"publisher","unstructured":"Wang Y, Zhou L, Zhang J, Zhai F, Xu J, Zong C. A compact and language-sensitive multilingual translation method. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy; 2019. p. 1213\u20131223. https:\/\/doi.org\/10.18653\/v1\/P19-1117. https:\/\/www.aclweb.org\/anthology\/P19-1117","DOI":"10.18653\/v1\/P19-1117"},{"key":"723_CR125","unstructured":"Ha T, Niehues J, Waibel AH. Toward multilingual neural machine translation with universal encoder and decoder. In: Proceedings of the international workshop on spoken language translation; 2016. http:\/\/workshop2016.iwslt.org\/downloads\/IWSLT_2016_paper_5.pdf"},{"key":"723_CR126","doi-asserted-by":"publisher","unstructured":"Chakravarthi BR, Arcan M, McCrae JP. Comparison of different orthographies for machine translation of under-resourced Dravidian languages. In: 2nd conference on language, data and knowledge (LDK 2019), Schloss Dagstuhl\u2013Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany. Open access series in informatics (OASIcs); 2019;70. p. 6:1\u20136:14, https:\/\/doi.org\/10.4230\/OASIcs.LDK.2019.6. http:\/\/drops.dagstuhl.de\/opus\/volltexte\/2019\/10370","DOI":"10.4230\/OASIcs.LDK.2019.6"},{"key":"723_CR127","unstructured":"Chakravarthi BR, Arcan M, McCrae JP. Wordnet gloss translation for under-resourced languages using multilingual neural machine translation. In: Proceedings of the second workshop on multilingualism at the intersection of knowledge bases and machine translation; 2019. p. 1\u20137."},{"key":"723_CR128","unstructured":"Chakravarthi BR, Priyadharshini R, Stearns B, Jayapal A, S S, Arcan M, Zarrouk M, McCrae JP. Multilingual multimodal machine translation for Dravidian languages utilizing phonetic transcription. In: Proceedings of the 2nd workshop on technologies for MT of low resource languages. European Association for Machine Translation, Dublin, Ireland; 2019. p. 56\u201363. https:\/\/www.aclweb.org\/anthology\/W19-6809"},{"key":"723_CR129","doi-asserted-by":"publisher","unstructured":"Li X, Michel P, Anastasopoulos A, Belinkov Y, Durrani N, Firat O, Koehn P, Neubig G, Pino J, Sajjad H. Findings of the first shared task on machine translation robustness. In: Proceedings of the fourth conference on machine translation (volume 2: shared task papers, day 1), Association for computational linguistics, Florence, Italy; 2019. p. 91\u2013102, https:\/\/doi.org\/10.18653\/v1\/W19-5303. https:\/\/www.aclweb.org\/anthology\/W19-5303","DOI":"10.18653\/v1\/W19-5303"},{"key":"723_CR130","unstructured":"Belinkov Y, Bisk Y. Synthetic and natural noise both break neural machine translation. In: International conference on learning representations; 2018. https:\/\/openreview.net\/forum?id=BJ8vJebC-"},{"key":"723_CR131","doi-asserted-by":"crossref","unstructured":"Kim Y, Jernite Y, Sontag D, Rush AM. Character-aware neural language models. In: Proceedings of the thirteenth AAAI conference on artificial intelligence. AAAI Press, AAAI; 2016:16. p. 2741\u20139.","DOI":"10.1609\/aaai.v30i1.10362"},{"key":"723_CR132","doi-asserted-by":"publisher","unstructured":"Cherry C, Foster G, Bapna A, Firat O, Macherey W. Revisiting character-based neural machine translation with capacity and compression. In: Proceedings of the 2018 conference on empirical methods in natural language processing, association for computational linguistics, Brussels, Belgium; 2018. p. 4295\u20134305. https:\/\/doi.org\/10.18653\/v1\/D18-1461. https:\/\/www.aclweb.org\/anthology\/D18-1461","DOI":"10.18653\/v1\/D18-1461"},{"key":"723_CR133","doi-asserted-by":"publisher","unstructured":"Costa-juss\u00e0 MR, Fonollosa JAR. Character-based neural machine translation. In: Proceedings of the 54th annual meeting of the association for computational linguistics (volume 2: short papers). Association for computational linguistics, Berlin, Germany; 2016. p. 357\u2013361. https:\/\/doi.org\/10.18653\/v1\/P16-2058. https:\/\/www.aclweb.org\/anthology\/P16-2058.","DOI":"10.18653\/v1\/P16-2058"},{"key":"723_CR134","doi-asserted-by":"publisher","unstructured":"Lee J, Cho K, Hofmann T. Fully character-level neural machine translation without explicit segmentation. Trans Assoc Comput Linguistics. 2017;5:365\u201378. https:\/\/doi.org\/10.1162\/tacl_a_00067. https:\/\/www.aclweb.org\/anthology\/Q17-1026","DOI":"10.1162\/tacl_a_00067"},{"key":"723_CR135","unstructured":"Yang Z, Chen W, Wang F, Xu B. A character-aware encoder for neural machine translation. In: Proceedings of COLING 2016, the 26th international conference on computational linguistics: technical papers, The COLING 2016 Organizing Committee, Osaka, Japan; 2016. p. 3063\u20133070. https:\/\/www.aclweb.org\/anthology\/C16-1288"},{"key":"723_CR136","doi-asserted-by":"publisher","unstructured":"Chitnis R, DeNero J. Variable-length word encodings for neural translation models. In: Proceedings of the 2015 conference on empirical methods in natural language processing, association for computational linguistics, Lisbon, Portugal; 2015. p. 2088\u20132093. https:\/\/doi.org\/10.18653\/v1\/D15-1249. https:\/\/www.aclweb.org\/anthology\/D15-1249","DOI":"10.18653\/v1\/D15-1249"},{"key":"723_CR137","unstructured":"Ding S, Renduchintala A, Duh K. A call for prudent choice of subword merge operations in neural machine translation. In: Proceedings of machine translation summit XVII volume 1: research track, European Association for Machine Translation, Dublin, Ireland; 2019. p. 204\u2013213. URL https:\/\/www.aclweb.org\/anthology\/W19-6620"},{"key":"723_CR138","doi-asserted-by":"publisher","unstructured":"Schuster M, Nakajima K. Japanese and Korean voice search. In: 2012 IEEE international conference on acoustics, speech and signal processing (ICASSP); 2012. p. 5149\u20135152. https:\/\/doi.org\/10.1109\/ICASSP.2012.6289079","DOI":"10.1109\/ICASSP.2012.6289079"},{"key":"723_CR139","unstructured":"Wu Y, Schuster M, Chen Z, Le QV, Norouzi M, Macherey W, Krikun M, Cao Y, Gao Q, Macherey K, et\u00a0al. Google\u2019s neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:160908144; 2016."},{"key":"723_CR140","doi-asserted-by":"publisher","unstructured":"Kudo T, Richardson J. Sentence piece: a simple and language independent subword tokenizer and detokenizer for neural text processing. In: Proceedings of the 2018 conference on empirical methods in natural language processing: system demonstrations. Association for Computational Linguistics, Brussels, Belgium; 2018. p. 66\u201371. https:\/\/doi.org\/10.18653\/v1\/D18-2012. https:\/\/www.aclweb.org\/anthology\/D18-2012","DOI":"10.18653\/v1\/D18-2012"},{"key":"723_CR141","doi-asserted-by":"publisher","unstructured":"Kudo T. Subword regularization: Improving neural network translation models with multiple subword candidates. In: Proceedings of the 56th annual meeting of the association for computational linguistics (volume 1: long papers), Melbourne, Australia; 2018. p. 66\u201375. https:\/\/doi.org\/10.18653\/v1\/P18-1007. https:\/\/www.aclweb.org\/anthology\/P18-1007","DOI":"10.18653\/v1\/P18-1007"},{"key":"723_CR142","doi-asserted-by":"crossref","unstructured":"Klein G, Kim Y, Deng Y, Senellart J, Rush AM. OpenNMT: open-source toolkit for neural machine translation. CoRR arXiv:abs\/1701.02810; 2017.","DOI":"10.18653\/v1\/P17-4012"},{"issue":"2","key":"723_CR143","first-page":"101","volume":"7","author":"S Jha","year":"2019","unstructured":"Jha S, Sudhakar A, Singh AK. Learning cross-lingual phonological and orthagraphic adaptations: a case study in improving neural machine translation between low-resource languages. J Language Model. 2019;7(2):101\u201342.","journal-title":"J Language Model."},{"key":"723_CR144","doi-asserted-by":"publisher","unstructured":"Bhattacharyya P, Khapra MM, Kunchukuttan A. Statistical machine translation between related languages. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: tutorial abstracts, association for computational linguistics, San Diego, California; 2016. p. 17\u201320, https:\/\/doi.org\/10.18653\/v1\/N16-4006. URL https:\/\/www.aclweb.org\/anthology\/N16-4006","DOI":"10.18653\/v1\/N16-4006"},{"key":"723_CR145","doi-asserted-by":"publisher","unstructured":"Gr\u00f6nroos SA, Virpioja S, Kurimo M. Cognate-aware morphological segmentation for multilingual neural translation. In: Proceedings of the third conference on machine translation: shared task papers, association for computational linguistics, Belgium, Brussels; 2018. p. 386\u2013393. https:\/\/doi.org\/10.18653\/v1\/W18-6410. https:\/\/www.aclweb.org\/anthology\/W18-6410","DOI":"10.18653\/v1\/W18-6410"},{"key":"723_CR146","doi-asserted-by":"crossref","unstructured":"Cherry C, Suzuki H. Discriminative substring decoding for transliteration. In: Proceedings of the 2009 conference on empirical methods in natural language processing, association for computational linguistics; 2009. p. 1066\u20131075. http:\/\/www.aclweb.org\/anthology\/D09-1111","DOI":"10.3115\/1699648.1699652"},{"key":"723_CR147","unstructured":"Bhat RA, Bhat IA, Jain N, Sharma DM. A house united: bridging the script and lexical barrier between Hindi and Urdu. In: COLING 2016, 26th international conference on computational linguistics. Proceedings of the conference: technical papers, December 11\u201316, 2016, Osaka, Japan; 2016. p. 397\u2013408. http:\/\/aclweb.org\/anthology\/C\/C16\/C16-1039.pdf"},{"key":"723_CR148","doi-asserted-by":"publisher","unstructured":"Papineni K, Roukos S, Ward T, Zhu WJ. BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting of the association for computational linguistics. Association for computational linguistics, Philadelphia, Pennsylvania, USA; 2002. p. 311\u2013318. https:\/\/doi.org\/10.3115\/1073083.1073135. https:\/\/www.aclweb.org\/anthology\/P02-1040","DOI":"10.3115\/1073083.1073135"},{"key":"723_CR149","doi-asserted-by":"crossref","unstructured":"Kunchukuttan A, Khapra M, Singh G, Bhattacharyya P. Leveraging orthographic similarity for multilingual neural transliteration. Trans Assoc Comput Linguistics 2018;6:303\u2013316. http:\/\/aclweb.org\/anthology\/Q18-1022","DOI":"10.1162\/tacl_a_00022"},{"key":"723_CR150","doi-asserted-by":"crossref","unstructured":"Baniata LH, Park S. Park SB. A neural machine translation model for Arabic dialects that utilizes multitask learning (MTL). Comput Intell Neurosci. 2018.","DOI":"10.1155\/2018\/7534712"},{"key":"723_CR151","unstructured":"Halpern J. Very large-scale lexical resources to enhance Chinese and Japanese machine translation. In: Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018). European Language Resources Association (ELRA), Miyazaki, Japan; 2018. https:\/\/www.aclweb.org\/anthology\/L18-1137"},{"key":"723_CR152","unstructured":"Ugawa A, Tamura A, Ninomiya T, Takamura H, Okumura M. Neural machine translation incorporating named entity. In: Proceedings of the 27th international conference on computational linguistics. Association for computational linguistics, Santa Fe, New Mexico, USA; 2018. p. 3240\u20133250. https:\/\/www.aclweb.org\/anthology\/C18-1274"},{"key":"723_CR153","unstructured":"Birch A, Haddow B, Tito I, Barone AVM, Bawden R, S\u00e1nchez-Mart\u00ednez F, Forcada ML, Espl\u00e0-Gomis M, S\u00e1nchez-Cartagena V, P\u00e9rez-Ortiz JA, Aziz W, Secker A, van\u00a0der Kreeft P. Global under-resourced media translation (GoURMET). In: Proceedings of machine translation summit XVII volume 2: translator, project and user tracks. European Association for Machine Translation, Dublin, Ireland; 2019. p. 122\u2013122. https:\/\/www.aclweb.org\/anthology\/W19-6723"},{"key":"723_CR154","unstructured":"Chakravarthi BR, Jose N, Suryawanshi S, Sherly E, McCrae JP (2020) A sentiment analysis dataset for code-mixed Malayalam-English. In: Proceedings of the 1st joint workshop of SLTU (Spoken Language Technologies for Under-resourced languages) and CCURL (Collaboration and Computing for Under-Resourced Languages) (SLTU-CCURL). European Language Resources Association (ELRA). France: Marseille; 2020."},{"key":"723_CR155","unstructured":"Chakravarthi BR, Muralidaran V, Priyadharshini R, McCrae JP (2020b) Corpus creation for sentiment analysis in code-mixed Tamil-English text. In: Proceedings of the 1st joint workshop of SLTU (Spoken Language Technologies for Under-resourced languages) and CCURL (Collaboration and Computing for Under-Resourced Languages) (SLTU-CCURL). European Language Resources Association (ELRA). France: Marseille; 2020."},{"key":"723_CR156","doi-asserted-by":"crossref","unstructured":"Jose N, Chakravarthi BR, Suryawanshi S, Sherly E, McCrae JP. A survey of current datasets for code-switching research. In: 2020 6th international conference on advanced computing and communication systems (ICACCS); 2020.","DOI":"10.1109\/ICACCS48705.2020.9074205"},{"key":"723_CR157","doi-asserted-by":"crossref","unstructured":"Priyadharshini R, Chakravarthi BR, Vegupatti M, McCrae JP. Named entity recognition for code-mixed Indian corpus using meta embedding. In: 2020 6th international conference on advanced computing and communication systems (ICACCS); 2020.","DOI":"10.1109\/ICACCS48705.2020.9074379"},{"key":"723_CR158","doi-asserted-by":"publisher","unstructured":"Ranjan P, Raja B, Priyadharshini R, Balabantaray RC. A comparative study on code-mixed data of Indian social media vs formal text. In: 2016 2nd international conference on contemporary computing and informatics (IC3I); 2016. p. 608\u2013611. https:\/\/doi.org\/10.1109\/IC3I.2016.7918035","DOI":"10.1109\/IC3I.2016.7918035"},{"key":"723_CR159","unstructured":"Tiedemann J. Synchronizing translated movie subtitles. In: Proceedings of the sixth international conference on language resources and evaluation (LREC\u201908). European Language Resources Association (ELRA), Marrakech, Morocco; 2008. http:\/\/www.lrec-conf.org\/proceedings\/lrec2008\/pdf\/484_paper.pdf"},{"key":"723_CR160","doi-asserted-by":"publisher","unstructured":"Fadaee M, Bisazza A, Monz C. Data augmentation for low-resource neural machine translation. In: Proceedings of the 55th annual meeting of the association for computational linguistics (volume 2: short papers), Vancouver, Canada; 2017. p. 567\u2013573. https:\/\/doi.org\/10.18653\/v1\/P17-2090. https:\/\/www.aclweb.org\/anthology\/P17-2090","DOI":"10.18653\/v1\/P17-2090"},{"key":"723_CR161","doi-asserted-by":"publisher","unstructured":"Li Z, Specia L. Improving neural machine translation robustness via data augmentation: Beyond back-translation. In: Proceedings of the 5th workshop on noisy user-generated text (W-NUT 2019). Association for computational linguistics, Hong Kong, China; 2019. p. 328\u2013336, https:\/\/doi.org\/10.18653\/v1\/D19-5543. https:\/\/www.aclweb.org\/anthology\/D19-5543","DOI":"10.18653\/v1\/D19-5543"},{"key":"723_CR162","doi-asserted-by":"publisher","unstructured":"Song K, Zhang Y, Yu H, Luo W, Wang K, Zhang M. Code-switching for enhancing NMT with pre-specified translation. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers). Association for computational linguistics, Minneapolis, Minnesota; 2019. p. 449\u2013459. https:\/\/doi.org\/10.18653\/v1\/N19-1044. https:\/\/www.aclweb.org\/anthology\/N19-1044","DOI":"10.18653\/v1\/N19-1044"},{"key":"723_CR163","doi-asserted-by":"publisher","unstructured":"Dou Q, Vaswani A, Knight K. Beyond parallel data: Joint word alignment and decipherment improves machine translation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). Association for computational linguistics, Doha, Qatar; 2014. p. 557\u2013565. https:\/\/doi.org\/10.3115\/v1\/D14-1061. https:\/\/www.aclweb.org\/anthology\/D14-1061","DOI":"10.3115\/v1\/D14-1061"},{"key":"723_CR164","unstructured":"Koehn P, Knight K. Estimating word translation probabilities from unrelated monolingual corpora using the em algorithm. In: Proceedings of the seventeenth national conference on artificial intelligence and twelfth conference on innovative applications of artificial intelligence. AAAI Press; 2000. p. 711\u2013715."},{"key":"723_CR165","unstructured":"Ravi S, Knight K. Deciphering foreign language. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies. Association for computational linguistics, Portland, Oregon, USA; 2011. p. 12\u201321. https:\/\/www.aclweb.org\/anthology\/P11-1002"},{"key":"723_CR166","doi-asserted-by":"publisher","unstructured":"Artetxe M, Labaka G, Agirre E. Unsupervised statistical machine translation. In: Proceedings of the 2018 conference on empirical methods in natural language processing. Association for computational linguistics, Brussels, Belgium; 2018. p. 3632\u20133642, https:\/\/doi.org\/10.18653\/v1\/D18-1399. https:\/\/www.aclweb.org\/anthology\/D18-1399","DOI":"10.18653\/v1\/D18-1399"},{"key":"723_CR167","unstructured":"Klementiev A, Irvine A, Callison-Burch C, Yarowsky D. Toward statistical machine translation without parallel corpora. In: Proceedings of the 13th conference of the European chapter of the association for computational linguistics. Association for computational linguistics, Avignon, France; 2012. p. 130\u2013140. https:\/\/www.aclweb.org\/anthology\/E12-1014"},{"key":"723_CR168","doi-asserted-by":"crossref","unstructured":"Artetxe M, Labaka G, Agirre E, Cho K. Unsupervised neural machine translation. In: Proceedings of the sixth international conference on learning representations; 2018.","DOI":"10.18653\/v1\/D18-1399"},{"key":"723_CR169","unstructured":"Rosner M, Sultana K. Automatic methods for the extension of a bilingual dictionary using comparable corpora. In: Proceedings of the ninth international conference on language resources and evaluation (LREC\u201914). European Language Resources Association (ELRA), Reykjavik, Iceland; 2014. p. 3790\u20133797. http:\/\/www.lrec-conf.org\/proceedings\/lrec2014\/pdf\/1169_Paper.pdf"},{"key":"723_CR170","doi-asserted-by":"publisher","unstructured":"Turcato D. Automatically creating bilingual lexicons for machine translation from bilingual text. In: 36th annual meeting of the association for computational linguistics and 17th international conference on computational linguistics, volume 2, association for computational linguistics, Montreal, Quebec, Canada; 1998. p. 1299\u20131306, https:\/\/doi.org\/10.3115\/980691.980781. https:\/\/www.aclweb.org\/anthology\/P98-2212","DOI":"10.3115\/980691.980781"},{"key":"723_CR171","unstructured":"Haghighi A, Liang P, Berg-Kirkpatrick T, Klein D. Learning bilingual lexicons from monolingual corpora. In: Proceedings of ACL-08: HLT, association for computational linguistics, Columbus, Ohio; 2008. p. 771\u2013779. https:\/\/www.aclweb.org\/anthology\/P08-1088"},{"key":"723_CR172","unstructured":"Berg-Kirkpatrick T, Bouchard-C\u00f4t\u00e9 A, DeNero J, Klein D. Painless unsupervised learning with features. In: Human language technologies: the 2010 annual conference of the North American Chapter of the Association for computational linguistics, Los Angeles, California; 2010. p. 582\u2013590. https:\/\/www.aclweb.org\/anthology\/N10-1083"},{"key":"723_CR173","unstructured":"Dyer C, Clark JH, Lavie A, Smith NA. Unsupervised word alignment with arbitrary features. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies. Association for computational linguistics, Portland, Oregon, USA; 2011. p. 409\u2013419. https:\/\/www.aclweb.org\/anthology\/P11-1042"},{"key":"723_CR174","doi-asserted-by":"crossref","unstructured":"Hauer B, Nicolai G, Kondrak G. Bootstrapping unsupervised bilingual lexicon induction. In: Proceedings of the 15th conference of the European Chapter of the Association for Computational Linguistics: volume 2, short papers, Valencia, Spain; 2017. p. 619\u2013624. https:\/\/www.aclweb.org\/anthology\/E17-2098","DOI":"10.18653\/v1\/E17-2098"},{"key":"723_CR175","doi-asserted-by":"publisher","unstructured":"Riley P, Gildea D. Orthographic features for bilingual lexicon induction. In: Proceedings of the 56th annual meeting of the association for computational linguistics (volume 2: short papers). Association for computational linguistics, Melbourne, Australia; 2018. p. 390\u2013394. https:\/\/doi.org\/10.18653\/v1\/P18-2062. https:\/\/www.aclweb.org\/anthology\/P18-2062","DOI":"10.18653\/v1\/P18-2062"},{"key":"723_CR176","unstructured":"Chu C, Nakazawa T, Kurohashi S. Improving statistical machine translation accuracy using bilingual lexicon extraction with paraphrases. In: Proceedings of the 28th Pacific Asia conference on language, information and computing, Department of Linguistics, Chulalongkorn University, Phuket, Thailand; 2014. p. 262\u2013271. URL https:\/\/www.aclweb.org\/anthology\/Y14-1032"},{"key":"723_CR177","unstructured":"Dou Q, Knight K. Dependency-based decipherment for resource-limited machine translation. In: Proceedings of the 2013 conference on empirical methods in natural language processing. Association for computational linguistics, Seattle, Washington, USA; 2013. p. 1668\u20131676. https:\/\/www.aclweb.org\/anthology\/D13-1173"},{"key":"723_CR178","doi-asserted-by":"publisher","unstructured":"Bloodgood M, Strauss B. Acquisition of translation lexicons for historically unwritten languages via bridging loanwords. In: Proceedings of the 10th workshop on building and using comparable Corpora, association for computational linguistics; 2017. p. 21\u201325. https:\/\/doi.org\/10.18653\/v1\/W17-2504. http:\/\/aclweb.org\/anthology\/W17-2504","DOI":"10.18653\/v1\/W17-2504"}],"container-title":["SN Computer Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s42979-021-00723-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s42979-021-00723-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s42979-021-00723-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,1]],"date-time":"2024-09-01T11:05:08Z","timestamp":1725188708000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s42979-021-00723-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,6,7]]},"references-count":178,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2021,7]]}},"alternative-id":["723"],"URL":"https:\/\/doi.org\/10.1007\/s42979-021-00723-4","relation":{},"ISSN":["2662-995X","2661-8907"],"issn-type":[{"value":"2662-995X","type":"print"},{"value":"2661-8907","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,6,7]]},"assertion":[{"value":"13 August 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 May 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 June 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"On behalf of all authors, the corresponding author states that there is no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest:"}}],"article-number":"330"}}