{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:22:03Z","timestamp":1750220523479,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":55,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,1,2]],"date-time":"2021-01-02T00:00:00Z","timestamp":1609545600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,1,2]]},"DOI":"10.1145\/3430984.3431026","type":"proceedings-article","created":{"date-parts":[[2020,12,28]],"date-time":"2020-12-28T05:34:44Z","timestamp":1609133684000},"page":"178-187","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["Revisiting Low Resource Status of Indian Languages in Machine Translation"],"prefix":"10.1145","author":[{"given":"Jerin","family":"Philip","sequence":"first","affiliation":[{"name":"IIIT Hyderabad"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shashank","family":"Siripragada","sequence":"additional","affiliation":[{"name":"IIIT Hyderabad"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vinay P.","family":"Namboodiri","sequence":"additional","affiliation":[{"name":"IIT Kanpur"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"C. V.","family":"Jawahar","sequence":"additional","affiliation":[{"name":"IIIT Hyderabad"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,1,2]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"crossref","unstructured":"Roee Aharoni Melvin Johnson and Orhan Firat. 2019. Massively Multilingual Neural Machine Translation. arXiv preprint arXiv:1903.00089(2019).  Roee Aharoni Melvin Johnson and Orhan Firat. 2019. Massively Multilingual Neural Machine Translation. arXiv preprint arXiv:1903.00089(2019).","DOI":"10.18653\/v1\/N19-1388"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1629175.1629184"},{"key":"e_1_3_2_2_3_1","volume-title":"5th International Conference on Learning Representations, ICLR","author":"Arora Sanjeev","year":"2019","unstructured":"Sanjeev Arora , Yingyu Liang , and Tengyu Ma . 2019 . A simple but tough-to-beat baseline for sentence embeddings . In 5th International Conference on Learning Representations, ICLR 2017. Sanjeev Arora, Yingyu Liang, and Tengyu Ma. 2019. A simple but tough-to-beat baseline for sentence embeddings. In 5th International Conference on Learning Representations, ICLR 2017."},{"key":"e_1_3_2_2_4_1","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, 4555\u20134567","author":"Ba\u00f1\u00f3n Marta","year":"2020","unstructured":"Marta Ba\u00f1\u00f3n , Pinzhen Chen , Barry Haddow , Kenneth Heafield , Hieu Hoang , Miquel Espl\u00e0-Gomis , Mikel\u00a0 L. Forcada , Amir Kamran , Faheem Kirefu , Philipp Koehn , Sergio Ortiz\u00a0Rojas , Leopoldo Pla\u00a0Sempere , Gema Ram\u00edrez-S\u00e1nchez , Elsa Sarr\u00edas , Marek Strelec , Brian Thompson , William Waites , Dion Wiggins , and Jaume Zaragoza . 2020 . ParaCrawl: Web-Scale Acquisition of Parallel Corpora . In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, 4555\u20134567 . https:\/\/www.aclweb.org\/anthology\/2020.acl-main.417 Marta Ba\u00f1\u00f3n, Pinzhen Chen, Barry Haddow, Kenneth Heafield, Hieu Hoang, Miquel Espl\u00e0-Gomis, Mikel\u00a0L. Forcada, Amir Kamran, Faheem Kirefu, Philipp Koehn, Sergio Ortiz\u00a0Rojas, Leopoldo Pla\u00a0Sempere, Gema Ram\u00edrez-S\u00e1nchez, Elsa Sarr\u00edas, Marek Strelec, Brian Thompson, William Waites, Dion Wiggins, and Jaume Zaragoza. 2020. ParaCrawl: Web-Scale Acquisition of Parallel Corpora. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, 4555\u20134567. https:\/\/www.aclweb.org\/anthology\/2020.acl-main.417"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1165"},{"key":"e_1_3_2_2_6_1","volume-title":"Findings of the 2019 conference on machine translation (wmt19). In Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1). 1\u201361","author":"Barrault Lo\u00efc","year":"2019","unstructured":"Lo\u00efc Barrault , Ond\u0159ej Bojar , Marta\u00a0 R Costa-Juss\u00e0 , Christian Federmann , Mark Fishel , Yvette Graham , Barry Haddow , Matthias Huck , Philipp Koehn , Shervin Malmasi , 2019 . Findings of the 2019 conference on machine translation (wmt19). In Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1). 1\u201361 . Lo\u00efc Barrault, Ond\u0159ej Bojar, Marta\u00a0R Costa-Juss\u00e0, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Shervin Malmasi, 2019. Findings of the 2019 conference on machine translation (wmt19). In Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1). 1\u201361."},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-4006"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W16-2365"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"crossref","unstructured":"Raj Dabre Chenhui Chu and Anoop Kunchukuttan. 2020. A Comprehensive Survey of Multilingual Neural Machine Translation. arXiv preprint arXiv:2001.01115(2020).  Raj Dabre Chenhui Chu and Anoop Kunchukuttan. 2020. A Comprehensive Survey of Multilingual Neural Machine Translation. arXiv preprint arXiv:2001.01115(2020).","DOI":"10.18653\/v1\/2020.coling-tutorials.3"},{"volume-title":"Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation: 5th Workshop on Asian Translation: 5th Workshop on Asian Translation","author":"Dabre Raj","key":"e_1_3_2_2_10_1","unstructured":"Raj Dabre , Anoop Kunchukuttan , Atsushi Fujita , and Eiichiro Sumita . 2018. NICT\u2019s Participation in WAT 2018: Approaches Using Multilingualism and Recurrently Stacked Layers . In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation: 5th Workshop on Asian Translation: 5th Workshop on Asian Translation . Association for Computational Linguistics , Hong Kong . https:\/\/www.aclweb.org\/anthology\/Y18-3003 Raj Dabre, Anoop Kunchukuttan, Atsushi Fujita, and Eiichiro Sumita. 2018. NICT\u2019s Participation in WAT 2018: Approaches Using Multilingualism and Recurrently Stacked Layers. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation: 5th Workshop on Asian Translation: 5th Workshop on Asian Translation. Association for Computational Linguistics, Hong Kong. https:\/\/www.aclweb.org\/anthology\/Y18-3003"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1045"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/972450.972455"},{"key":"e_1_3_2_2_13_1","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. Association for Computational Linguistics, Online, 162\u2013168","author":"Goyal Vikrant","year":"2020","unstructured":"Vikrant Goyal , Sourav Kumar , and Dipti\u00a0Misra Sharma . 2020 . Efficient Neural Machine Translation for Low-Resource Languages via Exploiting Related Languages . In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. Association for Computational Linguistics, Online, 162\u2013168 . https:\/\/www.aclweb.org\/anthology\/2020.acl-srw.22 Vikrant Goyal, Sourav Kumar, and Dipti\u00a0Misra Sharma. 2020. Efficient Neural Machine Translation for Low-Resource Languages via Exploiting Related Languages. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. Association for Computational Linguistics, Online, 162\u2013168. https:\/\/www.aclweb.org\/anthology\/2020.acl-srw.22"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-5216"},{"key":"e_1_3_2_2_15_1","unstructured":"Barry Haddow and Faheem Kirefu. 2020. PMIndia\u2013A Collection of Parallel Corpora of Languages of India. arXiv preprint arXiv:2001.09907(2020).  Barry Haddow and Faheem Kirefu. 2020. PMIndia\u2013A Collection of Parallel Corpora of Languages of India. arXiv preprint arXiv:2001.09907(2020)."},{"key":"e_1_3_2_2_16_1","volume-title":"Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC\u201910)","author":"Jha Girish\u00a0Nath","year":"2010","unstructured":"Girish\u00a0Nath Jha . 2010 . The TDIL Program and the Indian Langauge Corpora Intitiative (ILCI) . In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC\u201910) . Girish\u00a0Nath Jha. 2010. The TDIL Program and the Indian Langauge Corpora Intitiative (ILCI). In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC\u201910)."},{"key":"e_1_3_2_2_17_1","volume-title":"Google\u2019s multilingual neural machine translation system: Enabling zero-shot translation. Transactions of ACL","author":"Johnson Melvin","year":"2017","unstructured":"Melvin Johnson , Mike Schuster , Quoc\u00a0 V Le , Maxim Krikun , Yonghui Wu , Zhifeng Chen , Nikhil Thorat , Fernanda Vi\u00e9gas , Martin Wattenberg , Greg Corrado , 2017. Google\u2019s multilingual neural machine translation system: Enabling zero-shot translation. Transactions of ACL ( 2017 ). Melvin Johnson, Mike Schuster, Quoc\u00a0V Le, Maxim Krikun, Yonghui Wu, Zhifeng Chen, Nikhil Thorat, Fernanda Vi\u00e9gas, Martin Wattenberg, Greg Corrado, 2017. Google\u2019s multilingual neural machine translation system: Enabling zero-shot translation. Transactions of ACL (2017)."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"crossref","unstructured":"Pratik Joshi Sebastin Santy Amar Budhiraja Kalika Bali and Monojit Choudhury. 2020. The state and fate of linguistic diversity and inclusion in the NLP world. arXiv preprint arXiv:2004.09095(2020).  Pratik Joshi Sebastin Santy Amar Budhiraja Kalika Bali and Monojit Choudhury. 2020. The state and fate of linguistic diversity and inclusion in the NLP world. arXiv preprint arXiv:2004.09095(2020).","DOI":"10.18653\/v1\/2020.acl-main.560"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"crossref","unstructured":"Divyanshu Kakwani Anoop Kunchukuttan Satish Golla Gokul N.C. Avik Bhattacharyya Mitesh\u00a0M. Khapra and Pratyush Kumar. 2020. IndicNLPSuite: Monolingual Corpora Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages. In Findings of EMNLP.  Divyanshu Kakwani Anoop Kunchukuttan Satish Golla Gokul N.C. Avik Bhattacharyya Mitesh\u00a0M. Khapra and Pratyush Kumar. 2020. IndicNLPSuite: Monolingual Corpora Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages. In Findings of EMNLP.","DOI":"10.18653\/v1\/2020.findings-emnlp.445"},{"key":"e_1_3_2_2_20_1","unstructured":"Yunsu Kim Miguel Gra\u00e7a and Hermann Ney. 2020. When and Why is Unsupervised Neural Machine Translation Useless?arXiv preprint arXiv:2004.10581(2020).  Yunsu Kim Miguel Gra\u00e7a and Hermann Ney. 2020. When and Why is Unsupervised Neural Machine Translation Useless?arXiv preprint arXiv:2004.10581(2020)."},{"key":"e_1_3_2_2_21_1","volume-title":"Europarl: A parallel corpus for statistical machine translation. Citeseer.","author":"Koehn Philipp","year":"2005","unstructured":"Philipp Koehn . 2005 . Europarl: A parallel corpus for statistical machine translation. Citeseer. Philipp Koehn. 2005. Europarl: A parallel corpus for statistical machine translation. Citeseer."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"crossref","unstructured":"Philipp Koehn and Rebecca Knowles. 2017. Six challenges for neural machine translation. arXiv preprint arXiv:1706.03872(2017).  Philipp Koehn and Rebecca Knowles. 2017. Six challenges for neural machine translation. arXiv preprint arXiv:1706.03872(2017).","DOI":"10.18653\/v1\/W17-3204"},{"key":"e_1_3_2_2_23_1","volume-title":"Proceedings of the 27th ACM International Conference on Multimedia. 1428\u20131436","author":"Rudrabha Mukhopadhyay Prajwal KR","year":"2019","unstructured":"Prajwal KR , Rudrabha Mukhopadhyay , Jerin Philip , Abhishek Jha , Vinay Namboodiri , and CV Jawahar . 2019 . Towards automatic face-to-face translation . In Proceedings of the 27th ACM International Conference on Multimedia. 1428\u20131436 . Prajwal KR, Rudrabha Mukhopadhyay, Jerin Philip, Abhishek Jha, Vinay Namboodiri, and CV Jawahar. 2019. Towards automatic face-to-face translation. In Proceedings of the 27th ACM International Conference on Multimedia. 1428\u20131436."},{"key":"e_1_3_2_2_24_1","volume-title":"Sentencepiece: A simple and language independent subword tokenizer and detokenizer for neural text processing. arXiv preprint arXiv:1808.06226(2018).","author":"Kudo Taku","year":"2018","unstructured":"Taku Kudo and John Richardson . 2018 . Sentencepiece: A simple and language independent subword tokenizer and detokenizer for neural text processing. arXiv preprint arXiv:1808.06226(2018). Taku Kudo and John Richardson. 2018. Sentencepiece: A simple and language independent subword tokenizer and detokenizer for neural text processing. arXiv preprint arXiv:1808.06226(2018)."},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"crossref","unstructured":"Anoop Kunchukuttan and Pushpak Bhattacharyya. 2016. Learning variable length units for SMT between related languages via Byte Pair Encoding. arXiv preprint arXiv:1610.06510(2016).  Anoop Kunchukuttan and Pushpak Bhattacharyya. 2016. Learning variable length units for SMT between related languages via Byte Pair Encoding. arXiv preprint arXiv:1610.06510(2016).","DOI":"10.18653\/v1\/W17-4102"},{"key":"e_1_3_2_2_26_1","unstructured":"Anoop Kunchukuttan Divyanshu Kakwani Satish Golla Avik Bhattacharyya Mitesh\u00a0M Khapra Pratyush Kumar 2020. AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages. arXiv preprint arXiv:2005.00085(2020).  Anoop Kunchukuttan Divyanshu Kakwani Satish Golla Avik Bhattacharyya Mitesh\u00a0M Khapra Pratyush Kumar 2020. AI4Bharat-IndicNLP Corpus: Monolingual Corpora and Word Embeddings for Indic Languages. arXiv preprint arXiv:2005.00085(2020)."},{"key":"e_1_3_2_2_27_1","unstructured":"Anoop Kunchukuttan Pratik Mehta and Pushpak Bhattacharyya. 2017. The iit bombay english-hindi parallel corpus. arXiv preprint arXiv:1710.02855(2017).  Anoop Kunchukuttan Pratik Mehta and Pushpak Bhattacharyya. 2017. The iit bombay english-hindi parallel corpus. arXiv preprint arXiv:1710.02855(2017)."},{"key":"e_1_3_2_2_28_1","volume-title":"Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC\u201914)","author":"Kunchukuttan Anoop","year":"2014","unstructured":"Anoop Kunchukuttan , Abhijit Mishra , Rajen Chatterjee , Ritesh Shah , and Pushpak Bhattacharyya . 2014 . Shata-Anuvadak: Tackling Multiway Translation of Indian Languages . In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC\u201914) . 1781\u20131787. Anoop Kunchukuttan, Abhijit Mishra, Rajen Chatterjee, Ritesh Shah, and Pushpak Bhattacharyya. 2014. Shata-Anuvadak: Tackling Multiway Translation of Indian Languages. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC\u201914). 1781\u20131787."},{"key":"e_1_3_2_2_29_1","unstructured":"Dmitry Lepikhin HyoukJoong Lee Yuanzhong Xu Dehao Chen Orhan Firat Yanping Huang Maxim Krikun Noam Shazeer and Zhifeng Chen. 2020. GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding. arXiv preprint arXiv:2006.16668(2020).  Dmitry Lepikhin HyoukJoong Lee Yuanzhong Xu Dehao Chen Orhan Firat Yanping Huang Maxim Krikun Noam Shazeer and Zhifeng Chen. 2020. GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding. arXiv preprint arXiv:2006.16668(2020)."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W19-5325"},{"key":"e_1_3_2_2_31_1","volume-title":"Addressing word-order Divergence in Multilingual Neural Machine Translation for extremely Low Resource Languages","author":"Murthy Rudra","year":"1865","unstructured":"Rudra Murthy , Anoop Kunchukuttan , and Pushpak Bhattacharyya . 2019. Addressing word-order Divergence in Multilingual Neural Machine Translation for extremely Low Resource Languages . Association for Computational Linguistics , Minneapolis, Minnesota , 3868\u20133873. https:\/\/doi.org\/10. 1865 3\/v1\/N19-1387 10.18653\/v1 Rudra Murthy, Anoop Kunchukuttan, and Pushpak Bhattacharyya. 2019. Addressing word-order Divergence in Multilingual Neural Machine Translation for extremely Low Resource Languages. Association for Computational Linguistics, Minneapolis, Minnesota, 3868\u20133873. https:\/\/doi.org\/10.18653\/v1\/N19-1387"},{"key":"e_1_3_2_2_32_1","volume-title":"Overview of the 6th workshop on Asian translation. In Proceedings of the 6th Workshop on Asian Translation. 1\u201335","author":"Nakazawa Toshiaki","year":"2019","unstructured":"Toshiaki Nakazawa , Nobushige Doi , Shohei Higashiyama , Chenchen Ding , Raj Dabre , Hideya Mino , Isao Goto , Win\u00a0Pa Pa , Anoop Kunchukuttan , Shantipriya Parida , 2019 . Overview of the 6th workshop on Asian translation. In Proceedings of the 6th Workshop on Asian Translation. 1\u201335 . Toshiaki Nakazawa, Nobushige Doi, Shohei Higashiyama, Chenchen Ding, Raj Dabre, Hideya Mino, Isao Goto, Win\u00a0Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, 2019. Overview of the 6th workshop on Asian translation. In Proceedings of the 6th Workshop on Asian Translation. 1\u201335."},{"key":"e_1_3_2_2_33_1","volume-title":"Proceedings of the 4th Workshop on Asian Translation (WAT2017)","author":"Nakazawa Toshiaki","year":"2017","unstructured":"Toshiaki Nakazawa , Shohei Higashiyama , Chenchen Ding , Hideya Mino , Isao Goto , Hideto Kazawa , Yusuke Oda , Graham Neubig , and Sadao Kurohashi . 2017 . Overview of the 4th Workshop on Asian Translation . In Proceedings of the 4th Workshop on Asian Translation (WAT2017) . 1\u201354. Toshiaki Nakazawa, Shohei Higashiyama, Chenchen Ding, Hideya Mino, Isao Goto, Hideto Kazawa, Yusuke Oda, Graham Neubig, and Sadao Kurohashi. 2017. Overview of the 4th Workshop on Asian Translation. In Proceedings of the 4th Workshop on Asian Translation (WAT2017). 1\u201354."},{"volume-title":"Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation: 5th Workshop on Asian Translation: 5th Workshop on Asian Translation","author":"Nakazawa Toshiaki","key":"e_1_3_2_2_34_1","unstructured":"Toshiaki Nakazawa , Katsuhito Sudoh , Shohei Higashiyama , Chenchen Ding , Raj Dabre , Hideya Mino , Isao Goto , Win\u00a0Pa Pa , Anoop Kunchukuttan , and Sadao Kurohashi . 2018. Overview of the 5th Workshop on Asian Translation . In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation: 5th Workshop on Asian Translation: 5th Workshop on Asian Translation . Association for Computational Linguistics , Hong Kong . https:\/\/www.aclweb.org\/anthology\/Y18-3001 Toshiaki Nakazawa, Katsuhito Sudoh, Shohei Higashiyama, Chenchen Ding, Raj Dabre, Hideya Mino, Isao Goto, Win\u00a0Pa Pa, Anoop Kunchukuttan, and Sadao Kurohashi. 2018. Overview of the 5th Workshop on Asian Translation. In Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation: 5th Workshop on Asian Translation: 5th Workshop on Asian Translation. Association for Computational Linguistics, Hong Kong. https:\/\/www.aclweb.org\/anthology\/Y18-3001"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W19-5333"},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"crossref","unstructured":"Myle Ott Sergey Edunov Alexei Baevski Angela Fan Sam Gross Nathan Ng David Grangier and Michael Auli. 2019. fairseq: A Fast Extensible Toolkit for Sequence Modeling. In NAACL (Demonstrations).  Myle Ott Sergey Edunov Alexei Baevski Angela Fan Sam Gross Nathan Ng David Grangier and Michael Auli. 2019. fairseq: A Fast Extensible Toolkit for Sequence Modeling. In NAACL (Demonstrations).","DOI":"10.18653\/v1\/N19-4009"},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W18-6301"},{"key":"e_1_3_2_2_38_1","volume-title":"Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311\u2013318","author":"Papineni Kishore","year":"2002","unstructured":"Kishore Papineni , Salim Roukos , Todd Ward , and Wei-Jing Zhu . 2002 . BLEU: a method for automatic evaluation of machine translation . In Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311\u2013318 . Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311\u2013318."},{"volume-title":"Smart Intelligent Computing and Applications","author":"Parida Shantipriya","key":"e_1_3_2_2_39_1","unstructured":"Shantipriya Parida , Ond\u0159ej Bojar , and Satya\u00a0Ranjan Dash . 2020. OdiEnCorp: Odia\u2013English and Odia-Only Corpus for Machine Translation . In Smart Intelligent Computing and Applications . Springer , 495\u2013504. Shantipriya Parida, Ond\u0159ej Bojar, and Satya\u00a0Ranjan Dash. 2020. OdiEnCorp: Odia\u2013English and Odia-Only Corpus for Machine Translation. In Smart Intelligent Computing and Applications. Springer, 495\u2013504."},{"key":"e_1_3_2_2_40_1","unstructured":"Jerin Philip Vinay\u00a0P. Namboodiri and C.V. Jawahar. 2018. CVIT-MT Systems for WAT-2018. In Proceedings of the 32nd Pacific Asia Conference on Language Information and Computation: 5th Workshop on Asian Translation: 5th Workshop on Asian Translation. Association for Computational Linguistics Hong Kong. https:\/\/www.aclweb.org\/anthology\/Y18-3010  Jerin Philip Vinay\u00a0P. Namboodiri and C.V. Jawahar. 2018. CVIT-MT Systems for WAT-2018. In Proceedings of the 32nd Pacific Asia Conference on Language Information and Computation: 5th Workshop on Asian Translation: 5th Workshop on Asian Translation. Association for Computational Linguistics Hong Kong. https:\/\/www.aclweb.org\/anthology\/Y18-3010"},{"key":"e_1_3_2_2_41_1","unstructured":"Jerin Philip Vinay\u00a0P Namboodiri and CV Jawahar. 2019. A baseline neural machine translation system for indian languages. arXiv preprint arXiv:1907.12437(2019).  Jerin Philip Vinay\u00a0P Namboodiri and CV Jawahar. 2019. A baseline neural machine translation system for indian languages. arXiv preprint arXiv:1907.12437(2019)."},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"crossref","unstructured":"Matt Post. 2018. A call for clarity in reporting BLEU scores. arXiv preprint arXiv:1804.08771(2018).  Matt Post. 2018. A call for clarity in reporting BLEU scores. arXiv preprint arXiv:1804.08771(2018).","DOI":"10.18653\/v1\/W18-6319"},{"key":"e_1_3_2_2_43_1","volume-title":"Proceedings of the Seventh Workshop on Statistical Machine Translation. Association for Computational Linguistics, 401\u2013409","author":"Post Matt","year":"2012","unstructured":"Matt Post , Chris Callison-Burch , and Miles Osborne . 2012 . Constructing parallel corpora for six indian languages via crowdsourcing . In Proceedings of the Seventh Workshop on Statistical Machine Translation. Association for Computational Linguistics, 401\u2013409 . Matt Post, Chris Callison-Burch, and Miles Osborne. 2012. Constructing parallel corpora for six indian languages via crowdsourcing. In Proceedings of the Seventh Workshop on Statistical Machine Translation. Association for Computational Linguistics, 401\u2013409."},{"key":"e_1_3_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/E17-2025"},{"key":"e_1_3_2_2_45_1","volume-title":"Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012)","author":"Ramasamy Loganathan","year":"2012","unstructured":"Loganathan Ramasamy , Ond\u0159ej Bojar , and Zden\u011bk \u017dabokrtsk\u00fd . 2012 . Morphological Processing for English-Tamil Statistical Machine Translation . In Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012) . 113\u2013122. Loganathan Ramasamy, Ond\u0159ej Bojar, and Zden\u011bk \u017dabokrtsk\u00fd. 2012. Morphological Processing for English-Tamil Statistical Machine Translation. In Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012). 113\u2013122."},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFRKM.2018.8464781"},{"key":"e_1_3_2_2_47_1","volume-title":"Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia. arXiv preprint arXiv:1907.05791(2019).","author":"Schwenk Holger","year":"2019","unstructured":"Holger Schwenk , Vishrav Chaudhary , Shuo Sun , Hongyu Gong , and Francisco Guzm\u00e1n . 2019 . Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia. arXiv preprint arXiv:1907.05791(2019). Holger Schwenk, Vishrav Chaudhary, Shuo Sun, Hongyu Gong, and Francisco Guzm\u00e1n. 2019. Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia. arXiv preprint arXiv:1907.05791(2019)."},{"key":"e_1_3_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1009"},{"key":"e_1_3_2_2_49_1","volume-title":"The Ninth Conference of the Association for Machine Translation in the Americas (AMTA","author":"Sennrich Rico","year":"2010","unstructured":"Rico Sennrich and Martin Volk . 2010 . MT-based sentence alignment for OCR-generated parallel texts . In The Ninth Conference of the Association for Machine Translation in the Americas (AMTA 2010). Rico Sennrich and Martin Volk. 2010. MT-based sentence alignment for OCR-generated parallel texts. In The Ninth Conference of the Association for Machine Translation in the Americas (AMTA 2010)."},{"key":"e_1_3_2_2_50_1","volume-title":"Proceedings of the 18th Nordic Conference of Computational Linguistics (NODALIDA","author":"Sennrich Rico","year":"2011","unstructured":"Rico Sennrich and Martin Volk . 2011 . Iterative, MT-based sentence alignment of parallel texts . In Proceedings of the 18th Nordic Conference of Computational Linguistics (NODALIDA 2011). 175\u2013182. Rico Sennrich and Martin Volk. 2011. Iterative, MT-based sentence alignment of parallel texts. In Proceedings of the 18th Nordic Conference of Computational Linguistics (NODALIDA 2011). 175\u2013182."},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1021"},{"key":"e_1_3_2_2_52_1","unstructured":"Karanveer Singh and Pushpak Bhattacharyya. 2019. NMT in Low Resource Scenario : A Case Study in Indian Languages.  Karanveer Singh and Pushpak Bhattacharyya. 2019. NMT in Low Resource Scenario : A Case Study in Indian Languages."},{"key":"e_1_3_2_2_53_1","volume-title":"Proceedings of The 12th Language Resources and Evaluation Conference. European Language Resources Association","author":"Siripragada Shashank","year":"2020","unstructured":"Shashank Siripragada , Jerin Philip , Vinay\u00a0 P. Namboodiri , and C\u00a0V Jawahar . 2020 . A Multilingual Parallel Corpora Collection Effort for Indian Languages . In Proceedings of The 12th Language Resources and Evaluation Conference. European Language Resources Association , Marseille, France, 3743\u20133751. https:\/\/www.aclweb.org\/anthology\/ 2020.lrec-1.462 Shashank Siripragada, Jerin Philip, Vinay\u00a0P. Namboodiri, and C\u00a0V Jawahar. 2020. A Multilingual Parallel Corpora Collection Effort for Indian Languages. In Proceedings of The 12th Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, 3743\u20133751. https:\/\/www.aclweb.org\/anthology\/2020.lrec-1.462"},{"key":"e_1_3_2_2_54_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In NIPS.  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In NIPS."},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01231-1_29"}],"event":{"name":"CODS COMAD 2021: 8th ACM IKDD CODS and 26th COMAD","acronym":"CODS COMAD 2021","location":"Bangalore India"},"container-title":["Proceedings of the 3rd ACM India Joint International Conference on Data Science &amp; Management of Data (8th ACM IKDD CODS &amp; 26th COMAD)"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3430984.3431026","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3430984.3431026","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:44Z","timestamp":1750195484000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3430984.3431026"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,2]]},"references-count":55,"alternative-id":["10.1145\/3430984.3431026","10.1145\/3430984"],"URL":"https:\/\/doi.org\/10.1145\/3430984.3431026","relation":{},"subject":[],"published":{"date-parts":[[2021,1,2]]},"assertion":[{"value":"2021-01-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}