{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,8]],"date-time":"2025-09-08T06:00:05Z","timestamp":1757311205153,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":25,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,4,25]],"date-time":"2022-04-25T00:00:00Z","timestamp":1650844800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100008628","name":"Ministry of Electronics and Information technology","doi-asserted-by":"publisher","award":["11(6)\/2019-HCC (TDIL)"],"award-info":[{"award-number":["11(6)\/2019-HCC (TDIL)"]}],"id":[{"id":"10.13039\/501100008628","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,4,25]]},"DOI":"10.1145\/3487553.3524265","type":"proceedings-article","created":{"date-parts":[[2022,8,16]],"date-time":"2022-08-16T22:41:30Z","timestamp":1660689690000},"page":"171-175","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages"],"prefix":"10.1145","author":[{"given":"Tushar","family":"Abhishek","sequence":"first","affiliation":[{"name":"Information Retrieval and Extraction Lab, IIIT Hyderabad, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shivprasad","family":"Sagare","sequence":"additional","affiliation":[{"name":"Information Retrieval and Extraction Lab, IIIT Hyderabad, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bhavyajeet","family":"Singh","sequence":"additional","affiliation":[{"name":"Information Retrieval and Extraction Lab, IIIT Hyderabad, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anubhav","family":"Sharma","sequence":"additional","affiliation":[{"name":"Information Retrieval and Extraction Lab, IIIT Hyderabad, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Manish","family":"Gupta","sequence":"additional","affiliation":[{"name":"Information Retrieval and Extraction Lab, IIIT Hyderabad, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vasudeva","family":"Varma","sequence":"additional","affiliation":[{"name":"Information Retrieval and Extraction Lab, IIIT Hyderabad, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,8,16]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"O Agarwal H Ge S Shakeri and R Al-Rfou. 2021. Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training. In NAACL-HLT. 3554\u20133565.  O Agarwal H Ge S Shakeri and R Al-Rfou. 2021. Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training. In NAACL-HLT. 3554\u20133565.","DOI":"10.18653\/v1\/2021.naacl-main.278"},{"key":"e_1_3_2_1_2_1","unstructured":"G. Attardi. 2015. WikiExtractor. https:\/\/github.com\/attardi\/wikiextractor.  G. Attardi. 2015. WikiExtractor. https:\/\/github.com\/attardi\/wikiextractor."},{"key":"e_1_3_2_1_3_1","volume-title":"LDC2010T16(2010)","author":"Bali K","year":"2010","unstructured":"K Bali , M Choudhury , and P Biswas . 2010 . Indian Language POS Tagset: Bengali. Linguistic Data Consortium , LDC2010T16(2010) . K Bali, M Choudhury, and P Biswas. 2010. Indian Language POS Tagset: Bengali. Linguistic Data Consortium, LDC2010T16(2010)."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"crossref","unstructured":"J\u00a0A Botha Z Shan and D Gillick. 2020. Entity Linking in 100 Languages. In EMNLP. 7833\u20137845.  J\u00a0A Botha Z Shan and D Gillick. 2020. Entity Linking in 100 Languages. In EMNLP. 7833\u20137845.","DOI":"10.18653\/v1\/2020.emnlp-main.630"},{"key":"e_1_3_2_1_5_1","volume-title":"WIKITABLET: A Large-Scale Data-to-Text Dataset for Generating Wikipedia Article Sections. In ACL-IJCNLP Findings. 193\u2013209.","author":"Chen M","year":"2021","unstructured":"M Chen , S Wiseman , and K Gimpel . 2021 . WIKITABLET: A Large-Scale Data-to-Text Dataset for Generating Wikipedia Article Sections. In ACL-IJCNLP Findings. 193\u2013209. M Chen, S Wiseman, and K Gimpel. 2021. WIKITABLET: A Large-Scale Data-to-Text Dataset for Generating Wikipedia Article Sections. In ACL-IJCNLP Findings. 193\u2013209."},{"key":"e_1_3_2_1_6_1","volume-title":"KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation. arXiv:2010.02307","author":"Chen W","year":"2020","unstructured":"W Chen , Y Su , X Yan , and W\u00a0Y Wang . 2020 . KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation. arXiv:2010.02307 (2020). W Chen, Y Su, X Yan, and W\u00a0Y Wang. 2020. KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation. arXiv:2010.02307 (2020)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"A Conneau K Khandelwal N Goyal V Chaudhary G Wenzek F Guzm\u00e1n \u00c9 Grave M Ott L Zettlemoyer and V Stoyanov. 2020. Unsupervised Cross-lingual Representation Learning at Scale. In ACL. 8440\u20138451.  A Conneau K Khandelwal N Goyal V Chaudhary G Wenzek F Guzm\u00e1n \u00c9 Grave M Ott L Zettlemoyer and V Stoyanov. 2020. Unsupervised Cross-lingual Representation Learning at Scale. In ACL. 8440\u20138451.","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"e_1_3_2_1_8_1","unstructured":"D Duma and E Klein. 2013. Generating natural language from linked data: Unsupervised template extraction. In IWCS. 83\u201394.  D Duma and E Klein. 2013. Generating natural language from linked data: Unsupervised template extraction. In IWCS. 83\u201394."},{"key":"e_1_3_2_1_9_1","volume-title":"T-rex: A large scale alignment of natural language with knowledge base triples. In LREC.","author":"Elsahar H","year":"2018","unstructured":"H Elsahar , P Vougiouklis , A Remaci , C Gravier , J Hare , F Laforest , and E Simperl . 2018 . T-rex: A large scale alignment of natural language with knowledge base triples. In LREC. H Elsahar, P Vougiouklis, A Remaci, C Gravier, J Hare, F Laforest, and E Simperl. 2018. T-rex: A large scale alignment of natural language with knowledge base triples. In LREC."},{"key":"e_1_3_2_1_10_1","volume-title":"S Mille, D Moussallem, and A Shimorina.","author":"Ferreira T","year":"2020","unstructured":"T Ferreira , C Gardent , N Ilinykh , C van\u00a0der Lee , S Mille, D Moussallem, and A Shimorina. 2020 . The 2020 Bilingual, Bi-Directional WebNLG+ Shared Task : Overview and Evaluation Results. In WebNLG +. 55\u201376. T Ferreira, C Gardent, N Ilinykh, C van\u00a0der Lee, S Mille, D Moussallem, and A Shimorina. 2020. The 2020 Bilingual, Bi-Directional WebNLG+ Shared Task: Overview and Evaluation Results. In WebNLG+. 55\u201376."},{"key":"e_1_3_2_1_11_1","volume-title":"Partially-aligned data-to-text generation with distant supervision. arXiv:2010.01268","author":"Fu Z","year":"2020","unstructured":"Z Fu , B Shi , W Lam , L Bing , and Z Liu . 2020. Partially-aligned data-to-text generation with distant supervision. arXiv:2010.01268 ( 2020 ). Z Fu, B Shi, W Lam, L Bing, and Z Liu. 2020. Partially-aligned data-to-text generation with distant supervision. arXiv:2010.01268 (2020)."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"crossref","unstructured":"C Gardent A Shimorina S Narayan and L Perez-Beltrachini. 2017. The WebNLG challenge: Generating text from RDF data. In INLG. 124\u2013133.  C Gardent A Shimorina S Narayan and L Perez-Beltrachini. 2017. The WebNLG challenge: Generating text from RDF data. In INLG. 124\u2013133.","DOI":"10.18653\/v1\/W17-3518"},{"key":"e_1_3_2_1_13_1","volume-title":"Genwiki: A dataset of 1.3 million content-sharing text and graphs for unsupervised graph-to-text generation. In COLING. 2398\u20132409.","author":"Jin Z","year":"2020","unstructured":"Z Jin , Q Guo , X Qiu , and Z Zhang . 2020 . Genwiki: A dataset of 1.3 million content-sharing text and graphs for unsupervised graph-to-text generation. In COLING. 2398\u20132409. Z Jin, Q Guo, X Qiu, and Z Zhang. 2020. Genwiki: A dataset of 1.3 million content-sharing text and graphs for unsupervised graph-to-text generation. In COLING. 2398\u20132409."},{"key":"e_1_3_2_1_14_1","volume-title":"Muril: Multilingual representations for indian languages. arXiv:2103.10730","author":"Khanuja S","year":"2021","unstructured":"S Khanuja , D Bansal , S Mehtani , S Khosla , A Dey , B Gopalan , D\u00a0K Margam , P Aggarwal , R\u00a0T Nagipogu , S Dave , 2021 . Muril: Multilingual representations for indian languages. arXiv:2103.10730 (2021). S Khanuja, D Bansal, S Mehtani, S Khosla, A Dey, B Gopalan, D\u00a0K Margam, P Aggarwal, R\u00a0T Nagipogu, S Dave, 2021. Muril: Multilingual representations for indian languages. arXiv:2103.10730 (2021)."},{"key":"e_1_3_2_1_15_1","unstructured":"K Kolluru M Rezk P Verga W\u00a0W Cohen and P Talukdar. 2021. Multilingual Fact Linking. In AKBC.  K Kolluru M Rezk P Verga W\u00a0W Cohen and P Talukdar. 2021. Multilingual Fact Linking. In AKBC."},{"key":"e_1_3_2_1_16_1","unstructured":"A Kunchukuttan. 2020. The IndicNLP Library. https:\/\/github.com\/anoopkunchukuttan\/indic_nlp_library\/blob\/master\/docs\/indicnlp.pdf.  A Kunchukuttan. 2020. The IndicNLP Library. https:\/\/github.com\/anoopkunchukuttan\/indic_nlp_library\/blob\/master\/docs\/indicnlp.pdf."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"crossref","unstructured":"R Lebret D Grangier and M Auli. 2016. Neural Text Generation from Structured Data with Application to the Biography Domain. In EMNLP. 1203\u20131213.  R Lebret D Grangier and M Auli. 2016. Neural Text Generation from Structured Data with Application to the Biography Domain. In EMNLP. 1203\u20131213.","DOI":"10.18653\/v1\/D16-1128"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"crossref","unstructured":"P Nema S Shetty P Jain A Laha K Sankaranarayanan and M\u00a0M Khapra. 2018. Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization. In NAACL-HLT. 1539\u20131550.  P Nema S Shetty P Jain A Laha K Sankaranarayanan and M\u00a0M Khapra. 2018. Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization. In NAACL-HLT. 1539\u20131550.","DOI":"10.18653\/v1\/N18-1139"},{"key":"e_1_3_2_1_19_1","volume-title":"The E2E dataset: New challenges for end-to-end generation. arXiv:1706.09254","author":"Novikova J","year":"2017","unstructured":"J Novikova , O Du\u0161ek , and V Rieser . 2017. The E2E dataset: New challenges for end-to-end generation. arXiv:1706.09254 ( 2017 ). J Novikova, O Du\u0161ek, and V Rieser. 2017. The E2E dataset: New challenges for end-to-end generation. arXiv:1706.09254 (2017)."},{"key":"e_1_3_2_1_20_1","volume-title":"IJCNLP Workshop on NLP for Less Privileged Languages.","author":"Patel C","year":"2008","unstructured":"C Patel and K Gali . 2008 . Part-of-speech tagging for Gujarati using conditional random fields . In IJCNLP Workshop on NLP for Less Privileged Languages. C Patel and K Gali. 2008. Part-of-speech tagging for Gujarati using conditional random fields. In IJCNLP Workshop on NLP for Less Privileged Languages."},{"key":"e_1_3_2_1_21_1","volume-title":"Stanza: A Python Natural Language Processing Toolkit for Many Human Languages. In ACL Demos. https:\/\/nlp.stanford.edu\/pubs\/qi2020stanza.pdf","author":"Qi P","year":"2020","unstructured":"P Qi , Y Zhang , Y Zhang , J Bolton , and C\u00a0D Manning . 2020 . Stanza: A Python Natural Language Processing Toolkit for Many Human Languages. In ACL Demos. https:\/\/nlp.stanford.edu\/pubs\/qi2020stanza.pdf P Qi, Y Zhang, Y Zhang, J Bolton, and C\u00a0D Manning. 2020. Stanza: A Python Natural Language Processing Toolkit for Many Human Languages. In ACL Demos. https:\/\/nlp.stanford.edu\/pubs\/qi2020stanza.pdf"},{"key":"e_1_3_2_1_22_1","volume-title":"Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages. arXiv:2104.05596","author":"Ramesh G","year":"2021","unstructured":"G Ramesh , S Doddapaneni , A Bheemaraj , M Jobanputra , Raghavan AK, A Sharma , 2021 . Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages. arXiv:2104.05596 (2021). G Ramesh, S Doddapaneni, A Bheemaraj, M Jobanputra, Raghavan AK, A Sharma, 2021. Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages. arXiv:2104.05596 (2021)."},{"key":"e_1_3_2_1_23_1","first-page":"57","article-title":"Building applied natural language generation systems","volume":"3","author":"Reiter E","year":"1997","unstructured":"E Reiter and R Dale . 1997 . Building applied natural language generation systems . NL Engineering 3 , 1 (1997), 57 \u2013 87 . E Reiter and R Dale. 1997. Building applied natural language generation systems. NL Engineering 3, 1 (1997), 57\u201387.","journal-title":"NL Engineering"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"crossref","unstructured":"L Xue N Constant A Roberts M Kale R Al-Rfou A Siddhant A Barua and C Raffel. 2021. mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer. In NAACL-HLT. 483\u2013498.  L Xue N Constant A Roberts M Kale R Al-Rfou A Siddhant A Barua and C Raffel. 2021. mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer. In NAACL-HLT. 483\u2013498.","DOI":"10.18653\/v1\/2021.naacl-main.41"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"crossref","unstructured":"C Zhao M Walker and S Chaturvedi. 2020. Bridging the structural gap between encoding and decoding for data-to-text generation. In ACL. 2481\u20132491.  C Zhao M Walker and S Chaturvedi. 2020. Bridging the structural gap between encoding and decoding for data-to-text generation. In ACL. 2481\u20132491.","DOI":"10.18653\/v1\/2020.acl-main.224"}],"event":{"name":"WWW '22: The ACM Web Conference 2022","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"],"location":"Virtual Event, Lyon France","acronym":"WWW '22"},"container-title":["Companion Proceedings of the Web Conference 2022"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3487553.3524265","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3487553.3524265","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:34Z","timestamp":1750188634000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3487553.3524265"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,25]]},"references-count":25,"alternative-id":["10.1145\/3487553.3524265","10.1145\/3487553"],"URL":"https:\/\/doi.org\/10.1145\/3487553.3524265","relation":{},"subject":[],"published":{"date-parts":[[2022,4,25]]},"assertion":[{"value":"2022-08-16","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}