{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,24]],"date-time":"2026-04-24T03:38:23Z","timestamp":1777001903369,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":39,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,6,28]],"date-time":"2022-06-28T00:00:00Z","timestamp":1656374400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,6,28]]},"DOI":"10.1145\/3511095.3531277","type":"proceedings-article","created":{"date-parts":[[2022,6,20]],"date-time":"2022-06-20T17:32:06Z","timestamp":1655746326000},"page":"32-42","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":17,"title":["Data Bootstrapping Approaches to Improve Low Resource Abusive Language Detection for Indic Languages"],"prefix":"10.1145","author":[{"given":"Mithun","family":"Das","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, India, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Somnath","family":"Banerjee","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Animesh","family":"Mukherjee","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,6,28]]},"reference":[{"key":"e_1_3_2_2_1_1","unstructured":"2017. Moderators who had to view Child abuse content sue microsoft claiming PTSD. https:\/\/www.theguardian.com\/technology\/2017\/jan\/11\/microsoft-employees-child-abuse-lawsuit-ptsd  2017. Moderators who had to view Child abuse content sue microsoft claiming PTSD. https:\/\/www.theguardian.com\/technology\/2017\/jan\/11\/microsoft-employees-child-abuse-lawsuit-ptsd"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.2994950"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3503162.3505241"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3041021.3054223"},{"key":"e_1_3_2_2_5_1","unstructured":"Somnath Banerjee Maulindu Sarkar Nancy Agrawal Punyajoy Saha and Mithun Das. 2021. Exploring Transformer Based Models to Identify Hate Speech and Offensive Content in English and Indo-Aryan Languages. arXiv preprint arXiv:2111.13974(2021).  Somnath Banerjee Maulindu Sarkar Nancy Agrawal Punyajoy Saha and Mithun Das. 2021. Exploring Transformer Based Models to Identify Hate Speech and Offensive Content in English and Indo-Aryan Languages. arXiv preprint arXiv:2111.13974(2021)."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W18-1105"},{"key":"e_1_3_2_2_7_1","volume-title":"Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages. 133\u2013145","author":"Chakravarthi Bharathi\u00a0Raja","year":"2021","unstructured":"Bharathi\u00a0Raja Chakravarthi , Ruba Priyadharshini , Navya Jose , Thomas Mandl , Prasanna\u00a0Kumar Kumaresan , Rahul Ponnusamy , RL Hariharan , John\u00a0Philip McCrae , Elizabeth Sherly , 2021 . Findings of the shared task on offensive language identification in Tamil, Malayalam, and Kannada . In Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages. 133\u2013145 . Bharathi\u00a0Raja Chakravarthi, Ruba Priyadharshini, Navya Jose, Thomas Mandl, Prasanna\u00a0Kumar Kumaresan, Rahul Ponnusamy, RL Hariharan, John\u00a0Philip McCrae, Elizabeth Sherly, 2021. Findings of the shared task on offensive language identification in Tamil, Malayalam, and Kannada. In Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages. 133\u2013145."},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/SocialCom-PASSAT.2012.55"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-3908"},{"key":"e_1_3_2_2_10_1","unstructured":"Mithun Das Somnath Banerjee and Punyajoy Saha. 2021. Abusive and Threatening Language Detection in Urdu using Boosting based and BERT based models: A Comparative Approach. arXiv preprint arXiv:2111.14830(2021).  Mithun Das Somnath Banerjee and Punyajoy Saha. 2021. Abusive and Threatening Language Detection in Urdu using Boosting based and BERT based models: A Comparative Approach. arXiv preprint arXiv:2111.14830(2021)."},{"key":"e_1_3_2_2_11_1","volume-title":"Hate speech in online social media. ACM SIGWEB NewsletterAutumn","author":"Das Mithun","year":"2020","unstructured":"Mithun Das , Binny Mathew , Punyajoy Saha , Pawan Goyal , and Animesh Mukherjee . 2020. Hate speech in online social media. ACM SIGWEB NewsletterAutumn ( 2020 ), 1\u20138. Mithun Das, Binny Mathew, Punyajoy Saha, Pawan Goyal, and Animesh Mukherjee. 2020. Hate speech in online social media. ACM SIGWEB NewsletterAutumn (2020), 1\u20138."},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3465336.3475106"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1609\/icwsm.v11i1.14955"},{"key":"e_1_3_2_2_14_1","volume-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL.","author":"Devlin J.","year":"2019","unstructured":"J. Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL. J. Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1609\/icwsm.v12i1.14991"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.26615\/978-954-452-072-4_050"},{"key":"e_1_3_2_2_17_1","volume-title":"Hidden resilience and adaptive dynamics of the global online hate ecology. Nature 573, 7773","author":"Johnson F","year":"2019","unstructured":"Nicola\u00a0 F Johnson , R Leahy , N\u00a0Johnson Restrepo , Nicolas Velasquez , Ming Zheng , P Manrique , P Devkota , and Stefan Wuchty . 2019. Hidden resilience and adaptive dynamics of the global online hate ecology. Nature 573, 7773 ( 2019 ), 261\u2013265. Nicola\u00a0F Johnson, R Leahy, N\u00a0Johnson Restrepo, Nicolas Velasquez, Ming Zheng, P Manrique, P Devkota, and Stefan Wuchty. 2019. Hidden resilience and adaptive dynamics of the global online hate ecology. Nature 573, 7773 (2019), 261\u2013265."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3414524"},{"key":"e_1_3_2_2_19_1","volume-title":"Muril: Multilingual representations for indian languages. arXiv preprint arXiv:2103.10730(2021).","author":"Khanuja Simran","year":"2021","unstructured":"Simran Khanuja , Diksha Bansal , Sarvesh Mehtani , Savya Khosla , Atreyee Dey , Balaji Gopalan , Dilip\u00a0Kumar Margam , Pooja Aggarwal , Rajiv\u00a0Teja Nagipogu , Shachi Dave , 2021 . Muril: Multilingual representations for indian languages. arXiv preprint arXiv:2103.10730(2021). Simran Khanuja, Diksha Bansal, Sarvesh Mehtani, Savya Khosla, Atreyee Dey, Balaji Gopalan, Dilip\u00a0Kumar Margam, Pooja Aggarwal, Rajiv\u00a0Teja Nagipogu, Shachi Dave, 2021. Muril: Multilingual representations for indian languages. arXiv preprint arXiv:2103.10730(2021)."},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-2095"},{"key":"e_1_3_2_2_21_1","volume-title":"Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018)","author":"Kumar Ritesh","year":"2018","unstructured":"Ritesh Kumar , Atul\u00a0Kr Ojha , Shervin Malmasi , and Marcos Zampieri . 2018 . Benchmarking aggression identification in social media . In Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018) . 1\u201311. Ritesh Kumar, Atul\u00a0Kr Ojha, Shervin Malmasi, and Marcos Zampieri. 2018. Benchmarking aggression identification in social media. In Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018). 1\u201311."},{"key":"e_1_3_2_2_22_1","volume-title":"Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018)","author":"Kumar Ritesh","year":"2018","unstructured":"Ritesh Kumar , Atul\u00a0Kr Ojha , Marcos Zampieri , and Shervin Malmasi . 2018 . Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018) . In Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC- 2018). Ritesh Kumar, Atul\u00a0Kr Ojha, Marcos Zampieri, and Shervin Malmasi. 2018. Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018). In Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018)."},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"crossref","unstructured":"Thomas Mandl Sandip Modha Anand Kumar\u00a0M and Bharathi\u00a0Raja Chakravarthi. 2020. Overview of the hasoc track at fire 2020: Hate speech and offensive language identification in tamil malayalam hindi english and german. In Forum for Information Retrieval Evaluation. 29\u201332.  Thomas Mandl Sandip Modha Anand Kumar\u00a0M and Bharathi\u00a0Raja Chakravarthi. 2020. Overview of the hasoc track at fire 2020: Hate speech and offensive language identification in tamil malayalam hindi english and german. In Forum for Information Retrieval Evaluation. 29\u201332.","DOI":"10.1145\/3441501.3441517"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3368567.3368584"},{"key":"e_1_3_2_2_25_1","unstructured":"Thomas Mandl Sandip Modha Gautam\u00a0Kishore Shahi Hiren Madhu Shrey Satapara Prasenjit Majumder Johannes Schaefer Tharindu Ranasinghe Marcos Zampieri Durgesh Nandini 2021. Overview of the HASOC subtrack at FIRE 2021: Hate speech and offensive content identification in English and Indo-Aryan languages. arXiv preprint arXiv:2112.09301(2021).  Thomas Mandl Sandip Modha Gautam\u00a0Kishore Shahi Hiren Madhu Shrey Satapara Prasenjit Majumder Johannes Schaefer Tharindu Ranasinghe Marcos Zampieri Durgesh Nandini 2021. Overview of the HASOC subtrack at FIRE 2021: Hate speech and offensive content identification in English and Indo-Aryan languages. arXiv preprint arXiv:2112.09301(2021)."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i17.17745"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.conll-1.45"},{"key":"e_1_3_2_2_28_1","unstructured":"Casey Newton. 2019. The terror queue. https:\/\/www.theverge.com\/2019\/12\/16\/21021005\/google-youtube-moderators-ptsd-accenture-violent-disturbing-content-interviews-video  Casey Newton. 2019. The terror queue. https:\/\/www.theverge.com\/2019\/12\/16\/21021005\/google-youtube-moderators-ptsd-accenture-violent-disturbing-content-interviews-video"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1609\/icwsm.v12i1.15040"},{"key":"e_1_3_2_2_30_1","unstructured":"Georgios\u00a0K. Pitsilis H. Ramampiaro and H. Langseth. 2018. Detecting Offensive Language in Tweets Using Deep Learning. ArXiv abs\/1801.04433(2018).  Georgios\u00a0K. Pitsilis H. Ramampiaro and H. Langseth. 2018. Detecting Offensive Language in Tweets Using Deep Learning. ArXiv abs\/1801.04433(2018)."},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"crossref","unstructured":"Tharindu Ranasinghe and Marcos Zampieri. 2020. Multilingual offensive language identification with cross-lingual embeddings. arXiv preprint arXiv:2010.05324(2020).  Tharindu Ranasinghe and Marcos Zampieri. 2020. Multilingual offensive language identification with cross-lingual embeddings. arXiv preprint arXiv:2010.05324(2020).","DOI":"10.18653\/v1\/2020.emnlp-main.470"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.3390\/info12080306"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.197"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-16-0586-4_37"},{"key":"e_1_3_2_2_35_1","volume-title":"Exposure to hate speech increases prejudice through desensitization. Aggressive behavior 44, 2","author":"Soral Wiktor","year":"2018","unstructured":"Wiktor Soral , Micha\u0142 Bilewicz , and Miko\u0142aj Winiewski . 2018. Exposure to hate speech increases prejudice through desensitization. Aggressive behavior 44, 2 ( 2018 ), 136\u2013146. Wiktor Soral, Micha\u0142 Bilewicz, and Miko\u0142aj Winiewski. 2018. Exposure to hate speech increases prejudice through desensitization. Aggressive behavior 44, 2 (2018), 136\u2013146."},{"key":"e_1_3_2_2_36_1","volume-title":"YouTube is facing a full-scale advertising boycott over hate speech. The Verge","author":"Statt N","year":"2017","unstructured":"N Statt . 2017. YouTube is facing a full-scale advertising boycott over hate speech. The Verge ( 2017 ). N Statt. 2017. YouTube is facing a full-scale advertising boycott over hate speech. The Verge (2017)."},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1080\/09687599.2018.1515723"},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-2013"},{"key":"e_1_3_2_2_39_1","volume-title":"European semantic web conference","author":"Zhang Ziqi","unstructured":"Ziqi Zhang , David Robinson , and Jonathan Tepper . 2018. Detecting hate speech on twitter using a convolution-gru based deep neural network . In European semantic web conference . Springer , 745\u2013760. Ziqi Zhang, David Robinson, and Jonathan Tepper. 2018. Detecting hate speech on twitter using a convolution-gru based deep neural network. In European semantic web conference. Springer, 745\u2013760."}],"event":{"name":"HT '22: 33rd ACM Conference on Hypertext and Social Media","location":"Barcelona Spain","acronym":"HT '22","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGCHI ACM Special Interest Group on Computer-Human Interaction"]},"container-title":["Proceedings of the 33rd ACM Conference on Hypertext and Social Media"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3511095.3531277","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3511095.3531277","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:11:58Z","timestamp":1750191118000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3511095.3531277"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,28]]},"references-count":39,"alternative-id":["10.1145\/3511095.3531277","10.1145\/3511095"],"URL":"https:\/\/doi.org\/10.1145\/3511095.3531277","relation":{},"subject":[],"published":{"date-parts":[[2022,6,28]]},"assertion":[{"value":"2022-06-28","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}