{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T05:39:59Z","timestamp":1775885999606,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":42,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,4,19]],"date-time":"2021-04-19T00:00:00Z","timestamp":1618790400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,4,19]]},"DOI":"10.1145\/3442442.3452313","type":"proceedings-article","created":{"date-parts":[[2021,6,3]],"date-time":"2021-06-03T15:59:34Z","timestamp":1622735974000},"page":"500-507","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":39,"title":["A Comparative Study of Using Pre-trained Language Models for Toxic Comment Classification"],"prefix":"10.1145","author":[{"given":"Zhixue","family":"Zhao","sequence":"first","affiliation":[{"name":"University of Sheffield"}]},{"given":"Ziqi","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of Sheffield"}]},{"given":"Frank","family":"Hopfgartner","sequence":"additional","affiliation":[{"name":"University of Sheffield"}]}],"member":"320","published-online":{"date-parts":[[2021,6,3]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3041021.3054223"},{"key":"e_1_3_2_1_2_1","volume-title":"Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 76\u201382","author":"Baruah Arup","year":"2020","unstructured":"Arup Baruah , Kaushik Das , Ferdous Barbhuiya , and Kuntal Dey . 2020 . Aggression identification in english, hindi and bangla text using bert, roberta and svm . In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 76\u201382 . Arup Baruah, Kaushik Das, Ferdous Barbhuiya, and Kuntal Dey. 2020. Aggression identification in english, hindi and bangla text using bert, roberta and svm. In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 76\u201382."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"crossref","unstructured":"Iz Beltagy Kyle Lo and Arman Cohan. 2019. SciBERT: A pretrained language model for scientific text. arXiv preprint arXiv:1903.10676(2019). Iz Beltagy Kyle Lo and Arman Cohan. 2019. SciBERT: A pretrained language model for scientific text. arXiv preprint arXiv:1903.10676(2019).","DOI":"10.18653\/v1\/D19-1371"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.techfore.2013.04.013"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICACCI.2015.7275970"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00104"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Alexandra Chronopoulou Christos Baziotis and Alexandros Potamianos. 2019. An embarrassingly simple approach for transfer learning from pretrained language models. arXiv preprint arXiv:1902.10547(2019). Alexandra Chronopoulou Christos Baziotis and Alexandros Potamianos. 2019. An embarrassingly simple approach for transfer learning from pretrained language models. arXiv preprint arXiv:1902.10547(2019).","DOI":"10.18653\/v1\/N19-1213"},{"key":"e_1_3_2_1_8_1","unstructured":"ConversationAI. 2017. Toxic Comment Classification Challenge: Identify and classify toxic online comments. https:\/\/www.kaggle.com\/c\/jigsaw-toxic-comment-classification-challenge ConversationAI. 2017. Toxic Comment Classification Challenge: Identify and classify toxic online comments. https:\/\/www.kaggle.com\/c\/jigsaw-toxic-comment-classification-challenge"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Thomas Davidson Dana Warmsley Michael Macy and Ingmar Weber. 2017. Automated hate speech detection and the problem of offensive language. In Eleventh international aaai conference on web and social media. Thomas Davidson Dana Warmsley Michael Macy and Ingmar Weber. 2017. Automated hate speech detection and the problem of offensive language. In Eleventh international aaai conference on web and social media.","DOI":"10.1609\/icwsm.v11i1.14955"},{"key":"e_1_3_2_1_10_1","volume-title":"Proceedings of the First Italian Conference on Cybersecurity (ITASEC17)","author":"Fabio","year":"2017","unstructured":"Fabio Del\u00a0Vigna12, Andrea Cimino23, Felice Dell\u2019Orletta , Marinella Petrocchi , and Maurizio Tesconi . 2017 . Hate me, hate me not: Hate speech detection on facebook . In Proceedings of the First Italian Conference on Cybersecurity (ITASEC17) . 86\u201395. Fabio Del\u00a0Vigna12, Andrea Cimino23, Felice Dell\u2019Orletta, Marinella Petrocchi, and Maurizio Tesconi. 2017. Hate me, hate me not: Hate speech detection on facebook. In Proceedings of the First Italian Conference on Cybersecurity (ITASEC17). 86\u201395."},{"key":"e_1_3_2_1_11_1","volume-title":"Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018).","author":"Devlin Jacob","year":"2018","unstructured":"Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018)."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Yuchun Fang Zhengyan Ma Zhaoxiang Zhang Xu-Yao Zhang Xiang Bai 2017. Dynamic Multi-Task Learning with Convolutional Neural Network.. In IJCAI. 1668\u20131674. Yuchun Fang Zhengyan Ma Zhaoxiang Zhang Xu-Yao Zhang Xiang Bai 2017. Dynamic Multi-Task Learning with Convolutional Neural Network.. In IJCAI. 1668\u20131674.","DOI":"10.24963\/ijcai.2017\/231"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3232676"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1609\/icwsm.v12i1.14991"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2946594"},{"key":"e_1_3_2_1_16_1","volume-title":"Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 93\u201398","author":"Gordeev Denis","year":"2020","unstructured":"Denis Gordeev and Olga Lykova . 2020 . BERT of all trades, master of some . In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 93\u201398 . Denis Gordeev and Olga Lykova. 2020. BERT of all trades, master of some. In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 93\u201398."},{"key":"e_1_3_2_1_17_1","volume-title":"Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural networks 18, 5-6","author":"Graves Alex","year":"2005","unstructured":"Alex Graves and J\u00fcrgen Schmidhuber . 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural networks 18, 5-6 ( 2005 ), 602\u2013610. Alex Graves and J\u00fcrgen Schmidhuber. 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural networks 18, 5-6 (2005), 602\u2013610."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Suchin Gururangan Ana Marasovi\u0107 Swabha Swayamdipta Kyle Lo Iz Beltagy Doug Downey and Noah\u00a0A Smith. 2020. Don\u2019t Stop Pretraining: Adapt Language Models to Domains and Tasks. arXiv preprint arXiv:2004.10964(2020). Suchin Gururangan Ana Marasovi\u0107 Swabha Swayamdipta Kyle Lo Iz Beltagy Doug Downey and Noah\u00a0A Smith. 2020. Don\u2019t Stop Pretraining: Adapt Language Models to Domains and Tasks. arXiv preprint arXiv:2004.10964(2020).","DOI":"10.18653\/v1\/2020.acl-main.740"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Yoon Kim. 2014. Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882(2014). Yoon Kim. 2014. Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882(2014).","DOI":"10.3115\/v1\/D14-1181"},{"key":"e_1_3_2_1_20_1","volume-title":"Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying.","author":"Kumar Ritesh","year":"2020","unstructured":"Ritesh Kumar , Atul\u00a0Kr Ojha , Bornini Lahiri , Marcos Zampieri , Shervin Malmasi , Vanessa Murdock , and Daniel Kadar . 2020 . Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. Ritesh Kumar, Atul\u00a0Kr Ojha, Bornini Lahiri, Marcos Zampieri, Shervin Malmasi, Vanessa Murdock, and Daniel Kadar. 2020. Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying."},{"key":"e_1_3_2_1_21_1","volume-title":"Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC","author":"Kumar Ritesh","year":"2018","unstructured":"Ritesh Kumar , Aishwarya\u00a0 N. Reganti , Akshit Bhatia , and Tushar Maheshwari . 2018 . Aggression-annotated Corpus of Hindi-English Code-mixed Data . In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Nicoletta Calzolari\u00a0(Conference chair), Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, H\u00e9l\u00e8ne Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga (Eds.). European Language Resources Association (ELRA), Miyazaki, Japan. Ritesh Kumar, Aishwarya\u00a0N. Reganti, Akshit Bhatia, and Tushar Maheshwari. 2018. Aggression-annotated Corpus of Hindi-English Code-mixed Data. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Nicoletta Calzolari\u00a0(Conference chair), Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, H\u00e9l\u00e8ne Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga (Eds.). European Language Resources Association (ELRA), Miyazaki, Japan."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v27i1.8539"},{"key":"e_1_3_2_1_23_1","unstructured":"Guillaume Lample and Alexis Conneau. 2019. Cross-lingual language model pretraining. arXiv preprint arXiv:1901.07291(2019). Guillaume Lample and Alexis Conneau. 2019. Cross-lingual language model pretraining. arXiv preprint arXiv:1901.07291(2019)."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"crossref","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","article-title":"BioBERT: a pre-trained biomedical language representation model for biomedical text mining","volume":"36","author":"Lee Jinhyuk","year":"2020","unstructured":"Jinhyuk Lee , Wonjin Yoon , Sungdong Kim , Donghyeon Kim , Sunkyu Kim , Chan\u00a0Ho So , and Jaewoo Kang . 2020 . BioBERT: a pre-trained biomedical language representation model for biomedical text mining . Bioinformatics 36 , 4 (2020), 1234 \u2013 1240 . Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan\u00a0Ho So, and Jaewoo Kang. 2020. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36, 4 (2020), 1234\u20131240.","journal-title":"Bioinformatics"},{"key":"e_1_3_2_1_25_1","unstructured":"Han Liu Peter Burnap Wafa Alorainy and Matthew Williams. 2020. Scmhl5 at trac-2 shared task on aggression identification: Bert based ensemble learning approach. (2020). Han Liu Peter Burnap Wafa Alorainy and Matthew Williams. 2020. Scmhl5 at trac-2 shared task on aggression identification: Bert based ensemble learning approach. (2020)."},{"key":"e_1_3_2_1_26_1","volume-title":"Fuzzy Multi-task Learning for Hate Speech Type Identification. In The World Wide Web Conference. ACM, 3006\u20133012","author":"Liu Han","year":"2019","unstructured":"Han Liu , Pete Burnap , Wafa Alorainy , and Matthew\u00a0 L Williams . 2019 . Fuzzy Multi-task Learning for Hate Speech Type Identification. In The World Wide Web Conference. ACM, 3006\u20133012 . Han Liu, Pete Burnap, Wafa Alorainy, and Matthew\u00a0L Williams. 2019. Fuzzy Multi-task Learning for Hate Speech Type Identification. In The World Wide Web Conference. ACM, 3006\u20133012."},{"key":"e_1_3_2_1_27_1","volume-title":"Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692(2019).","author":"Liu Yinhan","year":"2019","unstructured":"Yinhan Liu , Myle Ott , Naman Goyal , Jingfei Du , Mandar Joshi , Danqi Chen , Omer Levy , Mike Lewis , Luke Zettlemoyer , and Veselin Stoyanov . 2019 . Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692(2019). Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692(2019)."},{"key":"e_1_3_2_1_28_1","volume-title":"International Conference on Complex Networks and Their Applications. Springer, 928\u2013940","author":"Mozafari Marzieh","year":"2019","unstructured":"Marzieh Mozafari , Reza Farahbakhsh , and No\u00ebl Crespi . 2019 . A BERT-based transfer learning approach for hate speech detection in online social media . In International Conference on Complex Networks and Their Applications. Springer, 928\u2013940 . Marzieh Mozafari, Reza Farahbakhsh, and No\u00ebl Crespi. 2019. A BERT-based transfer learning approach for hate speech detection in online social media. In International Conference on Complex Networks and Their Applications. Springer, 928\u2013940."},{"key":"e_1_3_2_1_29_1","volume-title":"Vol.\u00a01","author":"Munikar Manish","unstructured":"Manish Munikar , Sushil Shakya , and Aakash Shrestha . 2019. Fine-grained sentiment classification using bert. In 2019 Artificial Intelligence for Transforming Business and Society (AITB) , Vol.\u00a01 . IEEE , 1\u20135. Manish Munikar, Sushil Shakya, and Aakash Shrestha. 2019. Fine-grained sentiment classification using bert. In 2019 Artificial Intelligence for Transforming Business and Society (AITB), Vol.\u00a01. IEEE, 1\u20135."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-08608-8_14"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2872427.2883062"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Ji\u00a0Ho Park and Pascale Fung. 2017. One-step and two-step classification for abusive language detection on twitter. arXiv preprint arXiv:1706.01206(2017). Ji\u00a0Ho Park and Pascale Fung. 2017. One-step and two-step classification for abusive language detection on twitter. arXiv preprint arXiv:1706.01206(2017).","DOI":"10.18653\/v1\/W17-3006"},{"key":"e_1_3_2_1_34_1","volume-title":"Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 126\u2013131","author":"Samghabadi Niloofar\u00a0Safi","year":"2020","unstructured":"Niloofar\u00a0Safi Samghabadi , Parth Patwa , PYKL Srinivas , Prerana Mukherjee , Amitava Das , and Thamar Solorio . 2020 . Aggression and misogyny detection using BERT: A multi-task approach . In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 126\u2013131 . Niloofar\u00a0Safi Samghabadi, Parth Patwa, PYKL Srinivas, Prerana Mukherjee, Amitava Das, and Thamar Solorio. 2020. Aggression and misogyny detection using BERT: A multi-task approach. In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying. 126\u2013131."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-1101"},{"key":"e_1_3_2_1_36_1","unstructured":"Matthew Tang Priyanka Gandhi Md\u00a0Ahsanul Kabir Christopher Zou Jordyn Blakey and Xiao Luo. 2019. Progress notes classification and keyword extraction using attention-based deep learning models with BERT. arXiv preprint arXiv:1910.05786(2019). Matthew Tang Priyanka Gandhi Md\u00a0Ahsanul Kabir Christopher Zou Jordyn Blakey and Xiao Luo. 2019. Progress notes classification and keyword extraction using attention-based deep learning models with BERT. arXiv preprint arXiv:1910.05786(2019)."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"crossref","unstructured":"Betty van Aken Julian Risch Ralf Krestel and Alexander L\u00f6ser. 2018. Challenges for toxic comment classification: An in-depth error analysis. arXiv preprint arXiv:1809.07572(2018). Betty van Aken Julian Risch Ralf Krestel and Alexander L\u00f6ser. 2018. Challenges for toxic comment classification: An in-depth error analysis. arXiv preprint arXiv:1809.07572(2018).","DOI":"10.18653\/v1\/W18-5105"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W16-5618"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"crossref","unstructured":"Zeerak Waseem Thomas Davidson Dana Warmsley and Ingmar Weber. 2017. Understanding abuse: A typology of abusive language detection subtasks. arXiv preprint arXiv:1705.09899(2017). Zeerak Waseem Thomas Davidson Dana Warmsley and Ingmar Weber. 2017. Understanding abuse: A typology of abusive language detection subtasks. arXiv preprint arXiv:1705.09899(2017).","DOI":"10.18653\/v1\/W17-3012"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-2013"},{"key":"e_1_3_2_1_41_1","unstructured":"Thomas Wolf Lysandre Debut Victor Sanh Julien Chaumond Clement Delangue Anthony Moi Pierric Cistac Tim Rault R\u2019emi Louf Morgan Funtowicz and Jamie Brew. 2019. HuggingFace\u2019s Transformers: State-of-the-art Natural Language Processing. ArXiv abs\/1910.03771(2019). Thomas Wolf Lysandre Debut Victor Sanh Julien Chaumond Clement Delangue Anthony Moi Pierric Cistac Tim Rault R\u2019emi Louf Morgan Funtowicz and Jamie Brew. 2019. HuggingFace\u2019s Transformers: State-of-the-art Natural Language Processing. ArXiv abs\/1910.03771(2019)."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.3233\/SW-180338"},{"key":"e_1_3_2_1_43_1","volume-title":"European semantic web conference","author":"Zhang Ziqi","unstructured":"Ziqi Zhang , David Robinson , and Jonathan Tepper . 2018. Detecting hate speech on twitter using a convolution-gru based deep neural network . In European semantic web conference . Springer , 745\u2013760. Ziqi Zhang, David Robinson, and Jonathan Tepper. 2018. Detecting hate speech on twitter using a convolution-gru based deep neural network. In European semantic web conference. Springer, 745\u2013760."}],"event":{"name":"WWW '21: The Web Conference 2021","location":"Ljubljana Slovenia","acronym":"WWW '21","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"]},"container-title":["Companion Proceedings of the Web Conference 2021"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3442442.3452313","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3442442.3452313","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:02:20Z","timestamp":1750197740000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3442442.3452313"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,19]]},"references-count":42,"alternative-id":["10.1145\/3442442.3452313","10.1145\/3442442"],"URL":"https:\/\/doi.org\/10.1145\/3442442.3452313","relation":{},"subject":[],"published":{"date-parts":[[2021,4,19]]},"assertion":[{"value":"2021-06-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}