{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,5,13]],"date-time":"2025-05-13T22:00:28Z","timestamp":1747173628461,"version":"3.40.5"},"reference-count":56,"publisher":"Cambridge University Press (CUP)","issue":"6","license":[{"start":{"date-parts":[[2023,8,30]],"date-time":"2023-08-30T00:00:00Z","timestamp":1693353600000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"content-domain":{"domain":["cambridge.org"],"crossmark-restriction":true},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2023,11]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Identifying and annotating toxic online content on social media platforms is an extremely challenging problem. Work that studies toxicity in online content has predominantly focused on comments as independent entities. However, comments on social media are inherently conversational, and therefore, understanding and judging the comments fundamentally requires access to the context in which they are made. We introduce a study and resulting annotated dataset where we devise a number of controlled experiments on the importance of context and other observable confounders \u2013 namely gender, age and political orientation \u2013 towards the perception of toxicity in online content. Our analysis clearly shows the significance of context and the effect of observable confounders on annotations. Namely, we observe that the ratio of toxic to non-toxic judgements can be very different for each control group, and a higher proportion of samples are judged toxic in the presence of contextual information.<\/jats:p>","DOI":"10.1017\/s1351324923000414","type":"journal-article","created":{"date-parts":[[2023,8,30]],"date-time":"2023-08-30T09:56:43Z","timestamp":1693389403000},"page":"1538-1560","update-policy":"https:\/\/doi.org\/10.1017\/policypage","source":"Crossref","is-referenced-by-count":1,"title":["A study towards contextual understanding of toxicity in online conversations"],"prefix":"10.1017","volume":"29","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4438-8161","authenticated-orcid":false,"given":"Pranava","family":"Madhyastha","sequence":"first","affiliation":[]},{"given":"Antigoni","family":"Founta","sequence":"additional","affiliation":[]},{"given":"Lucia","family":"Specia","sequence":"additional","affiliation":[]}],"member":"56","published-online":{"date-parts":[[2023,8,30]]},"reference":[{"key":"S1351324923000414_ref13","doi-asserted-by":"publisher","DOI":"10.1111\/j.0092-5853.2004.00054.x"},{"key":"S1351324923000414_ref25","doi-asserted-by":"publisher","DOI":"10.1093\/poq\/nfm009"},{"key":"S1351324923000414_ref56","doi-asserted-by":"publisher","DOI":"10.1145\/1292491.1292514"},{"key":"S1351324923000414_ref55","first-page":"3","article-title":"10 points versus 11 points? effects of left-right scale design in a cross-national perspective","volume":"25","author":"Zuell","year":"2016","journal-title":"ASK. Research and Methods"},{"key":"S1351324923000414_ref26","unstructured":"Kumar, R. , Ojha, A.K. , Malmasi, S. and Zampieri, M. (2018). Benchmarking aggression identification in social media. In Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), pp. 1\u201311."},{"key":"S1351324923000414_ref37","doi-asserted-by":"crossref","unstructured":"Qian, J. , Bethke, A. , Liu, Y. , Belding, E. and Wang, W.Y. (2019). A benchmark dataset for learning to intervene in online hate speech. arXiv preprint arXiv:1909.","DOI":"10.18653\/v1\/D19-1482"},{"key":"S1351324923000414_ref30","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-27947-9_9"},{"key":"S1351324923000414_ref44","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W19-3509"},{"key":"S1351324923000414_ref16","doi-asserted-by":"publisher","DOI":"10.2139\/ssrn.2882824"},{"key":"S1351324923000414_ref4","doi-asserted-by":"publisher","DOI":"10.1609\/icwsm.v14i1.7347"},{"key":"S1351324923000414_ref3","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/S19-2007"},{"key":"S1351324923000414_ref19","doi-asserted-by":"publisher","DOI":"10.26615\/978-954-452-049-6_036"},{"key":"S1351324923000414_ref23","doi-asserted-by":"crossref","unstructured":"Jurgens, D. , Chandrasekharan, E. and Hemphill, L. (2019). A just and comprehensive strategy for using nlp to address online abuse. arXiv preprint arXiv:1906.","DOI":"10.18653\/v1\/P19-1357"},{"key":"S1351324923000414_ref5","doi-asserted-by":"publisher","DOI":"10.1140\/epjds\/s13688-016-0072-6"},{"key":"S1351324923000414_ref2","unstructured":"Barker, K. and Jurasz, O. (2019). Online harms white paper consultation response."},{"key":"S1351324923000414_ref9","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1271"},{"key":"S1351324923000414_ref1","article-title":"Censorship and content deletion in chinese social media","volume":"17","author":"Bamman","year":"2012","journal-title":"First Monday"},{"key":"S1351324923000414_ref20","doi-asserted-by":"publisher","DOI":"10.1109\/WACV45572.2020.9093414"},{"key":"S1351324923000414_ref24","unstructured":"Kiela, D. , Firooz, H. , Mohan, A. , Goswami, V. , Singh, A. , Ringshia, P. and Testuggine, D. (2020). The hateful memes challenge: detecting hate speech in multimodal memes."},{"volume-title":"Eight Ways to Run the Country: A New and Revealing Look at Left and Right","year":"2007","author":"Mitchell","key":"S1351324923000414_ref33"},{"key":"S1351324923000414_ref11","doi-asserted-by":"crossref","unstructured":"Davidson, T. , Bhattacharya, D. and Weber, I. (2019). Racial bias in hate speech and abusive language detection datasets. arXiv preprint arXiv: 1905.","DOI":"10.18653\/v1\/W19-3504"},{"key":"S1351324923000414_ref7","doi-asserted-by":"publisher","DOI":"10.1145\/3274301"},{"key":"S1351324923000414_ref39","doi-asserted-by":"publisher","DOI":"10.4324\/9781351051989-9"},{"key":"S1351324923000414_ref10","doi-asserted-by":"publisher","DOI":"10.1145\/2488388.2488416"},{"key":"S1351324923000414_ref42","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.eacl-main.152"},{"key":"S1351324923000414_ref48","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W16-5618"},{"key":"S1351324923000414_ref52","unstructured":"Yin, D. , Xue, Z. , Hong, L. , Davison, B.D. , Kontostathis, A. and Edwards, L. (2009). Detection of harassment on web 2.0. In Proceedings of the Content Analysis in the WEB, vol. 2, pp. 1\u20137."},{"key":"S1351324923000414_ref12","unstructured":"Davidson, T. , Warmsley, D. , Macy, M. and Weber, I. (2017). Automated hate speech detection and the problem of offensive language. arXiv preprint arXiv: 1703.04009."},{"key":"S1351324923000414_ref21","unstructured":"Ipsos, M. (2016). Ofcom: attitudes to potentially offensive language and gestures on TV and radio."},{"key":"S1351324923000414_ref22","doi-asserted-by":"publisher","DOI":"10.1145\/3351095.3375671"},{"key":"S1351324923000414_ref50","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4612-4380-9_16"},{"key":"S1351324923000414_ref14","doi-asserted-by":"publisher","DOI":"10.1145\/2740908.2742760"},{"key":"S1351324923000414_ref27","doi-asserted-by":"publisher","DOI":"10.1145\/3041021.3051106"},{"key":"S1351324923000414_ref28","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-9221.2008.00681.x"},{"key":"S1351324923000414_ref29","doi-asserted-by":"crossref","unstructured":"Liu, P. , Guberman, J. , Hemphill, L. and Culotta, A. (2018). Forecasting the presence and intensity of hostility on instagram using linguistic and social features. arXiv preprint arXiv:1804.06759.","DOI":"10.1609\/icwsm.v12i1.15022"},{"key":"S1351324923000414_ref46","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.2019.1686987"},{"key":"S1351324923000414_ref43","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0243300"},{"key":"S1351324923000414_ref6","doi-asserted-by":"publisher","DOI":"10.1177\/0886260517741215"},{"key":"S1351324923000414_ref47","unstructured":"Warner, W. and Hirschberg, J. (2012). Detecting hate speech on the world wide web. In Proceedings of the Second Workshop on Language in Social Media, pp. 19\u201326."},{"key":"S1351324923000414_ref8","doi-asserted-by":"publisher","DOI":"10.1145\/2998181.2998213"},{"key":"S1351324923000414_ref15","volume-title":"The Psychology of Politics","volume":"2","author":"Eysenck","year":"1968"},{"key":"S1351324923000414_ref18","doi-asserted-by":"crossref","unstructured":"Founta, A.-M. , Djouvas, C. , Chatzakou, D. , Leontiadis, I. , Blackburn, J. , Stringhini, G. , Vakali, A. , Sirivianos, M. and Kourtellis, N. (2018). Large scale crowdsourcing and characterization of twitter abusive behavior. arXiv preprint arXiv:1802.00393.","DOI":"10.1609\/icwsm.v12i1.14991"},{"key":"S1351324923000414_ref35","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.396"},{"key":"S1351324923000414_ref17","doi-asserted-by":"publisher","DOI":"10.1037\/h0031619"},{"key":"S1351324923000414_ref41","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1163"},{"key":"S1351324923000414_ref49","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-2013"},{"key":"S1351324923000414_ref51","doi-asserted-by":"publisher","DOI":"10.1145\/3038912.3052591"},{"key":"S1351324923000414_ref53","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/S19-2010"},{"key":"S1351324923000414_ref32","doi-asserted-by":"publisher","DOI":"10.1080\/0952813X.2017.1409284"},{"key":"S1351324923000414_ref36","article-title":"Resources and benchmark corpora for hate speech detection: a systematic review","volume":"55","author":"Poletto","year":"2020","journal-title":"Language Resources and Evaluation"},{"key":"S1351324923000414_ref40","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1163"},{"key":"S1351324923000414_ref34","doi-asserted-by":"publisher","DOI":"10.1145\/2872427.2883062"},{"key":"S1351324923000414_ref31","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.alw-1.18"},{"key":"S1351324923000414_ref45","unstructured":"Voigt, R. , Jurgens, D. , Prabhakaran, V. , Jurafsky, D. and Tsvetkov, Y. (2018). Rtgender: a corpus for studying differential responses to gender. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)."},{"key":"S1351324923000414_ref38","unstructured":"Ranganath, R. and Perotte, A. (2018). Multiple causal inference with latent confounding. arXiv preprint arXiv:1805.08273."},{"key":"S1351324923000414_ref54","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.semeval-1.188"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324923000414","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,6]],"date-time":"2023-12-06T10:48:32Z","timestamp":1701859712000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324923000414\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,30]]},"references-count":56,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2023,11]]}},"alternative-id":["S1351324923000414"],"URL":"https:\/\/doi.org\/10.1017\/s1351324923000414","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"type":"print","value":"1351-3249"},{"type":"electronic","value":"1469-8110"}],"subject":[],"published":{"date-parts":[[2023,8,30]]},"assertion":[{"value":"\u00a9 The Author(s), 2023. Published by Cambridge University Press","name":"copyright","label":"Copyright","group":{"name":"copyright_and_licensing","label":"Copyright and Licensing"}},{"value":"This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use and\/or adaptation of the article.","name":"license","label":"License","group":{"name":"copyright_and_licensing","label":"Copyright and Licensing"}},{"value":"This content has been made available to all.","name":"free","label":"Free to read"}]}}