{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,14]],"date-time":"2026-03-14T09:41:31Z","timestamp":1773481291381,"version":"3.50.1"},"reference-count":59,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2023,2,18]],"date-time":"2023-02-18T00:00:00Z","timestamp":1676678400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,2,18]],"date-time":"2023-02-18T00:00:00Z","timestamp":1676678400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100000781","name":"European Research Council","doi-asserted-by":"publisher","award":["640550"],"award-info":[{"award-number":["640550"]}],"id":[{"id":"10.13039\/501100000781","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"publisher","award":["FR 2829\/4-1"],"award-info":[{"award-number":["FR 2829\/4-1"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100005714","name":"Technische Universit\u00e4t Darmstadt","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100005714","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Lang Resources &amp; Evaluation"],"published-print":{"date-parts":[[2023,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The goal of hate speech detection is to filter negative online content aiming at certain groups of people. Due to the easy accessibility and multilinguality of social media platforms, it is crucial to protect everyone which requires building hate speech detection systems for a wide range of languages. However, the available labeled hate speech datasets are limited, making it difficult to build systems for many languages. In this paper we focus on cross-lingual transfer learning to support hate speech detection in low-resource languages, while highlighting label issues across application scenarios, such as inconsistent label sets of corpora or differing hate speech definitions, which hinder the application of such methods. We leverage cross-lingual word embeddings to train our neural network systems on the source language and apply them to the target language, which lacks labeled examples, and show that good performance can be achieved. We then incorporate unlabeled target language data for further model improvements by bootstrapping labels using an ensemble of different model architectures. Furthermore, we investigate the issue of label imbalance in hate speech datasets, since the high ratio of non-hate examples compared to hate examples often leads to low model performance. We test simple data undersampling and oversampling techniques and show their effectiveness.<\/jats:p>","DOI":"10.1007\/s10579-023-09637-4","type":"journal-article","created":{"date-parts":[[2023,2,19]],"date-time":"2023-02-19T19:41:01Z","timestamp":1676835661000},"page":"1515-1546","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["Label modification and bootstrapping for zero-shot cross-lingual hate speech detection"],"prefix":"10.1007","volume":"57","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6955-981X","authenticated-orcid":false,"given":"Irina","family":"Bigoulaeva","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5144-3069","authenticated-orcid":false,"given":"Viktor","family":"Hangya","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2187-7621","authenticated-orcid":false,"given":"Iryna","family":"Gurevych","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4891-682X","authenticated-orcid":false,"given":"Alexander","family":"Fraser","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,2,18]]},"reference":[{"key":"9637_CR1","first-page":"789","volume-title":"Proceedings of the 56th Annual meeting of the association for computational linguistics","author":"M Artetxe","year":"2018","unstructured":"Artetxe, M., Labaka, G., & Agirre, E. (2018). A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings. Proceedings of the 56th Annual meeting of the association for computational linguistics (pp. 789\u2013798). Association for Computational Linguistics."},{"key":"9637_CR2","first-page":"54","volume-title":"13th international workshop on semantic evaluation","author":"V Basile","year":"2019","unstructured":"Basile, V., Bosco, C., Fersini, E., Debora, N., Patti, V., Pardo, F. M. R., et al. (2019). Semeval-2019 task 5: multilingual detection of hate speech against immigrants and women in twitter. 13th international workshop on semantic evaluation (pp. 54\u201363). Association for Computational Linguistics."},{"key":"9637_CR3","doi-asserted-by":"crossref","first-page":"906","DOI":"10.7717\/peerj-cs.906","volume":"8","author":"JA Ben\u00edtez-Andrades","year":"2022","unstructured":"Ben\u00edtez-Andrades, J. A., Gonz\u00e1lez-Jim\u00e9nez, \u00c1., L\u00f3pez-Brea, \u00c1., Aveleira-Mata, J., Alija-P\u00e9rez, J.-M., & Garc\u00eda-Ord\u00e1s, M. T. (2022). Detecting racism and xenophobia using deep learning models on twitter data: Cnn, lstm and bert. PeerJ Comput Sci, 8, 906.","journal-title":"PeerJ Comput Sci"},{"key":"9637_CR4","first-page":"15","volume-title":"Proceedings of the first workshop on language technology for equality, diversity and inclusion","author":"I Bigoulaeva","year":"2021","unstructured":"Bigoulaeva, I., Hangya, V., & Fraser, A. (2021). Cross-lingual transfer learning for hate speech detection. Proceedings of the first workshop on language technology for equality, diversity and inclusion (pp. 15\u201325). Association for Computational Linguistics."},{"key":"9637_CR6","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1162\/tacl_a_00051","volume":"5","author":"P Bojanowski","year":"2017","unstructured":"Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2017). Enriching word vectors with subword information. Trans Assoc Comput Linguist, 5, 135\u2013146.","journal-title":"Trans Assoc Comput Linguist"},{"key":"9637_CR7","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/W15-30","volume-title":"Proceedings of the tenth workshop on statistical machine translation","author":"O Bojar","year":"2015","unstructured":"Bojar, O., Chatterjee, R., Federmann, C., Haddow, B., Hokamp, C., Huck, M., et al. (2015). Proceedings of the tenth workshop on statistical machine translation. ACL."},{"key":"9637_CR8","first-page":"1","volume-title":"Proceedings of the 50th Hawaii international conference on system sciences","author":"U Bretschneider","year":"2017","unstructured":"Bretschneider, U., & Peters, R. (2017). Detecting offensive statements towards foreigners in social media. In T. Bui (Ed.), Proceedings of the 50th Hawaii international conference on system sciences (pp. 1\u201310). HICSS."},{"key":"9637_CR9","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1613\/jair.953","volume":"16","author":"NV Chawla","year":"2002","unstructured":"Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). Smote: synthetic minority over-sampling technique. J Artificial Intelligence Res, 16, 321\u2013357. https:\/\/doi.org\/10.1613\/jair.953.","journal-title":"J Artificial Intelligence Res"},{"key":"9637_CR10","volume-title":"Word translation without parallel data","author":"A Conneau","year":"2018","unstructured":"Conneau, A., Lample, G., Ranzato, M., Denoyer, L., & J\u2019egou, H. (2018). Word translation without parallel data. USA: Cornell University."},{"key":"9637_CR11","volume-title":"Proceedings of the international conference on learning representations","author":"A Conneau","year":"2018","unstructured":"Conneau, A., Lample, G., Ranzato, M., Denoyer, L., & J\u00e9gou, H. (2018). Word translation without parallel data. Proceedings of the international conference on learning representations. Cornell University."},{"key":"9637_CR12","first-page":"512","volume-title":"Proceedings of the 11th International AAAI conference on web and social media","author":"T Davidson","year":"2017","unstructured":"Davidson, T., Warmsley, D., Macy, M., & Weber, I. (2017). Automated hate speech detection and the problem of offensive language. Proceedings of the 11th International AAAI conference on web and social media (pp. 512\u2013515). ICWSM\u2019 17."},{"key":"9637_CR13","doi-asserted-by":"crossref","first-page":"11","DOI":"10.18653\/v1\/W18-5102","volume-title":"Proceedings of the 2nd workshop on abusive language online (ALW2)","author":"O de Gibert","year":"2018","unstructured":"de Gibert, O., Perez, N., Garc\u00eda-Pablos, A., & Cuadros, M. (2018). Hate speech dataset from a white supremacy forum. Proceedings of the 2nd workshop on abusive language online (ALW2) (pp. 11\u201320). Association for Computational Linguistics."},{"key":"9637_CR14","first-page":"27","volume-title":"Proceedings of the GermEval 2018 workshop","author":"T De Smedt","year":"2018","unstructured":"De Smedt, T., & Jaki, S. (2018). Challenges of automatically detecting offensive language online: participation paper for the germeval shared task 2018 (HaUA). Proceedings of the GermEval 2018 workshop (pp. 27\u201332). ACM."},{"key":"9637_CR15","first-page":"4171","volume-title":"Proceedings of the 2019 Conference of the North American chapter of the association for computational linguistics: human language technologies","author":"J Devlin","year":"2019","unstructured":"Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American chapter of the association for computational linguistics: human language technologies (pp. 4171\u20134186). Association for Computational Linguistics."},{"key":"9637_CR16","volume-title":"Proceedings of the 2nd workshop on abusive language online (ALW2)","author":"D Fi\u0161er","year":"2018","unstructured":"Fi\u0161er, D., Huang, R., Prabhakaran, V., Voigt, R., Waseem, Z., & Wernimont, J. (2018). Proceedings of the 2nd workshop on abusive language online (ALW2). Brussels: Association for Computational Linguistics."},{"key":"9637_CR17","doi-asserted-by":"publisher","DOI":"10.1145\/3232676","author":"P Fortuna","year":"2018","unstructured":"Fortuna, P., & Nunes, S. (2018). A survey on automatic detection of hate speech in text. ACM Comput. Surv.https:\/\/doi.org\/10.1145\/3232676.","journal-title":"ACM Comput. Surv."},{"key":"9637_CR18","first-page":"6786","volume-title":"Proceedings of the 12th language resources and evaluation conference","author":"P Fortuna","year":"2020","unstructured":"Fortuna, P., Soler, J., & Wanner, L. (2020). Toxic, hateful, offensive or abusive? what are we really classifying? an empirical analysis of hate speech datasets. Proceedings of the 12th language resources and evaluation conference (pp. 6786\u20136794). Marseille: European Language Resources Association."},{"key":"9637_CR19","volume-title":"Detecting online hate speech using context aware models","author":"L Gao","year":"2017","unstructured":"Gao, L., & Huang, R. (2017). Detecting online hate speech using context aware models. Cornell University."},{"key":"9637_CR20","volume-title":"Analyzing and detecting abusive language across domains and languages","author":"G Glava\u0161","year":"2020","unstructured":"Glava\u0161, G., Karan, M., & Vulic, I. (2020). Analyzing and detecting abusive language across domains and languages. Association for Computational Linguistics."},{"key":"9637_CR21","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1145\/3270101.3270103","volume-title":"Proceedings of the 11th ACM workshop on artificial intelligence and security","author":"T Gr\u00f6ndahl","year":"2018","unstructured":"Gr\u00f6ndahl, T., Pajola, L., Juuti, M., Conti, M., & Asokan, N. (2018). All you need is love evading hate speech detection. Proceedings of the 11th ACM workshop on artificial intelligence and security (pp. 2\u201312). ACM."},{"key":"9637_CR22","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1145\/3465336.3475102","volume-title":"Proceedings of the 32nd ACM conference on hypertext and social media","author":"A Jiang","year":"2021","unstructured":"Jiang, A., & Zubiaga, A. (2021). Cross-lingual capsule network for hate speech detection in social media. Proceedings of the 32nd ACM conference on hypertext and social media (pp. 217\u2013223). ACM."},{"key":"9637_CR23","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1186\/s40537-019-0192-5","volume":"6","author":"J Johnson","year":"2019","unstructured":"Johnson, J., & Khoshgoftaar, T. (2019). Survey on deep learning with class imbalance. J Big Data, 6, 27.","journal-title":"J Big Data"},{"key":"9637_CR24","doi-asserted-by":"crossref","first-page":"1746","DOI":"10.3115\/v1\/D14-1181","volume-title":"Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP)","author":"Y Kim","year":"2014","unstructured":"Kim, Y. (2014). Convolutional neural networks for sentence classification. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (pp. 1746\u20131751). Association for Computational Linguistics."},{"key":"9637_CR25","doi-asserted-by":"crossref","first-page":"177","DOI":"10.3115\/1557769.1557821","volume-title":"Proceedings of the 45th annual meeting of the acl on interactive poster and demonstration sessions","author":"P Koehn","year":"2007","unstructured":"Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., et al. (2007). Moses: open source toolkit for statistical machine translation. Proceedings of the 45th annual meeting of the acl on interactive poster and demonstration sessions (pp. 177\u2013180). ACL."},{"key":"9637_CR26","doi-asserted-by":"crossref","unstructured":"Kozareva, Z. (2006). Bootstrapping named entity recognition with automatically generated gazetteer lists. In: Student Research Workshop. url: https:\/\/www.aclweb.org\/anthology\/E06-3004","DOI":"10.3115\/1609039.1609041"},{"key":"9637_CR27","first-page":"1","volume-title":"Proceedings of the first workshop on trolling, aggression and cyberbullying (TRAC-2018)","author":"R Kumar","year":"2018","unstructured":"Kumar, R., Ojha, A. K., Malmasi, S., & Zampieri, M. (2018). Benchmarking aggression identification in social media. Proceedings of the first workshop on trolling, aggression and cyberbullying (TRAC-2018) (pp. 1\u201311). Association for Computational Linguistics."},{"issue":"8","key":"9637_CR28","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0221152","volume":"14","author":"S MacAvaney","year":"2019","unstructured":"MacAvaney, S., Yao, H. R., Yang, E., Russell, K., Goharian, N., & Frieder, O. (2019). Hate speech detection: challenges and solutions. PLOS ONE, 14(8), 1\u201316. https:\/\/doi.org\/10.1371\/journal.pone.0221152.","journal-title":"PLOS ONE"},{"key":"9637_CR29","volume-title":"Proceedings of the fourth workshop on online abuse and harms","author":"K Madukwe","year":"2020","unstructured":"Madukwe, K., Gao, X., & Xue, B. (2020). (2020) In data we trust: a critical analysis of hate speech detection datasets. Proceedings of the fourth workshop on online abuse and harms. Association for Computational Linguistics."},{"key":"9637_CR30","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1145\/3368567","volume-title":"Proceedings of the 11th forum for information retrieval evaluation","author":"P Majumder","year":"2019","unstructured":"Majumder, P., Patel, D., Modha, S., & Mandl, T. (2019). Overview of the HASOC track at FIRE 2019: hate speech and offensive content identification in Indo-European languages. Proceedings of the 11th forum for information retrieval evaluation (pp. 14\u201317). ACM."},{"key":"9637_CR31","volume-title":"Proceedings of the 2nd workshop on abusive language online (ALW2)","author":"P Mathur","year":"2018","unstructured":"Mathur, P., Sawhney, R., Ayyar, M., & Shah, R. (2018). Did you offend me? classification of offensive tweets in Hinglish language. Proceedings of the 2nd workshop on abusive language online (ALW2). Brussels: Association for Computational Linguistics."},{"key":"9637_CR32","unstructured":"Mikolov, T., Le, Q.V., & Sutskever, I. (2013a). Exploiting Similarities among Languages for Machine Translation. CoRR abs\/1309.4"},{"key":"9637_CR33","volume-title":"1st international conference on learning representations","author":"T Mikolov","year":"2013","unstructured":"Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013b). Efficient estimation of word representations in vector space. 1st international conference on learning representations. ICLR."},{"key":"9637_CR34","volume-title":"Machine learning with oversampling and undersampling techniques: overview study experimental results","author":"R Mohammed","year":"2020","unstructured":"Mohammed, R., Rawashdeh, J., & Abdullah, M. (2020). Machine learning with oversampling and undersampling techniques: overview study experimental results. IEEE."},{"key":"9637_CR35","volume-title":"Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing","author":"D Nozza","year":"2021","unstructured":"Nozza, D. (2021). Exposing the limits of zero-shot cross-lingual hate speech detection. Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing. Association for Computational Linguistics."},{"key":"9637_CR36","first-page":"1","volume":"2017","author":"EW Pamungkas","year":"2021","unstructured":"Pamungkas, E. W., Basile, V., & Patti, V. (2021a). Towards multidomain and multilingual abusive language detection: a survey. Personal Ubiquitous Comput, 2017, 1\u201327.","journal-title":"Personal Ubiquitous Comput"},{"issue":"4","key":"9637_CR37","doi-asserted-by":"crossref","first-page":"102544","DOI":"10.1016\/j.ipm.2021.102544","volume":"58","author":"EW Pamungkas","year":"2021","unstructured":"Pamungkas, E. W., Basile, V., & Patti, V. (2021b). A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection. Info Process Manag, 58(4), 102544.","journal-title":"Info Process Manag"},{"key":"9637_CR38","first-page":"30","volume-title":"Proceedings of the EACL hackashop on news media content analysis and automated report generation","author":"A Pelicon","year":"2021","unstructured":"Pelicon, A., Shekhar, R., Martinc, M., \u0160krlj, B., Purver, M., & Pollak, S. (2021). Zero-shot cross-lingual content filtering: offensive language and hate speech detection. Proceedings of the EACL hackashop on news media content analysis and automated report generation (pp. 30\u201334). Association for Computational Linguistics."},{"issue":"2","key":"9637_CR39","doi-asserted-by":"crossref","first-page":"477","DOI":"10.1007\/s10579-020-09502-8","volume":"55","author":"F Poletto","year":"2021","unstructured":"Poletto, F., Basile, V., Sanguinetti, M., Bosco, C., & Patti, V. (2021). Resources and benchmark corpora for hate speech detection: a systematic review. Lang Resour Eval, 55(2), 477\u2013523.","journal-title":"Lang Resour Eval"},{"key":"9637_CR40","doi-asserted-by":"crossref","first-page":"5838","DOI":"10.18653\/v1\/2020.emnlp-main.470","volume-title":"Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP)","author":"T Ranasinghe","year":"2020","unstructured":"Ranasinghe, T., & Zampieri, M. (2020). Multilingual offensive language identification with cross-lingual embeddings. Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP) (pp. 5838\u20135844). Cornell University."},{"key":"9637_CR41","volume-title":"Proceedings of the third workshop on abusive language online","author":"ST Roberts","year":"2019","unstructured":"Roberts, S. T., Tetreault, J., Prabhakaran, V., & Waseem, Z. (2019). Proceedings of the third workshop on abusive language online. Florence: Association for Computational Linguistics."},{"key":"9637_CR42","first-page":"6","volume-title":"Proceedings of NLP4CMC III: 3rd workshop on natural languageprocessing for computer-mediated communication","author":"B Ross","year":"2016","unstructured":"Ross, B., Rist, M., Carbonell, G., Cabrera, B., Kurowsky, N., & Wojatzki, M. (2016). Measuring the reliability of hate speech annotations: the case of the European refugee crisis. In M. Bei\u00dfwenger, M. Wojatzki, & T. Zesch (Eds.), Proceedings of NLP4CMC III: 3rd workshop on natural languageprocessing for computer-mediated communication (Vol. 17, pp. 6\u20139). Bochumer Linguistische Arbeitsberichte."},{"key":"9637_CR43","doi-asserted-by":"crossref","DOI":"10.1553\/0x003a105d","volume-title":"Proceedings of the GermEval 2018 workshop","author":"J Ruppenhofer","year":"2018","unstructured":"Ruppenhofer, J., Siegel, M., & Wiegand, M. (2018). Proceedings of the GermEval 2018 workshop. Vienna: Austrian Academy of Sciences."},{"key":"9637_CR44","first-page":"1","volume-title":"Proceedings of the Fifth International workshop on natural language processing for social media","author":"A Schmidt","year":"2017","unstructured":"Schmidt, A., & Wiegand, M. (2017). A survey on hate speech detection using natural language processing. Proceedings of the Fifth International workshop on natural language processing for social media (pp. 1\u201310). Valencia: Association for Computational Linguistics."},{"key":"9637_CR45","volume-title":"Cross-lingual zero- and few-shot hate speech detection utilising frozen transformer language models and axel","author":"L Stappen","year":"2020","unstructured":"Stappen, L., Brunn, F., & Schuller, B. (2020). Cross-lingual zero- and few-shot hate speech detection utilising frozen transformer language models and axel. Cornell University."},{"key":"9637_CR46","volume-title":"Overview of germeval task 2, 2019 shared task on the identification of offensive language","author":"J Stru\u00df","year":"2019","unstructured":"Stru\u00df, J., Siegel, M., Ruppenhofer, J., Wiegand, M., & Klenner, M. (2019). Overview of germeval task 2, 2019 shared task on the identification of offensive language. University of Erlangen-Nuremberg."},{"issue":"12","key":"9637_CR47","doi-asserted-by":"crossref","first-page":"0243300","DOI":"10.1371\/journal.pone.0243300","volume":"15","author":"B Vidgen","year":"2020","unstructured":"Vidgen, B., & Derczynski, L. (2020). Directions in abusive language training data, a systematic review: garbage in, garbage out. PLOS ONE, 15(12), 0243300.","journal-title":"PLOS ONE"},{"key":"9637_CR48","first-page":"14647","volume-title":"CVF Conference on Computer Vision and Pattern Recognition","author":"X Wang","year":"2022","unstructured":"Wang, X., Wu, Z., Lian, L., & Yu, S. X. (2022). Debiased learning from naturally imbalanced pseudo-labels. CVF Conference on Computer Vision and Pattern Recognition (pp. 14647\u201314657). IEEE."},{"key":"9637_CR51","first-page":"88","volume-title":"Proceedings of the NAACL student research workshop","author":"Z Waseem","year":"2016","unstructured":"Waseem, Z., & Hovy, D. (2016). Hateful symbols or hateful people? predictive features for hate speech detection on twitter. Proceedings of the NAACL student research workshop (pp. 88\u201393). San Diego: Association for Computational Linguistics."},{"key":"9637_CR50","volume-title":"Proceedings of the first workshop on abusive language online","author":"Z Waseem","year":"2017","unstructured":"Waseem, Z., Chung, W. H. K., Hovy, D., & Tetreault, J. (2017a). Proceedings of the first workshop on abusive language online. Association for Computational Linguistics."},{"key":"9637_CR49","doi-asserted-by":"crossref","unstructured":"Waseem, Z., Davidson, T., Warmsley, D., Weber, I. (2017b). Understanding abuse: A typology of abusive language detection subtasks. arXiv preprint arXiv:1705.09899","DOI":"10.18653\/v1\/W17-3012"},{"key":"9637_CR52","first-page":"10857","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition","author":"C Wei","year":"2021","unstructured":"Wei, C., Sohn, K., Mellina, C., Yuille, A., & Yang, F. (2021). A class-rebalancing self-training framework for imbalanced semi-supervised learning. Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 10857\u201310866). IEEE."},{"key":"9637_CR53","volume-title":"Proceedings of the GermEval 2018 workshop","author":"G Wiedemann","year":"2018","unstructured":"Wiedemann, G., Ruppert, E., Jindal, R., & Biemann, C. (2018). Transfer learning from LDA to BiLSTM-CNN for offensive language detection in twitter. Proceedings of the GermEval 2018 workshop. University of Hamburg."},{"key":"9637_CR54","first-page":"21","volume-title":"Proceedings of the GermEval 2018 workshop","author":"M Wiegand","year":"2018","unstructured":"Wiegand, M., Amann, A., Anikina, T., Azoidou, A., Borisenkov, A., Kolmorgen, K., et al. (2018a). Saarland University\u2019s Participation in the GermEval Task 2018 (UdSW)-examining different types of classifiers and features. Proceedings of the GermEval 2018 workshop (pp. 21\u201326). Saarland University."},{"key":"9637_CR55","first-page":"1","volume-title":"Proceedings of GermEval 2018, 14th conference on natural language processing (KONVENS 2018)","author":"M Wiegand","year":"2018","unstructured":"Wiegand, M., Siegel, M., & Ruppenhofer, J. (2018b). Overview of the germeval 2018 shared task on the identification of offensive language. Proceedings of GermEval 2018, 14th conference on natural language processing (KONVENS 2018) (pp. 1\u201310). Austrian Academy of Sciences."},{"key":"9637_CR56","doi-asserted-by":"crossref","first-page":"1391","DOI":"10.1145\/3038912.3052591","volume-title":"Proceedings of the 26th international conference on world wide web","author":"E Wulczyn","year":"2017","unstructured":"Wulczyn, E., Thain, N., & Dixon, L. (2017). Ex machina: personal attacks seen at scale. Proceedings of the 26th international conference on world wide web (pp. 1391\u20131399). ACM."},{"key":"9637_CR57","volume-title":"Proceedings of the GermEval 2018 workshop","author":"J Xi","year":"2018","unstructured":"Xi, J., Spranger, M., & Labudde, D. (2018). CNN-based offensive language detection. Proceedings of the GermEval 2018 workshop. Austrian Academy of Sciences."},{"key":"9637_CR58","series-title":"Long and Short Papers","first-page":"1415","volume-title":"Proceedings of the 2019 Conference of the North American chapter of the association for computational linguistics: human language technologies","author":"M Zampieri","year":"2019","unstructured":"Zampieri, M., Malmasi, S., Nakov, P., Rosenthal, S., Farra, N., & Kumar, R. (2019). Predicting the type and target of offensive posts in social media. Long and Short Papers. Proceedings of the 2019 Conference of the North American chapter of the association for computational linguistics: human language technologies (Vol. 1, pp. 1415\u20131420). Association for Computational Linguistics."},{"key":"9637_CR59","doi-asserted-by":"crossref","first-page":"1425","DOI":"10.18653\/v1\/2020.semeval-1.188","volume-title":"Proceedings of the fourteenth workshop on semantic evaluation","author":"M Zampieri","year":"2020","unstructured":"Zampieri, M., Nakov, P., Rosenthal, S., Atanasova, P., Karadzhov, G., Mubarak, H., et al. (2020). 2020) Semeval-2020 task 12: Multilingual offensive language identification in social media (offenseval 2020. Proceedings of the fourteenth workshop on semantic evaluation (pp. 1425\u20131447). Cornell University."},{"key":"9637_CR60","first-page":"1435","volume-title":"Proceedings of the International AAAI conference on web and social media","author":"HB Zia","year":"2022","unstructured":"Zia, H. B., Castro, I., Zubiaga, A., & Tyson, G. (2022). Improving zero-shot cross-lingual hate speech detection with pseudo-label fine-tuning of transformer language models. Proceedings of the International AAAI conference on web and social media (pp. 1435\u20131439). AAAI."}],"container-title":["Language Resources and Evaluation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-023-09637-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10579-023-09637-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-023-09637-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,17]],"date-time":"2023-11-17T15:06:46Z","timestamp":1700233606000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10579-023-09637-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,18]]},"references-count":59,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,12]]}},"alternative-id":["9637"],"URL":"https:\/\/doi.org\/10.1007\/s10579-023-09637-4","relation":{},"ISSN":["1574-020X","1574-0218"],"issn-type":[{"value":"1574-020X","type":"print"},{"value":"1574-0218","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,18]]},"assertion":[{"value":"13 January 2023","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 February 2023","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}