{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T02:35:47Z","timestamp":1769826947151,"version":"3.49.0"},"reference-count":42,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2021,12,24]],"date-time":"2021-12-24T00:00:00Z","timestamp":1640304000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2022,3,31]]},"abstract":"<jats:p>On YouTube, billions of videos are watched online and millions of short messages are posted each day. YouTube along with other social networking sites are used by individuals and extremist groups for spreading hatred among users. In this paper, we consider religion as the most targeted domain for spreading hate speech among people of different religions. We present a methodology for the detection of religion-based hate videos on YouTube. Messages posted on YouTube videos generally express the opinions of users\u2019 related to that video. We provide a novel dataset for religious hate speech detection on Youtube comments. The proposed methodology applies data mining techniques on extracted comments from religious videos in order to filter religion-oriented messages and detect those videos which are used for spreading hate. The supervised learning algorithms: Support Vector Machine (SVM), Logistic Regression (LR), and k-Nearest Neighbor (k-NN) are used for baseline results.<\/jats:p>","DOI":"10.3233\/jifs-219264","type":"journal-article","created":{"date-parts":[[2021,12,24]],"date-time":"2021-12-24T10:28:04Z","timestamp":1640341684000},"page":"4769-4777","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":3,"title":["YouTube based religious hate speech and extremism detection dataset with machine learning baselines"],"prefix":"10.1177","volume":"42","author":[{"given":"Noman","family":"Ashraf","sequence":"first","affiliation":[{"name":"CIC, Instituto Polit\u00c3l\u2019 cnico Nacional, Mexico"}]},{"given":"Abid","family":"Rafiq","sequence":"additional","affiliation":[{"name":"Department of Computer Science and IT, University of Sargodha, Pakistan"}]},{"given":"Sabur","family":"Butt","sequence":"additional","affiliation":[{"name":"CIC, Instituto Polit\u00c3l\u2019 cnico Nacional, Mexico"}]},{"given":"Hafiz Muhammad Faisal","family":"Shehzad","sequence":"additional","affiliation":[{"name":"Department of Computer Science and IT, University of Sargodha, Pakistan"}]},{"given":"Grigori","family":"Sidorov","sequence":"additional","affiliation":[{"name":"CIC, Instituto Polit\u00c3l\u2019 cnico Nacional, Mexico"}]},{"given":"Alexander","family":"Gelbukh","sequence":"additional","affiliation":[{"name":"CIC, Instituto Polit\u00c3l\u2019 cnico Nacional, Mexico"}]}],"member":"179","published-online":{"date-parts":[[2021,12,24]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"crossref","unstructured":"AgarwalS. and SurekaA. A focused crawler for mining hate and extremism promoting videos on youtube. In Proceedings of the 25th ACM conference on Hypertext and social media (2014) pp. 294\u2013296.","DOI":"10.1145\/2631775.2631776"},{"key":"e_1_3_2_3_2","doi-asserted-by":"crossref","unstructured":"AlbadiN. KurdiM. and MishraS. Are they our brothers? analysis and detection of religious hate speech in the arabic twittersphere. In 2018 IEEE\/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) (2018) pp. 69\u201376. IEEE.","DOI":"10.1109\/ASONAM.2018.8508247"},{"issue":"3","key":"e_1_3_2_4_2","first-page":"02","article-title":"Multi-label emotion classification using content-based features inTwitter","volume":"24","author":"Ameer I.","year":"2021","unstructured":"AmeerI., AshrafN., SidorovG. and G\u00c3\u015fmez AdornoH., Multi-label emotion classification using content-based features inTwitter, Computaci\u00f3n y Sistemas24(3) (2021), 02.","journal-title":"Computaci\u00f3n y Sistemas"},{"key":"e_1_3_2_5_2","doi-asserted-by":"crossref","unstructured":"AshrafN. MustafaR. SidorovG. and GelbukhA. Individual vs. group violent threats classification in online discussions. In Companion Proceedings of the Web Conference 2020 WWW \u2019 20 (2020) pp. 629\u2013633 New York NY USA. Association for Computing Machinery. ISBN 9781450370240.","DOI":"10.1145\/3366424.3385778"},{"key":"e_1_3_2_6_2","unstructured":"AshrafN. ButtS. SidorovG. and GelbukhA. CIC at CheckThat! 2021: Fake news detection using machine learning and data augmentation. In CLEF 2021 \u2013 Conference and Labs of the Evaluation Forum Bucharest Romania 2021."},{"issue":"7","key":"e_1_3_2_7_2","first-page":"112","article-title":"Human ggressiveness and reactions towards uncertain decisions","volume":"6","author":"Bashir F.","year":"2019","unstructured":"BashirF., AshrafN., YaqoobA., RafiqA. and Ul MustafaR., Human ggressiveness and reactions towards uncertain decisions, International Journal of ADVANCED AND APPLIED SCIENCES6(7) (2019), 112\u2013116.","journal-title":"International Journal of ADVANCED AND APPLIED SCIENCES"},{"key":"e_1_3_2_8_2","first-page":"1","article-title":"Us and them: identifying cyber hate ontwitter across multiple protected characteristics","volume":"5","author":"Burnap P.","year":"2016","unstructured":"BurnapP. and WilliamsM.L., Us and them: identifying cyber hate ontwitter across multiple protected characteristics, EPJ DataScience5 (2016), 1\u201315.","journal-title":"EPJ DataScience"},{"key":"e_1_3_2_9_2","unstructured":"BurnapP. and Leighton WilliamsM. Hate speech machine classification and statistical modelling of information flows on twitter: Interpretation and communication for policy decision making. 2014."},{"issue":"1","key":"e_1_3_2_10_2","first-page":"2021","article-title":"Transformerbased extractive social media question answering onTweetQA","volume":"25","author":"Butt S.","unstructured":"ButtS., AshrafN., Fahim SiddiquiM.H., SidorovG. and GelbukhA., Transformerbased extractive social media question answering onTweetQA, Computaci\u00f3n y Sistemas25(1), 2021.","journal-title":"Computaci\u00f3n y Sistemas"},{"key":"e_1_3_2_11_2","unstructured":"ButtS. AshrafN. SidorovG. and GelbukhA. Sexism identification using BERT and data augmentation - EXIST2021. In International Conference of the Spanish Society for Natural Language Processing SEPLN 2021 IberLEF 2021 Spain 2021."},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1007\/s42979-020-00413-7"},{"key":"e_1_3_2_13_2","doi-asserted-by":"crossref","unstructured":"ChenH. DenningD. RobertsN. LarsonC.A. YuX. and HuangC.N. Chapter 1-revealing the hidden world of the dark web: Social media forums and videos Intelligent Systems for Security Informatics (2013) pp. 1\u201328.","DOI":"10.1016\/B978-0-12-404702-0.00001-X"},{"key":"e_1_3_2_14_2","doi-asserted-by":"crossref","unstructured":"ConwayM. and McInerneyL. Jihadi video and autoradicalisation: Evidence from an exploratory youtube study. In European Conference on Intelligence and Security Informatics (2008) pp. 108\u2013118. Springer.","DOI":"10.1007\/978-3-540-89900-6_13"},{"key":"e_1_3_2_15_2","doi-asserted-by":"crossref","unstructured":"de GibertO. PerezN. Garc\u00eda-PablosA. and CuadrosM. Hate speech dataset from a white supremacy forum. arXiv preprint arXiv:1809.04444 2018.","DOI":"10.18653\/v1\/W18-5102"},{"key":"e_1_3_2_16_2","first-page":"4171","article-title":"BERT: Pre-trainingof deep bidirectional transformers for language understanding. In(Long and Short Papers)","volume":"1","author":"Devlin J.","year":"2019","unstructured":"DevlinJ., ChangM.-W., LeeK. and ToutanovaK., BERT: Pre-trainingof deep bidirectional transformers for language understanding. In(Long and Short Papers), Proceedings of the 2019 Conference of the North AmericanChapter of the Association for Computational Linguistics: HumanLanguage Technologies1 (2019), pp. 4171\u20134186.","journal-title":"Proceedings of the 2019 Conference of the North AmericanChapter of the Association for Computational Linguistics: HumanLanguage Technologies"},{"key":"e_1_3_2_17_2","doi-asserted-by":"crossref","unstructured":"DjuricN. ZhouJ. MorrisR. GrbovicM. RadosavljevicV. and BhamidipatiN. Hate speech detection with comment embeddings. In Proceedings of the 24th International Conference on World Wide Web WWW \u0102\u017915 Companion (2015) pp. 29\u201330 New York NY USA. Association for Computing Machinery. ISBN 9781450334730.","DOI":"10.1145\/2740908.2742760"},{"key":"e_1_3_2_18_2","unstructured":"GagliardoneI. GalD. AlvesT. and MartinezG. Countering online hate speech. Unesco Publishing 2015."},{"issue":"1","key":"e_1_3_2_19_2","first-page":"10","article-title":"he weka data mining software: an update","volume":"11","author":"Hall M.","year":"2009","unstructured":"HallM., FrankE., HolmesG., PfahringerB., ReutemannP. and WittenI.H., he weka data mining software: an update, ACM SIGKDDExplorations Newsletter11(1) (2009), 10\u201318.","journal-title":"ACM SIGKDDExplorations Newsletter"},{"key":"e_1_3_2_20_2","unstructured":"HsuH. and LachenbruchP.A. Paired t test.Wiley StatsRef: statistics reference online 2014."},{"key":"e_1_3_2_21_2","doi-asserted-by":"crossref","unstructured":"KandiasM. StavrouV. BozovicN. MitrouL. and GritzalisD. Can we trust this user? predicting insider\u2019s attitude via youtube usage profiling. In 2013 IEEE 10th International Conference on Ubiquitous Intelligence and Computing and 2013 IEEE 10th International Conference on Autonomic and Trusted Computing (2013) pp. 347\u2013354. IEEE.","DOI":"10.1109\/UIC-ATC.2013.12"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","unstructured":"KhanL. AmjadA. AshrafN. ChangH.-T. and GelbukhA. Urdu sentiment analysis with deep learning methods IEEE Access (2021) pp. 1\u20131. doi: 10.1109\/ACCESS.2021.3093078.","DOI":"10.1109\/ACCESS.2021.3093078"},{"key":"e_1_3_2_23_2","doi-asserted-by":"crossref","unstructured":"KwokI. and WangY. Locate the hate: Detecting tweets against blacks. In Twenty-seventh AAAI conference on artificial intelligence 2013.","DOI":"10.1609\/aaai.v27i1.8539"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.4018\/IJSWIS.2017010104"},{"key":"e_1_3_2_25_2","unstructured":"LanZ. ChenM. GoodmanS. GimpelK. SharmaP. and SoricutR. ALBERT: A lite BERT for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942 2019."},{"key":"e_1_3_2_26_2","doi-asserted-by":"crossref","unstructured":"LiuB. and ZhangL. A survey of opinion mining and sentiment analysis. In Mining text data (2012) pp. 415\u2013463. Springer.","DOI":"10.1007\/978-1-4614-3223-4_13"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1080\/03637751003758227"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.22452\/mjcs.vol30no1.5"},{"key":"e_1_3_2_29_2","doi-asserted-by":"crossref","unstructured":"Ul MustafaR. AshrafN. Shabbir AhmedF. FerzundJ. ShahzadanB. and GelbukhA. A multiclass depression detection in social media based on sentiment analysis. In Shahram Latifi editor 17th International Conference on Information Technology\u2013 NewGenerations (ITNG 2020) (2020) pp. 659\u2013662 Cham 2020. Springer International Publishing. ISBN 978-3-030-43020-7.","DOI":"10.1007\/978-3-030-43020-7_89"},{"key":"e_1_3_2_30_2","doi-asserted-by":"crossref","unstructured":"Ul MustafaR. MouraD. and Esteve RothenbergC. Machine learning approach to estimate video qoe of encrypted dash traffic in 5g networks. In 2021 IEEE Statistical Signal ProcessingWorkshop (SSP) (2021) pp. 586\u2013589. IEEE.","DOI":"10.1109\/SSP49050.2021.9513804"},{"key":"e_1_3_2_31_2","unstructured":"MuthiahS. HuangB. ArredondoJ. MaresD. GetoorL. KatzG. and RamakrishnanN. Planned protest modeling in news and social media. In Twenty- Seventh IAAI Conference 2015."},{"issue":"8","key":"e_1_3_2_32_2","first-page":"9","article-title":"Language models are unsupervised multitask learners","volume":"1","author":"Radford A.","year":"2019","unstructured":"RadfordA., WuJ., ChildR., LuanD., AmodeiD. and SutskeverI., Language models are unsupervised multitask learners, OpenAIblog1(8) (2019), 9.","journal-title":"OpenAIblog"},{"key":"e_1_3_2_33_2","unstructured":"SilvaL. MondalM. CorreaD. BenevenutoF. and WeberI. Analyzing the targets of hate in online social media. In Tenth international AAAI conference on web and social media 2016."},{"key":"e_1_3_2_34_2","doi-asserted-by":"crossref","unstructured":"SrivastavaA. HasanM. YagnikB. WalambeR. and KotechaK. Role of artificial intelligence in detection of hateful speech for Hinglish data on social media. arXiv preprint arXiv:2105.04913 2021.","DOI":"10.1007\/978-981-16-3067-5_8"},{"key":"e_1_3_2_35_2","doi-asserted-by":"crossref","unstructured":"SurekaA. KumaraguruP. GoyalA. and ChhabraS. Mining youtube to discover extremist videos users and hidden communities. In Asia information retrieval symposium (2010) pp. 13\u201324. Springer.","DOI":"10.1007\/978-3-642-17187-1_2"},{"key":"e_1_3_2_36_2","doi-asserted-by":"crossref","unstructured":"TingI-H. ChiH.-M. WuJ.-S. and WangS.- L. An approach for hate groups detection in facebook. In The 3rd International Workshop on Intelligent Data Analysis and Management (2013) pp. 101\u2013106. Springer.","DOI":"10.1007\/978-94-007-7293-9_11"},{"key":"e_1_3_2_37_2","doi-asserted-by":"crossref","unstructured":"TurneyP.D. Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. arXiv preprint cs\/0212032 2002.","DOI":"10.3115\/1073083.1073153"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.3390\/info12010005"},{"key":"e_1_3_2_39_2","unstructured":"WarnerW. and HirschbergJ. Detecting hate speech on the world wide web. In Proceedings of the second workshop on language in social media (2012) pp. 19\u201326."},{"key":"e_1_3_2_40_2","doi-asserted-by":"crossref","unstructured":"WulczynE. ThainN. and DixonL. Ex machina: Personal attacks seen at scale. In Proceedings of the 26th International Conference onWorldWideWeb WWW \u0102\u017917 pp. 1391\u20131399 Republic and Canton of Geneva CHE 2017.","DOI":"10.1145\/3038912.3052591"},{"key":"e_1_3_2_41_2","unstructured":"ZampieriM. MalmasiS. NakovP. RosenthalS. FarraN. and KumarR. Predicting the type and target of offensive posts in social media. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies1 (Long and Short Papers) pp. 1415\u20131420 Minneapolis Minnesota June 2019. Association for Computational Linguistics."},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2005.96"},{"key":"e_1_3_2_43_2","unstructured":"ZiaT. Shehbaz AkramM. Saqib NawazM. ShahzadB. AbdullatifA.M. MustafaR.U. and Ikramullah LaliM. Identification of hatred speeches on twitter. In Proceedings of 52nd The IRES International Conference (2016) pp. 27\u201332."}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-219264","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-219264","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-219264","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T14:22:47Z","timestamp":1769782967000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-219264"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12,24]]},"references-count":42,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,3,31]]}},"alternative-id":["10.3233\/JIFS-219264"],"URL":"https:\/\/doi.org\/10.3233\/jifs-219264","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,12,24]]}}}