{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,24]],"date-time":"2025-04-24T04:14:21Z","timestamp":1745468061811,"version":"3.40.4"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,2,25]],"date-time":"2025-02-25T00:00:00Z","timestamp":1740441600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,2,25]],"date-time":"2025-02-25T00:00:00Z","timestamp":1740441600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Hum-Cent Intell Syst"],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Text classification plays a vital role in natural language processing (NLP), especially in the context of rapidly expanding textual data. This paper presents an automated approach to detecting and classifying violent incidents from a large corpus of Bangla news articles, specifically focusing on incidents such as Murder, Rape, Kidnap, Clash, and Suicide. Our methodology employs a semi-supervised learning framework, leveraging a small labeled dataset to train an initial predictive model, which is then expanded to create a comprehensive dataset suitable for deep learning applications. We explore various feature extraction techniques, including N-gram, TF-IDF, Word2Vec, FastText, and BERT, with the BERT-based classifier achieving the highest accuracy of 92% on the test set. Utilizing this classifier, we conduct a detailed trend and pattern analysis over a 5-year period (2014\u20132018), revealing significant insights into the frequency and distribution of violent events. Our findings highlight distinct trends in the occurrence of these incidents, including variations by month and age group, and geographical intensity, ultimately contributing to a better understanding of violent events in the context of Bangla news media.<\/jats:p>","DOI":"10.1007\/s44230-025-00092-8","type":"journal-article","created":{"date-parts":[[2025,2,25]],"date-time":"2025-02-25T13:54:18Z","timestamp":1740491658000},"page":"90-102","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Temporal, Demographic, and Geographical Analysis of Violent Events in Bangla News Media Using NLP Techniques"],"prefix":"10.1007","volume":"5","author":[{"given":"Iftakhar","family":"Ali Khandokar","sequence":"first","affiliation":[]},{"given":"Abdullah","family":"All Tanvir","sequence":"additional","affiliation":[]},{"given":"Md.","family":"Saddam Hossain Mukta","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0669-072X","authenticated-orcid":false,"given":"Swakkhar","family":"Shatabda","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,2,25]]},"reference":[{"key":"92_CR1","unstructured":"Liu P, Yuan W, Fu J, Jiang Z, Hayashi H, Neubig G. Pre-train, prompt, and predict: a systematic survey of prompting methods in natural language processing. arXiv:2107.13586 [arXiv preprint]. 2021."},{"key":"92_CR2","doi-asserted-by":"crossref","unstructured":"Al\u00a0Mostakim S, Ehsan F, Hasan SM, Islam S, Shatabda S. Bangla content categorization using text based supervised learning methods. In: 2018 international conference on bangla speech and language processing (ICBSLP). IEEE; 2018. p. 1\u20136.","DOI":"10.1109\/ICBSLP.2018.8554811"},{"key":"92_CR3","doi-asserted-by":"crossref","unstructured":"Bhakta D, Dash AA, Bari MF, Shatabda S. Supervised machine learning for multi-label classification of Bangla articles. In: Cyber security and computer science: second EAI international conference, ICONCS 2020, Dhaka, Bangladesh, February 15\u201316, 2020, Proceedings 2. Springer; 2020. p. 477\u201387.","DOI":"10.1007\/978-3-030-52856-0_38"},{"key":"92_CR4","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1016\/j.ijmedinf.2019.02.008","volume":"125","author":"C Dreisbach","year":"2019","unstructured":"Dreisbach C, Koleck TA, Bourne PE, Bakken S. A systematic review of natural language processing and text mining of symptoms from electronic patient-authored text data. Int J Med Inform. 2019;125:37\u201346.","journal-title":"Int J Med Inform"},{"key":"92_CR5","doi-asserted-by":"crossref","unstructured":"Chowdhury K, Shatabda S. Sentiment analysis on Bangla financial news. In: 2021 IEEE international women in engineering (WIE) conference on electrical and computer engineering (WIECON-ECE). IEEE; 2021. p. 64\u20137.","DOI":"10.1109\/WIECON-ECE54711.2021.9829684"},{"key":"92_CR6","doi-asserted-by":"publisher","first-page":"38999","DOI":"10.1109\/ACCESS.2022.3165563","volume":"10","author":"O Sen","year":"2022","unstructured":"Sen O, Fuad M, Islam MN, Rabbi J, Masud M, Hasan MK, Awal MA, Fime AA, Fuad MTH, Sikder D, et al. Bangla natural language processing: a comprehensive analysis of classical, machine learning, and deep learning-based methods. IEEE Access. 2022;10:38999\u20139044.","journal-title":"IEEE Access"},{"key":"92_CR7","unstructured":"Alam F, Hasan A, Alam T, Khan A, Tajrin J, Khan N, Chowdhury SA. A review of bangla natural language processing tasks and the utility of transformer models. arXiv:2107.03844. [arXiv preprint]. 2021."},{"issue":"4","key":"92_CR8","doi-asserted-by":"publisher","first-page":"967","DOI":"10.1147\/sj.404.0967","volume":"40","author":"T Nasukawa","year":"2001","unstructured":"Nasukawa T, Nagano T. Text analysis and knowledge mining system. IBM Syst J. 2001;40(4):967\u201384.","journal-title":"IBM Syst J"},{"key":"92_CR9","first-page":"117","volume":"13","author":"M Thangaraj","year":"2018","unstructured":"Thangaraj M, Sivakami M. Text classification techniques: a literature review. Interdiscip J Inf Knowl Manag. 2018;13:117.","journal-title":"Interdiscip J Inf Knowl Manag"},{"key":"92_CR10","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1016\/j.eswa.2018.03.058","volume":"106","author":"MM Miro\u0144czuk","year":"2018","unstructured":"Miro\u0144czuk MM, Protasiewicz J. A recent overview of the state-of-the-art elements of text classification. Expert Syst Appl. 2018;106:36\u201354.","journal-title":"Expert Syst Appl"},{"issue":"1","key":"92_CR11","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1007\/s10462-018-09677-1","volume":"52","author":"AI Kadhim","year":"2019","unstructured":"Kadhim AI. Survey on supervised machine learning techniques for automatic text classification. Artif Intell Rev. 2019;52(1):273\u201392.","journal-title":"Artif Intell Rev"},{"issue":"12","key":"92_CR12","doi-asserted-by":"publisher","first-page":"4509","DOI":"10.3390\/ijerph17124509","volume":"17","author":"Z Li","year":"2020","unstructured":"Li Z, Gurgel H, Dessay N, Luojia H, Lei X, Gong P. Semi-supervised text classification framework: an overview of dengue landscape factors and satellite earth observation. Int J Environ Res Public Health. 2020;17(12):4509.","journal-title":"Int J Environ Res Public Health"},{"key":"92_CR13","doi-asserted-by":"crossref","unstructured":"Linmei H, Yang T, Shi C, Ji H, Li X. Heterogeneous graph attention networks for semi-supervised short text classification. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP); 2019. p. 4821\u201330.","DOI":"10.18653\/v1\/D19-1488"},{"key":"92_CR14","doi-asserted-by":"crossref","unstructured":"Croce D, Castellucci G, Basili R. Gan-bert: generative adversarial learning for robust text classification with a bunch of labeled examples. In: Proceedings of the 58th annual meeting of the association for computational linguistics; 2020. p. 2114\u20139.","DOI":"10.18653\/v1\/2020.acl-main.191"},{"key":"92_CR15","doi-asserted-by":"crossref","unstructured":"Meng Y, Shen J, Zhang C, Han J. Weakly-supervised neural text classification. In: Proceedings of the 27th ACM international conference on information and knowledge management; 2018. p. 983\u201392.","DOI":"10.1145\/3269206.3271737"},{"key":"92_CR16","doi-asserted-by":"crossref","unstructured":"Meng Yu, Shen J, Zhang C, Han J. Weakly-supervised hierarchical text classification. In: Proceedings of the AAAI conference on artificial intelligence, vol. 33; 2019. p. 6826\u201333.","DOI":"10.1609\/aaai.v33i01.33016826"},{"key":"92_CR17","doi-asserted-by":"crossref","unstructured":"Cardellino C, Villata S, Alemany LA, Cabrio E. Information extraction with active learning: a case study in legal text. In: International conference on intelligent text processing and computational linguistics. Springer; 2015. p. 483\u201394.","DOI":"10.1007\/978-3-319-18117-2_36"},{"key":"92_CR18","doi-asserted-by":"crossref","unstructured":"Altschuler M, Bloodgood M. Stopping active learning based on predicted change of f measure for text classification. In: 2019 IEEE 13th international conference on semantic computing (ICSC). IEEE; 2019. p. 47\u201354.","DOI":"10.1109\/ICOSC.2019.8665646"},{"key":"92_CR19","doi-asserted-by":"publisher","first-page":"100546","DOI":"10.1016\/j.websem.2019.100546","volume":"60","author":"P Ristoski","year":"2020","unstructured":"Ristoski P, Gentile AL, Alba A, Gruhl D, Welch S. Large-scale relation extraction from web documents and knowledge graphs with human-in-the-loop. J Web Semant. 2020;60:100546.","journal-title":"J Web Semant"},{"issue":"11","key":"92_CR20","first-page":"2543","volume":"68","author":"M Kholghi","year":"2017","unstructured":"Kholghi M, De Vine L, Sitbon L, Zuccon G, Nguyen A. Clinical information extraction using small data: an active learning approach based on sequence representations and word embeddings. J Am Soc Inf Sci. 2017;68(11):2543\u201356.","journal-title":"J Am Soc Inf Sci"},{"issue":"3","key":"92_CR21","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3439726","volume":"54","author":"S Minaee","year":"2021","unstructured":"Minaee S, Kalchbrenner N, Cambria E, Nikzad N, Chenaghlu M, Gao J. Deep learning-based text classification: a comprehensive review. ACM Comput Surv (CSUR). 2021;54(3):1\u201340.","journal-title":"ACM Comput Surv (CSUR)"},{"key":"92_CR22","doi-asserted-by":"publisher","first-page":"101060","DOI":"10.1016\/j.aei.2020.101060","volume":"44","author":"W Fang","year":"2020","unstructured":"Fang W, Luo H, Xu S, Love PED, Lu Z, Ye C. Automated text classification of near-misses from safety reports: an improved deep learning approach. Adv Eng Inform. 2020;44:101060.","journal-title":"Adv Eng Inform"},{"key":"92_CR23","doi-asserted-by":"publisher","first-page":"138872","DOI":"10.1109\/ACCESS.2019.2941376","volume":"7","author":"J Dai","year":"2019","unstructured":"Dai J, Chen C, Li Y. A backdoor attack against lstm-based text classification systems. IEEE Access. 2019;7:138872\u20138.","journal-title":"IEEE Access"},{"key":"92_CR24","doi-asserted-by":"crossref","unstructured":"Yao L, Mao C, Luo Y. Graph convolutional networks for text classification. In: Proceedings of the AAAI conference on artificial intelligence, vol. 33; 2019. p. 7370\u20137.","DOI":"10.1609\/aaai.v33i01.33017370"},{"key":"92_CR25","doi-asserted-by":"publisher","first-page":"103285","DOI":"10.1016\/j.jbi.2019.103285","volume":"99","author":"V Su\u00e1rez-Paniagua","year":"2019","unstructured":"Su\u00e1rez-Paniagua V, Zavala RMR, Segura-Bedmar I, Mart\u00ednez P. A two-stage deep learning approach for extracting entities and relationships from medical texts. J Biomed Inform. 2019;99:103285.","journal-title":"J Biomed Inform"},{"issue":"6","key":"92_CR26","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2020.102269","volume":"57","author":"I Safder","year":"2020","unstructured":"Safder I, Hassan S-U, Visvizi A, Noraset T, Nawaz R, Tuarob S. Deep learning-based extraction of algorithmic metadata in full-text scholarly documents. Inf Process Manag. 2020;57(6): 102269.","journal-title":"Inf Process Manag"},{"key":"92_CR27","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1016\/j.neucom.2020.07.077","volume":"418","author":"N Pang","year":"2020","unstructured":"Pang N, Tan Z, Zhao X, Zeng W, Xiao W. Domain relation extraction from noisy Chinese texts. Neurocomputing. 2020;418:21\u201335.","journal-title":"Neurocomputing"},{"key":"92_CR28","doi-asserted-by":"crossref","unstructured":"Goseva-Popstojanova K, Tyo J. Identification of security related bug reports via text mining using supervised and unsupervised classification. In: 2018 IEEE international conference on software quality, reliability and security (QRS). IEEE; 2018. p. 344\u201355.","DOI":"10.1109\/QRS.2018.00047"},{"key":"92_CR29","volume":"51","author":"F Greco","year":"2020","unstructured":"Greco F, Polli A. Emotional text mining: customer profiling in brand management. Int J Inf Manag. 2020;51: 101934.","journal-title":"Int J Inf Manag"},{"key":"92_CR30","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1016\/j.jbi.2018.11.007","volume":"88","author":"A Sarker","year":"2018","unstructured":"Sarker A, Gonzalez-Hernandez G. An unsupervised and customizable misspelling generator for mining noisy health-related text sources. J Biomed Inform. 2018;88:98\u2013107.","journal-title":"J Biomed Inform"},{"key":"92_CR31","doi-asserted-by":"publisher","first-page":"218","DOI":"10.1016\/j.eswa.2017.07.019","volume":"94","author":"M Kang","year":"2018","unstructured":"Kang M, Ahn J, Lee K. Opinion mining using ensemble text hidden Markov models for text classification. Expert Syst Appl. 2018;94:218\u201327.","journal-title":"Expert Syst Appl"},{"key":"92_CR32","doi-asserted-by":"publisher","first-page":"1813","DOI":"10.1007\/s10489-019-01568-4","volume":"50","author":"F Fkih","year":"2020","unstructured":"Fkih F, Omri MN. Hidden data states-based complex terminology extraction from textual web data model. Appl Intell. 2020;50:1813\u201331.","journal-title":"Appl Intell"},{"issue":"1","key":"92_CR33","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1093\/jamia\/ocz063","volume":"27","author":"Q Wei","year":"2020","unstructured":"Wei Q, Ji Z, Li Z, Jingcheng D, Wang J, Jun X, Xiang Y, Tiryaki F, Stephen W, Zhang Y, et al. A study of deep learning approaches for medication and adverse drug event extraction from clinical text. J Am Med Inform Assoc. 2020;27(1):13\u201321.","journal-title":"J Am Med Inform Assoc"},{"issue":"6","key":"92_CR34","doi-asserted-by":"publisher","first-page":"3901","DOI":"10.1007\/s10462-019-09782-9","volume":"53","author":"FN Al-Aswadi","year":"2020","unstructured":"Al-Aswadi FN, Chan HY, Gan KH. Automatic ontology construction from text: a review from shallow to deep learning trend. Artif Intell Rev. 2020;53(6):3901\u201328.","journal-title":"Artif Intell Rev"},{"issue":"1","key":"92_CR35","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40691-020-00221-w","volume":"7","author":"H An","year":"2020","unstructured":"An H, Park M. Approaching fashion design trend applications using text mining and semantic network analysis. Fash Text. 2020;7(1):1\u201315.","journal-title":"Fash Text"},{"key":"92_CR36","doi-asserted-by":"crossref","unstructured":"Praciano BJG, da\u00a0Costa JPCL, Maranh\u00e3o JPA, de\u00a0Mendon\u00e7a FLL, de\u00a0Sousa\u00a0JRT, Prettz JB. Spatio-temporal trend analysis of the Brazilian elections based on twitter data. In: 2018 IEEE international conference on data mining workshops (ICDMW). IEEE; 2018. p. 1355\u201360.","DOI":"10.1109\/ICDMW.2018.00192"},{"key":"92_CR37","doi-asserted-by":"publisher","first-page":"339","DOI":"10.1016\/j.eswa.2018.07.017","volume":"113","author":"JL Martinez-Rodriguez","year":"2018","unstructured":"Martinez-Rodriguez JL, Lopez-Arevalo I, Rios-Alvarado AB. Openie-based approach for knowledge graph construction from text. Expert Syst Appl. 2018;113:339\u201355.","journal-title":"Expert Syst Appl"},{"issue":"4","key":"92_CR38","doi-asserted-by":"publisher","first-page":"1150","DOI":"10.1016\/j.jksuci.2020.05.008","volume":"34","author":"A Boufrida","year":"2022","unstructured":"Boufrida A, Boufaida Z. Rule extraction from scientific texts: evaluation in the specialty of gynecology. J King Saud Univ Comput Inf Sci. 2022;34(4):1150\u201360.","journal-title":"J King Saud Univ Comput Inf Sci"},{"key":"92_CR39","doi-asserted-by":"publisher","first-page":"474","DOI":"10.1016\/j.chb.2018.09.031","volume":"101","author":"C Shen","year":"2019","unstructured":"Shen C, Chen M, Wang C. Analyzing the trend of o2o commerce by bilingual text mining on social media. Comput Hum Behav. 2019;101:474\u201383.","journal-title":"Comput Hum Behav"},{"key":"92_CR40","doi-asserted-by":"crossref","unstructured":"Khandokar IA, Mamun I, Chadni TIA, Anas ZA, Shatabda S. Event detection and knowledge mining from unlabelled Bengali news articles. In: 2020 Emerging technology in computing, communication and electronics (ETCCE). IEEE; 2020. p. 1\u20136.","DOI":"10.1109\/ETCCE51779.2020.9350891"}],"container-title":["Human-Centric Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44230-025-00092-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s44230-025-00092-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44230-025-00092-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,23]],"date-time":"2025-04-23T13:02:44Z","timestamp":1745413364000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s44230-025-00092-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,25]]},"references-count":40,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,3]]}},"alternative-id":["92"],"URL":"https:\/\/doi.org\/10.1007\/s44230-025-00092-8","relation":{},"ISSN":["2667-1336"],"issn-type":[{"type":"electronic","value":"2667-1336"}],"subject":[],"published":{"date-parts":[[2025,2,25]]},"assertion":[{"value":"21 November 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 February 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 February 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Authors declare that there is no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}