{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T06:50:35Z","timestamp":1778309435118,"version":"3.51.4"},"reference-count":36,"publisher":"SAGE Publications","issue":"3","license":[{"start":{"date-parts":[[2023,6,1]],"date-time":"2023-06-01T00:00:00Z","timestamp":1685577600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Information Science"],"published-print":{"date-parts":[[2023,6]]},"abstract":"<jats:p>The incredible expansion of online texts due to the Internet has intensified and revived the interest of sorting, managing and categorising the documents into their respective domains. This shows the pressing need for automatic text categorization system to assign a document into its appropriate domain. In this article, the focus is on showcasing the effectiveness of a hybrid approach that works elegantly by combining text-based and graph-based features. The hybrid approach was applied on 14,373 Bangla articles with 57,22,569 tokens collected from various online news corpora covering nine categories. This article also presents the individual application of both the features to explicate how they generally work. For classification purposes, the feature sets were passed through the Bayesian classification methods which yield satisfactory results with 98.73% accuracy for Na\u00efve Bayes Multinomial (NBM). Also, to test the robustness and language independency of the system, the experiments were performed on two popular English datasets as well.<\/jats:p>","DOI":"10.1177\/01655515211027770","type":"journal-article","created":{"date-parts":[[2023,6,8]],"date-time":"2023-06-08T09:45:40Z","timestamp":1686217540000},"page":"762-777","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":5,"title":["Hybrid approach for text categorization: A case study with Bangla news article"],"prefix":"10.1177","volume":"49","author":[{"given":"Ankita","family":"Dhar","sequence":"first","affiliation":[]},{"given":"Himadri","family":"Mukherjee","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3360-7576","authenticated-orcid":false,"given":"Kaushik","family":"Roy","sequence":"additional","affiliation":[{"name":"Department of Computer Science, West Bengal State University, Kolkata, India"}]},{"given":"KC","family":"Santosh","sequence":"additional","affiliation":[{"name":"Department of Computer Science, The University of South Dakota, Vermillion, SD, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6465-671X","authenticated-orcid":false,"given":"Niladri Sekhar","family":"Dash","sequence":"additional","affiliation":[{"name":"Linguistic Research Unit, Indian Statistical Institute, Kolkata, India"}]}],"member":"179","published-online":{"date-parts":[[2023,6,8]]},"reference":[{"key":"bibr1-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1177\/0165551516683617"},{"key":"bibr2-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1177\/0165551519828627"},{"key":"bibr3-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1177\/0165551512459919"},{"key":"bibr4-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1177\/0165551517706219"},{"key":"bibr5-01655515211027770","unstructured":"Ethnologue, https:\/\/www.ethnologue.com\/language\/ben"},{"key":"bibr6-01655515211027770","first-page":"16","author":"Parlak B","year":"2019","journal-title":"J Inform Sci"},{"key":"bibr7-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1177\/0165551517743644"},{"key":"bibr8-01655515211027770","first-page":"1473","volume-title":"Proceedings of the IEEE\/ACM international conference on advances in social networks analysis and mining","author":"Malliaros FD"},{"key":"bibr9-01655515211027770","first-page":"1702","volume-title":"Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing","author":"Rousseau F"},{"key":"bibr10-01655515211027770","first-page":"514","volume-title":"Proceedings of the international symposium on neural networks","author":"Li H"},{"key":"bibr11-01655515211027770","first-page":"920","volume-title":"Proceedings of the international conference on document analysis and recognition","author":"Luo X"},{"key":"bibr12-01655515211027770","first-page":"212","volume":"12","author":"Wu H","year":"2019","journal-title":"Int J Intel Inform Database Syst"},{"key":"bibr13-01655515211027770","first-page":"71","volume":"06","author":"Al-Tahrawi MM","year":"2015","journal-title":"Int J Intel Syst Appl"},{"key":"bibr14-01655515211027770","first-page":"1","volume-title":"Proceedings of the 7th international conference on frontiers of information technology","author":"Ali AR"},{"key":"bibr15-01655515211027770","first-page":"109","volume-title":"Proceedings of the 3rd workshop on South and South East Asian natural language processing","author":"Gupta N"},{"key":"bibr16-01655515211027770","first-page":"343","volume":"02","author":"ArunaDevi K","year":"2014","journal-title":"Int J Sci Res Develop"},{"key":"bibr17-01655515211027770","first-page":"689","volume-title":"International conference on energy systems and applications","author":"Patil JJ"},{"issue":"1","key":"bibr18-01655515211027770","first-page":"11","volume":"4","author":"Patil M","year":"2014","journal-title":"ACEEE Int J Inform Tech"},{"key":"bibr19-01655515211027770","first-page":"191","volume-title":"2017 international conference on electrical, computer and communication engineering (ECCE)","author":"Islam MS"},{"key":"bibr20-01655515211027770","first-page":"1","volume-title":"Proceedings of the international conference on Bangla speech and language processing","author":"Alam MT"},{"key":"bibr21-01655515211027770","first-page":"24","volume":"178","author":"Hassan E","year":"2019","journal-title":"Int J Comput Appl"},{"key":"bibr22-01655515211027770","first-page":"1","volume-title":"Proceedings of the international conference on internet of things: smart innovation and usages","author":"Dhar A"},{"key":"bibr23-01655515211027770","unstructured":"Stopwords, https:\/\/www.tdil-dc.in\/index.php?option=com_downloadtask=showresourceDetailstoolid=1635lang=en"},{"key":"bibr24-01655515211027770","first-page":"271","volume":"2","author":"Karegowda AG","year":"2010","journal-title":"Int J Inform Tech Knowl Manag"},{"key":"bibr25-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1145\/2812809"},{"key":"bibr26-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1007\/s10791-011-9172-x"},{"key":"bibr27-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2006.180"},{"key":"bibr28-01655515211027770","first-page":"488","volume-title":"Proceedings of Australasian joint conference on artificial intelligence","author":"Kibriya AM"},{"key":"bibr29-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2018.07.028"},{"key":"bibr30-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1007\/s10791-008-9069-5"},{"key":"bibr31-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"},{"key":"bibr32-01655515211027770","first-page":"1","volume":"7","author":"Dem\u0161ar J","year":"2006","journal-title":"J Mach Learn Res"},{"key":"bibr33-01655515211027770","first-page":"3158","volume-title":"Proceedings of the Conference of the North American chapter of the association for computational linguistics: human language technologies","author":"Mahabal A"},{"key":"bibr34-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2018.03.003"},{"key":"bibr35-01655515211027770","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-016-2401-x"},{"key":"bibr36-01655515211027770","first-page":"1029","volume-title":"Proceedings of the international ACM SIGIR conference on research and development in information retrieval","author":"Ko Y"}],"container-title":["Journal of Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/01655515211027770","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/01655515211027770","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/01655515211027770","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T23:06:08Z","timestamp":1777503968000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/01655515211027770"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6]]},"references-count":36,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,6]]}},"alternative-id":["10.1177\/01655515211027770"],"URL":"https:\/\/doi.org\/10.1177\/01655515211027770","relation":{},"ISSN":["0165-5515","1741-6485"],"issn-type":[{"value":"0165-5515","type":"print"},{"value":"1741-6485","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,6]]}}}