{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T00:28:27Z","timestamp":1777854507495,"version":"3.51.4"},"reference-count":30,"publisher":"SAGE Publications","issue":"6","license":[{"start":{"date-parts":[[2020,6,18]],"date-time":"2020-06-18T00:00:00Z","timestamp":1592438400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"funder":[{"name":"Eski?ehir Teknik \u00dcniversitesi","award":["20DPR045"],"award-info":[{"award-number":["20DPR045"]}]}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Information Science"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:p>Text classification (TC) is very important and critical task in the 21th century as there exist high volume of electronic data on the Internet. In TC, textual data are characterised by a huge number of highly sparse features\/terms. A typical TC consists of many steps and one of the most important steps is undoubtedly feature selection (FS). In this study, we have comprehensively investigated the effects of various globalisation techniques on local feature selection (LFS) methods using datasets with different characteristics such as multi-class unbalanced (MCU), multi-class balanced (MCB), binary-class unbalanced (BCU) and binary-class balanced (BCB). The globalisation techniques used in this study are summation (SUM), weighted-sum (AVG), and maximum (MAX). To investigate the effect of globalisation techniques, we used three LFS methods named as Discriminative Feature Selection (DFSS), odds ratio (OR) and chi-square (CHI2). In the experiments, we have utilised four different benchmark datasets named as Reuters-21578, 20Newsgroup., Enron1, and Polarity in addition to Support Vector Machines (SVM) and Decision Tree (DT) classifiers. According to the experimental results, the most successful globalisation technique is AVG while all situations are taken into account. The experimental results indicate that DFSS method is more successful than OR and CHI2 methods on datasets with MCU and MCB characteristics. However, CHI2 method seems more accurate than OR and DFSS methods on datasets with BCU and BCB characteristics. Also, SVM classifier performed better than DT classifier in most cases.<\/jats:p>","DOI":"10.1177\/0165551520930897","type":"journal-article","created":{"date-parts":[[2020,6,18]],"date-time":"2020-06-18T09:32:42Z","timestamp":1592472762000},"page":"727-739","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":28,"title":["The effects of globalisation techniques on feature selection for text classification"],"prefix":"10.1177","volume":"47","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8919-6481","authenticated-orcid":false,"given":"Bekir","family":"Parlak","sequence":"first","affiliation":[{"name":"Department of Computer Engineering, Faculty of Engineering, Eski\u015fehir Technical University, Turkey"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alper Kursat","family":"Uysal","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Faculty of Engineering, Eski\u015fehir Technical University, Turkey"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2020,6,18]]},"reference":[{"key":"bibr1-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505283"},{"key":"bibr2-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4614-3223-4_6"},{"key":"bibr3-0165551520930897","first-page":"295","volume-title":"In: Proceedings of the tenth ACM international conference on web search and data mining","author":"Chen T"},{"key":"bibr4-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-10-4765-7_61"},{"key":"bibr5-0165551520930897","author":"Parlak B","journal-title":"J Inform Sci"},{"key":"bibr6-0165551520930897","first-page":"65675","author":"Jauhiainen TS","year":"2019","journal-title":"J Artif Intel Res"},{"key":"bibr7-0165551520930897","first-page":"4","volume":"7","author":"Kawade KO","year":"2018","journal-title":"Int J Comput Eng Appl"},{"key":"bibr8-0165551520930897","first-page":"1805","volume-title":"In: Proceedings of the 33 rd annual ACM symposium on applied computing","author":"Wehrmann J"},{"key":"bibr9-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2017.07.019"},{"key":"bibr10-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2017.12.016"},{"key":"bibr11-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2014.03.041"},{"key":"bibr12-0165551520930897","first-page":"137","volume-title":"European conference on machine learning","author":"Joachims T"},{"key":"bibr13-0165551520930897","first-page":"1157","volume":"3","author":"Guyon I","year":"2003","journal-title":"J Mach Learn Res"},{"key":"bibr14-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2009.01.051"},{"key":"bibr15-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2018.10.003"},{"key":"bibr16-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2018.07.028"},{"key":"bibr17-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2017.03.057"},{"key":"bibr18-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2015.08.050"},{"key":"bibr19-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2012.06.005"},{"key":"bibr20-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2011.07.010"},{"key":"bibr21-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2016.2522427"},{"key":"bibr22-0165551520930897","first-page":"1289","volume":"3","author":"Forman G","year":"2003","journal-title":"J Mach Learn Res"},{"key":"bibr23-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-45219-5_7"},{"key":"bibr24-0165551520930897","first-page":"606","volume-title":"International symposium on computer and information sciences","author":"\u00d6zg\u00fcr A"},{"key":"bibr25-0165551520930897","first-page":"76","volume":"10","author":"Singh SR","year":"2010","journal-title":"Feature Select Data Mining"},{"key":"bibr26-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2012.05.008"},{"key":"bibr27-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2013.02.019"},{"key":"bibr28-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2014.10.011"},{"key":"bibr29-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijpe.2014.12.035"},{"key":"bibr30-0165551520930897","doi-asserted-by":"publisher","DOI":"10.1002\/asi.21023"}],"container-title":["Journal of Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551520930897","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/0165551520930897","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551520930897","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T23:09:06Z","timestamp":1777504146000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/0165551520930897"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,18]]},"references-count":30,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["10.1177\/0165551520930897"],"URL":"https:\/\/doi.org\/10.1177\/0165551520930897","relation":{},"ISSN":["0165-5515","1741-6485"],"issn-type":[{"value":"0165-5515","type":"print"},{"value":"1741-6485","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,6,18]]}}}