{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T00:28:46Z","timestamp":1777854526818,"version":"3.51.4"},"reference-count":42,"publisher":"SAGE Publications","issue":"2","license":[{"start":{"date-parts":[[2015,12,1]],"date-time":"2015-12-01T00:00:00Z","timestamp":1448928000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Information Science"],"published-print":{"date-parts":[[2017,4]]},"abstract":"<jats:p>Owing to the rapid growth of the World Wide Web, the number of documents that can be accessed via the Internet explosively increases with each passing day. Considering news portals in particular, sometimes documents related to categories such as technology, sports and politics seem to be in the wrong category or documents are located in a generic category called others. At this point, text categorization (TC), which is generally addressed as a supervised learning task is needed. Although there are substantial number of studies conducted on TC in other languages, the number of studies conducted in Turkish is very limited owing to the lack of accessibility and usability of datasets created. In this paper, a new dataset named TTC-3600, which can be widely used in studies of TC of Turkish news and articles, is created. TTC-3600 is a well-documented dataset and its file formats are compatible with well-known text mining tools. Five widely used classifiers within the field of TC and two feature selection methods are evaluated on TTC-3600. The experimental results indicate that the best accuracy criterion value 91.03% is obtained with the combination of Random Forest classifier and attribute ranking-based feature selection method in all comparisons performed after pre-processing and feature selection steps. The publicly available TTC-3600 dataset and the experimental results of this study can be utilized in comparative experiments by other researchers.<\/jats:p>","DOI":"10.1177\/0165551515620551","type":"journal-article","created":{"date-parts":[[2015,12,23]],"date-time":"2015-12-23T21:34:24Z","timestamp":1450906464000},"page":"174-185","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":62,"title":["TTC-3600: A new benchmark dataset for Turkish text categorization"],"prefix":"10.1177","volume":"43","author":[{"given":"Deniz","family":"K\u0131l\u0131n\u00e7","sequence":"first","affiliation":[{"name":"Faculty of Technology, Celal Bayar University, Turkey"}]},{"given":"Ak\u0131n","family":"\u00d6z\u00e7ift","sequence":"additional","affiliation":[{"name":"Faculty of Technology, Celal Bayar University, Turkey"}]},{"given":"Fatma","family":"Bozyigit","sequence":"additional","affiliation":[{"name":"Faculty of Technology, Celal Bayar University, Turkey"}]},{"given":"Pelin","family":"Y\u0131ld\u0131r\u0131m","sequence":"additional","affiliation":[{"name":"Faculty of Technology, Celal Bayar University, Turkey"}]},{"given":"Fatih","family":"Y\u00fccalar","sequence":"additional","affiliation":[{"name":"Faculty of Technology, Celal Bayar University, Turkey"}]},{"given":"Emin","family":"Borandag","sequence":"additional","affiliation":[{"name":"Faculty of Technology, Celal Bayar University, Turkey"}]}],"member":"179","published-online":{"date-parts":[[2015,12,1]]},"reference":[{"key":"bibr1-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1177\/0165551504047928"},{"key":"bibr2-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0094137"},{"key":"bibr3-0165551515620551","volume-title":"Machine learning, neural and statistical classification","author":"Michie D","year":"1994"},{"key":"bibr4-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1145\/505282.505283"},{"key":"bibr5-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1155\/2014\/625342"},{"key":"bibr6-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1177\/0165551515591724"},{"key":"bibr7-0165551515620551","unstructured":"Wolpert DH, Macready WG. No free lunch theorem for search. Technical Report SFI-TR-05\u2013010, Santa Fe Institute, 1995."},{"key":"bibr8-0165551515620551","doi-asserted-by":"publisher","DOI":"10.3115\/1628960.1628969"},{"key":"bibr9-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1177\/0165551515585264"},{"key":"bibr10-0165551515620551","first-page":"161","volume-title":"Proceedings of SDAIR-94, 3rd annual symposium on document analysis and information retrieval","author":"Cavnar WB","year":"1994"},{"key":"bibr11-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1177\/0165551514558172"},{"key":"bibr12-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1177\/0165551513502417"},{"key":"bibr13-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1177\/0165551511423151"},{"key":"bibr14-0165551515620551","first-page":"369","volume-title":"Proceedings of the international symposium on innovations in intelligent systems and applications (INISTA)","author":"G\u00fcran A","year":"2009"},{"key":"bibr15-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1109\/INISTA.2011.5946084"},{"key":"bibr16-0165551515620551","first-page":"1","volume-title":"Proceedings of the ACL student research workshop","author":"Akkus BK","year":"2013"},{"key":"bibr17-0165551515620551","first-page":"1","volume-title":"Proceedings of IEEE signal processing and communications applications conference","author":"Amasyal\u0131 MF"},{"key":"bibr18-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1007\/11765448_22"},{"key":"bibr19-0165551515620551","first-page":"1","volume-title":"Proceedings of IEEE signal processing and communications applications conference (SIU)","author":"T\u00fcfek\u00e7i P"},{"key":"bibr20-0165551515620551","first-page":"1","volume-title":"Proceedings of IEEE signal processing and communications applications conference (SIU)","author":"\u00c7ataltepe Z"},{"key":"bibr21-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2011.01.023"},{"key":"bibr22-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2013.08.006"},{"key":"bibr23-0165551515620551","first-page":"1296","volume":"20","author":"Gunal S.","year":"2012","journal-title":"Turkish Journal of Electrical Engineering and Computer Sciences"},{"key":"bibr24-0165551515620551","first-page":"1","volume-title":"Proceedings of IEEE signal processing and communications applications conference (SIU)","author":"\u00d6zalp N"},{"key":"bibr25-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2004.07.004"},{"key":"bibr26-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1002\/asi.20750"},{"key":"bibr27-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2008.09.001"},{"key":"bibr28-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90021-0"},{"key":"bibr29-0165551515620551","first-page":"1","volume":"10","author":"Akin AA","year":"2007","journal-title":"Structure"},{"key":"bibr30-0165551515620551","first-page":"110","volume-title":"Proceedings of IEEE international symposium on innovations in intelligent systems and applications (INISTA) proceedings","author":"Yildirim P"},{"key":"bibr31-0165551515620551","doi-asserted-by":"publisher","DOI":"10.2495\/978-1-85312-995-7\/04"},{"key":"bibr32-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1007\/BF00153759"},{"issue":"3","key":"bibr33-0165551515620551","first-page":"235","volume":"16","author":"Quinlan JR","year":"1993","journal-title":"Machine Learning"},{"key":"bibr34-0165551515620551","doi-asserted-by":"publisher","DOI":"10.4304\/jcp.7.12.2913-2920"},{"key":"bibr35-0165551515620551","first-page":"51","author":"Hall M.","year":"1999","journal-title":"PhD thesis, Department of Computer Science, University of Waikato, New Zealand"},{"key":"bibr36-0165551515620551","first-page":"412","volume-title":"Proceedings of the 14th international conference on machine learning (ICML \u201897)","author":"Yang Y","year":"1997"},{"key":"bibr37-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2008.11.013"},{"key":"bibr38-0165551515620551","volume-title":"Data mining: Practical machine learning tools and techniques","author":"Witten IH","year":"2005"},{"key":"bibr39-0165551515620551","first-page":"629","volume-title":"Signal Processing and communications applications conference (SIU)","author":"Amasyal\u0131 M"},{"issue":"2","key":"bibr40-0165551515620551","first-page":"127","volume":"30","author":"Kohavi R","year":"1998","journal-title":"Machine Learning"},{"key":"bibr41-0165551515620551","first-page":"217","volume-title":"7th International conference on database theory","author":"Kevin B"},{"key":"bibr42-0165551515620551","doi-asserted-by":"publisher","DOI":"10.1177\/0165551514544096"}],"container-title":["Journal of Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551515620551","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/0165551515620551","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551515620551","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T23:09:24Z","timestamp":1777504164000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/0165551515620551"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,12,1]]},"references-count":42,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2017,4]]}},"alternative-id":["10.1177\/0165551515620551"],"URL":"https:\/\/doi.org\/10.1177\/0165551515620551","relation":{},"ISSN":["0165-5515","1741-6485"],"issn-type":[{"value":"0165-5515","type":"print"},{"value":"1741-6485","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,12,1]]}}}