{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,16]],"date-time":"2026-06-16T15:22:21Z","timestamp":1781623341170,"version":"3.54.5"},"reference-count":93,"publisher":"SAGE Publications","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["WEB"],"published-print":{"date-parts":[[2020,9,30]]},"abstract":"<jats:p>Text classification (a.k.a text categorisation) is an effective and efficient technology for information organisation and management. With the explosion of information resources on the Web and corporate intranets continues to increase, it has being become more and more important and has attracted wide attention from many different research fields. In the literature, many feature selection methods and classification algorithms have been proposed. It also has important applications in the real world. However, the dramatic increase in the availability of massive text data from various sources is creating a number of issues and challenges for text classification such as scalability issues. The purpose of this report is to give an overview of existing text classification technologies for building more reliable text classification applications, to propose a research direction for addressing the challenging problems in text mining.<\/jats:p>","DOI":"10.3233\/web-200442","type":"journal-article","created":{"date-parts":[[2020,8,7]],"date-time":"2020-08-07T05:04:50Z","timestamp":1596776690000},"page":"205-216","source":"Crossref","is-referenced-by-count":19,"title":["A survey on text classification and its applications"],"prefix":"10.1177","volume":"18","author":[{"given":"Xujuan","family":"Zhou","sequence":"first","affiliation":[{"name":"School of Management & Enterprise, The University of Southern Queensland, QLD, Australia. E-mails:\u00a0xujuan.zhou@usq.edu.au,\u00a0Raj.Gururajan@usq.edu.au"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Raj","family":"Gururajan","sequence":"additional","affiliation":[{"name":"School of Management & Enterprise, The University of Southern Queensland, QLD, Australia. E-mails:\u00a0xujuan.zhou@usq.edu.au,\u00a0Raj.Gururajan@usq.edu.au"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yuefeng","family":"Li","sequence":"additional","affiliation":[{"name":"Science and Engineering Faculty, Queensland University of Technology, QLD, Australia. E-mail:\u00a0y2.li@qut.edu.au"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Revathi","family":"Venkataraman","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, SRM Institute of Science and Technology, India. E-mail:\u00a0revathin@srmist.edu.in"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaohui","family":"Tao","sequence":"additional","affiliation":[{"name":"Faculty of Health, Engineering and Sciences, The University of Southern Queensland, QLD, Australia. E-mail:\u00a0x.tao@usq.edu.au"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ghazal","family":"Bargshady","sequence":"additional","affiliation":[{"name":"School of Management & Enterprise, The University of Southern Queensland, QLD, Australia. E-mails:\u00a0xujuan.zhou@usq.edu.au,\u00a0Raj.Gururajan@usq.edu.au"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Prabal D.","family":"Barua","sequence":"additional","affiliation":[{"name":"School of Management & Enterprise, The University of Southern Queensland, QLD, Australia. E-mails:\u00a0xujuan.zhou@usq.edu.au,\u00a0Raj.Gururajan@usq.edu.au"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Srinivas","family":"Kondalsamy-Chennakesavan","sequence":"additional","affiliation":[{"name":"Rural Clinical School, The University of Queensland, QLD, Australia. E-mail:\u00a0Ghazal.Bargshady@usq.edu.au"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"179","reference":[{"key":"10.3233\/WEB-200442_ref1","first-page":"113","article-title":"Reducing multiclass to binary: A unifying approach for margin classifiers","volume":"1","author":"Allwein","year":"2000","journal-title":"Journal of Machine Learning Research"},{"issue":"1","key":"10.3233\/WEB-200442_ref2","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1001\/jama.1987.03400010071030","article-title":"DXplain: An evolving diagnostic decision-support system","volume":"258","author":"Barnett","year":"1987","journal-title":"Jama"},{"key":"10.3233\/WEB-200442_ref3","doi-asserted-by":"crossref","unstructured":"R.\u00a0Bekkerman and M.\u00a0Gavish, High-precision phrase-based document classification on a modern scale, in: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD\u201911, ACM, New York, NY, USA, 2011, pp.\u00a0231\u2013239.","DOI":"10.1145\/2020408.2020449"},{"key":"10.3233\/WEB-200442_ref4","first-page":"1137","article-title":"A neural probabilistic language model","volume":"3","author":"Bengio","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"10.3233\/WEB-200442_ref5","doi-asserted-by":"crossref","unstructured":"S.\u00a0Brady and H.\u00a0Shatkay, EpiLoc: A (working) text-based system for predicting protein subcellular location, in: Biocomputing 2008, World Scientific, 2008, pp.\u00a0604\u2013615.","DOI":"10.1142\/9789812776136_0058"},{"key":"10.3233\/WEB-200442_ref6","unstructured":"P.B.\u00a0Cerrito and J.C.\u00a0Cerrito, Data and Text Mining the Electronic Medical Record to Improve Care and to Lower Costs, 2006."},{"issue":"3, Part 1","key":"10.3233\/WEB-200442_ref7","doi-asserted-by":"publisher","first-page":"5432","DOI":"10.1016\/j.eswa.2008.06.054","article-title":"Feature selection for text classification with Na\u00efve Bayes","volume":"36","author":"Chen","year":"2009","journal-title":"Expert Systems with Applications"},{"key":"10.3233\/WEB-200442_ref8","doi-asserted-by":"crossref","unstructured":"W.\u00a0Chen, J.\u00a0Yan, B.\u00a0Zhang, Z.\u00a0Chen and Q.\u00a0Yang, Document transformation for multi-label feature selection in text categorization, in: Proceedings of the 2007 Seventh IEEE International Conference on Data Mining, IEEE Computer Society, Washington, DC, USA, 2007, pp.\u00a0451\u2013456. ISBN 0-7695-3018-4.","DOI":"10.1109\/ICDM.2007.18"},{"key":"10.3233\/WEB-200442_ref9","doi-asserted-by":"crossref","unstructured":"W.W.\u00a0Cohen and Y.\u00a0Singer, Context-sensitive learning methods for text categorization, in: ACM Transactions on Information Systems, ACM Press, 1996, pp.\u00a0307\u2013315.","DOI":"10.1145\/243199.243278"},{"issue":"6","key":"10.3233\/WEB-200442_ref11","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9","article-title":"Indexing by latent semantic analysis","volume":"41","author":"Deerwester","year":"1990","journal-title":"Journal of the American Society for Information Science"},{"issue":"3","key":"10.3233\/WEB-200442_ref12","first-page":"135","article-title":"Personalisation in news delivery systems: Item summarization and multi-tier item selection using relevance feedback","volume":"3","author":"D\u00edaz","year":"2005","journal-title":"Web Intelligence and Agent Systems: An International Journal"},{"key":"10.3233\/WEB-200442_ref13","unstructured":"C.\u00a0Dos Santos and M.\u00a0Gatti, Deep convolutional neural networks for sentiment analysis of short texts, in: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, 2014, pp.\u00a069\u201378."},{"key":"10.3233\/WEB-200442_ref14","doi-asserted-by":"crossref","unstructured":"S.T.\u00a0Dumais, G.W.\u00a0Furnas, T.K.\u00a0Landauer, S.\u00a0Deerwester and R.\u00a0Harshman, Using latent semantic analysis to improve access to textual information, in: SIGCHI Conference on Human Factors in Computing Systems, ACM, 1988, pp.\u00a0281\u2013285.","DOI":"10.1145\/57167.57214"},{"key":"10.3233\/WEB-200442_ref15","first-page":"1289","article-title":"An extensive empirical study of feature selection metrics for text classification","volume":"3","author":"Forman","year":"2003","journal-title":"Journal of Machine Learning Research"},{"issue":"3","key":"10.3233\/WEB-200442_ref16","doi-asserted-by":"publisher","first-page":"223","DOI":"10.1145\/125187.125189","article-title":"A probabilistic learning approach for document indexing","volume":"9","author":"Fuhr","year":"1991","journal-title":"ACM Transaction on Information Systems"},{"key":"10.3233\/WEB-200442_ref17","unstructured":"N.\u00a0Fuhr, S.\u00a0Hartmann, G.\u00a0Lustig, M.\u00a0Schwantner, K.\u00a0Tzeras, T.H.\u00a0Darmstadt, F.\u00a0Informatik and G.\u00a0Knorz, AIR\/X \u2013 a rule-based multistage indexing system for large subject fields, in: Proceedings of RIAO\u201991, 1991, pp.\u00a0606\u2013623."},{"issue":"1","key":"10.3233\/WEB-200442_ref18","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1109\/TKDE.2006.16","article-title":"Text classification without negative examples revisit","volume":"18","author":"Fung","year":"2006","journal-title":"IEEE transactions on Knowledge and Data Engineering"},{"key":"10.3233\/WEB-200442_ref19","unstructured":"J.\u00a0Furnkranz, T.\u00a0Mitchell, E.\u00a0Riloff et al., A case study in using linguistic phrases for text categorization on the WWW, in: Working Notes of the AAAI\/ICML, Workshop on Learning for Text Categorization, 1998, pp.\u00a05\u201312."},{"issue":"6","key":"10.3233\/WEB-200442_ref20","doi-asserted-by":"publisher","first-page":"1629","DOI":"10.1109\/TKDE.2014.2384497","article-title":"Pattern-based topics for document modelling in information filtering","volume":"27","author":"Gao","year":"2014","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"10.3233\/WEB-200442_ref21","first-page":"1","article-title":"Ontology-based personalized search and browsing","volume":"1","author":"Gauch","year":"2003","journal-title":"Web Intelligence and Agent Systems"},{"key":"10.3233\/WEB-200442_ref22","doi-asserted-by":"publisher","DOI":"10.1109\/CSCWD.2016.7565984"},{"key":"10.3233\/WEB-200442_ref23","doi-asserted-by":"publisher","DOI":"10.1186\/gb-2004-5-6-r43"},{"key":"10.3233\/WEB-200442_ref24","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1016\/j.knosys.2016.04.022","article-title":"Decision support systems for adoption in dental clinics: A survey","volume":"104","author":"Goh","year":"2016","journal-title":"Knowledge-Based Systems"},{"key":"10.3233\/WEB-200442_ref25","doi-asserted-by":"publisher","DOI":"10.1145\/2487575.2487644"},{"key":"10.3233\/WEB-200442_ref26","doi-asserted-by":"publisher","DOI":"10.1145\/2629585"},{"key":"10.3233\/WEB-200442_ref27","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1016\/j.jbi.2015.08.013","article-title":"Identifying adverse drug event information in clinical notes with distributional semantic representations of context","volume":"57","author":"Henriksson","year":"2015","journal-title":"Journal of Biomedical Informatics"},{"key":"10.3233\/WEB-200442_ref28","doi-asserted-by":"crossref","unstructured":"T.\u00a0Joachims, Text categorization with support vector machines: Learning with many relevant features, in: ECML, 1998, pp.\u00a0137\u2013142.","DOI":"10.1007\/BFb0026683"},{"key":"10.3233\/WEB-200442_ref29","unstructured":"T.\u00a0Joachims, Transductive inference for text classification using support vector machines, in: The 16th International Conference on Machine Learning, Bled Slovenia, 1999, pp.\u00a0200\u2013209."},{"key":"10.3233\/WEB-200442_ref30","doi-asserted-by":"publisher","first-page":"227","DOI":"10.1023\/A:1025554732352","article-title":"A comparison of word- and sense-based text categorization using several classification algorithms","volume":"21","author":"Kehagias","year":"2001","journal-title":"Journal of Intelligent Information Systems"},{"key":"10.3233\/WEB-200442_ref31","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1181"},{"key":"10.3233\/WEB-200442_ref32","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-10-129"},{"key":"10.3233\/WEB-200442_ref33","doi-asserted-by":"publisher","first-page":"128","DOI":"10.1016\/j.knosys.2016.10.003","article-title":"A survey of the applications of text mining in financial domain","volume":"114","author":"Kumar","year":"2016","journal-title":"Knowledge-Based Systems"},{"key":"10.3233\/WEB-200442_ref34","doi-asserted-by":"crossref","unstructured":"S.\u00a0Lai, L.\u00a0Xu, K.\u00a0Liu and J.\u00a0Zhao, Recurrent convolutional neural networks for text classification, in: Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015.","DOI":"10.1609\/aaai.v29i1.9513"},{"key":"10.3233\/WEB-200442_ref35","doi-asserted-by":"publisher","first-page":"721","DOI":"10.1109\/TPAMI.2008.110","article-title":"Supervised and traditional term weighting methods for automatic text categorization","volume":"31","author":"Lan","year":"2009","journal-title":"IEEE Transations on Pattern Analysis and Machine Intelligence"},{"key":"10.3233\/WEB-200442_ref36","doi-asserted-by":"publisher","DOI":"10.1109\/HICSS.2007.570"},{"issue":"7553","key":"10.3233\/WEB-200442_ref37","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"issue":"4","key":"10.3233\/WEB-200442_ref38","doi-asserted-by":"publisher","first-page":"541","DOI":"10.1162\/neco.1989.1.4.541","article-title":"Backpropagation applied to handwritten zip code recognition","volume":"1","author":"LeCun","year":"1989","journal-title":"Neural computation"},{"issue":"11","key":"10.3233\/WEB-200442_ref39","doi-asserted-by":"publisher","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based learning applied to document recognition","volume":"86","author":"LeCun","year":"1998","journal-title":"Proceedings of the IEEE"},{"key":"10.3233\/WEB-200442_ref40","doi-asserted-by":"crossref","unstructured":"D.D.\u00a0Lewis, An evaluation of phrasal and clustered representations on a text categorization task, in: SIGIR, 1992, pp.\u00a037\u201350.","DOI":"10.1145\/133160.133172"},{"key":"10.3233\/WEB-200442_ref42","unstructured":"D.D.\u00a0Lewis and M.\u00a0Ringuette, A comparison of two learning algorithms for text categorization, in: Proceedings of SDAIR-94, 3rd Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, US, 1994, pp.\u00a081\u201393."},{"key":"10.3233\/WEB-200442_ref43","doi-asserted-by":"publisher","DOI":"10.1145\/243199.243277"},{"key":"10.3233\/WEB-200442_ref44","doi-asserted-by":"publisher","DOI":"10.1145\/243199.243277"},{"key":"10.3233\/WEB-200442_ref45","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1278"},{"key":"10.3233\/WEB-200442_ref46","doi-asserted-by":"publisher","DOI":"10.1145\/1835804.1835900"},{"issue":"7","key":"10.3233\/WEB-200442_ref47","doi-asserted-by":"publisher","first-page":"1438","DOI":"10.1109\/TKDE.2017.2681671","article-title":"Enhancing binary classification by modeling uncertain boundary in three-way decisions","volume":"29","author":"Li","year":"2017","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"10.3233\/WEB-200442_ref48","doi-asserted-by":"crossref","unstructured":"Y.\u00a0Li, X.\u00a0Zhou, P.\u00a0Bruza, Y.\u00a0Xu and R.Y.\u00a0Lau, A two-stage text mining model for information filtering, in: Proceedings of the 17th ACM Conference on Information and Knowledge Management, ACM, 2008, pp.\u00a01023\u20131032.","DOI":"10.1145\/1458082.1458218"},{"key":"10.3233\/WEB-200442_ref49","unstructured":"B.\u00a0Liu, Y.\u00a0Dai, X.\u00a0Li, W.S.\u00a0Lee and P.S.\u00a0Yu, Building text classifiers using positive and unlabeled examples, in: ICDM03, 2003, pp.\u00a0179\u2013186."},{"key":"10.3233\/WEB-200442_ref50","doi-asserted-by":"crossref","unstructured":"J.\u00a0Liu, W.-C.\u00a0Chang, Y.\u00a0Wu and Y.\u00a0Yang, Deep learning for extreme multi-label text classification, in: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2017, pp.\u00a0115\u2013124.","DOI":"10.1145\/3077136.3080834"},{"key":"10.3233\/WEB-200442_ref52","doi-asserted-by":"crossref","unstructured":"Y.\u00a0Meng, J.\u00a0Shen, C.\u00a0Zhang and J.\u00a0Han, Weakly-supervised hierarchical text classification, in: Proceedings of the AAAI Conference on Artificial Intelligence, Vol.\u00a033, 2019, pp.\u00a06826\u20136833.","DOI":"10.1609\/aaai.v33i01.33016826"},{"key":"10.3233\/WEB-200442_ref53","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1145\/219717.219748","article-title":"WordNet: A lexical database for English","volume":"38","author":"Miller","year":"1995","journal-title":"Communications of the ACM"},{"key":"10.3233\/WEB-200442_ref54","doi-asserted-by":"crossref","unstructured":"A.\u00a0Moschitti, Syntactic and semantic kernels for short text pair categorization, in: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL\u201909, Association for Computational Linguistics, Stroudsburg, PA, USA, 2009, pp.\u00a0576\u2013584.","DOI":"10.3115\/1609067.1609131"},{"key":"10.3233\/WEB-200442_ref55","doi-asserted-by":"crossref","unstructured":"R.\u00a0Moschitti and R.\u00a0Basili, Complex linguistic features for text classification: A comprehensive study, in: Proceedings of the 26th European Conference on Information Retrieval (ECIR), Springer Verlag, 2004, pp.\u00a0181\u2013196.","DOI":"10.1007\/978-3-540-24752-4_14"},{"key":"10.3233\/WEB-200442_ref56","doi-asserted-by":"crossref","unstructured":"N.\u00a0Nanas, V.S.\u00a0Uren and A.\u00a0Roeck, A comparative evaluation of term weighting methods for information filtering, in: DEXA Workshops, 2004, pp.\u00a013\u201317.","DOI":"10.1109\/DEXA.2004.1333442"},{"key":"10.3233\/WEB-200442_ref57","doi-asserted-by":"publisher","DOI":"10.1145\/258525.258537"},{"key":"10.3233\/WEB-200442_ref58","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1401958"},{"issue":"12","key":"10.3233\/WEB-200442_ref59","doi-asserted-by":"publisher","first-page":"1598","DOI":"10.1016\/j.patrec.2010.05.005","article-title":"Text classification with the support of pruned dependency patterns","volume":"31","author":"\u00d6zg\u00fcr","year":"2010","journal-title":"Pattern Recognition Letters"},{"key":"10.3233\/WEB-200442_ref60","doi-asserted-by":"publisher","DOI":"10.3115\/1118693.1118704"},{"key":"10.3233\/WEB-200442_ref61","doi-asserted-by":"publisher","DOI":"10.1145\/3178876.3186005"},{"key":"10.3233\/WEB-200442_ref62","unstructured":"X.\u00a0Peng and B.\u00a0Choi, Document classifications based on word semantic hierarchies, in: Proceedings of the International Conference on Artificial Intelligence and Applications (AIA\u201905), 2005, pp.\u00a0362\u2013367."},{"key":"10.3233\/WEB-200442_ref64","unstructured":"X.\u00a0Qiu, X.\u00a0Huang, Z.\u00a0Liu and J.\u00a0Zhou, Hierarchical text classification with latent concepts, in: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers \u2013 Volume 2, Association for Computational Linguistics, 2011, pp.\u00a0598\u2013602."},{"key":"10.3233\/WEB-200442_ref66","doi-asserted-by":"crossref","unstructured":"J.\u00a0Rousu, C.\u00a0Saunders, S.\u00a0Szedmak and J.\u00a0Shawe-Taylor, Learning hierarchical multi-category text classification models, in: Proceedings of the 22nd International Conference on Machine Learning, ACM, 2005, pp.\u00a0744\u2013751.","DOI":"10.1145\/1102351.1102445"},{"issue":"5","key":"10.3233\/WEB-200442_ref67","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1016\/0306-4573(88)90021-0","article-title":"Term-weighting approaches in automatic text retrieval","volume":"24","author":"Salton","year":"1988","journal-title":"Inf. Process. Manage."},{"key":"10.3233\/WEB-200442_ref68","doi-asserted-by":"crossref","unstructured":"R.E.\u00a0Schapire and Y.\u00a0Singer, BoosTexter: A boosting-based system for text categorization, in: Machine Learning, 2000, pp.\u00a0135\u2013168.","DOI":"10.1023\/A:1007649029923"},{"key":"10.3233\/WEB-200442_ref69","doi-asserted-by":"crossref","unstructured":"H.\u00a0Sch\u00fctze, D.A.\u00a0Hull and J.O.\u00a0Pedersen, A comparison of classifiers and document representations for the routing problem, in: SIGIR, 1995, pp.\u00a0229\u2013237.","DOI":"10.1145\/215206.215365"},{"issue":"1","key":"10.3233\/WEB-200442_ref70","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/505282.505283","article-title":"Machine learning in automated text categorization","volume":"34","author":"Sebastiani","year":"2002","journal-title":"ACM Computing Surveys"},{"issue":"11","key":"10.3233\/WEB-200442_ref71","doi-asserted-by":"publisher","first-page":"1410","DOI":"10.1093\/bioinformatics\/btm115","article-title":"SherLoc: High-accuracy prediction of protein subcellular localization by integrating text and protein sequence data","volume":"23","author":"Shatkay","year":"2007","journal-title":"Bioinformatics"},{"key":"10.3233\/WEB-200442_ref72","doi-asserted-by":"crossref","unstructured":"D.\u00a0Shen, J.-T.\u00a0Sun, Q.\u00a0Yang, H.\u00a0Zhao and Z.\u00a0Chen, Text classification improved through automatically extracted sequences, in: ICDE\u201906, 2006, pp.\u00a01\u20131.","DOI":"10.1109\/ICDE.2006.158"},{"key":"10.3233\/WEB-200442_ref73","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1167"},{"issue":"3","key":"10.3233\/WEB-200442_ref74","doi-asserted-by":"publisher","first-page":"235","DOI":"10.3233\/WIA-2010-0189","article-title":"A knowledge-based model using ontologies for personalized web information gathering","volume":"8","author":"Tao","year":"2010","journal-title":"Web Intelligence and Agent Systems: An International Journal"},{"key":"10.3233\/WEB-200442_ref75","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-49586-6_59"},{"key":"10.3233\/WEB-200442_ref76","doi-asserted-by":"publisher","first-page":"1","DOI":"10.4018\/jdwm.2007070101","article-title":"Multi-label classification: An overview","volume":"2007","author":"Tsoumakas","year":"2007","journal-title":"Int J Data Warehousing and Mining"},{"key":"10.3233\/WEB-200442_ref78","unstructured":"V.\u00a0Vapnik, The Nature of Statistical Learning Theory, Springer Science & Business Media, 2013."},{"issue":"4","key":"10.3233\/WEB-200442_ref79","first-page":"431","article-title":"A study on rough set-aided feature selection for automatic web-page classification","volume":"4","author":"Wakaki","year":"2006","journal-title":"Web Intelligence and Agent Systems: An International Journal"},{"key":"10.3233\/WEB-200442_ref80","unstructured":"K.\u00a0Weedford, E.\u00a0Walter and D.\u00a0Shenton, Cambridge Learner\u2019s Dictionary, 3rd edn, Cambridge University, 2007."},{"key":"10.3233\/WEB-200442_ref81","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2006.50"},{"issue":"3","key":"10.3233\/WEB-200442_ref82","doi-asserted-by":"crossref","first-page":"428","DOI":"10.1109\/TKDE.2008.166","article-title":"Distributional features for text categorization","volume":"21","author":"Xue","year":"2009","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"10.3233\/WEB-200442_ref83","doi-asserted-by":"publisher","DOI":"10.1145\/1835804.1835950"},{"key":"10.3233\/WEB-200442_ref84","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1023\/A:1009982220290","article-title":"An evaluation of statistical approaches to text categorization","volume":"1","author":"Yang","year":"1999","journal-title":"Information Retrieval"},{"key":"10.3233\/WEB-200442_ref85","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1145\/183422.183424","article-title":"An example-based mapping method for text categorization and retrieval","volume":"12","author":"Yang","year":"1994","journal-title":"ACM TOIS"},{"key":"10.3233\/WEB-200442_ref86","doi-asserted-by":"publisher","DOI":"10.4249\/scholarpedia.4242"},{"key":"10.3233\/WEB-200442_ref87","unstructured":"Y.\u00a0Yang and J.O.\u00a0Pedersen, A comparative study on feature selection in text categorization, in: Icml, Vol.\u00a097, 1997, p.\u00a035."},{"key":"10.3233\/WEB-200442_ref88","doi-asserted-by":"crossref","unstructured":"Z.\u00a0Yang, D.\u00a0Yang, C.\u00a0Dyer, X.\u00a0He, A.\u00a0Smola and E.\u00a0Hovy, Hierarchical attention networks for document classification, in: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2016, pp.\u00a01480\u20131489.","DOI":"10.18653\/v1\/N16-1174"},{"issue":"4","key":"10.3233\/WEB-200442_ref89","first-page":"203","article-title":"Clustering web pages about persons and organizations","volume":"3","author":"Ye","year":"2005","journal-title":"Web Intelligence and Agent Systems: An International Journal"},{"key":"10.3233\/WEB-200442_ref90","doi-asserted-by":"crossref","unstructured":"L.\u00a0Zhang, Y.\u00a0Li, C.\u00a0Sun and W.\u00a0Nadee, Rough set based approach to text classification, in: Proceedings of the 2013 IEEE\/WIC\/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) \u2013 Volume 03, IEEE Computer Society, 2013, pp.\u00a0245\u2013252.","DOI":"10.1109\/WI-IAT.2013.190"},{"key":"10.3233\/WEB-200442_ref91","doi-asserted-by":"publisher","DOI":"10.1109\/DSAA.2014.7058104"},{"issue":"8","key":"10.3233\/WEB-200442_ref92","doi-asserted-by":"publisher","first-page":"1819","DOI":"10.1109\/TKDE.2013.39","article-title":"A review on multi-label learning algorithms","volume":"26","author":"Zhang","year":"2013","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"10.3233\/WEB-200442_ref93","unstructured":"X.\u00a0Zhang, J.\u00a0Zhao and Y.\u00a0LeCun, Character-level convolutional networks for text classification, in: Advances in Neural Information Processing Systems, 2015, pp.\u00a0649\u2013657."},{"key":"10.3233\/WEB-200442_ref94","doi-asserted-by":"crossref","unstructured":"Y.\u00a0Zhang and J.\u00a0Callan, Maximum likelihood estimation for filtering thresholds, in: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM Press, 2001, pp.\u00a0294\u2013302.","DOI":"10.1145\/383952.384012"},{"key":"10.3233\/WEB-200442_ref96","doi-asserted-by":"crossref","unstructured":"X.\u00a0Zhou, Y.\u00a0Li, P.\u00a0Bruza, Y.\u00a0Xu and R.Y.K.\u00a0Lau, Pattern mining for a two-stage information filtering system, in: Proceedings of the 15th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining \u2013 Volume Part\u00a0I, PAKDD\u201911, Springer-Verlag, Berlin, Heidelberg, 2011, pp.\u00a0363\u2013374. ISBN 978-3-642-20840-9.","DOI":"10.1007\/978-3-642-20841-6_30"},{"key":"10.3233\/WEB-200442_ref97","doi-asserted-by":"crossref","unstructured":"X.\u00a0Zhou, Y.\u00a0Li, P.D.\u00a0Bruza, Y.\u00a0Xu and R.Y.\u00a0Lau, Rough sets based reasoning and pattern mining for a two-stage information filtering system, in: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, ACM, 2010, pp.\u00a01429\u20131432.","DOI":"10.1145\/1871437.1871639"},{"key":"10.3233\/WEB-200442_ref98","unstructured":"X.\u00a0Zhou, Y.\u00a0Li, Y.\u00a0Xu and R.\u00a0Lau, Relevence assessment of topic ontology, in: Proceedings of the 2006 Conference on Advances in Intelligent IT: Active Media Technology 2006, IOS Press, 2006, pp.\u00a044\u201351."},{"key":"10.3233\/WEB-200442_ref99","doi-asserted-by":"publisher","DOI":"10.1145\/3106426.3106459"},{"key":"10.3233\/WEB-200442_ref100","doi-asserted-by":"publisher","DOI":"10.1109\/CSCWD.2013.6581022"}],"container-title":["Web Intelligence"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/WEB-200442","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T05:27:20Z","timestamp":1777613240000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/WEB-200442"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,30]]},"references-count":93,"journal-issue":{"issue":"3"},"URL":"https:\/\/doi.org\/10.3233\/web-200442","relation":{},"ISSN":["2405-6464","2405-6456"],"issn-type":[{"value":"2405-6464","type":"electronic"},{"value":"2405-6456","type":"print"}],"subject":[],"published":{"date-parts":[[2020,9,30]]}}}