{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T16:05:34Z","timestamp":1759939534655},"reference-count":29,"publisher":"IGI Global","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,4]]},"abstract":"<jats:p>This article describes how text documents are a major data structure in the era of big data. With the explosive growth of data, the number of documents with multi-labels has increased dramatically. The popular multi-label classification technology, which is usually employed to handle multinomial text documents, is sensitive to the noise terms of text documents. Therefore, there still exists a huge room for multi-label classification of text documents. This article introduces a supervised topic model, named labeled LDA with function terms (LF-LDA), to filter out the noisy function terms from text documents, which can help to improve the performance of multi-label classification of text documents. The article also shows the derivation of the Gibbs Sampling formulas in detail, which can be generalized to other similar topic models. Based on the textual data set RCV1-v2, the article compared the proposed model with other two state-of-the-art multi-label classifiers, Tuned SVM and labeled LDA, on both Macro-F1 and Micro-F1 metrics. The result shows that LF-LDA outperforms them and has the lowest variance, which indicates the robustness of the LF-LDA classifier.<\/jats:p>","DOI":"10.4018\/ijdwm.2018040102","type":"journal-article","created":{"date-parts":[[2018,3,22]],"date-time":"2018-03-22T14:19:00Z","timestamp":1521728340000},"page":"18-36","source":"Crossref","is-referenced-by-count":6,"title":["LF-LDA"],"prefix":"10.4018","volume":"14","author":[{"given":"Yongjun","family":"Zhang","sequence":"first","affiliation":[{"name":"Faculty of Computer and Software Engineering, Huaiyin Institute of Technology, Huaian, China & College of Computer and Information, Hohai University, Nanjing, China"}]},{"given":"Zijian","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Computer and Information, Hohai University, Nanjing, China"}]},{"given":"Yongtao","family":"Yu","sequence":"additional","affiliation":[{"name":"Huaiyin Institute of Technology, Huaian, China"}]},{"given":"Bolun","family":"Chen","sequence":"additional","affiliation":[{"name":"Huaiyin Institute of Technology, Huaian, China"}]},{"given":"Jialin","family":"Ma","sequence":"additional","affiliation":[{"name":"The Laboratory for Internet of Things and Mobile Internet Technology of Jiangsu Province, Huaiyin Institute of Technology, Huaian, China & College of Computer and Information, Hohai University, Nanjing, China"}]},{"given":"Liang","family":"Shi","sequence":"additional","affiliation":[{"name":"Jiangsu Vocational College of Business, Nantong, China"}]}],"member":"2432","reference":[{"key":"IJDWM.2018040102-0","first-page":"327","article-title":"Supervised topic models.","volume":"3","author":"D. M.Blei","year":"2010","journal-title":"Advances in Neural Information Processing Systems"},{"key":"IJDWM.2018040102-1","first-page":"993","article-title":"Latent dirichlet allocation.","volume":"3","author":"D. M.Blei","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"IJDWM.2018040102-2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2004.03.009"},{"key":"IJDWM.2018040102-3","unstructured":"Brinker, K., & H\u00fcllermeier, E. (2007). Case-Based Multilabel Ranking. In IJCAI 2007, Proceedings of the, International Joint Conference on Artificial Intelligence, Hyderabad, India (pp. 702-707)."},{"key":"IJDWM.2018040102-4","first-page":"681","article-title":"A kernel method for multi-labelled classification.","volume":"Vol. 14","author":"A.Elisseeff","year":"2001","journal-title":"International Conference on Neural Information Processing Systems: Natural and Synthetic"},{"key":"IJDWM.2018040102-5","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2017.2692728"},{"key":"IJDWM.2018040102-6","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2016.2596138"},{"key":"IJDWM.2018040102-7","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0307752101"},{"key":"IJDWM.2018040102-8","unstructured":"Guo, Y., & Gu, S. (2011). Multi-Label Classification Using Conditional Dependency Networks. In IJCAI 2011, Proceedings of the, International Joint Conference on Artificial Intelligence, Barcelona, Catalonia, Spain (pp. 1300-1305)."},{"key":"IJDWM.2018040102-9","doi-asserted-by":"publisher","DOI":"10.1108\/IJWIS-01-2016-0002"},{"key":"IJDWM.2018040102-10","first-page":"897","article-title":"Disclda: discriminative learning for dimensionality reduction and classification.","author":"S.Lacoste-Julien","year":"2008","journal-title":"Proceedings of NIPS Neural Information Processing Systems"},{"key":"IJDWM.2018040102-11","doi-asserted-by":"publisher","DOI":"10.1504\/IJSSC.2012.047466"},{"key":"IJDWM.2018040102-12","doi-asserted-by":"publisher","DOI":"10.3115\/1699510.1699543"},{"key":"IJDWM.2018040102-13","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-011-5256-5"},{"key":"IJDWM.2018040102-14","doi-asserted-by":"publisher","DOI":"10.1504\/IJGUC.2016.077487"},{"key":"IJDWM.2018040102-15","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007649029923"},{"key":"IJDWM.2018040102-16","doi-asserted-by":"publisher","DOI":"10.1504\/IJSSC.2016.082760"},{"key":"IJDWM.2018040102-17","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2007.01.029"},{"key":"IJDWM.2018040102-18","doi-asserted-by":"publisher","DOI":"10.1002\/9780470391365"},{"key":"IJDWM.2018040102-19","doi-asserted-by":"publisher","DOI":"10.4018\/jdwm.2007070101"},{"key":"IJDWM.2018040102-20","doi-asserted-by":"crossref","unstructured":"Tsoumakas, G., Katakis, I., & Vlahavas, I. (2009). Mining multi-label data.","DOI":"10.1007\/978-0-387-09823-4_34"},{"key":"IJDWM.2018040102-21","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2010.164"},{"key":"IJDWM.2018040102-22","doi-asserted-by":"crossref","unstructured":"Vinod, D. S., & P, M. (2015). Support vector machine-based stuttering dysfluency classification using gmm supervectors. International Journal of Grid & Utility Computing, 6(3\/4), 143-149.","DOI":"10.1504\/IJGUC.2015.070680"},{"key":"IJDWM.2018040102-23","doi-asserted-by":"publisher","DOI":"10.1108\/IJWIS-04-2014-0013"},{"key":"IJDWM.2018040102-24","doi-asserted-by":"publisher","DOI":"10.1007\/s11063-009-9095-3"},{"key":"IJDWM.2018040102-25","doi-asserted-by":"crossref","first-page":"999","DOI":"10.1145\/1835804.1835930","article-title":"Multi-label learning by exploiting label dependency.","author":"M. L.Zhang","year":"2010","journal-title":"ACM SIGKDD International Conference on Knowledge Discovery and Data Mining"},{"key":"IJDWM.2018040102-26","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2006.162"},{"key":"IJDWM.2018040102-27","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2006.12.019"},{"key":"IJDWM.2018040102-28","doi-asserted-by":"publisher","DOI":"10.1145\/1553374.1553535"}],"container-title":["International Journal of Data Warehousing and Mining"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=202996","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,6]],"date-time":"2022-05-06T11:51:49Z","timestamp":1651837909000},"score":1,"resource":{"primary":{"URL":"http:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/IJDWM.2018040102"}},"subtitle":["A Supervised Topic Model for Multi-Label Documents Classification"],"short-title":[],"issued":{"date-parts":[[2018,4]]},"references-count":29,"journal-issue":{"issue":"2"},"URL":"https:\/\/doi.org\/10.4018\/ijdwm.2018040102","relation":{},"ISSN":["1548-3924","1548-3932"],"issn-type":[{"value":"1548-3924","type":"print"},{"value":"1548-3932","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,4]]}}}