{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,2]],"date-time":"2026-02-02T14:47:50Z","timestamp":1770043670031,"version":"3.49.0"},"reference-count":22,"publisher":"SAGE Publications","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IFS"],"published-print":{"date-parts":[[2021,8,11]]},"abstract":"<jats:p>Topic models are widely used in building clusters of documents for more than a decade, yet problems occurring in choosing the optimal number of topics. The main problem is the lack of a stable metric of the quality of topics obtained during the construction of topic models. The authors analyzed from previous works, most of the models used in determining the number of topics are non-parametric and the quality of topics determined by using perplexity and coherence measures and concluded that they are not applicable in solving this problem. In this paper, we used the parametric method, which is an extension of the traditional topic model with visual access tendency for visualization of the number of topics (clusters) to complement clustering and to choose the optimal number of topics based on results of cluster validity indices. Developed hybrid topic models are demonstrated with different Twitter datasets on various topics in obtaining the optimal number of topics and in measuring the quality of clusters. The experimental results showed that the Visual Non-negative Matrix Factorization (VNMF) topic model performs well in determining the optimal number of topics with interactive visualization and in performance measure of the quality of clusters with validity indices.<\/jats:p>","DOI":"10.3233\/jifs-202707","type":"journal-article","created":{"date-parts":[[2021,5,21]],"date-time":"2021-05-21T13:19:55Z","timestamp":1621603195000},"page":"803-817","source":"Crossref","is-referenced-by-count":6,"title":["Visualization and performance measure to determine number of topics in twitter data clustering using hybrid topic modeling"],"prefix":"10.1177","volume":"41","author":[{"given":"R.M.","family":"Noorullah","sequence":"first","affiliation":[{"name":"Department of CSE, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh, India"}]},{"given":"Moulana","family":"Mohammed","sequence":"additional","affiliation":[{"name":"Department of CSE, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh, India"}]}],"member":"179","reference":[{"key":"10.3233\/JIFS-202707_ref1","doi-asserted-by":"publisher","DOI":"10.1145\/2808797.2809344"},{"key":"10.3233\/JIFS-202707_ref2","doi-asserted-by":"publisher","first-page":"993","DOI":"10.1162\/jmlr.2003.3.4-5.993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn"},{"key":"10.3233\/JIFS-202707_ref4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/1667053.1667056","article-title":"The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies","volume":"57","author":"Blei","year":"2010","journal-title":"J. ACM"},{"key":"10.3233\/JIFS-202707_ref5","doi-asserted-by":"publisher","DOI":"10.1080\/21670811.2015.1093271"},{"key":"10.3233\/JIFS-202707_ref6","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-3110"},{"key":"10.3233\/JIFS-202707_ref9","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1016\/j.eswa.2018.07.063","article-title":"Document-based topic coherence measures for news media text","volume":"114","author":"Damir Korenci","year":"2018","journal-title":"Expert systems with Applications"},{"key":"10.3233\/JIFS-202707_ref11","doi-asserted-by":"crossref","first-page":"498","DOI":"10.1007\/978-3-662-44848-9_32","article-title":"How many topics? stability analysis for topic models","volume":"8724","author":"Greene","year":"2014","journal-title":"Machine Learning and Knowledge Discovery in Databases"},{"key":"10.3233\/JIFS-202707_ref12","doi-asserted-by":"publisher","DOI":"10.1145\/2911451.2914714"},{"key":"10.3233\/JIFS-202707_ref13","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-8659.2012.03108.x"},{"issue":"3","key":"10.3233\/JIFS-202707_ref14","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1007\/s10994-013-5413-0","article-title":"Interactive topic modeling","volume":"95","author":"Hu","year":"2014","journal-title":"Machine Learning"},{"key":"10.3233\/JIFS-202707_ref15","doi-asserted-by":"publisher","first-page":"1992","DOI":"10.1109\/TVCG.2013.212","article-title":"UTOPIAN: User-driven topic modeling based on interactive nonnegative matrix factorization","volume":"19","author":"Jaegul Choo","year":"2013","journal-title":"IEEE Transaction on Visualization and Computer Graphics"},{"key":"10.3233\/JIFS-202707_ref16","first-page":"2579","article-title":"Laurens van der Maaten and Jeoffrey Hinton, Visualizing data using t-SNE","volume":"9","author":"Laurens van der Maaten","year":"2008","journal-title":"Journal of Machine Learning Research"},{"key":"10.3233\/JIFS-202707_ref17","doi-asserted-by":"publisher","DOI":"10.1109\/icis.2016.7550929"},{"key":"10.3233\/JIFS-202707_ref20","doi-asserted-by":"publisher","DOI":"10.1109\/acdt.2016.7437660"},{"key":"10.3233\/JIFS-202707_ref21","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s12065-019-00300-y","article-title":"Visual topic models for healthcare data clustering","volume":"1","author":"Rajendra Prasad","year":"2019","journal-title":"Evolutionary Intelligence"},{"issue":"11","key":"10.3233\/JIFS-202707_ref22","doi-asserted-by":"publisher","first-page":"491","DOI":"10.14569\/IJACSA.2019.0101168","article-title":"Hybrid topic cluster models for social healthcare data","volume":"10","author":"Rajendra Prasad","year":"2019","journal-title":"International Journal of Advanced Computer Science and Applications (IJACSA)"},{"key":"10.3233\/JIFS-202707_ref23","doi-asserted-by":"publisher","DOI":"10.14569\/IJACSA.2015.060121"},{"key":"10.3233\/JIFS-202707_ref24","doi-asserted-by":"publisher","DOI":"10.1109\/ICRTCCM.2017.60"},{"key":"10.3233\/JIFS-202707_ref25","doi-asserted-by":"publisher","DOI":"10.23919\/FRUCT.2017.8071303"},{"key":"10.3233\/JIFS-202707_ref28","doi-asserted-by":"publisher","DOI":"10.1145\/1553374.1553515"},{"key":"10.3233\/JIFS-202707_ref29","unstructured":"Wongkot Sriurai , Phayung Meesad I. and Choochart Haruechaiyasak R. , Web Page Classification Based on a Topic Model and Neighboring Pages Integration, International Journal of Computer Science and Information Security (IJCSIS) 7(2) (2010), DOI: arXiv:1003.1510[cs.LG]."},{"key":"10.3233\/JIFS-202707_ref30","doi-asserted-by":"publisher","first-page":"58407","DOI":"10.1109\/access.2019.2914097","article-title":"Research on topic detection and tracking for online news texts","volume":"7","author":"Xu","year":"2019","journal-title":"IEEE Access"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/JIFS-202707","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,2]],"date-time":"2026-02-02T03:17:28Z","timestamp":1770002248000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/JIFS-202707"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,11]]},"references-count":22,"journal-issue":{"issue":"1"},"URL":"https:\/\/doi.org\/10.3233\/jifs-202707","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,8,11]]}}}