{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T06:50:21Z","timestamp":1777704621108,"version":"3.51.4"},"reference-count":31,"publisher":"SAGE Publications","issue":"6","license":[{"start":{"date-parts":[[2018,7,24]],"date-time":"2018-07-24T00:00:00Z","timestamp":1532390400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2018,12,24]]},"abstract":"<jats:p>This article proposes the modified AHC (Agglomerative Hierarchical Clustering) algorithm which considers the feature similarity and is applied to the text clustering. The words which are given as features for encoding texts into numerical vectors are semantic related entities, rather than independent ones, and the synergy effect between the word clustering and the text clustering is expected by combining both of them with each other. In this research, we define the similarity metric between numerical vectors considering the feature similarity, and modify the AHC algorithm by adopting the proposed similarity metric as the approach to the text clustering. The proposed AHC algorithm is empirically validated as the better approach in clustering texts in news articles and opinions. The significance of this research is to improve the clustering performance by utilizing the feature similarities.<\/jats:p>","DOI":"10.3233\/jifs-169840","type":"journal-article","created":{"date-parts":[[2018,7,27]],"date-time":"2018-07-27T19:29:24Z","timestamp":1532719764000},"page":"5993-6003","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":3,"title":["Clustering texts using feature similarity based AHC algorithm"],"prefix":"10.1177","volume":"35","author":[{"given":"Taeho","family":"Jo","sequence":"first","affiliation":[{"name":"School of Game, Hongik University, 2639, Sejongro, Sejong, South Korea"}]}],"member":"179","published-online":{"date-parts":[[2018,7,24]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"crossref","unstructured":"AbainiaK. OuamourS. and SayoudH. Neural Text Categorizer for topic identification of noisy Arabic Texts Proceedings of 12th IEEE Conference on Computer Systems and Applications2015 1\u20138.","DOI":"10.1109\/AICCSA.2015.7507237"},{"key":"e_1_3_2_3_2","doi-asserted-by":"crossref","unstructured":"Ah-PineJ. and WangX. Similarity Based Hierarchical Clustering with an Application to Text Collections Proceedings of International Symposium on Intelligent Data Analysis (2016) pp. 320\u2013331.","DOI":"10.1007\/978-3-319-46349-0_28"},{"key":"e_1_3_2_4_2","unstructured":"Baeza-YatesR. Ribeiro-NetoB. Modern Information Retrieval: The Concepts and Technology behind Search Addison-Wesley 2011."},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2006.06.026"},{"key":"e_1_3_2_6_2","doi-asserted-by":"crossref","unstructured":"DhillonI.S. MallelaS. and KumarR. Enhanced Word Clustering for Hierarchical Text Classification Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining2002 pp. 191\u2013200.","DOI":"10.1145\/775047.775076"},{"issue":"7","key":"e_1_3_2_7_2","first-page":"92","article-title":"Web document clustering using hybrid approach in data mining","volume":"3","author":"Gamare P.S.","year":"2015","unstructured":"GamareP.S. and PatilG.A., Web document clustering using hybrid approach in data mining, International Journal of Advent Technology3(7) (2015), 92\u201387.","journal-title":"International Journal of Advent Technology"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.4156\/jdcta.vol4.issue3.9"},{"key":"e_1_3_2_9_2","unstructured":"JoT. Neuro text categorizer: A new model of neural network for text categorization The Proceedings of ICONIP (2000) 280\u2013285."},{"key":"e_1_3_2_10_2","author":"Jo T.","year":"2006","unstructured":"JoT., The Implementation of Dynamic Document Organization using Text Categorization and Text Clustering. PhD Dissertation of University of Ottawa. 2006.","journal-title":"The Implementation of Dynamic Document Organization using Text Categorization and Text Clustering"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.5391\/IJFIS.2008.8.3.231"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.3745\/JIPS.2008.4.2.077"},{"key":"e_1_3_2_13_2","first-page":"13","article-title":"Modification of classification algorithm in favor of text categorization","volume":"2","author":"Jo T.","year":"2009","unstructured":"JoT., Modification of classification algorithm in favor of text categorization, International Journal of Computer Science and Software Technology2 (2009), 13\u201323.","journal-title":"International Journal of Computer Science and Software Technology"},{"key":"e_1_3_2_14_2","first-page":"21","article-title":"Modification of clustering algorithms for text clustering","volume":"3","author":"Jo T.","year":"2010","unstructured":"JoT., Modification of clustering algorithms for text clustering, International Journal of Computer Science and Software Technology3 (2010), 21\u201333.","journal-title":"International Journal of Computer Science and Software Technology"},{"key":"e_1_3_2_15_2","first-page":"83","article-title":"NTC (Neural Text Categorizer): Neural network for text categorization","volume":"2","author":"Jo T.","year":"2010","unstructured":"JoT., NTC (Neural Text Categorizer): Neural network for text categorization, International Journal of Information Studies2 (2010), 83\u201396.","journal-title":"International Journal of Information Studies"},{"key":"e_1_3_2_16_2","first-page":"31","article-title":"NTSO (Neural Text Self Organizer): A new neural network for text clustering","volume":"1","author":"Jo T.","year":"2010","unstructured":"JoT., NTSO (Neural Text Self Organizer): A new neural network for text clustering, Journal of Network Technology1 (2010), 31\u201343.","journal-title":"Journal of Network Technology"},{"key":"e_1_3_2_17_2","unstructured":"JoT. Device and Method for Categorizing Electronic Document Automatically 10-2009-0041272 10-1071495 2011."},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-014-1411-9"},{"key":"e_1_3_2_19_2","first-page":"45585","article-title":"Simulation of numerical semantic operations on string in text collection","volume":"10","author":"Jo T.","year":"2015","unstructured":"JoT., Simulation of numerical semantic operations on string in text collection, International Journal of Applied Engineering Research10 (2015), 45585\u201345591.","journal-title":"International Journal of Applied Engineering Research"},{"key":"e_1_3_2_20_2","unstructured":"JoT. and JapkowiczN. Text clustering using NTSO The Proceedings of IJCNN (2005) pp. 558\u2013563."},{"key":"e_1_3_2_21_2","doi-asserted-by":"crossref","unstructured":"JoT. and LeeM. The evaluation measure of text clustering for the variable number of clusters Lecture Notes in Computer Science2007(4492) 871\u2013879.","DOI":"10.1007\/978-3-540-72393-6_104"},{"key":"e_1_3_2_22_2","doi-asserted-by":"crossref","unstructured":"KateR.J. and MooneyR.J. Using String Kernels for Learning Semantic Parsers Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics (2006) pp. 913\u2013920.","DOI":"10.3115\/1220175.1220290"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btg431"},{"key":"e_1_3_2_24_2","first-page":"419","article-title":"Text classification with string kernels","volume":"2","author":"Lodhi H.","year":"2002","unstructured":"LodhiH., SaundersC., Shawe-TaylorJ., CristianiniN. and WatkinsC., Text classification with string kernels, Journal of Machine Learning Research2 (2002), 419\u2013444.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_25_2","unstructured":"MitchellT. Machine Learning 1st ed 1997McGraw-Hill."},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.7763\/IJMLC.2012.V2.158"},{"key":"e_1_3_2_27_2","doi-asserted-by":"crossref","unstructured":"SebastianiF. Machine learning in automated text categorization ACM Computing Survey (2002) 1\u201347.","DOI":"10.1145\/505282.505283"},{"key":"e_1_3_2_28_2","unstructured":"SlonimN. and TishbyN. The power of word clusters for text classification Proceedings of 23rd European Colloquium on Information Retrieval Research (2001) pp. 200\u2013200."},{"key":"e_1_3_2_29_2","unstructured":"WienerE.D. A Neural Network Approach to Topic Spotting in Text Master Thesis the Faculty of the Graduate School of the University of Colorado. 1995."},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009982220290"},{"key":"e_1_3_2_31_2","doi-asserted-by":"crossref","unstructured":"ZhengY. ChengX. HuangR. and ManY. A comparative study on text clustering methods Advanced Data Mining and Applications (2006) 644\u2013651.","DOI":"10.1007\/11811305_71"},{"key":"e_1_3_2_32_2","doi-asserted-by":"crossref","unstructured":"ZhouE. ZhongN. LiY. and HuangJ. Hot Topic Detection in News Blog Based on W2T Methodology Proceedings of International Conference on Wisdom Web of Things (2016) pp. 237\u2013258.","DOI":"10.1007\/978-3-319-44198-6_10"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-169840","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-169840","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-169840","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:41:38Z","timestamp":1777455698000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-169840"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,7,24]]},"references-count":31,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2018,12,24]]}},"alternative-id":["10.3233\/JIFS-169840"],"URL":"https:\/\/doi.org\/10.3233\/jifs-169840","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,7,24]]}}}