{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T06:50:21Z","timestamp":1777704621102,"version":"3.51.4"},"reference-count":35,"publisher":"SAGE Publications","issue":"6","license":[{"start":{"date-parts":[[2018,7,24]],"date-time":"2018-07-24T00:00:00Z","timestamp":1532390400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2018,12,24]]},"abstract":"<jats:p>This article proposes the modified KNN (K Nearest Neighbor) algorithm which receives a string vector as its input data and is applied to the text summarization. The results from applying the string vector based algorithms to the text categorizations were successful in previous works and the text summarization is able to be viewed into a binary classification where each paragraph is classified into summary or non-summary. In the proposed system, a text which is given as the input is partitioned into a list of paragraphs, each paragraph is classified by the proposed KNN version, and the paragraphs which are classified into summary are extracted ad the output. The proposed KNN version is empirically validated as the better approach in deciding whether each paragraph is essential or not in news articles and opinions. We need to define and characterize mathematically more operations on string vectors for modifying more advanced machine learning algorithms.<\/jats:p>","DOI":"10.3233\/jifs-169841","type":"journal-article","created":{"date-parts":[[2018,7,27]],"date-time":"2018-07-27T19:29:29Z","timestamp":1532719769000},"page":"6005-6016","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":5,"title":["Automatic text summarization using string vector based K nearest neighbor"],"prefix":"10.1177","volume":"35","author":[{"given":"Taeho","family":"Jo","sequence":"first","affiliation":[{"name":"School of Game, Hongik University, 2639, Sejongro, Sejong, South Korea"}]}],"member":"179","published-online":{"date-parts":[[2018,7,24]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"crossref","unstructured":"AbainiaK. OuamourS. and SayoudH. Neural Text Categorizer for topic identification of noisy Arabic Texts Proceedings of 12th IEEE Conference on Computer Systems and Applications 2015 pp. 1\u20138.","DOI":"10.1109\/AICCSA.2015.7507237"},{"key":"e_1_3_2_3_2","volume-title":"Retrieval: The Concepts and Technology behind Search","author":"Baeza-Yates R.","year":"2011","unstructured":"Baeza-YatesR. and Ribeiro-NetoB., Modern Information, Retrieval: The Concepts and Technology behind Search, 2011Addison-Wesley."},{"key":"e_1_3_2_4_2","doi-asserted-by":"crossref","unstructured":"ChuangW. and YangJ. Extracting sentence segments for text summarization: a machine learning approach Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval 2000 pp. 152\u2013159.","DOI":"10.1145\/345508.345566"},{"key":"e_1_3_2_5_2","doi-asserted-by":"crossref","unstructured":"FirteL. LemnaruC. and PotoleaR. Spam detection filter using KNN algorithm and resampling Proceedings of IEEE International Conference on Intelligent Computer Communication and Processing 2010 pp. 27\u201333.","DOI":"10.1109\/ICCP.2010.5606466"},{"key":"e_1_3_2_6_2","doi-asserted-by":"crossref","unstructured":"HanE. KarypisS.G. and KumarV. Text categorization using weight adjusted k-nearest neighbor classification Proceedings of Pacific-Asia conference on knowledge discovery and data mining 2001 pp. 53\u201365.","DOI":"10.1007\/3-540-45357-1_9"},{"key":"e_1_3_2_7_2","unstructured":"JamesC. KoprinskaI. and PoonJ. A neural network based approach to automated e-mail classification Proceedings of IEEE International Conferences on Web Intelligence 2003 pp. 702\u2013705."},{"key":"e_1_3_2_8_2","unstructured":"JoT. NeuroTextCategorizer: A New Model of Neural Network for Text Categorization The Proceedings of ICONIP (2000) pp. 280\u2013285."},{"key":"e_1_3_2_9_2","unstructured":"JoT. The Implementation of Dynamic Document Organization using Text Categorization and Text Clustering PhD Dissertation of University of Ottawa 2006."},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.5391\/IJFIS.2008.8.3.231"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.3745\/JIPS.2008.4.2.077"},{"key":"e_1_3_2_12_2","first-page":"13","article-title":"Modification of classification algorithm in favor of text categorization","volume":"2","author":"Jo T.","year":"2009","unstructured":"JoT., Modification of classification algorithm in favor of text categorization, International Journal of Computer Science and Software Technology, 2 (2009) 13\u201323.","journal-title":"International Journal of Computer Science and Software Technology"},{"key":"e_1_3_2_13_2","first-page":"21","article-title":"Modification of clustering algorithms for text clustering","volume":"3","author":"Jo T.","year":"2010","unstructured":"JoT., Modification of clustering algorithms for text clustering, International Journal of Computer Science and Software Technology, 3 (2010) 21\u201333.","journal-title":"International Journal of Computer Science and Software Technology"},{"key":"e_1_3_2_14_2","first-page":"83","article-title":"NTC (Neural Text Categorizer): Neural network for text categorization","volume":"2","author":"Jo T.","year":"2010","unstructured":"JoT., NTC (Neural Text Categorizer): Neural network for text categorization, International Journal of Information Studies, 2 (2010) 83\u201396.","journal-title":"International Journal of Information Studies"},{"key":"e_1_3_2_15_2","first-page":"31","article-title":"NTSO (Neural Text Self Organizer): A new neural network for text clustering","volume":"1","author":"Jo T.","year":"2010","unstructured":"JoT., NTSO (Neural Text Self Organizer): A new neural network for text clustering, Journal of Network Technology, 1 (2010) 31\u201343.","journal-title":"Journal of Network Technology"},{"key":"e_1_3_2_16_2","unstructured":"JoT. Device and Method for Categorizing Electronic Document Automatically 10-2009-0041272 10-1071495 2011."},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-014-1411-9"},{"key":"e_1_3_2_18_2","first-page":"45585","article-title":"Simulation of numerical semantic operations on string in text collection","volume":"10","author":"Jo T.","year":"2015","unstructured":"JoT., Simulation of numerical semantic operations on string in text collection, International Journal of Applied Engineering Research, 10 (2015) 45585\u201345591.","journal-title":"International Journal of Applied Engineering Research"},{"key":"e_1_3_2_19_2","first-page":"127","article-title":"Index based approach for text categorization","volume":"2","author":"Jo T.","year":"2008","unstructured":"JoT. and ChoD., Index based approach for text categorization, International Journal of Mathematics and Computers in Simulation, 2 (2008) 127\u2013132.","journal-title":"International Journal of Mathematics and Computers in Simulation"},{"key":"e_1_3_2_20_2","first-page":"558","article-title":"Text Clustering using NTSO","author":"Jo T.","year":"2005","unstructured":"JoT. and JapkowiczN., Text Clustering using NTSO, The Proceedings of IJCNN, (2005) pp. 558\u2013563.","journal-title":"The Proceedings of IJCNN"},{"key":"e_1_3_2_21_2","doi-asserted-by":"crossref","unstructured":"KateR.J. and MooneyR.J. Using String Kernels for Learning Semantic Parsers Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics 2006 pp. 913\u2013920.","DOI":"10.3115\/1220175.1220290"},{"key":"e_1_3_2_22_2","first-page":"4","article-title":"A review of machine learning algorithms for text-documents classification","volume":"1","author":"Khan A.","year":"2010","unstructured":"KhanA., BaharudinB., LeeL.H. and KhanK., A review of machine learning algorithms for text-documents classification, Journal of Advances in Information Technology, 1 (2010) 4\u201320.","journal-title":"Journal of Advances in Information Technology"},{"key":"e_1_3_2_23_2","first-page":"199","article-title":"Collocation dictionary optimization using WordNet and k-nearest neighbor learning","volume":"16","author":"Kim Y.","year":"2001","unstructured":"KimY., ZhangB. and KimY.T., Collocation dictionary optimization using WordNet and k-nearest neighbor learning, Machine Translation, 16 (2001) 199\u2013108.","journal-title":"Machine Translation"},{"key":"e_1_3_2_24_2","first-page":"44","article-title":"An empirical performance comparison of machine learning methods for spam e-mail categorization","author":"Lai C.","year":"2004","unstructured":"LaiC. and TsaiM., An empirical performance comparison of machine learning methods for spam e-mail categorization, Proceedings of IEEE International Conference on Hybrid Intelligent Systems, (2004) pp. 44\u201348.","journal-title":"Proceedings of IEEE International Conference on Hybrid Intelligent Systems"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btg431"},{"key":"e_1_3_2_26_2","first-page":"419","article-title":"Text classification with string kernels","volume":"2","author":"Lodhi H.","year":"2002","unstructured":"LodhiH., SaundersC., Shawe-TaylorJ., CristianiniN. and WatkinsC., Text classification with string kernels, Journal of Machine Learning Research, 2 (2002) 419\u2013444.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_27_2","unstructured":"ManningC.D. and SchutzeH. Foundations of Statistical Natural Language Processing MIT Press 1999."},{"key":"e_1_3_2_28_2","unstructured":"MitchellT. Machine Learning 1st ed. McGraw-Hill 1997."},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.7763\/IJMLC.2012.V2.158"},{"key":"e_1_3_2_30_2","doi-asserted-by":"crossref","unstructured":"PekarV. and StaabS. Word classification based on combined measures of distributional and semantic similarity Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics 2003 pp. 147\u2013150.","DOI":"10.3115\/1067737.1067770"},{"key":"e_1_3_2_31_2","doi-asserted-by":"crossref","unstructured":"SebastianiF. Machine learning in automated text categorization ACM Computing Survey (2002) pp. 1\u201347.","DOI":"10.1145\/505282.505283"},{"key":"e_1_3_2_32_2","doi-asserted-by":"crossref","unstructured":"StaufferM. FischerA. and RiesenK. A novel graph database for handwritten word images Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (2016) pp. 553\u2013563.","DOI":"10.1007\/978-3-319-49055-7_49"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.14257\/ijdta.2014.7.1.06"},{"key":"e_1_3_2_34_2","unstructured":"D.WienerE. A Neural Network Approach to Topic Spotting in Text. Master Thesis the Faculty of the Graduate School of the University of Colorado. 1995."},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009982220290"},{"key":"e_1_3_2_36_2","doi-asserted-by":"crossref","unstructured":"ZhengY. ChengX. HuangR. and ManY. A comparative study on text clustering methods Advanced Data Mining and Applications (2006) 644\u2013651.","DOI":"10.1007\/11811305_71"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-169841","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-169841","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-169841","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:41:38Z","timestamp":1777455698000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-169841"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,7,24]]},"references-count":35,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2018,12,24]]}},"alternative-id":["10.3233\/JIFS-169841"],"URL":"https:\/\/doi.org\/10.3233\/jifs-169841","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,7,24]]}}}