{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T06:54:13Z","timestamp":1777704853891,"version":"3.51.4"},"reference-count":31,"publisher":"SAGE Publications","issue":"2","license":[{"start":{"date-parts":[[2020,6,11]],"date-time":"2020-06-11T00:00:00Z","timestamp":1591833600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"published-print":{"date-parts":[[2020,8,31]]},"abstract":"<jats:p>Currently, the semantic analysis is used by different fields, such as information retrieval, the biomedical domain, and natural language processing. The primary focus of this research work is on using semantic methods, the cosine similarity algorithm, and fuzzy logic to improve the matching of documents. The algorithms were applied to plain texts in this case CVs (resumes) and job descriptions. Synsets of WordNet were used to enrich the semantic similarity methods such as the Wu-Palmer Similarity (WUP), Leacock-Chodorow similarity (LCH), and path similarity (hypernym\/hyponym). Additionally, keyword extraction was used to create a postings list where keywords were weighted. The task of recruiting new personnel in the companies that publish job descriptions and reciprocally finding a company when workers publish their resumes is discussed in this research work. The creation of a new gold standard was required to achieve a comparison of the proposed methods. A web application was designed to match the documents manually, creating the new gold standard. Thereby the new gold standard confirming benefits of enriching the cosine algorithm semantically. Finally, the results were compared with the new gold standard to check the efficiency of the new methods proposed. The measures used for the analysis were precision, recall, and f-measure, concluding that the cosine similarity weighted semantically can be used to get better similarity scores.<\/jats:p>","DOI":"10.3233\/jifs-179889","type":"journal-article","created":{"date-parts":[[2020,6,12]],"date-time":"2020-06-12T12:40:19Z","timestamp":1591965619000},"page":"2263-2278","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":3,"title":["Measuring semantic similarity of documents with weighted cosine and fuzzy logic"],"prefix":"10.1177","volume":"39","author":[{"given":"Juan","family":"Huetle-Figueroa","sequence":"first","affiliation":[{"name":"Department of Computing, Technological University Dublin, Ireland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fernando","family":"Perez-Tellez","sequence":"additional","affiliation":[{"name":"Department of Computing, Technological University Dublin, Ireland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Pinto","sequence":"additional","affiliation":[{"name":"Faculty of Computer Science, Benem\u00e9rita Universidad Aut\u00f3noma de Puebla, PUE, Mexico"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2020,6,11]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"crossref","unstructured":"B\u00e9lohl\u00e1vekR. DaubenJ.W. and KlirG.J. Fuzzy logic and mathematics: a historical perspective. Oxford University Press 2017.","DOI":"10.1093\/oso\/9780190200015.001.0001"},{"key":"e_1_3_1_3_2","unstructured":"BojanowskiP. GraveE. JoulinA. and MikolovT. Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606 (2016)."},{"key":"e_1_3_1_4_2","unstructured":"BojanowskiP. JoulinA. and MikolovT. Alternative structures for character-level rnns. arXiv preprint arXiv:1511.06303 (2015)."},{"key":"e_1_3_1_5_2","doi-asserted-by":"crossref","unstructured":"De BoomC. Van CanneytS. BohezS. DemeesterT. and DhoedtB. Learning semantic similarity for very short texts. In 2015 ieee international conference on data mining workshop (icdmw) (2015) IEEE pp. 1229\u20131234.","DOI":"10.1109\/ICDMW.2015.86"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1017\/S0269888917000029"},{"key":"e_1_3_1_7_2","unstructured":"FerreiraJ.D. and CoutoF.M. Semantic similarity in cheminformatics. In Cheminformatics and its Applications. IntechOpen 2019."},{"key":"e_1_3_1_8_2","unstructured":"FinlaysonM. Java libraries for accessing the princeton wordnet: Comparison and evaluation. In Proceedings of the Seventh Global Wordnet Conference (2014) pp. 78\u201385."},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.4018\/jswis.2006070104"},{"key":"e_1_3_1_10_2","article-title":"Vector based approaches to semantic similarity measures","volume":"163","author":"Huerta J.M.","year":"2008","unstructured":"HuertaJ.M., Vector based approaches to semantic similarity measures, Advances in Natural Language Processing and Applications163 (2008).","journal-title":"Advances in Natural Language Processing and Applications"},{"key":"e_1_3_1_11_2","unstructured":"Huetle-FigueroaJ. PerezF. and PintoD. On detecting keywords for concept mapping in plain text International Journal of Computational Linguistics and Applications (IJCLA) (2018). In 6th International Symposium on Language & Knowledge Engineering."},{"key":"e_1_3_1_12_2","unstructured":"KandolaJ. CristianiniN. and Shawe-TaylorJ.S. Learning semantic similarity. In Advances in neural information processing systems (2003) pp. 673\u2013680."},{"key":"e_1_3_1_13_2","doi-asserted-by":"crossref","unstructured":"LeacockC. and ChodorowM. Combining local context and wordnet sense similarity for word sense identification. wordnet an electronic lexical database. The MIT Press (1998).","DOI":"10.7551\/mitpress\/7287.003.0018"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2003.1209005"},{"key":"e_1_3_1_15_2","unstructured":"LinD. et al. An information-theoretic definition of similarity. In Icml (1998) vol. 98 Citeseer pp. 296\u2013304."},{"key":"e_1_3_1_16_2","doi-asserted-by":"crossref","unstructured":"LuoC. ZhanJ. XueX. WangL. RenR. and YangQ. Cosine normalization: Using cosine similarity instead of dot product in neural networks. In International Conference on Artificial Neural Networks (2018) Springer pp. 382\u2013391.","DOI":"10.1007\/978-3-030-01418-6_38"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cogsys.2016.01.001"},{"issue":"1","key":"e_1_3_1_18_2","first-page":"1","article-title":"A review of semantic similarity measures in wordnet","volume":"6","author":"Meng L.","year":"2013","unstructured":"MengL., HuangR. and GuJ., A review of semantic similarity measures in wordnet, International Journal of Hybrid Information Technology6(1) (2013), 1\u201312.","journal-title":"International Journal of Hybrid Information Technology"},{"key":"e_1_3_1_19_2","first-page":"775","article-title":"Corpus-based and knowledge-based measures of text semantic similarity","volume":"6","author":"Mihalcea R.","year":"2006","unstructured":"MihalceaR., CorleyC., StrapparavaC., et al., Corpus-based and knowledge-based measures of text semantic similarity. In AAAI6 (2006), pp. 775\u2013780.","journal-title":"AAAI"},{"key":"e_1_3_1_20_2","unstructured":"BabarMR.S.A. and PatilMS.P.D. Fuzzy approach for document summarization Journal of Information Knowledge and Research in Computer Engineering Vol. 03 ISSN: 0975 \u00e2\u0102\u015e 6760 (nov 2014) 630\u2013634."},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1080\/17517575.2018.1464666"},{"key":"e_1_3_1_22_2","unstructured":"PawarA. and MagoV. Calculating the similarity between words and sentences using a lexical database and corpus statistics. arXiv preprint arXiv:1802.05667 (2018)."},{"key":"e_1_3_1_23_2","unstructured":"PerkinsJ. Python 3 text processing with NLTK3 cookbook. Packt Publishing Ltd 2014."},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1000443"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2017.03.016"},{"key":"e_1_3_1_26_2","first-page":"133","article-title":"Using tf-idf to determine word relevance in document queries","volume":"242","author":"Ramos J.","year":"2003","unstructured":"RamosJ., et al., Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning242 (2003), pp. 133\u2013142.","journal-title":"Proceedings of the first instructional conference on machine learning"},{"key":"e_1_3_1_27_2","unstructured":"RusV. LinteanM. BanjadeR. NiraulaN. and StefanescuD. Semilar: The semantic similarity toolkit. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations (2013) pp. 163\u2013168."},{"issue":"4","key":"e_1_3_1_28_2","first-page":"35","article-title":"Modern information retrieval: A brief overview","volume":"24","author":"Singhal A.","year":"2001","unstructured":"SinghalA., et al., Modern information retrieval: A brief overview, IEEE Data Eng Bull24(4) (2001), 35\u201343.","journal-title":"IEEE Data Eng Bull"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1145\/1328854.1328855"},{"key":"e_1_3_1_30_2","doi-asserted-by":"crossref","unstructured":"WuZ. and PalmerM. Verbs semantics and lexical selection. In Proceedings of the 32nd annual meeting on Association for Computational Linguistics (1994) Association for Computational Linguistics pp. 133\u2013138.","DOI":"10.3115\/981732.981751"},{"key":"e_1_3_1_31_2","doi-asserted-by":"crossref","unstructured":"YanH. DingS. and SuelT. Inverted index compression and query processing with optimized document ordering. In Proceedings of the 18th international conference on World wide web (2009) ACM pp. 401\u2013410.","DOI":"10.1145\/1526709.1526764"},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bty410"}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179889","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.3233\/JIFS-179889","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/JIFS-179889","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:42:10Z","timestamp":1777455730000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/JIFS-179889"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,11]]},"references-count":31,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,8,31]]}},"alternative-id":["10.3233\/JIFS-179889"],"URL":"https:\/\/doi.org\/10.3233\/jifs-179889","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,6,11]]}}}