{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T20:54:25Z","timestamp":1777928065715,"version":"3.51.4"},"reference-count":65,"publisher":"SAGE Publications","issue":"3","license":[{"start":{"date-parts":[[2023,4,29]],"date-time":"2023-04-29T00:00:00Z","timestamp":1682726400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Librarianship and Information Science"],"published-print":{"date-parts":[[2024,9]]},"abstract":"<jats:p>The aim of this study is to identify the power of text-based metrics (Cosine and Lucene similarity) and linked-based (Co-citation, bibliographic coupling, Amsler, PageRank, and HITS) and their combination in estimating the similarity of articles with each other. The experiments were conducted on a test collection of 26,262 articles in the PubMed Central Open Access Subset (PMC OAS) of CITREC that, in addition to having linked-based metrics, their full text was available for calculating text-based metrics. Thirty articles were selected as primary articles, and articles related to each of them were retrieved based on the mesh similarity metric. Then, the similarity of the retrieved documents based on text-based and linked-based metrics was also extracted. In the next stage, text-based, linked-based, and hybrid metrics were entered into the generalized regression model to estimate the similarity of the articles to determine their power; finally, the performance of the models was compared based on the mean squared error and correlation. The results showed that the model included Cosine and Lucene similarity metrics in text-based metrics. In linked-based metrics, HITS (Hub), HITS (authority), PageRank, and co-citation had the highest power, respectively; but the bibliographic coupling and Amsler could not enter the model. In general, a comparison of text-based, linked-based, and hybrid metrics performance indicated that the linked-based model estimates similarity between articles better than the text-based model, and the combination of text-based and linked-based metrics makes little change in improving the power of the articles. Despite the importance and application of text-based and linked-based metrics to measure the similarity of articles, a study that examines the power of these metrics alone and in comparison with each other in estimating the similarity of articles was not observed.<\/jats:p>","DOI":"10.1177\/09610006231165759","type":"journal-article","created":{"date-parts":[[2023,4,29]],"date-time":"2023-04-29T08:24:33Z","timestamp":1682756673000},"page":"760-772","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":2,"title":["Comparison of text-based and linked-based metrics in terms of estimating the similarity of articles"],"prefix":"10.1177","volume":"56","author":[{"given":"Marzieh","family":"Goltaji","sequence":"first","affiliation":[{"name":"Shiraz University, Iran"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9253-4071","authenticated-orcid":false,"given":"Javad","family":"Abbaspour","sequence":"additional","affiliation":[{"name":"Shiraz University, Iran"}]},{"given":"Abdolrasool","family":"Jowkar","sequence":"additional","affiliation":[{"name":"Shiraz University, Iran"}]},{"given":"Seyed Mostafa","family":"Fakhrahmad","sequence":"additional","affiliation":[{"name":"Shiraz University, Iran"}]}],"member":"179","published-online":{"date-parts":[[2023,4,29]]},"reference":[{"key":"bibr1-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1145\/1454008.1454052"},{"key":"bibr2-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1162\/qss_a_00027"},{"key":"bibr3-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.joi.2008.11.003"},{"key":"bibr4-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1007\/s11192-007-1935-1"},{"key":"bibr5-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1007\/s11192-012-0630-z"},{"key":"bibr6-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1002\/asi.20941"},{"key":"bibr7-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1002\/asi.20608"},{"key":"bibr8-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1108\/00220410610714930"},{"key":"bibr9-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1197\/jamia.M1909"},{"key":"bibr10-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1002\/asi.4630310408"},{"key":"bibr11-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1002\/asi.21419"},{"key":"bibr12-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1162\/qss_a_00085"},{"key":"bibr13-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0018029"},{"key":"bibr14-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1186\/1742-5581-3-1"},{"key":"bibr15-09610006231165759","unstructured":"Char DC, Ajiferuke I (2013) Comparison of the effectiveness of related functions in Web of Science and Scopus. In: Proceedings of the annual conference of CAIS\/Actes du Congr\u00e8s Annuel de l'ACSI. Available at: http:\/\/citeseerx.ist.psu.edu\/viewdoc\/download;jsessionid=12BDD2E4D8D78A777A4BCDD5E8FD38B1?doi=10.1.1.181.382&rep=rep1&type=pdf (accessed 30 October 2020)."},{"key":"bibr16-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.joi.2006.06.001"},{"key":"bibr17-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.physa.2019.02.032"},{"key":"bibr18-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1093\/comnet\/cnx023"},{"issue":"2","key":"bibr19-09610006231165759","first-page":"5749","volume":"3","author":"Devi P","year":"2014","journal-title":"International Journal of Advanced Research in Computer and Communication Engineering"},{"key":"bibr20-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1007\/s11192-012-0756-z"},{"key":"bibr21-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2019.05.007"},{"key":"bibr22-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1137\/S1064827502412875"},{"key":"bibr23-09610006231165759","unstructured":"Gipp B, Beel J (2009) Citation Proximity Analysis (CPA): A new approach for identifying related work based on co-citation analysis. In: Proceedings of the 12th international conference on scientometrics and informetrics (ISSI\u201909) (eds Larsen B, Leta J), Rio de Janeiro, Brazil, International Society for Scientometrics and Informetrics, pp. 571\u2013575. Available at: https:\/\/isg.beel.org\/pubs\/Citation%20Proximity%20Analysis%20(CPA)%20-%20A%20new%20approach%20for%20identifying%20related%20work%20based%20on%20Co-Citation%20Analysis%20\u2014%20preprint.pdf (accessed 30 September 2020)."},{"key":"bibr24-09610006231165759","unstructured":"Gipp B, Meuschke N, Lipinski M (2015) CITREC: An evaluation framework for citation-based similarity measures based on TREC Genomics and PubMed Central. In: Proceedings of the iConference 2015, Newport Beach, CA, USA. Available at: https:\/\/kops.uni-konstanz.de\/server\/api\/core\/bitstreams\/fac0e9bf-a4c2-40ee-9bad-6a3fcf269281\/content (accessed 15 January 2021)."},{"key":"bibr25-09610006231165759","first-page":"21","volume":"3","author":"Goswami P","year":"2017","journal-title":"International Journal of HIT Transaction on ECCN"},{"key":"bibr26-09610006231165759","doi-asserted-by":"publisher","DOI":"10.3389\/frma.2018.00027"},{"key":"bibr27-09610006231165759","first-page":"316","volume-title":"AMIA Annual Symposium proceedings, Hilton Washington & Towers","author":"Herskovic JR","year":"2005"},{"key":"bibr28-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-15-0751-9_32"},{"key":"bibr29-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1186\/s12874-020-0907-5"},{"issue":"7","key":"bibr30-09610006231165759","first-page":"1679","volume":"67","author":"Jiang X","year":"2016","journal-title":"Journal of the American Society for Information Science and Technology"},{"key":"bibr31-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1002\/asi.5090140103"},{"key":"bibr32-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2017.09.014"},{"key":"bibr33-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1108\/00220410810912451"},{"key":"bibr34-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-47852-4_4"},{"key":"bibr35-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-9-270"},{"key":"bibr36-09610006231165759","doi-asserted-by":"publisher","DOI":"10.3390\/app9235176"},{"key":"bibr37-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1145\/2396761.2398555"},{"key":"bibr38-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1109\/BIBE.2007.4375740"},{"key":"bibr39-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-006-0023-9"},{"key":"bibr40-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2007.06.006"},{"key":"bibr41-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(198903)40:2<110::AID-ASI5>3.0.CO;2-T"},{"key":"bibr42-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1145\/1013367.1013521"},{"key":"bibr43-09610006231165759","first-page":"903","volume-title":"Proceedings of the seventeeth international joint conference on artificial intelligence","author":"Ng AY","year":"2001"},{"key":"bibr44-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.joi.2011.08.001"},{"key":"bibr45-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-018-0163-2"},{"key":"bibr46-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(198907)40:4<226::AID-ASI2>3.0.CO;2-6"},{"key":"bibr47-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.physa.2020.125344"},{"key":"bibr48-09610006231165759","doi-asserted-by":"publisher","DOI":"10.3390\/app11010162"},{"key":"bibr49-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2015.12.001"},{"key":"bibr50-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1145\/2513228.2513321"},{"key":"bibr51-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1002\/asi.20994"},{"key":"bibr52-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1002\/asi.4630240406"},{"key":"bibr53-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1007\/s11192-007-1961-z"},{"issue":"4","key":"bibr54-09610006231165759","first-page":"202","volume":"2","author":"Thada V","year":"2013","journal-title":"International Journal of Innovations in Engineering and Technology"},{"key":"bibr55-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1108\/00220410310463491"},{"issue":"1","key":"bibr56-09610006231165759","first-page":"1","volume":"5","author":"Vallez M","year":"2007","journal-title":"Hipertext. net"},{"key":"bibr57-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1109\/ISCIT.2004.1412885"},{"key":"bibr58-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2010.03.010"},{"key":"bibr59-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-01307-2_100"},{"key":"bibr60-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1145\/1963192.1963278"},{"key":"bibr61-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2015.07.036"},{"key":"bibr62-09610006231165759","doi-asserted-by":"publisher","DOI":"10.1016\/j.joi.2022.101291"},{"key":"bibr63-09610006231165759","doi-asserted-by":"publisher","DOI":"10.4236\/iim.2012.44016"},{"issue":"9","key":"bibr64-09610006231165759","first-page":"1824","volume":"61","author":"Zhuge H","year":"2010","journal-title":"Journal of the Association for Information Science and Technology"},{"key":"bibr65-09610006231165759","volume-title":"Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology","author":"Zipf GK","year":"1949"}],"container-title":["Journal of Librarianship and Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/09610006231165759","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/09610006231165759","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/09610006231165759","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T22:59:03Z","timestamp":1777676343000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/09610006231165759"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,29]]},"references-count":65,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,9]]}},"alternative-id":["10.1177\/09610006231165759"],"URL":"https:\/\/doi.org\/10.1177\/09610006231165759","relation":{},"ISSN":["0961-0006","1741-6477"],"issn-type":[{"value":"0961-0006","type":"print"},{"value":"1741-6477","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,4,29]]}}}