{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T15:48:09Z","timestamp":1781020089431,"version":"3.54.1"},"reference-count":30,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,2,1]],"date-time":"2022-02-01T00:00:00Z","timestamp":1643673600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,2,1]],"date-time":"2022-02-01T00:00:00Z","timestamp":1643673600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Besides Boolean retrieval with medical subject headings (MeSH), PubMed provides users with an alternative way called \u201cRelated Articles\u201d to access and collect relevant documents based on semantic similarity. To explore the functionality more efficiently and more accurately, we proposed an improved algorithm by measuring the semantic similarity of PubMed citations based on the MeSH-concept network model.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>Three article similarity networks are obtained using MeSH-concept random walk with restart (MCRWR), MeSH random walk with restart (MRWR) and PubMed related article (PMRA) respectively. The area under receiver operating characteristic (ROC) curve of MCRWR, MRWR and PMRA is 0.93, 0.90, and 0.67 respectively. Precisions of MCRWR and MRWR under various similarity thresholds are higher than that of PMRA. Mean value of P5 of MCRWR is 0.742, which is much higher than those of MRWR (0.692) and PMRA (0.223). In the article semantic similarity network of \u201cGenes &amp; Function of organ &amp; Disease\u201d based on MCRWR algorithm, four topics are identified according to golden standards.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>MeSH-concept random walk with restart algorithm has better performance in constructing article semantic similarity network, which can reveal the implicitly semantic association between documents. The efficiency and accuracy of retrieving semantic-related documents have been improved a lot.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-022-04578-1","type":"journal-article","created":{"date-parts":[[2022,2,1]],"date-time":"2022-02-01T08:03:09Z","timestamp":1643702589000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["MCRWR: a new method to measure the similarity of documents based on semantic network"],"prefix":"10.1186","volume":"23","author":[{"given":"Xianwei","family":"Pan","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Peng","family":"Huang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Shan","family":"Li","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Lei","family":"Cui","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2022,2,1]]},"reference":[{"key":"4578_CR1","unstructured":"PubMed Overview. https:\/\/pubmed.ncbi.nlm.nih.gov\/about\/. Accessed 20 Mar 2021."},{"key":"4578_CR2","doi-asserted-by":"publisher","first-page":"204","DOI":"10.1016\/j.jbi.2015.07.015","volume":"57","author":"LJ Garcia Castro","year":"2015","unstructured":"Garcia Castro LJ, Berlanga R, Garcia A. In the pursuit of a semantic similarity metric based on UMLS annotations for articles in PubMed Central Open Access. J Biomed Inform. 2015;57:204\u201318.","journal-title":"J Biomed Inform"},{"issue":"4","key":"4578_CR3","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1002\/asi.4630240406","volume":"24","author":"H Small","year":"1973","unstructured":"Small H. Co-citation in the scientific literature: a new measure of the relationship between two documents. J Am Soc Inform Sci. 1973;24(4):265\u20139.","journal-title":"J Am Soc Inform Sci"},{"issue":"2","key":"4578_CR4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3440755","volume":"54","author":"D Chandrasekaran","year":"2021","unstructured":"Chandrasekaran D, Mago V. Evolution of semantic similarity\u2014a survey. ACM Comput Surv. 2021;54(2):1\u201337.","journal-title":"ACM Comput Surv"},{"issue":"3","key":"4578_CR5","doi-asserted-by":"publisher","first-page":"e18029","DOI":"10.1371\/journal.pone.0018029","volume":"6","author":"KW Boyack","year":"2011","unstructured":"Boyack KW, Newman D, Duhon RJ, Klavans R, Patek M, Biberstine JR, Schijvenaars B, Skupin A, Ma N, B\u00f6rner K. Clustering more than two million biomedical publications: comparing the accuracies of nine text-based similarity approaches. PLoS ONE. 2011;6(3):e18029.","journal-title":"PLoS ONE"},{"key":"4578_CR6","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1016\/0306-4573(88)90021-0","volume":"24","author":"G Salton","year":"1988","unstructured":"Salton G, Buckley C. Term-weighting approaches in automatic text retrieval. Inf Process Manag. 1988;24:513\u201323.","journal-title":"Inf Process Manag"},{"key":"4578_CR7","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9","volume":"41","author":"S Deerwester","year":"1990","unstructured":"Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R. Indexing by latent semantic analysis. J Am Soc Inf Sci. 1990;41:391\u2013407.","journal-title":"J Am Soc Inf Sci"},{"key":"4578_CR8","first-page":"993","volume":"3","author":"DM Blei","year":"2003","unstructured":"Blei DM, Ng AY, Jordan MI. Latent Dirichlet allocation. J Mach Learn. 2003;3:993\u20131022.","journal-title":"J Mach Learn"},{"key":"4578_CR9","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1016\/S0306-4573(00)00015-7","volume":"36","author":"K Sparck Jones","year":"2000","unstructured":"Sparck Jones K, Walker S, Robertson SE. A probabilistic model of information retrieval: development and comparative experiments Part 1. Inf Process Manag. 2000;36:779\u2013808.","journal-title":"Inf Process Manag"},{"key":"4578_CR10","doi-asserted-by":"publisher","first-page":"809","DOI":"10.1016\/S0306-4573(00)00016-9","volume":"36","author":"K Sparck Jones","year":"2000","unstructured":"Sparck Jones K, Walker S, Robertson SE. A probabilistic model of information retrieval: development and comparative experiments Part 2. Inf Process Manag. 2000;36:809\u201340.","journal-title":"Inf Process Manag"},{"key":"4578_CR11","doi-asserted-by":"publisher","first-page":"423","DOI":"10.1186\/1471-2105-8-423","volume":"8","author":"J Lin","year":"2007","unstructured":"Lin J, Wilbur WJ. PubMed related articles: a probabilistic topic-based model for content similarity. BMC Bioinf. 2007;8:423.","journal-title":"BMC Bioinf"},{"key":"4578_CR12","first-page":"114","volume":"51","author":"F Rogers","year":"1963","unstructured":"Rogers F. Medical subject headings. Bull Med Libr Assoc. 1963;51:114\u20136.","journal-title":"Bull Med Libr Assoc"},{"key":"4578_CR13","doi-asserted-by":"crossref","unstructured":"Bodenreider O. The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res. 2004;32(Database issue):D267\u201370.","DOI":"10.1093\/nar\/gkh061"},{"issue":"6","key":"4578_CR14","first-page":"48","volume":"34","author":"XW Pan","year":"2013","unstructured":"Pan XW, Yang Y, Cui L. Research review of scientific paper network and conception of constructing paper similarity network. J Med Inf. 2013;34(6):48\u201354.","journal-title":"J Med Inf"},{"key":"4578_CR15","unstructured":"Pan XW. Comparison and evaluation of content and semantic similarity article network construction methods. China Medical University, 2014."},{"key":"4578_CR16","doi-asserted-by":"publisher","first-page":"175","DOI":"10.4137\/BBI.S35237","volume":"9","author":"A Suratanee","year":"2015","unstructured":"Suratanee A, Plaimas K. DDA: a novel network-based scoring method to identify disease-disease associations. Bioinform Biol Insights. 2015;9:175\u201386.","journal-title":"Bioinform Biol Insights"},{"issue":"8","key":"4578_CR17","doi-asserted-by":"publisher","first-page":"2074","DOI":"10.1039\/C3MB70608G","volume":"10","author":"J Sun","year":"2014","unstructured":"Sun J, Shi H, Wang Z, Zhang C, Liu L, Wang L, He W, Hao D, Liu S, Zhou M. Inferring novel lncRNA-disease associations based on a random walk model of a lncRNA functional similarity network. Mol Biosyst. 2014;10(8):2074\u201381.","journal-title":"Mol Biosyst"},{"key":"4578_CR18","unstructured":"Lov\u00e1sz L. Random walks on graphs: a survey. Combinatorics. 1996: 353\u2013398."},{"key":"4578_CR19","first-page":"14","volume":"2006","author":"WR Hersh","year":"2005","unstructured":"Hersh WR, Bhupatiraju RTTREC. Genomics track overview. Trec Proc. 2005;2006:14\u201325.","journal-title":"Trec Proc"},{"key":"4578_CR20","unstructured":"Rijsbergen C, Robertson SE, Porter MF. New models in probabilistic information retrieval. 1980."},{"key":"4578_CR21","first-page":"247","volume":"45","author":"R Berlanga","year":"2010","unstructured":"Berlanga R, Nebot V, Jimenez E. Semantic annotation of biomedical texts through concept retrieval. Procesamiento Del Lenguaje Natural. 2010;45:247\u201350.","journal-title":"Procesamiento Del Lenguaje Natural"},{"key":"4578_CR22","unstructured":"MetaMap\u2014a tool for recognizing UMLS concepts in text. https:\/\/lhncbc.nlm.nih.gov\/ii\/tools\/MetaMap.html. Accessed 8 Oct 2021."},{"key":"4578_CR23","unstructured":"Jiang JJ, Conrath DW. Semantic similarity based on corpus statistics and lexical taxonomy. Rocling. 1997:11512\u20130."},{"issue":"15","key":"4578_CR24","doi-asserted-by":"publisher","first-page":"1944","DOI":"10.1093\/bioinformatics\/btp338","volume":"25","author":"SF Zhu","year":"2009","unstructured":"Zhu SF, Zeng J, Mamitsuka H. Enhancing MEDLINE document clustering by incorporating MeSH semantic similarity. Bioinformatics. 2009;25(15):1944\u201351.","journal-title":"Bioinformatics"},{"issue":"10","key":"4578_CR25","doi-asserted-by":"publisher","first-page":"1274","DOI":"10.1093\/bioinformatics\/btm087","volume":"23","author":"JZ Wang","year":"2007","unstructured":"Wang JZ, Du Z, Payattakool R, Yu PS, Chen CF. A new method to measure the semantic similarity of GO terms. Bioinformatics. 2007;23(10):1274\u201381.","journal-title":"Bioinformatics"},{"issue":"4","key":"4578_CR26","doi-asserted-by":"publisher","first-page":"1118","DOI":"10.1073\/pnas.0706851105","volume":"105","author":"M Rosvall","year":"2008","unstructured":"Rosvall M, Bergstrom CT. Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci USA. 2008;105(4):1118\u201323.","journal-title":"Proc Natl Acad Sci USA"},{"key":"4578_CR27","unstructured":"Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal. 2006:1695."},{"key":"4578_CR28","unstructured":"Team R. R: A language and environment for statistical computing. 2013. Computing. 2011;1:12\u201321."},{"key":"4578_CR29","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1140\/epjst\/e2010-01179-1","volume":"178","author":"M Rosvall","year":"2009","unstructured":"Rosvall M, Axelsson D, Bergstrom CT. The map equation. Eur Phys J Spec Top. 2009;178:13.","journal-title":"Eur Phys J Spec Top"},{"issue":"6","key":"4578_CR30","doi-asserted-by":"publisher","first-page":"1542002","DOI":"10.1142\/S0219720015420020","volume":"13","author":"J Zhou","year":"2015","unstructured":"Zhou J, Shui Y, Peng S, Li X, Mamitsuka H, Zhu S. MeSHSim: an R\/Bioconductor package for measuring semantic similarity over MeSH headings and MEDLINE documents. J Bioinform Comput Biol. 2015;13(6):1542002.","journal-title":"J Bioinform Comput Biol"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04578-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-022-04578-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04578-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,2,1]],"date-time":"2022-02-01T08:04:25Z","timestamp":1643702665000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-022-04578-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,1]]},"references-count":30,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["4578"],"URL":"https:\/\/doi.org\/10.1186\/s12859-022-04578-1","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,2,1]]},"assertion":[{"value":"29 April 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 January 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 February 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"56"}}