{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T08:49:07Z","timestamp":1770972547186,"version":"3.50.1"},"reference-count":42,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2019,10,29]],"date-time":"2019-10-29T00:00:00Z","timestamp":1572307200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2019,10,29]],"date-time":"2019-10-29T00:00:00Z","timestamp":1572307200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2019,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n              <jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Biomedical literature concerns a wide range of concepts, requiring controlled vocabularies to maintain a consistent terminology across different research groups. However, as new concepts are introduced, biomedical literature is prone to ambiguity, specifically in fields that are advancing more rapidly, for example, drug design and development. Entity linking is a text mining task that aims at linking entities mentioned in the literature to concepts in a knowledge base. For example, entity linking can help finding all documents that mention the same concept and improve relation extraction methods. Existing approaches focus on the local similarity of each entity and the global coherence of all entities in a document, but do not take into account the semantics of the domain.<\/jats:p>\n              <\/jats:sec>\n              <jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We propose a method, PPR-SSM, to link entities found in documents to concepts from domain-specific ontologies. Our method is based on Personalized PageRank (PPR), using the relations of the ontology to generate a graph of candidate concepts for the mentioned entities. We demonstrate how the knowledge encoded in a domain-specific ontology can be used to calculate the coherence of a set of candidate concepts, improving the accuracy of entity linking. Furthermore, we explore weighting the edges between candidate concepts using semantic similarity measures (SSM). We show how PPR-SSM can be used to effectively link named entities to biomedical ontologies, namely chemical compounds, phenotypes, and gene-product localization and processes.<\/jats:p>\n              <\/jats:sec>\n              <jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>We demonstrated that PPR-SSM outperforms state-of-the-art entity linking methods in four distinct gold standards, by taking advantage of the semantic information contained in ontologies. Moreover, PPR-SSM is a graph-based method that does not require training data. Our method improved the entity linking accuracy of chemical compounds by 0.1385 when compared to a method that does not use SSMs.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-019-3157-y","type":"journal-article","created":{"date-parts":[[2019,10,30]],"date-time":"2019-10-30T20:44:43Z","timestamp":1572468283000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["PPR-SSM: personalized PageRank and semantic similarity measures for entity linking"],"prefix":"10.1186","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7965-6536","authenticated-orcid":false,"given":"Andre","family":"Lamurias","sequence":"first","affiliation":[]},{"given":"Pedro","family":"Ruas","sequence":"additional","affiliation":[]},{"given":"Francisco M.","family":"Couto","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2019,10,29]]},"reference":[{"key":"3157_CR1","doi-asserted-by":"publisher","unstructured":"Shen W, Wang J, Han J. Entity linking with a knowledge base: Issues, techniques, and solutions. IEEE Trans Knowl Data Eng. 2015; 27(2):443\u201360. \n                    https:\/\/doi.org\/10.1109\/TKDE.2014.2327028\n                    \n                  .","DOI":"10.1109\/TKDE.2014.2327028"},{"key":"3157_CR2","volume-title":"Multi-source, Multilingual Information Extraction and Summarization","author":"D Rao","year":"2013","unstructured":"Rao D, McNamee P, Dredze M. Entity linking: Finding extracted entities in a knowledge base. In: Multi-source, Multilingual Information Extraction and Summarization. London: Springer: 2013. p. 93\u2013115."},{"key":"3157_CR3","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-31164-2","volume-title":"Data Matching: Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection","author":"P Christen","year":"2012","unstructured":"Christen P. Data Matching: Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection. London: Springer; 2012."},{"key":"3157_CR4","doi-asserted-by":"publisher","unstructured":"Kouki P, Pujara J, Marcum C, Koehly L, Getoor L. Collective Entity Resolution in Familial Networks. 2017 IEEE Int Conf Data Min (ICDM). 2017:227\u2013236. \n                    https:\/\/doi.org\/10.1109\/ICDM.2017.32\n                    \n                  .","DOI":"10.1109\/ICDM.2017.32"},{"key":"3157_CR5","doi-asserted-by":"publisher","unstructured":"Ran C, Shen W, Wang J, Zhu X. Domain-specific knowledge base enrichment using wikipedia tables. Proc IEEE Int Conf Data Min ICDM. 2016; 2016-January:349\u2013358. \n                    https:\/\/doi.org\/10.1109\/ICDM.2015.124\n                    \n                  .","DOI":"10.1109\/ICDM.2015.124"},{"key":"3157_CR6","doi-asserted-by":"publisher","unstructured":"Wang J, Tong W, Yu H, Li M, Ma X, Cai H, Hanratty T, Han J. Mining multi-aspect reflection of news events in twitter: Discovery, linking and presentation. In: 2015 IEEE International Conference on Data Mining: 2015. p. 429\u2013438. \n                    https:\/\/doi.org\/10.1109\/ICDM.2015.112\n                    \n                  .","DOI":"10.1109\/ICDM.2015.112"},{"key":"3157_CR7","doi-asserted-by":"publisher","unstructured":"Chan SK, Lam W, Yu X. A cascaded approach to biomedical named entity recognition using a unified model. In: Seventh IEEE International Conference on Data Mining (ICDM 2007): 2007. p. 93\u2013102. \n                    https:\/\/doi.org\/10.1109\/ICDM.2007.20\n                    \n                  .","DOI":"10.1109\/ICDM.2007.20"},{"key":"3157_CR8","doi-asserted-by":"publisher","unstructured":"Krallinger M, Rabal O, Louren\u00e7o A, Oyarzabal J, Valencia A. Information Retrieval and Text Mining Technologies for Chemistry. Chem Rev. 2017:6\u201300851. \n                    https:\/\/doi.org\/10.1021\/acs.chemrev.6b00851\n                    \n                  .","DOI":"10.1021\/acs.chemrev.6b00851"},{"key":"3157_CR9","doi-asserted-by":"publisher","unstructured":"Rodriguez-Esteban R. Biomedical text mining and its applications. PLoS Comput Biol. 2009; 5(12):1\u20135. \n                    https:\/\/doi.org\/10.1371\/journal.pcbi.1000597\n                    \n                  .","DOI":"10.1371\/journal.pcbi.1000597"},{"key":"3157_CR10","doi-asserted-by":"publisher","unstructured":"Garcia ACB, Ferraz IN, Pinto F. The role of domain ontology in text mining applications: The addminer project. In: Sixth IEEE International Conference on Data Mining - Workshops (ICDMW\u201906): 2006. p. 34\u20138. \n                    https:\/\/doi.org\/10.1109\/ICDMW.2006.157\n                    \n                  .","DOI":"10.1109\/ICDMW.2006.157"},{"issue":"D1","key":"3157_CR11","doi-asserted-by":"publisher","first-page":"865","DOI":"10.1093\/nar\/gkw1039","volume":"45","author":"S K\u00f6hler","year":"2016","unstructured":"K\u00f6hler S, Vasilevsky NA, Engelstad M, Foster E, McMurry J, Aym\u00e9 S, Baynam G, Bello SM, Boerkoel CF, Boycott KM, et al.The human phenotype ontology in 2017. Nucleic Acids Res. 2016; 45(D1):865\u201376.","journal-title":"Nucleic Acids Res"},{"key":"3157_CR12","doi-asserted-by":"publisher","unstructured":"Hastings J, Owen G, Dekker A, Ennis M, Kale N, Muthukrishnan V, Turner S, Swainston N, Mendes P, Steinbeck C. ChEBI in 2016: Improved services and an expanding collection of metabolites. Nucleic Acids Res. 2016; 44(D1):1214\u20139. \n                    https:\/\/doi.org\/10.1093\/nar\/gkv1031\n                    \n                  .","DOI":"10.1093\/nar\/gkv1031"},{"key":"3157_CR13","doi-asserted-by":"publisher","unstructured":"Hastings J, De Matos P, Dekker A, Ennis M, Harsha B, Kale N, Muthukrishnan V, Owen G, Turner S, Williams M, Steinbeck C. The ChEBI reference database and ontology for biologically relevant chemistry: Enhancements for 2013. Nucleic Acids Res. 2013; 41(D1):456\u201363. \n                    https:\/\/doi.org\/10.1093\/nar\/gks1146\n                    \n                  .","DOI":"10.1093\/nar\/gks1146"},{"issue":"1-7","key":"3157_CR14","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1016\/S0169-7552(98)00110-X","volume":"30","author":"S Brin","year":"1998","unstructured":"Brin S, Page L. The anatomy of a large-scale hypertextual web search engine. Comput Netw ISDN Syst. 1998; 30(1-7):107\u201317.","journal-title":"Comput Netw ISDN Syst"},{"key":"3157_CR15","doi-asserted-by":"crossref","unstructured":"Fogaras D, R\u00e1cz B. Towards Scaling Fully Personalized PageRank. Science. 2002:105\u201317.","DOI":"10.1007\/978-3-540-30216-2_9"},{"key":"3157_CR16","doi-asserted-by":"publisher","unstructured":"Sinha R, Mihalcea R. Unsupervised graph-based word sense disambiguation using measures of word semantic similarity. In: International Conference on Semantic Computing (ICSC 2007). IEEE: 2007. p. 363\u20139. \n                    https:\/\/doi.org\/10.1109\/icsc.2007.87\n                    \n                  .","DOI":"10.1109\/icsc.2007.87"},{"key":"3157_CR17","doi-asserted-by":"crossref","unstructured":"Alhelbawy A, Gaizauskas R. Graph ranking for collective named entity disambiguation. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers): 2014. p. 75\u201380.","DOI":"10.3115\/v1\/P14-2013"},{"key":"3157_CR18","doi-asserted-by":"publisher","unstructured":"Lamurias A, Couto F. Text mining for bioinformatics using biomedical literature In: Ranganathan S., Nakai K., Sch\u00f6nbach C., Gribskov M., editors. Encyclopedia of Bioinformatics and Computational Biology vol. 1. Oxford: Oxford: Elsevier: 2019. \n                    https:\/\/doi.org\/10.1016\/B978-0-12-809633-8.20409-3\n                    \n                  .","DOI":"10.1016\/B978-0-12-809633-8.20409-3"},{"key":"3157_CR19","unstructured":"Ratinov L, Roth D, Downey D, Anderson M. Local and global algorithms for disambiguation to wikipedia. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1, HLT \u201911. Stroudsburg: Association for Computational Linguistics: 2011. p. 1375\u20131384. \n                    http:\/\/dl.acm.org\/citation.cfm?id=2002472.2002642\n                    \n                  ."},{"key":"3157_CR20","doi-asserted-by":"publisher","unstructured":"Radhakrishnan P, Talukdar P, Varma V. ELDEN: Improved Entity Linking Using Densified Knowledge Graphs; 2018. \n                    https:\/\/doi.org\/10.18653\/v1\/N18-1167\n                    \n                  .","DOI":"10.18653\/v1\/N18-1167"},{"key":"3157_CR21","unstructured":"Bunescu R, Pasca M. Using Encyclopedic Knowledge for Named Entity Disambiguation. Proc 11th Conf Eur Chapter Assoc Comput Linguist. 2006; April:3\u20137. \n                    https:\/\/www.aclweb.org\/anthology\/E06-1002\/\n                    \n                  ."},{"key":"3157_CR22","volume-title":"Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1","author":"L Ratinov","year":"2011","unstructured":"Ratinov L, Roth D, Downey D, Anderson M. Local and global algorithms for disambiguation to wikipedia. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. Portland, Oregon: Association for Computational Linguistics: 2011. p. 1375\u20131384."},{"key":"3157_CR23","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing","author":"J Hoffart","year":"2011","unstructured":"Hoffart J, Yosef MA, Bordino I, F\u00fcrstenau H, Pinkal M, Spaniol M, Taneva B, Thater S, Weikum G. Robust disambiguation of named entities in text. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Edinburgh: Association for Computational Linguistics: 2011. p. 782\u201392."},{"key":"3157_CR24","volume-title":"Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing","author":"X Cheng","year":"2013","unstructured":"Cheng X, Roth D. Relational inference for wikification. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Seattle: Association for Computational Linguistics: 2013. p. 1787\u201396."},{"key":"3157_CR25","doi-asserted-by":"publisher","unstructured":"Pershina M, He Y, Grishman R. Personalized page rank for named entity disambiguation. In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Denver: Association for Computational Linguistics: 2015. p. 238\u201343. \n                    https:\/\/doi.org\/10.3115\/v1\/N15-1026\n                    \n                  .","DOI":"10.3115\/v1\/N15-1026"},{"key":"3157_CR26","volume-title":"Proceedings of the Thirtieth International Conference on Very Large Data bases-Volume 30","author":"A Balmin","year":"2004","unstructured":"Balmin A, Hristidis V, Papakonstantinou Y. Objectrank: Authority-based keyword search in databases. In: Proceedings of the Thirtieth International Conference on Very Large Data bases-Volume 30. Toronto: VLDB Endowment: 2004. p. 564\u2013575."},{"key":"3157_CR27","volume-title":"International Semantic Web Conference","author":"G Wu","year":"2008","unstructured":"Wu G, Li J, Feng L, Wang K. Identifying potentially important concepts and relations in an ontology. In: International Semantic Web Conference. Karlsruhe: Springer: 2008. p. 33\u201349."},{"key":"3157_CR28","doi-asserted-by":"publisher","unstructured":"Singla P, Domingos P. Entity resolution with markov logic. In: Sixth International Conference on Data Mining (ICDM\u201906): 2006. p. 572\u201382. \n                    https:\/\/doi.org\/10.1109\/ICDM.2006.65\n                    \n                  .","DOI":"10.1109\/ICDM.2006.65"},{"issue":"Suppl 2","key":"3157_CR29","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1186\/gb-2008-9-s2-s3","volume":"9","author":"A Morgan","year":"2008","unstructured":"Morgan A, Lu Z, Wang X, Cohen A, Fluck J, Ruch P, Divoli A, Fundel K, Leaman R, Hakenberg J, et al.Overview of BioCreative II gene normalization. Genome Biol. 2008; 9(Suppl 2):3.","journal-title":"Genome Biol"},{"issue":"8","key":"3157_CR30","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1186\/1471-2105-12-S8-S2","volume":"12","author":"Z Lu","year":"2011","unstructured":"Lu Z, Kao H-Y, Wei C-H, Huang M, Liu J, Kuo C-J, Hsu C-N, Tsai RT-H, Dai H-J, Okazaki N, et al.The gene normalization task in biocreative iii. BMC Bioinformatics. 2011; 12(8):2.","journal-title":"BMC Bioinformatics"},{"key":"3157_CR31","doi-asserted-by":"publisher","unstructured":"Tsuruoka Y, Miwa M, Hamamoto K, Tsujii J, Ananiadou S. Discovering and visualizing indirect associations between biomedical concepts. Bioinformatics. 2011; 27(13):111\u20139. \n                    https:\/\/doi.org\/10.1093\/bioinformatics\/btr214\n                    \n                  .","DOI":"10.1093\/bioinformatics\/btr214"},{"key":"3157_CR32","doi-asserted-by":"publisher","unstructured":"Smith L, Tanabe LK, Ando RJn, Kuo C-jJ, Chung I-fF, Hsu C-NN, Lin Y-sS, Klinger R, Friedrich CM, Ganchev K, Torii M, Liu H, Haddow B, Struble CA, Povinelli RJ, Vlachos A, Baumgartner WA, Hunter L, Carpenter B, Tsai RT-h, Dai H-J, Liu F, Chen Y, Sun C, Katrenko S, Adriaans P, Blaschke C, Torres R, Neves M, Nakov P, Divoli A, Ma\u00f1a-L\u00f3pez M, Mata J, Wilbur WJ, et al.Overview of BioCreative II gene mention recognition,. Genome Biol. 2008; 9 Suppl 2(Suppl 2):2. \n                    https:\/\/doi.org\/10.1186\/gb-2008-9-s2-s2\n                    \n                  .","DOI":"10.1186\/gb-2008-9-s2-s2"},{"key":"3157_CR33","doi-asserted-by":"crossref","unstructured":"Ferreira JD, In\u00e1cio B, Salek RM, Couto FM. Assessing public metabolomics metadata, towards improving quality. J Integr Bioinformatics. 2017; 14(4).","DOI":"10.1515\/jib-2017-0054"},{"issue":"1","key":"3157_CR34","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1186\/1472-6947-15-S1-S4","volume":"15","author":"JG Zheng","year":"2015","unstructured":"Zheng JG, Howsmon D, Zhang B, Hahn J, McGuinness D, Hendler J, Ji H. Entity linking for biomedical literature. BMC Med Inform Decis Making. 2015; 15(1):4.","journal-title":"BMC Med Inform Decis Making"},{"key":"3157_CR35","doi-asserted-by":"publisher","unstructured":"Lobo M, Lamurias A, Couto F. Identifying human phenotype terms by combining machine learning and validation rules. BioMed Res Int. 2017; 2017. \n                    https:\/\/doi.org\/10.1155\/2017\/8565739\n                    \n                  .","DOI":"10.1155\/2017\/8565739"},{"key":"3157_CR36","doi-asserted-by":"publisher","unstructured":"Bada M, Eckert M, Evans D, Garcia K, Shipley K, Sitnikov D, Baumgartner Jr WA, Cohen KB, Verspoor K, Blake JA, Hunter LE. Concept annotation in the CRAFT corpus. BMC Bioinformatics. 2012; 61(Suppl 13). \n                    https:\/\/doi.org\/10.1186\/1471-2105-13-161\n                    \n                  .","DOI":"10.1186\/1471-2105-13-161"},{"key":"3157_CR37","doi-asserted-by":"publisher","unstructured":"Boguslav M, Cohen KB, Jr. WAB, Hunter LE. Improving precision in concept normalization:566\u201377. \n                    https:\/\/doi.org\/10.1142\/9789813235533_0052\n                    \n                  .","DOI":"10.1142\/9789813235533_0052"},{"key":"3157_CR38","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1162\/tacl_a_00089","volume":"4","author":"C-T Tsai","year":"2016","unstructured":"Tsai C-T, Roth D. Concept grounding to multiple knowledge bases via indirect supervision. Trans Assoc Comput Linguist. 2016; 4:141\u201354.","journal-title":"Trans Assoc Comput Linguist"},{"key":"3157_CR39","volume-title":"International Joint Conference on Artificial Intelligence","author":"P Resnik","year":"1995","unstructured":"Resnik P. Using information content to evaluate semantic similarity in a taxonomy. In: International Joint Conference on Artificial Intelligence. Montreal: Citeseer: 1995. p. 448\u201353."},{"key":"3157_CR40","doi-asserted-by":"publisher","unstructured":"Couto F, Lamurias A. Encyclopedia of Bioinformatics and Computational Biology In: Ranganathan S., Nakai K., Sch\u00f6nbach C., Gribskov M., editors. Oxford: Oxford: Elsevier: 2019. p. 870\u20136. \n                    https:\/\/doi.org\/10.1016\/B978-0-12-809633-8.20401-9\n                    \n                  .","DOI":"10.1016\/B978-0-12-809633-8.20401-9"},{"key":"3157_CR41","volume-title":"ICML","author":"D Lin","year":"1998","unstructured":"Lin D, et al.An information-theoretic definition of similarity. In: ICML. Madison, Wisconsin: Citeseer: 1998. p. 296\u2013304."},{"key":"3157_CR42","unstructured":"Jiang JJ, Conrath DW. Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceedings of the 10th Research on Computational Linguistics International Conference. Taipei: The Association for Computational Linguistics and Chinese Language Processing (ACLCLP): 1997. p. 19\u201333. \n                    http:\/\/www.aclweb.org\/anthology\/O97-1002\n                    \n                  ."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-019-3157-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12859-019-3157-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-019-3157-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,10,28]],"date-time":"2020-10-28T00:09:12Z","timestamp":1603843752000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-019-3157-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,10,29]]},"references-count":42,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,12]]}},"alternative-id":["3157"],"URL":"https:\/\/doi.org\/10.1186\/s12859-019-3157-y","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,10,29]]},"assertion":[{"value":"17 April 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 October 2019","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 October 2019","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"534"}}