{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:31Z","timestamp":1772138071281,"version":"3.50.1"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2025,3,18]],"date-time":"2025-03-18T00:00:00Z","timestamp":1742256000000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"European Molecular Biology Laboratory-European Bioinformatics Institute"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,3,29]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Summary<\/jats:title>\n                    <jats:p>The lit-OTAR framework, developed through a collaboration between Europe PMC and Open Targets, leverages deep learning to revolutionize drug discovery by extracting evidence from scientific literature for drug target identification and validation. This novel framework combines named entity recognition for identifying gene\/protein (target), disease, organism, and chemical\/drug within scientific texts, and entity normalization to map these entities to databases like Ensembl, Experimental Factor Ontology, and ChEMBL. Continuously operational, it has processed over 39 million abstracts and 4.5 million full-text articles and preprints to date, identifying more than 48.5 million unique associations that significantly help accelerate the drug discovery process and scientific research &amp;gt;29.9 m distinct target\u2013disease, 11.8\u2009m distinct target\u2013drug, and 8.3\u2009m distinct disease\u2013drug relationships.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The results are accessible through Europe PMC\u2019s SciLite web app (https:\/\/europepmc.org\/) and its annotations API (https:\/\/europepmc.org\/annotationsapi), as well as via the Open Targets Platform (https:\/\/platform.opentargets.org\/). The daily pipeline is available at https:\/\/github.com\/ML4LitS\/otar-maintenance, and the Open Targets ETL processes are available at https:\/\/github.com\/opentargets.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf113","type":"journal-article","created":{"date-parts":[[2025,3,13]],"date-time":"2025-03-13T08:22:19Z","timestamp":1741854139000},"source":"Crossref","is-referenced-by-count":8,"title":["Lit-OTAR framework for extracting biological evidences from literature"],"prefix":"10.1093","volume":"41","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9064-1965","authenticated-orcid":false,"given":"Santosh","family":"Tirunagari","sequence":"first","affiliation":[{"name":"Literature Services Team, European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus , Cambridge CB10 1SD,","place":["United Kingdom"]}]},{"given":"Shyamasree","family":"Saha","sequence":"additional","affiliation":[{"name":"Literature Services Team, European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus , Cambridge CB10 1SD,","place":["United Kingdom"]}]},{"given":"Aravind","family":"Venkatesan","sequence":"additional","affiliation":[{"name":"Literature Services Team, European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus , Cambridge CB10 1SD,","place":["United Kingdom"]}]},{"given":"Daniel","family":"Suveges","sequence":"additional","affiliation":[{"name":"Open Targets, European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus , Cambridge CB10 1SD,","place":["United Kingdom"]}]},{"given":"Miguel","family":"Carmona","sequence":"additional","affiliation":[{"name":"Open Targets, European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus , Cambridge CB10 1SD,","place":["United Kingdom"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4623-8642","authenticated-orcid":false,"given":"Annalisa","family":"Buniello","sequence":"additional","affiliation":[{"name":"Open Targets, European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus , Cambridge CB10 1SD,","place":["United Kingdom"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1857-278X","authenticated-orcid":false,"given":"David","family":"Ochoa","sequence":"additional","affiliation":[{"name":"Open Targets, European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus , Cambridge CB10 1SD,","place":["United Kingdom"]}]},{"given":"Johanna","family":"McEntyre","sequence":"additional","affiliation":[{"name":"Literature Services Team, European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus , Cambridge CB10 1SD,","place":["United Kingdom"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5806-6174","authenticated-orcid":false,"given":"Ellen","family":"McDonagh","sequence":"additional","affiliation":[{"name":"Open Targets, European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus , Cambridge CB10 1SD,","place":["United Kingdom"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3523-4408","authenticated-orcid":false,"given":"Melissa","family":"Harrison","sequence":"additional","affiliation":[{"name":"Literature Services Team, European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Wellcome Trust Genome Campus , Cambridge CB10 1SD,","place":["United Kingdom"]}]}],"member":"286","published-online":{"date-parts":[[2025,3,17]]},"reference":[{"key":"2025041602165855300_btaf113-B1","doi-asserted-by":"crossref","first-page":"W594","DOI":"10.1093\/nar\/gkz289","article-title":"LitSense: making sense of biomedical literature at sentence level","volume":"47","author":"Allot","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2025041602165855300_btaf113-B2","doi-asserted-by":"publisher","first-page":"D1467","DOI":"10.1093\/nar\/gkae1128","article-title":"Open targets platform: facilitating therapeutic hypotheses building in drug discovery","volume":"53","author":"Buniello","year":"2025","journal-title":"Nucleic Acids Res"},{"key":"2025041602165855300_btaf113-B3","doi-asserted-by":"publisher","first-page":"3539","DOI":"10.1093\/bioinformatics\/bty356","article-title":"D3NER: biomedical named entity recognition using CRF-biLSTM improved with fine-tuned embeddings of various linguistic information","volume":"34","author":"Dang","year":"2018","journal-title":"Bioinformatics"},{"key":"2025041602165855300_btaf113-B4","first-page":"4171","author":"Devlin"},{"key":"2025041602165855300_btaf113-B5","doi-asserted-by":"publisher","first-page":"D948","DOI":"10.1093\/nar\/gkae1071","article-title":"Ensembl 2025","volume":"53","author":"Dyer","year":"2025","journal-title":"Nucleic Acids Res"},{"key":"2025041602165855300_btaf113-B6","author":"Fang","year":"2023"},{"key":"2025041602165855300_btaf113-B7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3458754","article-title":"Domain-specific language model pretraining for biomedical natural language processing","volume":"3","author":"Gu","year":"2022","journal-title":"ACM Trans Comput Healthcare"},{"key":"2025041602165855300_btaf113-B8","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1186\/s13326-017-0131-3","article-title":"Literature evidence in open targets\u2014a target validation platform","volume":"8","author":"Kafkas","year":"2017","journal-title":"J Biomed Semantics"},{"key":"2025041602165855300_btaf113-B9","doi-asserted-by":"crossref","first-page":"3158","DOI":"10.1093\/bioinformatics\/bts591","article-title":"SemMedDB: a PubMed-scale repository of biomedical semantic predications","volume":"28","author":"Kilicoglu","year":"2012","journal-title":"Bioinformatics"},{"key":"2025041602165855300_btaf113-B10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1758-2946-7-S1-S1","article-title":"The CHEMDNER corpus of chemicals and drugs and its annotation principles","volume":"7","author":"Krallinger","year":"2015","journal-title":"J Cheminf"},{"key":"2025041602165855300_btaf113-B11","doi-asserted-by":"crossref","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","article-title":"BioBERT: a pre-trained biomedical language representation model for biomedical text mining","volume":"36","author":"Lee","year":"2020","journal-title":"Bioinformatics"},{"key":"2025041602165855300_btaf113-B12","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1109\/TKDE.2020.2981314","article-title":"A survey on deep learning for named entity recognition","volume":"34","author":"Li","year":"2022","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"2025041602165855300_btaf113-B13","doi-asserted-by":"publisher","first-page":"1112","DOI":"10.1093\/bioinformatics\/btq099","article-title":"Modeling sample variables with an experimental factor ontology","volume":"26","author":"Malone","year":"2010","journal-title":"Bioinformatics"},{"key":"2025041602165855300_btaf113-B14","author":"Mikolov"},{"key":"2025041602165855300_btaf113-B15","first-page":"39","article-title":"Distributional semantics resources for biomedical text processing","author":"Moen"},{"key":"2025041602165855300_btaf113-B16","doi-asserted-by":"crossref","first-page":"D1353","DOI":"10.1093\/nar\/gkac1046","article-title":"The next-generation open targets platform: reimagined, redesigned, rebuilt","volume":"51","author":"Ochoa","year":"2023","journal-title":"Nucleic Acids Res"},{"key":"2025041602165855300_btaf113-B17","doi-asserted-by":"publisher","first-page":"D845","DOI":"10.1093\/nar\/gkz1021","article-title":"The DisGeNET knowledge platform for disease genomics: 2019 update","volume":"48","author":"Pi\u00f1ero","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2025041602165855300_btaf113-B18","doi-asserted-by":"crossref","first-page":"D1668","DOI":"10.1093\/nar\/gkad1085","article-title":"Europe PMC in 2023","volume":"52","author":"Rosonovski","year":"2024","journal-title":"Nucleic Acids Res"},{"key":"2025041602165855300_btaf113-B19","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1016\/j.jbi.2014.05.007","article-title":"Lessons learnt from the ddiextraction-2013 shared task","volume":"51","author":"Segura-Bedmar","year":"2014","journal-title":"J Biomed Inform"},{"key":"2025041602165855300_btaf113-B20","doi-asserted-by":"crossref","first-page":"25","DOI":"10.12688\/wellcomeopenres.10210.1","article-title":"SciLite: a platform for displaying text-mined annotations as a means to link research articles with biological data","volume":"1","author":"Venkatesan","year":"2016","journal-title":"Wellcome Open Res"},{"key":"2025041602165855300_btaf113-B21","doi-asserted-by":"publisher","first-page":"W587","DOI":"10.1093\/nar\/gkz389","article-title":"PubTator central: automated concept annotation for biomedical full text articles","volume":"47","author":"Wei","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2025041602165855300_btaf113-B22","doi-asserted-by":"publisher","first-page":"W540","DOI":"10.1093\/nar\/gkae235","article-title":"PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge","volume":"52","author":"Wei","year":"2024","journal-title":"Nucleic Acids Res"},{"key":"2025041602165855300_btaf113-B23","doi-asserted-by":"publisher","DOI":"10.1038\/s41597-023-02617-x","volume-title":"Sci Data","author":"Yang","year":"2023"},{"key":"2025041602165855300_btaf113-B24","doi-asserted-by":"publisher","first-page":"D1180","DOI":"10.1093\/nar\/gkad1004","article-title":"The ChEMBL database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods","volume":"52","author":"Zdrazil","year":"2024","journal-title":"Nucleic Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf113\/62432066\/btaf113.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/4\/btaf113\/62432066\/btaf113.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/4\/btaf113\/62432066\/btaf113.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,16]],"date-time":"2025-04-16T02:17:04Z","timestamp":1744769824000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf113\/8082101"}},"subtitle":[],"editor":[{"given":"Zhiyong","family":"Lu","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,3,17]]},"references-count":24,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,3,29]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf113","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2024.03.06.583722","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,4]]},"published":{"date-parts":[[2025,3,17]]},"article-number":"btaf113"}}