{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T19:35:40Z","timestamp":1754163340014,"version":"3.41.2"},"reference-count":24,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,7,29]],"date-time":"2025-07-29T00:00:00Z","timestamp":1753747200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,7,29]],"date-time":"2025-07-29T00:00:00Z","timestamp":1753747200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100012320","name":"Otto-von-Guericke-Universit\u00e4t Magdeburg","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100012320","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Rapid extraction and visualization of cell-specific gene expression is important for automatic cell type annotation, e.g. in single cell analysis. There is an emerging field in which tools such as curated databases or machine learning methods are used to support cell type annotation. However, complementing approaches to efficiently incorporate the latest knowledge of free-text articles from literature databases, such as PubMed, are understudied.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>This work introduces the PubMed Gene\/Cell type-Relation Atlas (PuMA) which provides a local, easy-to-use web-interface to facilitate literature-driven cell type annotation. It utilizes a pretrained machine learning based named entity recognition model in order to extract gene and cell type concepts from PubMed, links biomedical ontologies, and suggests gene to cell type relations based on a ranking score. It includes a search tool for genes and cell types, additionally providing an interactive graph visualization for exploring cross-relations. Each result is fully traceable by linking the relevant PubMed articles.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>This work enables researchers to analyse and automatize cell type annotation based on PubMed articles. It complements manual curated marker gene databases and enables interactive visualizations. The evaluation shows that PuMA is competitive against an extensive manual curated database across three gold standard datasets and two species\u2014mouse and human. The software framework is freely available and enables regular article imports for incremental knowledge updates.GitLab: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"https:\/\/imigitlab.uni-muenster.de\/published\/PuMA\/\" ext-link-type=\"uri\">https:\/\/imigitlab.uni-muenster.de\/published\/PuMA\/<\/jats:ext-link>\n            <\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/s12859-025-06236-8","type":"journal-article","created":{"date-parts":[[2025,7,29]],"date-time":"2025-07-29T16:13:05Z","timestamp":1753805585000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["PuMA: PubMed gene\/cell type-relation Atlas"],"prefix":"10.1186","volume":"26","author":[{"given":"Lucas","family":"Bickmann","sequence":"first","affiliation":[]},{"given":"Sarah","family":"Sandmann","sequence":"additional","affiliation":[]},{"given":"Carolin","family":"Walter","sequence":"additional","affiliation":[]},{"given":"Julian","family":"Varghese","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,7,29]]},"reference":[{"issue":"2","key":"6236_CR1","doi-asserted-by":"publisher","first-page":"961","DOI":"10.1016\/j.csbj.2021.01.015","volume":"19","author":"G Pasquini","year":"2021","unstructured":"Pasquini G, Rojo Arias JE, Sch\u00e4fer P, Busskamp V. Automated methods for cell type annotation on scRNA-seq data. Comput Struct Biotechnol J. 2021;19(2):961\u20139. https:\/\/doi.org\/10.1016\/j.csbj.2021.01.015.","journal-title":"Comput Struct Biotechnol J"},{"key":"6236_CR2","doi-asserted-by":"publisher","first-page":"870","DOI":"10.1093\/nar\/gkac947","volume":"251","author":"C Hu","year":"2022","unstructured":"Hu C, et al. Cell Marker 2.0: an upyeard database of manually curated cell markers in human\/mouse and web tools based on scRNA-seq data. Nucleic Acids Res. 2022;251:870\u20136. https:\/\/doi.org\/10.1093\/nar\/gkac947.","journal-title":"Nucleic Acids Res"},{"key":"6236_CR3","doi-asserted-by":"publisher","unstructured":"Franz\u00e9n O, Bj\u00f6rkegren GL-M, JLM. PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data. Database: The Journal of Biological Databases and Curation. 2019-04-05;2019:baz046. https:\/\/doi.org\/10.1093\/database\/baz046.","DOI":"10.1093\/database\/baz046"},{"key":"6236_CR4","doi-asserted-by":"publisher","first-page":"5855","DOI":"10.7150\/thno.27285","volume":"8","author":"M Li","year":"2018","unstructured":"Li M, et al. A circular transcript of ncx1 gene mediates ischemic myocardial injury by targeting mir-133a-3p. Theranostics. 2018;8:5855\u201369. https:\/\/doi.org\/10.7150\/thno.27285.","journal-title":"Theranostics"},{"key":"6236_CR5","doi-asserted-by":"publisher","first-page":"864","DOI":"10.1161\/CIRCULATIONAHA.118.038944","volume":"140","author":"K van Duijvenboden","year":"2019","unstructured":"van Duijvenboden K, et al. Conserved NPPB+ border zone switches from MEF2- to AP-1-driven gene program. Circulation. 2019;140:864\u201379. https:\/\/doi.org\/10.1161\/CIRCULATIONAHA.118.038944.","journal-title":"Circulation"},{"key":"6236_CR6","doi-asserted-by":"publisher","first-page":"1393","DOI":"10.1093\/bioinformatics\/btab834","volume":"38","author":"S Mao","year":"2022","unstructured":"Mao S, Zhang Y, Seelig G, Kannan S. CellMeSH probabilistic cell-type identification using indexed literature. Bioinformatics (Oxford England). 2022;38:1393\u2013402. https:\/\/doi.org\/10.1093\/bioinformatics\/btab834.","journal-title":"Bioinformatics (Oxford England)"},{"key":"6236_CR7","doi-asserted-by":"publisher","first-page":"D26","DOI":"10.1093\/nar\/gkl993","volume":"35","author":"D Maglott","year":"2007","unstructured":"Maglott D, Ostell J, Pruitt KD, Tatusova T. Entrez gene: gene-centered information at NCBI. Nucleic Acids Res. 2007;35:D26-31. https:\/\/doi.org\/10.1093\/nar\/gkl993.","journal-title":"Nucleic Acids Res"},{"key":"6236_CR8","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1186\/s13326-016-0088-7","volume":"7","author":"AD Diehl","year":"2016","unstructured":"Diehl AD, et al. The cell ontology 2016: enhanced content, modularization, and ontology interoperability. J Biomed Semant. 2016;7:44. https:\/\/doi.org\/10.1186\/s13326-016-0088-7.","journal-title":"J Biomed Semant"},{"key":"6236_CR9","doi-asserted-by":"publisher","first-page":"W540","DOI":"10.1093\/nar\/gkae235","volume":"52","author":"C-H Wei","year":"2024","unstructured":"Wei C-H, et al. An ai-powered literature resource for unlocking biomedical knowledge. Pubtator 3.0. Nucleic Acids Res. 2024;52:W540-6. https:\/\/doi.org\/10.1093\/nar\/gkae235.","journal-title":"Nucleic Acids Res"},{"key":"6236_CR10","doi-asserted-by":"publisher","DOI":"10.3389\/fcell.2020.00673","author":"N Perera","year":"2020","unstructured":"Perera N, Dehmer M, Emmert-Streib F. Named entity recognition and relation detection for biomedical information extraction. Front Cell Develop Biol. 2020. https:\/\/doi.org\/10.3389\/fcell.2020.00673.","journal-title":"Front Cell Develop Biol"},{"key":"6236_CR11","doi-asserted-by":"publisher","DOI":"10.1016\/j.jhep.2023.07.028","author":"J Varghese","year":"2023","unstructured":"Varghese J, Chapiro J. ChatGPT: the transformative influence of generative AI on science and healthcare. J Hepatol. 2023. https:\/\/doi.org\/10.1016\/j.jhep.2023.07.028.","journal-title":"J Hepatol"},{"key":"6236_CR12","doi-asserted-by":"publisher","unstructured":"Touvron H, et al. Llama: Open and efficient foundation language models (2023). https:\/\/doi.org\/10.48550\/arXiv.2302.13971.","DOI":"10.48550\/arXiv.2302.13971"},{"key":"6236_CR13","doi-asserted-by":"publisher","first-page":"vbac035","DOI":"10.1093\/bioadv\/vbac035","volume":"2","author":"W Gu","year":"2022","unstructured":"Gu W, et al. MarkerGenie: an NLP-enabled text-mining system for biomedical entity relation extraction. Bioinform Adv. 2022;2:vbac035. https:\/\/doi.org\/10.1093\/bioadv\/vbac035.","journal-title":"Bioinform Adv"},{"key":"6236_CR14","doi-asserted-by":"publisher","first-page":"4837","DOI":"10.1093\/bioinformatics\/btac598","volume":"38","author":"M Sung","year":"2022","unstructured":"Sung M, et al. BERN2: an advanced neural biomedical named entity recognition and normalization tool. Bioinformatics. 2022;38:4837\u20139. https:\/\/doi.org\/10.1093\/bioinformatics\/btac598.","journal-title":"Bioinformatics"},{"key":"6236_CR15","unstructured":"Kans J. Entrez direct: E-utilities on the unix command line (2023-08-22). https:\/\/www.ncbi.nlm.nih.gov\/books\/NBK179288\/."},{"key":"6236_CR16","doi-asserted-by":"publisher","first-page":"D267","DOI":"10.1093\/nar\/gkh061","volume":"32","author":"O Bodenreider","year":"2004","unstructured":"Bodenreider O. The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res. 2004;32:D267\u201370. https:\/\/doi.org\/10.1093\/nar\/gkh061.","journal-title":"Nucleic Acids Res"},{"key":"6236_CR17","doi-asserted-by":"publisher","first-page":"282","DOI":"10.3233\/978-1-61499-678-1-282","volume":"228","author":"J Varghese","year":"2016","unstructured":"Varghese J, et al. Key data elements in myeloid leukemia. Stud Health Technol Inform. 2016;228:282\u20136. https:\/\/doi.org\/10.3233\/978-1-61499-678-1-282.","journal-title":"Stud Health Technol Inform"},{"key":"6236_CR18","unstructured":"Jupp S, Burdett T, Leroy C, Parkinson HE. A new ontology lookup service at EMBL-EBI. SWAT4LS. 2015;2:118\u2013119."},{"key":"6236_CR19","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1016\/S0306-4573(02)00021-3","volume":"39","author":"A Aizawa","year":"2003","unstructured":"Aizawa A. An information-theoretic perspective of TF\u2013IDF measures. Inform Process Manag. 2003;39:45\u201365. https:\/\/doi.org\/10.1016\/S0306-4573(02)00021-3.","journal-title":"Inform Process Manag"},{"key":"6236_CR20","doi-asserted-by":"publisher","first-page":"980","DOI":"10.1016\/j.ins.2008.11.017","volume":"179","author":"U Dogrusoz","year":"2009","unstructured":"Dogrusoz U, Giral E, Cetintas A, Civril A, Demir E. A layout algorithm for undirected compound graphs. Inf Sci. 2009;179:980\u201394. https:\/\/doi.org\/10.1016\/j.ins.2008.11.017.","journal-title":"Inf Sci"},{"key":"6236_CR21","doi-asserted-by":"publisher","unstructured":"Consortium TM, et al. Single-cell transcriptomics of 20 mouse organs creates a tabula muris. Nature. 2018;562:367\u201372. https:\/\/doi.org\/10.1038\/s41586-018-0590-4.","DOI":"10.1038\/s41586-018-0590-4"},{"key":"6236_CR22","doi-asserted-by":"publisher","first-page":"1091","DOI":"10.1016\/j.cell.2018.02.001","volume":"172","author":"X Han","year":"2018","unstructured":"Han X, et al. Mapping the mouse cell atlas by microwell-seq. Cell. 2018;172:1091-1107.e17. https:\/\/doi.org\/10.1016\/j.cell.2018.02.001.","journal-title":"Cell"},{"key":"6236_CR23","doi-asserted-by":"publisher","first-page":"14049","DOI":"10.1038\/ncomms14049","volume":"8","author":"GXY Zheng","year":"2017","unstructured":"Zheng GXY, et al. Massively parallel digital transcriptional profiling of single cells. Nat Commun. 2017;8:14049. https:\/\/doi.org\/10.1038\/ncomms14049.","journal-title":"Nat Commun"},{"key":"6236_CR24","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1007\/BF01386390","volume":"1","author":"EW Dijkstra","year":"1959","unstructured":"Dijkstra EW. A note on two problems in connexion with graphs. Numer Math. 1959;1:269\u201371. https:\/\/doi.org\/10.1007\/BF01386390.","journal-title":"Numer Math"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-025-06236-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-025-06236-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-025-06236-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,31]],"date-time":"2025-07-31T02:13:29Z","timestamp":1753928009000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06236-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,29]]},"references-count":24,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["6236"],"URL":"https:\/\/doi.org\/10.1186\/s12859-025-06236-8","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2025,7,29]]},"assertion":[{"value":"7 April 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 July 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 July 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"201"}}