{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,22]],"date-time":"2026-03-22T00:02:56Z","timestamp":1774137776284,"version":"3.50.1"},"reference-count":27,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2026,3,2]],"date-time":"2026-03-02T00:00:00Z","timestamp":1772409600000},"content-version":"vor","delay-in-days":2,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2026,2,28]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Enrichment analysis across multiple databases often results in a high level of redundancy due to overlapping terms, complicating the interpretation of biological data. To address this, we developed SummArIzeR, an R package to cluster and annotate enrichment results across multiple databases, enabling fast, intuitive interpretation and comparison across multiple conditions. SummArIzeR clusters enrichment results based on shared genes, calculates a pooled P-value for each cluster and facilitates the cluster annotation using large-language models. It further allows an easily interpretable visualization of the results.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Compared to existing tools, SummArIzeR provides unbiased and fast cluster annotation using large language models. We demonstrate that SummArIzeR achieves clustering comparable to manual curation while offering superior grouping based on shared underlying genes.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The SummArIzeR package is available as an open-source R package, with a comprehensive user manual provided in its GitHub repository: https:\/\/github.com\/bonellilab\/SummArIzeR.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btag102","type":"journal-article","created":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T12:44:02Z","timestamp":1772196242000},"source":"Crossref","is-referenced-by-count":0,"title":["SummArIzeR: simplifying cross-database enrichment result clustering and annotation via large language models"],"prefix":"10.1093","volume":"42","author":[{"given":"Marie","family":"Brinkmann","sequence":"first","affiliation":[{"name":"Division of Rheumatology, Department of Internal Medicine III, Medical University of Vienna , 1090 Vienna,","place":["Austria"]}]},{"given":"Michael","family":"Bonelli","sequence":"additional","affiliation":[{"name":"Division of Rheumatology, Department of Internal Medicine III, Medical University of Vienna , 1090 Vienna,","place":["Austria"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0892-7068","authenticated-orcid":false,"given":"Anela","family":"Tosevska","sequence":"additional","affiliation":[{"name":"Division of Rheumatology, Department of Internal Medicine III, Medical University of Vienna , 1090 Vienna,","place":["Austria"]}]}],"member":"286","published-online":{"date-parts":[[2026,2,28]]},"reference":[{"key":"2026032119465838900_btag102-B1","doi-asserted-by":"crossref","first-page":"1091","DOI":"10.1093\/bioinformatics\/btp101","article-title":"ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks","volume":"25","author":"Bindea","year":"2009","journal-title":"Bioinformatics"},{"key":"2026032119465838900_btag102-B2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s10115-012-0522-9","article-title":"Overlapping correlation clustering","volume":"35","author":"Bonchi","year":"2013","journal-title":"Knowl Inf Syst"},{"key":"2026032119465838900_btag102-B4","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1186\/s13040-019-0204-1","article-title":"ViSEAGO: a Bioconductor package for clustering biological functions using Gene Ontology and semantic similarity","volume":"12","author":"Brionne","year":"2019","journal-title":"BioData Min"},{"key":"2026032119465838900_btag102-B5","doi-asserted-by":"crossref","first-page":"911861","DOI":"10.3389\/fmed.2022.911861","article-title":"Advances and trends in omics technology development","volume":"9","author":"Dai","year":"2022","journal-title":"Front Med"},{"key":"2026032119465838900_btag102-B6","doi-asserted-by":"crossref","first-page":"2340","DOI":"10.1038\/s41467-021-22544-y","article-title":"Epigenomic landscape of human colorectal cancer unveils an aberrant core of pan-cancer enhancers orchestrated by YAP\/TAZ","volume":"12","author":"Della Chiara","year":"2021","journal-title":"Nat Commun"},{"key":"2026032119465838900_btag102-B7","doi-asserted-by":"publisher","author":"Bhuva","DOI":"10.18129\/B9.BIOC.MSIGDB"},{"key":"2026032119465838900_btag102-B8","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1038\/s41592-024-02526-w","article-title":"Enhancing functional gene set analysis with large language models","volume":"22","year":"2025","journal-title":"Nat Methods"},{"key":"2026032119465838900_btag102-B9","doi-asserted-by":"crossref","first-page":"868067","DOI":"10.3389\/fimmu.2022.868067","article-title":"Computed tomography imaging-based radiogenomics analysis reveals hypoxia patterns and immunological characteristics in ovarian cancer","volume":"13","author":"Feng","year":"2022","journal-title":"Front Immunol"},{"key":"2026032119465838900_btag102-B10","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1007\/978-1-4612-4380-9_6","volume-title":"Breakthroughs in Statistics","author":"Fisher","year":"1992"},{"key":"2026032119465838900_btag102-B11","doi-asserted-by":"crossref","first-page":"2628","DOI":"10.1093\/bioinformatics\/btz931","article-title":"ShinyGO: a graphical gene-set enrichment tool for animals and plants","volume":"36","author":"Ge","year":"2020","journal-title":"Bioinformatics"},{"key":"2026032119465838900_btag102-B12","doi-asserted-by":"crossref","first-page":"2847","DOI":"10.1093\/bioinformatics\/btw313","article-title":"Complex heatmaps reveal patterns and correlations in multidimensional genomic data","volume":"32","author":"Gu","year":"2016","journal-title":"Bioinformatics"},{"key":"2026032119465838900_btag102-B13","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1016\/j.gpb.2022.04.008","article-title":"simplifyEnrichment: a bioconductor package for clustering and visualizing functional enrichment results","volume":"21","author":"Gu","year":"2023","journal-title":"Genomics Proteomics Bioinf"},{"key":"2026032119465838900_btag102-B14","doi-asserted-by":"crossref","first-page":"1551","DOI":"10.3390\/genes15121551","article-title":"From omics to multi-omics: a review of advantages and tradeoffs","volume":"15","author":"Hayes","year":"2024","journal-title":"Genes"},{"key":"2026032119465838900_btag102-B15","doi-asserted-by":"crossref","first-page":"1462","DOI":"10.1038\/s41592-024-02235-4","article-title":"Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis","volume":"21","author":"Hou","year":"2024","journal-title":"Nat Methods"},{"key":"2026032119465838900_btag102-B16","doi-asserted-by":"crossref","first-page":"10872","DOI":"10.1038\/s41598-018-28948-z","article-title":"GOATOOLS: a Python library for Gene Ontology analyses","volume":"8","author":"Klopfenstein","year":"2018","journal-title":"Sci Rep"},{"key":"2026032119465838900_btag102-B17","doi-asserted-by":"crossref","first-page":"1142","DOI":"10.1136\/ard-2022-223396","article-title":"Cytokine-directed cellular cross-talk imprints synovial pathotypes in rheumatoid arthritis","volume":"82","author":"Kugler","year":"2023","journal-title":"Ann Rheum Dis"},{"key":"2026032119465838900_btag102-B18","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1080\/01621459.2018.1554485","article-title":"Cauchy combination test: a powerful test with analytic p-value calculation under arbitrary dependency structures","volume":"115","author":"Liu","year":"2020","journal-title":"J Am Stat Assoc"},{"key":"2026032119465838900_btag102-B19","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1007\/11569596_31","volume-title":"Computer and Information Sciences\u2014ISCIS 2005","author":"Pons","year":"2005"},{"key":"2026032119465838900_btag102-B20","doi-asserted-by":"crossref","first-page":"3498","DOI":"10.1016\/j.csbj.2024.09.031","article-title":"Large language models and their applications in bioinformatics","volume":"23","author":"Sarumi","year":"2024","journal-title":"Comput Struct Biotechnol J"},{"key":"2026032119465838900_btag102-B21","doi-asserted-by":"crossref","first-page":"W216","DOI":"10.1093\/nar\/gkac194","article-title":"DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update)","volume":"50","author":"Sherman","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2026032119465838900_btag102-B22","author":"Stouffer","year":"1949"},{"key":"2026032119465838900_btag102-B23","doi-asserted-by":"crossref","first-page":"e21800","DOI":"10.1371\/journal.pone.0021800","article-title":"REVIGO summarizes and visualizes long lists of gene ontology terms","volume":"6","author":"Supek","year":"2011","journal-title":"PloS One"},{"key":"2026032119465838900_btag102-B24","doi-asserted-by":"crossref","first-page":"D330","DOI":"10.1093\/nar\/gky1055","article-title":"The Gene Ontology Resource: 20 years and still GOing strong","volume":"47","author":"The Gene Ontology Consortium","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2026032119465838900_btag102-B25","doi-asserted-by":"crossref","first-page":"e90","DOI":"10.1002\/cpz1.90","article-title":"Gene set knowledge discovery with Enrichr","volume":"1","author":"Xie","year":"2021","journal-title":"Curr Protoc"},{"key":"2026032119465838900_btag102-B26","doi-asserted-by":"crossref","first-page":"3292","DOI":"10.1038\/s41596-024-01020-z","article-title":"Using clusterProfiler to characterize multiomics data","volume":"19","author":"Xu","year":"2024","journal-title":"Nat Protoc"},{"key":"2026032119465838900_btag102-B27","doi-asserted-by":"crossref","first-page":"1836","DOI":"10.1111\/j.1420-9101.2011.02297.x","article-title":"Optimally weighted Z-test is a powerful method for combining probabilities in meta-analysis: Optimally weighted Z-test is a powerful method","volume":"24","author":"Zaykin","year":"2011","journal-title":"J Evol Biol"},{"key":"2026032119465838900_btag102-B28","doi-asserted-by":"crossref","first-page":"1523","DOI":"10.1038\/s41467-019-09234-6","article-title":"Metascape provides a biologist-oriented resource for the analysis of systems-level datasets","volume":"10","author":"Zhou","year":"2019","journal-title":"Nat Commun"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btag102\/67195590\/btag102.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/42\/3\/btag102\/67195590\/btag102.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/42\/3\/btag102\/67195590\/btag102.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T23:47:06Z","timestamp":1774136826000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btag102\/8503160"}},"subtitle":[],"editor":[{"given":"Peter","family":"Robinson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2026,2,28]]},"references-count":27,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2026,2,28]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btag102","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2026,3]]},"published":{"date-parts":[[2026,2,28]]},"article-number":"btag102"}}