{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,22]],"date-time":"2025-12-22T09:47:00Z","timestamp":1766396820247,"version":"3.48.0"},"publisher-location":"New York, NY, USA","reference-count":14,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,9,19]]},"DOI":"10.1145\/3774976.3774993","type":"proceedings-article","created":{"date-parts":[[2025,12,22]],"date-time":"2025-12-22T09:41:32Z","timestamp":1766396492000},"page":"98-102","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["From Thousands to a Handful: Streamlining Breast Cancer Pathway Enrichment with LLMs"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3605-0398","authenticated-orcid":false,"given":"Mateusz","family":"Kania","sequence":"first","affiliation":[{"name":"Department of Applied Informatics, Silesian University of Technology, Gliwice, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2895-7969","authenticated-orcid":false,"given":"Joanna","family":"Zyla","sequence":"additional","affiliation":[{"name":"Department of Data Science and Engineering, Silesian University of Technology, Gliwice, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-2098-2232","authenticated-orcid":false,"given":"Karolina","family":"Widzisz","sequence":"additional","affiliation":[{"name":"Department of Computer Graphics, Vision and Digital Systems, Silesian University of Technology, Gliwice, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1793-9546","authenticated-orcid":false,"given":"Andrzej","family":"Polanski","sequence":"additional","affiliation":[{"name":"Department of Computer Graphics, Vision and Digital Systems, Silesian University of Technology, Gliwice, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,12,22]]},"reference":[{"key":"e_1_3_3_1_2_2","doi-asserted-by":"publisher","unstructured":"Minkyung Baek Frank DiMaio Ivan Anishchenko Justas Dauparas Sergey Ovchinnikov Gyu\u00a0Rie Lee Jue Wang Qi Cong Lisa\u00a0N. Kinch R.\u00a0Dusty Schaeffer Cloe Mill\u00e1n Heeseok Park Claire\u00a0M. Adams Carla\u00a0R.K. Glassman Andy DeGiovanni Jose\u00a0H. Pereira Andriy\u00a0V. Rodrigues Adriaan\u00a0A. Van\u00a0Dijk Anselm\u00a0C. Ebrecht David\u00a0J. Opperman Tam\u00e1s Sagmeister Christiane Buhlheller Tea Pavkov-Keller Madhumitha\u00a0K. Rathinaswamy Udit Dalwadi Christopher\u00a0K. Yip John\u00a0E. Burke Ka-Yaw\u00a0A. Garcia Nick\u00a0V. Grishin Paul\u00a0D. Adams Randy\u00a0J. Read and David Baker. 2021. Accurate prediction of protein structures and interactions using a three-track neural network. Science 373 6557 (2021) 871\u2013876. 10.1126\/science.abj8754","DOI":"10.1126\/science.abj8754"},{"key":"e_1_3_3_1_3_2","doi-asserted-by":"publisher","unstructured":"Cancer Genome Atlas Network. 2012. Comprehensive molecular portraits of human breast tumours. Nature 490 7418 (2012) 61\u201370. 10.1038\/nature11412","DOI":"10.1038\/nature11412"},{"key":"e_1_3_3_1_4_2","volume-title":"org.Hs.eg.db: Genome-wide annotation for Human","author":"Carlson Marc","year":"2025","unstructured":"Marc Carlson. 2025. org.Hs.eg.db: Genome-wide annotation for Human. Bioconductor, Seattle, WA. https:\/\/bioconductor.org\/packages\/org.Hs.eg.db R package version 3.21.0."},{"key":"e_1_3_3_1_5_2","doi-asserted-by":"crossref","unstructured":"Mengzhou Hu Sahar Alkhairy Ingoo Lee Rudolf\u00a0T Pillich Dylan Fong Kevin Smith Robin Bachelder Trey Ideker and Dexter Pratt. 2025. Evaluation of large language models for discovery of gene set function. Nature methods 22 1 (2025) 82\u201391.","DOI":"10.1038\/s41592-024-02525-x"},{"key":"e_1_3_3_1_6_2","doi-asserted-by":"crossref","unstructured":"Marcin\u00a0P Joachimiak J\u00a0Harry Caufield Nomi\u00a0L Harris Hyeongsik Kim and Christopher\u00a0J Mungall. 2024. Gene set summarization using large language models. ArXiv (2024) arXiv\u20132305.","DOI":"10.7490\/f1000research.1120059.1"},{"key":"e_1_3_3_1_7_2","doi-asserted-by":"publisher","unstructured":"John Jumper Richard Evans Alexander Pritzel Tim Green Michael Figurnov Olaf Ronneberger Kathryn Tunyasuvunakool Russ Bates Augustin \u017d\u00eddek Anna Potapenko et\u00a0al. 2021. Highly accurate protein structure prediction with AlphaFold. Nature 596 7873 (2021) 583\u2013589. 10.1038\/s41586-021-03819-2","DOI":"10.1038\/s41586-021-03819-2"},{"key":"e_1_3_3_1_8_2","doi-asserted-by":"publisher","unstructured":"Purvesh Khatri Marina Sirota and Atul\u00a0J. Butte. 2012. Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges. PLOS Computational Biology 8 2 (2012) e1002375. 10.1371\/journal.pcbi.1002375","DOI":"10.1371\/journal.pcbi.1002375"},{"key":"e_1_3_3_1_9_2","unstructured":"OpenAI. 2024. GPT-3.5-turbo Model Specification. https:\/\/platform.openai.com\/docs\/models\/gpt-3-5 context window 16\u00a0385 tokens accessed 24 May 2025."},{"key":"e_1_3_3_1_10_2","unstructured":"OpenAI. 2024. GPT-4-turbo Model Card. https:\/\/platform.openai.com\/docs\/models\/gpt-4o-and-gpt-4-turbo context window 128 k tokens accessed 24 May 2025."},{"key":"e_1_3_3_1_11_2","unstructured":"OpenAI. 2025. GPT-4.1-mini Beta Documentation. context window 1 M tokens accessed 24 May 2025."},{"key":"e_1_3_3_1_12_2","doi-asserted-by":"publisher","unstructured":"Kaumadi Wijesooriya Sameer\u00a0A. Jadaan Kaushalya\u00a0L. Perera Tanuveer Kaur and Mark Ziemann. 2022. Urgent need for consistent standards in functional enrichment analysis. PLOS Computational Biology 18 3 (2022) e1009935. 10.1371\/journal.pcbi.1009935","DOI":"10.1371\/journal.pcbi.1009935"},{"key":"e_1_3_3_1_13_2","doi-asserted-by":"publisher","unstructured":"Guangchuang Yu Li-Gen Wang Yi Han and Qing-Yu He. 2010. GOSemSim: an R package for measuring semantic similarity among GO terms and gene products. Bioinformatics 26 7 (2010) 976\u2013978. 10.1093\/bioinformatics\/btq064R package version 2.35.0.","DOI":"10.1093\/bioinformatics\/btq064"},{"key":"e_1_3_3_1_14_2","doi-asserted-by":"publisher","unstructured":"Guangchuang Yu Li-Gen Wang Yanyan Han and Qing-Yu He. 2012. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS: A Journal of Integrative Biology 16 5 (2012) 284\u2013287. 10.1089\/omi.2011.0118","DOI":"10.1089\/omi.2011.0118"},{"key":"e_1_3_3_1_15_2","doi-asserted-by":"crossref","unstructured":"Jiqing Zhu Rebecca\u00a0Y Wang Xiaoting Wang Ricardo Azevedo Alexander Moreno Julia\u00a0A Kuhn and Zia Khan. 2025. Enhancing gene set overrepresentation analysis with large language models. Bioinformatics Advances 5 1 (2025) vbaf054.","DOI":"10.1093\/bioadv\/vbaf054"}],"event":{"name":"ICBRA 2025: The 12th International Conference on Bioinformatics Research and Applications","location":"Prague Czech Republic","acronym":"ICBRA 2025"},"container-title":["Proceedings of the 12th International Conference on Bioinformatics Research and Applications"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3774976.3774993","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,22]],"date-time":"2025-12-22T09:42:10Z","timestamp":1766396530000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3774976.3774993"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,19]]},"references-count":14,"alternative-id":["10.1145\/3774976.3774993","10.1145\/3774976"],"URL":"https:\/\/doi.org\/10.1145\/3774976.3774993","relation":{},"subject":[],"published":{"date-parts":[[2025,9,19]]},"assertion":[{"value":"2025-12-22","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}