{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,12]],"date-time":"2026-01-12T11:18:13Z","timestamp":1768216693714,"version":"3.49.0"},"reference-count":17,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2026,1,12]],"date-time":"2026-01-12T00:00:00Z","timestamp":1768176000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:p>Spatial transcriptomics (ST) technologies enable the profiling of gene expression while preserving spatial context, offering unprecedented insights into tissue organization. However, traditional spatial domain identification methods primarily rely on gene expression matrices and spatial coordinates while overlooking the rich biological knowledge encoded in gene functional descriptions. Here, we propose SpaLLM, a general framework that integrates large language model (LLM) embeddings of gene descriptions with conventional spatial transcriptomics analysis. Our approach leverages pre-computed GenePT embeddings from NCBI gene summaries to create biologically-informed gene representations. SpaLLM combines these LLM-derived gene features with cell-gene expression matrices through matrix multiplication, generating enriched cell representations that capture both expression patterns and functional knowledge. These enriched features are then integrated with existing graph-based spatial analysis methods for improved spatial domain identification. Extensive validation on 12 sequencing-based Visium sections and an independent imaging-based osmFISH dataset demonstrate that SpaLLM consistently enhances spatial domain identification. Our modular framework can be seamlessly integrated with existing spatial analysis pipelines, making it broadly applicable to diverse research scenarios.<\/jats:p>","DOI":"10.3389\/fbinf.2025.1713975","type":"journal-article","created":{"date-parts":[[2026,1,12]],"date-time":"2026-01-12T08:15:02Z","timestamp":1768205702000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["SpaLLM: a general framework for spatial domain identification with large language models"],"prefix":"10.3389","volume":"5","author":[{"given":"Zeyu","family":"Zou","sequence":"first","affiliation":[]},{"given":"Ziheng","family":"Duan","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2026,1,12]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","DOI":"10.1101\/2023.10.16.562533","article-title":"Genept: a simple but effective foundation model for genes and cells built from chatgpt","author":"Chen","year":"2024","journal-title":"bioRxiv"},{"key":"B2","doi-asserted-by":"publisher","first-page":"1777","DOI":"10.1016\/j.cell.2022.04.003","article-title":"Spatiotemporal transcriptomic atlas of mouse organogenesis using dna nanoball-patterned arrays","volume":"185","author":"Chen","year":"2022","journal-title":"Cell."},{"key":"B3","doi-asserted-by":"publisher","first-page":"932","DOI":"10.1038\/s41592-018-0175-z","article-title":"Spatial organization of the somatosensory cortex revealed by osmfish","volume":"15","author":"Codeluppi","year":"2018","journal-title":"Nat. Methods"},{"key":"B4","doi-asserted-by":"publisher","first-page":"1739","DOI":"10.1038\/s41467-022-29439-6","article-title":"Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder","volume":"13","author":"Dong","year":"2022","journal-title":"Nat. Commun."},{"key":"B5","doi-asserted-by":"publisher","first-page":"btae339","DOI":"10.1093\/bioinformatics\/btae339","article-title":"Impeller: a path-based heterogeneous graph learning method for spatial transcriptomic data imputation","volume":"40","author":"Duan","year":"","journal-title":"Bioinformatics"},{"key":"B6","first-page":"538","article-title":"Imiracle: an iterative multi-view graph neural network to model intercellular gene regulation from spatial transcriptomic data","volume-title":"Proceedings of the 33rd ACM international conference on information and knowledge management","author":"Duan","year":""},{"key":"B7","first-page":"625","article-title":"Muse: a multi-slice joint analysis method for spatial transcriptomics experiments","volume-title":"Proceedings of the 34th ACM international conference on information and knowledge management","author":"Duan","year":""},{"key":"B8","first-page":"19","article-title":"Disco: a diffusion model for spatial transcriptomics data completion","volume-title":"2025 IEEE international conference on image processing (ICIP)","author":"Duan","year":""},{"key":"B9","doi-asserted-by":"publisher","first-page":"eadi5199","DOI":"10.1126\/science.adi5199","article-title":"Single-cell genomics and regulatory networks for 388 human brains","volume":"384","author":"Emani","year":"2024","journal-title":"Science"},{"key":"B10","doi-asserted-by":"publisher","first-page":"1342","DOI":"10.1038\/s41592-021-01255-8","article-title":"Spagcn: integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network","volume":"18","author":"Hu","year":"2021","journal-title":"Nat. Methods"},{"key":"B11","doi-asserted-by":"publisher","first-page":"1155","DOI":"10.1038\/s41467-023-36796-3","article-title":"Spatially informed clustering, integration, and deconvolution of spatial transcriptomics with graphst","volume":"14","author":"Long","year":"2023","journal-title":"Nat. Commun."},{"key":"B12","doi-asserted-by":"publisher","first-page":"425","DOI":"10.1038\/s41593-020-00787-0","article-title":"Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex","volume":"24","author":"Maynard","year":"2021","journal-title":"Nat. Neurosci."},{"key":"B13","doi-asserted-by":"publisher","first-page":"4076","DOI":"10.1038\/s41467-022-31739-w","article-title":"Identifying multicellular spatiotemporal organization of cells with spaceflow","volume":"13","author":"Ren","year":"2022","journal-title":"Nat. Commun."},{"key":"B14","doi-asserted-by":"publisher","first-page":"1463","DOI":"10.1126\/science.aaw1219","article-title":"Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution","volume":"363","author":"Rodriques","year":"2019","journal-title":"Science"},{"key":"B15","doi-asserted-by":"publisher","first-page":"eadg5136","DOI":"10.1126\/science.adg5136","article-title":"Single-cell multi-cohort dissection of the schizophrenia transcriptome","volume":"384","author":"Ruzicka","year":"2024","journal-title":"Science"},{"key":"B16","doi-asserted-by":"publisher","first-page":"bbae329","DOI":"10.1093\/bib\/bbae329","article-title":"Accurately deciphering spatial domains for spatially resolved transcriptomics with stcluster","volume":"25","author":"Wang","year":"2024","journal-title":"Briefings Bioinforma."},{"key":"B17","doi-asserted-by":"publisher","first-page":"1375","DOI":"10.1038\/s41587-021-00935-2","article-title":"Spatial transcriptomics at subspot resolution with bayesspace","volume":"39","author":"Zhao","year":"2021","journal-title":"Nat. Biotechnol."}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1713975\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,12]],"date-time":"2026-01-12T08:15:03Z","timestamp":1768205703000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1713975\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,1,12]]},"references-count":17,"alternative-id":["10.3389\/fbinf.2025.1713975"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2025.1713975","relation":{},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,1,12]]},"article-number":"1713975"}}