{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,22]],"date-time":"2026-01-22T20:43:29Z","timestamp":1769114609709,"version":"3.49.0"},"reference-count":46,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2025,6,5]],"date-time":"2025-06-05T00:00:00Z","timestamp":1749081600000},"content-version":"vor","delay-in-days":35,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/100007225","name":"Ministry of Science and Technology","doi-asserted-by":"publisher","award":["BT\/GenomeIndia\/2018"],"award-info":[{"award-number":["BT\/GenomeIndia\/2018"]}],"id":[{"id":"10.13039\/100007225","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Cell-type annotation remains a major challenge in single-cell and spatial omics analysis. Most existing methods rely on single-cell RNA sequencing (scRNA-seq) references or predefined marker sets. However, the scarcity of high-quality scRNA-seq references and marker sets makes relying on a single approach prone to bias and limits usability. Furthermore, available methods for cell-type annotation in single-cell ATAC-sequencing (scATAC-seq) and spatial transcriptomics datasets perform poorly. Here, we present ScInfeR, a graph-based cell-type annotation method that combines information from both scRNA-seq references and marker sets. By integrating these two data sources, ScInfeR can accurately annotate broad range of cell-types. It employs a hierarchical framework inspired by message-passing layers in graph neural networks to accurately identify cell subtypes. ScInfeR is highly versatile, supporting cell annotation across scRNA-seq, scATAC-seq, and spatial omics datasets. For scATAC-seq, it effectively utilizes chromatin accessibility data, while for spatial transcriptomics, it incorporates spatial coordinate information. Additionally, ScInfeR supports weighted positive and negative markers, allowing users to define marker importance in cell-type classification. Our extensive benchmarking across multiple atlas-scale scRNA-seq, scATAC-seq, and spatial datasets, evaluating 10 existing tools in over 100 cell-type prediction tasks, demonstrated ScInfeR\u2019s superior performance. Noteworthy, it exhibits robustness against batch effects arising in these datasets. To facilitate seamless annotation, we developed ScInfeRDB, an interactive database containing manually curated scRNA-seq references and marker sets for 329 cell-types, covering 2497 gene markers in 28 tissue types from human and plant. ScInfeR is available as an R package, with both the tool and database publicly accessible at https:\/\/www.swainasish.in\/scinfer.<\/jats:p>","DOI":"10.1093\/bib\/bbaf253","type":"journal-article","created":{"date-parts":[[2025,6,5]],"date-time":"2025-06-05T16:48:59Z","timestamp":1749142139000},"source":"Crossref","is-referenced-by-count":3,"title":["ScInfeR: an efficient method for annotating cell types and sub-types in single-cell RNA-seq, ATAC-seq, and spatial omics"],"prefix":"10.1093","volume":"26","author":[{"given":"Asish","family":"Kumar Swain","sequence":"first","affiliation":[{"name":"Department of Bioscience and Bioengineering, Indian Institute of Technology (IIT) , N.H. 62, Nagaur Road, Karwar, Jodhpur 342030, Rajasthan ,","place":["India"]}]},{"given":"Rajveer","family":"Singh Shekhawat","sequence":"additional","affiliation":[{"name":"Department of Bioscience and Bioengineering, Indian Institute of Technology (IIT) , N.H. 62, Nagaur Road, Karwar, Jodhpur 342030, Rajasthan ,","place":["India"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7160-9209","authenticated-orcid":false,"given":"Pankaj","family":"Yadav","sequence":"additional","affiliation":[{"name":"Department of Bioscience and Bioengineering, Indian Institute of Technology (IIT) , N.H. 62, Nagaur Road, Karwar, Jodhpur 342030, Rajasthan ,","place":["India"]},{"name":"School of Artificial Intelligence and Data Science, Indian Institute of Technology (IIT) , N.H. 62, Nagaur Road, Karwar, Jodhpur 342030, Rajasthan ,","place":["India"]}]}],"member":"286","published-online":{"date-parts":[[2025,6,5]]},"reference":[{"key":"2025062805314017700_ref1","doi-asserted-by":"publisher","first-page":"4307","DOI":"10.1038\/s41467-020-18158-5","article-title":"Single cell transcriptomics comes of age","volume":"11","author":"Aldridge","year":"2020","journal-title":"Nat Commun"},{"key":"2025062805314017700_ref2","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1038\/s41368-021-00146-0","article-title":"From bulk, single-cell to spatial RNA sequencing","volume":"13","author":"Li","year":"2021","journal-title":"Int J Oral Sci"},{"key":"2025062805314017700_ref3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-019-1854-5","article-title":"Assessment of computational methods for the analysis of single-cell ATAC-seq data","volume":"20","author":"Chen","year":"2019","journal-title":"Genome Biol"},{"key":"2025062805314017700_ref4","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1038\/s41586-021-03634-9","article-title":"Exploring tissue architecture using spatial transcriptomics","volume":"596","author":"Rao","year":"2021","journal-title":"Nature."},{"key":"2025062805314017700_ref5","doi-asserted-by":"publisher","first-page":"8845","DOI":"10.1093\/nar\/gku555","article-title":"Single-cell RNA-seq: advances and future challenges","volume":"42","author":"Saliba","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2025062805314017700_ref6","doi-asserted-by":"crossref","first-page":"bbae662","DOI":"10.1093\/bib\/bbae662","article-title":"scRGCL: a cell type annotation method for single-cell RNA-seq data using residual graph convolutional neural network with contrastive learning","volume":"26","author":"Yuan","year":"2025","journal-title":"Brief Bioinform"},{"key":"2025062805314017700_ref7","doi-asserted-by":"publisher","first-page":"1246","DOI":"10.1038\/s41467-022-28803-w","article-title":"Fully-automated and ultra-fast cell-type identification using specific marker combinations from single-cell transcriptomic data","volume":"13","author":"Ianevski","year":"2022","journal-title":"Nat Commun"},{"key":"2025062805314017700_ref8","doi-asserted-by":"publisher","first-page":"531","DOI":"10.3390\/genes10070531","article-title":"SCINA: a semi-supervised subtyping algorithm of single cells and bulk samples","volume":"10","author":"Zhang","year":"2019","journal-title":"Genes."},{"key":"2025062805314017700_ref9","doi-asserted-by":"publisher","first-page":"852","DOI":"10.1038\/s42256-022-00534-z","article-title":"scBERT as a large-scale pretrained deep language model for cell type annotation of single-cell RNA-seq data","volume":"4","author":"Yang","year":"2022","journal-title":"Nat Mach Intell"},{"key":"2025062805314017700_ref10","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1016\/j.gpb.2020.07.004","article-title":"Evaluation of cell type annotation R packages on single-cell RNA-seq data","volume":"19","author":"Huang","year":"2021","journal-title":"Genomics Proteomics Bioinformatics"},{"key":"2025062805314017700_ref11","doi-asserted-by":"publisher","first-page":"baz046","DOI":"10.1093\/database\/baz046","article-title":"PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data","volume":"2019","author":"Franz\u00e9n","year":"2019","journal-title":"Database."},{"key":"2025062805314017700_ref12","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1186\/s13073-023-01249-5","article-title":"Annotation of cell types (ACT): a convenient web server for cell type annotation","volume":"15","author":"Quan","year":"2023","journal-title":"Genome Med"},{"key":"2025062805314017700_ref13","doi-asserted-by":"publisher","first-page":"D870","DOI":"10.1093\/nar\/gkac947","article-title":"CellMarker 2.0: an updated database of manually curated cell markers in human\/mouse and web tools based on scRNA-seq data","volume":"51","author":"Hu","year":"2023","journal-title":"Nucleic Acids Res"},{"key":"2025062805314017700_ref14","doi-asserted-by":"crossref","first-page":"bbad179","DOI":"10.1093\/bib\/bbad179","article-title":"scAnno: a deconvolution strategy-based automatic cell type annotation tool for single-cell RNA-sequencing data sets","volume":"24","author":"Liu","year":"2023","journal-title":"Brief Bioinform"},{"key":"2025062805314017700_ref15","doi-asserted-by":"publisher","first-page":"2257","DOI":"10.1634\/stemcells.2006-0744","article-title":"Markers and methods for cell sorting of human embryonic stem cell-derived neural cell populations","volume":"25","author":"Pruszak","year":"2007","journal-title":"Stem Cells"},{"key":"2025062805314017700_ref16","doi-asserted-by":"publisher","first-page":"750","DOI":"10.1038\/nbt.1931","article-title":"Cell-surface markers for the isolation of pancreatic cell types derived from human embryonic stem cells","volume":"29","author":"Kelly","year":"2011","journal-title":"Nat Biotechnol"},{"key":"2025062805314017700_ref17","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1038\/s41590-018-0276-y","article-title":"Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage","volume":"20","author":"Aran","year":"2019","journal-title":"Nat Immunol"},{"key":"2025062805314017700_ref18","doi-asserted-by":"publisher","first-page":"4688","DOI":"10.1093\/bioinformatics\/btz292","article-title":"scMatch: a single-cell gene expression profile annotation tool using reference datasets","volume":"35","author":"Hou","year":"2019","journal-title":"Bioinformatics."},{"key":"2025062805314017700_ref19","doi-asserted-by":"publisher","first-page":"495","DOI":"10.1038\/nbt.3192","article-title":"Spatial reconstruction of single-cell gene expression data","volume":"33","author":"Satija","year":"2015","journal-title":"Nat Biotechnol"},{"key":"2025062805314017700_ref20","doi-asserted-by":"crossref","first-page":"bbae188","DOI":"10.1093\/bib\/bbae188","article-title":"scBOL: a universal cell type identification framework for single-cell and spatial transcriptomics data","volume":"25","author":"Zhai","year":"2024","journal-title":"Brief Bioinform"},{"key":"2025062805314017700_ref21","doi-asserted-by":"publisher","first-page":"983","DOI":"10.1038\/s41592-019-0535-3","article-title":"Supervised classification enables rapid annotation of cell atlases","volume":"16","author":"Pliner","year":"2019","journal-title":"Nat Methods"},{"key":"2025062805314017700_ref22","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1186\/s13059-021-02281-7","article-title":"scSorter: assigning cells to known cell types according to marker genes","volume":"22","author":"Guo","year":"2021","journal-title":"Genome Biol"},{"key":"2025062805314017700_ref23","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbad268","article-title":"AtacAnnoR: a reference-based annotation tool for single cell ATAC-seq data","volume":"24","author":"Tian","year":"2023","journal-title":"Brief Bioinform"},{"key":"2025062805314017700_ref24","doi-asserted-by":"publisher","first-page":"1864","DOI":"10.1038\/s41467-023-37439-3","article-title":"Cellcano: supervised cell type identification for single cell ATAC-seq data","volume":"14","author":"Ma","year":"2023","journal-title":"Nat Commun"},{"key":"2025062805314017700_ref25","doi-asserted-by":"crossref","first-page":"bbad533","DOI":"10.1093\/bib\/bbad533","article-title":"SPANN: annotating single-cell resolution spatial transcriptome data with scRNA-seq data","volume":"25","author":"Yuan","year":"2024","journal-title":"Brief Bioinform"},{"key":"2025062805314017700_ref26","doi-asserted-by":"publisher","first-page":"1465","DOI":"10.1038\/s41587-023-01657-3","article-title":"TACCO unifies annotation transfer and decomposition of cell identities for single-cell and spatial omics","volume":"41","author":"Mages","year":"2023","journal-title":"Nat Biotechnol"},{"key":"2025062805314017700_ref27","doi-asserted-by":"publisher","first-page":"3458","DOI":"10.1038\/s41467-020-17281-7","article-title":"Searching large-scale scRNA-seq databases via unbiased cell embedding with cell BLAST","volume":"11","author":"Cao","year":"2020","journal-title":"Nat Commun"},{"key":"2025062805314017700_ref28","doi-asserted-by":"publisher","first-page":"1333","DOI":"10.1038\/s41592-021-01282-5","article-title":"Single-cell chromatin state analysis with Signac","volume":"18","author":"Stuart","year":"2021","journal-title":"Nat Methods"},{"key":"2025062805314017700_ref29","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1038\/s41588-021-00790-6","article-title":"ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis","volume":"53","author":"Granja","year":"2021","journal-title":"Nat Genet"},{"key":"2025062805314017700_ref30","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-017-1382-0","article-title":"SCANPY: large-scale single-cell gene expression data analysis","volume":"19","author":"Wolf","year":"2018","journal-title":"Genome Biol"},{"key":"2025062805314017700_ref31","doi-asserted-by":"crossref","first-page":"eabl4896","DOI":"10.1126\/science.abl4896","article-title":"The tabula sapiens: A multiple-organ, single-cell transcriptomic atlas of humans","volume":"376","author":"Consortium*","year":"2022","journal-title":"Science"},{"key":"2025062805314017700_ref32","doi-asserted-by":"publisher","first-page":"925","DOI":"10.1038\/s41587-019-0206-z","article-title":"Massively parallel single-cell chromatin landscapes of human immune cell development and intratumoral T cell exhaustion","volume":"37","author":"Satpathy","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2025062805314017700_ref33","doi-asserted-by":"publisher","first-page":"916","DOI":"10.1038\/s41587-019-0147-6","article-title":"Droplet-based combinatorial indexing for massive-scale single-cell chromatin accessibility","volume":"37","author":"Lareau","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2025062805314017700_ref34","doi-asserted-by":"crossref","first-page":"eaat5691","DOI":"10.1126\/science.aat5691","article-title":"Three-dimensional intact-tissue sequencing of single-cell transcriptional states","volume":"361","author":"Wang","year":"2018","journal-title":"Science."},{"key":"2025062805314017700_ref35","doi-asserted-by":"publisher","first-page":"74","DOI":"10.1038\/s41587-021-01006-2","article-title":"Integration of spatial and single-cell transcriptomic data elucidates mouse organogenesis","volume":"40","author":"Lohoff","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2025062805314017700_ref36","doi-asserted-by":"publisher","first-page":"425","DOI":"10.1038\/s41593-020-00787-0","article-title":"Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex","volume":"24","author":"Maynard","year":"2021","journal-title":"Nat Neurosci"},{"key":"2025062805314017700_ref37","doi-asserted-by":"publisher","first-page":"1683","DOI":"10.1038\/s41592-023-02035-2","article-title":"Population-level integration of single-cell datasets enables multi-scale analysis across samples","volume":"20","author":"De Donno","year":"2023","journal-title":"Nat Methods"},{"key":"2025062805314017700_ref38","doi-asserted-by":"publisher","first-page":"2241","DOI":"10.1016\/j.celrep.2019.04.054","article-title":"High-throughput single-cell transcriptome profiling of plant cell types","volume":"27","author":"Shulse","year":"2019","journal-title":"Cell Rep"},{"key":"2025062805314017700_ref39","doi-asserted-by":"publisher","first-page":"D1629","DOI":"10.1093\/nar\/gkad706","article-title":"scPlantDB: a comprehensive database for exploring cell types and markers of plant cell atlases","volume":"52","author":"He","year":"2024","journal-title":"Nucleic Acids Res"},{"key":"2025062805314017700_ref40","doi-asserted-by":"publisher","first-page":"1289","DOI":"10.1038\/s41592-019-0619-0","article-title":"Fast, sensitive and accurate integration of single-cell data with harmony","volume":"16","author":"Korsunsky","year":"2019","journal-title":"Nat Methods"},{"key":"2025062805314017700_ref41","doi-asserted-by":"publisher","first-page":"639","DOI":"10.1038\/s42003-024-06349-5","article-title":"SpatialPrompt: spatially aware scalable and accurate tool for spot deconvolution and domain identification in spatial transcriptomics","volume":"7","author":"Swain","year":"2024","journal-title":"Commun Biol"},{"key":"2025062805314017700_ref42","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbab567","article-title":"A comprehensive comparison of supervised and unsupervised methods for cell type identification in single-cell RNA-seq","volume":"23","author":"Sun","year":"2022","journal-title":"Brief Bioinform"},{"key":"2025062805314017700_ref43","doi-asserted-by":"publisher","first-page":"D596","DOI":"10.1093\/nar\/gkab1020","article-title":"DISCO: a database of deeply integrated human single-cell omics data","volume":"50","author":"Li","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2025062805314017700_ref44","doi-asserted-by":"publisher","first-page":"D1448","DOI":"10.1093\/nar\/gkab949","article-title":"PCMDB: a curated and comprehensive resource of plant cell markers","volume":"50","author":"Jin","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2025062805314017700_ref45","volume-title":"Shiny: Web Application Framework for R; 2024. R Package Version 1.8.1.1","author":"Chang"},{"key":"2025062805314017700_ref46","doi-asserted-by":"publisher","first-page":"2338","DOI":"10.1038\/s41467-020-15851-3","article-title":"Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis","volume":"11","author":"Li","year":"2020","journal-title":"Nat Commun"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/3\/bbaf253\/63441672\/bbaf253.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/26\/3\/bbaf253\/63441672\/bbaf253.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,28]],"date-time":"2025-06-28T09:31:48Z","timestamp":1751103108000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaf253\/8157322"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,5,1]]},"references-count":46,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2025,5,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaf253","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,5]]},"published":{"date-parts":[[2025,5,1]]},"article-number":"bbaf253"}}