{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T11:58:49Z","timestamp":1772539129152,"version":"3.50.1"},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"22","license":[{"start":{"date-parts":[[2019,4,27]],"date-time":"2019-04-27T00:00:00Z","timestamp":1556323200000},"content-version":"vor","delay-in-days":1,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Cancer Research Trust \u2018Enabling"},{"DOI":"10.13039\/501100001170","name":"Cancer Council of Western Australia","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100001170","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Australian Government Research Training Programme"},{"DOI":"10.13039\/100011719","name":"Cancer Research Trust","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100011719","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Australian National Health and Medical Research Council Fellowship","award":["APP1154524"],"award-info":[{"award-number":["APP1154524"]}]},{"name":"Australian Government and the Government of Western Australia"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Single-cell RNA sequencing (scRNA-seq) measures gene expression at the resolution of individual cells. Massively multiplexed single-cell profiling has enabled large-scale transcriptional analyses of thousands of cells in complex tissues. In most cases, the true identity of individual cells is unknown and needs to be inferred from the transcriptomic data. Existing methods typically cluster (group) cells based on similarities of their gene expression profiles and assign the same identity to all cells within each cluster using the averaged expression levels. However, scRNA-seq experiments typically produce low-coverage sequencing data for each cell, which hinders the clustering process.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We introduce scMatch, which directly annotates single cells by identifying their closest match in large reference datasets. We used this strategy to annotate various single-cell datasets and evaluated the impacts of sequencing depth, similarity metric and reference datasets. We found that scMatch can rapidly and robustly annotate single cells with comparable accuracy to another recent cell annotation tool (SingleR), but that it is quicker and can handle larger reference datasets. We demonstrate how scMatch can handle large customized reference gene expression profiles that combine data from multiple sources, thus empowering researchers to identify cell populations in any complex tissue with the desired precision.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>scMatch (Python code) and the FANTOM5 reference dataset are freely available to the research community here https:\/\/github.com\/forrest-lab\/scMatch.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz292","type":"journal-article","created":{"date-parts":[[2019,4,21]],"date-time":"2019-04-21T11:06:59Z","timestamp":1555844819000},"page":"4688-4695","source":"Crossref","is-referenced-by-count":118,"title":["scMatch: a single-cell gene expression profile annotation tool using reference datasets"],"prefix":"10.1093","volume":"35","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6571-1514","authenticated-orcid":false,"given":"Rui","family":"Hou","sequence":"first","affiliation":[{"name":"Harry Perkins Institute of Medical Research, QEII Medical Centre and Centre for Medical Research, The University of Western Australia , Nedlands, Perth, WA 6009, Australia"}]},{"given":"Elena","family":"Denisenko","sequence":"additional","affiliation":[{"name":"Harry Perkins Institute of Medical Research, QEII Medical Centre and Centre for Medical Research, The University of Western Australia , Nedlands, Perth, WA 6009, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4543-1675","authenticated-orcid":false,"given":"Alistair R R","family":"Forrest","sequence":"additional","affiliation":[{"name":"Harry Perkins Institute of Medical Research, QEII Medical Centre and Centre for Medical Research, The University of Western Australia , Nedlands, Perth, WA 6009, Australia"}]}],"member":"286","published-online":{"date-parts":[[2019,4,26]]},"reference":[{"key":"2023013108315814000_btz292-B1","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1038\/s41590-018-0276-y","article-title":"Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage","volume":"20","author":"Aran","year":"2019","journal-title":"Nat. Immunol"},{"key":"2023013108315814000_btz292-B2","doi-asserted-by":"crossref","first-page":"1010","DOI":"10.1126\/science.1259418","article-title":"Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells","volume":"347","author":"Arner","year":"2015","journal-title":"Science"},{"key":"2023013108315814000_btz292-B3","doi-asserted-by":"crossref","first-page":"714","DOI":"10.1016\/j.cell.2014.04.005","article-title":"Single-cell trajectory detection uncovers progression and regulatory coordination in human B cell development","volume":"157","author":"Bendall","year":"2014","journal-title":"Cell"},{"key":"2023013108315814000_btz292-B4","doi-asserted-by":"crossref","first-page":"1681","DOI":"10.1016\/j.cell.2015.05.044","article-title":"Genomic classification of cutaneous melanoma","volume":"161","year":"2015","journal-title":"Cell"},{"key":"2023013108315814000_btz292-B5","doi-asserted-by":"crossref","first-page":"892","DOI":"10.1038\/s41467-018-03214-y","article-title":"Reconstruction of complex single-cell trajectories using CellRouter","volume":"9","author":"da Rocha","year":"2018","journal-title":"Nat. Commun"},{"key":"2023013108315814000_btz292-B6","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1186\/s13326-016-0088-7","article-title":"The Cell Ontology 2016: enhanced content, modularization, and ontology interoperability","volume":"7","author":"Diehl","year":"2016","journal-title":"J. Biomed. Semantics"},{"key":"2023013108315814000_btz292-B7","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1016\/j.cels.2016.10.021","article-title":"The BLUEPRINT data analysis portal","volume":"3","author":"Fernandez","year":"2016","journal-title":"Cell Syst"},{"key":"2023013108315814000_btz292-B8","doi-asserted-by":"crossref","first-page":"462","DOI":"10.1038\/nature13182","article-title":"A promoter-level mammalian expression atlas","volume":"507","author":"Forrest","year":"2014","journal-title":"Nature"},{"key":"2023013108315814000_btz292-B9","doi-asserted-by":"crossref","first-page":"1297","DOI":"10.12688\/f1000research.15809.1","article-title":"Comparison of clustering tools in R for medium-sized 10x Genomics single-cell RNA-sequencing data","volume":"7","author":"Freytag","year":"2018","journal-title":"F1000Res"},{"key":"2023013108315814000_btz292-B10","doi-asserted-by":"crossref","first-page":"845","DOI":"10.1038\/nmeth.3971","article-title":"Diffusion pseudotime robustly reconstructs lineage branching","volume":"13","author":"Haghverdi","year":"2016","journal-title":"Nat. Methods"},{"key":"2023013108315814000_btz292-B11","doi-asserted-by":"crossref","first-page":"1091","DOI":"10.1016\/j.cell.2018.02.001","article-title":"Mapping the mouse cell atlas by Microwell-seq","volume":"172","author":"Han","year":"2018","journal-title":"Cell"},{"key":"2023013108315814000_btz292-B12","doi-asserted-by":"crossref","first-page":"666","DOI":"10.1016\/j.celrep.2012.08.003","article-title":"CEL-Seq: single-cell RNA-seq by multiplexed linear amplification","volume":"2","author":"Hashimshony","year":"2012","journal-title":"Cell Rep"},{"key":"2023013108315814000_btz292-B13","article-title":"Impact of similarity metrics on single-cell RNA-seq data clustering","author":"Kim","year":"2018","journal-title":"Brief. Bioinform"},{"key":"2023013108315814000_btz292-B14","doi-asserted-by":"crossref","first-page":"1187","DOI":"10.1016\/j.cell.2015.04.044","article-title":"Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells","volume":"161","author":"Klein","year":"2015","journal-title":"Cell"},{"key":"2023013108315814000_btz292-B15","doi-asserted-by":"crossref","first-page":"708","DOI":"10.1038\/ng.3818","article-title":"Reference component analysis of single-cell transcriptomes elucidates cellular heterogeneity in human colorectal tumors","volume":"49","author":"Li","year":"2017","journal-title":"Nat. Genet"},{"key":"2023013108315814000_btz292-B16","doi-asserted-by":"crossref","first-page":"D737","DOI":"10.1093\/nar\/gkw995","article-title":"Update of the FANTOM web resource: high resolution transcriptome of diverse cell types in mammals","volume":"45","author":"Lizio","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023013108315814000_btz292-B17","doi-asserted-by":"crossref","first-page":"632","DOI":"10.1186\/1471-2164-14-632","article-title":"An expression atlas of human primary cells: inference of gene function from coexpression networks","volume":"14","author":"Mabbott","year":"2013","journal-title":"BMC Genomics"},{"key":"2023013108315814000_btz292-B18","doi-asserted-by":"crossref","first-page":"1202","DOI":"10.1016\/j.cell.2015.05.002","article-title":"Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets","volume":"161","author":"Macosko","year":"2015","journal-title":"Cell"},{"key":"2023013108315814000_btz292-B19","doi-asserted-by":"crossref","first-page":"1096","DOI":"10.1038\/nmeth.2639","article-title":"Smart-seq2 for sensitive full-length transcriptome profiling in single cells","volume":"10","author":"Picelli","year":"2013","journal-title":"Nat. Methods"},{"key":"2023013108315814000_btz292-B20","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1038\/nbt.2967","article-title":"Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex","volume":"32","author":"Pollen","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2023013108315814000_btz292-B21","first-page":"2018","article-title":"The Human Cell Atlas White Paper","volume":"05192","author":"Regev","year":"2018","journal-title":"arXiv Preprint arXiv"},{"key":"2023013108315814000_btz292-B22","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1126\/science.aam8999","article-title":"Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding","volume":"360","author":"Rosenberg","year":"2018","journal-title":"Science"},{"key":"2023013108315814000_btz292-B23","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1038\/nbt.3569","article-title":"Wishbone identifies bifurcating developmental trajectories from single-cell data","volume":"34","author":"Setty","year":"2016","journal-title":"Nat. Biotechnol"},{"key":"2023013108315814000_btz292-B24","doi-asserted-by":"crossref","first-page":"360","DOI":"10.1016\/j.stem.2015.07.013","article-title":"Single-cell RNA-seq with waterfall reveals molecular cascades underlying adult neurogenesis","volume":"17","author":"Shin","year":"2015","journal-title":"Cell Stem Cell"},{"key":"2023013108315814000_btz292-B25","doi-asserted-by":"crossref","first-page":"36014","DOI":"10.1038\/srep36014","article-title":"Vertical flow array chips reliably identify cell types from single-cell mRNA sequencing experiments","volume":"6","author":"Shirai","year":"2016","journal-title":"Sci. Rep"},{"key":"2023013108315814000_btz292-B26","doi-asserted-by":"crossref","first-page":"D726","DOI":"10.1093\/nar\/gkv1160","article-title":"ENCODE data at the ENCODE portal","volume":"44","author":"Sloan","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023013108315814000_btz292-B27","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1038\/nmeth.4220","article-title":"Power analysis of single-cell RNA-sequencing experiments","volume":"14","author":"Svensson","year":"2017","journal-title":"Nat. Methods"},{"key":"2023013108315814000_btz292-B28","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1038\/s41586-018-0590-4","article-title":"Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris","volume":"562","author":"Tabula Muris","year":"2018","journal-title":"Nature"},{"key":"2023013108315814000_btz292-B29","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1038\/nmeth.1315","article-title":"mRNA-Seq whole-transcriptome analysis of a single cell","volume":"6","author":"Tang","year":"2009","journal-title":"Nat. Methods"},{"key":"2023013108315814000_btz292-B30","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1038\/nature09145","article-title":"Single-cell NF-kappa B dynamics reveal digital activation and analogue information processing","volume":"466","author":"Tay","year":"2010","journal-title":"Nature"},{"key":"2023013108315814000_btz292-B31","doi-asserted-by":"crossref","first-page":"12308","DOI":"10.1021\/ac5035924","article-title":"Self-digitization microfluidic chip for absolute quantification of mRNA in single cells","volume":"86","author":"Thompson","year":"2014","journal-title":"Anal. Chem"},{"key":"2023013108315814000_btz292-B32","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1126\/science.aad0501","article-title":"Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq","volume":"352","author":"Tirosh","year":"2016","journal-title":"Science"},{"key":"2023013108315814000_btz292-B33","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1038\/nbt.2859","article-title":"The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells","volume":"32","author":"Trapnell","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2023013108315814000_btz292-B34","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1038\/nature13173","article-title":"Reconstructing lineage hierarchies of the distal lung epithelium using single-cell RNA-seq","volume":"509","author":"Treutlein","year":"2014","journal-title":"Nature"},{"key":"2023013108315814000_btz292-B35","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1038\/nbt.2857","article-title":"Microfluidic high-throughput culturing of single cells for selection based on extracellular metabolite production or consumption","volume":"32","author":"Wang","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2023013108315814000_btz292-B36","doi-asserted-by":"crossref","first-page":"14049","DOI":"10.1038\/ncomms14049","article-title":"Massively parallel digital transcriptional profiling of single cells","volume":"8","author":"Zheng","year":"2017","journal-title":"Nat. Commun"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz292\/28665318\/btz292.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/22\/4688\/48978090\/bioinformatics_35_22_4688.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/22\/4688\/48978090\/bioinformatics_35_22_4688.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T17:38:55Z","timestamp":1675186735000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/22\/4688\/5480299"}},"subtitle":[],"editor":[{"given":"Janet","family":"Kelso","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,4,26]]},"references-count":36,"journal-issue":{"issue":"22","published-print":{"date-parts":[[2019,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz292","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,11,15]]},"published":{"date-parts":[[2019,4,26]]}}}