{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T17:18:05Z","timestamp":1773249485713,"version":"3.50.1"},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2020,7,29]],"date-time":"2020-07-29T00:00:00Z","timestamp":1595980800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,4,9]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Single-cell RNA sequencing allows us to study cell heterogeneity at an unprecedented cell-level resolution and identify known and new cell populations. Current cell labeling pipeline uses unsupervised clustering and assigns labels to clusters by manual inspection. However, this pipeline does not utilize available gold-standard labels because there are usually too few of them to be useful to most computational methods. This article aims to facilitate cell labeling with a semi-supervised method in an alternative pipeline, in which a few gold-standard labels are first identified and then extended to the rest of the cells computationally.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We built a semi-supervised dimensionality reduction method, a network-enhanced autoencoder (netAE). Tested on three public datasets, netAE outperforms various dimensionality reduction baselines and achieves satisfactory classification accuracy even when the labeled set is very small, without disrupting the similarity structure of the original space.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The code of netAE is available on GitHub: https:\/\/github.com\/LeoZDong\/netAE.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa669","type":"journal-article","created":{"date-parts":[[2020,7,17]],"date-time":"2020-07-17T19:11:23Z","timestamp":1595013083000},"page":"43-49","source":"Crossref","is-referenced-by-count":18,"title":["netAE: semi-supervised dimensionality reduction of single-cell RNA sequencing to facilitate cell labeling"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6368-8103","authenticated-orcid":false,"given":"Zhengyang","family":"Dong","sequence":"first","affiliation":[{"name":"Department of Computer Science, Stanford University , Stanford, CA 94305"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0495-7059","authenticated-orcid":false,"given":"Gil","family":"Alterovitz","sequence":"additional","affiliation":[{"name":"Department of Medicine, Brigham and Women's Hospital\/Harvard Medical School , Boston, MA 021153"},{"name":"National Artificial Intelligence Institute, U.S Department of Veterans Affairs , Washington, DC 20571"}]}],"member":"286","published-online":{"date-parts":[[2020,7,29]]},"reference":[{"key":"2023051510493094100_btaa669-B1","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1186\/s13059-019-1795-z","article-title":"A comparison of automatic cell identification methods for single-cell RNA sequencing data","volume":"20","author":"Abdelaal","year":"2019","journal-title":"Genome Biol"},{"key":"2023051510493094100_btaa669-B2","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1016\/j.omtm.2018.07.003","article-title":"An introduction to the analysis of single-cell RNA-sequencing data","volume":"10","author":"AlJanahi","year":"2018","journal-title":"Mol. Ther. Methods Clin. Dev"},{"key":"2023051510493094100_btaa669-B3","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1016\/j.mam.2017.07.002","article-title":"Identifying cell populations with scRNASeq","volume":"59","author":"Andrews","year":"2018","journal-title":"Mol. Aspects Med"},{"key":"2023051510493094100_btaa669-B4","doi-asserted-by":"crossref","first-page":"2865","DOI":"10.1093\/bioinformatics\/bty1044","article-title":"M3drop: dropout-based feature selection for scRNASeq","volume":"35","author":"Andrews","year":"2018","journal-title":"Bioinformatics"},{"key":"2023051510493094100_btaa669-B5","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.coisb.2017.07.004","article-title":"Single cells make big data: new challenges and opportunities in transcriptomics","volume":"4","author":"Angerer","year":"2017","journal-title":"Curr. Opin. Syst. Biol"},{"key":"2023051510493094100_btaa669-B6","first-page":"279","article-title":"Modular learning in neural networks","author":"Ballard","year":"1987","journal-title":"AAAI"},{"key":"2023051510493094100_btaa669-B7","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1126\/science.153.3731.34","article-title":"Dynamic programming","volume":"153","author":"Bellman","year":"1966","journal-title":"Science"},{"key":"2023051510493094100_btaa669-B8","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1038\/nbt.3102","article-title":"Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells","volume":"33","author":"Buettner","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023051510493094100_btaa669-B9","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1016\/j.cels.2017.03.006","article-title":"What is your conceptual definition of \u201ccell type\u201d in the context of a mature organism?","volume":"4","author":"Clevers","year":"2017","journal-title":"Cell Syst"},{"key":"2023051510493094100_btaa669-B10","doi-asserted-by":"crossref","first-page":"2567","DOI":"10.1093\/bioinformatics\/btw227","article-title":"densityCut: an efficient and versatile topological approach for automatic clustering of biological data","volume":"32","author":"Ding","year":"2016","journal-title":"Bioinformatics"},{"key":"2023051510493094100_btaa669-B11","doi-asserted-by":"crossref","first-page":"390","DOI":"10.1038\/s41467-018-07931-2","article-title":"Single-cell RNA-seq denoising using a deep count autoencoder","volume":"10","author":"Eraslan","year":"2019","journal-title":"Nat. Commun"},{"key":"2023051510493094100_btaa669-B12","doi-asserted-by":"crossref","first-page":"7821","DOI":"10.1073\/pnas.122653799","article-title":"Community structure in social and biological networks","volume":"99","author":"Girvan","year":"2002","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051510493094100_btaa669-B13","author":"Greene","year":"1994"},{"key":"2023051510493094100_btaa669-B14","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1007\/BF02289588","article-title":"Hierarchical clustering schemes","volume":"32","author":"Johnson","year":"1967","journal-title":"Psychometrika"},{"key":"2023051510493094100_btaa669-B15","author":"Kingma","year":"2014"},{"key":"2023051510493094100_btaa669-B16","author":"Kingma","year":"2013"},{"key":"2023051510493094100_btaa669-B17","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1038\/nmeth.4644","article-title":"scmap: projection of single-cell RNA-seq data across data sets","volume":"15","author":"Kiselev","year":"2018","journal-title":"Nat. Methods"},{"key":"2023051510493094100_btaa669-B18","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1038\/s41576-018-0088-9","article-title":"Challenges in unsupervised clustering of single-cell RNA-seq data","volume":"20","author":"Kiselev","year":"2019","journal-title":"Nat. Rev. Genet"},{"key":"2023051510493094100_btaa669-B19","doi-asserted-by":"crossref","first-page":"1458","DOI":"10.1016\/j.celrep.2018.10.047","article-title":"Analysis of single-cell RNA-seq identifies cell-cell communication associated with tumor characteristics","volume":"25","author":"Kumar","year":"2018","journal-title":"Cell Rep"},{"key":"2023051510493094100_btaa669-B20","first-page":"1","author":"Li","year":"2020"},{"key":"2023051510493094100_btaa669-B21","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1038\/s41592-018-0229-2","article-title":"Deep generative modeling for single-cell transcriptomics","volume":"15","author":"Lopez","year":"2018","journal-title":"Nat. Methods"},{"key":"2023051510493094100_btaa669-B22","doi-asserted-by":"crossref","first-page":"714","DOI":"10.1093\/bib\/bbq090","article-title":"Principal component analysis based methods in bioinformatics studies","volume":"12","author":"Ma","year":"2011","journal-title":"Brief. Bioinformatics"},{"key":"2023051510493094100_btaa669-B23","doi-asserted-by":"crossref","first-page":"e20","DOI":"10.1182\/blood-2016-05-716480","article-title":"A single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation","volume":"128","author":"Nestorowa","year":"2016","journal-title":"Blood"},{"key":"2023051510493094100_btaa669-B24","doi-asserted-by":"crossref","first-page":"8577","DOI":"10.1073\/pnas.0601602103","article-title":"Modularity and community structure in networks","volume":"103","author":"Newman","year":"2006","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051510493094100_btaa669-B25","doi-asserted-by":"crossref","first-page":"026113","DOI":"10.1103\/PhysRevE.69.026113","article-title":"Finding and evaluating community structure in networks","volume":"69","author":"Newman","year":"2004","journal-title":"Phys. Rev. E"},{"key":"2023051510493094100_btaa669-B26","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1080\/14786440109462720","article-title":"LIII. On lines and planes of closest fit to systems of points in space","volume":"2","author":"Pearson","year":"1901","journal-title":"Lond. Edinb. Dublin Philos. Mag. J. Sci"},{"key":"2023051510493094100_btaa669-B27","doi-asserted-by":"crossref","first-page":"1012","DOI":"10.1016\/j.cell.2016.03.023","article-title":"Single-cell RNA-seq reveals lineage and x chromosome dynamics in human preimplantation embryos","volume":"165","author":"Petropoulos","year":"2016","journal-title":"Cell"},{"key":"2023051510493094100_btaa669-B28","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1186\/s13059-015-0805-z","article-title":"Zifa: dimensionality reduction for zero-inflated single-cell gene expression analysis","volume":"16","author":"Pierson","year":"2015","journal-title":"Genome Biol"},{"key":"2023051510493094100_btaa669-B29","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1038\/s41467-017-02554-5","article-title":"A general and flexible method for signal extraction from single-cell RNA-seq data","volume":"9","author":"Risso","year":"2018","journal-title":"Nat. Commun"},{"key":"2023051510493094100_btaa669-B30","doi-asserted-by":"crossref","first-page":"2032","DOI":"10.1038\/s41467-017-02289-3","article-title":"Estimation of immune cell content in tumour tissue using single-cell RNA-seq data","volume":"8","author":"Schelker","year":"2017","journal-title":"Nat. Commun"},{"key":"2023051510493094100_btaa669-B31","doi-asserted-by":"crossref","first-page":"39921","DOI":"10.1038\/srep39921","article-title":"Batch effects and the effective design of single-cell gene expression studies","volume":"7","author":"Tung","year":"2017","journal-title":"Sci. Rep"},{"key":"2023051510493094100_btaa669-B32","first-page":"85","article-title":"Visualizing data using t-SNE","volume":"9","author":"Van der Maaten","year":"2008","journal-title":"J. Mach. Learn. Res"},{"key":"2023051510493094100_btaa669-B33","first-page":"573782","author":"Way","year":"2019"},{"key":"2023051510493094100_btaa669-B34","first-page":"532895","author":"Xu","year":"2019"},{"key":"2023051510493094100_btaa669-B35","doi-asserted-by":"crossref","first-page":"1138","DOI":"10.1126\/science.aaa1934","article-title":"Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq","volume":"347","author":"Zeisel","year":"2015","journal-title":"Science"},{"key":"2023051510493094100_btaa669-B36","first-page":"1","article-title":"Introduction to semi-supervised learning","volume":"3","author":"Zhu","year":"2009","journal-title":"Synth. Lect. Artif. Intell. Mach. Learn"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa669\/34774523\/btaa669.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/1\/43\/50322329\/btaa669.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/1\/43\/50322329\/btaa669.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,15]],"date-time":"2023-05-15T10:52:05Z","timestamp":1684147925000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/1\/43\/5877940"}},"subtitle":[],"editor":[{"given":"Jan","family":"Gorodkin","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,7,29]]},"references-count":36,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,4,9]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa669","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,1,1]]},"published":{"date-parts":[[2020,7,29]]}}}