{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T17:40:47Z","timestamp":1775324447362,"version":"3.50.1"},"reference-count":53,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2020,4,4]],"date-time":"2020-04-04T00:00:00Z","timestamp":1585958400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"NSFC","doi-asserted-by":"publisher","award":["61772394"],"award-info":[{"award-number":["61772394"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Single-cell RNA-sequencing (scRNA-seq) profiles transcriptome of individual cells, which enables the discovery of cell types or subtypes by using unsupervised clustering. Current algorithms perform dimension reduction before cell clustering because of noises, high-dimensionality and linear inseparability of scRNA-seq data. However, independence of dimension reduction and clustering fails to fully characterize patterns in data, resulting in an undesirable performance.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this study, we propose a flexible and accurate algorithm for scRNA-seq data by jointly learning dimension reduction and cell clustering (aka DRjCC), where dimension reduction is performed by projected matrix decomposition and cell type clustering by non-negative matrix factorization. We first formulate joint learning of dimension reduction and cell clustering into a constrained optimization problem and then derive the optimization rules. The advantage of DRjCC is that feature selection in dimension reduction is guided by cell clustering, significantly improving the performance of cell type discovery. Eleven scRNA-seq datasets are adopted to validate the performance of algorithms, where the number of single cells varies from 49 to 68\u00a0579 with the number of cell types ranging from 3 to 14. The experimental results demonstrate that DRjCC significantly outperforms 13 state-of-the-art methods in terms of various measurements on cell type clustering (on average 17.44% by improvement). Furthermore, DRjCC is efficient and robust across different scRNA-seq datasets from various tissues. The proposed model and methods provide an effective strategy to analyze scRNA-seq data.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The software is coded using matlab, and is free available for academic https:\/\/github.com\/xkmaxidian\/DRjCC.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa231","type":"journal-article","created":{"date-parts":[[2020,3,31]],"date-time":"2020-03-31T11:12:59Z","timestamp":1585653179000},"page":"3825-3832","source":"Crossref","is-referenced-by-count":53,"title":["Joint learning dimension reduction and clustering of single-cell RNA-sequencing data"],"prefix":"10.1093","volume":"36","author":[{"given":"Wenming","family":"Wu","sequence":"first","affiliation":[{"name":"School of Computer Science and Technology , Xidian University, Xi\u2019an, China"}]},{"given":"Xiaoke","family":"Ma","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology , Xidian University, Xi\u2019an, China"}]}],"member":"286","published-online":{"date-parts":[[2020,4,4]]},"reference":[{"key":"2023063011293752000_btaa231-B1","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.cels.2016.08.011","article-title":"A single-cell transcriptomic map of the human and mouse pancreas reveals inter-and intra-cell population structure","volume":"3","author":"Baron","year":"2016","journal-title":"Cell Syst"},{"key":"2023063011293752000_btaa231-B2","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1038\/nbt.4314","article-title":"Dimensionality reduction for visualizing single-cell data using UMAP","volume":"37","author":"Becht","year":"2019","journal-title":"Nat. Biotechnol"},{"key":"2023063011293752000_btaa231-B3","first-page":"2399","article-title":"Manifold regularization: a geometric framework for learning from labeled and unlabeled examples","volume":"7","author":"Belkin","year":"2006","journal-title":"J. Mach. Learn. Res"},{"key":"2023063011293752000_btaa231-B4","doi-asserted-by":"crossref","first-page":"1787","DOI":"10.1101\/gr.177725.114","article-title":"Cell fate inclination within 2-cell and 4-cell mouse embryos revealed by single-cell RNA sequencing","volume":"24","author":"Biase","year":"2014","journal-title":"Genome Res"},{"key":"2023063011293752000_btaa231-B5","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1038\/nature22330","article-title":"Assembly of functionally integrated human forebrain spheroids","volume":"545","author":"Birey","year":"2017","journal-title":"Nature"},{"key":"2023063011293752000_btaa231-B6","doi-asserted-by":"crossref","first-page":"1093","DOI":"10.1038\/nmeth.2645","article-title":"Accounting for technical noise in single-cell RNA-seq experiments","volume":"10","author":"Brennecke","year":"2013","journal-title":"Nat. Methods"},{"key":"2023063011293752000_btaa231-B7","doi-asserted-by":"crossref","first-page":"1548","DOI":"10.1109\/TPAMI.2010.231","article-title":"Graph regularized non-negative matrix factorization for data representation","volume":"33","author":"Cai","year":"2011","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023063011293752000_btaa231-B8","doi-asserted-by":"crossref","first-page":"15672","DOI":"10.1073\/pnas.1520760112","article-title":"Human cerebral organoids recapitulate gene expression programs of fetal neocortex development","volume":"112","author":"Camp","year":"2015","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023063011293752000_btaa231-B9","doi-asserted-by":"crossref","first-page":"538","DOI":"10.1038\/nature25981","article-title":"The cis-regulatory dynamics of embryonic development at single-cell resolution","volume":"555","author":"Cusanovich","year":"2018","journal-title":"Nature"},{"key":"2023063011293752000_btaa231-B10","doi-asserted-by":"crossref","first-page":"2002","DOI":"10.1038\/s41467-018-04368-5","article-title":"Interpretable dimensionality reduction of single cell transcriptome data with deep generative models","volume":"9","author":"Ding","year":"2018","journal-title":"Nat. Commun"},{"key":"2023063011293752000_btaa231-B11","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1109\/TPAMI.2008.277","article-title":"Convex and semi-nonnegative matrix factorizations","volume":"32","author":"Ding","year":"2009","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023063011293752000_btaa231-B12","doi-asserted-by":"crossref","first-page":"7723","DOI":"10.1073\/pnas.1805681115","article-title":"Integrative analysis of single-cell genomics data by coupled nonnegative matrix factorizations","volume":"115","author":"Duren","year":"2018","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023063011293752000_btaa231-B13","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1038\/nature11979","article-title":"Ovarian surface epithelium at the junction area contains a cancer-prone stem cell niche","volume":"495","author":"Flesken-Nikitin","year":"2013","journal-title":"Nature"},{"key":"2023063011293752000_btaa231-B14","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1002\/widm.32","article-title":"Cluster ensembles","volume":"1","author":"Ghosh","year":"2011","journal-title":"Data Mining Knowl. Discov"},{"key":"2023063011293752000_btaa231-B15","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1038\/nature14966","article-title":"Single-cell messenger RNA sequencing reveals rare intestinal cell types","volume":"525","author":"Grun","year":"2015","journal-title":"Nature"},{"key":"2023063011293752000_btaa231-B16","doi-asserted-by":"crossref","first-page":"e1004575","DOI":"10.1371\/journal.pcbi.1004575","article-title":"SINCEAR: a pipeline for single-cell RNA-seq profiling analysis","volume":"11","author":"Guo","year":"2015","journal-title":"PLoS Comput. Biol"},{"key":"2023063011293752000_btaa231-B17","doi-asserted-by":"crossref","first-page":"1091","DOI":"10.1016\/j.cell.2018.02.001","article-title":"Mapping the mouse cell atlas by Microwell-Seq","volume":"172","author":"Han","year":"2018","journal-title":"Cell"},{"key":"2023063011293752000_btaa231-B18","doi-asserted-by":"crossref","first-page":"475","DOI":"10.1038\/nature06952","article-title":"Mechanism of shape determination in motile cells","volume":"453","author":"Keren","year":"2008","journal-title":"Nature"},{"key":"2023063011293752000_btaa231-B19","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1038\/nmeth.4236","article-title":"SC3: consensus clustering of single-cell RNA-seq data","volume":"14","author":"Kiselev","year":"2017","journal-title":"Nat. Methods"},{"key":"2023063011293752000_btaa231-B20","doi-asserted-by":"crossref","first-page":"310","DOI":"10.1038\/s41576-019-0095-5","article-title":"Challenges in unsupervised clustering of single-cell RNA-seq data","volume":"20","author":"Kiselev","year":"2019","journal-title":"Nat. Gene. Rev"},{"key":"2023063011293752000_btaa231-B21","doi-asserted-by":"crossref","first-page":"2439","DOI":"10.1093\/cercor\/bhn260","article-title":"Intermediate neuronal progenitors (basal progenitors) produce pyramidal-projection neurons for all layers of cerebral cortex","volume":"19","author":"Kowalczyk","year":"2009","journal-title":"Cereb. Cortex"},{"key":"2023063011293752000_btaa231-B22","doi-asserted-by":"crossref","first-page":"W90","DOI":"10.1093\/nar\/gkw377","article-title":"Enrichr: a comprehensive gene set enrichment analysis web server 2016 update","volume":"44","author":"Kuleshov","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023063011293752000_btaa231-B23","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nature13920","article-title":"Deconstructing transcriptional heterogeneity in pluripotent stem cells","volume":"516","author":"Kumar","year":"2014","journal-title":"Nature"},{"key":"2023063011293752000_btaa231-B24","first-page":"1675","author":"Lakkaraju","year":"2016"},{"key":"2023063011293752000_btaa231-B25","doi-asserted-by":"crossref","first-page":"788","DOI":"10.1038\/44565","article-title":"Learning the parts of objects by non-negative matrix factorization","volume":"401","author":"Lee","year":"1999","journal-title":"Nature"},{"key":"2023063011293752000_btaa231-B26","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1016\/j.neucom.2013.03.034","article-title":"Locally discriminative spectral clustering with composite manifold","volume":"119","author":"Li","year":"2013","journal-title":"Neurocomputing"},{"key":"2023063011293752000_btaa231-B27","doi-asserted-by":"crossref","first-page":"2809","DOI":"10.1093\/bioinformatics\/bty1056","article-title":"Single-cell RNA-seq interpretations using evolutionary multiobjective ensemble pruning","volume":"35","author":"Li","year":"2019","journal-title":"Bioinformatics"},{"key":"2023063011293752000_btaa231-B28","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1186\/s13059-017-1188-0","article-title":"CIDR: ultrafast and accurate clustering through imputation for single-cell RNA-seq data","volume":"18","author":"Lin","year":"2017","journal-title":"Genome Biol"},{"key":"2023063011293752000_btaa231-B29","doi-asserted-by":"crossref","first-page":"1045","DOI":"10.1109\/TKDE.2017.2657752","article-title":"Evolutionary nonnegative matrix factorization algorithms for community detection in dynamic networks","volume":"29","author":"Ma","year":"2017","journal-title":"IEEE Trans. Knowl. Data Eng"},{"key":"2023063011293752000_btaa231-B30","first-page":"2579","article-title":"Visualizing high-dimensional data using t-SNE","volume":"9","author":"Maaten","year":"2008","journal-title":"J. Mach. Learn. Res"},{"key":"2023063011293752000_btaa231-B31","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1007\/s10618-010-0191-9","article-title":"Accelerating spectral clustering with partial supervision","volume":"21","author":"Mavroedis","year":"2010","journal-title":"Data Mining Knowl. Discov"},{"key":"2023063011293752000_btaa231-B32","doi-asserted-by":"crossref","first-page":"983","DOI":"10.1038\/s41592-019-0535-3","article-title":"Supervised classification enables rapid annotation of cell atlases","volume":"16","author":"Pliner","year":"2019","journal-title":"Nat. Methods"},{"key":"2023063011293752000_btaa231-B33","first-page":"2007","author":"Rajapakse","year":"2004"},{"key":"2023063011293752000_btaa231-B34","doi-asserted-by":"crossref","first-page":"777","DOI":"10.1038\/nbt.2282","article-title":"Full-length mRNA-seq from single-cell levels of RNA and individual circulating tumor cells","volume":"30","author":"Ramskold","year":"2012","journal-title":"Nat. Biotechnol"},{"key":"2023063011293752000_btaa231-B35","doi-asserted-by":"crossref","first-page":"2323","DOI":"10.1126\/science.290.5500.2323","article-title":"Nonlinear dimensionality reduction by locally linear embedding","volume":"290","author":"Roweis","year":"2000","journal-title":"Science"},{"key":"2023063011293752000_btaa231-B37","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1038\/nbt.3192","article-title":"Spatial reconstruction of single-cell gene expression data","volume":"33","author":"Satija","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023063011293752000_btaa231-B40","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/bioinformatics\/btw607","article-title":"Robust classification of single-cell transcriptome data by nonnegative matrix factorization","volume":"33","author":"Shao","year":"2017","journal-title":"Bioinformatics"},{"key":"2023063011293752000_btaa231-B44","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1038\/nmeth.1315","article-title":"mRNA-seq whole-transcriptome analysis of a single cell","volume":"6","author":"Tang","year":"2009","journal-title":"Nat. Methods"},{"key":"2023063011293752000_btaa231-B45","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1186\/s13059-018-1431-3","article-title":"GiniClust2: a cluster-aware, weighted ensemble clustering method for cell-type detection","volume":"19","author":"Tsoucas","year":"2018","journal-title":"Genome Biol"},{"key":"2023063011293752000_btaa231-B46","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1038\/40805","article-title":"A multivalent PDZ-domain protein assembles signalling complexes in a G-protein-coupled cascade","volume":"388","author":"Tsunoda","year":"1997","journal-title":"Nature"},{"key":"2023063011293752000_btaa231-B47","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1186\/s13059-017-1382-0","article-title":"SCANPY: large-scale single-cell gene expression data analysis","volume":"19","author":"Wolf","year":"2018","journal-title":"Genome Biol"},{"key":"2023063011293752000_btaa231-B48","doi-asserted-by":"crossref","first-page":"4290","DOI":"10.1073\/pnas.1521171113","article-title":"Stability-driven nonnegative matrix factorization to interpret spatial gene expression and build local gene networks","volume":"113","author":"Wu","year":"2016","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023063011293752000_btaa231-B49","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1093\/bioinformatics\/bty793","article-title":"SAFE-clustering: single-cell aggregated (from ensemble) clustering for single-cell RNA-seq data","volume":"35","author":"Yang","year":"2019","journal-title":"Bioinformatics"},{"key":"2023063011293752000_btaa231-B50","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1186\/s13059-017-1305-0","article-title":"Splatter: simulation of single-cell RNA sequencing data","volume":"18","author":"Zappia","year":"2017","journal-title":"Genome Biol"},{"key":"2023063011293752000_btaa231-B51","doi-asserted-by":"crossref","first-page":"1138","DOI":"10.1126\/science.aaa1934","article-title":"Brain structure. Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq","volume":"347","author":"Zeisel","year":"2015","journal-title":"Science"},{"key":"2023063011293752000_btaa231-B52","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/nrn.2017.85","article-title":"Neuronal cell-type classification: challenges, opportunities and the path forward","volume":"18","author":"Zeng","year":"2017","journal-title":"Nat. Rev. Neurosci"},{"key":"2023063011293752000_btaa231-B53","doi-asserted-by":"crossref","first-page":"526","DOI":"10.1038\/s41586-019-1576-6","article-title":"Synaptic proximity enables NMDAR signalling to promote brain metastasis","volume":"573","author":"Zeng","year":"2019","journal-title":"Nature"},{"key":"2023063011293752000_btaa231-B54","doi-asserted-by":"crossref","first-page":"1690","DOI":"10.1109\/TPAMI.2016.2613924","article-title":"Sparse representation-based open set recognition","volume":"39","author":"Zhang","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023063011293752000_btaa231-B56","doi-asserted-by":"crossref","first-page":"1007","DOI":"10.1038\/s41592-019-0529-1","article-title":"Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling","volume":"16","author":"Zhang","year":"2019","journal-title":"Nat. Methods"},{"key":"2023063011293752000_btaa231-B57","doi-asserted-by":"crossref","first-page":"14049","DOI":"10.1038\/ncomms14049","article-title":"Massively parallel digital transcriptional profiling of single cells","volume":"8","author":"Zheng","year":"2017","journal-title":"Nat. Commun"},{"key":"2023063011293752000_btaa231-B58","doi-asserted-by":"crossref","first-page":"e2888","DOI":"10.7717\/peerj.2888","article-title":"Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization","volume":"5","author":"Zhu","year":"2017","journal-title":"Peerj"},{"key":"2023063011293752000_btaa231-B59","doi-asserted-by":"crossref","first-page":"466","DOI":"10.1073\/pnas.1817715116","article-title":"Semisoft clustering of single-cell data","volume":"116","author":"Zhu","year":"2019","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023063011293752000_btaa231-B60","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1186\/s12859-016-0984-y","article-title":"pcaReduce: hierarchical clustering of single cell transcriptional profiles","volume":"17","author":"Zurauskiene","year":"2013","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa231\/33151304\/btaa231.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/12\/3825\/50750654\/bioinformatics_36_12_3825.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/12\/3825\/50750654\/bioinformatics_36_12_3825.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T11:30:19Z","timestamp":1688124619000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/12\/3825\/5815975"}},"subtitle":[],"editor":[{"given":"Anthony","family":"Mathelier","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,4,4]]},"references-count":53,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2020,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa231","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,6,15]]},"published":{"date-parts":[[2020,4,4]]}}}