{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,15]],"date-time":"2025-11-15T10:35:17Z","timestamp":1763202917260,"version":"3.41.2"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2024,6,26]],"date-time":"2024-06-26T00:00:00Z","timestamp":1719360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62076109"],"award-info":[{"award-number":["62076109"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Jilin Province Outstanding Young Scientist Program","award":["20230508098RC"],"award-info":[{"award-number":["20230508098RC"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The annotation of cell types from single-cell transcriptomics is essential for understanding the biological identity and functionality of cellular populations. Although manual annotation remains the gold standard, the advent of automatic pipelines has become crucial for scalable, unbiased, and cost-effective annotations. Nonetheless, the effectiveness of these automatic methods, particularly those employing deep learning, significantly depends on the architecture of the classifier and the quality and diversity of the training datasets.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>To address these limitations, we present a Pruning-enabled Gene-Cell Net (PredGCN) incorporating a Coupled Gene-Cell Net (CGCN) to enable representation learning and information storage. PredGCN integrates a Gene Splicing Net (GSN) and a Cell Stratification Net (CSN), employing a pruning operation (PrO) to dynamically tackle the complexity of heterogeneous cell identification. Among them, GSN leverages multiple statistical and hypothesis-driven feature extraction methods to selectively assemble genes with specificity for scRNA-seq data while CSN unifies elements based on diverse region demarcation principles, exploiting the representations from GSN and precise identification from different regional homogeneity perspectives. Furthermore, we develop a multi-objective Pareto pruning operation (Pareto PrO) to expand the dynamic capabilities of CGCN, optimizing the sub-network structure for accurate cell type annotation. Multiple comparison experiments on real scRNA-seq datasets from various species have demonstrated that PredGCN surpasses existing state-of-the-art methods, including its scalability to cross-species datasets. Moreover, PredGCN can uncover unknown cell types and provide functional genomic analysis by quantifying the influence of genes on cell clusters, bringing new insights into cell type identification and characterizing scRNA-seq data from different perspectives.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The source code is available at https:\/\/github.com\/IrisQi7\/PredGCN and test data is available at https:\/\/figshare.com\/articles\/dataset\/PredGCN\/25251163.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae421","type":"journal-article","created":{"date-parts":[[2024,6,26]],"date-time":"2024-06-26T19:27:26Z","timestamp":1719430046000},"source":"Crossref","is-referenced-by-count":2,"title":["PredGCN: a Pruning-enabled Gene-Cell Net for automatic cell annotation of single cell transcriptome data"],"prefix":"10.1093","volume":"40","author":[{"given":"Qi","family":"Qi","sequence":"first","affiliation":[{"name":"School of Artificial Intelligence, Jilin University , Changchun 130012, China"}]},{"given":"Yunhe","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence, Hebei University of Technology , Tianjin 300401, China"}]},{"given":"Yujian","family":"Huang","sequence":"additional","affiliation":[{"name":"College of Computer Science and Cyber Security, Chengdu University of Technology , Chengdu 610059, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8620-2735","authenticated-orcid":false,"given":"Yi","family":"Fan","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence, Jilin University , Changchun 130012, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8716-9823","authenticated-orcid":false,"given":"Xiangtao","family":"Li","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence, Jilin University , Changchun 130012, China"}]}],"member":"286","published-online":{"date-parts":[[2024,6,26]]},"reference":[{"key":"2024071019300479800_btae421-B1","doi-asserted-by":"crossref","first-page":"264","DOI":"10.1186\/s13059-019-1862-5","article-title":"scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data","volume":"20","author":"Alquicira-Hernandez","year":"2019","journal-title":"Genome Biol"},{"key":"2024071019300479800_btae421-B2","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1038\/s41590-018-0276-y","article-title":"Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage","volume":"20","author":"Aran","year":"2019","journal-title":"Nat Immunol"},{"key":"2024071019300479800_btae421-B3","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1016\/j.cels.2016.08.011","article-title":"A single-cell transcriptomic map of the human and mouse pancreas reveals inter-and intra-cell population structure","volume":"3","author":"Baron","year":"2016","journal-title":"Cell Syst"},{"key":"2024071019300479800_btae421-B4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.inffus.2018.11.008","article-title":"Ensembles for feature selection: a review and future trends","volume":"52","author":"Bol\u00f3n-Canedo","year":"2019","journal-title":"Inf Fusion"},{"key":"2024071019300479800_btae421-B5","doi-asserted-by":"crossref","first-page":"e95","DOI":"10.1093\/nar\/gkz543","article-title":"CHETAH: a selective, hierarchical cell type identification method for single-cell RNA sequencing","volume":"47","author":"de Kanter","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024071019300479800_btae421-B6","doi-asserted-by":"crossref","first-page":"103477","DOI":"10.1016\/j.ebiom.2021.103477","article-title":"Expression characteristics of interferon-stimulated genes and possible regulatory mechanisms in lupus patients using transcriptomics analyses","volume":"70","author":"Deng","year":"2021","journal-title":"EBioMedicine"},{"key":"2024071019300479800_btae421-B94330770","doi-asserted-by":"crossref","first-page":"737","DOI":"10.1038\/s41587-020-0465-8","article-title":"Systematic comparison of single-cell and single-nucleus RNA-sequencing methods","volume":"38","author":"Ding","year":"2020","journal-title":"Nat Biotechnol"},{"key":"2024071019300479800_btae421-B8","doi-asserted-by":"crossref","first-page":"2205442","DOI":"10.1002\/advs.202205442","article-title":"Reliable identification and interpretation of single-cell molecular heterogeneity and transcriptional regulation using dynamic ensemble pruning","volume":"10","author":"Fan","year":"2023","journal-title":"Adv Sci"},{"key":"2024071019300479800_btae421-B9","doi-asserted-by":"crossref","first-page":"baz046","DOI":"10.1093\/database\/baz046","article-title":"PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data","volume":"2019","author":"Franz\u00e9n","year":"2019","journal-title":"Database"},{"key":"2024071019300479800_btae421-B10","doi-asserted-by":"crossref","first-page":"e694","DOI":"10.1002\/ctm2.694","article-title":"Single-cell RNA sequencing technologies and applications: a brief overview","volume":"12","author":"Jovic","year":"2022","journal-title":"Clin Transl Med"},{"key":"2024071019300479800_btae421-B11","doi-asserted-by":"crossref","first-page":"1419","DOI":"10.1038\/s12276-020-00499-2","article-title":"Single-cell sequencing techniques from individual to multiomics analyses","volume":"52","author":"Kashima","year":"2020","journal-title":"Exp Mol Med"},{"key":"2024071019300479800_btae421-B12","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1038\/nmeth.4644","article-title":"scmap: projection of single-cell RNA-seq data across data sets","volume":"15","author":"Kiselev","year":"2018","journal-title":"Nat Methods"},{"key":"2024071019300479800_btae421-B13","doi-asserted-by":"crossref","first-page":"e0205499","DOI":"10.1371\/journal.pone.0205499","article-title":"CaSTLe \u2013 classification of single cells by transfer learning: harnessing the power of publicly available single cell RNA sequencing experiments to annotate new experiments","volume":"13","author":"Lieberman","year":"2018","journal-title":"PLoS One"},{"key":"2024071019300479800_btae421-B14","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1093\/bioinformatics\/btz592","article-title":"ACTINN: automated identification of cell types in single cell RNA sequencing","volume":"36","author":"Ma","year":"2020","journal-title":"Bioinformatics"},{"key":"2024071019300479800_btae421-B15","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1016\/j.cels.2016.09.002","article-title":"A single-cell transcriptome atlas of the human pancreas","volume":"3","author":"Muraro","year":"2016","journal-title":"Cell Syst"},{"key":"2024071019300479800_btae421-B16","doi-asserted-by":"crossref","first-page":"1094","DOI":"10.1038\/s41590-020-0743-0","article-title":"Mapping systemic lupus erythematosus heterogeneity at the single-cell level","volume":"21","author":"Nehar-Belaid","year":"2020","journal-title":"Nat Immunol"},{"key":"2024071019300479800_btae421-B17","doi-asserted-by":"crossref","first-page":"961","DOI":"10.1016\/j.csbj.2021.01.015","article-title":"Automated methods for cell type annotation on scRNA-seq data","volume":"19","author":"Pasquini","year":"2021","journal-title":"Comput Struct Biotechnol J"},{"key":"2024071019300479800_btae421-B18","doi-asserted-by":"crossref","first-page":"593","DOI":"10.1016\/j.cmet.2016.08.020","article-title":"Single-cell transcriptome profiling of human pancreatic islets in health and type 2 diabetes","volume":"24","author":"Segerstolpe","year":"2016","journal-title":"Cell Metab"},{"key":"2024071019300479800_btae421-B19","doi-asserted-by":"crossref","first-page":"100882","DOI":"10.1016\/j.isci.2020.100882","article-title":"scCATCH: automatic annotation on cell types of clusters from single-cell RNA sequencing data","volume":"23","author":"Shao","year":"2020","journal-title":"iScience"},{"key":"2024071019300479800_btae421-B20","doi-asserted-by":"crossref","first-page":"bbab034","DOI":"10.1093\/bib\/bbab034","article-title":"Accurate feature selection improves single-cell RNA-seq cell clustering","volume":"22","author":"Su","year":"2021","journal-title":"Brief Bioinform"},{"key":"2024071019300479800_btae421-B21","doi-asserted-by":"crossref","first-page":"2307280","DOI":"10.1002\/advs.202307280","article-title":"Distribution-agnostic deep learning enables accurate single-cell data recovery and transcriptional regulation interpretation","volume":"11","author":"Su","year":"2024","journal-title":"Adv Sci"},{"key":"2024071019300479800_btae421-B22","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1016\/j.cels.2019.06.004","article-title":"SingleCellNet: a computational tool to classify single cell RNA-seq data across platforms and across species","volume":"9","author":"Tan","year":"2019","journal-title":"Cell Syst"},{"key":"2024071019300479800_btae421-B23","doi-asserted-by":"crossref","first-page":"3222","DOI":"10.1038\/s41467-019-11181-1","article-title":"Genetic mapping of cell type specificity for complex traits","volume":"10","author":"Watanabe","year":"2019","journal-title":"Nat Commun"},{"key":"2024071019300479800_btae421-B24","doi-asserted-by":"crossref","first-page":"608","DOI":"10.1016\/j.cmet.2016.08.018","article-title":"RNA sequencing of single human islet cells reveals type 2 diabetes genes","volume":"24","author":"Xin","year":"2016","journal-title":"Cell Metab"},{"key":"2024071019300479800_btae421-B25","doi-asserted-by":"crossref","first-page":"bbaa393","DOI":"10.1093\/bib\/bbaa393","article-title":"Identification of haploinsufficient genes from epigenomic data using deep forest","volume":"22","author":"Yang","year":"2021","journal-title":"Brief Bioinform"},{"key":"2024071019300479800_btae421-B26","doi-asserted-by":"crossref","first-page":"400","DOI":"10.1038\/s41467-023-36134-7","article-title":"Topological identification and interpretation for single-cell gene regulation elucidation across multiple platforms using scMGCA","volume":"14","author":"Yu","year":"2023","journal-title":"Nat Commun"},{"key":"2024071019300479800_btae421-B27","doi-asserted-by":"crossref","first-page":"1007","DOI":"10.1038\/s41592-019-0529-1","article-title":"Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling","volume":"16","author":"Zhang","year":"2019","journal-title":"Nat Methods"},{"key":"2024071019300479800_btae421-B28","doi-asserted-by":"crossref","first-page":"D721","DOI":"10.1093\/nar\/gky900","article-title":"CellMarker: a manually curated resource of cell markers in human and mouse","volume":"47","author":"Zhang","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024071019300479800_btae421-B29","doi-asserted-by":"crossref","first-page":"531","DOI":"10.3390\/genes10070531","article-title":"SCINA: a semi-supervised subtyping algorithm of single cells and bulk samples","volume":"10","author":"Zhang","year":"2019","journal-title":"Genes (Basel)"},{"key":"2024071019300479800_btae421-B30","doi-asserted-by":"crossref","first-page":"eabq7599","DOI":"10.1126\/sciadv.abq7599","article-title":"Human PBMC scRNA-seq\u2013based aging clocks reveal ribosome to inflammation balance as a single-cell aging hallmark and super longevity","volume":"9","author":"Zhu","year":"2023","journal-title":"Sci Adv"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae421\/58342069\/btae421.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/7\/btae421\/58499509\/btae421.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/7\/btae421\/58499509\/btae421.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,10]],"date-time":"2024-07-10T19:30:50Z","timestamp":1720639850000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae421\/7699793"}},"subtitle":[],"editor":[{"given":"Can","family":"Alkan","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,6,26]]},"references-count":30,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2024,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae421","relation":{},"ISSN":["1367-4811"],"issn-type":[{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2024,7]]},"published":{"date-parts":[[2024,6,26]]},"article-number":"btae421"}}