{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T21:17:06Z","timestamp":1772918226034,"version":"3.50.1"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2024,2,5]],"date-time":"2024-02-05T00:00:00Z","timestamp":1707091200000},"content-version":"vor","delay-in-days":4,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62141207"],"award-info":[{"award-number":["62141207"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,2,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Dropout events bring challenges in analyzing single-cell RNA sequencing data as they introduce noise and distort the true distributions of gene expression profiles. Recent studies focus on estimating dropout probability and imputing dropout events by leveraging information from similar cells or genes. However, the number of dropout events differs in different cells, due to the complex factors, such as different sequencing protocols, cell types, and batch effects. The dropout event differences are not fully considered in assessing the similarities between cells and genes, which compromises the reliability of downstream analysis.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>This work proposes a hybrid Generative Adversarial Network with dropouts identification to impute single-cell RNA sequencing data, named AGImpute. First, the numbers of dropout events in different cells in scRNA-seq data are differentially estimated by using a dynamic threshold estimation strategy. Next, the identified dropout events are imputed by a hybrid deep learning model, combining Autoencoder with a Generative Adversarial Network. To validate the efficiency of the AGImpute, it is compared with seven state-of-the-art dropout imputation methods on two simulated datasets and seven real single-cell RNA sequencing datasets. The results show that AGImpute imputes the least number of dropout events than other methods. Moreover, AGImpute enhances the performance of downstream analysis, including clustering performance, identifying cell-specific marker genes, and inferring trajectory in the time-course dataset.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The source code can be obtained from https:\/\/github.com\/xszhu-lab\/AGImpute.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae068","type":"journal-article","created":{"date-parts":[[2024,2,6]],"date-time":"2024-02-06T04:26:53Z","timestamp":1707193613000},"source":"Crossref","is-referenced-by-count":17,"title":["AGImpute: imputation of scRNA-seq data based on a hybrid GAN with dropouts identification"],"prefix":"10.1093","volume":"40","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7696-9112","authenticated-orcid":false,"given":"Xiaoshu","family":"Zhu","sequence":"first","affiliation":[{"name":"School of Computer and Information Security, Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology , Guilin 541004, China"}]},{"given":"Shuang","family":"Meng","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Guangxi Normal University , Guilin 541006, China"}]},{"given":"Gaoshi","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Guangxi Normal University , Guilin 541006, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1516-0480","authenticated-orcid":false,"given":"Jianxin","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Hunan Provincial Key Lab on Bioinformatics, Central South University , Changsha 400083, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4099-5183","authenticated-orcid":false,"given":"Xiaoqing","family":"Peng","sequence":"additional","affiliation":[{"name":"School of Life Sciences, Center for Medical Genetics, Central South University , Changsha 400083, China"}]}],"member":"286","published-online":{"date-parts":[[2024,2,5]]},"reference":[{"key":"2024022006445019600_btae068-B1","first-page":"3625","article-title":"Psychrophilic proteases dramatically reduce single-cell RNA-seq artifacts: a molecular atlas of kidney development","volume":"144","author":"Adam","year":"2017","journal-title":"Development"},{"key":"2024022006445019600_btae068-B2","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1152\/physiolgenomics.00126.2022","article-title":"Single-cell transcriptomic heterogeneity between conduit and resistance mesenteric arteries in rats","volume":"55","author":"Anderson","year":"2023","journal-title":"Physiol Genomics"},{"key":"2024022006445019600_btae068-B3","first-page":"3568","author":"Berrevoets","year":"2023"},{"key":"2024022006445019600_btae068-B4","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1038\/nbt.3102","article-title":"Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells","volume":"33","author":"Buettner","year":"2015","journal-title":"Nat Biotechnol"},{"key":"2024022006445019600_btae068-B5","doi-asserted-by":"crossref","first-page":"295","DOI":"10.3389\/fgene.2020.00295","article-title":"Single-cell transcriptome data clustering via multinomial modeling and adaptive fuzzy k-means algorithm","volume":"11","author":"Chen","year":"2020","journal-title":"Front Genet"},{"key":"2024022006445019600_btae068-B6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-018-1575-1","article-title":"VIPER: variability-preserving imputation for accurate gene expression recovery in single-cell RNA sequencing studies","volume":"19","author":"Chen","year":"2018","journal-title":"Genome Biol"},{"key":"2024022006445019600_btae068-B7","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1186\/s13059-016-1033-x","article-title":"Single-cell RNA-seq reveals novel regulators of human embryonic stem cell differentiation to definitive endoderm","volume":"17","author":"Chu","year":"2016","journal-title":"Genome Biol"},{"key":"2024022006445019600_btae068-B8","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1126\/science.1245316","article-title":"Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells","volume":"343","author":"Deng","year":"2014","journal-title":"Science"},{"key":"2024022006445019600_btae068-B9","doi-asserted-by":"crossref","first-page":"119562","DOI":"10.1016\/j.eswa.2023.119562","article-title":"A novel combined approach based on deep Autoencoder and deep classifiers for credit card fraud detection","volume":"217","author":"Fanai","year":"2023","journal-title":"Expert Syst Appl"},{"key":"2024022006445019600_btae068-B10","doi-asserted-by":"crossref","first-page":"440","DOI":"10.2174\/1574893617666220330151024","article-title":"DSAE-Impute: learning discriminative stacked autoencoders for imputing single-cell RNA-seq data","volume":"17","author":"Gan","year":"2022","journal-title":"Curr Bioinform"},{"key":"2024022006445019600_btae068-B11","first-page":"27","author":"Goodfellow","year":"2014"},{"key":"2024022006445019600_btae068-B12","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1038\/nmeth.2930","article-title":"Validation of noise models for single-cell transcriptomics","volume":"11","author":"Gr\u00fcn","year":"2014","journal-title":"Nat Methods"},{"key":"2024022006445019600_btae068-B13","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1038\/nature24489","article-title":"A single-cell survey of the small intestinal epithelium","volume":"551","author":"Haber","year":"2017","journal-title":"Nature"},{"key":"2024022006445019600_btae068-B14","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1016\/j.cels.2017.10.012","article-title":"Unsupervised trajectory analysis of single-cell RNA-seq and imaging data reveals alternative tuft cell origins in the gut","volume":"6","author":"Herring","year":"2017","journal-title":"Cell Syst"},{"key":"2024022006445019600_btae068-B15","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1038\/s41592-018-0033-z","article-title":"SAVER: gene expression recovery for single-cell RNA sequencing","volume":"15","author":"Huang","year":"2018","journal-title":"Nat Methods"},{"key":"2024022006445019600_btae068-B16","first-page":"S11","article-title":"065 Longitudinal analysis of T cell dynamics in alopecia areata at single-cell resolution","volume":"142","author":"Lee","year":"2022","journal-title":"J Investig Dermatol"},{"key":"2024022006445019600_btae068-B17","doi-asserted-by":"crossref","first-page":"997","DOI":"10.1038\/s41467-018-03405-7","article-title":"An accurate and robust imputation method scImpute for single-cell RNA-seq data","volume":"9","author":"Li","year":"2018","journal-title":"Nat Commun"},{"key":"2024022006445019600_btae068-B18","doi-asserted-by":"crossref","first-page":"2619","DOI":"10.1007\/s11135-022-01484-9","article-title":"Cyclic clustering approach to impute missing values for cyclostationary hydrological time series","volume":"57","author":"Mahmoudi","year":"2023","journal-title":"Qual Quant"},{"key":"2024022006445019600_btae068-B19","first-page":"665323","author":"Miao","year":"2019"},{"key":"2024022006445019600_btae068-B20","doi-asserted-by":"crossref","first-page":"860","DOI":"10.1186\/s12864-021-08101-3","article-title":"ScLRTC: imputation for single-cell RNA-seq data via low-rank tensor completion","volume":"22","author":"Pan","year":"2021","journal-title":"BMC Genomics"},{"key":"2024022006445019600_btae068-B21","doi-asserted-by":"crossref","first-page":"bbad149","DOI":"10.1093\/bib\/bbad149","article-title":"SSNMDI: a novel joint learning model of semi-supervised non-negative matrix factorization and data imputation for clustering of single-cell RNA-seq data","volume":"24","author":"Qiu","year":"2023","journal-title":"Brief Bioinform"},{"key":"2024022006445019600_btae068-B22","doi-asserted-by":"crossref","first-page":"6229","DOI":"10.3390\/ijms24076229","article-title":"Epi-Impute: single-cell RNA-seq imputation via integration with single-cell ATAC-seq","volume":"24","author":"Raevskiy","year":"2023","journal-title":"Int J Mol Sci"},{"key":"2024022006445019600_btae068-B23","doi-asserted-by":"crossref","DOI":"10.1093\/jleuko\/qiad069","volume-title":"Human \u03b3\u03b4 T Cell Identification from Single-Cell RNA Sequencing Datasets by Modular TCR Expression","author":"Song","year":"2023"},{"key":"2024022006445019600_btae068-B24","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1038\/nmeth.4292","article-title":"Normalizing single-cell RNA sequencing data: challenges and opportunities","volume":"14","author":"Vallejos","year":"2017","journal-title":"Nat Methods"},{"key":"2024022006445019600_btae068-B25","doi-asserted-by":"crossref","first-page":"716","DOI":"10.1016\/j.cell.2018.05.061","article-title":"Recovering gene interactions from single-cell data using data diffusion","volume":"174","author":"van Dijk","year":"2018","journal-title":"Cell"},{"key":"2024022006445019600_btae068-B26","first-page":"1","author":"Wagner","year":"2019"},{"key":"2024022006445019600_btae068-B27","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1038\/nature24029","article-title":"The neuropeptide NMU amplifies ILC2-driven allergic lung inflammation","volume":"549","author":"Wallrapp","year":"2017","journal-title":"Nature"},{"key":"2024022006445019600_btae068-B28","first-page":"E6437","article-title":"Gene expression distribution deconvolution in single-cell RNA sequencing","volume":"115","author":"Wang","year":"2018","journal-title":"Proc Natl Acad Sci USA"},{"key":"2024022006445019600_btae068-B29","doi-asserted-by":"crossref","first-page":"e85","DOI":"10.1093\/nar\/gkaa506","article-title":"scIGANs: single-cell RNA-seq imputation using generative adversarial networks","volume":"48","author":"Xu","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2024022006445019600_btae068-B30","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1186\/s13059-017-1305-0","article-title":"Splatter: simulation of single-cell RNA sequencing data","volume":"18","author":"Zappia","year":"2017","journal-title":"Genome Biol"},{"key":"2024022006445019600_btae068-B31","doi-asserted-by":"crossref","first-page":"727","DOI":"10.1016\/j.devcel.2023.03.011","article-title":"Understanding cell fate acquisition in stem-cell-derived pancreatic islets using single-cell multiome-inferred regulomes","volume":"58","author":"Zhu","year":"2023","journal-title":"Dev Cell"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae068\/56588217\/btae068.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/2\/btae068\/56708222\/btae068.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/2\/btae068\/56708222\/btae068.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,20]],"date-time":"2024-02-20T06:45:18Z","timestamp":1708411518000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae068\/7601322"}},"subtitle":[],"editor":[{"given":"Inanc","family":"Birol","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,2,1]]},"references-count":31,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2024,2,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae068","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,2,1]]},"published":{"date-parts":[[2024,2,1]]},"article-number":"btae068"}}