{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:05Z","timestamp":1772138045640,"version":"3.50.1"},"reference-count":56,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2021,5,11]],"date-time":"2021-05-11T00:00:00Z","timestamp":1620691200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"Andrew J. Viterbi Fellowship"},{"name":"Computational Biology and Bioinformatics"},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["DMS 2045327"],"award-info":[{"award-number":["DMS 2045327"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,10,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Methods to model dynamic changes in gene expression at a genome-wide level are not currently sufficient for large (temporally rich or single-cell) datasets. Variational autoencoders offer means to characterize large datasets and have been used effectively to characterize features of single-cell datasets. Here, we extend these methods for use with gene expression time series data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We present RVAgene: a recurrent variational autoencoder to model gene expression dynamics. RVAgene learns to accurately and efficiently reconstruct temporal gene profiles. It also learns a low dimensional representation of the data via a recurrent encoder network that can be used for biological feature discovery, and from which we can generate new gene expression data by sampling the latent space. We test RVAgene on simulated and real biological datasets, including embryonic stem cell differentiation and kidney injury response dynamics. In all cases, RVAgene accurately reconstructed complex gene expression temporal profiles. Via cross validation, we show that a low-error latent space representation can be learnt using only a fraction of the data. Through clustering and gene ontology term enrichment analysis on the latent space, we demonstrate the potential of RVAgene for unsupervised discovery. In particular, RVAgene identifies new programs of shared gene regulation of Lox family genes in response to kidney injury.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>All datasets analyzed in this manuscript are publicly available and have been published previously. RVAgene is available in Python, at GitHub: https:\/\/github.com\/maclean-lab\/RVAgene; Zenodo archive: http:\/\/doi.org\/10.5281\/zenodo.4271097.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab260","type":"journal-article","created":{"date-parts":[[2021,4,22]],"date-time":"2021-04-22T20:51:50Z","timestamp":1619124710000},"page":"3252-3262","source":"Crossref","is-referenced-by-count":27,"title":["RVAgene: generative modeling of gene expression time series data"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1182-3742","authenticated-orcid":false,"given":"Raktim","family":"Mitra","sequence":"first","affiliation":[{"name":"University of Southern California , Los Angeles, CA 90007, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0689-7907","authenticated-orcid":false,"given":"Adam L","family":"MacLean","sequence":"additional","affiliation":[{"name":"University of Southern California , Los Angeles, CA 90007, USA"}]}],"member":"286","published-online":{"date-parts":[[2021,5,11]]},"reference":[{"key":"2023051608274835700_btab260-B1","first-page":"119","author":"Ainsworth","year":"2018"},{"key":"2023051608274835700_btab260-B2","doi-asserted-by":"crossref","first-page":"4774","DOI":"10.1093\/bioinformatics\/btaa576","article-title":"Gene regulation inference from single-cell RNA-seq data with linear differential equations and velocity inference","volume":"36","author":"Aubin-Frankowski","year":"2020","journal-title":"Bioinformatics"},{"key":"2023051608274835700_btab260-B3","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1007\/BF00993164","article-title":"Approximation and estimation bounds for artificial neural networks","volume":"14","author":"Barron","year":"1994","journal-title":"Mach. Learn"},{"key":"2023051608274835700_btab260-B4","author":"Botchkarev","year":"2018"},{"key":"2023051608274835700_btab260-B5","first-page":"280","author":"Bouchard-C\u00f4t\u00e9","year":"2010"},{"key":"2023051608274835700_btab260-B6","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1186\/s12859-018-2217-z","article-title":"Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data","volume":"19","author":"Chen","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2023051608274835700_btab260-B7","first-page":"6571","author":"Chen","year":"2018"},{"key":"2023051608274835700_btab260-B8","doi-asserted-by":"crossref","first-page":"20170387","DOI":"10.1098\/rsif.2017.0387","article-title":"Opportunities and obstacles for deep learning in biology and medicine","volume":"15","author":"Ching","year":"2018","journal-title":"J. R. Soc. Interface"},{"key":"2023051608274835700_btab260-B9","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1186\/1471-2105-12-399","article-title":"Bayesian hierarchical clustering for microarray time series data with replicates and outlier measurements","volume":"12","author":"Cooke","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023051608274835700_btab260-B10","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/BF02551274","article-title":"Approximation by superpositions of a sigmoidal function","volume":"2","author":"Cybenko","year":"1989","journal-title":"Math. Control Sign. Syst"},{"key":"2023051608274835700_btab260-B11","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1038\/s41592-019-0353-7","article-title":"Scalable analysis of cell-type composition from single-cell transcriptomics using deep recurrent learning","volume":"16","author":"Deng","year":"2019","journal-title":"Nat. Methods"},{"key":"2023051608274835700_btab260-B12","doi-asserted-by":"crossref","DOI":"10.1101\/534834","article-title":"Network inference with granger causality ensembles on single-cell transcriptomic data","author":"Deshpande","year":"2019"},{"key":"2023051608274835700_btab260-B13","author":"Dilokthanakul","year":"2016"},{"key":"2023051608274835700_btab260-B14","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-018-04368-5","article-title":"Interpretable dimensionality reduction of single cell transcriptome data with deep generative models","volume":"9","author":"Ding","year":"2018","journal-title":"Nat. Commun"},{"key":"2023051608274835700_btab260-B15","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-018-07931-2","article-title":"Single-cell RNA-seq denoising using a deep count autoencoder","volume":"10","author":"Eraslan","year":"2019","journal-title":"Nat. Commun"},{"key":"2023051608274835700_btab260-B16","author":"Fabius","year":"2014"},{"key":"2023051608274835700_btab260-B17","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1214\/aos\/1176342360","article-title":"A Bayesian analysis of some nonparametric problems","volume":"1","author":"Ferguson","year":"1973","journal-title":"Ann. Stat"},{"key":"2023051608274835700_btab260-B18","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1016\/0893-6080(89)90003-8","article-title":"On the approximate realization of continuous mappings by neural networks","volume":"2","author":"Funahashi","year":"1989","journal-title":"Neural networks"},{"key":"2023051608274835700_btab260-B19","doi-asserted-by":"crossref","first-page":"845","DOI":"10.1038\/nmeth.3971","article-title":"Diffusion pseudotime robustly reconstructs lineage branching","volume":"13","author":"Haghverdi","year":"2016","journal-title":"Nat. Methods"},{"key":"2023051608274835700_btab260-B20","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/s41586-020-2649-2","article-title":"Array programming with numpy","volume":"585","author":"Harris","year":"2020","journal-title":"Nature"},{"key":"2023051608274835700_btab260-B21","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1186\/1471-2105-14-252","article-title":"Hierarchical Bayesian modelling of gene expression time series across irregularly sampled replicates and clusters","volume":"14","author":"Hensman","year":"2013","journal-title":"BMC Bioinformatics"},{"key":"2023051608274835700_btab260-B22","first-page":"IV","author":"Hershey","year":"2007"},{"key":"2023051608274835700_btab260-B23","author":"Higgins","year":"2016"},{"key":"2023051608274835700_btab260-B24","doi-asserted-by":"crossref","first-page":"504","DOI":"10.1126\/science.1127647","article-title":"Reducing the dimensionality of data with neural networks","volume":"313","author":"Hinton","year":"2006","journal-title":"Science"},{"key":"2023051608274835700_btab260-B25","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput"},{"key":"2023051608274835700_btab260-B26","first-page":"1303","article-title":"Stochastic variational inference","volume":"14","author":"Hoffman","year":"2013","journal-title":"J. Mach. Learn. Res"},{"key":"2023051608274835700_btab260-B27","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1016\/0893-6080(89)90020-8","article-title":"Multilayer feedforward networks are universal approximators","volume":"2","author":"Hornik","year":"1989","journal-title":"Neural Netw"},{"key":"2023051608274835700_btab260-B28","first-page":"1607","author":"Ingraham","year":"2017"},{"key":"2023051608274835700_btab260-B29","doi-asserted-by":"crossref","first-page":"e20487","DOI":"10.7554\/eLife.20487","article-title":"Dynamics of embryonic stem cell differentiation inferred from single-cell transcriptomics show a series of transitions through discrete cell states","volume":"6","author":"Jang","year":"2017","journal-title":"eLife"},{"key":"2023051608274835700_btab260-B30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-018-07085-1","article-title":"An information-theoretic framework for deciphering pleiotropic and noisy biochemical signaling","volume":"9","author":"Jetka","year":"2018","journal-title":"Nat. Commun"},{"key":"2023051608274835700_btab260-B31","doi-asserted-by":"crossref","first-page":"e1","DOI":"10.1093\/nar\/gkaa1014","article-title":"TENET: gene network reconstruction using transfer entropy reveals key regulatory factors from single cell transcriptomic data","volume":"49","author":"Kim","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023051608274835700_btab260-B32","author":"Kingma","year":"2014"},{"key":"2023051608274835700_btab260-B33","doi-asserted-by":"crossref","first-page":"1187","DOI":"10.1016\/j.cell.2015.04.044","article-title":"Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells","volume":"161","author":"Klein","year":"2015","journal-title":"Cell"},{"key":"2023051608274835700_btab260-B34","doi-asserted-by":"crossref","first-page":"e156","DOI":"10.1093\/nar\/gkx681","article-title":"Using neural networks for reducing the dimensions of single-cell RNA-seq data","volume":"45","author":"Lin","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023051608274835700_btab260-B35","doi-asserted-by":"crossref","first-page":"e9471","DOI":"10.1172\/jci.insight.94716","article-title":"Molecular characterization of the transition from acute to chronic kidney injury following ischemia\/reperfusion","volume":"2","author":"Liu","year":"2017","journal-title":"JCI Insight"},{"key":"2023051608274835700_btab260-B36","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1038\/s41592-018-0229-2","article-title":"Deep generative modeling for single-cell transcriptomics","volume":"15","author":"Lopez","year":"2018","journal-title":"Nat. Methods"},{"key":"2023051608274835700_btab260-B37","doi-asserted-by":"crossref","first-page":"4885","DOI":"10.1093\/bioinformatics\/btaa032","article-title":"Inference of gene regulatory networks based on nonlinear ordinary differential equations","volume":"36","author":"Ma","year":"2020","journal-title":"Bioinformatics"},{"key":"2023051608274835700_btab260-B38","first-page":"89","author":"Malhotra","year":"2015"},{"key":"2023051608274835700_btab260-B39","doi-asserted-by":"crossref","first-page":"2314","DOI":"10.1093\/bioinformatics\/btx194","article-title":"SCODE: an efficient regulatory network inference algorithm from single-cell RNA-Seq during differentiation","volume":"33","author":"Matsumoto","year":"2017","journal-title":"Bioinformatics"},{"key":"2023051608274835700_btab260-B40","doi-asserted-by":"crossref","first-page":"e1005896","DOI":"10.1371\/journal.pcbi.1005896","article-title":"Clustering gene expression time series data using an infinite Gaussian process mixture model","volume":"14","author":"McDowell","year":"2018","journal-title":"PLoS Comput. Biol"},{"key":"2023051608274835700_btab260-B41","author":"Nallapati","year":"2016"},{"key":"2023051608274835700_btab260-B42","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1681\/ASN.V112319","article-title":"Effect of gender on the progression of nondiabetic renal disease: a meta-analysis","volume":"11","author":"Neugarten","year":"2000","journal-title":"J. Am. Soc. Nephrol"},{"key":"2023051608274835700_btab260-B43","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1016\/j.cell.2008.09.050","article-title":"Nature, nurture, or chance: stochastic gene expression and its consequences","volume":"135","author":"Raj","year":"2008","journal-title":"Cell"},{"key":"2023051608274835700_btab260-B44","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1016\/j.devcel.2019.10.005","article-title":"Single-cell profiling reveals sex, lineage, and regional diversity in the mouse kidney","volume":"51","author":"Ransick","year":"2019","journal-title":"Dev. Cell"},{"key":"2023051608274835700_btab260-B45","first-page":"5321","author":"Rubanova","year":"2019"},{"key":"2023051608274835700_btab260-B46","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1038\/323533a0","article-title":"Learning representations by back-propagating errors","volume":"323","author":"Rumelhart","year":"1986","journal-title":"Nature"},{"key":"2023051608274835700_btab260-B47","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1038\/s41587-019-0071-9","article-title":"A comparison of single-cell trajectory inference methods","volume":"37","author":"Saelens","year":"2019","journal-title":"Nat. Biotechnol"},{"key":"2023051608274835700_btab260-B48","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1038\/nprot.2017.149","article-title":"Exponential scaling of single-cell RNA-seq in the past decade","volume":"13","author":"Svensson","year":"2018","journal-title":"Nat. Protoc"},{"key":"2023051608274835700_btab260-B49","doi-asserted-by":"crossref","first-page":"3418","DOI":"10.1093\/bioinformatics\/btaa169","article-title":"Interpretable factor models of single-cell RNA-seq via variational autoencoders","volume":"36","author":"Svensson","year":"2020","journal-title":"Bioinformatics"},{"key":"2023051608274835700_btab260-B50","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41598-018-34688-x","article-title":"Autoimpute: autoencoder based imputation of single-cell RNA-seq data","volume":"8","author":"Talwar","year":"2018","journal-title":"Sci. Rep"},{"key":"2023051608274835700_btab260-B51","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1016\/j.gpb.2018.08.003","article-title":"VASC: dimension reduction and visualization of single-cell RNA-seq data by deep variational autoencoder","volume":"16","author":"Wang","year":"2018","journal-title":"Genomics Proteomics Bioinf"},{"key":"2023051608274835700_btab260-B52","doi-asserted-by":"crossref","first-page":"875","DOI":"10.1038\/s41592-019-0537-1","article-title":"Data denoising with transfer learning in single-cell transcriptomics","volume":"16","author":"Wang","year":"2019","journal-title":"Nat. Methods"},{"key":"2023051608274835700_btab260-B53","doi-asserted-by":"crossref","DOI":"10.1101\/174474","article-title":"Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders","author":"Way","year":"2017"},{"key":"2023051608274835700_btab260-B54","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1016\/j.jbi.2018.05.016","article-title":"Modeling asynchronous event sequences with RNNs","volume":"83","author":"Wu","year":"2018","journal-title":"J. Biomed. Inf"},{"key":"2023051608274835700_btab260-B55","doi-asserted-by":"crossref","first-page":"2008","DOI":"10.1109\/TPAMI.2018.2889774","article-title":"Advances in variational inference","volume":"41","author":"Zhang","year":"2019","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023051608274835700_btab260-B56","doi-asserted-by":"crossref","first-page":"466","DOI":"10.1073\/pnas.1817715116","article-title":"Semisoft clustering of single-cell data","volume":"116","author":"Zhu","year":"2019","journal-title":"Proc. Natl. Acad. Sci. USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab260\/39270725\/btab260.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/19\/3252\/50338443\/btab260.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/19\/3252\/50338443\/btab260.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T04:43:46Z","timestamp":1684212226000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/19\/3252\/6273572"}},"subtitle":[],"editor":[{"given":"Anthony","family":"Mathelier","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,5,11]]},"references-count":56,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2021,10,11]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab260","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.11.10.375436","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,10,1]]},"published":{"date-parts":[[2021,5,11]]}}}