{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,14]],"date-time":"2026-04-14T16:55:04Z","timestamp":1776185704693,"version":"3.50.1"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2021,12,7]],"date-time":"2021-12-07T00:00:00Z","timestamp":1638835200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,2,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Recent advancements in single-cell RNA sequencing (scRNA-seq) have enabled time-efficient transcriptome profiling in individual cells. To optimize sequencing protocols and develop reliable analysis methods for various application scenarios, solid simulation methods for scRNA-seq data are required. However, due to the noisy nature of scRNA-seq data, currently available simulation methods cannot sufficiently capture and simulate important properties of real data, especially the biological variation. In this study, we developed scRNA-seq information producer (SCRIP), a novel simulator for scRNA-seq that is accurate and enables simulation of bursting kinetics.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Compared to existing simulators, SCRIP showed a significantly higher accuracy of stimulating key data features, including mean\u2013variance dependency in all experiments. SCRIP also outperformed other methods in recovering cell\u2013cell distances. The application of SCRIP in evaluating differential expression analysis methods showed that edgeR outperformed other examined methods in differential expression analyses, and ZINB-WaVE improved the AUC at high dropout rates. Collectively, this study provides the research community with a rigorous tool for scRNA-seq data simulation.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>https:\/\/CRAN.R-project.org\/package=SCRIP.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab824","type":"journal-article","created":{"date-parts":[[2021,12,1]],"date-time":"2021-12-01T20:22:18Z","timestamp":1638390138000},"page":"1304-1311","source":"Crossref","is-referenced-by-count":12,"title":["SCRIP: an accurate simulator for single-cell RNA sequencing data"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3678-2879","authenticated-orcid":false,"given":"Fei","family":"Qin","sequence":"first","affiliation":[{"name":"Department of Epidemiology and Biostatistics, Arnold School of Public Health, University of South Carolina , Columbia, SC 29208, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2787-8185","authenticated-orcid":false,"given":"Xizhi","family":"Luo","sequence":"additional","affiliation":[{"name":"Department of Epidemiology and Biostatistics, Arnold School of Public Health, University of South Carolina , Columbia, SC 29208, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1597-4719","authenticated-orcid":false,"given":"Feifei","family":"Xiao","sequence":"additional","affiliation":[{"name":"Department of Epidemiology and Biostatistics, Arnold School of Public Health, University of South Carolina , Columbia, SC 29208, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Guoshuai","family":"Cai","sequence":"additional","affiliation":[{"name":"Department of Environmental Health Sciences, Arnold School of Public Health, University of South Carolina , Columbia, SC 29208, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2021,12,7]]},"reference":[{"key":"2023020108550038200_btab824-B1","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1016\/j.biocel.2017.07.006","article-title":"Delineating biological and technical variance in single cell expression data","volume":"90","author":"Arzalluz-Luque","year":"2017","journal-title":"Int. J. Biochem. Cell Biol"},{"key":"2023020108550038200_btab824-B2","doi-asserted-by":"crossref","first-page":"1468","DOI":"10.1093\/bioinformatics\/btz752","article-title":"SPARSim single cell: a count data simulator for scRNA-seq data","volume":"36","author":"Baruzzo","year":"2020","journal-title":"Bioinformatics"},{"key":"2023020108550038200_btab824-B4","doi-asserted-by":"crossref","first-page":"2131","DOI":"10.1093\/bioinformatics\/btv124","article-title":"SimSeq: a nonparametric approach to simulation of RNA-sequence datasets","volume":"31","author":"Benidt","year":"2015","journal-title":"Bioinformatics"},{"key":"2023020108550038200_btab824-B5","doi-asserted-by":"crossref","first-page":"S5","DOI":"10.1186\/1471-2105-13-S13-S5","article-title":"Accuracy of RNA-Seq and its dependence on sequencing depth","volume":"13","author":"Cai","year":"2012","journal-title":"BMC Bioinform"},{"key":"2023020108550038200_btab824-B6","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1186\/s12859-017-1780-z","article-title":"Local sequence and sequencing depth dependent accuracy of RNA-seq reads","volume":"18","author":"Cai","year":"2017","journal-title":"BMC Bioinform"},{"key":"2023020108550038200_btab824-B7","doi-asserted-by":"crossref","first-page":"15672","DOI":"10.1073\/pnas.1520760112","article-title":"Human cerebral organoids recapitulate gene expression programs of fetal neocortex development","volume":"112","author":"Camp","year":"2015","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020108550038200_btab824-B8","doi-asserted-by":"crossref","first-page":"3942","DOI":"10.1038\/s41467-021-24152-2","article-title":"Spearheading future omics analyses using dyngen: a multi-modal simulator of single cells","volume":"12","author":"Cannoodt","year":"2021","journal-title":"Nat. Commun"},{"key":"2023020108550038200_btab824-B9","doi-asserted-by":"crossref","first-page":"1018","DOI":"10.1016\/j.cub.2006.03.092","article-title":"Transcriptional pulsing of a developmental gene","volume":"16","author":"Chubb","year":"2006","journal-title":"Curr. Biol"},{"key":"2023020108550038200_btab824-B10","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1016\/j.cels.2020.08.003","article-title":"SERGIO: a single-cell expression simulator guided by gene regulatory networks","volume":"11","author":"Dibaeinia","year":"2020","journal-title":"Cell Syst"},{"key":"2023020108550038200_btab824-B11","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1186\/s13059-015-0683-4","article-title":"Deep sequencing reveals cell-type-specific patterns of single-cell transcriptome variation","volume":"16","author":"Dueck","year":"2015","journal-title":"Genome Biol"},{"key":"2023020108550038200_btab824-B12","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1186\/s13059-015-0844-5","article-title":"MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data","volume":"16","author":"Finak","year":"2015","journal-title":"Genome Biol"},{"key":"2023020108550038200_btab824-B13","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1186\/s12859-020-3450-9","article-title":"Data-based RNA-seq simulations by binomial thinning","volume":"21","author":"Gerard","year":"2020","journal-title":"BMC Bioinform"},{"key":"2023020108550038200_btab824-B14","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1016\/j.mam.2017.07.003","article-title":"Single-cell RNA sequencing: technical advancements and biological applications","volume":"59","author":"Hedlund","year":"2018","journal-title":"Mol. Aspects Med"},{"key":"2023020108550038200_btab824-B15","first-page":"735","article-title":"Comparison of methods to detect differentially expressed genes between single-cell populations","volume":"18","author":"Jaakkola","year":"2017","journal-title":"Brief. Bioinform"},{"key":"2023020108550038200_btab824-B16","doi-asserted-by":"crossref","first-page":"R7","DOI":"10.1186\/gb-2013-14-1-r7","article-title":"Inferring the kinetics of stochastic gene expression from single-cell RNA-sequencing data","volume":"14","author":"Kim","year":"2013","journal-title":"Genome Biol"},{"key":"2023020108550038200_btab824-B17","doi-asserted-by":"crossref","first-page":"1187","DOI":"10.1016\/j.cell.2015.04.044","article-title":"Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells","volume":"161","author":"Klein","year":"2015","journal-title":"Cell"},{"key":"2023020108550038200_btab824-B18","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1186\/s13059-016-1077-y","article-title":"A statistical approach for identifying differential distributions in single-cell RNA-seq experiments","volume":"17","author":"Korthauer","year":"2016","journal-title":"Genome Biol"},{"key":"2023020108550038200_btab824-B19","doi-asserted-by":"crossref","first-page":"R29","DOI":"10.1186\/gb-2014-15-2-r29","article-title":"voom: precision weights unlock linear model analysis tools for RNA-seq read counts","volume":"15","author":"Law","year":"2014","journal-title":"Genome Biol"},{"key":"2023020108550038200_btab824-B20","doi-asserted-by":"crossref","first-page":"i41","DOI":"10.1093\/bioinformatics\/btz321","article-title":"A statistical simulator scDesign for rational scRNA-seq experimental design","volume":"35","author":"Li","year":"2019","journal-title":"Bioinformatics"},{"key":"2023020108550038200_btab824-B21","doi-asserted-by":"crossref","first-page":"555","DOI":"10.1007\/s13253-015-0227-0","article-title":"A semi-parametric Bayesian approach for differential expression analysis of RNA-seq data","volume":"20","author":"Liu","year":"2015","journal-title":"J. Agric. Biol. Environ. Stat"},{"key":"2023020108550038200_btab824-B22","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2023020108550038200_btab824-B23","doi-asserted-by":"crossref","first-page":"4288","DOI":"10.1093\/nar\/gks042","article-title":"Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation","volume":"40","author":"McCarthy","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023020108550038200_btab824-B24","doi-asserted-by":"crossref","first-page":"eaav2501","DOI":"10.1126\/science.aav2501","article-title":"Identification of a mesenchymal progenitor cell hierarchy in adipose tissue","volume":"364","author":"Merrick","year":"2019","journal-title":"Science"},{"key":"2023020108550038200_btab824-B25","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1038\/nri.2017.76","article-title":"Single-cell RNA sequencing to explore immune cell heterogeneity","volume":"18","author":"Papalexi","year":"2018","journal-title":"Nat. Rev. Immunol"},{"key":"2023020108550038200_btab824-B26","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1016\/j.cell.2008.09.050","article-title":"Nature, nurture, or chance: stochastic gene expression and its consequences","volume":"135","author":"Raj","year":"2008","journal-title":"Cell"},{"key":"2023020108550038200_btab824-B27","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020108550038200_btab824-B28","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1038\/nbt.3192","article-title":"Spatial reconstruction of single-cell gene expression data","volume":"33","author":"Satija","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023020108550038200_btab824-B29","doi-asserted-by":"crossref","first-page":"618","DOI":"10.1038\/nrg3542","article-title":"Single-cell sequencing-based technologies will revolutionize whole-organism science","volume":"14","author":"Shapiro","year":"2013","journal-title":"Nat. Rev. Genet"},{"key":"2023020108550038200_btab824-B30","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1186\/1471-2105-14-91","article-title":"A comparison of methods for differential expression analysis of RNA-seq data","volume":"14","author":"Soneson","year":"2013","journal-title":"BMC Bioinform"},{"key":"2023020108550038200_btab824-B31","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1038\/nmeth.4612","article-title":"Bias, robustness and scalability in single-cell differential expression analysis","volume":"15","author":"Soneson","year":"2018","journal-title":"Nat. Methods"},{"key":"2023020108550038200_btab824-B32","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1016\/j.stem.2019.11.010","article-title":"Arterial Sca1(+) vascular stem cells generate de novo smooth muscle for artery repair and regeneration","volume":"26","author":"Tang","year":"2020","journal-title":"Cell Stem Cell"},{"key":"2023020108550038200_btab824-B33","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1126\/science.aad0501","article-title":"Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq","volume":"352","author":"Tirosh","year":"2016","journal-title":"Science"},{"key":"2023020108550038200_btab824-B34","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1038\/nbt.2859","article-title":"The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells","volume":"32","author":"Trapnell","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2023020108550038200_btab824-B35","doi-asserted-by":"crossref","first-page":"39921","DOI":"10.1038\/srep39921","article-title":"Batch effects and the effective design of single-cell gene expression studies","volume":"7","author":"Tung","year":"2017","journal-title":"Sci. Rep"},{"key":"2023020108550038200_btab824-B36","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1186\/s13059-018-1406-4","article-title":"Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications","volume":"19","author":"Van den Berge","year":"2018","journal-title":"Genome Biol"},{"key":"2023020108550038200_btab824-B37","doi-asserted-by":"crossref","first-page":"3486","DOI":"10.1093\/bioinformatics\/btx435","article-title":"powsimR: power analysis for bulk and single cell RNA-seq experiments","volume":"33","author":"Vieth","year":"2017","journal-title":"Bioinformatics"},{"key":"2023020108550038200_btab824-B38","doi-asserted-by":"crossref","first-page":"2128","DOI":"10.1093\/bioinformatics\/btw202","article-title":"Beta-Poisson model for single-cell RNA-seq data analyses","volume":"32","author":"Vu","year":"2016","journal-title":"Bioinformatics"},{"key":"2023020108550038200_btab824-B39","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1186\/s12859-019-2599-6","article-title":"Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data","volume":"20","author":"Wang","year":"2019","journal-title":"BMC Bioinform"},{"key":"2023020108550038200_btab824-B40","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1093\/biostatistics\/kxs033","article-title":"A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data","volume":"14","author":"Wu","year":"2013","journal-title":"Biostatistics"},{"key":"2023020108550038200_btab824-B41","doi-asserted-by":"crossref","first-page":"608","DOI":"10.1016\/j.cmet.2016.08.018","article-title":"RNA sequencing of single human islet cells reveals type 2 diabetes genes","volume":"24","author":"Xin","year":"2016","journal-title":"Cell Metab"},{"key":"2023020108550038200_btab824-B42","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1186\/s13059-017-1305-0","article-title":"Splatter: simulation of single-cell RNA sequencing data","volume":"18","author":"Zappia","year":"2017","journal-title":"Genome Biol"},{"key":"2023020108550038200_btab824-B43","doi-asserted-by":"crossref","first-page":"2611","DOI":"10.1038\/s41467-019-10500-w","article-title":"Simulating multiple faceted variability in single cell RNA sequencing","volume":"10","author":"Zhang","year":"2019","journal-title":"Nat. Commun"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab824\/41764721\/btab824.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/5\/1304\/49008416\/btab824.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/5\/1304\/49008416\/btab824.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,13]],"date-time":"2023-11-13T13:08:53Z","timestamp":1699880933000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/5\/1304\/6454945"}},"subtitle":[],"editor":[{"given":"Anthony","family":"Mathelier","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2021,12,7]]},"references-count":42,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,2,7]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab824","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,3,1]]},"published":{"date-parts":[[2021,12,7]]}}}