{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,15]],"date-time":"2026-04-15T08:00:34Z","timestamp":1776240034757,"version":"3.50.1"},"reference-count":52,"publisher":"Oxford University Press (OUP)","issue":"14","license":[{"start":{"date-parts":[[2019,7,8]],"date-time":"2019-07-08T00:00:00Z","timestamp":1562544000000},"content-version":"vor","delay-in-days":7,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["DMS-1613338"],"award-info":[{"award-number":["DMS-1613338"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["DBI-1846216"],"award-info":[{"award-number":["DBI-1846216"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01GM120507"],"award-info":[{"award-number":["R01GM120507"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Single-cell RNA sequencing (scRNA-seq) has revolutionized biological sciences by revealing genome-wide gene expression levels within individual cells. However, a critical challenge faced by researchers is how to optimize the choices of sequencing platforms, sequencing depths and cell numbers in designing scRNA-seq experiments, so as to balance the exploration of the depth and breadth of transcriptome information.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Here we present a flexible and robust simulator, scDesign, the first statistical framework for researchers to quantitatively assess practical scRNA-seq experimental design in the context of differential gene expression analysis. In addition to experimental design, scDesign also assists computational method development by generating high-quality synthetic scRNA-seq datasets under customized experimental settings. In an evaluation based on 17 cell types and 6 different protocols, scDesign outperformed four state-of-the-art scRNA-seq simulation methods and led to rational experimental design. In addition, scDesign demonstrates reproducibility across biological replicates and independent studies. We also discuss the performance of multiple differential expression and dimension reduction methods based on the protocol-dependent scRNA-seq data generated by scDesign. scDesign is expected to be an effective bioinformatic tool that assists rational scRNA-seq experimental design and comparison of scRNA\u2013seq computational methods based on specific research goals.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>We have implemented our method in the R package scDesign, which is freely available at https:\/\/github.com\/Vivianstats\/scDesign.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz321","type":"journal-article","created":{"date-parts":[[2019,5,9]],"date-time":"2019-05-09T15:21:53Z","timestamp":1557415313000},"page":"i41-i50","source":"Crossref","is-referenced-by-count":72,"title":["A statistical simulator scDesign for rational scRNA-seq experimental design"],"prefix":"10.1093","volume":"35","author":[{"given":"Wei Vivian","family":"Li","sequence":"first","affiliation":[{"name":"Department of Statistics, University of California, Los Angeles, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9288-5648","authenticated-orcid":false,"given":"Jingyi Jessica","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Statistics, University of California, Los Angeles, CA, USA"},{"name":"Department of Human Genetics, University of California, Los Angeles, CA, USA"}]}],"member":"286","published-online":{"date-parts":[[2019,7,5]]},"reference":[{"key":"2023062712385672500_btz321-B1","doi-asserted-by":"crossref","first-page":"D781","DOI":"10.1093\/nar\/gkx949","article-title":"SCPortalen: human and mouse single-cell centric database","volume":"46","author":"Abugessaisa","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023062712385672500_btz321-B2","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1186\/s13059-016-0927-y","article-title":"Design and computational analysis of single-cell RNA-sequencing experiments","volume":"17","author":"Bacher","year":"2016","journal-title":"Genome Biol"},{"key":"2023062712385672500_btz321-B3","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1093\/bfgp\/elx035","article-title":"Experimental design for single-cell RNA sequencing","volume":"17","author":"Baran-Gale","year":"2017","journal-title":"Brief. Funct. Genomics"},{"key":"2023062712385672500_btz321-B4","doi-asserted-by":"crossref","first-page":"411.","DOI":"10.1038\/nbt.4096","article-title":"Integrating single-cell transcriptomic data across different conditions, technologies, and species","volume":"36","author":"Butler","year":"2018","journal-title":"Nat. Biotechnol"},{"key":"2023062712385672500_btz321-B5","volume-title":"Single Cell Biology, Breckenridge, CO","author":"Cannoodt","year":"2019"},{"key":"2023062712385672500_btz321-B6","doi-asserted-by":"crossref","first-page":"368.","DOI":"10.3390\/genes8120368","article-title":"scRNASeqDB: a database for RNA-seq based gene expression profiles in human single cells","volume":"8","author":"Cao","year":"2017","journal-title":"Genes"},{"key":"2023062712385672500_btz321-B7","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1007\/978-1-4939-7834-2_3","volume-title":"Gene Expression Analysis","author":"Chatterjee","year":"2018"},{"key":"2023062712385672500_btz321-B8","doi-asserted-by":"crossref","first-page":"7285","DOI":"10.1073\/pnas.1507125112","article-title":"A survey of human brain transcriptome diversity at the single cell level","volume":"112","author":"Darmanis","year":"2015","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023062712385672500_btz321-B9","author":"Dumitrascu","year":"2018"},{"key":"2023062712385672500_btz321-B10","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1093\/nar\/30.1.207","article-title":"Gene Expression Omnibus: NCBI gene expression and hybridization array data repository","volume":"30","author":"Edgar","year":"2002","journal-title":"Nucleic Acids Res"},{"key":"2023062712385672500_btz321-B11","doi-asserted-by":"crossref","first-page":"278","DOI":"10.1186\/s13059-015-0844-5","article-title":"Mast: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data","volume":"16","author":"Finak","year":"2015","journal-title":"Genome Biol"},{"key":"2023062712385672500_btz321-B12","doi-asserted-by":"crossref","first-page":"395.","DOI":"10.1038\/nmeth.4179","article-title":"Seq-well: portable, low-cost RNA sequencing of single cells at high throughput","volume":"14","author":"Gierahn","year":"2017","journal-title":"Nat. Methods"},{"key":"2023062712385672500_btz321-B13","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1016\/j.cell.2015.10.039","article-title":"Design and analysis of single-cell sequencing experiments","volume":"163","author":"Gr\u00fcn","year":"2015","journal-title":"Cell"},{"key":"2023062712385672500_btz321-B14","doi-asserted-by":"crossref","first-page":"421.","DOI":"10.1038\/nbt.4091","article-title":"Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors","volume":"36","author":"Haghverdi","year":"2018","journal-title":"Nat. Biotechnol"},{"key":"2023062712385672500_btz321-B15","doi-asserted-by":"crossref","first-page":"75.","DOI":"10.1186\/s13073-017-0467-4","article-title":"A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications","volume":"9","author":"Haque","year":"2017","journal-title":"Genome Med"},{"key":"2023062712385672500_btz321-B16","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1016\/S0893-6080(00)00026-5","article-title":"Independent component analysis: algorithms and applications","volume":"13","author":"Hyv\u00e4rinen","year":"2000","journal-title":"Neural Netw"},{"key":"2023062712385672500_btz321-B17","doi-asserted-by":"crossref","first-page":"776","DOI":"10.1126\/science.1247651","article-title":"Massively parallel single-cell RNA-seq for marker-free decomposition of tissues into cell types","volume":"343","author":"Jaitin","year":"2014","journal-title":"Science"},{"key":"2023062712385672500_btz321-B18","doi-asserted-by":"crossref","first-page":"1543","DOI":"10.1101\/gr.121095.111","article-title":"Synthetic spike-in standards for RNA-seq experiments","volume":"21","author":"Jiang","year":"2011","journal-title":"Genome Res"},{"key":"2023062712385672500_btz321-B19","doi-asserted-by":"crossref","first-page":"740","DOI":"10.1038\/nmeth.2967","article-title":"Bayesian approach to single-cell differential expression analysis","volume":"11","author":"Kharchenko","year":"2014","journal-title":"Nat. Methods"},{"key":"2023062712385672500_btz321-B20","doi-asserted-by":"crossref","first-page":"72.","DOI":"10.1038\/nmeth.1778","article-title":"Counting absolute numbers of molecules using unique molecular identifiers","volume":"9","author":"Kivioja","year":"2012","journal-title":"Nat. Methods"},{"key":"2023062712385672500_btz321-B21","doi-asserted-by":"crossref","first-page":"1187","DOI":"10.1016\/j.cell.2015.04.044","article-title":"Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells","volume":"161","author":"Klein","year":"2015","journal-title":"Cell"},{"key":"2023062712385672500_btz321-B22","doi-asserted-by":"crossref","first-page":"610","DOI":"10.1016\/j.molcel.2015.04.005","article-title":"The technology and biology of single-cell RNA sequencing","volume":"58","author":"Kolodziejczyk","year":"2015","journal-title":"Mol. Cell"},{"key":"2023062712385672500_btz321-B23","doi-asserted-by":"crossref","first-page":"222.","DOI":"10.1186\/s13059-016-1077-y","article-title":"A statistical approach for identifying differential distributions in single-cell RNA-seq experiments","volume":"17","author":"Korthauer","year":"2016","journal-title":"Genome Biol"},{"key":"2023062712385672500_btz321-B24","doi-asserted-by":"crossref","first-page":"997","DOI":"10.1038\/s41467-018-03405-7","article-title":"An accurate and robust imputation method scimpute for single-cell RNA-seq data","volume":"9","author":"Li","year":"2018","journal-title":"Nat. Commun"},{"key":"2023062712385672500_btz321-B25","author":"Li","year":"2018"},{"key":"2023062712385672500_btz321-B26","doi-asserted-by":"crossref","first-page":"550.","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2023062712385672500_btz321-B27","doi-asserted-by":"crossref","first-page":"75.","DOI":"10.1186\/s13059-016-0947-7","article-title":"Pooling across cells to normalize single-cell RNA sequencing data with many zero counts","volume":"17","author":"Lun","year":"2016","journal-title":"Genome Biol"},{"key":"2023062712385672500_btz321-B28","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Maaten","year":"2008","journal-title":"J. Mach. Learn. Res"},{"key":"2023062712385672500_btz321-B29","doi-asserted-by":"crossref","first-page":"1202","DOI":"10.1016\/j.cell.2015.05.002","article-title":"Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets","volume":"161","author":"Macosko","year":"2015","journal-title":"Cell"},{"key":"2023062712385672500_btz321-B30","doi-asserted-by":"crossref","first-page":"1326","DOI":"10.1126\/science.aaf6463","article-title":"Oligodendrocyte heterogeneity in the mouse juvenile and adult central nervous system","volume":"352","author":"Marques","year":"2016","journal-title":"Science"},{"key":"2023062712385672500_btz321-B31","doi-asserted-by":"crossref","first-page":"1179","DOI":"10.1093\/bioinformatics\/btw777","article-title":"Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R","volume":"33","author":"McCarthy","year":"2017","journal-title":"Bioinformatics"},{"key":"2023062712385672500_btz321-B32","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/btz078","article-title":"PROSSTT: probabilistic simulation of single-cell RNA-seq data for complex differentiation processes","author":"Papadopoulos","year":"2019","journal-title":"Bioinformatics"},{"key":"2023062712385672500_btz321-B33","doi-asserted-by":"crossref","first-page":"1096.","DOI":"10.1038\/nmeth.2639","article-title":"Smart-seq2 for sensitive full-length transcriptome profiling in single cells","volume":"10","author":"Picelli","year":"2013","journal-title":"Nat. Methods"},{"key":"2023062712385672500_btz321-B34","doi-asserted-by":"crossref","first-page":"241.","DOI":"10.1186\/s13059-015-0805-z","article-title":"ZIFA: dimensionality reduction for zero-inflated single-cell gene expression analysis","volume":"16","author":"Pierson","year":"2015","journal-title":"Genome Biol"},{"key":"2023062712385672500_btz321-B35","doi-asserted-by":"crossref","first-page":"1053.","DOI":"10.1038\/nbt.2967","article-title":"Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex","volume":"32","author":"Pollen","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2023062712385672500_btz321-B36","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511806384","volume-title":"Experimental Design and Data Analysis for Biologists","author":"Quinn","year":"2002"},{"key":"2023062712385672500_btz321-B37","doi-asserted-by":"crossref","first-page":"284.","DOI":"10.1038\/s41467-017-02554-5","article-title":"A general and flexible method for signal extraction from single-cell RNA-seq data","volume":"9","author":"Risso","year":"2018","journal-title":"Nat. Commun"},{"key":"2023062712385672500_btz321-B38","doi-asserted-by":"crossref","first-page":"12781.","DOI":"10.1038\/s41598-017-12989-x","article-title":"Impact of sequencing depth and read length on single cell RNA sequencing data of t cells","volume":"7","author":"Rizzetto","year":"2017","journal-title":"Sci. Rep"},{"key":"2023062712385672500_btz321-B39","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edger: a bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2023062712385672500_btz321-B40","doi-asserted-by":"crossref","first-page":"1308","DOI":"10.1016\/j.cell.2016.07.054","article-title":"Comprehensive classification of retinal bipolar neurons by single-cell transcriptomics","volume":"166","author":"Shekhar","year":"2016","journal-title":"Cell"},{"key":"2023062712385672500_btz321-B41","doi-asserted-by":"crossref","first-page":"825","DOI":"10.1038\/s41588-018-0129-5","article-title":"Genetic identification of brain cell types underlying schizophrenia","volume":"50","author":"Skene","year":"2018","journal-title":"Nat. Genet"},{"key":"2023062712385672500_btz321-B42","doi-asserted-by":"crossref","first-page":"381.","DOI":"10.1038\/nmeth.4220","article-title":"Power analysis of single-cell RNA-sequencing experiments","volume":"14","author":"Svensson","year":"2017","journal-title":"Nat. Methods"},{"key":"2023062712385672500_btz321-B43","doi-asserted-by":"crossref","first-page":"377.","DOI":"10.1038\/nmeth.1315","article-title":"mRNA-seq whole-transcriptome analysis of a single cell","volume":"6","author":"Tang","year":"2009","journal-title":"Nat. Methods"},{"key":"2023062712385672500_btz321-B44","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1126\/science.aad0501","article-title":"Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq","volume":"352","author":"Tirosh","year":"2016","journal-title":"Science"},{"key":"2023062712385672500_btz321-B45","author":"Van den Berge","year":"2017"},{"key":"2023062712385672500_btz321-B46","doi-asserted-by":"crossref","first-page":"3486","DOI":"10.1093\/bioinformatics\/btx435","article-title":"powsimr: power analysis for bulk and single cell RNA-seq experiments","volume":"33","author":"Vieth","year":"2017","journal-title":"Bioinformatics"},{"key":"2023062712385672500_btz321-B47","doi-asserted-by":"crossref","first-page":"1145.","DOI":"10.1038\/nbt.3711","article-title":"Revealing the vectors of cellular identity with single-cell genomics","volume":"34","author":"Wagner","year":"2016","journal-title":"Nat. Biotechnol"},{"key":"2023062712385672500_btz321-B48","doi-asserted-by":"crossref","first-page":"e0150606.","DOI":"10.1371\/journal.pone.0150606","article-title":"A protocol for the comprehensive flow cytometric analysis of immune cells in normal and inflamed murine non-lymphoid tissues","volume":"11","author":"Yen-Rei","year":"2016","journal-title":"PLoS One"},{"key":"2023062712385672500_btz321-B49","doi-asserted-by":"crossref","first-page":"174.","DOI":"10.1186\/s13059-017-1305-0","article-title":"Splatter: simulation of single-cell RNA sequencing data","volume":"18","author":"Zappia","year":"2017","journal-title":"Genome Biol"},{"key":"2023062712385672500_btz321-B50","author":"Zhang","year":"2018"},{"key":"2023062712385672500_btz321-B51","doi-asserted-by":"crossref","first-page":"14049","DOI":"10.1038\/ncomms14049","article-title":"Massively parallel digital transcriptional profiling of single cells","volume":"8","author":"Zheng","year":"2017","journal-title":"Nat. Commun"},{"key":"2023062712385672500_btz321-B52","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1016\/j.molcel.2017.01.023","article-title":"Comparative analysis of single-cell RNA sequencing methods","volume":"65","author":"Ziegenhain","year":"2017","journal-title":"Mol. Cell"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/14\/i41\/50721496\/bioinformatics_35_14_i41.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/14\/i41\/50721496\/bioinformatics_35_14_i41.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,27]],"date-time":"2023-06-27T08:39:59Z","timestamp":1687855199000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/14\/i41\/5529133"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,7]]},"references-count":52,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2019,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz321","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/437095","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,7]]},"published":{"date-parts":[[2019,7]]}}}