{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,26]],"date-time":"2026-06-26T16:01:34Z","timestamp":1782489694314,"version":"3.54.5"},"reference-count":22,"publisher":"Oxford University Press (OUP)","issue":"14","license":[{"start":{"date-parts":[[2016,4,19]],"date-time":"2016-04-19T00:00:00Z","timestamp":1461024000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Swedish Science Council"},{"DOI":"10.13039\/100012538","name":"Swedish Cancer Foundation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100012538","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Motivation: Single-cell RNA-sequencing technology allows detection of gene expression at the single-cell level. One typical feature of the data is a bimodality in the cellular distribution even for highly expressed genes, primarily caused by a proportion of non-expressing cells. The standard and the over-dispersed gamma-Poisson models that are commonly used in bulk-cell RNA-sequencing are not able to capture this property.<\/jats:p>\n                  <jats:p>Results: We introduce a beta-Poisson mixture model that can capture the bimodality of the single-cell gene expression distribution. We further integrate the model into the generalized linear model framework in order to perform differential expression analyses. The whole analytical procedure is called BPSC. The results from several real single-cell RNA-seq datasets indicate that \u223c90% of the transcripts are well characterized by the beta-Poisson model; the model-fit from BPSC is better than the fit of the standard gamma-Poisson model in\u2009&amp;gt;\u200980% of the transcripts. Moreover, in differential expression analyses of simulated and real datasets, BPSC performs well against edgeR, a conventional method widely used in bulk-cell RNA-sequencing data, and against scde and MAST, two recent methods specifically designed for single-cell RNA-seq data.<\/jats:p>\n                  <jats:p>Availability and Implementation: An R package BPSC for model fitting and differential expression analyses of single-cell RNA-seq data is available under GPL-3 license at https:\/\/github.com\/nghiavtr\/BPSC .<\/jats:p>\n                  <jats:p>Contact: \u00a0yudi.pawitan@ki.se or mattias.rantalainen@ki.se<\/jats:p>\n                  <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btw202","type":"journal-article","created":{"date-parts":[[2016,4,19]],"date-time":"2016-04-19T21:30:19Z","timestamp":1461101419000},"page":"2128-2135","source":"Crossref","is-referenced-by-count":160,"title":["Beta-Poisson model for single-cell RNA-seq data analyses"],"prefix":"10.1093","volume":"32","author":[{"given":"Trung Nghia","family":"Vu","sequence":"first","affiliation":[{"name":"1 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, 17177 Stockholm, Sweden"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Quin F.","family":"Wills","sequence":"additional","affiliation":[{"name":"2 Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK"},{"name":"3 Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, UK"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Krishna R.","family":"Kalari","sequence":"additional","affiliation":[{"name":"4 Department of Health Sciences Research, Mayo Clinic, Rochester, MN 55905, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nifang","family":"Niu","sequence":"additional","affiliation":[{"name":"5 Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN 55905, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Liewei","family":"Wang","sequence":"additional","affiliation":[{"name":"5 Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN 55905, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mattias","family":"Rantalainen","sequence":"additional","affiliation":[{"name":"1 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, 17177 Stockholm, Sweden"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yudi","family":"Pawitan","sequence":"additional","affiliation":[{"name":"1 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, 17177 Stockholm, Sweden"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2016,4,19]]},"reference":[{"key":"2026032300460881000_btw202-B1","doi-asserted-by":"crossref","first-page":"1765","DOI":"10.1038\/nprot.2013.099","article-title":"Count-based differential expression analysis of RNA sequencing data using R and Bioconductor","volume":"8","author":"Anders","year":"2013","journal-title":"Nat. Protoc"},{"key":"2026032300460881000_btw202-B2","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1093\/bioinformatics\/btu638","article-title":"HTSeqa Python framework to work with high-throughput sequencing data","volume":"31","author":"Anders","year":"2015","journal-title":"Bioinformatics"},{"key":"2026032300460881000_btw202-B3","doi-asserted-by":"crossref","first-page":"D472","DOI":"10.1093\/nar\/gkt1102","article-title":"The Reactome pathway knowledgebase","volume":"42","author":"Croft","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2026032300460881000_btw202-B4","first-page":"btv007","article-title":"Inferring single-cell gene expression mechanisms using stochastic simulation","author":"Daigle","year":"2015","journal-title":"Bioinformatics"},{"key":"2026032300460881000_btw202-B5","doi-asserted-by":"crossref","first-page":"278.","DOI":"10.1186\/s13059-015-0844-5","article-title":"MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data","volume":"16","author":"Finak","year":"2015","journal-title":"Genome Biol"},{"key":"2026032300460881000_btw202-B6","volume-title":"Introduction to Numerical Analysis. 2nd edition","author":"Hildebrand","year":"1974"},{"key":"2026032300460881000_btw202-B7","doi-asserted-by":"crossref","first-page":"740","DOI":"10.1038\/nmeth.2967","article-title":"Bayesian approach to single-cell differential expression analysis","volume":"11","author":"Kharchenko","year":"2014","journal-title":"Nat. Methods"},{"key":"2026032300460881000_btw202-B8","doi-asserted-by":"crossref","first-page":"8977","DOI":"10.1073\/pnas.1201689109","article-title":"Distinct perturbation of the translatome by the antidiabetic drug metformin","volume":"109","author":"Larsson","year":"2012","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2026032300460881000_btw202-B9","doi-asserted-by":"crossref","first-page":"R29.","DOI":"10.1186\/gb-2014-15-2-r29","article-title":"voom: precision weights unlock linear model analysis tools for RNA-seq read counts","volume":"15","author":"Law","year":"2014","journal-title":"Genome Biol"},{"key":"2026032300460881000_btw202-B10","doi-asserted-by":"crossref","first-page":"4288","DOI":"10.1093\/nar\/gks042","article-title":"Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation","volume":"40","author":"McCarthy","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2026032300460881000_btw202-B11","doi-asserted-by":"crossref","first-page":"219","DOI":"10.3892\/ijo.2014.2398","article-title":"Peroxiredoxin proteins protect MCF-7 breast cancer cells from doxorubicin-induced toxicity","volume":"45","author":"McDonald","year":"2014","journal-title":"Int. J. Oncol"},{"key":"2026032300460881000_btw202-B12","doi-asserted-by":"crossref","first-page":"462","DOI":"10.1038\/nbt.2862","article-title":"Sailfish enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms","volume":"32","author":"Patro","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2026032300460881000_btw202-B13","volume-title":"In All Likelihood: Statistical Modelling and Inference Using Likelihood","author":"Pawitan","year":"2013"},{"key":"2026032300460881000_btw202-B14","doi-asserted-by":"crossref","first-page":"3017","DOI":"10.1093\/bioinformatics\/bti448","article-title":"False discovery rate, sensitivity and sample size for microarray studies","volume":"21","author":"Pawitan","year":"2005","journal-title":"Bioinformatics"},{"key":"2026032300460881000_btw202-B15","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2026032300460881000_btw202-B16","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.ymeth.2013.03.026","article-title":"Stochastic models of transcription: from single molecules to single cells","volume":"62","author":"Sanchez","year":"2013","journal-title":"Methods"},{"key":"2026032300460881000_btw202-B17","doi-asserted-by":"crossref","first-page":"17256","DOI":"10.1073\/pnas.0803850105","article-title":"Analytical distributions for stochastic gene expression","volume":"105","author":"Shahrezaei","year":"2008","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2026032300460881000_btw202-B18","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1038\/nature12172","article-title":"Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells","volume":"498","author":"Shalek","year":"2013","journal-title":"Nature"},{"key":"2026032300460881000_btw202-B19","doi-asserted-by":"crossref","first-page":"511","DOI":"10.1038\/nbt.1621","article-title":"Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation","volume":"28","author":"Trapnell","year":"2010","journal-title":"Nat. Biotechnol"},{"key":"2026032300460881000_btw202-B20","doi-asserted-by":"crossref","first-page":"812","DOI":"10.15252\/msb.20156198","article-title":"Single-cell polyadenylation site mapping reveals 3 isoform choice variability","volume":"11","author":"Velten","year":"2015","journal-title":"Mol. Syst. Biol"},{"key":"2026032300460881000_btw202-B21","doi-asserted-by":"crossref","first-page":"748","DOI":"10.1038\/nbt.2642","article-title":"Single-cell gene expression analysis reveals genetic associations masked in whole-tissue experiments","volume":"31","author":"Wills","year":"2013","journal-title":"Nat. Biotechnol"},{"key":"2026032300460881000_btw202-B22","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1038\/nmeth.2694","article-title":"Quantitative assessment of single-cell RNA-sequencing methods","volume":"11","author":"Wu","year":"2014","journal-title":"Nat. Methods"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/14\/2128\/49019666\/bioinformatics_32_14_2128.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/14\/2128\/49019666\/bioinformatics_32_14_2128.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T04:46:19Z","timestamp":1774241179000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/14\/2128\/2288270"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,4,19]]},"references-count":22,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2016,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw202","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2016,7,15]]},"published":{"date-parts":[[2016,4,19]]}}}