{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T22:08:32Z","timestamp":1774476512393,"version":"3.50.1"},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"22","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2014,11,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Single-cell DNA sequencing is necessary for examining genetic variation at the cellular level, which remains hidden in bulk sequencing experiments. But because they begin with such small amounts of starting material, the amount of information that is obtained from single-cell sequencing experiment is highly sensitive to the choice of protocol employed and variability in library preparation. In particular, the fraction of the genome represented in single-cell sequencing libraries exhibits extreme variability due to quantitative biases in amplification and loss of genetic material.<\/jats:p>\n               <jats:p>Results: We propose a method to predict the genome coverage of a deep sequencing experiment using information from an initial shallow sequencing experiment mapped to a reference genome. The observed coverage statistics are used in a non-parametric empirical Bayes Poisson model to estimate the gain in coverage from deeper sequencing. This approach allows researchers to know statistical features of deep sequencing experiments without actually sequencing deeply, providing a basis for optimizing and comparing single-cell sequencing protocols or screening libraries.<\/jats:p>\n               <jats:p>Availability and implementation: The method is available as part of the preseq software package. Source code is available at http:\/\/smithlabresearch.org\/preseq .<\/jats:p>\n               <jats:p>Contact: \u00a0andrewds@usc.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary material is available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btu540","type":"journal-article","created":{"date-parts":[[2014,8,9]],"date-time":"2014-08-09T01:36:21Z","timestamp":1407548181000},"page":"3159-3165","source":"Crossref","is-referenced-by-count":59,"title":["Modeling genome coverage in single-cell sequencing"],"prefix":"10.1093","volume":"30","author":[{"given":"Timothy","family":"Daley","sequence":"first","affiliation":[{"name":"1 Department of Mathematics and 2 Department of Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA"}]},{"given":"Andrew D.","family":"Smith","sequence":"additional","affiliation":[{"name":"1 Department of Mathematics and 2 Department of Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA"}]}],"member":"286","published-online":{"date-parts":[[2014,8,8]]},"reference":[{"key":"2023012711593709900_btu540-B1","doi-asserted-by":"crossref","first-page":"1843","DOI":"10.1214\/aop\/1176989531","article-title":"Compound poisson approximation for nonnegative random variables via Stein's method","volume":"20","author":"Barbour","year":"1992","journal-title":"Ann. Probab."},{"key":"2023012711593709900_btu540-B2","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1111\/1574-6976.12015","article-title":"The future is now: single-cell genomics of bacteria and archaea","volume":"37","author":"Blainey","year":"2013","journal-title":"FEMS Microbiol. Rev."},{"key":"2023012711593709900_btu540-B3","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1007\/BF00058655","article-title":"Bagging predictors","volume":"24","author":"Breiman","year":"1996","journal-title":"Mach. Learn."},{"key":"2023012711593709900_btu540-B4","first-page":"679","article-title":"Transformation to normality of the null distribution of G1","volume":"57","author":"D\u2019Agostino","year":"1970","journal-title":"Biometrika"},{"key":"2023012711593709900_btu540-B5","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1038\/nmeth.2375","article-title":"Predicting the molecular complexity of sequencing libraries","volume":"10","author":"Daley","year":"2013","journal-title":"Nat. Methods"},{"key":"2023012711593709900_btu540-B6","doi-asserted-by":"crossref","first-page":"e30377","DOI":"10.1371\/journal.pone.0030377","article-title":"Fast computation and applications of genome mappability","volume":"7","author":"Derrien","year":"2012","journal-title":"PLoS One"},{"key":"2023012711593709900_btu540-B7","first-page":"435","article-title":"Estimating the number of unseen species: how many words did Shakespeare know?","volume":"63","author":"Efron","year":"1976","journal-title":"Biometrika"},{"key":"2023012711593709900_btu540-B8","doi-asserted-by":"crossref","first-page":"1292","DOI":"10.1093\/molbev\/msu074","article-title":"Ancient whole genome enrichment using baits built from modern DNA","volume":"31","author":"Enk","year":"2014","journal-title":"Mol. Biol. Evol."},{"key":"2023012711593709900_btu540-B9","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1016\/j.cell.2012.09.035","article-title":"Single-neuron sequencing analysis of L1 retrotransposition and somatic mutation in the human brain","volume":"151","author":"Evrony","year":"2012","journal-title":"Cell"},{"key":"2023012711593709900_btu540-B10","doi-asserted-by":"crossref","first-page":"e105","DOI":"10.1093\/nar\/gkp526","article-title":"Identification of small gains and losses in single cells after whole genome amplification on tiling oligo arrays","volume":"37","author":"Geigl","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012711593709900_btu540-B11","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1111\/j.1399-0004.2009.01273.x","article-title":"Preimplantation genetic diagnosis","volume":"76","author":"Geraedts","year":"2009","journal-title":"Clin. Genet."},{"key":"2023012711593709900_btu540-B12","doi-asserted-by":"crossref","first-page":"1126","DOI":"10.1038\/nbt.2720","article-title":"Massively parallel polymerase cloning and genome sequencing of single cells using nanoliter microwells","volume":"31","author":"Gole","year":"2013","journal-title":"Nat. Biotechnol."},{"key":"2023012711593709900_btu540-B13","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1093\/biomet\/43.1-2.45","article-title":"The number of new species, and the increase in population coverage, when a sample is increased","volume":"43","author":"Good","year":"1956","journal-title":"Biometrika"},{"key":"2023012711593709900_btu540-B14","doi-asserted-by":"crossref","first-page":"843","DOI":"10.1101\/gr.147686.112","article-title":"Single molecule molecular inversion probes for targeted, high-accuracy detection of low-frequency variation","volume":"23","author":"Hiatt","year":"2013","journal-title":"Genome Res."},{"key":"2023012711593709900_btu540-B15","doi-asserted-by":"crossref","first-page":"954","DOI":"10.1101\/gr.816903","article-title":"Unbiased whole-genome amplification directly from clinical samples","volume":"13","author":"Hosono","year":"2003","journal-title":"Genome Res."},{"key":"2023012711593709900_btu540-B16","doi-asserted-by":"crossref","first-page":"1492","DOI":"10.1016\/j.cell.2013.11.040","article-title":"Genome analyses of single human oocytes","volume":"155","author":"Hou","year":"2013","journal-title":"Cell"},{"key":"2023012711593709900_btu540-B17","doi-asserted-by":"crossref","first-page":"416","DOI":"10.1126\/science.1248575","article-title":"Single-cell genomics reveals hundreds of coexisting subpopulations in wild Prochlorococcus","volume":"344","author":"Kashtan","year":"2014","journal-title":"Science"},{"key":"2023012711593709900_btu540-B18","doi-asserted-by":"crossref","first-page":"826","DOI":"10.1101\/gr.144600.112","article-title":"Sequencing of isolated sperm cells for direct haplotyping of a human genome","volume":"23","author":"Kirkness","year":"2013","journal-title":"Genome Res."},{"key":"2023012711593709900_btu540-B39","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1038\/nmeth.1778","article-title":"Counting absolute numbers of molecules using unique molecular identifiers","volume":"9","author":"Kivioja","year":"2012","journal-title":"Nature methods"},{"key":"2023012711593709900_btu540-B19","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/0888-7543(88)90007-9","article-title":"Genomic mapping by fingerprinting random clones: a mathematical analysis","volume":"2","author":"Lander","year":"1988","journal-title":"Genomics"},{"key":"2023012711593709900_btu540-B20","doi-asserted-by":"crossref","first-page":"1123","DOI":"10.1111\/j.0006-341X.2003.00129.x","article-title":"Nonidentifiability of population size from capture-recapture data with heterogeneous detection probabilities","volume":"59","author":"Link","year":"2003","journal-title":"Biometrics"},{"key":"2023012711593709900_btu540-B21","doi-asserted-by":"crossref","first-page":"1627","DOI":"10.1126\/science.1229112","article-title":"Probing meiotic recombination and aneuploidy of single sperm cells by whole-genome sequencing","volume":"338","author":"Lu","year":"2012","journal-title":"Science"},{"key":"2023012711593709900_btu540-B22","doi-asserted-by":"crossref","first-page":"632","DOI":"10.1126\/science.1243472","article-title":"Mosaic copy number variation in human neurons","volume":"342","author":"McConnell","year":"2013","journal-title":"Science"},{"key":"2023012711593709900_btu540-B23","doi-asserted-by":"crossref","first-page":"3492","DOI":"10.1158\/0008-5472.CAN-11-4037","article-title":"Ultrasensitive measurement of hotspot mutations in tumor DNA in blood using error-suppressed multiplexed deep sequencing","volume":"72","author":"Narayan","year":"2012","journal-title":"Cancer Res."},{"key":"2023012711593709900_btu540-B24","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1038\/nature09807","article-title":"Tumour evolution inferred by single-cell sequencing","volume":"472","author":"Navin","year":"2011","journal-title":"Nature"},{"key":"2023012711593709900_btu540-B25","doi-asserted-by":"crossref","first-page":"21083","DOI":"10.1073\/pnas.1320659110","article-title":"Reproducible copy number variation patterns among single circulating tumor cells of lung cancer patients","volume":"110","author":"Ni","year":"2013","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711593709900_btu540-B26","doi-asserted-by":"crossref","first-page":"1107","DOI":"10.1101\/gr.131482.111","article-title":"Single-cell sequencing provides clues about the host interactions of segmented filamentous bacteria (SFB)","volume":"22","author":"Pamp","year":"2012","journal-title":"Genome Res."},{"key":"2023012711593709900_btu540-B27","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1101\/gr.124016.111","article-title":"Preparation of high-quality next-generation sequencing libraries from picogram quantities of target DNA","volume":"22","author":"Parkinson","year":"2012","journal-title":"Genome Res."},{"key":"2023012711593709900_btu540-B28","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1186\/1471-2164-7-216","article-title":"Assessment of whole genome amplification-induced bias through high-throughput, massively parallel whole genome sequencing","volume":"7","author":"Pinard","year":"2006","journal-title":"BMC Genomics"},{"key":"2023012711593709900_btu540-B29","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1038\/nature12886","article-title":"The complete genome sequence of a Neanderthal from the Altai Mountains","volume":"505","author":"Pr\u00fcfer","year":"2014","journal-title":"Nature"},{"key":"2023012711593709900_btu540-B30","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1093\/bioinformatics\/btq033","article-title":"BEDTools: a flexible suite of utilities for comparing genomic features","volume":"26","author":"Quinlan","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012711593709900_btu540-B31","doi-asserted-by":"crossref","first-page":"1633","DOI":"10.1016\/S0140-6736(04)16209-0","article-title":"Preimplantation genetic diagnosis","volume":"363","author":"Sermon","year":"2004","journal-title":"Lancet"},{"key":"2023012711593709900_btu540-B32","doi-asserted-by":"crossref","first-page":"618","DOI":"10.1038\/nrg3542","article-title":"Single-cell sequencing-based technologies will revolutionize whole-organism science","volume":"14","author":"Shapiro","year":"2013","journal-title":"Nat. Rev. Genet."},{"key":"2023012711593709900_btu540-B33","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1038\/nrg3642","article-title":"Sequencing depth and coverage: key considerations in genomic analyses","volume":"15","author":"Sims","year":"2014","journal-title":"Nat. Rev. Genet."},{"key":"2023012711593709900_btu540-B40","doi-asserted-by":"crossref","first-page":"3034","DOI":"10.1093\/nar\/23.15.3034","article-title":"Whole genome amplification of single cells: mathematical analysis of PEP and tagged PCR","volume":"23","author":"Sun","year":"1995","journal-title":"Nucleic acids research"},{"key":"2023012711593709900_btu540-B34","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1016\/j.cell.2012.06.030","article-title":"Genome-wide single-cell analysis of recombination activity and de novo mutation rates in human sperm","volume":"150","author":"Wang","year":"2012","journal-title":"Cell"},{"key":"2023012711593709900_btu540-B35","doi-asserted-by":"crossref","first-page":"942","DOI":"10.1198\/016214504000002005","article-title":"A penalized nonparametric maximum likelihood approach to species richness estimation","volume":"100","author":"Wang","year":"2005","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012711593709900_btu540-B36","doi-asserted-by":"crossref","first-page":"886","DOI":"10.1016\/j.cell.2012.02.025","article-title":"Single-cell exome sequencing reveals single-nucleotide mutation characteristics of a kidney tumor","volume":"148","author":"Xu","year":"2012","journal-title":"Cell"},{"key":"2023012711593709900_btu540-B37","doi-asserted-by":"crossref","first-page":"680","DOI":"10.1038\/nbt1214","article-title":"Sequencing genomes from single cells by polymerase cloning","volume":"24","author":"Zhang","year":"2006","journal-title":"Nat. Biotechnol."},{"key":"2023012711593709900_btu540-B38","doi-asserted-by":"crossref","first-page":"1622","DOI":"10.1126\/science.1229164","article-title":"Genome-wide detection of single-nucleotide and copy-number variations of a single human cell","volume":"338","author":"Zong","year":"2012","journal-title":"Science"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/22\/3159\/48931016\/bioinformatics_30_22_3159.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/22\/3159\/48931016\/bioinformatics_30_22_3159.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T12:54:59Z","timestamp":1674824099000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/30\/22\/3159\/2391301"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,8,8]]},"references-count":40,"journal-issue":{"issue":"22","published-print":{"date-parts":[[2014,11,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btu540","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2014,11,15]]},"published":{"date-parts":[[2014,8,8]]}}}