{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,3]],"date-time":"2024-06-03T22:33:44Z","timestamp":1717454024185},"reference-count":21,"publisher":"Oxford University Press (OUP)","issue":"14","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Whole-genome sequencing (WGS) allows direct interrogation of previously undetected uncommon or rare variants, which potentially contribute to the missing heritability of human disease. However, cost of sequencing large numbers of samples limits its application in case\u2013control association studies. Here, we describe theoretical and empirical design considerations for such sequencing studies, aimed at maximizing the power of detecting association under the constraint of study-wide cost.<\/jats:p>\n               <jats:p>Results: We consider two cost regimes. First, assuming cost is proportional to the total amount of base pairs to be sequenced across all samples, which is a practical model for whole-genome sequencing, we explored the tradeoff in terms of study power between increasing the number of subjects and increasing depth coverage. We demonstrate that the optimal power of detecting association is achieved at medium depth coverage under a wide range of realistic conditions for case-only sequencing designs. Second, if cost is fixed per sample, which is approximately the case in exome sequencing, we show that in a simple case+control sequencing study, the optimal design should include cases totaling 1\/e of all subjects.<\/jats:p>\n               <jats:p>Availability: A web tool implementing the methods is available at http:\/\/www.cs.columbia.edu\/~itsik\/OPERA\/.<\/jats:p>\n               <jats:p>Contact: \u00a0yshen@c2b2.columbia.edu; itsik@cs.columbia.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr305","type":"journal-article","created":{"date-parts":[[2011,6,3]],"date-time":"2011-06-03T11:53:17Z","timestamp":1307101997000},"page":"1995-1997","source":"Crossref","is-referenced-by-count":7,"title":["Coverage tradeoffs and power estimation in the design of whole-genome sequencing experiments for detecting association"],"prefix":"10.1093","volume":"27","author":[{"given":"Yufeng","family":"Shen","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ruijie","family":"Song","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Itsik","family":"Pe'er","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2011,6,2]]},"reference":[{"key":"2023012712450422800_B1","doi-asserted-by":"crossref","first-page":"881","DOI":"10.1126\/science.1156409","article-title":"Genetic mapping in human disease","volume":"322","author":"Altshuler","year":"2008","journal-title":"Science"},{"key":"2023012712450422800_B2","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1038\/nature07517","article-title":"Accurate whole human genome sequencing using reversible terminator chemistry","volume":"456","author":"Bentley","year":"2008","journal-title":"Nature"},{"key":"2023012712450422800_B3","doi-asserted-by":"crossref","first-page":"695","DOI":"10.1038\/ng.f.136","article-title":"Common and rare variants in multifactorial susceptibility to common diseases","author":"Bodmer","year":"2008","journal-title":"Nat. Genet."},{"key":"2023012712450422800_B4","doi-asserted-by":"crossref","first-page":"1264","DOI":"10.1056\/NEJMoa054013","article-title":"Sequence variations in PCSK9, low LDL, and protection against coronary heart disease","volume":"354","author":"Cohen","year":"2006","journal-title":"N. Engl. J. Med."},{"key":"2023012712450422800_B5","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1038\/nrg2809","article-title":"Missing heritability and strategies for finding the underlying causes of complex disease","volume":"11","author":"Eichler","year":"2010","journal-title":"Nat. Rev. Genet."},{"key":"2023012712450422800_B6","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1038\/nrg2554","article-title":"Human genetic variation and its contribution to complex traits","volume":"10","author":"Frazer","year":"2009","journal-title":"Nat. Rev. Genet."},{"key":"2023012712450422800_B7","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1016\/0888-7543(88)90007-9","article-title":"Genomic mapping by fingerprinting random clones: a mathematical analysis","volume":"2","author":"Lander","year":"1998","journal-title":"Genomics"},{"key":"2023012712450422800_B8","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1016\/j.ajhg.2008.06.024","article-title":"Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data","volume":"83","author":"Li","year":"2008","journal-title":"Am. J. Hum. Genet."},{"key":"2023012712450422800_B9","doi-asserted-by":"crossref","first-page":"1181","DOI":"10.1056\/NEJMoa0908094","article-title":"Whole-genome sequencing in a patient with Charcot-Marie-Tooth neuropathy","volume":"362","author":"Lupski","year":"2010","journal-title":"N. Engl. J. Med."},{"key":"2023012712450422800_B10","doi-asserted-by":"crossref","first-page":"e1000384","DOI":"10.1371\/journal.pgen.1000384","article-title":"A groupwise association test for rare mutations using a weighted sum statistic","volume":"5","author":"Madsen","year":"2009","journal-title":"PLoS Genet."},{"key":"2023012712450422800_B11","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1101\/gr.091868.109","article-title":"Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding","volume":"19","author":"Mckernan","year":"2009","journal-title":"Genome Res."},{"key":"2023012712450422800_B12","doi-asserted-by":"crossref","first-page":"e1001322","DOI":"10.1371\/journal.pgen.1001322","article-title":"Testing for an unusual distribution of rare variants","volume":"7","author":"Neale","year":"2011","journal-title":"PLoS Genet."},{"key":"2023012712450422800_B13","doi-asserted-by":"crossref","first-page":"272","DOI":"10.1038\/nature08250","article-title":"Targeted capture and massively parallel sequencing of 12 human exomes","volume":"461","author":"Ng","year":"2009","journal-title":"Nature"},{"key":"2023012712450422800_B14","doi-asserted-by":"crossref","first-page":"832","DOI":"10.1016\/j.ajhg.2010.04.005","article-title":"Pooled association tests for rare variants in exon-resequencing studies","volume":"86","author":"Price","year":"2010","journal-title":"Am. J. Hum. Genet."},{"key":"2023012712450422800_B15","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1086\/321272","article-title":"Are rare variants responsible for susceptibility to complex diseases?","volume":"69","author":"Pritchard","year":"2001","journal-title":"Am. J. Hum. Genet."},{"key":"2023012712450422800_B16","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1038\/nature09115","article-title":"Functionally defective germline variants of sialic acid acetylesterase in autoimmunity","volume":"466","author":"Surolia","year":"2010","journal-title":"Nature"},{"key":"2023012712450422800_B17","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1186\/1471-2164-10-485","article-title":"The theory of discovering rare variants via DNA sequencing","volume":"10","author":"Wendl","year":"2009","journal-title":"BMC Genomics"},{"key":"2023012712450422800_B18","doi-asserted-by":"crossref","first-page":"872","DOI":"10.1038\/nature06884","article-title":"The complete genome of an individual by massively parallel DNA sequencing","volume":"452","author":"Wheeler","year":"2008","journal-title":"Nature"},{"key":"2023012712450422800_B19","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1038\/ng.608","article-title":"Common SNPs explain a large proportion of the heritability for human height","volume":"42","author":"Yang","year":"2010","journal-title":"Nat. Genet."},{"key":"2023012712450422800_B20","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1002\/gepi.20554","article-title":"Bayesian analysis of rare variants in genetic association studies","volume":"35","author":"Yi","year":"2011","journal-title":"Genet. Epidemiol."},{"key":"2023012712450422800_B21","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/nature09534","article-title":"A map of human genome variation from population-scale sequencing","volume":"467","author":"1000 Genomes Project Consortium et al.","year":"2010","journal-title":"Nature"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/14\/1995\/48933134\/bioinformatics_27_14_1995.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/14\/1995\/48933134\/bioinformatics_27_14_1995.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T13:37:21Z","timestamp":1674826641000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/14\/1995\/194535"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,6,2]]},"references-count":21,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2011,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr305","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2011,7]]},"published":{"date-parts":[[2011,6,2]]}}}