{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,18]],"date-time":"2025-10-18T10:36:43Z","timestamp":1760783803269},"reference-count":25,"publisher":"Oxford University Press (OUP)","issue":"15","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2014,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Sufficiently powered case\u2013control studies with next-generation sequence (NGS) data remain prohibitively expensive for many investigators. If feasible, a more efficient strategy would be to include publicly available sequenced controls. However, these studies can be confounded by differences in sequencing platform; alignment, single nucleotide polymorphism and variant calling algorithms; read depth; and selection thresholds. Assuming one can match cases and controls on the basis of ethnicity and other potential confounding factors, and one has access to the aligned reads in both groups, we investigate the effect of systematic differences in read depth and selection threshold when comparing allele frequencies between cases and controls. We propose a novel likelihood-based method, the robust variance score (RVS), that substitutes genotype calls by their expected values given observed sequence data.<\/jats:p><jats:p>Results: We show theoretically that the RVS eliminates read depth bias in the estimation of minor allele frequency. We also demonstrate that, using simulated and real NGS data, the RVS method controls Type I error and has comparable power to the \u2018gold standard\u2019 analysis with the true underlying genotypes for both common and rare variants.<\/jats:p><jats:p>Availability and implementation: An RVS R script and instructions can be found at strug.research.sickkids.ca , and at https:\/\/github.com\/strug-lab\/RVS .<\/jats:p><jats:p>Contact: \u00a0lisa.strug@utoronto.ca<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btu196","type":"journal-article","created":{"date-parts":[[2014,4,15]],"date-time":"2014-04-15T01:46:51Z","timestamp":1397526411000},"page":"2179-2188","source":"Crossref","is-referenced-by-count":29,"title":["Association analysis using next-generation sequence data from publicly available control groups: the robust variance score statistic"],"prefix":"10.1093","volume":"30","author":[{"given":"Andriy","family":"Derkach","sequence":"first","affiliation":[{"name":"1 Department of Statistical Science, University of Toronto, Toronto, ON, Canada, 2 Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, 3 Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, 4 Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, 5 Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, 6 Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada"}]},{"given":"Theodore","family":"Chiang","sequence":"additional","affiliation":[{"name":"1 Department of Statistical Science, University of Toronto, Toronto, ON, Canada, 2 Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, 3 Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, 4 Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, 5 Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, 6 Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada"}]},{"given":"Jiafen","family":"Gong","sequence":"additional","affiliation":[{"name":"1 Department of Statistical Science, University of Toronto, Toronto, ON, Canada, 2 Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, 3 Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, 4 Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, 5 Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, 6 Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada"}]},{"given":"Laura","family":"Addis","sequence":"additional","affiliation":[{"name":"1 Department of Statistical Science, University of Toronto, Toronto, ON, Canada, 2 Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, 3 Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, 4 Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, 5 Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, 6 Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada"}]},{"given":"Sara","family":"Dobbins","sequence":"additional","affiliation":[{"name":"1 Department of Statistical Science, University of Toronto, Toronto, ON, Canada, 2 Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, 3 Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, 4 Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, 5 Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, 6 Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada"}]},{"given":"Ian","family":"Tomlinson","sequence":"additional","affiliation":[{"name":"1 Department of Statistical Science, University of Toronto, Toronto, ON, Canada, 2 Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, 3 Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, 4 Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, 5 Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, 6 Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada"}]},{"given":"Richard","family":"Houlston","sequence":"additional","affiliation":[{"name":"1 Department of Statistical Science, University of Toronto, Toronto, ON, Canada, 2 Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, 3 Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, 4 Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, 5 Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, 6 Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada"}]},{"given":"Deb K.","family":"Pal","sequence":"additional","affiliation":[{"name":"1 Department of Statistical Science, University of Toronto, Toronto, ON, Canada, 2 Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, 3 Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, 4 Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, 5 Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, 6 Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada"}]},{"given":"Lisa J.","family":"Strug","sequence":"additional","affiliation":[{"name":"1 Department of Statistical Science, University of Toronto, Toronto, ON, Canada, 2 Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, 3 Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, 4 Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, 5 Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, 6 Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada"},{"name":"1 Department of Statistical Science, University of Toronto, Toronto, ON, Canada, 2 Program in Child Health Evaluative Sciences, the Hospital for Sick Children Research Institute, Toronto, ON, Canada, 3 Department of Clinical Neuroscience, Institute of Psychiatry, King's College London, London, 4 Division of Genetics and Epidemiology, Institute of Cancer Research, Sutton, Surrey, 5 Molecular and Population Genetics and NIHR Comprehensive Biomedical Research Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK, 6 Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada"}]}],"member":"286","published-online":{"date-parts":[[2014,4,14]]},"reference":[{"key":"2023012711332925500_btu196-B1","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nature11632","article-title":"An integrated map of genetic variation from 1,092 human genomes","volume":"491","author":"Abecasis","year":"2012","journal-title":"Nature"},{"key":"2023012711332925500_btu196-B2","doi-asserted-by":"crossref","first-page":"375","DOI":"10.2307\/3001775","article-title":"Tests for linear trends in proportions and frequencies","volume":"11","author":"Armitage","year":"1955","journal-title":"Biometrics"},{"key":"2023012711332925500_btu196-B3","doi-asserted-by":"crossref","first-page":"606","DOI":"10.1002\/gepi.20609","article-title":"Comparison of statistical tests for disease association with rare variants","volume":"35","author":"Basu","year":"2011","journal-title":"Genet. Epidemiol."},{"key":"2023012711332925500_btu196-B4","doi-asserted-by":"crossref","first-page":"e60","DOI":"10.1093\/nar\/gks024","article-title":"A powerful test for multiple rare variants association studies that incorporates sequencing qualities","volume":"40","author":"Daye","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2023012711332925500_btu196-B5","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1038\/ng.806","article-title":"A framework for variation discovery and genotyping using next-generation DNA sequencing data","volume":"43","author":"DePristo","year":"2011","journal-title":"Nat. Genet."},{"key":"2023012711332925500_btu196-B6","article-title":"Pooled association tests for rare genetic variants: a review and some new results","author":"Derkach","year":"2012"},{"key":"2023012711332925500_btu196-B7","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1126\/science.1181498","article-title":"Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays","volume":"327","author":"Drmanac","year":"2010","journal-title":"Science"},{"key":"2023012711332925500_btu196-B8","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1002\/gepi.20574","article-title":"Confounded by sequencing depth in association studies of rare alleles","volume":"35","author":"Garner","year":"2011","journal-title":"Genet. Epidemiol."},{"key":"2023012711332925500_btu196-B9","doi-asserted-by":"crossref","first-page":"1039","DOI":"10.1080\/01621459.1990.10474974","article-title":"Bootstrap test for difference between means in nonparametric regression","volume":"85","author":"Hall","year":"1990","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012711332925500_btu196-B10","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1186\/1471-2105-12-231","article-title":"Estimation of allele frequency and association mapping using next-generation sequencing data","volume":"12","author":"Kim","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023012711332925500_btu196-B11","doi-asserted-by":"crossref","first-page":"762","DOI":"10.1093\/biostatistics\/kxs014","article-title":"Optimal tests for rare variant effects in sequencing association studies","volume":"13","author":"Lee","year":"2012","journal-title":"Biostatistics"},{"key":"2023012711332925500_btu196-B12","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The Sequence Alignment\/Map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012711332925500_btu196-B13","doi-asserted-by":"crossref","first-page":"1745","DOI":"10.1093\/bioinformatics\/bts263","article-title":"SEQCHIP: a powerful method to integrate sequence and genotype data for the detection of rare variant associations","volume":"28","author":"Liu","year":"2012","journal-title":"Bioinformatics"},{"key":"2023012711332925500_btu196-B14","doi-asserted-by":"crossref","first-page":"e14318","DOI":"10.1371\/journal.pone.0014318","article-title":"Three ways of combining genotyping and resequencing in case-control association studies","volume":"5","author":"Longmate","year":"2010","journal-title":"PLoS One"},{"key":"2023012711332925500_btu196-B15","doi-asserted-by":"crossref","first-page":"e1000384","DOI":"10.1371\/journal.pgen.1000384","article-title":"A groupwise association test for rare mutations using a weighted sum statistic","volume":"5","author":"Madsen","year":"2009","journal-title":"PLoS Genet."},{"key":"2023012711332925500_btu196-B16","doi-asserted-by":"crossref","first-page":"1297","DOI":"10.1101\/gr.107524.110","article-title":"The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data","volume":"20","author":"McKenna","year":"2010","journal-title":"Genome. Res."},{"key":"2023012711332925500_btu196-B17","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.mrfmmm.2006.09.003","article-title":"A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST)","volume":"615","author":"Morgenthaler","year":"2007","journal-title":"Mutat. Res."},{"key":"2023012711332925500_btu196-B18","doi-asserted-by":"crossref","first-page":"e1001322","DOI":"10.1371\/journal.pgen.1001322","article-title":"Testing for an unusual distribution of rare variants","volume":"7","author":"Neale","year":"2011","journal-title":"PLoS. Genet."},{"key":"2023012711332925500_btu196-B19","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1038\/nrg2986","article-title":"Genotype and SNP calling from next-generation sequencing data","volume":"12","author":"Nielsen","year":"2011","journal-title":"Nat. Rev. Genet."},{"key":"2023012711332925500_btu196-B20","doi-asserted-by":"crossref","first-page":"e1002198","DOI":"10.1371\/journal.pgen.1002198","article-title":"Fine mapping of five loci associated with low-density lipoprotein cholesterol detects variants that double the explained heritability","volume":"7","author":"Sanna","year":"2011","journal-title":"PLoS Genet."},{"key":"2023012711332925500_btu196-B21","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1126\/science.333.6041.404-a","article-title":"Retraction","volume":"333","author":"Sebastiani","year":"2011","journal-title":"Science"},{"key":"2023012711332925500_btu196-B22","doi-asserted-by":"crossref","first-page":"430","DOI":"10.1002\/gepi.21636","article-title":"Association testing for next-generation sequencing data using score statistics","volume":"36","author":"Skotte","year":"2012","journal-title":"Genet. Epidemiol."},{"key":"2023012711332925500_btu196-B23","doi-asserted-by":"crossref","first-page":"1171","DOI":"10.1038\/ejhg.2008.267","article-title":"Centrotemporal sharp wave EEG trait in rolandic epilepsy maps to Elongator Protein Complex 4 (ELP4)","volume":"17","author":"Strug","year":"2009","journal-title":"Eur. J. Hum. Genet."},{"key":"2023012711332925500_btu196-B24","doi-asserted-by":"crossref","first-page":"661","DOI":"10.1038\/nature05911","article-title":"Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls","volume":"447","author":"The Wellcome Trust Case Control Consortium","year":"2007","journal-title":"Nature"},{"key":"2023012711332925500_btu196-B25","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1016\/j.ajhg.2011.05.029","article-title":"Rare-variant association testing for sequencing data with the sequence kernel association test","volume":"89","author":"Wu","year":"2011","journal-title":"Am. J. Hum. Genet."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/15\/2179\/48926102\/bioinformatics_30_15_2179.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/15\/2179\/48926102\/bioinformatics_30_15_2179.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,25]],"date-time":"2024-05-25T22:18:05Z","timestamp":1716675485000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/30\/15\/2179\/2391316"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,4,14]]},"references-count":25,"journal-issue":{"issue":"15","published-print":{"date-parts":[[2014,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btu196","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2014,8,1]]},"published":{"date-parts":[[2014,4,14]]}}}