{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T23:30:09Z","timestamp":1775259009417,"version":"3.50.1"},"reference-count":50,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2019,8,1]],"date-time":"2019-08-01T00:00:00Z","timestamp":1564617600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["U01 HG007900"],"award-info":[{"award-number":["U01 HG007900"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["R01 DK097534"],"award-info":[{"award-number":["R01 DK097534"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["NIH R01 DK099820"],"award-info":[{"award-number":["NIH R01 DK099820"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["1UM HG009428"],"award-info":[{"award-number":["1UM HG009428"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["NIH F32 1F32DK115188"],"award-info":[{"award-number":["NIH F32 1F32DK115188"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["FP059028-01-PR"],"award-info":[{"award-number":["FP059028-01-PR"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100006510","name":"Duke University","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100006510","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Biostatistics and Bioinformatics Department"},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>High-throughput reporter assays dramatically improve our ability to assign function to noncoding genetic variants, by measuring allelic effects on gene expression in the controlled setting of a reporter gene. Unlike genetic association tests, such assays are not confounded by linkage disequilibrium when loci are independently assayed. These methods can thus improve the identification of causal disease mutations. While work continues on improving experimental aspects of these assays, less effort has gone into developing methods for assessing the statistical significance of assay results, particularly in the case of rare variants captured from patient DNA.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We describe a Bayesian hierarchical model, called Bayesian Inference of Regulatory Differences, which integrates prior information and explicitly accounts for variability between experimental replicates. The model produces substantially more accurate predictions than existing methods when allele frequencies are low, which is of clear advantage in the search for disease-causing variants in DNA captured from patient cohorts. Using the model, we demonstrate a clear tradeoff between variant sequencing coverage and numbers of biological replicates, and we show that the use of additional biological replicates decreases variance in estimates of effect size, due to the properties of the Poisson-binomial distribution. We also provide a power and sample size calculator, which facilitates decision making in experimental design parameters.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>The software is freely available from www.geneprediction.org\/bird. The experimental design web tool can be accessed at http:\/\/67.159.92.22:8080<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz545","type":"journal-article","created":{"date-parts":[[2019,7,24]],"date-time":"2019-07-24T19:17:27Z","timestamp":1563995847000},"page":"331-338","source":"Crossref","is-referenced-by-count":6,"title":["Bayesian estimation of genetic regulatory effects in high-throughput reporter assays"],"prefix":"10.1093","volume":"36","author":[{"given":"William H","family":"Majoros","sequence":"first","affiliation":[{"name":"Duke Center for Statistical Genetics and Genomics , Duke University"},{"name":"Division of Integrative Genomics , Department of Biostatistics and Bioinformatics, Duke University Medical School"},{"name":"Center for Genomic and Computational Biology , Duke University Medical School"}]},{"given":"Young-Sook","family":"Kim","sequence":"additional","affiliation":[{"name":"Center for Genomic and Computational Biology , Duke University Medical School"},{"name":"Program in Computational Biology & Bioinformatics , Duke University, Durham, NC 27710"}]},{"given":"Alejandro","family":"Barrera","sequence":"additional","affiliation":[{"name":"Center for Genomic and Computational Biology , Duke University Medical School"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6183-1893","authenticated-orcid":false,"given":"Fan","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Yale University , New Haven, CT 06520"}]},{"given":"Xingyan","family":"Wang","sequence":"additional","affiliation":[{"name":"Masters Program in Biostatistics, Department of Biostatistics and Bioinformatics , Duke University Medical School, Durham, NC 27710"}]},{"given":"Sarah J","family":"Cunningham","sequence":"additional","affiliation":[{"name":"University Program in Genetics and Genomics , Duke University"}]},{"given":"Graham D","family":"Johnson","sequence":"additional","affiliation":[{"name":"Center for Genomic and Computational Biology , Duke University Medical School"},{"name":"Department of Biostatistics and Bioinformatics , Duke University, Durham, NC 27710"}]},{"given":"Cong","family":"Guo","sequence":"additional","affiliation":[{"name":"University Program in Genetics and Genomics , Duke University"}]},{"given":"William L","family":"Lowe","sequence":"additional","affiliation":[{"name":"Division of Endocrinology Metabolism and Molecular Medicine , Northwestern University Feinberg School of Medicine, Chicago"}]},{"given":"Denise M","family":"Scholtens","sequence":"additional","affiliation":[{"name":"Division of Biostatistics, Department of Preventive Medicine , Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA"}]},{"given":"M Geoffrey","family":"Hayes","sequence":"additional","affiliation":[{"name":"Division of Endocrinology Metabolism and Molecular Medicine , Northwestern University Feinberg School of Medicine, Chicago"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7629-061X","authenticated-orcid":false,"given":"Timothy E","family":"Reddy","sequence":"additional","affiliation":[{"name":"Duke Center for Statistical Genetics and Genomics , Duke University"},{"name":"Division of Integrative Genomics , Department of Biostatistics and Bioinformatics, Duke University Medical School"},{"name":"Center for Genomic and Computational Biology , Duke University Medical School"}]},{"given":"Andrew S","family":"Allen","sequence":"additional","affiliation":[{"name":"Duke Center for Statistical Genetics and Genomics , Duke University"},{"name":"Division of Integrative Genomics , Department of Biostatistics and Bioinformatics, Duke University Medical School"},{"name":"Center for Genomic and Computational Biology , Duke University Medical School"}]}],"member":"286","published-online":{"date-parts":[[2019,8,1]]},"reference":[{"key":"2023013112082245800_btz545-B1","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1038\/nmeth0410-248","article-title":"A method and server for predicting damaging missense mutations","volume":"7","author":"Adzhubei","year":"2010","journal-title":"Nat. Methods"},{"key":"2023013112082245800_btz545-B2","doi-asserted-by":"crossref","first-page":"1074","DOI":"10.1126\/science.1232542","article-title":"Genome-wide quantitative enhancer activity maps identified by STARR-seq","volume":"339","author":"Arnold","year":"2013","journal-title":"Science"},{"key":"2023013112082245800_btz545-B4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v076.i01","article-title":"Stan: a probabilistic programming language","volume":"76","author":"Carpenter","year":"2017","journal-title":"J. Stat. Softw"},{"key":"2023013112082245800_btz545-B5","doi-asserted-by":"crossref","first-page":"195.","DOI":"10.1186\/s13059-015-0762-6","article-title":"Tools and best practices for data processing in allelic expression analysis","volume":"16","author":"Castel","year":"2015","journal-title":"Genome Biol"},{"key":"2023013112082245800_btz545-B6","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1016\/j.cell.2016.09.005","article-title":"Enhancer variants synergistically drive dysfunction of a gene regulatory network in hirschsprung disease","volume":"167","author":"Chatterjee","year":"2016","journal-title":"Cell"},{"key":"2023013112082245800_btz545-B7","doi-asserted-by":"crossref","first-page":"11101.","DOI":"10.1038\/ncomms11101","article-title":"A uniform survey of allele-specific binding and expression over 1000-genomes-project individuals","volume":"7","author":"Chen","year":"2016","journal-title":"Nat. Commun"},{"key":"2023013112082245800_btz545-B8","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1016\/0370-2693(87)91197-X","article-title":"Hybrid Monte Carlo","volume":"195","author":"Duane","year":"1987","journal-title":"Phys. Lett. B"},{"key":"2023013112082245800_btz545-B9","doi-asserted-by":"crossref","first-page":"1383","DOI":"10.1101\/gr.133702.111","article-title":"Human genomic disease variants: a neutral evolutionary explanation","volume":"22","author":"Dudley","year":"2012","journal-title":"Genome Res"},{"key":"2023013112082245800_btz545-B10","author":"Durmus","year":"2017"},{"key":"2023013112082245800_btz545-B11","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1038\/nature13835","article-title":"Genetic and epigenetic fine-mapping of causal autoimmune disease variants","volume":"518","author":"Farh","year":"2015","journal-title":"Nature"},{"key":"2023013112082245800_btz545-B12","doi-asserted-by":"crossref","first-page":"430","DOI":"10.1038\/ng.567","article-title":"Variants in ADCY5 and near CCNL1 are associated with fetal growth and birth weight","volume":"42","author":"Freathy","year":"2010","journal-title":"Nat. Genet"},{"key":"2023013112082245800_btz545-B13","doi-asserted-by":"crossref","first-page":"394.","DOI":"10.1186\/s12864-017-3785-4","article-title":"Transversions have larger regulatory effects than transitions","volume":"18","author":"Guo","year":"2017","journal-title":"BMC Genomics"},{"key":"2023013112082245800_btz545-B14","volume-title":"Principles of Population Genetics","author":"Hartl","year":"1997"},{"key":"2023013112082245800_btz545-B15","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1093\/biomet\/57.1.97","article-title":"Monte Carlo sampling methods using Markov chains and their applications","volume":"57","author":"Hastings","year":"1970","journal-title":"Biometrika"},{"key":"2023013112082245800_btz545-B16","doi-asserted-by":"crossref","first-page":"1369","DOI":"10.1016\/j.cell.2016.09.037","article-title":"Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters","volume":"167","author":"Javierre","year":"2016","journal-title":"Cell"},{"key":"2023013112082245800_btz545-B17","doi-asserted-by":"crossref","first-page":"1701","DOI":"10.1101\/gr.237354.118","article-title":"High-throughput characterization of genetic effects on DNA-protein binding and gene transcription","volume":"28","author":"Kalita","year":"2018","journal-title":"Genome Res"},{"key":"2023013112082245800_btz545-B18","doi-asserted-by":"crossref","first-page":"787","DOI":"10.1093\/bioinformatics\/btx598","article-title":"QuASAR-MPRA: accurate allele-specific analysis for massively parallel reporter assays","volume":"34","author":"Kalita","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013112082245800_btz545-B19","volume-title":"Probabilistic Graphical Models: Principles and Techniques","author":"Koller","year":"2009"},{"key":"2023013112082245800_btz545-B20","doi-asserted-by":"crossref","first-page":"1073","DOI":"10.1038\/nprot.2009.86","article-title":"Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm","volume":"4","author":"Kumar","year":"2009","journal-title":"Nat. Protoc"},{"key":"2023013112082245800_btz545-B21","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1038\/ng.3467","article-title":"Fine-mapping cellular QTLs with RASQUAL and ATAC-seq","volume":"48","author":"Kumasaka","year":"2016","journal-title":"Nat. Genet"},{"key":"2023013112082245800_btz545-B22","doi-asserted-by":"crossref","first-page":"19498","DOI":"10.1073\/pnas.1210678109","article-title":"Complex effects of nucleotide variants in a mammalian cis-regulatory element","volume":"109","author":"Kwasnieski","year":"2012","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013112082245800_btz545-B23","doi-asserted-by":"crossref","first-page":"194.","DOI":"10.1186\/s13059-017-1322-z","article-title":"Systematic identification of regulatory variants associated with cancer risk","volume":"18","author":"Liu","year":"2017","journal-title":"Genome Biol"},{"key":"2023013112082245800_btz545-B24","doi-asserted-by":"crossref","first-page":"1432","DOI":"10.1101\/gr.190603.115","article-title":"Genomic approaches for understanding the genetics of complex disease","volume":"25","author":"Lowe","year":"2015","journal-title":"Genome Res"},{"key":"2023013112082245800_btz545-B25","doi-asserted-by":"crossref","first-page":"3049","DOI":"10.1002\/sim.3680","article-title":"The BUGS project: evolution, critique and future directions","volume":"28","author":"Lunn","year":"2009","journal-title":"Stat. Med"},{"key":"2023013112082245800_btz545-B26","doi-asserted-by":"crossref","first-page":"1437","DOI":"10.1093\/bioinformatics\/btw799","article-title":"High-throughput interpretation of gene structure changes in human and nonhuman resequencing data, using ACE","volume":"33","author":"Majoros","year":"2017","journal-title":"Bioinformatics"},{"key":"2023013112082245800_btz545-B27","doi-asserted-by":"crossref","first-page":"3616","DOI":"10.1093\/bioinformatics\/bty324","article-title":"Predicting gene structure changes resulting from genetic variants via exon definition features","volume":"34","author":"Majoros","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013112082245800_btz545-B28","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1016\/j.tig.2012.06.008","article-title":"Relating human genetic variation to variation in drug responses","volume":"28","author":"Madian","year":"2012","journal-title":"Trends Genet"},{"key":"2023013112082245800_btz545-B29","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1038\/nbt.2137","article-title":"Systematic dissection and optimization of inducible enhancers in human cells using a massively parallel reporter assay","volume":"30","author":"Melnikov","year":"2012","journal-title":"Nat. Biotechnol"},{"key":"2023013112082245800_btz545-B30","volume-title":"Machine Learning: A Probabilistic Perspective","author":"Murphy","year":"2012"},{"key":"2023013112082245800_btz545-B31","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1038\/nmeth.2885","article-title":"FIREWACh: high-throughput functional detection of transcriptional regulatory modules in mammalian cells","volume":"11","author":"Murtha","year":"2014","journal-title":"Nat. Methods"},{"key":"2023013112082245800_btz545-B32","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1038\/nbt.2136","article-title":"Massively parallel functional dissection of mammalian enhancers in vivo","volume":"30","author":"Patwardhan","year":"2012","journal-title":"Nat. Biotechnol"},{"key":"2023013112082245800_btz545-B33","author":"Plummer","year":"2003"},{"key":"2023013112082245800_btz545-B34","volume-title":"R\u00e9cherches sur la probabilit\u00e9 des jugements","author":"Poisson","year":"1837"},{"key":"2023013112082245800_btz545-B35","first-page":"20151684.","article-title":"Progress and promise in understanding the genetic basis of common diseases","volume":"282","author":"Price","year":"2015","journal-title":"Proc. Biol. Sci"},{"key":"2023013112082245800_btz545-B37","doi-asserted-by":"crossref","first-page":"1661","DOI":"10.1161\/CIRCULATIONAHA.109.914820","article-title":"Pharmacogenomics: the genetics of variable drug responses","volume":"123","author":"Roden","year":"2011","journal-title":"Circulation"},{"key":"2023013112082245800_btz545-B38","doi-asserted-by":"crossref","first-page":"521","DOI":"10.1038\/nbt.2205","article-title":"Inferring gene regulatory logic from high-throughput measurements of thousands of systematically designed promoters","volume":"30","author":"Sharon","year":"2012","journal-title":"Nat. Biotechnol"},{"key":"2023013112082245800_btz545-B39","doi-asserted-by":"crossref","first-page":"1728","DOI":"10.1101\/gr.119784.110","article-title":"A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data","volume":"21","author":"Skelly","year":"2011","journal-title":"Genome Res"},{"key":"2023013112082245800_btz545-B40","doi-asserted-by":"crossref","first-page":"R111","DOI":"10.1093\/hmg\/ddv260","article-title":"Strategies for fine-mapping complex traits","volume":"24","author":"Spain","year":"2015","journal-title":"Hum. Mol. Genet"},{"key":"2023013112082245800_btz545-B41","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1016\/j.ajhg.2017.07.014","article-title":"Variant interpretation: functional assays to the rescue","volume":"101","author":"Starita","year":"2017","journal-title":"Am. J. Hum. Genet"},{"key":"2023013112082245800_btz545-B42","doi-asserted-by":"crossref","first-page":"1519","DOI":"10.1016\/j.cell.2016.04.027","article-title":"Direct identification of hundreds of expression-modulating variants using a multiplexed reporter assay","volume":"165","author":"Tewhey","year":"2016","journal-title":"Cell"},{"key":"2023013112082245800_btz545-B43","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","article-title":"A global reference for human genetic variation","volume":"526","year":"2015","journal-title":"Nature"},{"key":"2023013112082245800_btz545-B44","author":"Tran","year":"2016"},{"key":"2023013112082245800_btz545-B45","doi-asserted-by":"crossref","first-page":"1530","DOI":"10.1016\/j.cell.2016.04.048","article-title":"Systematic functional dissection of common genetic variation affecting red blood cell traits","volume":"165","author":"Ulirsch","year":"2016","journal-title":"Cell"},{"key":"2023013112082245800_btz545-B46","doi-asserted-by":"crossref","first-page":"3583","DOI":"10.1093\/hmg\/ddt168","article-title":"The chromosome 3q25 genomic region is associated with measures of adiposity in newborns in a multi-ethnic genome-wide association study","volume":"22","author":"Urbanek","year":"2013","journal-title":"Hum. Mol. Genet"},{"key":"2023013112082245800_btz545-B47","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/nmeth.3582","article-title":"WASP: allele-specific software for robust discovery of molecular quantitative trait loci","volume":"12","author":"van de Geijn","year":"2015","journal-title":"Nat. Methods"},{"key":"2023013112082245800_btz545-B52","doi-asserted-by":"crossref","first-page":"1206","DOI":"10.1101\/gr.190090.115","article-title":"Massively parallel quantification of the regulatory effects of noncoding genetic variation in a human cohort","volume":"25","author":"Vockley","year":"2015","journal-title":"Genome Res."},{"key":"2023013112082245800_btz545-B48","doi-asserted-by":"crossref","first-page":"11952","DOI":"10.1073\/pnas.1307449110","article-title":"Massively parallel in vivo enhancer assay reveals that highly local features determine the cis-regulatory function of ChIP-seq peaks","volume":"110","author":"White","year":"2013","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023013112082245800_btz545-B49","doi-asserted-by":"crossref","first-page":"R102","DOI":"10.1093\/hmg\/ddv259","article-title":"Non-coding genetic variants in human disease","volume":"24","author":"Zhang","year":"2015","journal-title":"Hum. Mol. Genet"},{"key":"2023013112082245800_btz545-B50","doi-asserted-by":"crossref","first-page":"2022.","DOI":"10.1038\/s41467-018-04451-x","article-title":"High-throughput screening of prostate cancer risk loci by single nucleotide polymorphisms sequencing","volume":"9","author":"Zhang","year":"2018","journal-title":"Nat. Commun"},{"key":"2023013112082245800_btz545-B51","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1177\/1535370217713750","article-title":"Challenges and progress in interpretation of non-coding genetic variants associated with human disease","volume":"242","author":"Zhu","year":"2017","journal-title":"Exp. Biol. Med"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz545\/29154776\/btz545.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/2\/331\/48991562\/btz545.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/2\/331\/48991562\/btz545.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,18]],"date-time":"2023-09-18T14:21:11Z","timestamp":1695046871000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/2\/331\/5542384"}},"subtitle":[],"editor":[{"given":"Bonnie","family":"Berger","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,8,1]]},"references-count":50,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz545","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,1,15]]},"published":{"date-parts":[[2019,8,1]]}}}