{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,4]],"date-time":"2024-08-04T02:23:10Z","timestamp":1722738190184},"reference-count":22,"publisher":"Oxford University Press (OUP)","issue":"21","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The development of next generation sequencing technology provides an efficient and powerful approach to rare variant detection. To identify genetic variations, the essential question is how to quantity the sequencing error rate in the data. Because of the advantage of easy implementation and the ability to integrate data from different sources, the empirical Bayes method is popularly employed to estimate the sequencing error rate for SNP detection.<\/jats:p>\n               <jats:p>Results: We propose a novel statistical model to fit the observed non-reference allele frequency data, and utilize the empirical Bayes method for both genotyping and SNP detection, where an ECM algorithm is implemented to estimate the model parameters. The performance of our proposed method is investigated via simulations and real data analysis. It is shown that our method makes less genotype-call errors, and with the parameter estimates from the ECM algorithm, it attains high detection power with FDR being well controlled.<\/jats:p>\n               <jats:p>Availability and implementation : The proposed algorithm is wrapped in the R package ebGenotyping, which can be downloaded from http:\/\/cran.r-project.org\/web\/packages\/ebGenotyping\/ .<\/jats:p>\n               <jats:p>Contact: \u00a0youn@mail.sysu.edu.cn<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btw409","type":"journal-article","created":{"date-parts":[[2016,7,5]],"date-time":"2016-07-05T01:59:34Z","timestamp":1467683974000},"page":"3240-3245","source":"Crossref","is-referenced-by-count":5,"title":["An empirical Bayes method for genotyping and SNP detection using multi-sample next-generation sequencing data"],"prefix":"10.1093","volume":"32","author":[{"given":"Gongyi","family":"Huang","sequence":"first","affiliation":[{"name":"1 School of Mathematics, Sun Yat-sen University, Guangzhou 510275, China"}]},{"given":"Shaoli","family":"Wang","sequence":"additional","affiliation":[{"name":"2 School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai 200433, China"},{"name":"3 Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai 200433, China"}]},{"given":"Xueqin","family":"Wang","sequence":"additional","affiliation":[{"name":"1 School of Mathematics, Sun Yat-sen University, Guangzhou 510275, China"},{"name":"4 South China Center for Statistical Science, Sun Yat-sen University, Guangzhou 510275, China"},{"name":"5 Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China"}]},{"given":"Na","family":"You","sequence":"additional","affiliation":[{"name":"1 School of Mathematics, Sun Yat-sen University, Guangzhou 510275, China"},{"name":"4 South China Center for Statistical Science, Sun Yat-sen University, Guangzhou 510275, China"}]}],"member":"286","published-online":{"date-parts":[[2016,7,4]]},"reference":[{"key":"2023020113514343600_btw409-B1","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1089\/cmb.2014.0029","article-title":"Joint variant and de novo mutation identification on pedigrees from high-throughput sequencing data","volume":"21","author":"Cleary","year":"2014","journal-title":"J. Comput. Biol"},{"key":"2023020113514343600_btw409-B2","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1038\/ng.806","article-title":"A framework for variation discovery and genotyping using next-generation DNA sequencing data","volume":"43","author":"DePristo","year":"2011","journal-title":"Nat. Genet"},{"key":"2023020113514343600_btw409-B3","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1198\/016214504000000089","article-title":"Large-scale simultaneous hypothesis testing: The choice of a null hypothesis","volume":"99","author":"Efron","year":"2004","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020113514343600_btw409-B4","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1198\/016214506000001211","article-title":"Correlation and large-scale simultaneous significance testing","volume":"102","author":"Efron","year":"2007","journal-title":"J. Am. Stat. Assoc"},{"key":"2023020113514343600_btw409-B5","doi-asserted-by":"crossref","first-page":"1351","DOI":"10.1214\/009053606000001460","article-title":"Size, power and false discovery rates","volume":"35","author":"Efron","year":"2007","journal-title":"Ann. Stat"},{"key":"2023020113514343600_btw409-B6","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511761362","volume-title":"Large-Scale Inference: Empirical Bayes Methods for Estimation, Testing, and Prediction","author":"Efron","year":"2010","edition":"1st edn"},{"key":"2023020113514343600_btw409-B7","doi-asserted-by":"crossref","first-page":"730","DOI":"10.1093\/bioinformatics\/btq040","article-title":"Snvmix: predicting single nucleotide variants from next-generation sequencing of tumors","volume":"26","author":"Goya","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020113514343600_btw409-B8","doi-asserted-by":"crossref","first-page":"952","DOI":"10.1101\/gr.113084.110","article-title":"SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples","volume":"21","author":"Le","year":"2011","journal-title":"Genome Res"},{"key":"2023020113514343600_btw409-B9","doi-asserted-by":"crossref","first-page":"e1000481","DOI":"10.1371\/journal.pgen.1000481","article-title":"Discovery of rare variants via sequencing: implications for the design of complex trait association studies","volume":"5","author":"Li","year":"2009","journal-title":"PloS Genet"},{"key":"2023020113514343600_btw409-B10","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and samtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020113514343600_btw409-B11","doi-asserted-by":"crossref","first-page":"1124","DOI":"10.1101\/gr.088013.108","article-title":"Snp detection for massively parallel whole-genome resequencing","volume":"19","author":"Li","year":"2009","journal-title":"Genome Res"},{"key":"2023020113514343600_btw409-B12","doi-asserted-by":"crossref","first-page":"2803","DOI":"10.1093\/bioinformatics\/btq526","article-title":"Seqem: An adaptive genotype-calling approach for next-generation sequencing studies","volume":"26","author":"Martin","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020113514343600_btw409-B13","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1093\/biomet\/80.2.267","article-title":"Maximum likelihood estimation via the ECM algorithm: a general framework","volume":"80","author":"Meng","year":"1993","journal-title":"Biometrika"},{"key":"2023020113514343600_btw409-B14","doi-asserted-by":"crossref","first-page":"1047","DOI":"10.1214\/12-AOAS538","article-title":"Detecting mutations in mixed sample sequencing data using empirical Bayes","volume":"6","author":"Muralidharan","year":"2012","journal-title":"Ann. Appl. Stat"},{"key":"2023020113514343600_btw409-B15","doi-asserted-by":"crossref","first-page":"e5.","DOI":"10.1093\/nar\/gkr851","article-title":"A cross-sample statistical model for snp detection in short-read sequencing data","volume":"40","author":"Muralidharan","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023020113514343600_btw409-B16","doi-asserted-by":"crossref","first-page":"1486","DOI":"10.1093\/bioinformatics\/btv753","article-title":"Multigems: Detection of snvs from multiple samples using model selection on high-throughput sequencing data","volume":"32","author":"Murillo","year":"2016","journal-title":"Bioinformatics"},{"key":"2023020113514343600_btw409-B17","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1038\/nmeth1156","article-title":"Next-generation sequencing transforms today\u2019s biology","volume":"5","author":"Schuster","year":"2008","journal-title":"Nat. Methods"},{"key":"2023020113514343600_btw409-B18","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1101\/gr.096388.109","article-title":"A SNP discovery method to assess variant allele probability from next-generation resequencing data","volume":"20","author":"Shen","year":"2010","journal-title":"Genome Res"},{"key":"2023020113514343600_btw409-B19","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1101\/gad.1864110","article-title":"Personal genome sequencing: current approaches and challenges","volume":"24","author":"Snyder","year":"2010","journal-title":"Genes Dev"},{"key":"2023020113514343600_btw409-B20","doi-asserted-by":"crossref","first-page":"643","DOI":"10.1093\/bioinformatics\/bts001","article-title":"Snp calling using genotype model selection on high-throughput sequencing data","volume":"28","author":"You","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020113514343600_btw409-B21","doi-asserted-by":"crossref","first-page":"2229","DOI":"10.1214\/13-AOAS660","article-title":"An empirical Bayes testing procedure for detecting variants in analysis of next generation sequencing data","volume":"7","author":"Zhao","year":"2013","journal-title":"Ann. Appl. Stat"},{"key":"2023020113514343600_btw409-B22","doi-asserted-by":"crossref","first-page":"2569","DOI":"10.1093\/bioinformatics\/bts501","article-title":"An empirical Bayes mixture model for snp detection in pooled sequencing data","volume":"28","author":"Zhou","year":"2012","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/21\/3240\/49021832\/bioinformatics_32_21_3240.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/21\/3240\/49021832\/bioinformatics_32_21_3240.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T23:54:35Z","timestamp":1675295675000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/21\/3240\/2415185"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,7,4]]},"references-count":22,"journal-issue":{"issue":"21","published-print":{"date-parts":[[2016,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw409","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2016,11,1]]},"published":{"date-parts":[[2016,7,4]]}}}