{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T16:04:23Z","timestamp":1761581063996},"reference-count":17,"publisher":"Oxford University Press (OUP)","issue":"7","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Next-generation sequencing (NGS) technologies have enabled whole-genome discovery and analysis of genetic variants in many species of interest. Individuals are often sequenced at low coverage for detecting novel variants, phasing haplotypes and inferring population structures. Although several tools have been developed for SNP and genotype calling in NGS data, haplotype phasing is often done separately on the called genotypes.<\/jats:p><jats:p>Results: We propose a dynamic Bayesian Markov model (DBM) for simultaneous genotype calling and haplotype phasing in low-coverage NGS data of unrelated individuals. Our method is fully probabilistic that produces consistent inference of genotypes, haplotypes and recombination probabilities. Using data from the 1000 Genomes Project, we demonstrate that DBM not only yields more accurate results than some popular methods, but also provides novel characterization of haplotype structures at the individual level for visualization, interpretation and comparison in downstream analysis. DBM is a powerful and flexible tool that can be applied to many sequencing studies. Its statistical framework can also be extended to accommodate broader scopes of data.<\/jats:p><jats:p>Availability and implementation: \u00a0http:\/\/stat.psu.edu\/\u223cyuzhang\/software\/dbm.tar<\/jats:p><jats:p>Contact: \u00a0yuzhang@stat.psu.edu<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btt065","type":"journal-article","created":{"date-parts":[[2013,2,14]],"date-time":"2013-02-14T04:21:04Z","timestamp":1360815664000},"page":"878-885","source":"Crossref","is-referenced-by-count":9,"title":["A dynamic Bayesian Markov model for phasing and characterizing haplotypes in next-generation sequencing"],"prefix":"10.1093","volume":"29","author":[{"given":"Yu","family":"Zhang","sequence":"first","affiliation":[{"name":"Department of Statistics, The Pennsylvania State University, 325 Thomas, University Park, PA 16802, USA"}]}],"member":"286","published-online":{"date-parts":[[2013,2,25]]},"reference":[{"key":"2023020303310721900_btt065-B1","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1093\/bioinformatics\/bth457","article-title":"Haploview: analysis and visualization of LD and haplotype maps","volume":"21","author":"Barrett","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020303310721900_btt065-B2","doi-asserted-by":"crossref","first-page":"577","DOI":"10.7551\/mitpress\/1120.003.0079","article-title":"The infinite hidden Markov model","volume-title":"Advances in Neural Information Processing Systems 14","author":"Beal","year":"2002"},{"key":"2023020303310721900_btt065-B3","doi-asserted-by":"crossref","first-page":"1084","DOI":"10.1086\/521987","article-title":"Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering","volume":"81","author":"Browning","year":"2007","journal-title":"Am. J. Hum. Genet."},{"key":"2023020303310721900_btt065-B4","doi-asserted-by":"crossref","first-page":"703","DOI":"10.1038\/nrg3054","article-title":"Haplotype phasing: existing methods and new developments","volume":"12","author":"Browning","year":"2011","journal-title":"Nat. Rev. Genet."},{"key":"2023020303310721900_btt065-B5","doi-asserted-by":"crossref","first-page":"1042","DOI":"10.1198\/jasa.2009.tm08439","article-title":"Nonparametric Bayes modeling of multivariate categorical data","volume":"104","author":"Dunson","year":"2009","journal-title":"J. Am. Stat. Assoc."},{"key":"2023020303310721900_btt065-B6","doi-asserted-by":"crossref","first-page":"S3","DOI":"10.1186\/1471-2105-13-S6-S3","article-title":"Haplotype reconstruction using perfect phylogeny and sequence data","volume":"13","author":"Efros","year":"2012","journal-title":"BMC Bioinformatics"},{"key":"2023020303310721900_btt065-B8","doi-asserted-by":"crossref","first-page":"e1000529","DOI":"10.1371\/journal.pgen.1000529","article-title":"A flexible and accurate genotype imputation method for the next generation of genome-wide association studies","volume":"5","author":"Howie","year":"2009","journal-title":"Plos Genet."},{"key":"2023020303310721900_btt065-B9","volume-title":"Monte Carlo Strategies in Scientific Computing","author":"Liu","year":"2001"},{"key":"2023020303310721900_btt065-B10","doi-asserted-by":"crossref","first-page":"2213","DOI":"10.1093\/genetics\/165.4.2213","article-title":"Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data","volume":"165","author":"Li","year":"2003","journal-title":"Genetics"},{"key":"2023020303310721900_btt065-B11","doi-asserted-by":"crossref","first-page":"816","DOI":"10.1002\/gepi.20533","article-title":"MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes","volume":"34","author":"Li","year":"2009","journal-title":"Genet. Epi."},{"key":"2023020303310721900_btt065-B12","doi-asserted-by":"crossref","first-page":"940","DOI":"10.1101\/gr.117259.110","article-title":"Low-coverage sequencing: implications of design of complex trait association studies","volume":"21","author":"Li","year":"2011","journal-title":"Genome Res."},{"key":"2023020303310721900_btt065-B14","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1038\/nrg2986","article-title":"Genotype and SNP calling from next-generation sequencing data","volume":"12","author":"Nielsen","year":"2011","journal-title":"Nat. Rev. Genet."},{"key":"2023020303310721900_btt065-B15","doi-asserted-by":"crossref","first-page":"629","DOI":"10.1086\/502802","article-title":"A fast and flexible statistical model for large scale population genotype data: applications to inferring missing genotypes and haplotypic phase","volume":"78","author":"Scheet","year":"2006","journal-title":"Am. J. Hum. Genet."},{"key":"2023020303310721900_btt065-B16","first-page":"639","article-title":"A Constructive Definition of Dirichlet Priors","volume":"4","author":"Sethuraman","year":"1994","journal-title":"Statistica Sinica"},{"key":"2023020303310721900_btt065-B17","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/nature09534","article-title":"A map of human genome variation from population-scale sequencing","volume":"467","author":"The 1000 Genomes Project Consortium","year":"2010","journal-title":"Nature"},{"key":"2023020303310721900_btt065-B18","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1016\/j.ajhg.2012.06.013","article-title":"Phasing of many thousands of genotyped samples","volume":"91","author":"Williams","year":"2012","journal-title":"Am. J. Hum. Genet."},{"key":"2023020303310721900_btt065-B19","doi-asserted-by":"crossref","first-page":"938","DOI":"10.1093\/bioinformatics\/bts047","article-title":"Genotype calling from next-generation sequencing data using haplotype information of reads","volume":"28","author":"Zhi","year":"2012","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/7\/878\/49060920\/bioinformatics_29_7_878.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/7\/878\/49060920\/bioinformatics_29_7_878.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,6]],"date-time":"2024-05-06T01:39:48Z","timestamp":1714959588000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/29\/7\/878\/253555"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,2,25]]},"references-count":17,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2013,4,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btt065","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2013,4,1]]},"published":{"date-parts":[[2013,2,25]]}}}