{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,4]],"date-time":"2024-08-04T02:29:25Z","timestamp":1722738565162},"reference-count":21,"publisher":"Oxford University Press (OUP)","issue":"22","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,11,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Variant calling from genome-wide sequencing data is essential for the analysis of disease-causing mutations and elucidation of disease mechanisms. However, variant calling in low coverage regions is difficult due to sequence read errors and mapping errors. Hence, variant calling approaches that are robust to low coverage data are demanded.<\/jats:p>\n               <jats:p>Results: We propose a new variant calling approach that considers pedigree information and haplotyping based on sequence reads spanning two or more heterozygous positions termed phase informative reads. In our approach, genotyping and haplotyping by the assignment of each read to a haplotype based on phase informative reads are simultaneously performed. Therefore, positions with low evidence for heterozygosity are rescued by phase informative reads, and such rescued positions contribute to haplotyping in a synergistic way. In addition, pedigree information supports more accurate haplotyping as well as genotyping, especially in low coverage regions. Although heterozygous positions are useful for haplotyping, homozygous positions are not informative and weaken the information from heterozygous positions, as majority of positions are homozygous. Thus, we introduce latent variables that determine zygosity at each position to filter out homozygous positions for haplotyping. In performance evaluation with a parent\u2013offspring trio sequencing data, our approach outperforms existing approaches in accuracy on the agreement with single nucleotide polymorphism array genotyping results. Also, performance analysis considering distance between variants showed that the use of phase informative reads is effective for accurate variant calling, and further performance improvement is expected with longer sequencing data.<\/jats:p>\n               <jats:p>Contact: nagasaki@megabank.tohoku.ac.jp or kojima@megabank.tohoku.ac.jp<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btt503","type":"journal-article","created":{"date-parts":[[2013,9,4]],"date-time":"2013-09-04T00:15:30Z","timestamp":1378253730000},"page":"2835-2843","source":"Crossref","is-referenced-by-count":11,"title":["A statistical variant calling approach from pedigree information and local haplotyping with phase informative reads"],"prefix":"10.1093","volume":"29","author":[{"given":"Kaname","family":"Kojima","sequence":"first","affiliation":[{"name":"Department of Integrative Genomics, Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8573, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Naoki","family":"Nariai","sequence":"additional","affiliation":[{"name":"Department of Integrative Genomics, Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8573, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Takahiro","family":"Mimori","sequence":"additional","affiliation":[{"name":"Department of Integrative Genomics, Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8573, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mamoru","family":"Takahashi","sequence":"additional","affiliation":[{"name":"Department of Integrative Genomics, Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8573, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yumi","family":"Yamaguchi-Kabata","sequence":"additional","affiliation":[{"name":"Department of Integrative Genomics, Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8573, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yukuto","family":"Sato","sequence":"additional","affiliation":[{"name":"Department of Integrative Genomics, Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8573, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Masao","family":"Nagasaki","sequence":"additional","affiliation":[{"name":"Department of Integrative Genomics, Tohoku Medical Megabank Organization, Tohoku University, 2-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8573, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2013,9,3]]},"reference":[{"key":"2023012810474490700_btt503-B1","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1713","article-title":"A family-based probabilistic method for capturing de novo mutations from high-throughput short-read sequencing data","volume":"11","author":"Cartwright","year":"2012","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023012810474490700_btt503-B2","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1101\/gr.142455.112","article-title":"Genotype calling and haplotyping in parent-offspring trios","volume":"23","author":"Chen","year":"2013","journal-title":"Genome Res."},{"key":"2023012810474490700_btt503-B3","doi-asserted-by":"crossref","first-page":"14771","DOI":"10.1073\/pnas.93.25.14771","article-title":"A metric map of humans: 23,500 loci in 850 bands","volume":"93","author":"Collins","year":"1996","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012810474490700_btt503-B4","doi-asserted-by":"crossref","first-page":"712","DOI":"10.1038\/ng.862","article-title":"Variation in genome-wide mutation rates within and between human families","volume":"43","author":"Conrad","year":"2011","journal-title":"Nat. Genet."},{"key":"2023012810474490700_btt503-B5","doi-asserted-by":"crossref","first-page":"2156","DOI":"10.1093\/bioinformatics\/btr330","article-title":"The variant call format and VCFtools","volume":"27","author":"Danecek","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012810474490700_btt503-B6","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1038\/ng.806","article-title":"A framework for variation discovery and genotyping using next-generation DNA sequencing data","volume":"43","author":"DePristo","year":"2011","journal-title":"Nat. Genet."},{"key":"2023012810474490700_btt503-B7","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1007\/978-3-642-29627-7_8","article-title":"Hap-seq: an optimal algorithm for haplotype phasing with imputation using sequencing data","volume":"7262","author":"He","year":"2012","journal-title":"Lect. Notes Comput. Sci."},{"key":"2023012810474490700_btt503-B8","doi-asserted-by":"crossref","first-page":"1157","DOI":"10.1093\/bioinformatics\/btr076","article-title":"Improving SNP discovery by base alignment quality","volume":"27","author":"Li","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012810474490700_btt503-B9","doi-asserted-by":"crossref","first-page":"e1002944","DOI":"10.1371\/journal.pgen.1002944","article-title":"A likelihood-based framework for variant calling and de novo mutation detection in families","volume":"8","author":"Li","year":"2012","journal-title":"PLoS Genet."},{"key":"2023012810474490700_btt503-B10","article-title":"Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM","author":"Li","year":"2013"},{"key":"2023012810474490700_btt503-B11","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short-read alignment with Burrows-Wheeler Transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012810474490700_btt503-B12","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012810474490700_btt503-B13","doi-asserted-by":"crossref","first-page":"1124","DOI":"10.1101\/gr.088013.108","article-title":"SNP detection for massively parallel whole-genome resequencing","volume":"19","author":"Li","year":"2009","journal-title":"Genome Res."},{"key":"2023012810474490700_btt503-B14","doi-asserted-by":"crossref","first-page":"1297","DOI":"10.1101\/gr.107524.110","article-title":"The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data","volume":"20","author":"McKenna","year":"2010","journal-title":"Genome Res."},{"key":"2023012810474490700_btt503-B15","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1093\/bioinformatics\/bts632","article-title":"Genotype calling and phasing using next-generation sequencing reads and a haplotype scaffold","volume":"29","author":"Menelaou","year":"2013","journal-title":"Bioinformatics"},{"key":"2023012810474490700_btt503-B16","first-page":"467","article-title":"Loopy belief propagation for approximate inference: an empirical study","volume-title":"Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence","author":"Murphy","year":"1999"},{"key":"2023012810474490700_btt503-B17","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1186\/1471-2164-13-341","article-title":"A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers","volume":"13","author":"Quail","year":"2012","journal-title":"BMC Genomics"},{"key":"2023012810474490700_btt503-B18","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1038\/nrg3117","article-title":"Repetitive DNA and next-generation sequencing: computational challenges and solutions","volume":"13","author":"Treangen","year":"2012","journal-title":"Nat. Rev. Genet."},{"key":"2023012810474490700_btt503-B19","doi-asserted-by":"crossref","first-page":"736","DOI":"10.1109\/18.910585","article-title":"On the optimality of solutions of the max-product belief-propagation algorithm in arbitrary graphs","volume":"47","author":"Weiss","year":"2001","journal-title":"IEEE Trans. Inf. Theory"},{"key":"2023012810474490700_btt503-B20","doi-asserted-by":"crossref","first-page":"2282","DOI":"10.1109\/TIT.2005.850085","article-title":"Constructing free-energy approximations and generalized belief propagation algorithms","volume":"51","author":"Yedidia","year":"2005","journal-title":"IEEE Trans. Inf. Theory"},{"key":"2023012810474490700_btt503-B21","doi-asserted-by":"crossref","first-page":"643","DOI":"10.1093\/bioinformatics\/bts001","article-title":"SNP calling using genotype model selection on high-throughput sequencing data","volume":"28","author":"You","year":"2012","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/22\/2835\/48899876\/bioinformatics_29_22_2835.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/22\/2835\/48899876\/bioinformatics_29_22_2835.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,28]],"date-time":"2023-01-28T12:44:30Z","timestamp":1674909870000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/29\/22\/2835\/315776"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,9,3]]},"references-count":21,"journal-issue":{"issue":"22","published-print":{"date-parts":[[2013,11,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btt503","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2013,11,15]]},"published":{"date-parts":[[2013,9,3]]}}}