{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,26]],"date-time":"2025-09-26T13:12:32Z","timestamp":1758892352346},"reference-count":18,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>The allele frequencies of single-nucleotide polymorphisms (SNPs) are needed to select an optimal subset of common SNPs for use in association studies. Sequence-based methods for finding SNPs with allele frequencies may need to handle thousands of sequences from the same genome location (sequences of deep coverage).<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We describe a computational method for finding common SNPs with allele frequencies in single-pass sequences of deep coverage. The method enhances a widely used program named PolyBayes in several aspects. We present results from our method and PolyBayes on eighteen data sets of human expressed sequence tags (ESTs) with deep coverage. The results indicate that our method used almost all single-pass sequences in computation of the allele frequencies of SNPs.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>The new method is able to handle single-pass sequences of deep coverage efficiently. Our work shows that it is possible to analyze sequences of deep coverage by using pairwise alignments of the sequences with the finished genome sequence, instead of multiple sequence alignments.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/1471-2105-6-220","type":"journal-article","created":{"date-parts":[[2005,9,7]],"date-time":"2005-09-07T18:13:51Z","timestamp":1126116831000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["A method for finding single-nucleotide polymorphisms with allele frequencies in sequences of deep coverage"],"prefix":"10.1186","volume":"6","author":[{"given":"Jianmin","family":"Wang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaoqiu","family":"Huang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2005,9,7]]},"reference":[{"key":"545_CR1","doi-asserted-by":"publisher","first-page":"1072","DOI":"10.1126\/science.1105436","volume":"307","author":"DA Hinds","year":"2005","unstructured":"Hinds DA, Stuve LL, Nilsen GB, Halperin E, Eskin E, Ballinger DG, Frazer KA, Cox DR: Whole-genome patterns of common DNA variation in three human populations. Science 2005, 307: 1072\u20131079. 10.1126\/science.1105436","journal-title":"Science"},{"key":"545_CR2","doi-asserted-by":"publisher","first-page":"1182","DOI":"10.1126\/science.307.5713.1182a","volume":"307","author":"J Kaiser","year":"2005","unstructured":"Kaiser J: NCI gears up for cancer genome project. Science 2005, 307: 1182. 10.1126\/science.307.5713.1182a","journal-title":"Science"},{"key":"545_CR3","doi-asserted-by":"publisher","first-page":"452","DOI":"10.1038\/70570","volume":"23","author":"GT Marth","year":"1999","unstructured":"Marth GT, Korf I, Yandell MD, Yeh RT, Gu Z, Zakeri H, Stitziel NO, Hillier L, Kwok P, Gish WR: A general approach to single-nucleotide polymorphism discovery. Nat Genet 1999, 23: 452\u2013456. 10.1038\/70570","journal-title":"Nat Genet"},{"key":"545_CR4","doi-asserted-by":"publisher","first-page":"928","DOI":"10.1038\/35057149","volume":"409","author":"The International SNP Map Working Group","year":"2001","unstructured":"The International SNP Map Working Group: A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature 2001, 409: 928\u2013933. 10.1038\/35057149","journal-title":"Nature"},{"key":"545_CR5","unstructured":"PolyFreq Program[http:\/\/bioinformatics.iastate.edu\/aat\/PolyFreq\/]"},{"key":"545_CR6","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1093\/nar\/gkg033","volume":"31","author":"DL Wheeler","year":"2003","unstructured":"Wheeler DL, Church DM, Federhen S, Lash AE, Madden TL, Pontius JU, Schuler GD, Schriml LM, Sequeira E, Tatusova TA, Wagner L: Database resources of the National Center for Biotechnology. Nucleic Acids Res 2003, 31: 28\u201333. 10.1093\/nar\/gkg033","journal-title":"Nucleic Acids Res"},{"key":"545_CR7","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1101\/gr.8.3.175","volume":"8","author":"B Ewing","year":"1998","unstructured":"Ewing B, Hillier L, Wendl M, Green P: Basecalling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res 1998, 8: 175\u2013185.","journal-title":"Genome Res"},{"key":"545_CR8","doi-asserted-by":"publisher","first-page":"186","DOI":"10.1101\/gr.8.3.186","volume":"8","author":"B Ewing","year":"1998","unstructured":"Ewing B, Green P: Basecalling of automated sequencer traces using phred. II. Error probabilities. Genome Res 1998, 8: 186\u2013194.","journal-title":"Genome Res"},{"key":"545_CR9","doi-asserted-by":"publisher","first-page":"16899","DOI":"10.1073\/pnas.242603899","volume":"26","author":"Mammalian Gene Collection (MGC) Program Team","year":"2002","unstructured":"Mammalian Gene Collection (MGC) Program Team: Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. Proc Natl Acad Sci USA 2002, 26: 16899\u201316903. 10.1073\/pnas.242603899","journal-title":"Proc Natl Acad Sci USA"},{"key":"545_CR10","doi-asserted-by":"publisher","first-page":"656","DOI":"10.1101\/gr.229202. Article published online before March 2002","volume":"12","author":"WJ Kent","year":"2002","unstructured":"Kent WJ: BLAT \u2013 The BLAST-like alignment tool. Genome Res 2002, 12: 656\u2013664. 10.1101\/gr.229202. Article published online before March 2002","journal-title":"Genome Res"},{"key":"545_CR11","volume-title":"RepeatMasker Open-3.0","author":"AFA Smit","year":"1996","unstructured":"Smit AFA, Hubley R, Green P: RepeatMasker Open-3.0.1996. [http:\/\/www.repeatmasker.org]"},{"key":"545_CR12","unstructured":"dbSNP[http:\/\/www.ncbi.nlm.nih.gov\/SNP]"},{"key":"545_CR13","doi-asserted-by":"publisher","first-page":"D501","DOI":"10.1093\/nar\/gki025","volume":"33","author":"KD Pruitt","year":"2005","unstructured":"Pruitt KD, Tatusova T, Maglott DR: NCBI Reference Sequence (RefSeq): A curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res 2005, 33: D501-D504. 10.1093\/nar\/gki025","journal-title":"Nucleic Acids Res"},{"key":"545_CR14","doi-asserted-by":"publisher","first-page":"228","DOI":"10.1093\/bioinformatics\/19.2.228","volume":"19","author":"X Huang","year":"2003","unstructured":"Huang X, Chao K-M: A generalized global alignment algorithm. Bioinformatics 2003, 19: 228\u2013233. 10.1093\/bioinformatics\/19.2.228","journal-title":"Bioinformatics"},{"key":"545_CR15","doi-asserted-by":"publisher","first-page":"2529","DOI":"10.1093\/bioinformatics\/bth279","volume":"20","author":"X Huang","year":"2004","unstructured":"Huang X, Ye L, Chou H-H, Yang I-H, Chao K-M: Effcient combination of multiple word models for improved sequence comparison. Bioinformatics 2004, 20: 2529\u20132533. 10.1093\/bioinformatics\/bth279","journal-title":"Bioinformatics"},{"key":"545_CR16","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1006\/geno.1997.4984","volume":"46","author":"X Huang","year":"1997","unstructured":"Huang X, Adams MD, Zhou H, Kerlavage AR: A tool for analyzing and annotating genomic sequences. Genomics 1997, 46: 37\u201345. 10.1006\/geno.1997.4984","journal-title":"Genomics"},{"key":"545_CR17","first-page":"219","volume":"10","author":"X Huang","year":"1994","unstructured":"Huang X: An algorithm for identifying regions of a DNA sequence that satisfy a content requirement. Comput Appl Biosci 1994, 10: 219\u2013225.","journal-title":"Comput Appl Biosci"},{"key":"545_CR18","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1038\/35035083","volume":"407","author":"D Altshuler","year":"2000","unstructured":"Altshuler D, Pollara VJ, Cowles CR, Van Etten WJ, Baldwin J, Linton L, Lander ES: An SNP map of the human genome generated by reduced representation shotgun sequencing. Nature 2000, 407: 513\u2013516. 10.1038\/35035083","journal-title":"Nature"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-6-220.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,1]],"date-time":"2024-02-01T17:57:25Z","timestamp":1706810245000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-6-220"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,9,7]]},"references-count":18,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2005,12]]}},"alternative-id":["545"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-6-220","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2005,9,7]]},"assertion":[{"value":"1 February 2005","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 September 2005","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 September 2005","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"220"}}