{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T22:48:32Z","timestamp":1767998912300,"version":"3.49.0"},"reference-count":25,"publisher":"Oxford University Press (OUP)","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2013,2,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Genotype imputation has become an indispensible step in genome-wide association studies (GWAS). Imputation accuracy, directly influencing downstream analysis, has shown to be improved using re-sequencing-based reference panels; however, this comes at the cost of high computational burden due to the huge number of potentially imputable markers (tens of millions) discovered through sequencing a large number of individuals. Therefore, there is an increasing need for access to imputation quality information without actually conducting imputation. To facilitate this process, we have established a publicly available SNP and indel imputability database, aiming to provide direct access to imputation accuracy information for markers identified by the 1000 Genomes Project across four major populations and covering multiple GWAS genotyping platforms.<\/jats:p>\n               <jats:p>Results: SNP and indel imputability information can be retrieved through a user-friendly interface by providing the ID(s) of the desired variant(s) or by specifying the desired genomic region. The query results can be refined by selecting relevant GWAS genotyping platform(s). This is the first database providing variant imputability information specific to each continental group and to each genotyping platform. In Filipino individuals from the Cebu Longitudinal Health and Nutrition Survey, our database can achieve an area under the receiver-operating characteristic curve of 0.97, 0.91, 0.88 and 0.79 for markers with minor allele frequency &amp;gt;5%, 3\u20135%, 1\u20133% and 0.5\u20131%, respectively. Specifically, by filtering out 48.6% of markers (corresponding to a reduction of up to 48.6% in computational costs for actual imputation) based on the imputability information in our database, we can remove 77%, 58%, 51% and 42% of the poorly imputed markers at the cost of only 0.3%, 0.8%, 1.5% and 4.6% of the well-imputed markers with minor allele frequency &amp;gt;5%, 3\u20135%, 1\u20133% and 0.5\u20131%, respectively.<\/jats:p>\n               <jats:p>Availability: \u00a0http:\/\/www.unc.edu\/\u223cyunmli\/imputability.html<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <jats:p>Contact: \u00a0yunli@med.unc.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts724","type":"journal-article","created":{"date-parts":[[2013,1,5]],"date-time":"2013-01-05T01:14:19Z","timestamp":1357348459000},"page":"528-531","source":"Crossref","is-referenced-by-count":20,"title":["A comprehensive SNP and indel imputability database"],"prefix":"10.1093","volume":"29","author":[{"given":"Qing","family":"Duan","sequence":"first","affiliation":[{"name":"1 Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA, 2Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599, USA and 3Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eric Yi","family":"Liu","sequence":"additional","affiliation":[{"name":"1 Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA, 2Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599, USA and 3Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Damien C.","family":"Croteau-Chonka","sequence":"additional","affiliation":[{"name":"1 Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA, 2Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599, USA and 3Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Karen L.","family":"Mohlke","sequence":"additional","affiliation":[{"name":"1 Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA, 2Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599, USA and 3Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yun","family":"Li","sequence":"additional","affiliation":[{"name":"1 Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA, 2Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599, USA and 3Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599, USA"},{"name":"1 Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA, 2Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599, USA and 3Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599, USA"},{"name":"1 Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA, 2Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599, USA and 3Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2013,1,3]]},"reference":[{"key":"2023012810251391600_bts724-B1","doi-asserted-by":"crossref","first-page":"619","DOI":"10.1093\/ije\/dyq085","article-title":"Cohort profile: the Cebu longitudinal health and nutrition survey","volume":"40","author":"Adair","year":"2011","journal-title":"Int. J. Epidemiol."},{"key":"2023012810251391600_bts724-B2","doi-asserted-by":"crossref","first-page":"847","DOI":"10.1016\/j.ajhg.2009.11.004","article-title":"Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies","volume":"85","author":"Browning","year":"2009","journal-title":"Am. J. Hum. Genet."},{"key":"2023012810251391600_bts724-B3","doi-asserted-by":"crossref","first-page":"463","DOI":"10.1093\/hmg\/ddr480","article-title":"Population-specific coding variant underlies genome-wide association with adiponectin level","volume":"21","author":"Croteau-Chonka","year":"2012","journal-title":"Hum. Mol. Genet."},{"key":"2023012810251391600_bts724-B4","doi-asserted-by":"crossref","first-page":"e1000899","DOI":"10.1371\/journal.pgen.1000899","article-title":"Chromosome 9p21 SNPs associated with multiple disease phenotypes correlate with ANRIL expression","volume":"6","author":"Cunnington","year":"2010","journal-title":"PLoS Genet."},{"key":"2023012810251391600_bts724-B5","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1016\/j.ajhg.2011.08.001","article-title":"A variant in MCF2L is associated with osteoarthritis","volume":"89","author":"Day-Williams","year":"2011","journal-title":"Am. J. Hum. Genet."},{"key":"2023012810251391600_bts724-B6","doi-asserted-by":"crossref","first-page":"e11018","DOI":"10.1371\/journal.pone.0011018","article-title":"Utilizing genotype imputation for the augmentation of sequence data","volume":"5","author":"Fridley","year":"2010","journal-title":"PloS ONE"},{"key":"2023012810251391600_bts724-B7","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1534\/g3.111.001198","article-title":"Genotype imputation with thousands of genomes","volume":"1","author":"Howie","year":"2011","journal-title":"G3 (Bethesda, MD.)"},{"key":"2023012810251391600_bts724-B8","doi-asserted-by":"crossref","first-page":"955","DOI":"10.1038\/ng.2354","article-title":"Fast and accurate genotype imputation in genome-wide association studies through pre-phasing","volume":"44","author":"Howie","year":"2012","journal-title":"Nat. Genet."},{"key":"2023012810251391600_bts724-B9","doi-asserted-by":"crossref","first-page":"316","DOI":"10.1038\/ng.781","article-title":"A rare variant in MYH6 is associated with high risk of sick sinus syndrome","volume":"43","author":"Holm","year":"2011","journal-title":"Nat. Genet."},{"key":"2023012810251391600_bts724-B10","doi-asserted-by":"crossref","first-page":"801","DOI":"10.1038\/ejhg.2012.3","article-title":"1000 Genomes-based imputation identifies novel and refined associations for the. Wellcome Trust Case Control Consortium phase 1 Data","volume":"20","author":"Huang","year":"2012","journal-title":"Eur. J. Hum. Genet."},{"key":"2023012810251391600_bts724-B11","doi-asserted-by":"crossref","first-page":"2050","DOI":"10.1093\/hmg\/ddq062","article-title":"Genome-wide association study of homocysteine levels in Filipinos provides evidence for CPS1 in women and a stronger MTHFR effect in young adults","volume":"19","author":"Lange","year":"2010","journal-title":"Hum. Mol. Genet."},{"key":"2023012810251391600_bts724-B12","doi-asserted-by":"crossref","first-page":"e24945","DOI":"10.1371\/journal.pone.0024945","article-title":"Performance of genotype imputation for rare variants identified in exons and flanking regions of genes","volume":"6","author":"Li","year":"2011","journal-title":"PloS ONE"},{"key":"2023012810251391600_bts724-B25","doi-asserted-by":"crossref","first-page":"2213","DOI":"10.1093\/genetics\/165.4.2213","article-title":"Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data","volume":"165","author":"Li","year":"2003","journal-title":"Genetics"},{"key":"2023012810251391600_bts724-B13","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1146\/annurev.genom.9.081307.164242","article-title":"Genotype imputation","volume":"10","author":"Li","year":"2009","journal-title":"Ann. Rev. Genomics Hum. Genet."},{"key":"2023012810251391600_bts724-B14","doi-asserted-by":"crossref","first-page":"940","DOI":"10.1101\/gr.117259.110","article-title":"Low-coverage sequencing: implications for design of complex trait association studies","volume":"21","author":"Li","year":"2011","journal-title":"Genome Res."},{"key":"2023012810251391600_bts724-B15","doi-asserted-by":"crossref","first-page":"816","DOI":"10.1002\/gepi.20533","article-title":"MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes","volume":"34","author":"Li","year":"2010","journal-title":"Genet. Epidemiol."},{"key":"2023012810251391600_bts724-B16","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1002\/gepi.21603","article-title":"Genotype imputation of metabochip SNPs using a study-specific reference panel of \u223c4,000 haplotypes in African Americans from the women\u2019s health initiative","volume":"117","author":"Liu","year":"2012","journal-title":"Genet. Epidemiol."},{"key":"2023012810251391600_bts724-B17","first-page":"1","article-title":"MaCH-admix: genotype imputation for admixed populations","volume":"00","author":"Liu","year":"2012","journal-title":"Genet. Epidemiol."},{"key":"2023012810251391600_bts724-B18","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1038\/nrg2796","article-title":"Genotype imputation for genome-wide association studies","volume":"11","author":"Marchini","year":"2010","journal-title":"Nature reviews. Genetics"},{"key":"2023012810251391600_bts724-B19","doi-asserted-by":"crossref","first-page":"729","DOI":"10.1007\/s10038-007-0175-9","article-title":"Comparison of ENCODE region SNPs between Cebu Filipino and Asian HapMap samples","volume":"52","author":"Marvelle","year":"2007","journal-title":"J. Hum. Genet."},{"key":"2023012810251391600_bts724-B20","doi-asserted-by":"crossref","first-page":"1488","DOI":"10.1126\/science.1142447","article-title":"A common allele on chromosome 9 associated with coronary heart disease","volume":"316","author":"McPherson","year":"2007","journal-title":"Science"},{"key":"2023012810251391600_bts724-B21","doi-asserted-by":"crossref","first-page":"400","DOI":"10.1002\/gepi.21634","article-title":"A two-platform design for next generation genome-wide association studies","volume":"36","author":"Sampson","year":"2012","journal-title":"Genet. Epidemiol."},{"key":"2023012810251391600_bts724-B22","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/nature09534","article-title":"A map of human genome variation from population-scale sequencing","volume":"467","author":"The 1000 Genomes Project Consortium","year":"2010","journal-title":"Nature"},{"key":"2023012810251391600_bts724-B23","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1038\/nature09298","article-title":"Integrating common and rare genetic variation in diverse human populations","volume":"467","author":"The International HapMap 3 Consortium","year":"2010","journal-title":"Nature"},{"key":"2023012810251391600_bts724-B24","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1002\/gepi.20552","article-title":"A comparison of approaches to account for uncertainty in analysis of imputed genotypes","volume":"35","author":"Zheng","year":"2011","journal-title":"Genet. Epidemiol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/4\/528\/48896983\/bioinformatics_29_4_528.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/29\/4\/528\/48896983\/bioinformatics_29_4_528.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,28]],"date-time":"2023-01-28T11:57:02Z","timestamp":1674907022000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/29\/4\/528\/199693"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,1,3]]},"references-count":25,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2013,2,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts724","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2013,2,15]]},"published":{"date-parts":[[2013,1,3]]}}}