{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,29]],"date-time":"2026-03-29T12:23:21Z","timestamp":1774787001099,"version":"3.50.1"},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"16","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,8,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Limited availability of data has hindered the development of algorithms that can identify functionally meaningful regulatory single nucleotide polymorphisms (rSNPs). Given the large number of common polymorphisms known to reside in the human genome, the identification of functional rSNPs via laboratory assays will be costly and time-consuming. Therefore appropriate bioinformatics strategies for predicting functional rSNPs are necessary. Recent data from the Encyclopedia of DNA Elements (ENCODE) Project has significantly expanded the amount of available functional information relevant to non-coding regions of the genome, and, importantly, led to the conclusion that many functional elements in the human genome are not conserved.<\/jats:p>\n               <jats:p>Results: In this article we describe how ENCODE data can be leveraged to probabilistically determine the functional and phenotypic significance of non-coding SNPs (ncSNPs). The method achieves excellent sensitivity (\u223c80%) and specificity (\u223c99%) based on a set of known phenotypically relevant and non-functional SNPs. In addition, we show that our method is not overtrained through the use of cross-validation analyses.<\/jats:p>\n               <jats:p>Availability: The software platforms used in our analyses are freely available (http:\/\/www.cs.waikato.ac.nz\/ml\/weka\/). In addition, we provide the training dataset (Supplementary Table 3), and our predictions (Supplementary Table 6), in the Supplementary Material.<\/jats:p>\n               <jats:p>Contact: \u00a0nschork@scripps.edu.<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn311","type":"journal-article","created":{"date-parts":[[2008,6,19]],"date-time":"2008-06-19T00:14:16Z","timestamp":1213834456000},"page":"1787-1792","source":"Crossref","is-referenced-by-count":18,"title":["Predicting functional regulatory polymorphisms"],"prefix":"10.1093","volume":"24","author":[{"given":"Ali","family":"Torkamani","sequence":"first","affiliation":[{"name":"Scripps Genomic Medicine and the Scripps Translational Science Institute, Scripps Health and Department of Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, CA 92037, USA"}]},{"given":"Nicholas J.","family":"Schork","sequence":"additional","affiliation":[{"name":"Scripps Genomic Medicine and the Scripps Translational Science Institute, Scripps Health and Department of Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, CA 92037, USA"}]}],"member":"286","published-online":{"date-parts":[[2008,6,18]]},"reference":[{"key":"2023020210495576300_B1","doi-asserted-by":"crossref","first-page":"e5","DOI":"10.1371\/journal.pcbi.0040005","article-title":"In silico detection of sequence variations modifying transcriptional regulation","volume":"4","author":"Andersen","year":"2008","journal-title":"PLoS Comput. Biol"},{"key":"2023020210495576300_B2","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1002\/humu.20207","article-title":"Strong bias in the location of functional promoter polymorphisms","volume":"26","author":"Buckland","year":"2005","journal-title":"Hum. Mutat"},{"key":"2023020210495576300_B3","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1016\/j.bbadis.2005.10.004","article-title":"The importance and identification of regulatory polymorphisms and their mechanisms of action","volume":"1762","author":"Buckland","year":"2006","journal-title":"Biochim. Biophys. Acta"},{"key":"2023020210495576300_B4","doi-asserted-by":"crossref","first-page":"16433","DOI":"10.1073\/pnas.162342499","article-title":"The insulation of genes from external enhancers and silencing chromatin","volume":"99","author":"Burgess-Beusse","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210495576300_B5","doi-asserted-by":"crossref","first-page":"1121","DOI":"10.1016\/S0140-6736(05)67424-7","article-title":"Genetic association studies","volume":"366","author":"Cordell","year":"2005","journal-title":"Lancet"},{"key":"2023020210495576300_B6","doi-asserted-by":"crossref","first-page":"1933","DOI":"10.1016\/j.jacc.2007.07.062","article-title":"Future use of genomics in coronary artery disease","volume":"50","author":"Damani","year":"2007","journal-title":"J. Am. Coll. Cardiol"},{"key":"2023020210495576300_B7","doi-asserted-by":"crossref","first-page":"1827","DOI":"10.1371\/journal.pgen.0030170","article-title":"Power to detect risk alleles using genome-wide tag SNP panels","volume":"3","author":"Eberle","year":"2007","journal-title":"PLoS Genet"},{"key":"2023020210495576300_B8","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1038\/nature05874","article-title":"Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project","volume":"447","author":"ENCODE Project Consortium","year":"2007","journal-title":"Nature"},{"key":"2023020210495576300_B9","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/S0092-8674(00)80073-2","article-title":"Chromatin unfolds","volume":"86","author":"Felsenfeld","year":"1996","journal-title":"Cell"},{"key":"2023020210495576300_B10","doi-asserted-by":"crossref","first-page":"D707","DOI":"10.1093\/nar\/gkm988","article-title":"Ensembl 2008","volume":"36","author":"Flicek","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2023020210495576300_B11","first-page":"338","article-title":"Estimating continuous distributions in Bayesian classifiers","author":"George","year":"1995"},{"key":"2023020210495576300_B12","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1016\/j.ajhg.2007.09.006","article-title":"Shifting paradigm of association studies, value of rare singlenucleotide polymorphisms","volume":"82","author":"Gorlov","year":"2008","journal-title":"Am. J. Hum. Genet"},{"key":"2023020210495576300_B13","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1146\/annurev.bi.57.070188.001111","article-title":"Nuclease hypersensitive sites in chromatin","volume":"57","author":"Gross","year":"1988","journal-title":"Annu. Rev. Biochem"},{"key":"2023020210495576300_B14","doi-asserted-by":"crossref","first-page":"2249","DOI":"10.1093\/hmg\/ddg246","article-title":"Functional analysis of human promoter polymorphisms","volume":"12","author":"Hoogendoorn","year":"2003","journal-title":"Hum. Mol. Genet"},{"key":"2023020210495576300_B15","doi-asserted-by":"crossref","first-page":"D773","DOI":"10.1093\/nar\/gkm966","article-title":"The UCSC Genome Browser Database, 2008 update","volume":"36","author":"Karolchik","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2023020210495576300_B16","doi-asserted-by":"crossref","first-page":"3576","DOI":"10.1093\/nar\/gkg585","article-title":"MATCH, a tool for searching transcription factor binding sites in DNA sequences","volume":"31","author":"Kel","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2023020210495576300_B17","first-page":"656","article-title":"BLAT\u2013the BLAST-like alignment tool","volume":"12","author":"Kent","year":"2002","journal-title":"Genome Res"},{"key":"2023020210495576300_B18","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1016\/S0065-2660(07)00417-8","article-title":"Study designs for genome-wide association studies","volume":"60","author":"Kraft","year":"2008","journal-title":"Adv. Genet"},{"key":"2023020210495576300_B19","doi-asserted-by":"crossref","first-page":"191","DOI":"10.2307\/2347628","article-title":"Ridge estimators in logistic regression","volume":"41","author":"Cessie","year":"1992","journal-title":"Appl. Stat"},{"key":"2023020210495576300_B20","doi-asserted-by":"crossref","first-page":"7357","DOI":"10.1073\/pnas.0401866101","article-title":"Distinct localization of histone H3 acetylation and H3-K4 methylation to the transcription start sites in the human genome","volume":"101","author":"Liang","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210495576300_B21","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1016\/j.ajhg.2007.10.012","article-title":"Accommodating linkage disequilibrium in genetic-association analyses via ridge regression","volume":"82","author":"Malo","year":"2008","journal-title":"Am. J. Hum. Genet"},{"key":"2023020210495576300_B22","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1038\/nrg2203","article-title":"New links to the pathogenesis of Crohn disease provided by genome-wide association scans","volume":"9","author":"Mathew","year":"2008","journal-title":"Nat. Rev. Genet"},{"key":"2023020210495576300_B23","doi-asserted-by":"crossref","first-page":"e1000018","DOI":"10.1371\/journal.pgen.1000018","article-title":"Functional characterization of variations on regulatory motifs","volume":"4","author":"Michal","year":"2008","journal-title":"PLoS Genet"},{"key":"2023020210495576300_B24","doi-asserted-by":"crossref","first-page":"1797","DOI":"10.1101\/gr.6761107","article-title":"28-way vertebrate alignment and conservation track in the UCSC Genome Browser","volume":"17","author":"Miller","year":"2007","journal-title":"Genome Res"},{"key":"2023020210495576300_B25","doi-asserted-by":"crossref","first-page":"e106","DOI":"10.1371\/journal.pcbi.0030106","article-title":"A survey of genomic properties for the detection of regulatory polymorphisms","volume":"3","author":"Montgomery","year":"2007","journal-title":"PLoS Comput. Biol"},{"key":"2023020210495576300_B26","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1093\/bib\/6.1.44","article-title":"Bioinformatics approaches and resources for single nucleotide polymorphism functional analysis","volume":"6","author":"Mooney","year":"2005","journal-title":"Brief. Bioinform"},{"key":"2023020210495576300_B27","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1146\/annurev.genom.7.080505.115630","article-title":"Predicting the effects of amino acid substitutions on protein function","volume":"7","author":"Ng","year":"2006","journal-title":"Annu. Rev. Genomics Hum. Genet"},{"key":"2023020210495576300_B28","doi-asserted-by":"crossref","first-page":"1991","DOI":"10.1093\/oxfordjournals.molbev.a004023","article-title":"Abundant raw material for cis-regulatory evolution in humans","volume":"19","author":"Rockman","year":"2002","journal-title":"Mol. Biol. Evol"},{"key":"2023020210495576300_B29","doi-asserted-by":"crossref","first-page":"939","DOI":"10.1038\/nbt1098-939","article-title":"Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation","volume":"16","author":"Roth","year":"1998","journal-title":"Nat. Biotechnol"},{"key":"2023020210495576300_B30","doi-asserted-by":"crossref","first-page":"85","DOI":"10.2174\/156652407779940413","article-title":"DNA methylation in health, disease, and cancer","volume":"7","author":"Shames","year":"2007","journal-title":"Curr. Mol. Med"},{"key":"2023020210495576300_B31","doi-asserted-by":"crossref","first-page":"1034","DOI":"10.1101\/gr.3715005","article-title":"Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes","volume":"15","author":"Siepel","year":"2005","journal-title":"Genome Res"},{"key":"2023020210495576300_B32","doi-asserted-by":"crossref","first-page":"577","DOI":"10.1002\/humu.10212","article-title":"Human Gene Mutation Database (HGMD), 2003 update","volume":"21","author":"Stenson","year":"2003","journal-title":"Hum. Mutat"},{"key":"2023020210495576300_B33","doi-asserted-by":"crossref","first-page":"789","DOI":"10.1038\/nature02168","article-title":"The International HapMap Project","volume":"426","author":"The International HapMap Consortium","year":"2003","journal-title":"Nature"},{"key":"2023020210495576300_B34","doi-asserted-by":"crossref","first-page":"2918","DOI":"10.1093\/bioinformatics\/btm437","article-title":"Accurate prediction of deleterious protein kinase polymorphisms","volume":"23","author":"Torkamani","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020210495576300_B35","volume-title":"Data Mining, Practical Machine Learning Tools and Techniques.","author":"Witten","year":"2005","edition":"2nd"},{"key":"2023020210495576300_B36","doi-asserted-by":"crossref","first-page":"1520","DOI":"10.1101\/gr.6665407","article-title":"Prediction of individual genetic risk to disease from genomewide association studies","volume":"17","author":"Wray","year":"2007","journal-title":"Genome Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/16\/1787\/49049774\/bioinformatics_24_16_1787.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/16\/1787\/49049774\/bioinformatics_24_16_1787.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T12:37:07Z","timestamp":1675341427000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/16\/1787\/202236"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,6,18]]},"references-count":36,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2008,8,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn311","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,8,15]]},"published":{"date-parts":[[2008,6,18]]}}}