{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T12:18:52Z","timestamp":1767961132949,"version":"3.49.0"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"21","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Contemporary, high-throughput sequencing efforts have identified a rich source of naturally occurring single nucleotide polymorphisms (SNPs), a subset of which occur in the coding region of genes and result in a change in the encoded amino acid sequence (non-synonymous coding SNPs or \u2018nsSNPs\u2019). It is hypothesized that a subset of these nsSNPs may underlie common human disease. Testing all these polymorphisms for disease association would be time consuming and expensive. Thus, computational methods have been developed to both prioritize candidate nsSNPs and make sense of their likely molecular physiologic impact.<\/jats:p><jats:p>Results: We have developed a method to prioritize nsSNPs and have applied it to the human protein kinase gene family. The results of our analyses provide high quality predictions and outperform available whole genome prediction methods (74% versus 83% prediction accuracy). Our analyses and methods consider both DNA sequence conservation, which most traditional methods are based on, as well unique structural and functional features of kinases. We provide a ranked list of common kinase nsSNPs that have a higher probability of impacting human disease based on our analyses.<\/jats:p><jats:p>Contact: \u00a0nschork@scripps.edu<\/jats:p><jats:p>Supplementary information: Supplementary data are available on Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm437","type":"journal-article","created":{"date-parts":[[2007,9,14]],"date-time":"2007-09-14T00:23:31Z","timestamp":1189729411000},"page":"2918-2925","source":"Crossref","is-referenced-by-count":56,"title":["Accurate prediction of deleterious protein kinase polymorphisms"],"prefix":"10.1093","volume":"23","author":[{"given":"Ali","family":"Torkamani","sequence":"first","affiliation":[{"name":"1 Department of Medicine and Center for Human Genetics and Genomics and 2Scripps Genomic Medicine and Department of Molecular and Experimental Medicine, The Scripps Research Institute, University of California, San Diego, La Jolla, CA 92093, USA"}]},{"given":"Nicholas J.","family":"Schork","sequence":"additional","affiliation":[{"name":"1 Department of Medicine and Center for Human Genetics and Genomics and 2Scripps Genomic Medicine and Department of Molecular and Experimental Medicine, The Scripps Research Institute, University of California, San Diego, La Jolla, CA 92093, USA"}]}],"member":"286","published-online":{"date-parts":[[2007,9,12]]},"reference":[{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"6395","DOI":"10.1073\/pnas.0408677102","article-title":"Solving the protein sequence metric problem","volume":"102","author":"Atchley","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1016\/S0306-9877(03)00332-3","article-title":"The common variants\/multiple disease hypothesis of common complex genetic disorders","volume":"62","author":"Becker","year":"2004","journal-title":"Med. Hypotheses"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"664","DOI":"10.1093\/bioinformatics\/btl649","article-title":"Deleterious SNP prediction: be mindful of your training data!","volume":"23","author":"Care","year":"2007","journal-title":"Bioinformatics"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1038\/10290","article-title":"Characterization of single-nucleotide polymorphisms in coding regions of the human genes","volume":"22","author":"Cargill","year":"1999","journal-title":"Nat. Genet."},{"key":"2023041107264848400_","first-page":"1399","article-title":"Genetics moves into the medical mainstream","volume":"294","author":"Collins","year":"2001","journal-title":"JAMA"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"820","DOI":"10.1126\/science.316.5826.820","article-title":"Genome-wide association. Closing the net on common disease genes","volume":"316","author":"Couzin","year":"2007","journal-title":"Science"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"3176","DOI":"10.1093\/bioinformatics\/bti486","article-title":"PMUT: a web-based tool for the annotation of pathological mutations on proteins","volume":"21","author":"Ferrer-Costa","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"e90","DOI":"10.1371\/journal.pcbi.0020090","article-title":"Wiggle \u2013 predicting functionally flexible regions from primary sequence","volume":"2","author":"Gu","year":"2006","journal-title":"PLoS Comput. Biol."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1038\/10297","article-title":"Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis","volume":"22","author":"Halushka","year":"1999","journal-title":"Nat. Genet."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"576","DOI":"10.1096\/fasebj.9.8.7768349","article-title":"Protein kinases 6. The eukaryotic protein kinase superfamily: kinase (catalytic domain structure and classification)","volume":"9","author":"Hanks","year":"1995","journal-title":"FASEB J."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"641","DOI":"10.1016\/S0969-2126(00)00065-4","article-title":"Volume changes on protein folding","volume":"2","author":"Harpaz","year":"1994","journal-title":"Structure"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"727","DOI":"10.1038\/nrd892","article-title":"The druggable genome","volume":"1","author":"Hopkins","year":"2002","journal-title":"Nat. Rev. Drug Discov."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1098\/rstb.1998.0228","article-title":"Croonian lecture: the phosphorylation of proteins on tyrosine \u2013 its role in cell growth and disease","volume":"353","author":"Hunter","year":"1998","journal-title":"Philos. Trans. R. Soc. Lond. B Biol. Sci."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1086\/519747","article-title":"Sequence-based prioritization of nonsynonymous single-nucleotide polymorphisms for the study of disease mutations","volume":"81","author":"Jian","year":"2007","journal-title":"Am. J. Hum. Genet."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1093\/bib\/5.2.150","article-title":"MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment","volume":"5","author":"Kumar","year":"2004","journal-title":"Brief. Bioinformatics"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/0022-2836(82)90515-0","article-title":"A simple method for displaying the hydropathic character of a protein","volume":"157","author":"Kyte","year":"1982","journal-title":"J. Mol. Biol."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"49045","DOI":"10.1074\/jbc.M409358200","article-title":"Direct binding of DNA by tumor suppressor menin","volume":"279","author":"La","year":"2004","journal-title":"J. Biol. Chem."},{"key":"2023041107264848400_","first-page":"860","article-title":"Initial sequencing and analysis of the human genome","volume":"209","author":"Lander","year":"2001","journal-title":"Nature"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1016\/j.jbi.2005.02.008","article-title":"The use of receiver operating characteristic curves in biomedical informatics","volume":"38","author":"Lasko","year":"2005","journal-title":"J. Biomed. Inform."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1016\/S0888-7543(03)00009-0","article-title":"Distribution analysis of nonsynonymous polymorphisms within the G-protein-coupled receptor gene family","volume":"81","author":"Lee","year":"2003","journal-title":"Genomics"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"1821","DOI":"10.1101\/gr.2730004","article-title":"Pattern of sequence variation across 213 environmental response genes","volume":"14","author":"Livingston","year":"2004","journal-title":"Genome Res."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"2129","DOI":"10.1056\/NEJMoa040938","article-title":"Activating mutations in the epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib","volume":"21","author":"Lynch","year":"2004","journal-title":"N. Engl. J. Med."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1186\/1471-2105-7-301","article-title":"Improving the accuracy of protein secondary structure prediction using structural alignment","volume":"14","author":"Montgomerie","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1101\/gr.212802","article-title":"Accounting for human polymorphisms predicted to affect protein function","volume":"12","author":"Ng","year":"2002","journal-title":"Genome Res."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1146\/annurev.genom.7.080505.115630","article-title":"Predicting the effects of amino acid substitutions on protein function","volume":"7","author":"Ng","year":"2006","journal-title":"Annu. Rev. Genomics Hum. Genet."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1046\/j.1469-1809.2002.00119.x","article-title":"The expected power of genome-wide linkage disequilibrium testing using single nucleotide polymorphism markers for detecting a low-frequency disease variant","volume":"66","author":"Ohashi","year":"2002","journal-title":"Ann. Hum. Genet."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"312","DOI":"10.1186\/1471-2105-7-312","article-title":"Prediction of catalytic residues using support vector machine with selected protein sequence and structural properties","volume":"21","author":"Petrova","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1086\/321272","article-title":"Are rare variants responsible for susceptibility to common diseases?","volume":"69","author":"Pritchard","year":"2001","journal-title":"Am. J. Hum. Genet."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"2417","DOI":"10.1093\/hmg\/11.20.2417","article-title":"The allelic architecture of human disease genes: common disease-common variant \u2026 or not?","volume":"20","author":"Pritchard","year":"2002","journal-title":"Hum. Mol. Genet."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"502","DOI":"10.1016\/S0168-9525(01)02410-6","article-title":"On the allelic spectrum of human disease","volume":"17","author":"Reich","year":"2001","journal-title":"Trends Genet."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"W321","DOI":"10.1093\/nar\/gkh377","article-title":"The PredictProtein server","volume":"32","author":"Rost","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"928","DOI":"10.1038\/35057149","article-title":"A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms","volume":"409","author":"Sachidanandam","year":"2001","journal-title":"Nature"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1038\/nature05874","article-title":"Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project","volume":"447","author":"The ENCODE Project Consortium","year":"2007","journal-title":"Nature"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"789","DOI":"10.1038\/nature02168","article-title":"The international HapMap project","volume":"426","author":"The International HapMap Consortium","year":"2003","journal-title":"Nature"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"15398","DOI":"10.1073\/pnas.0404380101","article-title":"Coding single-nucleotide polymorphisms associated with complex vs. Mendelian disease: evolutionary evidence for differences in molecular effects","volume":"101","author":"Thomas","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"2129","DOI":"10.1101\/gr.772403","article-title":"PANTHER: a library of protein families and subfamilies indexed by function","volume":"13","author":"Thomas","year":"2003","journal-title":"Genome Res."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1016\/j.ygeno.2007.03.006","article-title":"Distribution analysis of nonsynonymous polymorphisms within the human kinase gene family","volume":"90","author":"Torkamani","year":"2007","journal-title":"Genomics"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"1304","DOI":"10.1126\/science.1058040","article-title":"The sequence of the human genome","volume":"291","author":"Venter","year":"2001","journal-title":"Science"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1146\/annurev.biophys.28.1.319","article-title":"Membrane protein folding and stability: physical principles","volume":"28","author":"White","year":"1999","journal-title":"Ann. Rev. Biophys. Biomol. Struct."},{"key":"2023041107264848400_","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques","author":"Witten","year":"2005","edition":"2nd"},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"1129","DOI":"10.1093\/ije\/dyi130","article-title":"How many genes underlie the occurrence of common complex diseases in the population?","volume":"34","author":"Yang","year":"2005","journal-title":"Int. J. Epidemiol."},{"key":"2023041107264848400_","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1186\/1471-2105-7-166","article-title":"SNPs3D: candidate gene and SNP selection for association studies","volume":"7","author":"Yue","year":"2006","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/21\/2918\/49822625\/bioinformatics_23_21_2918.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/21\/2918\/49822625\/bioinformatics_23_21_2918.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,14]],"date-time":"2023-05-14T04:11:52Z","timestamp":1684037512000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/21\/2918\/372426"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,9,12]]},"references-count":42,"journal-issue":{"issue":"21","published-print":{"date-parts":[[2007,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm437","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,11,1]]},"published":{"date-parts":[[2007,9,12]]}}}