{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,12]],"date-time":"2026-02-12T13:35:11Z","timestamp":1770903311297,"version":"3.50.1"},"reference-count":48,"publisher":"Oxford University Press (OUP)","issue":"16","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,8,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Enzyme catalysis is involved in numerous biological processes and the disruption of enzymatic activity has been implicated in human disease. Despite this, various aspects of catalytic reactions are not completely understood, such as the mechanics of reaction chemistry and the geometry of catalytic residues within active sites. As a result, the computational prediction of catalytic residues has the potential to identify novel catalytic pockets, aid in the design of more efficient enzymes and also predict the molecular basis of disease.<\/jats:p>\n               <jats:p>Results: We propose a new kernel-based algorithm for the prediction of catalytic residues based on protein sequence, structure and evolutionary information. The method relies upon explicit modeling of similarity between residue-centered neighborhoods in protein structures. We present evidence that this algorithm evaluates favorably against established approaches, and also provides insights into the relative importance of the geometry, physicochemical properties and evolutionary conservation of catalytic residue activity. The new algorithm was used to identify known mutations associated with inherited disease whose molecular mechanism might be predicted to operate specifically though the loss or gain of catalytic residues. It should, therefore, provide a viable approach to identifying the molecular basis of disease in which the loss or gain of function is not caused solely by the disruption of protein stability. Our analysis suggests that both mechanisms are actively involved in human inherited disease.<\/jats:p>\n               <jats:p>Availability and Implementation: Source code for the structural kernel is available at www.informatics.indiana.edu\/predrag\/<\/jats:p>\n               <jats:p>Contact: \u00a0predrag@indiana.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq319","type":"journal-article","created":{"date-parts":[[2010,6,16]],"date-time":"2010-06-16T01:09:58Z","timestamp":1276650598000},"page":"1975-1982","source":"Crossref","is-referenced-by-count":15,"title":["Structure-based kernels for the prediction of catalytic residues and their involvement in human inherited disease"],"prefix":"10.1093","volume":"26","author":[{"given":"Fuxiao","family":"Xin","sequence":"first","affiliation":[{"name":"1 School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA, 2Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK and 3Buck Institute for Age Research, Novato, CA 94945, USA"}]},{"given":"Steven","family":"Myers","sequence":"additional","affiliation":[{"name":"1 School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA, 2Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK and 3Buck Institute for Age Research, Novato, CA 94945, USA"}]},{"given":"Yong Fuga","family":"Li","sequence":"additional","affiliation":[{"name":"1 School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA, 2Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK and 3Buck Institute for Age Research, Novato, CA 94945, USA"}]},{"given":"David N.","family":"Cooper","sequence":"additional","affiliation":[{"name":"1 School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA, 2Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK and 3Buck Institute for Age Research, Novato, CA 94945, USA"}]},{"given":"Sean D.","family":"Mooney","sequence":"additional","affiliation":[{"name":"1 School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA, 2Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK and 3Buck Institute for Age Research, Novato, CA 94945, USA"}]},{"given":"Predrag","family":"Radivojac","sequence":"additional","affiliation":[{"name":"1 School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA, 2Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff CF14 4XN, UK and 3Buck Institute for Age Research, Novato, CA 94945, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,6,15]]},"reference":[{"key":"2023012508022489600_B1","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1186\/1471-2105-10-197","article-title":"Resboost: characterizing and predicting catalytic residues in enzymes","volume":"10","author":"Alterovitz","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012508022489600_B2","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/S0022-2836(02)01036-7","article-title":"Analysis of catalytic residues in enzyme active sites","volume":"324","author":"Bartlett","year":"2002","journal-title":"J. Mol. Biol."},{"key":"2023012508022489600_B3","doi-asserted-by":"crossref","first-page":"1196","DOI":"10.1126\/science.1085515","article-title":"A perspective on enzyme catalysis","volume":"301","author":"Benkovic","year":"2003","journal-title":"Science"},{"key":"2023012508022489600_B4","doi-asserted-by":"crossref","first-page":"3317","DOI":"10.1021\/bi800049z","article-title":"Free-energy landscape of enzyme catalysis","volume":"47","author":"Benkovic","year":"2008","journal-title":"Biochemistry"},{"issue":"Suppl. 1","key":"2023012508022489600_B5","doi-asserted-by":"crossref","first-page":"i47","DOI":"10.1093\/bioinformatics\/bti1007","article-title":"Protein function prediction via graph kernels","volume":"21","author":"Borgwardt","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012508022489600_B6","first-page":"137","article-title":"Non-mercer kernels for SVM object recognition","volume-title":"British Machine Vision Conference (BMVC)","author":"Boughorbel","year":"2004"},{"key":"2023012508022489600_B7","doi-asserted-by":"crossref","first-page":"W306","DOI":"10.1093\/nar\/gki375","article-title":"I-mutant2.0: predicting stability changes upon mutation from the protein sequence or structure","volume":"33","author":"Capriotti","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012508022489600_B8","doi-asserted-by":"crossref","first-page":"885","DOI":"10.1006\/jmbi.2001.5009","article-title":"Prediction of functionally important residues based solely on the computed energetics of protein structure","volume":"312","author":"Elcock","year":"2001","journal-title":"J. Mol. Biol."},{"key":"2023012508022489600_B9","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.atherosclerosis.2008.10.027","article-title":"Degradation of ldlr protein mediated by \u2018gain of function\u2019 PCSK9 mutants in normal and ARH cells","volume":"203","author":"Fasano","year":"2009","journal-title":"Atherosclerosis"},{"key":"2023012508022489600_B10","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1145\/1102351.1102380","article-title":"Optimal assignment kernels for attributed molecular graphs","volume-title":"Proceedings of the 22nd international conference on Machine learning","author":"Fr\u00f6hlich","year":"2005"},{"key":"2023012508022489600_B11","article-title":"Kernel Methods in Chemo- and Bioinformatics","volume-title":"PhD. thesis","author":"Fr\u00f6hlich","year":"2006"},{"key":"2023012508022489600_B12","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1126\/science.1088172","article-title":"How enzymes work: analysis by modern rate theory and computer simulations","volume":"303","author":"Garcia-Viloca","year":"2004","journal-title":"Science"},{"key":"2023012508022489600_B13","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1093\/protein\/6.1.29","article-title":"The prediction and characterization of metal binding sites in proteins","volume":"6","author":"Gregory","year":"1993","journal-title":"Protein Eng."},{"key":"2023012508022489600_B14","doi-asserted-by":"crossref","first-page":"707","DOI":"10.1006\/jmbi.1993.1074","article-title":"Identification of tertiary structure resemblance in proteins using a maximal common subgraph isomorphism algorithm","volume":"229","author":"Grindley","year":"1993","journal-title":"J. Mol. Biol."},{"key":"2023012508022489600_B15","first-page":"154","article-title":"Neural net representations of empirical protein potentials","volume":"3","author":"Grossman","year":"1995","journal-title":"Proc. Int. Conf. Intell Syst. Mol. Biol."},{"key":"2023012508022489600_B16","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/j.jmb.2004.11.013","article-title":"Conformational changes observed in enzyme crystal structures upon substrate binding","volume":"346","author":"Gutteridge","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023012508022489600_B17","doi-asserted-by":"crossref","first-page":"719","DOI":"10.1016\/S0022-2836(03)00515-1","article-title":"Using a neural network and spatial clustering to predict the location of active sites in enzymes","volume":"330","author":"Gutteridge","year":"2003","journal-title":"J. Mol. Biol."},{"key":"2023012508022489600_B18","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4615-0907-3","volume-title":"Learning to classify text using support vector machines: methods, theory, and algorithms.","author":"Joachims","year":"2002"},{"key":"2023012508022489600_B19","doi-asserted-by":"crossref","first-page":"2995","DOI":"10.1172\/JCI29383","article-title":"Secreted pcsk9 decreases the number of ldl receptors in hepatocytes and in livers of parabiotic mice","volume":"116","author":"Lagace","year":"2006","journal-title":"J. Clin. Invest."},{"key":"2023012508022489600_B20","first-page":"1435","article-title":"Fast string kernels using inexact matching for protein sequences","volume":"5","author":"Leslie","year":"2004","journal-title":"J. Mach. Learn. Res."},{"key":"2023012508022489600_B21","doi-asserted-by":"crossref","first-page":"2744","DOI":"10.1093\/bioinformatics\/btp528","article-title":"Automated inference of molecular mechanisms of disease from amino acid substitutions","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012508022489600_B22","doi-asserted-by":"crossref","first-page":"e1000497","DOI":"10.1371\/journal.pcbi.1000497","article-title":"Influence of sequence changes and environment on intrinsically disordered proteins","volume":"5","author":"Mohan","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2023012508022489600_B23","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1002\/humu.21192","article-title":"In silico functional profiling of human disease-associated and polymorphic amino acid substitutions","volume":"31","author":"Mort","year":"2010","journal-title":"Hum. Mutat."},{"key":"2023012508022489600_B24","doi-asserted-by":"crossref","first-page":"536","DOI":"10.1016\/S0022-2836(05)80134-2","article-title":"SCOP: a structural classification of proteins database for the investigation of sequences and structures","volume":"247","author":"Murzin","year":"1995","journal-title":"J. Mol. Biol."},{"key":"2023012508022489600_B25","doi-asserted-by":"crossref","first-page":"1135","DOI":"10.1038\/nbt1209-1135","article-title":"How does multiple testing correction work?","volume":"27","author":"Noble","year":"2009","journal-title":"Nature Biotechnology"},{"key":"2023012508022489600_B26","doi-asserted-by":"crossref","first-page":"12473","DOI":"10.1073\/pnas.211436698","article-title":"Thematics: a simple computational predictor of enzyme function from structure","volume":"98","author":"Ondrechen","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508022489600_B27","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1016\/S0022-2836(03)00207-9","article-title":"Prediction of catalytic residues in enzymes based on known tertiary structure, stability profile, and sequence conservation","volume":"327","author":"Ota","year":"2003","journal-title":"J. Mol. Biol."},{"key":"2023012508022489600_B28","doi-asserted-by":"crossref","first-page":"14754","DOI":"10.1073\/pnas.0404569101","article-title":"Automated prediction of protein function and detection of functional sites from structure","volume":"101","author":"Pazos","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508022489600_B29","doi-asserted-by":"crossref","first-page":"312","DOI":"10.1186\/1471-2105-7-312","article-title":"Prediction of catalytic residues using support vector machine with selected protein sequence and structural properties","volume":"7","author":"Petrova","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012508022489600_B30","doi-asserted-by":"crossref","first-page":"D129","DOI":"10.1093\/nar\/gkh028","article-title":"The catalytic site atlas: a resource of catalytic sites and residues identified in enzymes using structural data","volume":"32","author":"Porter","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012508022489600_B31","doi-asserted-by":"crossref","first-page":"i241","DOI":"10.1093\/bioinformatics\/btn267","article-title":"Gain and loss of phosphorylation sites in human cancer","volume":"24","author":"Radivojac","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508022489600_B32","doi-asserted-by":"crossref","first-page":"617","DOI":"10.1093\/bioinformatics\/btq008","article-title":"Active site prediction using evolutionary and structural information","volume":"26","author":"Sankararaman","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012508022489600_B33","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/4057.001.0001","volume-title":"Kernel methods in computational biology.","author":"Sch\u00f6lkopf","year":"2004"},{"key":"2023012508022489600_B34","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1186\/gm13","article-title":"The human gene mutation database: 2008 update","volume":"1","author":"Stenson","year":"2009","journal-title":"Genome Med."},{"key":"2023012508022489600_B35","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1093\/protein\/gzn003","article-title":"An improved prediction of catalytic residues in enzyme structures","volume":"21","author":"Tang","year":"2008","journal-title":"Protein Eng. Des. Sel."},{"key":"2023012508022489600_B36","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1110\/ps.073213608","article-title":"Enhanced performance in prediction of protein active sites with thematics and support vector machines","volume":"17","author":"Tong","year":"2008","journal-title":"Protein Sci."},{"key":"2023012508022489600_B37","doi-asserted-by":"crossref","first-page":"e1000266","DOI":"10.1371\/journal.pcbi.1000266","article-title":"Partial order optimum likelihood (pool): maximum likelihood prediction of protein active site residues using 3D structure and sequence properties","volume":"5","author":"Tong","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2023012508022489600_B38","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1016\/j.jmb.2005.01.044","article-title":"Using a library of structural templates to recognise catalytic sites and explore their evolution in homologous families","volume":"347","author":"Torrance","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023012508022489600_B39","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1089\/cmb.2009.0029","article-title":"Graphlet kernels for prediction of functional residues in protein structures","volume":"17","author":"Vacic","year":"2010","journal-title":"J. Comput. Biol."},{"key":"2023012508022489600_B40","article-title":"The optimal assignment kernel is not positive definite","author":"Vert","year":"2008","journal-title":"CoRR"},{"key":"2023012508022489600_B41","doi-asserted-by":"crossref","first-page":"1001","DOI":"10.1002\/pro.5560050603","article-title":"Derivation of 3d coordinate templates for searching structural databases: application to ser-his-asp catalytic triads in the serine proteinases and lipases","volume":"5","author":"Wallace","year":"1996","journal-title":"Protein Sci."},{"key":"2023012508022489600_B42","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1002\/humu.22","article-title":"SNPS, protein structure, and disease","volume":"17","author":"Wang","year":"2001","journal-title":"Hum. Mutat."},{"key":"2023012508022489600_B43","doi-asserted-by":"crossref","first-page":"938","DOI":"10.1021\/ar000058i","article-title":"The depth of chemical time and the power of enzymes as catalysts","volume":"34","author":"Wolfenden","year":"2001","journal-title":"Acc. Chem. Res."},{"key":"2023012508022489600_B44","doi-asserted-by":"crossref","first-page":"R8","DOI":"10.1186\/gb-2008-9-1-r8","article-title":"The seqfeature library of 3D functional site models: comparison to existing methods and applications to protein function annotation","volume":"9","author":"Wu","year":"2008","journal-title":"Genome Biol."},{"key":"2023012508022489600_B45","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1110\/ps.062523907","article-title":"Evaluation of features for catalytic residue prediction in novel folds","volume":"16","author":"Youn","year":"2007","journal-title":"Protein Sci."},{"key":"2023012508022489600_B46","doi-asserted-by":"crossref","first-page":"1263","DOI":"10.1016\/j.jmb.2005.12.025","article-title":"Identification and analysis of deleterious human SNPS","volume":"356","author":"Yue","year":"2006","journal-title":"J. Mol. Biol."},{"key":"2023012508022489600_B47","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1016\/j.jmb.2005.08.020","article-title":"Loss of protein structure stability as a major causative factor in monogenic disease","volume":"353","author":"Yue","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023012508022489600_B48","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1093\/protein\/2.2.127","article-title":"Analysis and prediction of the location of catalytic residues in enzymes","volume":"2","author":"Zvelebil","year":"1988","journal-title":"Protein Eng."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/16\/1975\/48854328\/bioinformatics_26_16_1975.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/16\/1975\/48854328\/bioinformatics_26_16_1975.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:03:35Z","timestamp":1674633815000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/16\/1975\/216235"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,6,15]]},"references-count":48,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2010,8,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq319","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,8,15]]},"published":{"date-parts":[[2010,6,15]]}}}