{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,8]],"date-time":"2025-11-08T12:09:47Z","timestamp":1762603787269},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>A large panel of methods exists that aim to identify residues with critical impact on protein function based on evolutionary signals, sequence and structure information. However, it is not clear to what extent these different methods overlap, and if any of the methods have higher predictive potential compared to others when it comes to, in particular, the identification of catalytic residues (CR) in proteins. Using a large set of enzymatic protein families and measures based on different evolutionary signals, we sought to break up the different components of the information content within a multiple sequence alignment to investigate their predictive potential and degree of overlap.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>Our results demonstrate that the different methods included in the benchmark in general can be divided into three groups with a limited mutual overlap. One group containing real-value Evolutionary Trace (rvET) methods and conservation, another containing mutual information (MI) methods, and the last containing methods designed explicitly for the identification of specificity determining positions (SDPs): integer-value Evolutionary Trace (ivET), SDPfox, and XDET. In terms of prediction of CR, we find using a proximity score integrating structural information (as the sum of the scores of residues located within a given distance of the residue in question) that only the methods from the first two groups displayed a reliable performance. Next, we investigated to what degree proximity scores for conservation, rvET and cumulative MI (cMI) provide complementary information capable of improving the performance for CR identification. We found that integrating conservation with proximity scores for rvET and cMI achieved the highest performance. The proximity conservation score contained no complementary information when integrated with proximity rvET. Moreover, the signal from rvET provided only a limited gain in predictive performance when integrated with mutual information and conservation proximity scores. Combined, these observations demonstrate that the rvET and cMI scores add complementary information to the prediction system.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>This work contributes to the understanding of the different signals of evolution and also shows that it is possible to improve the detection of catalytic residues by integrating structural and higher order sequence evolutionary information with sequence conservation.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-13-235","type":"journal-article","created":{"date-parts":[[2012,9,15]],"date-time":"2012-09-15T09:55:15Z","timestamp":1347702915000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":34,"title":["Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction"],"prefix":"10.1186","volume":"13","author":[{"given":"Elin","family":"Teppa","sequence":"first","affiliation":[]},{"given":"Angela D","family":"Wilkins","sequence":"additional","affiliation":[]},{"given":"Morten","family":"Nielsen","sequence":"additional","affiliation":[]},{"given":"Cristina Marino","family":"Buslje","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,9,14]]},"reference":[{"key":"5460_CR1","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1093\/nar\/gkh028","volume":"32","author":"CT Porter","year":"2004","unstructured":"Porter CT, Bartlett GJ, Thornton JM, The Catalytic Site Atlas: Nucleic Acids Res. 2004, 32: 129-133. Database issue","journal-title":"Nucleic Acids Res"},{"issue":"3-4","key":"5460_CR2","first-page":"159","volume":"5","author":"W Oliveira L","year":"1997","unstructured":"Oliveira L W, Vriend G, Ljzerman AP: Identification of class-determining residues in G protein-coupled receptors by sequence analysis. Receptors Channels. 5th edition. 1997, 5 (3-4): 159-174.","journal-title":"Receptors Channels. 5th edition"},{"issue":"22","key":"5460_CR3","doi-asserted-by":"publisher","first-page":"6540","DOI":"10.1093\/nar\/gkl901","volume":"34","author":"W Pirovano","year":"2006","unstructured":"Pirovano W, Feenstra KA, Heringa J: Sequence comparison by sequence harmony identifies subtype-specific functional sites. Nucleic Acids Res. 2006, 34 (22): 6540-6548. 10.1093\/nar\/gkl901.","journal-title":"Nucleic Acids Res"},{"key":"5460_CR4","doi-asserted-by":"publisher","first-page":"231","DOI":"10.1002\/prot.22239","volume":"75","author":"S Chakrabarti","year":"2009","unstructured":"Chakrabarti S, Panchenko AR: Coevolution in defining the functional specificity. Proteins. 2009, 75: 231-240. 10.1002\/prot.22239.","journal-title":"Proteins"},{"issue":"2","key":"5460_CR5","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1038\/nsb0295-171","volume":"2","author":"G Casari","year":"1995","unstructured":"Casari G, Sander C, Valencia A: A method to predict functional residues in proteins. Nat Struct Mol Biol. 1995, 2 (2): 171-178. 10.1038\/nsb0295-171.","journal-title":"Nat Struct Mol Biol"},{"issue":"1","key":"5460_CR6","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1006\/jmbi.2000.4036","volume":"303","author":"SS Hannenhalli","year":"2000","unstructured":"Hannenhalli SS, Russell RB: Analysis and prediction of functional sub-types from protein sequence alignments. J Mol Biol. 2000, 303 (1): 61-76. 10.1006\/jmbi.2000.4036.","journal-title":"J Mol Biol"},{"key":"5460_CR7","doi-asserted-by":"publisher","first-page":"e160","DOI":"10.1371\/journal.pcbi.0030160","volume":"3","author":"DP Brown","year":"2007","unstructured":"Brown DP, Krishnamurthy N, Sjolander K: Automated protein subfamily identification and classification. PLoS Comput Biol. 2007, 3: e160-10.1371\/journal.pcbi.0030160.","journal-title":"PLoS Comput Biol"},{"issue":"8","key":"5460_CR8","doi-asserted-by":"publisher","first-page":"1435","DOI":"10.1093\/oxfordjournals.molbev.a003929","volume":"18","author":"N Wicker","year":"2001","unstructured":"Wicker N, et al: Secator: A Program for Inferring Protein Subfamilies from Phylogenetic Trees. Mol Biol Evol. 2001, 18 (8): 1435-1441. 10.1093\/oxfordjournals.molbev.a003929.","journal-title":"Mol Biol Evol"},{"key":"5460_CR9","doi-asserted-by":"publisher","first-page":"1473","DOI":"10.1093\/bioinformatics\/btn214","volume":"24","author":"JA Capra","year":"2008","unstructured":"Capra JA, Singh M: Characterization and prediction of residues determining protein functional specificity. Bioinformatics. 2008, 24: 1473-1480. 10.1093\/bioinformatics\/btn214.","journal-title":"Bioinformatics"},{"issue":"1","key":"5460_CR10","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1186\/1748-7188-5-29","volume":"5","author":"P Mazin","year":"2010","unstructured":"Mazin P, et al: An automated stochastic approach to the identification of the protein specificity determinants and functional subfamilies. Algorithms for Molecular Biology. 2010, 5 (1): 29-10.1186\/1748-7188-5-29.","journal-title":"Algorithms for Molecular Biology"},{"key":"5460_CR11","doi-asserted-by":"publisher","first-page":"2466","DOI":"10.1093\/bioinformatics\/btl411","volume":"22","author":"P Marttinen","year":"2006","unstructured":"Marttinen P, et al: Bayesian search of functionally divergent protein subgroups and their function specific residues. Bioinformatics. 2006, 22: 2466-2474. 10.1093\/bioinformatics\/btl411.","journal-title":"Bioinformatics"},{"issue":"2","key":"5460_CR12","doi-asserted-by":"publisher","first-page":"342","DOI":"10.1006\/jmbi.1996.0167","volume":"257","author":"O Lichtarge","year":"1996","unstructured":"Lichtarge O, Bourne HR, Cohen FE: An Evolutionary Trace Method Defines Binding Surfaces Common to Protein Families. J Mol Biol. 1996, 257 (2): 342-358. 10.1006\/jmbi.1996.0167.","journal-title":"J Mol Biol"},{"issue":"5","key":"5460_CR13","doi-asserted-by":"publisher","first-page":"1265","DOI":"10.1016\/j.jmb.2003.12.078","volume":"336","author":"I Mihalek","year":"2004","unstructured":"Mihalek I, Res I, Lichtarge O: A Family of Evolution-Entropy Hybrid Methods for Ranking Protein Residues by Importance. J Mol Biol. 2004, 336 (5): 1265-1282. 10.1016\/j.jmb.2003.12.078.","journal-title":"J Mol Biol"},{"key":"5460_CR14","doi-asserted-by":"publisher","first-page":"164","DOI":"10.1093\/bioinformatics\/bti766","volume":"22","author":"J Pei","year":"2006","unstructured":"Pei J, et al: Prediction of functional specificity determinants from protein sequences using log-likelihood ratios. Bioinformatics. 2006, 22: 164-171. 10.1093\/bioinformatics\/bti766.","journal-title":"Bioinformatics"},{"issue":"7","key":"5460_CR15","doi-asserted-by":"publisher","first-page":"908","DOI":"10.1093\/bioinformatics\/btn057","volume":"24","author":"K Ye","year":"2008","unstructured":"Ye K, Vriend G, Ijzerman AP: Tracing evolutionary pressure. Bioinformatics. 2008, 24 (7): 908-915. 10.1093\/bioinformatics\/btn057.","journal-title":"Bioinformatics"},{"issue":"11","key":"5460_CR16","doi-asserted-by":"publisher","first-page":"e1000978","DOI":"10.1371\/journal.pcbi.1000978","volume":"6","author":"C Marino Buslje","year":"2010","unstructured":"Marino Buslje C, et al: Networks of High Mutual Information Define the Structural Proximity of Catalytic Sites: Implications for Catalytic Residue Identification. PLoS Comput Biol. 2010, 6 (11): e1000978-10.1371\/journal.pcbi.1000978.","journal-title":"PLoS Comput Biol"},{"issue":"16","key":"5460_CR17","doi-asserted-by":"publisher","first-page":"2049","DOI":"10.1093\/bioinformatics\/btl285","volume":"22","author":"DH Morgan","year":"2006","unstructured":"Morgan DH, et al: ET viewer: an application for predicting and visualizing functional sites in protein structures. Bioinformatics. 2006, 22 (16): 2049-2050. 10.1093\/bioinformatics\/btl285.","journal-title":"Bioinformatics"},{"key":"5460_CR18","doi-asserted-by":"publisher","first-page":"2445","DOI":"10.1093\/bioinformatics\/btn474","volume":"24","author":"S Sankararaman","year":"2008","unstructured":"Sankararaman S, Sjolander K: INTREPID - INformation-theoretic TREe traversal for Protein functional site IDentification. Bioinformatics. 2008, 24: 2445-2452. 10.1093\/bioinformatics\/btn474.","journal-title":"Bioinformatics"},{"issue":"12","key":"5460_CR19","doi-asserted-by":"publisher","first-page":"1440","DOI":"10.1093\/bioinformatics\/btl104","volume":"22","author":"F Pazos","year":"2006","unstructured":"Pazos F, Rausell A, Valencia A: Phylogeny-independent detection of functional residues. Bioinformatics. 2006, 22 (12): 1440-1448. 10.1093\/bioinformatics\/btl104.","journal-title":"Bioinformatics"},{"issue":"suppl 1","key":"5460_CR20","doi-asserted-by":"publisher","first-page":"D211","DOI":"10.1093\/nar\/gkp985","volume":"38","author":"RD Finn","year":"2010","unstructured":"Finn RD, et al: The Pfam protein families database. Nucleic Acids Res. 2010, 38 (suppl 1): D211-D222.","journal-title":"Nucleic Acids Res"},{"key":"5460_CR21","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1093\/bioinformatics\/btm537","volume":"24","author":"K Ye","year":"2008","unstructured":"Ye K, et al: Multi-RELIEF: a method to recognize specificity determining residues from multiple sequence alignments using a Machine-Learning approach for feature weighting. Bioinformatics. 2008, 24: 18-25. 10.1093\/bioinformatics\/btm537.","journal-title":"Bioinformatics"},{"issue":"1","key":"5460_CR22","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1186\/1471-2105-10-207","volume":"10","author":"S Chakrabarti","year":"2009","unstructured":"Chakrabarti S, Panchenko A: Ensemble approach to predict specificity determinants: benchmarking and validation. BMC Bioinforma. 2009, 10 (1): 207-10.1186\/1471-2105-10-207.","journal-title":"BMC Bioinforma"},{"issue":"2","key":"5460_CR23","doi-asserted-by":"publisher","first-page":"443","DOI":"10.1110\/ps.03191704","volume":"13","author":"OV Kalinina","year":"2004","unstructured":"Kalinina OV, et al: Automated selection of positions determining functional specificity of proteins by comparative analysis of orthologous groups in protein families. Protein Sci. 2004, 13 (2): 443-456. 10.1110\/ps.03191704.","journal-title":"Protein Sci"},{"issue":"17","key":"5460_CR24","doi-asserted-by":"publisher","first-page":"7787","DOI":"10.1073\/pnas.0914877107","volume":"107","author":"GJ Rodriguez","year":"2010","unstructured":"Rodriguez GJ, et al: Evolution-guided discovery and recoding of allosteric pathway specificity determinants in psychoactive bioamine receptors. Proc Natl Acad Sci. 2010, 107 (17): 7787-7792. 10.1073\/pnas.0914877107.","journal-title":"Proc Natl Acad Sci"},{"issue":"1","key":"5460_CR25","doi-asserted-by":"publisher","first-page":"205","DOI":"10.1006\/jmbi.2000.4042","volume":"302","author":"C Notredame","year":"2000","unstructured":"Notredame C, Higgins DG, Heringa J: T-coffee: a novel method for fast and accurate multiple sequence alignment. J Mol Biol. 2000, 302 (1): 205-217. 10.1006\/jmbi.2000.4042.","journal-title":"J Mol Biol"},{"issue":"4","key":"5460_CR26","doi-asserted-by":"publisher","first-page":"1289","DOI":"10.1016\/S0022-2836(02)01451-1","volume":"326","author":"A del Sol Mesa","year":"2003","unstructured":"del Sol Mesa A, Pazos F, Valencia A: Automatic Methods for Predicting Functionally Important Residues. J Mol Biol. 2003, 326 (4): 1289-1302. 10.1016\/S0022-2836(02)01451-1.","journal-title":"J Mol Biol"},{"issue":"1","key":"5460_CR27","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1214\/aoms\/1177729694","volume":"22","author":"S Kullback","year":"1951","unstructured":"Kullback S, Leibler R: On Information and Sufficiency. Ann. Math. Statist. 1951, 22 (1): 7-","journal-title":"Ann. Math. Statist"},{"issue":"6","key":"5460_CR28","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1007\/s00251-010-0441-4","volume":"62","author":"T Stranzl","year":"2010","unstructured":"Stranzl T, et al: NetCTLpan: pan-specific MHC class I pathway epitope predictions. Immunogenetics. 2010, 62 (6): 357-368. 10.1007\/s00251-010-0441-4.","journal-title":"Immunogenetics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-13-235.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T20:30:48Z","timestamp":1630528248000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-13-235"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,9,14]]},"references-count":28,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["5460"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-13-235","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,9,14]]},"assertion":[{"value":"8 May 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 September 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 September 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"235"}}