{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,8]],"date-time":"2025-11-08T12:05:51Z","timestamp":1762603551659},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2006,6,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Current projects for the massive characterization of proteomes are generating protein sequences and structures with unknown function. The difficulty of experimentally determining functionally important sites calls for the development of computational methods. The first techniques, based on the search for fully conserved positions in multiple sequence alignments (MSAs), were followed by methods for locating family-dependent conserved positions. These rely on the functional classification implicit in the alignment for locating these positions related with functional specificity. The next obvious step, still scarcely explored, is to detect these positions using a functional classification different from the one implicit in the sequence relationships between the proteins. Here, we present two new methods for locating functional positions which can incorporate an arbitrary external functional classification which may or may not coincide with the one implicit in the MSA. The Xdet method is able to use a functional classification with an associated hierarchy or similarity between functions to locate positions related to that classification. The MCdet method uses multivariate statistical analysis to locate positions responsible for each one of the functions within a multifunctional family.<\/jats:p>\n               <jats:p>Results: We applied the methods to different cases, illustrating scenarios where there is a disagreement between the functional and the phylogenetic relationships, and demonstrated their usefulness for the phylogeny-independent prediction of functional positions.<\/jats:p>\n               <jats:p>Availability: All computer programs and datasets used in this work are available from the authors for academic use.<\/jats:p>\n               <jats:p>Contact: \u00a0pazos@cnb.uam.es<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available at<\/jats:p>","DOI":"10.1093\/bioinformatics\/btl104","type":"journal-article","created":{"date-parts":[[2006,3,22]],"date-time":"2006-03-22T02:33:29Z","timestamp":1142994809000},"page":"1440-1448","source":"Crossref","is-referenced-by-count":56,"title":["Phylogeny-independent detection of functional residues"],"prefix":"10.1093","volume":"22","author":[{"given":"Florencio","family":"Pazos","sequence":"first","affiliation":[{"name":"Protein Design Group, National Centre for Biotechnology (CNB-CSIC) \u00a0 C\/Darwin, 3. Campus U. Aut\u00f3noma, 28049 Cantoblanco, Madrid, Spain"}]},{"given":"Antonio","family":"Rausell","sequence":"additional","affiliation":[{"name":"Protein Design Group, National Centre for Biotechnology (CNB-CSIC) \u00a0 C\/Darwin, 3. Campus U. Aut\u00f3noma, 28049 Cantoblanco, Madrid, Spain"}]},{"given":"Alfonso","family":"Valencia","sequence":"additional","affiliation":[{"name":"Protein Design Group, National Centre for Biotechnology (CNB-CSIC) \u00a0 C\/Darwin, 3. Campus U. Aut\u00f3noma, 28049 Cantoblanco, Madrid, Spain"}]}],"member":"286","published-online":{"date-parts":[[2006,3,21]]},"reference":[{"key":"2023012408411237900_b1","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1006\/jmbi.2001.4870","article-title":"Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking","volume":"311","author":"Aloy","year":"2001","journal-title":"J. Mol. Biol."},{"key":"2023012408411237900_b2","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1007\/s004220050357","article-title":"Classification of protein families and detection of the determinant residues with an improved self-organizing map","volume":"76","author":"Andrade","year":"1997","journal-title":"Biol. Cybern."},{"key":"2023012408411237900_b3","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1006\/jmbi.2000.4474","article-title":"ConSurf: an algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information","volume":"307","author":"Armon","year":"2001","journal-title":"J. Mol. Biol."},{"key":"2023012408411237900_b4","doi-asserted-by":"crossref","first-page":"D138","DOI":"10.1093\/nar\/gkh121","article-title":"The Pfam protein families database","volume":"32","author":"Bateman","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012408411237900_b5","doi-asserted-by":"crossref","first-page":"14764","DOI":"10.1073\/pnas.222508899","article-title":"Finding important sites in protein sequences","volume":"99","author":"Bickel","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012408411237900_b6","doi-asserted-by":"crossref","first-page":"801","DOI":"10.1038\/35093574","article-title":"A tour of structural genomics","volume":"2","author":"Brenner","year":"2001","journal-title":"Nat. Rev. Genet."},{"key":"2023012408411237900_b7","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1038\/nsb0295-171","article-title":"A method to predict functional residues in proteins","volume":"2","author":"Casari","year":"1995","journal-title":"Nat. Struct. Biol."},{"key":"2023012408411237900_b8","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1016\/S0014-5793(01)03307-5","article-title":"Can we infer peptide recognition specificity mediated by SH3 domains?","volume":"513","author":"Cesareni","year":"2002","journal-title":"FEBS Lett."},{"key":"2023012408411237900_b9","doi-asserted-by":"crossref","first-page":"672","DOI":"10.1002\/prot.20348","article-title":"Small-world network approach to identify key residues in protein\u2013protein interaction","volume":"58","author":"Del Sol","year":"2004","journal-title":"Proteins"},{"key":"2023012408411237900_b10","doi-asserted-by":"crossref","first-page":"1289","DOI":"10.1016\/S0022-2836(02)01451-1","article-title":"Automatic methods for predicting functionally important residues","volume":"326","author":"del Sol Mesa","year":"2003","journal-title":"J. Mol. Biol."},{"key":"2023012408411237900_b11","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1006\/jsbi.2001.4391","article-title":"Enhanced functional annotation of protein sequences via the use of structural descriptors","volume":"134","author":"Di Gennaro","year":"2001","journal-title":"J. Struct. Biol."},{"key":"2023012408411237900_b12","doi-asserted-by":"crossref","first-page":"885","DOI":"10.1006\/jmbi.2001.5009","article-title":"Prediction of functionally important residues based solely on the computed energetics of protein structure","volume":"312","author":"Elcock","year":"2001","journal-title":"J. Mol. Biol."},{"key":"2023012408411237900_b13","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1006\/jmbi.1997.1599","article-title":"Crystal structure of a catalytic-site mutant alpha-amylase from Bacillus subtilis complexed with maltopentaose","volume":"277","author":"Fujimoto","year":"1998","journal-title":"J. Mol. Biol."},{"key":"2023012408411237900_b14","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1002\/prot.20769","article-title":"A method for localizing ligand binding pockets in protein structures","volume":"62","author":"Glaser","year":"2006","journal-title":"Proteins"},{"key":"2023012408411237900_b15","volume-title":"Theory and Application of Correspondence Analysis","author":"Greenacre","year":"1984"},{"key":"2023012408411237900_b16","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1006\/jmbi.2000.4036","article-title":"Analysis and prediction of functional sub-types from protein sequence alignments","volume":"303","author":"Hannenhalli","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023012408411237900_b17","doi-asserted-by":"crossref","first-page":"D258","DOI":"10.1093\/nar\/gkh036","article-title":"The Gene Ontology (GO) database and informatics resource","volume":"32","author":"Harris","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012408411237900_b18","doi-asserted-by":"crossref","first-page":"155","DOI":"10.2174\/1386207024607338","article-title":"Grouping of coefficients for the calculation of inter-molecular similarity and dissimilarity using 2D fragment bit-strings","volume":"5","author":"Holliday","year":"2002","journal-title":"Comb. Chem. High Throughput Screen"},{"key":"2023012408411237900_b19","first-page":"3600","article-title":"The FSSP database of structurally aligned protein fold families","volume":"22","author":"Holm","year":"1994","journal-title":"Nucl. Acids Res."},{"key":"2023012408411237900_b20","doi-asserted-by":"crossref","first-page":"3570","DOI":"10.1093\/bioinformatics\/bti561","article-title":"P-cats: prediction of catalytic residues in proteins from their tertiary structures","volume":"21","author":"Kinoshita","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012408411237900_b21","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1002\/prot.20321","article-title":"Predicting protein functional sites with phylogenetic motifs","volume":"58","author":"La","year":"2005","journal-title":"Proteins"},{"key":"2023012408411237900_b22","doi-asserted-by":"crossref","first-page":"1487","DOI":"10.1006\/jmbi.2001.4540","article-title":"Three-dimensional cluster analysis identifies interfaces and functional residue clusters in proteins","volume":"307","author":"Landgraf","year":"2001","journal-title":"J. Mol. Biol."},{"key":"2023012408411237900_b23","first-page":"175","volume-title":"Multivariate Descriptive Statistical Analysis","author":"Lebart","year":"1984"},{"key":"2023012408411237900_b24","doi-asserted-by":"crossref","first-page":"342","DOI":"10.1006\/jmbi.1996.0167","article-title":"An evolutionary trace method defines binding surfaces common to protein families","volume":"257","author":"Lichtarge","year":"1996","journal-title":"J. Mol. Biol."},{"key":"2023012408411237900_b25","first-page":"645","article-title":"Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation","volume":"6","author":"Livingstone","year":"1993","journal-title":"Comput. Appl. Biosci."},{"key":"2023012408411237900_b26","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1016\/S0022-2836(02)00587-9","article-title":"Using orthologous and paralogous proteins to identify specificity-determining residues in bacterial transcription factors","volume":"321","author":"Mirny","year":"2002","journal-title":"J. Mol. Biol."},{"key":"2023012408411237900_b27","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1093\/nar\/gkg046","article-title":"The InterPro Database, 2003 brings increased coverage and new features","volume":"31","author":"Mulder","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012408411237900_b28","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1016\/S0014-5793(03)00456-3","article-title":"Predicted protein\u2013protein interaction sites from local sequence information","volume":"544","author":"Ofran","year":"2003","journal-title":"FEBS Lett."},{"key":"2023012408411237900_b29","doi-asserted-by":"crossref","first-page":"14754","DOI":"10.1073\/pnas.0404569101","article-title":"Automated prediction of protein function and detection of functional sites from structure","volume":"101","author":"Pazos","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012408411237900_b30","volume-title":"Analisis de Datos Multivariantes","author":"Pe\u00f1a","year":"2002"},{"key":"2023012408411237900_b31","doi-asserted-by":"crossref","first-page":"D129","DOI":"10.1093\/nar\/gkh028","article-title":"The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data","volume":"32","author":"Porter","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012408411237900_b32","volume-title":"Numerical Recipes in C: The Art of Scientific Computing","author":"Press","year":"1992"},{"key":"2023012408411237900_b33","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1016\/S0968-0004(00)89080-5","article-title":"RASMOL: biomolecular graphics for all","volume":"20","author":"Sayle","year":"1995","journal-title":"Trends Biochem. Sci."},{"key":"2023012408411237900_b34","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1126\/science.1093857","article-title":"Environmental genome shotgun sequencing of the Sargasso Sea","volume":"304","author":"Venter","year":"2004","journal-title":"Science"},{"key":"2023012408411237900_b35","doi-asserted-by":"crossref","first-page":"1105","DOI":"10.1016\/j.jmb.2005.08.008","article-title":"In silico discovery of enzyme\u2013substrate specificity-determining residue clusters","volume":"352","author":"Yu","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023012408411237900_b36","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1016\/B978-1-4832-2734-4.50017-6","article-title":"Evolutionary divergence and convergence in proteins","volume-title":"Evolving Genes and Proteins","author":"Zuckerkandl","year":"1965"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/12\/1440\/48838998\/bioinformatics_22_12_1440.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/12\/1440\/48838998\/bioinformatics_22_12_1440.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,24]],"date-time":"2023-01-24T08:57:00Z","timestamp":1674550620000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/22\/12\/1440\/206950"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,3,21]]},"references-count":36,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2006,6,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btl104","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2006,6,15]]},"published":{"date-parts":[[2006,3,21]]}}}