{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:09:44Z","timestamp":1772165384469,"version":"3.50.1"},"reference-count":80,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2010,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>\n                      Structural variations caused by a wide range of physico-chemical and biological sources directly influence the function of a protein. For enzymatic proteins, the structure and chemistry of the catalytic binding site residues can be loosely defined as a\n                      <jats:italic>substructure<\/jats:italic>\n                      of the protein. Comparative analysis of drug-receptor substructures across and within species has been used for lead evaluation. Substructure-level similarity between the binding sites of functionally similar proteins has also been used to identify instances of convergent evolution among proteins. In functionally homologous protein families, shared chemistry and geometry at catalytic sites provide a common, local point of comparison among proteins that may differ significantly at the sequence, fold, or domain topology levels.\n                    <\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>\n                      This paper describes two key results that can be used separately or in combination for protein function analysis. The Family-wise Analysis of SubStructural Templates (FASST) method uses all-against-all substructure comparison to determine Substructural Clusters (SCs). SCs characterize the binding site substructural variation within a protein family. In this paper we focus on examples of automatically determined SCs that can be linked to phylogenetic distance between family members, segregation by conformation, and organization by homology among convergent protein lineages. The Motif Ensemble Statistical Hypothesis (MESH) framework constructs a representative motif for each protein cluster among the SCs determined by FASST to build\n                      <jats:italic>motif ensembles<\/jats:italic>\n                      that are shown through a series of function prediction experiments to improve the function prediction power of existing motifs.\n                    <\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>FASST contributes a critical feedback and assessment step to existing binding site substructure identification methods and can be used for the thorough investigation of structure-function relationships. The application of MESH allows for an automated, statistically rigorous procedure for incorporating structural variation data into protein function prediction pipelines. Our work provides an unbiased, automated assessment of the structural variability of identified binding site substructures among protein structure families and a technique for exploring the relation of substructural variation to protein function. As available proteomic data continues to expand, the techniques proposed will be indispensable for the large-scale analysis and interpretation of structural data.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/1471-2105-11-242","type":"journal-article","created":{"date-parts":[[2010,5,12]],"date-time":"2010-05-12T02:13:59Z","timestamp":1273630439000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction"],"prefix":"10.1186","volume":"11","author":[{"given":"Drew H","family":"Bryant","sequence":"first","affiliation":[]},{"given":"Mark","family":"Moll","sequence":"additional","affiliation":[]},{"given":"Brian Y","family":"Chen","sequence":"additional","affiliation":[]},{"given":"Viacheslav Y","family":"Fofanov","sequence":"additional","affiliation":[]},{"given":"Lydia E","family":"Kavraki","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2010,5,11]]},"reference":[{"issue":"4","key":"3699_CR1","doi-asserted-by":"publisher","first-page":"962","DOI":"10.1002\/prot.20099","volume":"55","author":"EC Meng","year":"2004","unstructured":"Meng EC, Polacco BJ, Babbitt PC: Superfamily active site templates. Proteins 2004, 55(4):962\u2013976. 10.1002\/prot.20099","journal-title":"Proteins"},{"issue":"8","key":"3699_CR2","doi-asserted-by":"publisher","first-page":"2545","DOI":"10.1021\/bi052101l","volume":"45","author":"SCH Pegg","year":"2006","unstructured":"Pegg SCH, Brown SD, Ojha S, Seffernick J, Meng EC, Morris JH, Chang PJ, Huang CC, Ferrin TE, Babbitt PC: Leveraging enzyme structure-function relationships for functional inference and experimental design: the structure-function linkage database. Biochemistry 2006, 45(8):2545\u20132555. 10.1021\/bi052101l","journal-title":"Biochemistry"},{"key":"3699_CR3","doi-asserted-by":"publisher","first-page":"38","DOI":"10.1038\/sj.bjp.0707307","volume":"152","author":"D Rognan","year":"2007","unstructured":"Rognan D: Chemogenomic approaches to rational drug design. British Journal of Pharmacology 2007, 152: 38\u201352. 10.1038\/sj.bjp.0707307","journal-title":"British Journal of Pharmacology"},{"key":"3699_CR4","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1038\/sj.bjp.0707308","volume":"152","author":"T Klabunde","year":"2007","unstructured":"Klabunde T: Chemogenomic approaches to drug discovery: similar receptors bind similar ligands. British Journal of Pharmacology 2007, 152: 5\u20137. 10.1038\/sj.bjp.0707308","journal-title":"British Journal of Pharmacology"},{"issue":"12","key":"3699_CR5","doi-asserted-by":"publisher","first-page":"1528","DOI":"10.1016\/j.str.2007.11.006","volume":"15","author":"W Hendrickson","year":"2007","unstructured":"Hendrickson W: Impact of structures from the Protein Structure Initiative. Structure 2007, 15(12):1528\u20131529. 10.1016\/j.str.2007.11.006","journal-title":"Structure"},{"key":"3699_CR6","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1016\/0076-6879(94)44004-2","volume":"244","author":"ND Rawlings","year":"1994","unstructured":"Rawlings ND, Barrett AJ: Families of serine proteases. Methods in Enzymology 1994, 244: 19\u201361. full_text","journal-title":"Methods in Enzymology"},{"issue":"6","key":"3699_CR7","doi-asserted-by":"publisher","first-page":"1001","DOI":"10.1002\/pro.5560050603","volume":"5","author":"AC Wallace","year":"1996","unstructured":"Wallace AC, Laskowski RA, Thornton JM: Derivation of 3D coordinate templates for searching structural databases: Application to Ser-His-Asp catalytic triads in the serine proteinases and lipases. Protein Science 1996, 5(6):1001\u20131013. 10.1002\/pro.5560050603","journal-title":"Protein Science"},{"issue":"5","key":"3699_CR8","doi-asserted-by":"publisher","first-page":"741","DOI":"10.1016\/S0022-2836(02)00649-6","volume":"321","author":"N Nagano","year":"2002","unstructured":"Nagano N, Orengo CA, Thornton JM: One fold with many functions: the evolutionary relationships between TIM barrel families based on their sequences, structures and functions. Journal of Molecular Biology 2002, 321(5):741\u2013765. 10.1016\/S0022-2836(02)00649-6","journal-title":"Journal of Molecular Biology"},{"issue":"6","key":"3699_CR9","doi-asserted-by":"publisher","first-page":"723","DOI":"10.1093\/bioinformatics\/btk038","volume":"22","author":"BJ Polacco","year":"2006","unstructured":"Polacco BJ, Babbitt PC: Automated discovery of 3D motifs for protein function annotation. Bioinformatics 2006, 22(6):723\u2013730. 10.1093\/bioinformatics\/btk038","journal-title":"Bioinformatics"},{"issue":"12","key":"3699_CR10","doi-asserted-by":"publisher","first-page":"3634","DOI":"10.1021\/ja068256d","volume":"129","author":"AL Bowman","year":"2007","unstructured":"Bowman AL, Lerner MG, Carlson HA: Protein flexibility and species specificity in structure-based drug discovery: dihydrofolate reductase as a test system. Journal of the American Chemical Society 2007, 129(12):3634\u20133640. 10.1021\/ja068256d","journal-title":"Journal of the American Chemical Society"},{"issue":"3","key":"3699_CR11","doi-asserted-by":"publisher","first-page":"550","DOI":"10.1021\/jm030912m","volume":"47","author":"A Weber","year":"2004","unstructured":"Weber A, Casini A, Heine A, Kuhn D, Supuran CT, Scozzafava A, Klebe G: Unexpected nanomolar inhibition of carbonic anhydrase by COX-2-selective celecoxib: new pharmacological opportunities due to related binding site recognition. Journal of Medicinal Chemistry 2004, 47(3):550\u2013557. 10.1021\/jm030912m","journal-title":"Journal of Medicinal Chemistry"},{"issue":"5","key":"3699_CR12","doi-asserted-by":"publisher","first-page":"e1000387","DOI":"10.1371\/journal.pcbi.1000387","volume":"5","author":"L Xie","year":"2009","unstructured":"Xie L, Li J, Xie L, Bourne PE: Drug Discovery Using Chemical Systems Biology: Identification of the Protein-Ligand Binding Network To Explain the Side Effects of CETP Inhibitors. PLoS Comput Biol 2009, 5(5):e1000387. 10.1371\/journal.pcbi.1000387","journal-title":"PLoS Comput Biol"},{"issue":"1-2","key":"3699_CR13","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1016\/j.mce.2005.11.043","volume":"248","author":"M Hult","year":"2006","unstructured":"Hult M, Shafqat N, Elleby B, Mitschke D, Svensson S, Forsgren M, Barf T, Vallgarda J, Abrahmsen L, Oppermann U: Active site variability of type 1 11beta-hydroxysteroid dehydrogenase revealed by selective inhibitors and cross-species comparisons. Molecular and Cellular Endocrinology 2006, 248(1\u20132):26\u201333. 10.1016\/j.mce.2005.11.043","journal-title":"Molecular and Cellular Endocrinology"},{"issue":"5","key":"3699_CR14","doi-asserted-by":"publisher","first-page":"1211","DOI":"10.1006\/jmbi.1998.1844","volume":"279","author":"RB Russell","year":"1998","unstructured":"Russell RB: Detection of protein three-dimensional side-chain patterns: new examples of convergent evolution. Journal of Molecular Biology 1998, 279(5):1211\u20131227. 10.1006\/jmbi.1998.1844","journal-title":"Journal of Molecular Biology"},{"issue":"13","key":"3699_CR15","doi-asserted-by":"publisher","first-page":"1644","DOI":"10.1093\/bioinformatics\/btg226","volume":"19","author":"JA Barker","year":"2003","unstructured":"Barker JA, Thornton JM: An algorithm for constraint-based structural template matching: application to 3D templates with statistical analysis. Bioinformatics 2003, 19(13):1644\u20131649. 10.1093\/bioinformatics\/btg226","journal-title":"Bioinformatics"},{"issue":"5","key":"3699_CR16","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1016\/j.copbio.2006.07.004","volume":"17","author":"DJ Rigden","year":"2006","unstructured":"Rigden DJ: Understanding the cell in terms of structure and function: insights from structural genomics. Current Opinion in Biotechnology 2006, 17(5):457\u2013464. 10.1016\/j.copbio.2006.07.004","journal-title":"Current Opinion in Biotechnology"},{"issue":"3","key":"3699_CR17","doi-asserted-by":"publisher","first-page":"399","DOI":"10.1016\/j.sbi.2006.04.003","volume":"16","author":"A Andreeva","year":"2006","unstructured":"Andreeva A, Murzin AG: Evolution of protein fold in the presence of functional constraints. Current Opinion in Structural Biology 2006, 16(3):399\u2013408. 10.1016\/j.sbi.2006.04.003","journal-title":"Current Opinion in Structural Biology"},{"issue":"3","key":"3699_CR18","doi-asserted-by":"publisher","first-page":"423","DOI":"10.1006\/jmbi.1997.1019","volume":"269","author":"RB Russell","year":"1997","unstructured":"Russell RB, Saqi MAS, Sayle RA, Bates PA, Sternberg MJE: Recognition of analogous and homologous protein folds: analysis of sequence and structure conservation. Journal of Molecular Biology 1997, 269(3):423\u2013439. 10.1006\/jmbi.1997.1019","journal-title":"Journal of Molecular Biology"},{"issue":"2-3","key":"3699_CR19","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1006\/jsbi.2001.4335","volume":"134","author":"NV Grishin","year":"2001","unstructured":"Grishin NV: Fold change in evolution of protein structures. Journal of Structural Biology 2001, 134(2\u20133):167\u2013185. 10.1006\/jsbi.2001.4335","journal-title":"Journal of Structural Biology"},{"issue":"14","key":"3699_CR20","doi-asserted-by":"publisher","first-page":"5441","DOI":"10.1073\/pnas.0704422105","volume":"105","author":"L Xie","year":"2008","unstructured":"Xie L, Bourne P: Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments. Proceedings of the National Academy of Sciences 2008, 105(14):5441. 10.1073\/pnas.0704422105","journal-title":"Proceedings of the National Academy of Sciences"},{"key":"3699_CR21","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/nar\/28.1.235","volume":"28","author":"H Berman","year":"2000","unstructured":"Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H, Shindyalov I, Bourne P: The Protein Data Bank. Nucleic Acids Research 2000, 28: 235\u2013242. 10.1093\/nar\/28.1.235","journal-title":"Nucleic Acids Research"},{"issue":"2","key":"3699_CR22","doi-asserted-by":"publisher","first-page":"387","DOI":"10.1016\/S0022-2836(02)00811-2","volume":"323","author":"S Schmitt","year":"2002","unstructured":"Schmitt S, Kuhn D, Klebe G: A new method to detect related function among proteins independent of sequence and fold homology. Journal of Molecular Biology 2002, 323(2):387\u2013406. 10.1016\/S0022-2836(02)00811-2","journal-title":"Journal of Molecular Biology"},{"key":"3699_CR23","first-page":"W116","volume-title":"Nucleic Acids Research","author":"J Dundas","year":"2006","unstructured":"Dundas J, Ouyang Z, Tseng J, Binkowski A, Turpaz Y, Liang J: CASTp: computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated residues. Nucleic Acids Research 2006, (34 Web Server):W116\u20138. 10.1093\/nar\/gkl282"},{"issue":"Suppl 2","key":"3699_CR24","doi-asserted-by":"publisher","first-page":"S2","DOI":"10.1186\/1471-2164-9-S2-S2","volume":"9","author":"I Halperin","year":"2008","unstructured":"Halperin I, Glazer DS, Wu S, Altman RB: The FEATURE framework for protein function annotation: modeling new functions, improving performance, and extending to novel applications. BMC Genomics 2008, 9(Suppl 2):S2. 10.1186\/1471-2164-9-S2-S2","journal-title":"BMC Genomics"},{"issue":"8","key":"3699_CR25","doi-asserted-by":"publisher","first-page":"e1000485","DOI":"10.1371\/journal.pcbi.1000485","volume":"5","author":"OC Redfern","year":"2009","unstructured":"Redfern OC, Dessailly BH, Dallman TJ, Sillitoe I, Orengo CA: FLORA: a novel method to predict protein function from structure in diverse superfamilies. PLoS Comput Biol 2009, 5(8):e1000485. 10.1371\/journal.pcbi.1000485","journal-title":"PLoS Comput Biol"},{"issue":"16","key":"3699_CR26","doi-asserted-by":"publisher","first-page":"i207","DOI":"10.1093\/bioinformatics\/btn268","volume":"24","author":"Y Bromberg","year":"2008","unstructured":"Bromberg Y, Rost B: Comprehensive in silico mutagenesis highlights functionally important residues in proteins. Bioinformatics 2008, 24(16):i207\u201312. 10.1093\/bioinformatics\/btn268","journal-title":"Bioinformatics"},{"issue":"2","key":"3699_CR27","doi-asserted-by":"publisher","first-page":"342","DOI":"10.1006\/jmbi.1996.0167","volume":"257","author":"O Lichtarge","year":"1996","unstructured":"Lichtarge O, Bourne HR, Cohen FE: An evolutionary trace method defines binding surfaces common to protein families. Journal of Molecular Biology 1996, 257(2):342\u2013358. 10.1006\/jmbi.1996.0167","journal-title":"Journal of Molecular Biology"},{"key":"3699_CR28","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1093\/bioinformatics\/19.1.163","volume":"19","author":"F Glaser","year":"2003","unstructured":"Glaser F, Pupko T, Paz I, Bell RE, Bechor-Shental D, Martz E, Ben-Tal N: ConSurf: identification of functional regions in proteins by surface-mapping of phylogenetic information. Bioinformatics 2003, 19: 163\u2013164. 10.1093\/bioinformatics\/19.1.163","journal-title":"Bioinformatics"},{"key":"3699_CR29","first-page":"D129","volume-title":"Nucleic Acids Research","author":"CT Porter","year":"2004","unstructured":"Porter CT, Bartlett GJ, Thornton JM: The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data. Nucleic Acids Research 2004, (32 Database):D129\u201333. 10.1093\/nar\/gkh028"},{"key":"3699_CR30","doi-asserted-by":"publisher","first-page":"200","DOI":"10.1093\/bioinformatics\/18.1.200","volume":"18","author":"AC Stuart","year":"2002","unstructured":"Stuart AC, Ilyin VA, Sali A: LigBase: a database of families of aligned ligand binding sites in known protein sequences and structures. Bioinformatics 2002, 18: 200\u2013201. 10.1093\/bioinformatics\/18.1.200","journal-title":"Bioinformatics"},{"key":"3699_CR31","first-page":"D667","volume-title":"Nucleic Acids Research","author":"BH Dessailly","year":"2008","unstructured":"Dessailly BH, Lensink MF, Orengo CA, Wodak SJ: LigASite-a database of biologically relevant binding sites in proteins with known apo-structures. Nucleic Acids Research 2008, (36 Database):D667\u201373."},{"issue":"4","key":"3699_CR32","doi-asserted-by":"publisher","first-page":"1887","DOI":"10.1006\/jmbi.1998.2393","volume":"285","author":"GJ Kleywegt","year":"1999","unstructured":"Kleywegt GJ: Recognition of spatial motifs in protein structures. Journal of Molecular Biology 1999, 285(4):1887\u20131897. 10.1006\/jmbi.1998.2393","journal-title":"Journal of Molecular Biology"},{"issue":"2","key":"3699_CR33","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1021\/ci0255984","volume":"43","author":"RV Spriggs","year":"2003","unstructured":"Spriggs RV, Artymiuk PJ, Willett P: Searching for patterns of amino acids in 3D protein structures. Journal of Chemical Information and Computer Sciences 2003, 43(2):412\u2013421.","journal-title":"Journal of Chemical Information and Computer Sciences"},{"issue":"13","key":"3699_CR34","doi-asserted-by":"publisher","first-page":"3341","DOI":"10.1093\/nar\/gkg506","volume":"31","author":"A Stark","year":"2003","unstructured":"Stark A, Russell RB: Annotation in three dimensions. PINTS: Patterns in Non-homologous Tertiary Structures. Nucleic Acids Research 2003, 31(13):3341\u20133344. 10.1093\/nar\/gkg506","journal-title":"Nucleic Acids Research"},{"issue":"3","key":"3699_CR35","doi-asserted-by":"publisher","first-page":"607","DOI":"10.1016\/j.jmb.2004.04.012","volume":"339","author":"A Shulman-Peleg","year":"2004","unstructured":"Shulman-Peleg A, Nussinov R, Wolfson HJ: Recognition of functional sites in protein structures. Journal of Molecular Biology 2004, 339(3):607\u2013633. 10.1016\/j.jmb.2004.04.012","journal-title":"Journal of Molecular Biology"},{"issue":"4","key":"3699_CR36","doi-asserted-by":"publisher","first-page":"S5","DOI":"10.1186\/1471-2105-6-S4-S5","volume":"6","author":"G Ausiello","year":"2005","unstructured":"Ausiello G, Via A, Helmer-Citterich M: Query3d: a new method for high-throughput analysis of functional residues in protein structures. BMC Bioinformatics 2005, 6(4):S5. 10.1186\/1471-2105-6-S4-S5","journal-title":"BMC Bioinformatics"},{"key":"3699_CR37","doi-asserted-by":"publisher","first-page":"W89","DOI":"10.1093\/nar\/gki414","volume":"33","author":"R Laskowski","year":"2005","unstructured":"Laskowski R, Watson J, Thornton J: ProFunc: a server for predicting protein function from 3D structure. Nucleic Acids Research 2005, 33: W89. 10.1093\/nar\/gki414","journal-title":"Nucleic Acids Research"},{"issue":"3","key":"3699_CR38","doi-asserted-by":"publisher","first-page":"614","DOI":"10.1016\/j.jmb.2005.05.067","volume":"351","author":"RA Laskowski","year":"2005","unstructured":"Laskowski RA, Watson JD, Thornton JM: Protein function prediction using local 3D templates. Journal of Molecular Biology 2005, 351(3):614\u2013626. 10.1016\/j.jmb.2005.05.067","journal-title":"Journal of Molecular Biology"},{"key":"3699_CR39","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1016\/j.str.2004.10.015","volume":"13","author":"D Pal","year":"2005","unstructured":"Pal D, Eisenberg D: Inference of protein function from protein structure. Structure 2005, 13: 121\u2013130. 10.1016\/j.str.2004.10.015","journal-title":"Structure"},{"issue":"5","key":"3699_CR40","doi-asserted-by":"publisher","first-page":"1112","DOI":"10.1016\/j.jmb.2005.11.044","volume":"355","author":"ND Gold","year":"2006","unstructured":"Gold ND, Jackson RM: Fold independent structural comparisons of protein-ligand binding sites for exploring functional relationships. Journal of Molecular Biology 2006, 355(5):1112\u20131124. 10.1016\/j.jmb.2005.11.044","journal-title":"Journal of Molecular Biology"},{"key":"3699_CR41","doi-asserted-by":"publisher","first-page":"75","DOI":"10.2142\/biophysics.3.75","volume":"3","author":"AR Kinjo","year":"2007","unstructured":"Kinjo AR, Nakamura H: Similarity search for local protein structures at atomic resolution by exploiting a database management system. Biophysics 2007, 3: 75\u201384. 10.2142\/biophysics.3.75","journal-title":"Biophysics"},{"issue":"6","key":"3699_CR42","doi-asserted-by":"publisher","first-page":"791","DOI":"10.1089\/cmb.2007.R017","volume":"14","author":"BY Chen","year":"2007","unstructured":"Chen BY, Fofanov VY, Bryant DH, Dodson BD, Kristensen DM, Lisewski AM, Kimmel M, Lichtarge O, Kavraki LE: The MASH pipeline for protein function prediction and an algorithm for the geometric refinement of 3D motifs. Journal of Computational Biology 2007, 14(6):791\u2013816. 10.1089\/cmb.2007.R017","journal-title":"Journal of Computational Biology"},{"key":"3699_CR43","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1142\/9781848162648_0014","volume-title":"Proc. of the Seventh Annual Intl. Conf. on Computational Systems Bioinformatics","author":"M Moll","year":"2008","unstructured":"Moll M, Kavraki LE: Matching of structural motifs using hashing on residue labels and geometric filtering for protein function prediction. Proc. of the Seventh Annual Intl. Conf. on Computational Systems Bioinformatics 2008, 157\u2013168."},{"issue":"2","key":"3699_CR44","doi-asserted-by":"publisher","first-page":"451","DOI":"10.1016\/j.jmb.2008.12.072","volume":"387","author":"YY Tseng","year":"2009","unstructured":"Tseng YY, Dundas J, Liang J: Predicting protein function and binding profile via matching of local evolutionary and geometric surface patterns. Journal of Molecular Biology 2009, 387(2):451\u2013464. 10.1016\/j.jmb.2008.12.072","journal-title":"Journal of Molecular Biology"},{"issue":"2","key":"3699_CR45","doi-asserted-by":"publisher","first-page":"407","DOI":"10.1089\/cmb.2006.13.407","volume":"13","author":"M Shatsky","year":"2006","unstructured":"Shatsky M, Shulman-Peleg A, Nussinov R, Wolfson HJ: The multiple common point set problem and its application to molecule binding pattern detection. Journal of Computational Biology 2006, 13(2):407\u2013428. 10.1089\/cmb.2006.13.407","journal-title":"Journal of Computational Biology"},{"key":"3699_CR46","volume-title":"Proteins: Structure, Function, and Bioinformatics","author":"A Brakoulias","year":"2004","unstructured":"Brakoulias A, Jackson R: Towards a structural classification of phosphate binding sites in protein-nucleotide complexes: an automated all-against-all structural comparison using geometric matching. Proteins: Structure, Function, and Bioinformatics 2004., 56(2): 10.1002\/prot.20123"},{"issue":"2","key":"3699_CR47","doi-asserted-by":"publisher","first-page":"234","DOI":"10.1016\/j.str.2008.11.009","volume":"17","author":"AR Kinjo","year":"2009","unstructured":"Kinjo AR, Nakamura H: Comprehensive structural classification of ligand-binding motifs in proteins. Structure 2009, 17(2):234\u2013246. 10.1016\/j.str.2008.11.009","journal-title":"Structure"},{"issue":"2","key":"3699_CR48","doi-asserted-by":"publisher","first-page":"470","DOI":"10.1002\/prot.20752","volume":"62","author":"Z Zhang","year":"2006","unstructured":"Zhang Z, Grigorov MG: Similarity networks of protein binding sites. Proteins 2006, 62(2):470\u2013478. 10.1002\/prot.20752","journal-title":"Proteins"},{"issue":"5275","key":"3699_CR49","doi-asserted-by":"publisher","first-page":"595","DOI":"10.1126\/science.273.5275.595","volume":"273","author":"L Holm","year":"1996","unstructured":"Holm L, Sander C: Mapping the Protein Universe. Science 1996, 273(5275):595\u2013603. 10.1126\/science.273.5275.595","journal-title":"Science"},{"issue":"11","key":"3699_CR50","doi-asserted-by":"publisher","first-page":"478","DOI":"10.1016\/S0968-0004(00)89105-7","volume":"20","author":"L Holm","year":"1995","unstructured":"Holm L, Sander C: Dali: a network tool for protein structure comparison. Trends in Biochemical Sciences 1995, 20(11):478\u2013480. 10.1016\/S0968-0004(00)89105-7","journal-title":"Trends in Biochemical Sciences"},{"key":"3699_CR51","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1186\/1471-2148-8-101","volume":"8","author":"NB Loughran","year":"2008","unstructured":"Loughran NB, O'Connor B, \u00d3F\u00e1g\u00e1in C, O'Connell MJ: The phylogeny of the mammalian heme peroxidases and the evolution of their diverse functions. BMC Evolutionary Biology 2008, 8: 101. 10.1186\/1471-2148-8-101","journal-title":"BMC Evolutionary Biology"},{"issue":"5","key":"3699_CR52","doi-asserted-by":"publisher","first-page":"567","DOI":"10.1016\/j.ygeno.2007.01.006","volume":"89","author":"F Passardi","year":"2007","unstructured":"Passardi F, Bakalovic N, Teixeira FK, Margis-Pinheiro M, Penel C, Dunand C: Prokaryotic origins of the non-animal peroxidase superfamily and organelle-mediated transmission to eukaryotes. Genomics 2007, 89(5):567\u2013579. 10.1016\/j.ygeno.2007.01.006","journal-title":"Genomics"},{"issue":"37","key":"3699_CR53","doi-asserted-by":"publisher","first-page":"21884","DOI":"10.1074\/jbc.270.37.21884","volume":"270","author":"K Fukuyama","year":"1995","unstructured":"Fukuyama K, Kunishima N, Amada F, Kubota T, Matsubara H: Crystal structures of cyanide-and triiodide-bound forms of Arthromyces ramosus peroxidase at different pH values. Journal of Biological Chemistry 1995, 270(37):21884\u201321892. 10.1074\/jbc.270.37.21884","journal-title":"Journal of Biological Chemistry"},{"issue":"8","key":"3699_CR54","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.1016\/S0969-2126(97)00260-8","volume":"5","author":"CA Orengo","year":"1997","unstructured":"Orengo CA, Michie AD, Jones S, Jones DT, Swindells MB, Thornton JM: CATH-a hierarchic classification of protein domain structures. Structure 1997, 5(8):1093\u20131108. 10.1016\/S0969-2126(97)00260-8","journal-title":"Structure"},{"key":"3699_CR55","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1186\/1475-2859-6-5","volume":"6","author":"K Karhumaa","year":"2007","unstructured":"Karhumaa K, Sanchez RG, Hahn-H\u00e4gerdal B, Gorwa-Grauslund MF: Comparison of the xylose reductase-xylitol dehydrogenase and the xylose isomerase pathways for xylose fermentation by recombinant Saccharomyces cerevisiae. Microbial Cell Factories 2007, 6: 5. 10.1186\/1475-2859-6-5","journal-title":"Microbial Cell Factories"},{"key":"3699_CR56","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1007\/10_2007_057","volume":"108","author":"AJ Van Maris","year":"2007","unstructured":"Van Maris AJ, Winkler AA, Kuyper M, De Laat WT, Van Dijken JP, Pronk JT: Development of efficient xylose fermentation in Saccharomyces cerevisiae: xylose isomerase as a key component. Advances in Biochemical Engineering\/Biotechnology 2007, 108: 179\u2013204. full_text","journal-title":"Advances in Biochemical Engineering\/Biotechnology"},{"issue":"26","key":"3699_CR57","doi-asserted-by":"publisher","first-page":"8542","DOI":"10.1021\/bi00400a008","volume":"26","author":"HM Holden","year":"1987","unstructured":"Holden HM, Tronrud DE, Monzingo AF, Weaver LH, Matthews BW: Slow-and fast-binding inhibitors of thermolysin display diffierent modes of binding: crystallographic analysis of extended phosphonamidate transition-state analogs. Biochemistry 1987, 26(26):8542\u20138553. 10.1021\/bi00400a008","journal-title":"Biochemistry"},{"issue":"10","key":"3699_CR58","doi-asserted-by":"publisher","first-page":"1955","DOI":"10.1002\/pro.5560041001","volume":"4","author":"DR Holland","year":"1995","unstructured":"Holland DR, Hausrath AC, Juers D, Matthews BW: Structural analysis of zinc substitutions in the active site of thermolysin. Protein Science 1995, 4(10):1955\u20131965. 10.1002\/pro.5560041001","journal-title":"Protein Science"},{"issue":"6260","key":"3699_CR59","doi-asserted-by":"publisher","first-page":"694","DOI":"10.1038\/343694a0","volume":"343","author":"D Blow","year":"1990","unstructured":"Blow D: More of the catalytic triad. Nature 1990, 343(6260):694\u2013695. 10.1038\/343694a0","journal-title":"Nature"},{"issue":"6","key":"3699_CR60","doi-asserted-by":"publisher","first-page":"3452","DOI":"10.1074\/jbc.M510564200","volume":"281","author":"A Dementiev","year":"2006","unstructured":"Dementiev A, Dobo J, Gettins PGW: Active site distortion is sufficient for proteinase inhibition by serpins: structure of the covalent complex of \u03b11-proteinase inhibitor with porcine pancreatic elastase. Journal of Biological Chemistry 2006, 281(6):3452\u20133457. 10.1074\/jbc.M510564200","journal-title":"Journal of Biological Chemistry"},{"issue":"44","key":"3699_CR61","doi-asserted-by":"publisher","first-page":"43357","DOI":"10.1074\/jbc.M306944200","volume":"278","author":"A Schmidt","year":"2003","unstructured":"Schmidt A, Jelsch C, Ostergaard P, Rypniewski W, Lamzin VS: Trypsin revisited: crystallography at (sub) atomic resolution and quantum chemistry revealing details of catalysis. Journal of Biological Chemistry 2003, 278(44):43357\u201343362. 10.1074\/jbc.M306944200","journal-title":"Journal of Biological Chemistry"},{"key":"3699_CR62","doi-asserted-by":"publisher","first-page":"343","DOI":"10.1142\/9781860948732_0035","volume-title":"Proc. of the Sixth Annual Intl. Conf. on Computational Systems Bioinformatics","author":"BY Chen","year":"2007","unstructured":"Chen BY, Bryant DH, Cruess AE, Bylund JH, Fofanov VY, Kristensen DM, Kimmel M, Lichtarge O, Kavraki LE: Composite motifs integrating multiple protein structures increase sensitivity for function prediction. Proc. of the Sixth Annual Intl. Conf. on Computational Systems Bioinformatics 2007, 343\u2013355. full_text"},{"issue":"4","key":"3699_CR63","first-page":"536","volume":"247","author":"AG Murzin","year":"1995","unstructured":"Murzin AG, Brenner SE, Hubbard T, Chothia C: SCOP: A structural classification of proteins database for the investigation of sequences and structures. Journal of Molecular Biology 1995, 247(4):536\u2013540.","journal-title":"Journal of Molecular Biology"},{"issue":"22","key":"3699_CR64","doi-asserted-by":"publisher","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","volume":"22","author":"JD Thompson","year":"1994","unstructured":"Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research 1994, 22(22):4673\u20134680. 10.1093\/nar\/22.22.4673","journal-title":"Nucleic Acids Research"},{"issue":"9","key":"3699_CR65","doi-asserted-by":"publisher","first-page":"739","DOI":"10.1093\/protein\/11.9.739","volume":"11","author":"I Shindyalov","year":"1998","unstructured":"Shindyalov I, Bourne P: Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Engineering Design and Selection 1998, 11(9):739\u2013747. 10.1093\/protein\/11.9.739","journal-title":"Protein Engineering Design and Selection"},{"issue":"5","key":"3699_CR66","doi-asserted-by":"publisher","first-page":"595","DOI":"10.1038\/nbt0596-595","volume":"14","author":"C Mattos","year":"1996","unstructured":"Mattos C, Ringe D: Locating and characterizing binding sites on proteins. Nat Biotechnol 1996, 14(5):595\u2013599. 10.1038\/nbt0596-595","journal-title":"Nat Biotechnol"},{"issue":"5","key":"3699_CR67","doi-asserted-by":"publisher","first-page":"1471","DOI":"10.1016\/j.jmb.2006.01.039","volume":"357","author":"C Mattos","year":"2006","unstructured":"Mattos C, Bellamacina CR, Peisach E, Pereira A, Vitkup D, Petsko GA, Ringe D: Multiple solvent crystal structures: probing binding sites, plasticity and hydration. J Mol Biol 2006, 357(5):1471\u20131482. 10.1016\/j.jmb.2006.01.039","journal-title":"J Mol Biol"},{"issue":"4","key":"3699_CR68","doi-asserted-by":"publisher","first-page":"628","DOI":"10.1002\/(SICI)1097-0134(19991201)37:4<628::AID-PROT13>3.0.CO;2-G","volume":"37","author":"AC English","year":"1999","unstructured":"English AC, Done SH, Caves LS, Groom CR, Hubbard RE: Locating interaction sites on proteins: the crystal structure of thermolysin soaked in 2% to 100% isopropanol. Proteins 1999, 37(4):628\u2013640. 10.1002\/(SICI)1097-0134(19991201)37:4<628::AID-PROT13>3.0.CO;2-G","journal-title":"Proteins"},{"key":"3699_CR69","doi-asserted-by":"publisher","first-page":"47","DOI":"10.1093\/protein\/14.1.47","volume":"14","author":"AC English","year":"2001","unstructured":"English AC, Groom CR, Hubbard RE: Experimental and computational mapping of the binding surface of a crystalline protein. Protein Eng 2001, 14: 47\u201359. 10.1093\/protein\/14.1.47","journal-title":"Protein Eng"},{"key":"3699_CR70","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-1904-8","volume-title":"Principal Components Analysis","author":"IT Jolliffe","year":"1986","unstructured":"Jolliffe IT: Principal Components Analysis. New York: Springer-Verlag; 1986."},{"key":"3699_CR71","doi-asserted-by":"publisher","first-page":"611","DOI":"10.1198\/016214502760047131","volume":"97","author":"C Fraley","year":"2002","unstructured":"Fraley C, Raftery AE: Model-based clustering, discriminant analysis and density estimation. Journal of the American Statistical Association 2002, 97: 611\u2013631. 10.1198\/016214502760047131","journal-title":"Journal of the American Statistical Association"},{"key":"3699_CR72","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/BIBMW.2008.4686202","volume-title":"IEEE International Conference on Bioinformatics and Biomedicine Workshop, 2008","author":"VY Fofanov","year":"2008","unstructured":"Fofanov VY, Chen BY, Bryant DH, Moll M, Lichtarge O, Kavraki LE, Kimmel M: A statistical model to correct systematic bias introduced by algorithmic thresholds in protein structural comparison algorithms. IEEE International Conference on Bioinformatics and Biomedicine Workshop, 2008 2008, 1\u20138. full_text"},{"issue":"26","key":"3699_CR73","doi-asserted-by":"publisher","first-page":"9885","DOI":"10.1073\/pnas.0603553103","volume":"103","author":"P Das","year":"2006","unstructured":"Das P, Moll M, Stamati H, Kavraki LE, Clementi C: Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction. Proceedings of the National Academy of Sciences 2006, 103(26):9885. 10.1073\/pnas.0603553103","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"4","key":"3699_CR74","doi-asserted-by":"publisher","first-page":"897","DOI":"10.1002\/prot.21337","volume":"67","author":"E Plaku","year":"2007","unstructured":"Plaku E, Stamati H, Clementi C, Kavraki LE: Fast and reliable analysis of molecular motion using proximity relations and dimensionality reduction. Proteins 2007, 67(4):897\u2013907. 10.1002\/prot.21337","journal-title":"Proteins"},{"key":"3699_CR75","first-page":"D281","volume-title":"Nucleic Acid Research","author":"R Finn","year":"2008","unstructured":"Finn R, Tate J, Mistry J, Coggill P, Sammut S, et al.: The Pfam protein family database. Nucleic Acid Research 2008, (36 Database):D281\u201388."},{"key":"3699_CR76","volume-title":"Proc. of the Fifth Annual Intl. Conf. on Computational Systems Bioinformatics","author":"X Wang","year":"2006","unstructured":"Wang X, Snoeyink J: Multiple structure alignment by optimal RMSD implies that the average structure is a consensus. In Proc. of the Fifth Annual Intl. Conf. on Computational Systems Bioinformatics. Imperial College Press; 2006."},{"issue":"3","key":"3699_CR77","doi-asserted-by":"crossref","first-page":"683","DOI":"10.1111\/j.2517-6161.1991.tb01857.x","volume":"53","author":"SJ Sheather","year":"1991","unstructured":"Sheather SJ, Jones MC: A reliable data-based bandwidth selection method for kernel density estimation. Journal of the Royal Statistical Society. Series B. Methodological 1991, 53(3):683\u2013690.","journal-title":"Journal of the Royal Statistical Society. Series B. Methodological"},{"issue":"4","key":"3699_CR78","doi-asserted-by":"publisher","first-page":"800","DOI":"10.1093\/biomet\/75.4.800","volume":"75","author":"Y Hochberg","year":"1988","unstructured":"Hochberg Y: A sharper Bonferroni procedure for multiple tests of significance. Biometrika 1988, 75(4):800\u2013802. 10.1093\/biomet\/75.4.800","journal-title":"Biometrika"},{"issue":"440","key":"3699_CR79","doi-asserted-by":"publisher","first-page":"1601","DOI":"10.1080\/01621459.1997.10473682","volume":"92","author":"SK Sarkar","year":"1997","unstructured":"Sarkar SK, Chang CK: The Simes method for multiple hypothesis testing with positively dependent test statistics. Journal of the American Statistical Association 1997, 92(440):1601\u20131608. 10.2307\/2965431","journal-title":"Journal of the American Statistical Association"},{"issue":"13","key":"3699_CR80","doi-asserted-by":"publisher","first-page":"1605","DOI":"10.1002\/jcc.20084","volume":"25","author":"EF Pettersen","year":"2004","unstructured":"Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE: UCSF Chimera-a visualization system for exploratory research and analysis. Journal of Computational Chemistry 2004, 25(13):1605\u20131612. 10.1002\/jcc.20084","journal-title":"Journal of Computational Chemistry"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-11-242.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,20]],"date-time":"2025-02-20T15:31:01Z","timestamp":1740065461000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-11-242"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,5,11]]},"references-count":80,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2010,12]]}},"alternative-id":["3699"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-11-242","relation":{"has-review":[{"id-type":"doi","id":"10.3410\/f.3674970.3396073","asserted-by":"object"}]},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,5,11]]},"assertion":[{"value":"13 September 2009","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 May 2010","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 May 2010","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"242"}}