{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,12]],"date-time":"2024-09-12T11:53:58Z","timestamp":1726142038347},"reference-count":34,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2007,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Matching functional sites is a key problem for the understanding of protein function and evolution. The commonly used graph theoretic approach, and other related approaches, require adjustment of a matching distance threshold <jats:italic>a priori<\/jats:italic> according to the noise in atomic positions. This is difficult to pre-determine when matching sites related by varying evolutionary distances and crystallographic precision. Furthermore, sometimes the graph method is unable to identify alternative but important solutions in the neighbourhood of the distance based solution because of strict distance constraints. We consider the Bayesian approach to improve graph based solutions. In principle this approach applies to other methods with strict distance matching constraints. The Bayesian method can flexibly incorporate all types of prior information on specific binding sites (e.g. amino acid types) in contrast to combinatorial formulations.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We present a new meta-algorithm for matching protein functional sites (active sites and ligand binding sites) based on an initial graph matching followed by refinement using a Markov chain Monte Carlo (MCMC) procedure. This procedure is an innovative extension to our recent work. The method accounts for the 3-dimensional structure of the site as well as the physico-chemical properties of the constituent amino acids. The MCMC procedure can lead to a significant increase in the number of significant matches compared to the graph method as measured independently by rigorously derived p-values.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>MCMC refinement step is able to significantly improve graph based matches. We apply the method to matching NAD(P)(H) binding sites within single Rossmann fold families, between different families in the same superfamily, and in different folds. Within families sites are often well conserved, but there are examples where significant shape based matches do not retain similar amino acid chemistry, indicating that even within families the same ligand may be bound using substantially different physico-chemistry. We also show that the procedure finds significant matches between binding sites for the same co-factor in different families and different folds.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-8-257","type":"journal-article","created":{"date-parts":[[2007,7,17]],"date-time":"2007-07-17T18:13:02Z","timestamp":1184695982000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Bayesian refinement of protein functional site matching"],"prefix":"10.1186","volume":"8","author":[{"given":"Kanti V","family":"Mardia","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vysaul B","family":"Nyirongo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peter J","family":"Green","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nicola D","family":"Gold","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David R","family":"Westhead","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2007,7,17]]},"reference":[{"key":"1629_CR1","doi-asserted-by":"publisher","first-page":"739","DOI":"10.1093\/protein\/11.9.739","volume":"11","author":"IN Shindyalov","year":"1998","unstructured":"Shindyalov IN, Bourne PE: Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 1998, 11: 739-47. 10.1093\/protein\/11.9.739.","journal-title":"Protein Eng"},{"key":"1629_CR2","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1006\/jmbi.1993.1489","volume":"233","author":"L Holm","year":"1993","unstructured":"Holm L, Sander C: Protein structure comparison by alignment of distance matrices. J Mol Biol. 1993, 233: 123-38. 10.1006\/jmbi.1993.1489.","journal-title":"J Mol Biol"},{"key":"1629_CR3","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1006\/jmbi.1994.1657","volume":"243","author":"PJ Artymiuk","year":"1994","unstructured":"Artymiuk PJ, Poirrette AR, Grindley HM, Rice DW, Willett P: A graph-theoretic approach to the identification of three-dimensional patterns of amino acid side-chains in protein structures. J Mol Biol. 1994, 243: 327-44. 10.1006\/jmbi.1994.1657.","journal-title":"J Mol Biol"},{"key":"1629_CR4","doi-asserted-by":"publisher","first-page":"505","DOI":"10.1016\/S0022-2836(03)00882-9","volume":"332","author":"TA Binkowski","year":"2003","unstructured":"Binkowski TA, Adamian L, Liang J: Inferring functional relationships of proteins from local sequence and spatial surface patterns. J Mol Biol. 2003, 332: 505-26. 10.1016\/S0022-2836(03)00882-9.","journal-title":"J Mol Biol"},{"key":"1629_CR5","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1093\/protein\/12.1.11","volume":"12","author":"K Kinoshita","year":"1999","unstructured":"Kinoshita K, Sadanami K, Kidera A, Go N: Structural motif of phosphate-binding site common to various protein superfamilies: all-against-all structural comparison of protein-mononucleotide complexes. Protein Eng. 1999, 12: 11-4. 10.1093\/protein\/12.1.11.","journal-title":"Protein Eng"},{"key":"1629_CR6","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1023\/A:1011318527094","volume":"2","author":"K Kinoshita","year":"2002","unstructured":"Kinoshita K, Furui J, Nakamura H: Identification of protein functions from a molecular surface database, eF-site. J Struct Funct Genomics. 2002, 2: 9-22. 10.1023\/A:1011318527094.","journal-title":"J Struct Funct Genomics"},{"key":"1629_CR7","doi-asserted-by":"publisher","first-page":"1887","DOI":"10.1006\/jmbi.1998.2393","volume":"285","author":"GJ Kleywegt","year":"1999","unstructured":"Kleywegt GJ: Recognition of spatial motifs in protein structures. J Mol Biol. 1999, 285: 1887-97. 10.1006\/jmbi.1998.2393.","journal-title":"J Mol Biol"},{"key":"1629_CR8","doi-asserted-by":"publisher","first-page":"607","DOI":"10.1016\/j.jmb.2004.04.012","volume":"339","author":"A Shulman-Peleg","year":"2004","unstructured":"Shulman-Peleg A, Nussinov R, Wolfson HJ: Recognition of functional sites in protein structures. J Mol Biol. 2004, 339: 607-33. 10.1016\/j.jmb.2004.04.012.","journal-title":"J Mol Biol"},{"issue":"5","key":"1629_CR9","doi-asserted-by":"publisher","first-page":"1307","DOI":"10.1016\/S0022-2836(03)00045-7","volume":"326","author":"A Stark","year":"2003","unstructured":"Stark A, Sunyaev S, Russell RB: A model for Statistical Significance of Local Similarities in Structure. J Mol Biol. 2003, 326 (5): 1307-1316. 10.1016\/S0022-2836(03)00045-7.","journal-title":"J Mol Biol"},{"key":"1629_CR10","doi-asserted-by":"publisher","first-page":"2308","DOI":"10.1002\/pro.5560061104","volume":"6","author":"AC Wallace","year":"1997","unstructured":"Wallace AC, Borkakoti N, Thornton JM: TESS: a geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Protein Sci. 1997, 6: 2308-23.","journal-title":"Protein Sci"},{"key":"1629_CR11","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1038\/221337a0","volume":"221","author":"DM Blow","year":"1969","unstructured":"Blow DM, Birktoft JJ, Hartley BS: Role of a buried acid group in the mechanism of action of chymotrypsin. Nature. 1969, 221: 337-40. 10.1038\/221337a0.","journal-title":"Nature"},{"key":"1629_CR12","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1038\/221235a0","volume":"221","author":"CS Wright","year":"1969","unstructured":"Wright CS, Alden RA, Kraut J: Structure of subtilisin BPN' at 2.5 angstrom resolution. Nature. 1969, 221: 235-42. 10.1038\/221235a0.","journal-title":"Nature"},{"key":"1629_CR13","doi-asserted-by":"publisher","first-page":"631","DOI":"10.1038\/372631a0","volume":"372","author":"CA Orengo","year":"1994","unstructured":"Orengo CA, Jones DT, Thornton JM: Protein superfamilies and domain superfolds. Nature. 1994, 372: 631-4. 10.1038\/372631a0.","journal-title":"Nature"},{"issue":"2","key":"1629_CR14","doi-asserted-by":"publisher","first-page":"387","DOI":"10.1016\/S0022-2836(02)00811-2","volume":"323","author":"S Schmitt","year":"2002","unstructured":"Schmitt S, Kuhn D, Klebe G: A New Method to Detect Related Function Among Proteins Independent of Sequence and Fold Homology. J Mol Biol. 2002, 323 (2): 387-406. 10.1016\/S0022-2836(02)00811-2.","journal-title":"J Mol Biol"},{"issue":"5","key":"1629_CR15","doi-asserted-by":"publisher","first-page":"1112","DOI":"10.1016\/j.jmb.2005.11.044","volume":"355","author":"ND Gold","year":"2006","unstructured":"Gold ND, Jackson RM: Fold Independent Structural Comparisons of Protein-Ligand Binding Sites for Exploring Functional Relationships. J Mol Biol. 2006, 355 (5): 1112-1124. 10.1016\/j.jmb.2005.11.044.","journal-title":"J Mol Biol"},{"issue":"2","key":"1629_CR16","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/biomet\/93.2.235","volume":"93","author":"PJ Green","year":"2006","unstructured":"Green PJ, Mardia KV: Bayesian alignment using hierarchical models, with applications in protein bioinformatics. Biometrika. 2006, 93 (2): 235-254. 10.1093\/biomet\/93.2.235.","journal-title":"Biometrika"},{"key":"1629_CR17","volume-title":"Computational approaches to similarity searching in a functional site database for protein function prediction","author":"ND Gold","year":"2003","unstructured":"Gold ND: Computational approaches to similarity searching in a functional site database for protein function prediction. 2003, Ph.D thesis, Leeds University, School of Biochemistry and Microbiology"},{"issue":"5","key":"1629_CR18","doi-asserted-by":"publisher","first-page":"377","DOI":"10.1016\/S0959-440X(96)80058-3","volume":"6","author":"JF Gibrat","year":"1996","unstructured":"Gibrat JF, Madej T, Bryant SH: Surprising similarities in structure comparison. Curr Opin Struct Biol. 1996, 6 (5): 377-385. 10.1016\/S0959-440X(96)80058-3.","journal-title":"Curr Opin Struct Biol"},{"issue":"1","key":"1629_CR19","doi-asserted-by":"publisher","first-page":"2256","DOI":"10.1107\/S0907444904026460","volume":"60","author":"E Krissinel","year":"2004","unstructured":"Krissinel E, Henrick K: Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallographica Section D. 2004, 60 (1): 2256-2268.","journal-title":"Acta Crystallographica Section D"},{"key":"1629_CR20","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/nar\/28.1.235","volume":"28","author":"HM Berman","year":"2000","unstructured":"Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne NE: The Protein Data Bank. Nucleic Acids Research. 2000, 28: 235-242. 10.1093\/nar\/28.1.235.","journal-title":"Nucleic Acids Research"},{"key":"1629_CR21","doi-asserted-by":"publisher","first-page":"665","DOI":"10.1093\/biomet\/59.3.665","volume":"59","author":"TD Downs","year":"1972","unstructured":"Downs TD: Orientation statistics. Biometrika. 1972, 59: 665-676. 10.1093\/biomet\/59.3.665.","journal-title":"Biometrika"},{"key":"1629_CR22","volume-title":"Directional Statistics","author":"KV Mardia","year":"2000","unstructured":"Mardia KV, Jupp PE: Directional Statistics. 2000, Chichester: John Wiley and Sons Ltd"},{"key":"1629_CR23","volume-title":"Three-Dimensional Chemical Structure Handling","author":"P Willett","year":"1991","unstructured":"Willett P: Three-Dimensional Chemical Structure Handling. 1991, New York: John Wiley and Sons Inc"},{"issue":"5","key":"1629_CR24","doi-asserted-by":"publisher","first-page":"350","DOI":"10.1002\/jcc.540060504","volume":"6","author":"AK Ghose","year":"1985","unstructured":"Ghose AK, Crippen GM: Geometrically feasible binding modes of a flexible ligand molecule at the receptor site. Journal of Computational Chemistry. 1985, 6 (5): 350-359. 10.1002\/jcc.540060504.","journal-title":"Journal of Computational Chemistry"},{"issue":"1","key":"1629_CR25","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1002\/jcc.540050105","volume":"5","author":"FS Kuhl","year":"1984","unstructured":"Kuhl FS, Crippen GM, Friesen DK: A Combinatorial Algorithm for Calculating Ligand Binding. Journal of Computational Chemistry. 1984, 5 (1): 24-34. 10.1002\/jcc.540050105.","journal-title":"Journal of Computational Chemistry"},{"key":"1629_CR26","volume-title":"Logical and Combinatorial Algorithms for Drug Design","author":"V Golender","year":"1983","unstructured":"Golender V, Rozenblit A: Logical and Combinatorial Algorithms for Drug Design. 1983, Letchworth: Research Studies Press"},{"key":"1629_CR27","doi-asserted-by":"publisher","first-page":"572","DOI":"10.1109\/PROC.1981.12026","volume":"69","author":"HG Barrow","year":"1981","unstructured":"Barrow HG, Tenenbaum JM: Computational vision. Proceedings of the IEEE. 1981, 69: 572-595.","journal-title":"Proceedings of the IEEE"},{"key":"1629_CR28","doi-asserted-by":"publisher","first-page":"83","DOI":"10.1016\/0020-0190(76)90049-1","volume":"4","author":"HG Barrow","year":"1976","unstructured":"Barrow HG, Burstall RM: Subgraph isomorphism, matching relational structures and maximal cliques. Information Processing Letters. 1976, 4: 83-84. 10.1016\/0020-0190(76)90049-1.","journal-title":"Information Processing Letters"},{"key":"1629_CR29","first-page":"9","volume-title":"Operations Research Letters","author":"R Carraghan","year":"1990","unstructured":"Carraghan R, Pardalos PM: Exact algorithm for the maximum clique problem. Operations Research Letters. 1990, 9-375."},{"key":"1629_CR30","doi-asserted-by":"publisher","first-page":"827","DOI":"10.1107\/S0567739478001680","volume":"A34","author":"W Kabsch","year":"1978","unstructured":"Kabsch W: A discussion of the solution for the best rotation to relate two sets of vectors. Acta Cryst A. 1978, A34: 827-828. 10.1107\/S0567739478001680.","journal-title":"Acta Cryst A"},{"issue":"3","key":"1629_CR31","doi-asserted-by":"publisher","first-page":"565","DOI":"10.1016\/j.jmb.2005.01.044","volume":"347","author":"JW Torrance","year":"2005","unstructured":"Torrance JW, Bartlett GJ, Porter CT, Thornton JM: Using a Library of Structural Templates to Recognise Catalytic Sites and Explore their Evolution in Homologous Families. J Mol Biol. 2005, 347 (3): 565-581. 10.1016\/j.jmb.2005.01.044.","journal-title":"J Mol Biol"},{"key":"1629_CR32","volume-title":"Statistical Shape Analysis","author":"IL Dryden","year":"1998","unstructured":"Dryden IL, Mardia KV: Statistical Shape Analysis. 1998, Chichester: John Wiley"},{"issue":"1","key":"1629_CR33","doi-asserted-by":"publisher","first-page":"D226","DOI":"10.1093\/nar\/gkh039","volume":"32","author":"A Andreeva","year":"2004","unstructured":"Andreeva A, Howorth D, Brenner SE, Hubbard TJP, Chothia C, Murzin AG: SCOP database in 2004: refinements integrate structure and sequence family data. Nucl Acid Res. 2004, 32 (1): D226-D229. 10.1093\/nar\/gkh039.","journal-title":"Nucl Acid Res"},{"key":"1629_CR34","volume-title":"lpsolve \u2013 Simplex-based code for linear and integer programming","author":"M Berkelaar","year":"1996","unstructured":"Berkelaar M: lpsolve \u2013 Simplex-based code for linear and integer programming. 1996, [http:\/\/www.cs.sunysb.edu\/~algorith\/implement\/lpsolve\/implement.shtml]"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-8-257.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T01:55:09Z","timestamp":1630461309000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-8-257"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,7,17]]},"references-count":34,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2007,12]]}},"alternative-id":["1629"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-8-257","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2007,7,17]]},"assertion":[{"value":"24 November 2006","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 July 2007","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 July 2007","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"257"}}