{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,31]],"date-time":"2025-12-31T11:19:49Z","timestamp":1767179989017,"version":"build-2238731810"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1009901","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,3,8]],"date-time":"2022-03-08T00:00:00Z","timestamp":1646697600000}}],"reference-count":41,"publisher":"Public Library of Science (PLoS)","issue":"2","license":[{"start":{"date-parts":[[2022,2,24]],"date-time":"2022-02-24T00:00:00Z","timestamp":1645660800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Studying similarities in protein molecules has become a fundamental activity in much of biology and biomedical research, for which methods such as multiple sequence alignments are widely used. Most methods available for such comparisons cater to studying proteins which have clearly recognizable evolutionary relationships but not to proteins that recognize the same or similar ligands but do not share similarities in their sequence or structural folds. In many cases, proteins in the latter class share structural similarities only in their binding sites. While several algorithms are available for comparing binding sites, there are none for deriving structural motifs of the binding sites, independent of the whole proteins. We report the development of SiteMotif, a new algorithm that compares binding sites from multiple proteins and derives sequence-order independent structural site motifs. We have tested the algorithm at multiple levels of complexity and demonstrate its performance in different scenarios. We have benchmarked against 3 current methods available for binding site comparison and demonstrate superior performance of our algorithm. We show that SiteMotif identifies new structural motifs of spatially conserved residues in proteins, even when there is no sequence or fold-level similarity. We expect SiteMotif to be useful for deriving key mechanistic insights into the mode of ligand interaction, predict the ligand type that a protein can bind and improve the sensitivity of functional annotation.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1009901","type":"journal-article","created":{"date-parts":[[2022,2,24]],"date-time":"2022-02-24T13:50:31Z","timestamp":1645710631000},"page":"e1009901","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":10,"title":["SiteMotif: A graph-based algorithm for deriving structural motifs in Protein Ligand binding sites"],"prefix":"10.1371","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2755-5052","authenticated-orcid":true,"given":"Santhosh","family":"Sankar","sequence":"first","affiliation":[]},{"given":"Nagasuma","family":"Chandra","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,2,24]]},"reference":[{"issue":"12","key":"pcbi.1009901.ref001","doi-asserted-by":"crossref","first-page":"4412","DOI":"10.1073\/pnas.86.12.4412","article-title":"A tool for multiple sequence alignment","volume":"86","author":"D Lipman","year":"1989","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"1","key":"pcbi.1009901.ref002","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/0022-2836(89)90084-3","article-title":"Protein structure alignment","volume":"208","author":"WR Taylor","year":"1989","journal-title":"Journal of molecular biology"},{"issue":"4","key":"pcbi.1009901.ref003","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1002\/prot.24450","article-title":"Protein function annotation by local binding site surface similarity","volume":"82","author":"R Spitzer","year":"2014","journal-title":"Proteins"},{"issue":"3","key":"pcbi.1009901.ref004","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"SF Altschul","year":"1990","journal-title":"Journal of molecular biology"},{"issue":"2\u20133","key":"pcbi.1009901.ref005","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1006\/jsbi.2001.4393","article-title":"On the evolution of protein folds: Are similar motifs in different protein folds the result of convergence, insertion, or relics of an ancient peptide world?","volume":"134","author":"AN Lupas","year":"2001","journal-title":"Journal of structural biology"},{"issue":"9","key":"pcbi.1009901.ref006","doi-asserted-by":"crossref","first-page":"1160","DOI":"10.1093\/bioinformatics\/btq100","article-title":"ProBiS algorithm for detection of structurally similar protein binding sites by local structural alignment","volume":"26","author":"J Konc","year":"2010","journal-title":"Bioinformatics"},{"issue":"5","key":"pcbi.1009901.ref007","doi-asserted-by":"crossref","first-page":"597","DOI":"10.1093\/bioinformatics\/btt024","article-title":"APoc: Large-scale identification of similar protein pockets","volume":"29","author":"M Gao","year":"2013","journal-title":"Bioinformatics"},{"issue":"4","key":"pcbi.1009901.ref008","doi-asserted-by":"crossref","first-page":"865","DOI":"10.1002\/pro.2890","article-title":"G-LoSA: An efficient computational tool for local structure-centric biological studies and drug design","volume":"25","author":"HS Lee","year":"2016","journal-title":"Protein Science"},{"issue":"7","key":"pcbi.1009901.ref009","doi-asserted-by":"crossref","first-page":"1725","DOI":"10.1021\/ci200132z","article-title":"PocketAlign A Novel Algorithm for Aligning Binding Sites in Protein Structures","volume":"51","author":"K Yeturu","year":"2011","journal-title":"Journal of Chemical Information and Modeling"},{"key":"pcbi.1009901.ref010","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1093\/nar\/gkp281","article-title":"SITEHOUND-web: a server for ligand binding site identification in protein structures","volume":"37","author":"M Hernandez","year":"2009","journal-title":"Nucleic Acids Research"},{"issue":"1","key":"pcbi.1009901.ref011","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1016\/j.jsb.2007.09.005","article-title":"PocketDepth: A new depth based algorithm for identification of ligand binding sites in proteins","volume":"161","author":"Y Kalidas","year":"2008","journal-title":"Journal of Structural Biology"},{"issue":"1","key":"pcbi.1009901.ref012","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1186\/1471-2105-10-168","article-title":"Fpocket: An open source platform for ligand pocket detection.","volume":"10","author":"V Le Guilloux","year":"2009","journal-title":"BMC Bioinformatics"},{"issue":"3","key":"pcbi.1009901.ref013","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1016\/j.str.2018.02.001","article-title":"An Augmented Pocketome: Detection and Analysis of Small-Molecule Binding Pockets in Proteins of Known 3D Structure.","volume":"26","author":"R Bhagavat","year":"2018","journal-title":"Structure"},{"issue":"6","key":"pcbi.1009901.ref014","doi-asserted-by":"crossref","first-page":"922","DOI":"10.1107\/S0567739476001873","article-title":"A solution for the best rotation to relate two sets of vectors","author":"W. Kabsch","year":"1976","journal-title":"Acta Crystallographica Section A"},{"key":"pcbi.1009901.ref015","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1016\/S0065-3233(00)54003-0","article-title":"Amino acid substitution matrices","volume":"54","author":"S Henikoff","year":"2000","journal-title":"Advances in protein chemistry"},{"issue":"1","key":"pcbi.1009901.ref016","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1093\/nar\/25.1.236","article-title":"SCOP: a Structural Classification of Proteins database","volume":"25","author":"TJP Hubbard","year":"1997","journal-title":"Nucleic Acids Research"},{"issue":"22","key":"pcbi.1009901.ref017","doi-asserted-by":"crossref","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","article-title":"Amino acid substitution matrices from protein blocks","volume":"89","author":"S Henikoff","year":"1992","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"11","key":"pcbi.1009901.ref018","doi-asserted-by":"crossref","first-page":"2498","DOI":"10.1101\/gr.1239303","article-title":"Cytoscape: a software environment for integrated models of biomolecular interaction networks","volume":"13","author":"P Shannon","year":"2003","journal-title":"Genome Research"},{"issue":"2","key":"pcbi.1009901.ref019","first-page":"1","article-title":"An automated method for finding molecular complexes in large protein interaction networks","volume":"4","author":"GD Bader","year":"2003","journal-title":"BMC Bioinformatics"},{"issue":"5","key":"pcbi.1009901.ref020","doi-asserted-by":"crossref","first-page":"655","DOI":"10.1016\/j.jpdc.2007.09.005","article-title":"MPI for Python: Performance improvements and MPI-2 extensions","volume":"68","author":"L Dalc\u00edn","year":"2008","journal-title":"Journal of Parallel and Distributed Computing"},{"issue":"6","key":"pcbi.1009901.ref021","doi-asserted-by":"crossref","first-page":"899","DOI":"10.1107\/S0907444902003451","article-title":"The protein data bank","volume":"58","author":"HM Berman","year":"2002","journal-title":"Acta crystallographica. Section D, Biological crystallography"},{"issue":"3","key":"pcbi.1009901.ref022","doi-asserted-by":"crossref","first-page":"625","DOI":"10.1110\/ps.37802","article-title":"Crystal structure of human L-isoaspartyl-O-methyl-transferase with S-adenosyl homocysteine at 1.6-\u00c5 resolution and modeling of an isoaspartyl-containing peptide at the active site","volume":"11","author":"CD Smith","year":"2002","journal-title":"Protein Science"},{"issue":"34","key":"pcbi.1009901.ref023","doi-asserted-by":"crossref","first-page":"35622","DOI":"10.1074\/jbc.M403152200","article-title":"The structure of the pantothenate kinase\u00b7ADP\u00b7pantothenate ternary complex reveals the relationship between the binding sites for substrate, allosteric regulator, and antimetabolites","volume":"279","author":"RA Ivey","year":"2004","journal-title":"The Journal of biological chemistry"},{"issue":"8","key":"pcbi.1009901.ref024","doi-asserted-by":"crossref","first-page":"945","DOI":"10.1002\/j.1460-2075.1982.tb01276.x","article-title":"Distantly related sequences in the alpha- and beta-subunits of ATP synthase, myosin, kinases and other ATP-requiring enzymes and a common nucleotide binding fold","volume":"1","author":"JE Walker","year":"1982","journal-title":"The EMBO journal"},{"issue":"4","key":"pcbi.1009901.ref025","first-page":"1","article-title":"PLIC: Protein-ligand interaction clusters.","volume":"2014","author":"P Anand","year":"2014","journal-title":"Database"},{"issue":"29","key":"pcbi.1009901.ref026","doi-asserted-by":"crossref","first-page":"26258","DOI":"10.1074\/jbc.M111.256305","article-title":"An interaction between the walker A and D-loop motifs is critical to ATP hydrolysis and cooperativity in bacteriophage T4 Rad50","volume":"286","author":"MB De La Rosa","year":"2011","journal-title":"The Journal of biological chemistry"},{"issue":"1","key":"pcbi.1009901.ref027","first-page":"1","article-title":"Local comparison of protein structures highlights cases of convergent evolution in analogous functional sites","volume":"8","author":"G Ausiello","year":"2007","journal-title":"BMC Bioinformatics"},{"issue":"4","key":"pcbi.1009901.ref028","first-page":"1","article-title":"Query3d: A new method for high-throughput analysis of functional residues in protien structures","volume":"6","author":"G Ausiello","year":"2005","journal-title":"BMC Bioinformatics"},{"issue":"3","key":"pcbi.1009901.ref029","first-page":"1","article-title":"Structural analysis of heme proteins: Implication for design and prediction","volume":"11","author":"T Li","year":"2011","journal-title":"BMC structural biology"},{"issue":"9","key":"pcbi.1009901.ref030","doi-asserted-by":"crossref","first-page":"1699","DOI":"10.1002\/prot.25328","article-title":"Deciphering common recognition principles of nucleoside mono\/di and tri-phosphates binding in diverse proteins via structural matching of their binding sites","volume":"85","author":"R Bhagavat","year":"2017","journal-title":"Proteins"},{"issue":"2","key":"pcbi.1009901.ref031","doi-asserted-by":"crossref","first-page":"751","DOI":"10.1042\/BJ20040697","article-title":"Catalytic and structural contributions for glutathione-binding residues in a Delta class glutathione S-transferase","volume":"382","author":"P Winayanuwattikun","year":"2004","journal-title":"The Biochemical journal"},{"issue":"8","key":"pcbi.1009901.ref032","first-page":"1","article-title":"The still mysterious roles of cysteine-containing glutathione transferases in plants","volume":"5","author":"PA Lallement","year":"2014","journal-title":"Frontiers in pharmacology"},{"issue":"1","key":"pcbi.1009901.ref033","doi-asserted-by":"crossref","first-page":"1","DOI":"10.3390\/genes11010025","article-title":"Effects of Substrate-Binding Site Residues on the Biochemical Properties of a Tau Class Glutathione S-Transferase from Oryza sativa.","volume":"11","author":"X Yang","year":"2019","journal-title":"Genes (Basel)"},{"issue":"2\u20133","key":"pcbi.1009901.ref034","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1016\/S0014-5793(99)00147-7","article-title":"Functional analysis of the evolutionarily conserved proline 53 residue in Proteus mirabilis glutathione transferase B1-1","volume":"445","author":"N Allocati","year":"1999","journal-title":"FEBS Letters"},{"issue":"1","key":"pcbi.1009901.ref035","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1002\/prot.340140110","article-title":"A structural model for human dihydrolipoamide dehydrogenase","volume":"14","author":"JE Jentoft","year":"1992","journal-title":"Proteins"},{"issue":"8","key":"pcbi.1009901.ref036","doi-asserted-by":"crossref","first-page":"1093","DOI":"10.1016\/S0969-2126(97)00260-8","article-title":"CATH\u2014A hierarchic classification of protein domain structures","volume":"5","author":"CA Orengo","year":"1997","journal-title":"Structure"},{"issue":"7","key":"pcbi.1009901.ref037","first-page":"1","article-title":"S-glutathionylation proteome profiling reveals a crucial role of a thioredoxin-like protein in interspecies competition and cariogenecity of Streptococcus mutans.","volume":"16","author":"Z Li","year":"2020","journal-title":"PLoS pathogens."},{"issue":"1","key":"pcbi.1009901.ref038","doi-asserted-by":"crossref","first-page":"400","DOI":"10.1186\/s12859-019-2988-x","article-title":"FunFam protein families improve residue level molecular function prediction","volume":"20","author":"L Scheibenreif","year":"2019","journal-title":"BMC Bioinformatics"},{"issue":"22","key":"pcbi.1009901.ref039","doi-asserted-by":"crossref","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","article-title":"CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice","volume":"22","author":"JD Thompson","year":"1994","journal-title":"Nucleic Acids Research"},{"issue":"113","key":"pcbi.1009901.ref040","first-page":"1","article-title":"MUSCLE: A multiple sequence alignment method with reduced time and space complexity","volume":"5","author":"RC Edgar","year":"2004","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"pcbi.1009901.ref041","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1006\/jmbi.2000.4042","article-title":"T-coffee: A novel method for fast and accurate multiple sequence alignment","volume":"302","author":"C Notredame","year":"2000","journal-title":"Journal of Molecular Biology"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1009901","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,3,8]],"date-time":"2022-03-08T00:00:00Z","timestamp":1646697600000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009901","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,3,8]],"date-time":"2022-03-08T13:52:56Z","timestamp":1646747576000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009901"}},"subtitle":[],"editor":[{"given":"Bert L.","family":"de Groot","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,2,24]]},"references-count":41,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2022,2,24]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1009901","relation":{},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,2,24]]}}}