{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,6]],"date-time":"2026-04-06T07:05:33Z","timestamp":1775459133879,"version":"3.50.1"},"reference-count":45,"publisher":"Oxford University Press (OUP)","issue":"23","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: An approach for identifying similarities of protein\u2013protein binding sites is presented. The geometric shape of a binding site is described by computing a feature vector based on moment invariants. In order to search for similarities, feature vectors of binding sites are compared. Similar feature vectors indicate binding sites with similar shapes.<\/jats:p><jats:p>Results: The approach is validated on a representative set of protein\u2013protein binding sites, extracted from the SCOPPI database. When querying binding sites from a representative set, we search for known similarities among 2819 binding sites. A median area under the ROC curve of 0.98 is observed. For half of the queries, a similar binding site is identified among the first two of 2819 when sorting all binding sites according the proposed similarity measure. Typical examples identified by this method are analyzed and discussed. The nitrogenase iron protein-like SCOP family is clustered hierarchically according to the proposed similarity measure as a case study.<\/jats:p><jats:p>Availability: Python code is available on request from the authors.<\/jats:p><jats:p>Contact: \u00a0sommer@mpi-inf.mpg.de<\/jats:p><jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm503","type":"journal-article","created":{"date-parts":[[2007,11,1]],"date-time":"2007-11-01T00:33:58Z","timestamp":1193877238000},"page":"3139-3146","source":"Crossref","is-referenced-by-count":46,"title":["Moment invariants as shape recognition technique for comparing protein binding sites"],"prefix":"10.1093","volume":"23","author":[{"given":"Ingolf","family":"Sommer","sequence":"first","affiliation":[{"name":"1 Max-Planck-Institute for Informatics, Stuhlsatzenhausweg 85, 66123 Saarbr\u00fccken and 2Faculty of Mathematics and Computer Science, Saarland University, Building E1.1, 66041 Saarbr\u00fccken, Germany"}]},{"given":"Oliver","family":"M\u00fcller","sequence":"additional","affiliation":[{"name":"1 Max-Planck-Institute for Informatics, Stuhlsatzenhausweg 85, 66123 Saarbr\u00fccken and 2Faculty of Mathematics and Computer Science, Saarland University, Building E1.1, 66041 Saarbr\u00fccken, Germany"}]},{"given":"Francisco S.","family":"Domingues","sequence":"additional","affiliation":[{"name":"1 Max-Planck-Institute for Informatics, Stuhlsatzenhausweg 85, 66123 Saarbr\u00fccken and 2Faculty of Mathematics and Computer Science, Saarland University, Building E1.1, 66041 Saarbr\u00fccken, Germany"}]},{"given":"Oliver","family":"Sander","sequence":"additional","affiliation":[{"name":"1 Max-Planck-Institute for Informatics, Stuhlsatzenhausweg 85, 66123 Saarbr\u00fccken and 2Faculty of Mathematics and Computer Science, Saarland University, Building E1.1, 66041 Saarbr\u00fccken, Germany"}]},{"given":"Joachim","family":"Weickert","sequence":"additional","affiliation":[{"name":"1 Max-Planck-Institute for Informatics, Stuhlsatzenhausweg 85, 66123 Saarbr\u00fccken and 2Faculty of Mathematics and Computer Science, Saarland University, Building E1.1, 66041 Saarbr\u00fccken, Germany"}]},{"given":"Thomas","family":"Lengauer","sequence":"additional","affiliation":[{"name":"1 Max-Planck-Institute for Informatics, Stuhlsatzenhausweg 85, 66123 Saarbr\u00fccken and 2Faculty of Mathematics and Computer Science, Saarland University, Building E1.1, 66041 Saarbr\u00fccken, Germany"}]}],"member":"286","published-online":{"date-parts":[[2007,10,31]]},"reference":[{"key":"2023041107510500200_","first-page":"34","article-title":"Nearest neighbor classification in 3D protein data bases","volume-title":"Proceedings 7th International Conference on Intelligent Systems for Molecular Biology","author":"Ankerst","year":"1999"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"518","DOI":"10.1002\/asi.20140","article-title":"Graph theoretic methods for the analysis of structural relationships in biological macromolecules","volume":"56","author":"Artymiuk","year":"2005","journal-title":"J. Am. Soc. Inf. Sci. Technol"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"1711","DOI":"10.1002\/jcc.20681","article-title":"Ultrafast shape recognition to search compound databases for similar molecular shapes","volume":"28","author":"Ballester","year":"2007","journal-title":"J. Comput. Chem"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1007\/11496656_36","article-title":"Identifying similar surface patches on proteins using a spin-image surface representation","volume-title":"Combinatorial Pattern Matching: 16th Annual Symposium, CPM 2005, Lecture Notes in Computer Science 3537","author":"Bock","year":"2005"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1089\/cmb.2006.0145","article-title":"Discovery of similar regions on protein surfaces","volume":"14","author":"Bock","year":"2007","journal-title":"J. Comput. Biol"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"313","DOI":"10.1016\/S1093-3263(01)00134-6","article-title":"Protein-ligand recognition using spherical harmonic molecular surfaces: towards a fast and efficient filter for large virtual throughput screening","volume":"20","author":"Cai","year":"2002","journal-title":"J. Mol. Graph. Model"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"2082","DOI":"10.1110\/ps.062245906","article-title":"Revisiting the voronoi description of protein\u2013protein interfaces","volume":"15","author":"Cazals","year":"2006","journal-title":"Protein Sci"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"304","DOI":"10.1038\/254304a0","article-title":"Structural invariants in protein folding","volume":"254","author":"Chothia","year":"1975","journal-title":"Nature"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","DOI":"10.1016\/j.cag.2004.08.015","article-title":"A barcode shape descriptor for curve point cloud data","volume-title":"Eurographics Symposium on Point-Based Graphics","author":"Collins","year":"2004"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1145\/174462.156635","article-title":"Three-dimensional alpha shapes","volume":"13","author":"Edelsbrunner","year":"1994","journal-title":"ACM Trans. Graph"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"861","DOI":"10.1016\/j.patrec.2005.10.010","article-title":"An introduction to ROC analysis","volume":"27","author":"Fawcett","year":"2006","journal-title":"Pattern Recognit. Lett"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1109\/TPAMI.2003.1177154","article-title":"Moment forms invariant to rotation and blur in arbitrary number of dimensions","volume":"25","author":"Flusser","year":"2003","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"1653","DOI":"10.1002\/(SICI)1096-987X(19961115)17:14<1653::AID-JCC7>3.0.CO;2-K","article-title":"A fast method of molecular shape comparison: a simple application of a gaussian description of molecular shape","volume":"17","author":"Grant","year":"1996","journal-title":"J. Comput. Chem"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"414","DOI":"10.1021\/ci0497049","article-title":"Sh2 binding site comparison: a new application of the surfcomp method","volume":"45","author":"Hofbauer","year":"2005","journal-title":"J. Chem. Inf. Model"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"837","DOI":"10.1021\/ci0342371","article-title":"Surfcomp: a novel graph-based approach to molecular surface comparison","volume":"44","author":"Hofbauer","year":"2004","journal-title":"J. Chem. Inf. Comput. Sci"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1006\/jmbi.1993.1489","article-title":"Protein structure comparison by alignment of distance matrices","volume":"233","author":"Holm","year":"1993","journal-title":"J. Mol. Biol"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"1671","DOI":"10.1109\/PROC.1984.13073","article-title":"Extended Gaussian images","volume":"72","author":"Horn","year":"1984","journal-title":"Proc. IEEE"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1109\/TIT.1962.1057692","article-title":"Visual pattern recognition by moment invatiants","volume":"8","author":"Hu","year":"1962","journal-title":"IRE. Tran. Inf. Theory"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1080\/10618600.1996.10474713","article-title":"R: a language for data analysis and graphics","volume":"5","author":"Ihaka","year":"1996","journal-title":"J. Comput. Graph. Stat"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"3929","DOI":"10.1093\/bioinformatics\/bti645","article-title":"The SuMo server: 3D search for protein functional sites","volume":"21","author":"Jambon","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"D580","DOI":"10.1093\/nar\/gkl836","article-title":"SNAPPI-DB: a database and api of structures, interfaces and alignments for protein\u2013protein interactions","volume":"35","author":"Jefferson","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1016\/0079-6107(94)00008-W","article-title":"Protein-protein interactions: a review of protein dimer structures","volume":"63","author":"Jones","year":"1995","journal-title":"Prog. Biophys. Mol. Biol"},{"key":"2023041107510500200_","article-title":"Rotation invariant spherical harmonic representation of 3D shape descriptors","author":"Kazhdan","year":"2003"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"1043","DOI":"10.1110\/ps.03484604","article-title":"A new, structurally nonredundant, diverse data set of protein\u2013protein interfaces and its implications","volume":"13","author":"Keskin","year":"2004","journal-title":"Protein Sci"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"1075","DOI":"10.1002\/prot.20693","article-title":"Survey of the geometric association of domain-domain interfaces","volume":"61","author":"Kim","year":"2005","journal-title":"Proteins"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1371\/journal.pcbi.0020124","article-title":"The many faces of protein\u2013protein interactions: a compendium of interface geometry","volume":"2","author":"Kim","year":"2006","journal-title":"PLoS. Comput. Biol"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"1887","DOI":"10.1006\/jmbi.1998.2393","article-title":"Recognition of spatial motifs in protein structures","volume":"285","author":"Kleywegt","year":"1999","journal-title":"J. Mol. Biol"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"819","DOI":"10.1109\/34.709598","article-title":"n-dimensional moment invariants and conceptual mathematical theory of recognition of n-dimensional data sets","volume":"20","author":"Mamistvalov","year":"1998","journal-title":"IEEE. Trans. Pattern Anal. Mach. Intell"},{"key":"2023041107510500200_","volume-title":"Data Structures and Efficient Algorithms. Volume 3: Multi-dimensional Searching and Computational Geometry, volume 3 of EATCS Monographs on Theoretical Computer Science","author":"Mehlhorn","year":"1984"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"2347","DOI":"10.1093\/bioinformatics\/bti337","article-title":"Real spherical harmonic expansion coefficients as 3D shape descriptors for protein binding pocket and ligand comparisons","volume":"21","author":"Morris","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041107510500200_","volume-title":"Moment Functions in Image Analysis","author":"Mukundan","year":"1998"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"536","DOI":"10.1016\/S0022-2836(05)80134-2","article-title":"SCOP: a structural classification of proteins database for the investigation of sequences and structures","volume":"247","author":"Murzin","year":"1995","journal-title":"J. Mol. Biol"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"807","DOI":"10.1145\/571647.571648","article-title":"Shape distributions","volume":"21","author":"Osada","year":"2002","journal-title":"ACM Trans. Graph"},{"key":"2023041107510500200_","volume-title":"Taschenbuch der Statistik","author":"Rinne","year":"2003"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"739","DOI":"10.1093\/protein\/11.9.739","article-title":"Protein structure alignment by incremental combinatorial extension (CE) of the optimal path","volume":"11","author":"Shindyalov","year":"1998","journal-title":"Protein Eng"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1007\/978-3-540-30219-3_17","article-title":"Protein-protein interfaces: recognition of similar spatial and chemical organizations","volume-title":"Algorithms in Bioinformatics: 4th International Workshop, WABI 2004, Bergen, Norway, 2004, Volume 3240 of Lecture Notes in Computer Science","author":"Shulman-Peleg","year":"2004"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"607","DOI":"10.1016\/j.jmb.2004.04.012","article-title":"Recognition of functional sites in protein structures","volume":"339","author":"Shulman-Peleg","year":"2004","journal-title":"J. Mol. Biol"},{"issue":"Web Server issue","key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"W337","DOI":"10.1093\/nar\/gki482","article-title":"SiteEngines: recognition and comparison of binding sites and protein\u2013protein interfaces","volume":"33","author":"Shulman-Peleg","year":"2005","journal-title":"Nucleic Acids Res"},{"issue":"12","key":"2023041107510500200_","first-page":"2103","article-title":"D\u00e9j\u00e1 vu all over again: finding and analyzing protein structure similarities","volume":"12","author":"Sierk","year":"2004","journal-title":"Structure"},{"issue":"20","key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"3940","DOI":"10.1093\/bioinformatics\/bti623","article-title":"ROCR: visualizing classifier performance in R","volume":"21","author":"Sing","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"1307","DOI":"10.1016\/S0022-2836(03)00045-7","article-title":"A model for statistical significance of local similarities in structure","volume":"326","author":"Stark","year":"2003","journal-title":"J. Mol. Biol"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"1970","DOI":"10.1007\/PL00000677","article-title":"Protein surface similarities: a survey of methods to describe and compare protein surfaces","volume":"57","author":"Via","year":"2000","journal-title":"Cell. Mol. Life Sci"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"D310","DOI":"10.1093\/nar\/gkj099","article-title":"Scoppi: a structural classification of protein\u2013protein interfaces","volume":"34","author":"Winter","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1109\/99.641604","article-title":"Geometric hashing: an overview","volume":"97","author":"Wolfson","year":"1997","journal-title":"IEEE Comput. Sci. Eng"},{"key":"2023041107510500200_","doi-asserted-by":"crossref","first-page":"1029","DOI":"10.1073\/pnas.0407152101","article-title":"The protein structure prediction problem could be solved using the current PDB library","volume":"102","author":"Zhang","year":"2005","journal-title":"Proc. Natl. Acad. Sci. USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/23\/3139\/49821715\/bioinformatics_23_23_3139.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/23\/3139\/49821715\/bioinformatics_23_23_3139.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,22]],"date-time":"2025-01-22T01:42:47Z","timestamp":1737510167000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/23\/3139\/290489"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,10,31]]},"references-count":45,"journal-issue":{"issue":"23","published-print":{"date-parts":[[2007,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm503","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,12,1]]},"published":{"date-parts":[[2007,10,31]]}}}