{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,4]],"date-time":"2024-08-04T13:14:42Z","timestamp":1722777282507},"reference-count":58,"publisher":"Oxford University Press (OUP)","issue":"24","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,12,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: While protein secondary structure is well understood, representing the repetitive nature of tertiary packing in proteins remains difficult. We have developed a construct called the relative packing group (RPG) that applies the clique concept from graph theory as a natural basis for defining the packing motifs in proteins. An RPG is defined as a clique of residues, where every member contacts all others as determined by the Delaunay tessellation. Geometrically similar RPGs define a regular element of tertiary structure or tertiary motif (TerMo). This intuitive construct provides a simple approach to characterize general repetitive elements of tertiary structure.<\/jats:p>\n               <jats:p>Results: A dataset of over 4 million tetrahedral RPGs was clustered using different criteria to characterize the various aspects of regular tertiary structure in TerMos. Grouping this data within the SCOP classification levels of Family, Superfamily, Fold, Class and PDB showed that similar packing is shared across different folds. Classification of RPGs based on residue sequence locality reveals topological preferences according to protein sizes and secondary structure. We find that larger proteins favor RPGs with three local residues packed against a non-local residue. Classifying by secondary structure, helices prefer mostly local residues, sheets favor at least two local residues, while turns and coil populate with more local residues. To depict these TerMos, we have developed 2 complementary and intuitive representations: (i) Dirichlet process mixture density estimation of the torsion angle distributions and (ii) kernel density estimation of the Cartesian coordinate distribution. The TerMo library and representations software are available upon request.<\/jats:p>\n               <jats:p>Contact: \u00a0jtsai@pacific.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq573","type":"journal-article","created":{"date-parts":[[2010,11,4]],"date-time":"2010-11-04T00:15:19Z","timestamp":1288829719000},"page":"3059-3066","source":"Crossref","is-referenced-by-count":11,"title":["Characterizing the regularity of tetrahedral packing motifs in protein tertiary structure"],"prefix":"10.1093","volume":"26","author":[{"given":"Ryan","family":"Day","sequence":"first","affiliation":[{"name":"1 Department of Chemistry, University of the Pacific, Stockton, CA 95211, 2Department of Statistics, Texas A&M University, College Station, TX 77843 and 3Department of Statistics, Rice University, Houston, TX 77251, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kristin P.","family":"Lennox","sequence":"additional","affiliation":[{"name":"1 Department of Chemistry, University of the Pacific, Stockton, CA 95211, 2Department of Statistics, Texas A&M University, College Station, TX 77843 and 3Department of Statistics, Rice University, Houston, TX 77251, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David B.","family":"Dahl","sequence":"additional","affiliation":[{"name":"1 Department of Chemistry, University of the Pacific, Stockton, CA 95211, 2Department of Statistics, Texas A&M University, College Station, TX 77843 and 3Department of Statistics, Rice University, Houston, TX 77251, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marina","family":"Vannucci","sequence":"additional","affiliation":[{"name":"1 Department of Chemistry, University of the Pacific, Stockton, CA 95211, 2Department of Statistics, Texas A&M University, College Station, TX 77843 and 3Department of Statistics, Rice University, Houston, TX 77251, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jerry W.","family":"Tsai","sequence":"additional","affiliation":[{"name":"1 Department of Chemistry, University of the Pacific, Stockton, CA 95211, 2Department of Statistics, Texas A&M University, College Station, TX 77843 and 3Department of Statistics, Rice University, Houston, TX 77251, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2010,11,2]]},"reference":[{"key":"2023012508030734500_B1","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1006\/jmbi.1994.1657","article-title":"A graph-theoretic approach to the identification of three-dimensional patterns of amino acid side-chains in protein structures","volume":"243","author":"Artymiuk","year":"1994","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B2","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1002\/prot.10435","article-title":"The origin and extent of coarse-grained regularities in protein internal packing","volume":"53","author":"Bagci","year":"2003","journal-title":"Proteins"},{"key":"2023012508030734500_B3","doi-asserted-by":"crossref","first-page":"622","DOI":"10.1002\/pro.5560040404","article-title":"Characterizing the microenvironment surrounding protein sites","volume":"4","author":"Bagley","year":"1995","journal-title":"Protein Sci."},{"key":"2023012508030734500_B4","doi-asserted-by":"crossref","first-page":"773","DOI":"10.1007\/s10822-009-9273-4","article-title":"Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development","volume":"23","author":"Bandyopadhyay","year":"2009","journal-title":"J. Comput. Aided Mol. Des."},{"key":"2023012508030734500_B5","doi-asserted-by":"crossref","first-page":"785","DOI":"10.1007\/s10822-009-9277-0","article-title":"Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: II. Case studies and applications","volume":"23","author":"Bandyopadhyay","year":"2009","journal-title":"J. Comput. Aided Mol. Des."},{"key":"2023012508030734500_B6","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1089\/cmb.1995.2.125","article-title":"Algorithms for protein structural motif recognition","volume":"2","author":"Berger","year":"1995","journal-title":"J. Comput. Biol."},{"key":"2023012508030734500_B7","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1089\/cmb.1997.4.261","article-title":"An iterative method for improved protein structural motif recognition","volume":"4","author":"Berger","year":"1997","journal-title":"J. Comput. Biol."},{"key":"2023012508030734500_B8","doi-asserted-by":"crossref","first-page":"8500","DOI":"10.1073\/pnas.112221999","article-title":"TRILOGY: discovery of sequence-structure patterns across diverse proteins","volume":"99","author":"Bradley","year":"2002","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508030734500_B9","doi-asserted-by":"crossref","first-page":"1868","DOI":"10.1126\/science.1113801","article-title":"Toward high-resolution de novo structure prediction for small proteins","volume":"309","author":"Bradley","year":"2005","journal-title":"Science"},{"key":"2023012508030734500_B10","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1145\/362342.362367","article-title":"Finding all cliques of an undirected graph","volume":"16","author":"Bron","year":"1973","journal-title":"Commun. ACM"},{"key":"2023012508030734500_B11","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1002\/prot.340210302","article-title":"Funnels, pathways, and the energy landscape of protein folding: a synthesis","volume":"21","author":"Bryngelson","year":"1995","journal-title":"Proteins"},{"key":"2023012508030734500_B12","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1006\/jmbi.1998.1943","article-title":"Prediction of local structure in proteins using a library of sequence-structure motifs","volume":"281","author":"Bystroff","year":"1998","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B13","doi-asserted-by":"crossref","first-page":"D189","DOI":"10.1093\/nar\/gkh034","article-title":"The ASTRAL Compendium in 2004","volume":"32","author":"Chandonia","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012508030734500_B14","first-page":"793","article-title":"Sur la sphere vide [The Empty Sphere]","volume":"7","author":"Delaunay","year":"1934","journal-title":"Izv Akad Nauk SSSR, Otdelenie Matematicheskikh i Estestvennykh Nauk"},{"key":"2023012508030734500_B15","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1186\/1471-2105-8-106","article-title":"Discovering structural motifs using a structural alphabet: application to magnesium-binding sites","volume":"8","author":"Dudev","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012508030734500_B16","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s12033-008-9127-7","article-title":"Data deposition and annotation at the worldwide protein data bank","volume":"42","author":"Dutta","year":"2009","journal-title":"Mol. Biotechnol."},{"key":"2023012508030734500_B17","doi-asserted-by":"crossref","first-page":"1792","DOI":"10.1093\/nar\/gkh340","article-title":"MUSCLE: multiple sequence alignment with high accuracy and high throughput","volume":"32","author":"Edgar","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012508030734500_B18","doi-asserted-by":"crossref","first-page":"4721","DOI":"10.1021\/bi00181a032","article-title":"Two crystal structures of the B1 immunoglobulin-binding domain of streptococcal protein G and comparison with NMR","volume":"33","author":"Gallagher","year":"1994","journal-title":"Biochemistry"},{"key":"2023012508030734500_B19","doi-asserted-by":"crossref","first-page":"47500","DOI":"10.1074\/jbc.M206105200","article-title":"Evidence for plasticity and structural mimicry at the immunoglobulin light chain-protein L interface","volume":"277","author":"Graille","year":"2002","journal-title":"J. Biol. Chem."},{"key":"2023012508030734500_B20","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1093\/protein\/6.1.29","article-title":"The prediction and characterization of metal binding sites in proteins","volume":"6","author":"Gregory","year":"1993","journal-title":"Protein Eng."},{"key":"2023012508030734500_B21","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1002\/prot.10520","article-title":"Sequence and structural analysis of cellular retinoic acid-binding proteins reveals a network of conserved hydrophobic interactions","volume":"54","author":"Gunasekaran","year":"2004","journal-title":"Proteins"},{"key":"2023012508030734500_B22","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1016\/0022-2836(91)90388-M","article-title":"Side-chain clusters in protein structures and their role in protein folding","volume":"220","author":"Heringa","year":"1991","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B23","doi-asserted-by":"crossref","first-page":"706","DOI":"10.1016\/j.jmb.2005.09.081","article-title":"Characterizing conserved structural contacts by pair-wise relative contacts and relative packing groups","volume":"354","author":"Holmes","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B24","doi-asserted-by":"crossref","first-page":"657","DOI":"10.1089\/cmb.2005.12.657","article-title":"Comparing graph representations of protein structure for mining family-specific residue-based packing motifs","volume":"12","author":"Huan","year":"2005","journal-title":"J. Comput. Biol."},{"key":"2023012508030734500_B25","first-page":"411","article-title":"Accurate classification of protein structural families using coherent subgraph analysis","volume":"2004","author":"Huan","year":"2004","journal-title":"Pac. Symp. Biocomput."},{"key":"2023012508030734500_B26","doi-asserted-by":"crossref","first-page":"2057","DOI":"10.1110\/ps.0302503","article-title":"Contact order revisited: influence of protein size on the folding rate","volume":"12","author":"Ivankov","year":"2003","journal-title":"Protein Sci."},{"key":"2023012508030734500_B27","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1006\/jmbi.1999.3058","article-title":"Identification of side-chain clusters in protein structures by a graph spectral method","volume":"292","author":"Kannan","year":"1999","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B28","doi-asserted-by":"crossref","first-page":"1887","DOI":"10.1006\/jmbi.1998.2393","article-title":"Recognition of spatial motifs in protein structures","volume":"285","author":"Kleywegt","year":"1999","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B29","first-page":"427","article-title":"The structure of hydrophobic cores of globins","volume":"8","author":"Kozitsyn","year":"1975","journal-title":"Mol. Biol."},{"key":"2023012508030734500_B30","doi-asserted-by":"crossref","first-page":"10383","DOI":"10.1073\/pnas.97.19.10383","article-title":"Native protein sequences are close to optimal for their structures","volume":"97","author":"Kuhlman","year":"2000","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508030734500_B31","doi-asserted-by":"crossref","first-page":"1364","DOI":"10.1126\/science.1089427","article-title":"Design of a novel globular protein fold with atomic-level accuracy","volume":"302","author":"Kuhlman","year":"2003","journal-title":"Science"},{"key":"2023012508030734500_B32","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1006\/jmbi.2001.5229","article-title":"Accurate computer-based design of a new backbone conformation in the second turn of protein L","volume":"315","author":"Kuhlman","year":"2002","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B33","doi-asserted-by":"crossref","first-page":"9429","DOI":"10.1073\/pnas.89.20.9429","article-title":"Three-dimensional structure of two crystal forms of FabR19.9 from a monoclonal anti-arsonate antibody","volume":"89","author":"Lascombe","year":"1992","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508030734500_B34","doi-asserted-by":"crossref","first-page":"586","DOI":"10.1198\/jasa.2009.0024","article-title":"Density estimation for protein conformational angles using a bivariate von Mises distribution and Bayesian nonparametrics","volume":"104","author":"Lennox","year":"2009","journal-title":"J. Am. Stat. Soc."},{"key":"2023012508030734500_B35","doi-asserted-by":"crossref","first-page":"639","DOI":"10.1089\/cmb.2008.0176","article-title":"Conditional graphical models for protein structural motif recognition","volume":"16","author":"Liu","year":"2009","journal-title":"J. Comput. Biol."},{"key":"2023012508030734500_B36","doi-asserted-by":"crossref","first-page":"536","DOI":"10.1016\/S0022-2836(05)80134-2","article-title":"SCOP: a structural classification of proteins database for the investigation of sequences and structures","volume":"247","author":"Murzin","year":"1995","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B37","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1093\/protein\/6.3.247","article-title":"Atomic environments of arginine side chains in proteins","volume":"6","author":"Nandi","year":"1993","journal-title":"Protein Eng."},{"key":"2023012508030734500_B38","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1186\/1471-2105-8-321","article-title":"Automatic generation of 3D motifs for classification of protein binding sites","volume":"8","author":"Nebel","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012508030734500_B39","doi-asserted-by":"crossref","first-page":"2606","DOI":"10.1110\/ps.0215902","article-title":"MAMMOTH (matching molecular models obtained from theory): an automated method for model comparison","volume":"11","author":"Ortiz","year":"2002","journal-title":"Protein Sci."},{"key":"2023012508030734500_B40","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1073\/pnas.37.5.235","article-title":"Atomic coordinates and structure factors for two helical configurations of polypeptide chains","volume":"37","author":"Pauling","year":"1951","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508030734500_B41","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1073\/pnas.37.5.251","article-title":"The pleated sheet, a new layer configuration of polypeptide chains","volume":"37","author":"Pauling","year":"1951","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508030734500_B42","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1007\/s007750100214","article-title":"Structural characteristics of protein binding sites for calcium and lanthanide ions","volume":"6","author":"Pidcock","year":"2001","journal-title":"J. Biol. Inorg. Chem."},{"key":"2023012508030734500_B43","doi-asserted-by":"crossref","first-page":"985","DOI":"10.1006\/jmbi.1998.1645","article-title":"Contact order, transition state placement and the refolding rates of single domain proteins","volume":"277","author":"Plaxco","year":"1998","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B44","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1016\/S0022-2836(63)80023-6","article-title":"Stereochemistry of polypeptide chain configurations","volume":"7","author":"Ramachandran","year":"1963","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B45","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1002\/prot.20479","article-title":"Structure alignment via Delaunay tetrahedralization","volume":"60","author":"Roach","year":"2005","journal-title":"Proteins"},{"key":"2023012508030734500_B46","doi-asserted-by":"crossref","first-page":"1211","DOI":"10.1006\/jmbi.1998.1844","article-title":"Detection of protein three-dimensional side-chain patterns: new examples of convergent evolution","volume":"279","author":"Russell","year":"1998","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B47","doi-asserted-by":"crossref","first-page":"903","DOI":"10.1006\/jmbi.1998.2043","article-title":"Supersites within superfolds. Binding site similarity in the absence of homology","volume":"282","author":"Russell","year":"1998","journal-title":"J. Mol. Biol."},{"key":"2023012508030734500_B48","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1089\/cmb.1994.1.121","article-title":"Three-dimensional searching for recurrent structural motifs in data bases of protein structures","volume":"1","author":"Rustici","year":"1994","journal-title":"J. Comput. Biol."},{"key":"2023012508030734500_B49","doi-asserted-by":"crossref","first-page":"1919","DOI":"10.1016\/S0006-3495(03)75000-0","article-title":"Role of hydrophobic clusters and long-range contact networks in the folding of (alpha\/beta)8 barrel proteins","volume":"84","author":"Selvaraj","year":"2003","journal-title":"Biophys. J."},{"key":"2023012508030734500_B50","doi-asserted-by":"crossref","first-page":"3320","DOI":"10.1093\/bioinformatics\/btm527","article-title":"Support vector machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs","volume":"23","author":"Shamim","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012508030734500_B51","doi-asserted-by":"crossref","first-page":"1331","DOI":"10.1093\/bioinformatics\/btm121","article-title":"Searching for three-dimensional secondary structural patterns in proteins with ProSMoS","volume":"23","author":"Shi","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012508030734500_B52","doi-asserted-by":"crossref","first-page":"719","DOI":"10.1093\/biomet\/89.3.719","article-title":"Probabilistic model for two dependent circular variables","volume":"89","author":"Singh","year":"2002","journal-title":"Biometrika"},{"key":"2023012508030734500_B53","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1089\/cmb.1996.3.213","article-title":"Delaunay tessellation of proteins: four body nearest-neighbor propensities of amino acid residues","volume":"3","author":"Singh","year":"1996","journal-title":"J. Comput. Biol."},{"key":"2023012508030734500_B54","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1021\/ci0255984","article-title":"Searching for patterns of amino acids in 3D protein structures","volume":"43","author":"Spriggs","year":"2003","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"2023012508030734500_B55","doi-asserted-by":"crossref","first-page":"15558","DOI":"10.1021\/bi961409x","article-title":"Solution structure of the E-domain of staphylococcal protein A","volume":"35","author":"Starovasnik","year":"1996","journal-title":"Biochemistry"},{"key":"2023012508030734500_B56","doi-asserted-by":"crossref","first-page":"763","DOI":"10.1093\/protein\/10.7.763","article-title":"Prediction of protein supersecondary structures based on the artificial neural network method","volume":"10","author":"Sun","year":"1997","journal-title":"Protein Eng."},{"key":"2023012508030734500_B57","doi-asserted-by":"crossref","first-page":"198","DOI":"10.1515\/crll.1908.134.198","article-title":"Nouveles applications des param\u00e9tres continus \u00e0 la th\u00e9orie des formes quadratiques [New applications of continuous parameters in the theory of quadratic forms]","volume":"134","author":"Voronoi","year":"1908","journal-title":"J. Reine Angew. Math."},{"key":"2023012508030734500_B58","doi-asserted-by":"crossref","first-page":"3009","DOI":"10.1016\/j.csda.2005.06.019","article-title":"A Bayesian approach to bandwidth selection for multivariate kernel density estimation","volume":"50","author":"Zhang","year":"2006","journal-title":"Comput. Stat. Data Anal."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/24\/3059\/48854084\/bioinformatics_26_24_3059.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/24\/3059\/48854084\/bioinformatics_26_24_3059.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:05:19Z","timestamp":1674633919000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/24\/3059\/287386"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,11,2]]},"references-count":58,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2010,12,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq573","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,12,15]]},"published":{"date-parts":[[2010,11,2]]}}}