{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,24]],"date-time":"2025-08-24T01:23:31Z","timestamp":1755998611772},"reference-count":45,"publisher":"Oxford University Press (OUP)","issue":"9","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":1671,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/3.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Structural alignment methods are widely used to generate gold standard alignments for improving multiple sequence alignments and transferring functional annotations, as well as for assigning structural distances between proteins. However, the correctness of the alignments generated by these methods is difficult to assess objectively since little is known about the exact evolutionary history of most proteins. Since homology is an equivalence relation, an upper bound on alignment quality can be found by assessing the consistency of alignments. Measuring the consistency of current methods of structure alignment and determining the causes of inconsistencies can, therefore, provide information on the quality of current methods and suggest possibilities for further improvement.<\/jats:p><jats:p>Results: We analyze the self-consistency of seven widely-used structural alignment methods (SAP, TM-align, Fr-TM-align, MAMMOTH, DALI, CE and FATCAT) on a diverse, non-redundant set of 1863 domains from the SCOP database and demonstrate that even for relatively similar proteins the degree of inconsistency of the alignments on a residue level is high (30%). We further show that levels of consistency vary substantially between methods, with two methods (SAP and Fr-TM-align) producing more consistent alignments than the rest. Inconsistency is found to be higher near gaps and for proteins of low structural complexity, as well as for helices. The ability of the methods to identify good structural alignments is also assessed using geometric measures, for which FATCAT (flexible mode) is found to be the best performer despite being highly inconsistent. We conclude that there is substantial scope for improving the consistency of structural alignment methods.<\/jats:p><jats:p>Contact: \u00a0msadows@nimr.mrc.ac.uk<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts103","type":"journal-article","created":{"date-parts":[[2012,3,8]],"date-time":"2012-03-08T01:14:56Z","timestamp":1331169296000},"page":"1209-1215","source":"Crossref","is-referenced-by-count":15,"title":["Evolutionary inaccuracy of pairwise structural alignments"],"prefix":"10.1093","volume":"28","author":[{"given":"M. I.","family":"Sadowski","sequence":"first","affiliation":[{"name":"Division of Mathematical Biology, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW71AA, UK"}]},{"given":"W. R.","family":"Taylor","sequence":"additional","affiliation":[{"name":"Division of Mathematical Biology, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW71AA, UK"}]}],"member":"286","published-online":{"date-parts":[[2012,3,6]]},"reference":[{"key":"2023012512234442200_B1","doi-asserted-by":"crossref","first-page":"1103","DOI":"10.1093\/protein\/9.12.1103","article-title":"Detection of non-topological motifs in protein structures","volume":"9","author":"Alesker","year":"1996","journal-title":"Protein Eng."},{"key":"2023012512234442200_B2","doi-asserted-by":"crossref","first-page":"D419","DOI":"10.1093\/nar\/gkm993","article-title":"Data growth and its impact on the SCOP database: new developments","volume":"36","author":"Andreeva","year":"2008","journal-title":"Nucleic Acids Res."},{"issue":"Suppl. 2","key":"2023012512234442200_B3","doi-asserted-by":"crossref","first-page":"W604","DOI":"10.1093\/nar\/gkl092","article-title":"Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee","volume":"34","author":"Armougom","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012512234442200_B4","doi-asserted-by":"crossref","first-page":"E205","DOI":"10.1093\/bioinformatics\/btl294","article-title":"Vorolign-fast structural alignment using Voronoi contacts","volume":"23","author":"Birzele","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012512234442200_B5","doi-asserted-by":"crossref","first-page":"254","DOI":"10.1093\/nar\/28.1.254","article-title":"The ASTRAL compendium for protein structure and sequence analysis","volume":"28","author":"Brenner","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023012512234442200_B6","doi-asserted-by":"crossref","first-page":"3481","DOI":"10.1073\/pnas.0914097107","article-title":"FragBag, an accurate representation of protein structure, retrieves structural neighbours from the entire PDB quickly and accurately","volume":"107","author":"Budowski-Tal","year":"2010","journal-title":"Proc. Natl Acad. Sci."},{"key":"2023012512234442200_B7","doi-asserted-by":"crossref","first-page":"219","DOI":"10.2174\/138920307780831839","article-title":"Recent progress in measuring structural similarity between proteins","volume":"8","author":"Carugo","year":"2007","journal-title":"Curr. Protein Pept. Sci."},{"key":"2023012512234442200_B8","doi-asserted-by":"crossref","first-page":"2935","DOI":"10.1110\/ps.051428205","article-title":"A novel approach to structural alignment using realistic structural and environmental information","volume":"14","author":"Chen","year":"2005","journal-title":"Protein Sci."},{"key":"2023012512234442200_B9","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1016\/S0959-440X(96)80058-3","article-title":"Suprising similarities in structure comparison","volume":"6","author":"Gibrat","year":"1996","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2023012512234442200_B10","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1002\/pro.5560050711","article-title":"The structural alignment between two proteins: is there a unique answer?","volume":"5","author":"Godzik","year":"1996","journal-title":"Protein Sci."},{"key":"2023012512234442200_B11","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1006\/jmbi.1993.1489","article-title":"Protein-structure comparison by alignment of distance matrices","volume":"233","author":"Holm","year":"1993","journal-title":"J. Mol. Biol."},{"key":"2023012512234442200_B12","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1016\/j.compbiolchem.2011.04.008","article-title":"Exploring the limits of fold discrimination by structural alignment: a large scale benchmark using decoys of known fold","volume":"35","author":"Hollup","year":"2011","journal-title":"Comput. Biol. Chem."},{"key":"2023012512234442200_B13","doi-asserted-by":"crossref","first-page":"2577","DOI":"10.1002\/bip.360221211","article-title":"Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features","volume":"22","author":"Kabsch","year":"1983","journal-title":"Biopolymers"},{"key":"2023012512234442200_B14","doi-asserted-by":"crossref","first-page":"925","DOI":"10.1093\/bioinformatics\/btr044","article-title":"GOSSIP: a method for fast and accurate global alignment of protein structures","volume":"27","author":"Kifer","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012512234442200_B15","doi-asserted-by":"crossref","first-page":"1173","DOI":"10.1016\/j.jmb.2004.12.032","article-title":"Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures","volume":"346","author":"Kolodny","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023012512234442200_B16","doi-asserted-by":"crossref","first-page":"2256","DOI":"10.1107\/S0907444904026460","article-title":"Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions","volume":"60","author":"Krissinel","year":"2004","journal-title":"Acta Crystallogr. D."},{"key":"2023012512234442200_B17","doi-asserted-by":"crossref","first-page":"745","DOI":"10.1093\/protein\/13.11.745","article-title":"ProSup: a refined tool for protein structure alignment","volume":"13","author":"Lackner","year":"2000","journal-title":"Protein Eng."},{"key":"2023012512234442200_B18","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1186\/1472-6807-7-50","article-title":"Comparative analysis of protein structure alignments","volume":"7","author":"Mayr","year":"2007","journal-title":"BMC Struct. Biol."},{"key":"2023012512234442200_B19","doi-asserted-by":"crossref","first-page":"D427","DOI":"10.1093\/nar\/gkq1130","article-title":"Superfamily 1.75 including a domain-centric gene ontology method","volume":"39","author":"Morais","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023012512234442200_B20","doi-asserted-by":"crossref","first-page":"352","DOI":"10.1186\/1471-2105-9-352","article-title":"Alignment of protein structures in the presence of domain motions","volume":"9","author":"Mosca","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012512234442200_B21","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1006\/jmbi.2000.4042","article-title":"T-Coffee: a novel method for fast and accurate multiple sequence alignment","volume":"302","author":"Notredame","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023012512234442200_B22","doi-asserted-by":"crossref","first-page":"1378","DOI":"10.1109\/TITB.2010.2079939","article-title":"Searching protein 3D structures for optimal structure alignment using intelligent algorithms and data structures","volume":"14","author":"Novosad","year":"2010","journal-title":"IEEE Trans. Inf. Technol. Biomed."},{"key":"2023012512234442200_B23","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1016\/j.jmb.2004.04.058","article-title":"3DCoffee: Combining protein sequences and structures within multiple sequence alignments","volume":"340","author":"O'Sullivan","year":"2004","journal-title":"J. Mol. Biol."},{"key":"2023012512234442200_B24","first-page":"3255","article-title":"MAMMOTH (matching molecular models obtained from theory): an automated method for model comparison","volume":"21","author":"Ortiz","year":"2002","journal-title":"Protein Sci."},{"key":"2023012512234442200_B25","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1186\/1471-2105-9-531","article-title":"Fr-TM-align: a new protein structural alingment methods based on fragment alignments and the TM-score","volume":"9","author":"Pandit","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012512234442200_B26","doi-asserted-by":"crossref","first-page":"W30","DOI":"10.1093\/nar\/gkn322","article-title":"PROMALS3D web server for accurate multiple protein sequence and structure alignments","volume":"36","author":"Pei","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012512234442200_B27","doi-asserted-by":"crossref","first-page":"D129","DOI":"10.1093\/nar\/gkh028","article-title":"The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data","volume":"32","author":"Porter","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012512234442200_B28","doi-asserted-by":"crossref","first-page":"1625","DOI":"10.1093\/bioinformatics\/btp296","article-title":"Flexible structural protein alignment by a sequence of local transformations","volume":"25","author":"Rocha","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012512234442200_B29","doi-asserted-by":"crossref","first-page":"033103","DOI":"10.1088\/0953-8984\/22\/3\/033103","article-title":"Protein structures, folds and fold spaces","volume":"22","author":"Sadowski","year":"2010","journal-title":"J. Phys. Condens. Matter"},{"key":"2023012512234442200_B30","doi-asserted-by":"crossref","first-page":"244","DOI":"10.1016\/j.jsb.2010.07.016","article-title":"On the evolutionary origins of \u201cfold space continuity\u201d: a study of topological convergence and divergence in mixed alpha-beta domains","volume":"172","author":"Sadowski","year":"2010","journal-title":"J. Struct. Biol."},{"key":"2023012512234442200_B31","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1186\/1748-7188-5-12","article-title":"FlexSnap: flexible non-sequential protein structure alignment","volume":"5","author":"Salem","year":"2010","journal-title":"Algorithm. Mol. Biol."},{"key":"2023012512234442200_B32","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1007\/BF01796096","article-title":"Recognition of phylogenetic relationships from polypeptide chain fold similarities","volume":"9","author":"Schulz","year":"1977","journal-title":"J. Mol. Evol."},{"key":"2023012512234442200_B33","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1089\/106652704773416902","article-title":"FlexProt: alignment of flexible protein structures without a predefinition of hinge regions","volume":"11","author":"Shatsky","year":"2004","journal-title":"J. Comput. Biol."},{"key":"2023012512234442200_B34","doi-asserted-by":"crossref","first-page":"867","DOI":"10.1109\/TCBB.2011.24","article-title":"A spectral approach to protein structure alignment","volume":"8","author":"Shibberu","year":"2011","journal-title":"IEEE Trans. Comput. Biol. Bioinf."},{"key":"2023012512234442200_B35","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1002\/prot.20124","article-title":"Alternative alignments from comparison of protein structures","volume":"56","author":"Shih","year":"2004","journal-title":"Proteins"},{"key":"2023012512234442200_B36","doi-asserted-by":"crossref","first-page":"739","DOI":"10.1093\/protein\/11.9.739","article-title":"Protein structure alignment by incremental combinatorial extension (CE) of the optimal path","volume":"11","author":"Shindyalov","year":"1998","journal-title":"Prot. Eng."},{"key":"2023012512234442200_B37","doi-asserted-by":"crossref","first-page":"654","DOI":"10.1110\/ps.8.3.654","article-title":"Protein structure comparison using iterated double dynamic programming","volume":"8","author":"Taylor","year":"1999","journal-title":"Protein Sci."},{"key":"2023012512234442200_B38","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1002\/9780470619902.ch7","article-title":"Protein products of tandem gene duplication: a structural view","volume-title":"Evolution After Gene Duplication.","author":"Taylor","year":"2010"},{"key":"2023012512234442200_B39","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/S0097-8485(00)80003-0","article-title":"Multiple protein sequence alignment using double-dynamic programming","volume":"24","author":"Taylor","year":"2000","journal-title":"Comput. Chem."},{"key":"2023012512234442200_B40","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1093\/protein\/15.2.79","article-title":"A Fourier analysis of symmetry in protein structure","volume":"15","author":"Taylor","year":"2002","journal-title":"Prot. Eng."},{"key":"2023012512234442200_B41","doi-asserted-by":"crossref","first-page":"358","DOI":"10.1186\/1471-2105-9-358","article-title":"TOPS++FATCAT: fast flexible structural alignment using constraints derived from TOPS+ strings model","volume":"9","author":"Veeramalai","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012512234442200_B42","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1089\/cmb.2006.13.43","article-title":"Flexible secondary structure based protein structure comparison applied to the detection of circular permutation","volume":"13","author":"Vesterstrom","year":"2006","journal-title":"J. Comput. Biol."},{"key":"2023012512234442200_B43","doi-asserted-by":"crossref","first-page":"ii246","DOI":"10.1093\/bioinformatics\/btg1086","article-title":"Flexible structure alignment by chaining aligned fragment pairs allowing twists","volume":"19","author":"Ye","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012512234442200_B44","doi-asserted-by":"crossref","first-page":"702","DOI":"10.1002\/prot.20264","article-title":"Scoring function for the assessment of protein structure template quality","volume":"52","author":"Zhang","year":"2004","journal-title":"Proteins"},{"key":"2023012512234442200_B45","doi-asserted-by":"crossref","first-page":"2302","DOI":"10.1093\/nar\/gki524","article-title":"TM-align: a protein structure alignment algorithm based on the TM-score","volume":"33","author":"Zhang","year":"2005","journal-title":"Nucleic Acids Res."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/9\/1209\/48879814\/bioinformatics_28_9_1209.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/9\/1209\/48879814\/bioinformatics_28_9_1209.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,21]],"date-time":"2024-04-21T05:57:47Z","timestamp":1713679067000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/9\/1209\/311088"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,3,6]]},"references-count":45,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2012,5,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts103","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,5,1]]},"published":{"date-parts":[[2012,3,6]]}}}