{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T08:19:22Z","timestamp":1760170762600},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"24","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":1801,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/3.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,12,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Evaluating alternative multiple protein sequence alignments is an important unsolved problem in Biology. The most accurate way of doing this is to use structural information. Unfortunately, most methods require at least two structures to be embedded in the alignment, a condition rarely met when dealing with standard datasets.<\/jats:p>\n               <jats:p>Result: We developed STRIKE, a method that determines the relative accuracy of two alternative alignments of the same sequences using a single structure. We validated our methodology on three commonly used reference datasets (BAliBASE, Homestrad and Prefab). Given two alignments, STRIKE manages to identify the most accurate one in 70% of the cases on average. This figure increases to 79% when considering very challenging datasets like the RV11 category of BAliBASE. This discrimination capacity is significantly higher than that reported for other metrics such as Contact Accepted mutation or Blosum. We show that this increased performance results both from a refined definition of the contacts and from the use of an improved contact substitution score.<\/jats:p>\n               <jats:p>Contact: \u00a0cedric.notredame@crg.eu<\/jats:p>\n               <jats:p>Availability: STRIKE is an open source freeware available from www.tcoffee.org<\/jats:p>\n               <jats:p>Supplementary Information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr587","type":"journal-article","created":{"date-parts":[[2011,10,29]],"date-time":"2011-10-29T02:04:25Z","timestamp":1319853865000},"page":"3385-3391","source":"Crossref","is-referenced-by-count":23,"title":["STRIKE: evaluation of protein MSAs using a single 3D structure"],"prefix":"10.1093","volume":"27","author":[{"given":"Carsten","family":"Kemena","sequence":"first","affiliation":[{"name":"1 Bioinformatics and Genomics Program, Centre for Genomic Regulation (CRG) and UPF, Aiguader, 88, 08003 Barcelona, Spain and 2Division of Mathematical Biology, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW7 1AA, UK"}]},{"given":"Jean-Francois","family":"Taly","sequence":"additional","affiliation":[{"name":"1 Bioinformatics and Genomics Program, Centre for Genomic Regulation (CRG) and UPF, Aiguader, 88, 08003 Barcelona, Spain and 2Division of Mathematical Biology, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW7 1AA, UK"}]},{"given":"Jens","family":"Kleinjung","sequence":"additional","affiliation":[{"name":"1 Bioinformatics and Genomics Program, Centre for Genomic Regulation (CRG) and UPF, Aiguader, 88, 08003 Barcelona, Spain and 2Division of Mathematical Biology, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW7 1AA, UK"}]},{"given":"Cedric","family":"Notredame","sequence":"additional","affiliation":[{"name":"1 Bioinformatics and Genomics Program, Centre for Genomic Regulation (CRG) and UPF, Aiguader, 88, 08003 Barcelona, Spain and 2Division of Mathematical Biology, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW7 1AA, UK"}]}],"member":"286","published-online":{"date-parts":[[2011,10,28]]},"reference":[{"key":"2023012511312859800_B1","doi-asserted-by":"crossref","first-page":"555","DOI":"10.1016\/0022-2836(91)90193-A","article-title":"Amino acid substitution matrices from an information theoretic perspective","volume":"219","author":"Altschul","year":"1991","journal-title":"J. Mol. Biol."},{"key":"2023012511312859800_B2","doi-asserted-by":"crossref","first-page":"6338","DOI":"10.1093\/nar\/gkq526","article-title":"AlexSys: a knowledge-based expert system for multiple sequence alignment construction and analysis","volume":"38","author":"Aniba","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012511312859800_B3","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The Protein Data Bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023012511312859800_B4","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1126\/science.1853201","article-title":"A method to identify protein sequences that fold into a known three-dimensional structure","volume":"253","author":"Bowie","year":"1991","journal-title":"Science"},{"key":"2023012511312859800_B5","doi-asserted-by":"crossref","first-page":"D189","DOI":"10.1093\/nar\/gkh034","article-title":"The ASTRAL Compendium in 2004","volume":"32","author":"Chandonia","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012511312859800_B6","doi-asserted-by":"crossref","first-page":"W606","DOI":"10.1093\/nar\/gkh400","article-title":"CaspR: a web server for automated molecular replacement using homology modelling","volume":"32","author":"Claude","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012511312859800_B7","doi-asserted-by":"crossref","first-page":"709","DOI":"10.1126\/science.6879170","article-title":"Solvent-accessible surfaces of proteins and nucleic acids","volume":"221","author":"Connolly","year":"1983","journal-title":"Science"},{"key":"2023012511312859800_B8","first-page":"353","article-title":"A model of evolutionary change in proteins. Detecting distant relationships: computer methods and results","volume-title":"Atlas of Protein Sequence and Structure.","author":"Dayhoff","year":"1979"},{"key":"2023012511312859800_B9","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1101\/gr.2821705","article-title":"ProbCons: probabilistic consistency-based multiple sequence alignment","volume":"15","author":"Do","year":"2005","journal-title":"Genome Res."},{"key":"2023012511312859800_B10","doi-asserted-by":"crossref","first-page":"1792","DOI":"10.1093\/nar\/gkh340","article-title":"MUSCLE: multiple sequence alignment with high accuracy and high throughput","volume":"32","author":"Edgar","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012511312859800_B11","doi-asserted-by":"crossref","first-page":"1546","DOI":"10.1093\/bioinformatics\/bth126","article-title":"Combining partial order alignment and progressive multiple sequence alignment increases alignment speed and scalability to very large alignment problems","volume":"20","author":"Grasso","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012511312859800_B12","doi-asserted-by":"crossref","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","article-title":"Amino acid substitution matrices from protein blocks","volume":"89","author":"Henikoff","year":"1992","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511312859800_B13","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1038\/358086a0","article-title":"A new approach to protein fold recognition","volume":"358","author":"Jones","year":"1992","journal-title":"Nature"},{"key":"2023012511312859800_B14","doi-asserted-by":"crossref","first-page":"511","DOI":"10.1093\/nar\/gki198","article-title":"MAFFT version 5: improvement in accuracy of multiple sequence alignment","volume":"33","author":"Katoh","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012511312859800_B15","doi-asserted-by":"crossref","first-page":"7120","DOI":"10.1093\/nar\/gki1020","article-title":"Automatic assessment of alignment quality","volume":"33","author":"Lassmann","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012511312859800_B16","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1016\/S1476-9271(03)00022-7","article-title":"Testing homology with Contact Accepted mutatiOn (CAO): a contact-based Markov model of protein evolution","volume":"27","author":"Lin","year":"2003","journal-title":"Comput. Biol. Chem."},{"key":"2023012511312859800_B17","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1038\/356083a0","article-title":"Assessment of protein models with three-dimensional profiles","volume":"356","author":"L\u00fcthy","year":"1992","journal-title":"Nature"},{"key":"2023012511312859800_B18","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1002\/prot.10231","article-title":"FROST: a filter-based fold recognition method","volume":"49","author":"Marin","year":"2002","journal-title":"Proteins"},{"key":"2023012511312859800_B19","doi-asserted-by":"crossref","first-page":"863","DOI":"10.1101\/gr.115949.110","article-title":"High sensitivity to aligner and high rate of false positives in the estimates of positive selection in the 12 Drosophila genomes","volume":"21","author":"Markova-Raina","year":"2011","journal-title":"Genome Res."},{"key":"2023012511312859800_B20","doi-asserted-by":"crossref","first-page":"2469","DOI":"10.1002\/pro.5560071126","article-title":"HOMSTRAD: a database of protein structure alignments for homologous families","volume":"7","author":"Mizuguchi","year":"1998","journal-title":"Protein Sci."},{"key":"2023012511312859800_B21","doi-asserted-by":"crossref","first-page":"536","DOI":"10.1016\/S0022-2836(05)80134-2","article-title":"SCOP: a structural classification of proteins database for the investigation of sequences and structures","volume":"247","author":"Murzin","year":"1995","journal-title":"J. Mol. Biol."},{"key":"2023012511312859800_B22","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1006\/jmbi.2000.4042","article-title":"T-Coffee: a novel method for fast and accurate multiple sequence alignment","volume":"302","author":"Notredame","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023012511312859800_B23","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1016\/j.jmb.2004.04.058","article-title":"3DCoffee: combining protein sequences and structures within multiple sequence alignments","volume":"340","author":"O'Sullivan","year":"2004","journal-title":"J. Mol. Biol."},{"key":"2023012511312859800_B24","doi-asserted-by":"crossref","first-page":"427","DOI":"10.1093\/bioinformatics\/btg008","article-title":"PCMA: fast and accurate multiple sequence alignment based on profile consistency","volume":"19","author":"Pei","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012511312859800_B25","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1006\/jmbi.2001.4762","article-title":"FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties","volume":"310","author":"Shi","year":"2001","journal-title":"J. Mol. Biol."},{"key":"2023012511312859800_B26","doi-asserted-by":"crossref","first-page":"146","DOI":"10.1186\/1471-2105-11-146","article-title":"Improving pairwise sequence alignment accuracy using near-optimal protein sequence alignments","volume":"11","author":"Sierk","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023012511312859800_B27","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1002\/prot.340170404","article-title":"Recognition of errors in three-dimensional structures of proteins","volume":"17","author":"Sippl","year":"1993","journal-title":"Proteins Struct. Funct. Genet."},{"key":"2023012511312859800_B28","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1186\/1471-2105-9-6","article-title":"Can molecular dynamics simulations help in discriminating correct from erroneous protein 3D models?","volume":"9","author":"Taly","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012511312859800_B29","doi-asserted-by":"crossref","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","article-title":"CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice","volume":"22","author":"Thompson","year":"1994","journal-title":"Nucleic Acids Res."},{"key":"2023012511312859800_B30","article-title":"Multiple sequence alignment using ClustalW and ClustalX","author":"Thompson","year":"2002","journal-title":"Curr. Protoc. Bioinformatics"},{"key":"2023012511312859800_B31","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1093\/bioinformatics\/btg133","article-title":"RASCAL: rapid scanning and correction of multiple sequence alignments","volume":"19","author":"Thompson","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012511312859800_B32","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1002\/prot.20527","article-title":"BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark","volume":"61","author":"Thompson","year":"2005","journal-title":"Proteins"},{"key":"2023012511312859800_B33","doi-asserted-by":"crossref","first-page":"1692","DOI":"10.1093\/nar\/gkl091","article-title":"M-Coffee: combining multiple sequence alignment methods with T-Coffee","volume":"34","author":"Wallace","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012511312859800_B34","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1126\/science.1151532","article-title":"Alignment uncertainty and genomic analysis","volume":"319","author":"Wong","year":"2008","journal-title":"Science"},{"key":"2023012511312859800_B35","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1002\/prot.21945","article-title":"MUSTER: improving protein sequence profile-profile alignments by using multiple sources of structure information","volume":"72","author":"Wu","year":"2008","journal-title":"Proteins"},{"key":"2023012511312859800_B36","doi-asserted-by":"crossref","first-page":"15688","DOI":"10.1073\/pnas.2533904100","article-title":"The compositional adjustment of amino acid substitution matrices","volume":"100","author":"Yu","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511312859800_B37","doi-asserted-by":"crossref","first-page":"7594","DOI":"10.1073\/pnas.0305695101","article-title":"Automated structure prediction of weakly homologous proteins on a genomic scale","volume":"101","author":"Zhang","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/24\/3385\/48861993\/bioinformatics_27_24_3385.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/24\/3385\/48861993\/bioinformatics_27_24_3385.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T11:32:18Z","timestamp":1674646338000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/24\/3385\/306640"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,10,28]]},"references-count":37,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2011,12,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr587","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2011,12,15]]},"published":{"date-parts":[[2011,10,28]]}}}