{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,11]],"date-time":"2026-01-11T12:03:59Z","timestamp":1768133039075,"version":"3.49.0"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,2,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Multiple sequence alignment (MSA) is a core method in bioinformatics. The accuracy of such alignments may influence the success of downstream analyses such as phylogenetic inference, protein structure prediction, and functional prediction. The importance of MSA has lead to the proliferation of MSA methods, with different objective functions and heuristics to search for the optimal MSA. Different methods of inferring MSAs produce different results in all but the most trivial cases. By measuring the differences between inferred alignments, we may be able to develop an understanding of how these differences (i) relate to the objective functions and heuristics used in MSA methods, and (ii) affect downstream analyses.<\/jats:p>\n               <jats:p>Results: We introduce four metrics to compare MSAs, which include the position in a sequence where a gap occurs or the location on a phylogenetic tree where an insertion or deletion (indel) event occurs. We use both real and synthetic data to explore the information given by these metrics and demonstrate how the different metrics in combination can yield more information about MSA methods and the differences between them.<\/jats:p>\n               <jats:p>Availability: \u00a0MetAl is a free software implementation of these metrics in Haskell. Source and binaries for Windows, Linux and Mac OS X are available from http:\/\/kumiho.smith.man.ac.uk\/whelan\/software\/metal\/.<\/jats:p>\n               <jats:p>Contact: \u00a0simon.whelan@manchester.ac.uk<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr701","type":"journal-article","created":{"date-parts":[[2011,12,25]],"date-time":"2011-12-25T01:23:15Z","timestamp":1324776195000},"page":"495-502","source":"Crossref","is-referenced-by-count":64,"title":["Measuring the distance between multiple sequence alignments"],"prefix":"10.1093","volume":"28","author":[{"given":"Benjamin P.","family":"Blackburne","sequence":"first","affiliation":[{"name":"Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Manchester M13 9PT"}]},{"given":"Simon","family":"Whelan","sequence":"additional","affiliation":[{"name":"Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Manchester M13 9PT"}]}],"member":"286","published-online":{"date-parts":[[2011,12,23]]},"reference":[{"key":"2023012512183095000_B1","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1093\/bioinformatics\/bti770","article-title":"The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling","volume":"22","author":"Arnold","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012512183095000_B2","doi-asserted-by":"crossref","first-page":"2090","DOI":"10.1093\/molbev\/msl080","article-title":"Exploring the relationship between sequence similarity and accurate phylogenetic trees","volume":"23","author":"Cantarel","year":"2006","journal-title":"Mol. Biol. Evol."},{"key":"2023012512183095000_B3","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-00234-2","volume-title":"Encyclopedia of Distances.","author":"Deza","year":"2009"},{"key":"2023012512183095000_B4","doi-asserted-by":"crossref","first-page":"e11082","DOI":"10.1371\/journal.pone.0011082","article-title":"Identifying and seeing beyond multiple sequence alignment errors using intra-molecular protein covariation","volume":"5","author":"Dickson","year":"2010","journal-title":"PLoS One"},{"key":"2023012512183095000_B5","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1101\/gr.2821705","article-title":"ProbCons: probabilistic consistency-based multiple sequence alignment","volume":"15","author":"Do","year":"2005","journal-title":"Genome Res."},{"key":"2023012512183095000_B6","first-page":"205","article-title":"A new generation of homology search tools based on probabilistic inference","volume":"23","author":"Eddy","year":"2009","journal-title":"Genome Inform."},{"key":"2023012512183095000_B7","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1016\/j.sbi.2006.04.004","article-title":"Multiple sequence alignment","volume":"16","author":"Edgar","year":"2006","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2023012512183095000_B8","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1186\/1471-2105-5-113","article-title":"MUSCLE: a multiple sequence alignment method with reduced time and space complexity","volume":"5","author":"Edgar","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023012512183095000_B9","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1101\/gr.8.3.163","article-title":"Phylogenomics: improving functional predictions for uncharacterized genes by evolutionary analysis","volume":"8","author":"Eisen","year":"1998","journal-title":"Genome Res."},{"key":"2023012512183095000_B10","volume-title":"Inferring Phylogenies.","author":"Felsenstein","year":"2003"},{"key":"2023012512183095000_B11","doi-asserted-by":"crossref","first-page":"1879","DOI":"10.1093\/molbev\/msp098","article-title":"INDELible: a flexible simulator of biological sequence evolution","volume":"26","author":"Fletcher","year":"2009","journal-title":"Mol. Biol. Evol."},{"key":"2023012512183095000_B12","doi-asserted-by":"crossref","first-page":"2257","DOI":"10.1093\/molbev\/msq115","article-title":"The effect of insertions, deletions, and alignment errors on the branch-site test of positive selection","volume":"27","author":"Fletcher","year":"2010","journal-title":"Mol. Biol. Evol."},{"key":"2023012512183095000_B13","doi-asserted-by":"crossref","first-page":"2433","DOI":"10.1093\/molbev\/msm176","article-title":"Mind the gaps: evidence of bias in estimates of multiple sequence alignments","volume":"24","author":"Golubchik","year":"2007","journal-title":"Mol. Biol. Evol."},{"key":"2023012512183095000_B14","doi-asserted-by":"crossref","first-page":"W29","DOI":"10.1093\/nar\/gkq298","article-title":"SATCHMO-JS: a webserver for simultaneous protein multiple sequence alignment and phylogenetic tree construction","volume":"38","author":"Hagopian","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012512183095000_B15","doi-asserted-by":"crossref","first-page":"792","DOI":"10.1093\/molbev\/msi066","article-title":"Comparison of the accuracies of several phylogenetic methods using protein and DNA sequences","volume":"22","author":"Hall","year":"2005","journal-title":"Mol. Biol. Evol."},{"key":"2023012512183095000_B16","doi-asserted-by":"crossref","first-page":"2044","DOI":"10.1093\/bioinformatics\/bth198","article-title":"Phylogenetic trees based on gene content","volume":"20","author":"Huson","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012512183095000_B17","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1093\/bib\/bbn013","article-title":"Recent developments in the MAFFT multiple sequence alignment program","volume":"9","author":"Katoh","year":"2008","journal-title":"Brief. Bioinformatics"},{"key":"2023012512183095000_B18","doi-asserted-by":"crossref","first-page":"1380","DOI":"10.1093\/molbev\/msm060","article-title":"Heads or tails: a simple reliability check for multiple sequence alignments","volume":"24","author":"Landan","year":"2007","journal-title":"Mol. Biol. Evol."},{"key":"2023012512183095000_B19","doi-asserted-by":"crossref","first-page":"2947","DOI":"10.1093\/bioinformatics\/btm404","article-title":"Clustal W and Clustal X version 2.0","volume":"23","author":"Larkin","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012512183095000_B20","doi-asserted-by":"crossref","first-page":"126","DOI":"10.1016\/S0014-5793(02)03189-7","article-title":"Quality assessment of multiple alignment programs","volume":"529","author":"Lassmann","year":"2002","journal-title":"FEBS Lett."},{"key":"2023012512183095000_B21","volume-title":"An Introduction to Chemoinformatics.","author":"Leach","year":"2003"},{"key":"2023012512183095000_B22","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1023\/A:1019154432472","article-title":"A proof of the triangle inequality for the Tanimoto distance","volume":"26","author":"Lipkus","year":"1999","journal-title":"J. Math. Chem."},{"key":"2023012512183095000_B23","doi-asserted-by":"crossref","first-page":"1561","DOI":"10.1126\/science.1171243","article-title":"Rapid and accurate large-scale coestimation of sequence alignments and phylogenetic trees","volume":"324","author":"Liu","year":"2009","journal-title":"Science"},{"key":"2023012512183095000_B24","doi-asserted-by":"crossref","first-page":"1632","DOI":"10.1126\/science.1158395","article-title":"Phylogeny-aware gap placement prevents errors in sequence alignment and evolutionary analysis","volume":"320","author":"L\u00f6ytynoja","year":"2008","journal-title":"Science"},{"key":"2023012512183095000_B25","doi-asserted-by":"crossref","first-page":"863","DOI":"10.1101\/gr.115949.110","article-title":"High sensitivity to aligner and high rate of false positives in the estimates of positive selection in the 12 drosophila genomes","volume":"21","author":"Markova-Raina","year":"2011","journal-title":"Genome Res."},{"key":"2023012512183095000_B26","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1186\/1471-2105-9-137","article-title":"How reliably can we predict the reliability of protein structure predictions?","volume":"9","author":"Miklos","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012512183095000_B27","doi-asserted-by":"crossref","first-page":"428","DOI":"10.1093\/oxfordjournals.molbev.a025779","article-title":"Effects of nucleotide sequence alignment on phylogeny estimation: a case study of 18S rDNAs of apicomplexa","volume":"14","author":"Morrison","year":"1997","journal-title":"Mol. Biol. Evol."},{"key":"2023012512183095000_B28","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1006\/jmbi.2000.4042","article-title":"T-Coffee: a novel method for fast and accurate multiple sequence alignment","volume":"302","author":"Notredame","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023012512183095000_B29","doi-asserted-by":"crossref","first-page":"e123","DOI":"10.1371\/journal.pcbi.0030123","article-title":"Recent evolutions of multiple sequence alignment algorithms","volume":"3","author":"Notredame","year":"2007","journal-title":"PLoS Comput. Biol."},{"key":"2023012512183095000_B30","doi-asserted-by":"crossref","first-page":"314","DOI":"10.1080\/10635150500541730","article-title":"Multiple sequence alignment accuracy and phylogenetic inference","volume":"55","author":"Ogden","year":"2006","journal-title":"Syst. Biol."},{"key":"2023012512183095000_B31","doi-asserted-by":"crossref","first-page":"1759","DOI":"10.1093\/molbev\/msq066","article-title":"An alignment confidence score capturing robustness to guide tree uncertainty","volume":"27","author":"Penn","year":"2010","journal-title":"Mol. Biol. Evol."},{"key":"2023012512183095000_B32","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1186\/1471-2105-7-376","article-title":"Detecting the limits of regulatory element conservation and divergence estimation using pairwise and multiple alignments","volume":"7","author":"Pollard","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012512183095000_B33","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1080\/10635150590947041","article-title":"Joint Bayesian estimation of alignment and phylogeny","volume":"54","author":"Redelings","year":"2005","journal-title":"Syst. Biol."},{"key":"2023012512183095000_B34","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1093\/gbe\/evp012","article-title":"Estimates of positive darwinian selection are inflated by errors in sequencing, annotation, and alignment","volume":"1","author":"Schneider","year":"2009","journal-title":"Genome Biol. Evol."},{"key":"2023012512183095000_B35","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1093\/sysbio\/49.2.369","article-title":"Gaps as characters in sequence-based phylogenetic analyses","volume":"49","author":"Simmons","year":"2000","journal-title":"Syst. Biol."},{"key":"2023012512183095000_B36","doi-asserted-by":"crossref","first-page":"2688","DOI":"10.1093\/bioinformatics\/btl446","article-title":"RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models","volume":"22","author":"Stamatakis","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012512183095000_B37","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1186\/1748-7188-3-6","article-title":"DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment","volume":"3","author":"Subramanian","year":"2008","journal-title":"Algorithms Mol. Biol."},{"key":"2023012512183095000_B38","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1002\/prot.20527","article-title":"Balibase 3.0: latest developments of the multiple sequence alignment benchmark","volume":"61","author":"Thompson","year":"2005","journal-title":"Proteins"},{"key":"2023012512183095000_B39","doi-asserted-by":"crossref","first-page":"1108","DOI":"10.1109\/TCBB.2009.68","article-title":"The impact of multiple protein sequence alignment on phylogenetic estimation","volume":"8","author":"Wang","year":"2011","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"2023012512183095000_B40","doi-asserted-by":"crossref","first-page":"691","DOI":"10.1093\/oxfordjournals.molbev.a003851","article-title":"A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach","volume":"18","author":"Whelan","year":"2001","journal-title":"Mol. Biol. Evol."},{"key":"2023012512183095000_B41","doi-asserted-by":"crossref","first-page":"438","DOI":"10.1111\/j.1096-0031.2009.00292.x","article-title":"No so hot - heads or tails is not able to reliably compare multiple sequence alignments","volume":"26","author":"Wise","year":"2010","journal-title":"Cladistics"},{"key":"2023012512183095000_B42","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1126\/science.1151532","article-title":"Alignment uncertainty and genomic analysis","volume":"319","author":"Wong","year":"2008","journal-title":"Science"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/4\/495\/48879457\/bioinformatics_28_4_495.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/4\/495\/48879457\/bioinformatics_28_4_495.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T15:01:35Z","timestamp":1674658895000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/4\/495\/212883"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,12,23]]},"references-count":42,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2012,2,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr701","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,2,15]]},"published":{"date-parts":[[2011,12,23]]}}}