{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,18]],"date-time":"2025-12-18T14:30:27Z","timestamp":1766068227232,"version":"3.43.0"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2025,7,25]],"date-time":"2025-07-25T00:00:00Z","timestamp":1753401600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100018537","name":"National Science and Technology Major Project","doi-asserted-by":"publisher","award":["2022ZD0117700"],"award-info":[{"award-number":["2022ZD0117700"]}],"id":[{"id":"10.13039\/501100018537","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62425107","62450002"],"award-info":[{"award-number":["62425107","62450002"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Municipal Government of Quzhou","award":["2024D001"],"award-info":[{"award-number":["2024D001"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,8,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Reliable protein multiple sequence alignment (MSA) is essential for downstream biomedical research and directly impacts the accuracy of analytical results. However, protein sequences often exhibit low similarity and complex alignment patterns, and existing general alignment tools frequently fall short in terms of accuracy. Many current realignment methods are outdated, suffering from issues such as code obsolescence and inadequate precision. As a result, there is a pressing need for realignment methods that can better address these challenges.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>This study introduces ReAlign-P, a realignment tool designed specifically for protein MSA. ReAlign-P first divides the initial alignment into three regions and applies a novel vertical iterative realignment strategy to optimize the more conserved middle region. This method is by default compatible with MUSCLE5 for realignment, leading to a significant improvement in accuracy. We evaluated initial alignments generated using 10 different MSA parameter configurations across four protein benchmark datasets. The results demonstrate that ReAlign-P consistently outperforms or matches the quality of the initial alignments in all cases. In contrast, RASCAL\u2014the only other currently functional protein realignment tool\u2014sometimes even reduces alignment quality. ReAlign-P not only delivers more substantial improvements but also exhibits greater stability, effectively addressing the gap in available protein realignment tools.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The source code and test data for ReAlign-P are available on GitHub (https:\/\/github.com\/malabz\/ReAlign-P).<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf421","type":"journal-article","created":{"date-parts":[[2025,7,25]],"date-time":"2025-07-25T18:02:27Z","timestamp":1753466547000},"source":"Crossref","is-referenced-by-count":2,"title":["ReAlign-P: a vertical iterative realignment method for protein multiple sequence alignment"],"prefix":"10.1093","volume":"41","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5068-5672","authenticated-orcid":false,"given":"Yixiao","family":"Zhai","sequence":"first","affiliation":[{"name":"Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China , Chengdu, 610054,","place":["China"]},{"name":"Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China , Quzhou, 324003,","place":["China"]}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-1788-3084","authenticated-orcid":false,"given":"Pinglu","family":"Zhang","sequence":"additional","affiliation":[{"name":"Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China , Chengdu, 610054,","place":["China"]},{"name":"Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China , Quzhou, 324003,","place":["China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6406-1142","authenticated-orcid":false,"given":"Quan","family":"Zou","sequence":"additional","affiliation":[{"name":"Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China , Chengdu, 610054,","place":["China"]},{"name":"Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China , Quzhou, 324003,","place":["China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2956-6799","authenticated-orcid":false,"given":"Ximei","family":"Luo","sequence":"additional","affiliation":[{"name":"Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China , Chengdu, 610054,","place":["China"]}]}],"member":"286","published-online":{"date-parts":[[2025,7,25]]},"reference":[{"key":"2025081218215032500_btaf421-B1","doi-asserted-by":"publisher","first-page":"369","DOI":"10.1089\/cmb.1997.4.369","article-title":"ReAligner: A program for refining DNA sequence multi-alignments","volume":"4","author":"Anson","year":"1997","journal-title":"J Comput Biol"},{"key":"2025081218215032500_btaf421-B2","doi-asserted-by":"crossref","first-page":"721","DOI":"10.1101\/gr.926603","article-title":"LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA","volume":"13","author":"Brudno","year":"2003","journal-title":"Genome Res"},{"key":"2025081218215032500_btaf421-B3","doi-asserted-by":"crossref","first-page":"2598","DOI":"10.1093\/nar\/gkl274","article-title":"Refining multiple sequence alignments with conserved core regions","volume":"34","author":"Chakrabarti","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2025081218215032500_btaf421-B4","doi-asserted-by":"crossref","first-page":"546","DOI":"10.3390\/biom12040546","article-title":"Developments in algorithms for sequence alignment: a review","volume":"12","author":"Chao","year":"2022","journal-title":"Biomolecules"},{"key":"2025081218215032500_btaf421-B5","doi-asserted-by":"crossref","first-page":"33964","DOI":"10.1038\/srep33964","article-title":"FAMSA: fast and accurate multiple sequence alignment of huge protein families","volume":"6","author":"Deorowicz","year":"2016","journal-title":"Sci Rep"},{"key":"2025081218215032500_btaf421-B6","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1101\/gr.2821705","article-title":"ProbCons: probabilistic consistency-based multiple sequence alignment","volume":"15","author":"Do","year":"2005","journal-title":"Genome Res"},{"key":"2025081218215032500_btaf421-B7","doi-asserted-by":"crossref","first-page":"1792","DOI":"10.1093\/nar\/gkh340","article-title":"MUSCLE: multiple sequence alignment with high accuracy and high throughput","volume":"32","author":"Edgar","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2025081218215032500_btaf421-B8","doi-asserted-by":"crossref","first-page":"6968","DOI":"10.1038\/s41467-022-34630-w","article-title":"Muscle5: high-accuracy alignment ensembles enable unbiased assessments of sequence homology and phylogeny","volume":"13","author":"Edgar","year":"2022","journal-title":"Nat Commun"},{"key":"2025081218215032500_btaf421-B9","doi-asserted-by":"crossref","first-page":"41553","DOI":"10.1038\/srep41553","article-title":"QuickProbs 2: towards rapid construction of high-quality alignments of large protein families","volume":"7","author":"Gudy\u015b","year":"2017","journal-title":"Sci Rep"},{"key":"2025081218215032500_btaf421-B10","doi-asserted-by":"crossref","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","article-title":"Amino acid substitution matrices from protein blocks","volume":"89","author":"Henikoff","year":"1992","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2025081218215032500_btaf421-B11","doi-asserted-by":"crossref","first-page":"772","DOI":"10.1093\/molbev\/mst010","article-title":"MAFFT multiple sequence alignment software version 7: improvements in performance and usability","volume":"30","author":"Katoh","year":"2013","journal-title":"Mol Biol Evol"},{"key":"2025081218215032500_btaf421-B12","doi-asserted-by":"crossref","first-page":"1958","DOI":"10.1093\/bioinformatics\/btq338","article-title":"MSAProbs: multiple sequence alignment based on pair hidden markov models and partition function posterior probabilities","volume":"26","author":"Liu","year":"2010","journal-title":"Bioinformatics"},{"key":"2025081218215032500_btaf421-B13","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1186\/1471-2105-15-265","article-title":"ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach","volume":"15","author":"Lyras","year":"2014","journal-title":"BMC Bioinformatics"},{"first-page":"111","year":"2005","author":"Manohar","key":"2025081218215032500_btaf421-B14"},{"key":"2025081218215032500_btaf421-B15","doi-asserted-by":"crossref","first-page":"527","DOI":"10.31449\/inf.v43i4.2621","article-title":"Refin-Align: new refinement algorithm for multiple sequence alignment","volume":"43","author":"Mokaddem","year":"2019","journal-title":"Informatica"},{"key":"2025081218215032500_btaf421-B16","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1006\/jmbi.2000.4042","article-title":"T-Coffee: a novel method for fast and accurate multiple sequence alignment","volume":"302","author":"Notredame","year":"2000","journal-title":"J Mol Biol"},{"key":"2025081218215032500_btaf421-B17","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1186\/1471-2105-4-47","article-title":"OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy","volume":"4","author":"Raghava","year":"2003","journal-title":"BMC Bioinform"},{"key":"2025081218215032500_btaf421-B18","doi-asserted-by":"crossref","first-page":"13.1","DOI":"10.1002\/0471250953.bi0313s48","article-title":"Clustal omega","volume":"48","author":"Sievers","year":"2014","journal-title":"Curr Protoc Bioinform"},{"key":"2025081218215032500_btaf421-B19","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1002\/prot.20527","article-title":"BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark","volume":"61","author":"Thompson","year":"2005","journal-title":"Proteins"},{"key":"2025081218215032500_btaf421-B20","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1093\/bioinformatics\/btg133","article-title":"RASCAL: rapid scanning and correction of multiple sequence alignments","volume":"19","author":"Thompson","year":"2003","journal-title":"Bioinformatics"},{"key":"2025081218215032500_btaf421-B21","doi-asserted-by":"crossref","first-page":"1267","DOI":"10.1093\/bioinformatics\/bth493","article-title":"SABmark\u2014a benchmark for sequence alignment that covers the entire known fold space","volume":"21","author":"Van Walle","year":"2005","journal-title":"Bioinformatics"},{"key":"2025081218215032500_btaf421-B22","doi-asserted-by":"crossref","first-page":"1408","DOI":"10.1093\/bioinformatics\/bti159","article-title":"Evaluation of iterative alignment algorithms for multiple alignment","volume":"21","author":"Wallace","year":"2005","journal-title":"Bioinformatics"},{"key":"2025081218215032500_btaf421-B23","doi-asserted-by":"crossref","first-page":"212106","DOI":"10.1007\/s11432-024-4171-9","article-title":"SBSM-Pro: support bio-sequence machine for proteins","volume":"67","author":"Wang","year":"2024","journal-title":"Sci China Inf Sci"},{"key":"2025081218215032500_btaf421-B24","doi-asserted-by":"crossref","first-page":"2240002","DOI":"10.1142\/S0219720022400029","article-title":"RPfam: a refiner towards curated-like multiple sequence alignments of the pfam protein families","volume":"20","author":"Wei","year":"2022","journal-title":"J Bioinform Comput Biol"},{"key":"2025081218215032500_btaf421-B25","doi-asserted-by":"crossref","first-page":"1435","DOI":"10.1093\/oxfordjournals.molbev.a003929","article-title":"Secator: a program for inferring protein subfamilies from phylogenetic trees","volume":"18","author":"Wicker","year":"2001","journal-title":"Mol Biol Evol"},{"key":"2025081218215032500_btaf421-B26","first-page":"434","article-title":"On the PAM matrix model of protein evolution","volume":"2","author":"Wilbur","year":"1985","journal-title":"Mol Biol Evol"},{"key":"2025081218215032500_btaf421-B27","doi-asserted-by":"crossref","first-page":"e1011988","DOI":"10.1371\/journal.pcbi.1011988","article-title":"TPMA: a two pointers meta-alignment tool to ensemble different multiple nucleic acid sequence alignments","volume":"20","author":"Zhai","year":"2024","journal-title":"PLOS Comput Biol"},{"key":"2025081218215032500_btaf421-B28","doi-asserted-by":"crossref","first-page":"lqae170","DOI":"10.1093\/nargab\/lqae170","article-title":"ReAlign-N: an integrated realignment approach for multiple nucleic acid sequence alignment, combining global and local realignments","volume":"6","author":"Zhai","year":"2024","journal-title":"NAR Genomics Bioinform"},{"key":"2025081218215032500_btaf421-B29","doi-asserted-by":"crossref","first-page":"295","DOI":"10.2174\/0929866526666190806143959","article-title":"SpliVert: a protein multiple sequence alignment refinement method based on splitting-splicing vertically","volume":"27","author":"Zhan","year":"2020","journal-title":"Protein Pept Lett"},{"key":"2025081218215032500_btaf421-B30","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1142\/9781860948732_0026","article-title":"MANGO: a new approach to multiple sequence alignment","volume":"6","author":"Zhang","year":"2007","journal-title":"Comput Syst Bioinform"},{"key":"2025081218215032500_btaf421-B31","doi-asserted-by":"crossref","first-page":"btae718","DOI":"10.1093\/bioinformatics\/btae718","article-title":"HAlign 4: a new strategy for rapidly aligning millions of sequences","volume":"40","author":"Zhou","year":"2024","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf421\/63852766\/btaf421.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/8\/btaf421\/63852766\/btaf421.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/8\/btaf421\/63852766\/btaf421.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,12]],"date-time":"2025-08-12T22:21:59Z","timestamp":1755037319000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf421\/8213638"}},"subtitle":[],"editor":[{"given":"Arne","family":"Elofsson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,7,25]]},"references-count":31,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2025,8,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf421","relation":{},"ISSN":["1367-4811"],"issn-type":[{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2025,8]]},"published":{"date-parts":[[2025,7,25]]},"article-number":"btaf421"}}