{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T11:05:07Z","timestamp":1775041507349,"version":"3.50.1"},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2025,5,28]],"date-time":"2025-05-28T00:00:00Z","timestamp":1748390400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Belgian Fund for Scientific Research"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,6,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Predicting how mutations impact protein biophysical properties remains a significant challenge in computational biology. In recent years, numerous predictors, primarily deep learning models, have been developed to address this problem; however, issues such as their lack of interpretability and limited accuracy persist.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We showed that a simple evolutionary score, based on the log-odd ratio of wild-type and mutated residue frequencies in evolutionary related proteins, when scaled by the residue\u2019s relative solvent accessibility, performs on par with or slightly outperforms most of the benchmarked predictors, many of which are considerably more complex. The evaluation is performed on mutations from the ProteinGym deep mutational scanning dataset collection, which measures various properties such as stability, activity or fitness. This raises further questions about what these complex models actually learn and highlights their limitations in addressing prediction of mutational landscape.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The RSALOR model is available as a user-friendly Python package that can be installed from the PyPI repository. The code is freely available at https:\/\/github.com\/3BioCompBio\/RSALOR.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf322","type":"journal-article","created":{"date-parts":[[2025,5,27]],"date-time":"2025-05-27T07:32:08Z","timestamp":1748331128000},"source":"Crossref","is-referenced-by-count":9,"title":["Residue conservation and solvent accessibility are (almost) all you need for predicting mutational effects in proteins"],"prefix":"10.1093","volume":"41","author":[{"given":"Matsvei","family":"Tsishyn","sequence":"first","affiliation":[{"name":"Computational Biology and Bioinformatics, Universit\u00e9 Libre de Bruxelles , Brussels 1050,","place":["Belgium"]},{"name":"Interuniversity Institute of Bioinformatics in Brussels , Bruxelles 1050,","place":["Belgium"]}]},{"given":"Pauline","family":"Hermans","sequence":"additional","affiliation":[{"name":"Computational Biology and Bioinformatics, Universit\u00e9 Libre de Bruxelles , Brussels 1050,","place":["Belgium"]},{"name":"Interuniversity Institute of Bioinformatics in Brussels , Bruxelles 1050,","place":["Belgium"]}]},{"given":"Marianne","family":"Rooman","sequence":"additional","affiliation":[{"name":"Computational Biology and Bioinformatics, Universit\u00e9 Libre de Bruxelles , Brussels 1050,","place":["Belgium"]},{"name":"Interuniversity Institute of Bioinformatics in Brussels , Bruxelles 1050,","place":["Belgium"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2916-022X","authenticated-orcid":false,"given":"Fabrizio","family":"Pucci","sequence":"additional","affiliation":[{"name":"Computational Biology and Bioinformatics, Universit\u00e9 Libre de Bruxelles , Brussels 1050,","place":["Belgium"]},{"name":"Interuniversity Institute of Bioinformatics in Brussels , Bruxelles 1050,","place":["Belgium"]}]}],"member":"286","published-online":{"date-parts":[[2025,5,28]]},"reference":[{"key":"2025070408280409700_btaf322-B1","doi-asserted-by":"crossref","first-page":"4480","DOI":"10.1038\/s41598-018-22531-2","article-title":"Prediction and interpretation of deleterious coding variants in terms of protein structural stability","volume":"8","author":"Ancien","year":"2018","journal-title":"Sci Rep"},{"key":"2025070408280409700_btaf322-B2","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1186\/s13059-023-02986-x","article-title":"An atlas of variant effects to understand the genome at nucleotide resolution","volume":"24","author":"Fowler","year":"2023","journal-title":"Genome Biology"},{"key":"2025070408280409700_btaf322-B3","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1038\/s41586-021-04043-8","article-title":"Disease variant prediction with deep generative models of evolutionary data","volume":"599","author":"Frazer","year":"2021","journal-title":"Nature"},{"key":"2025070408280409700_btaf322-B4","doi-asserted-by":"crossref","first-page":"102713","DOI":"10.1016\/j.copbio.2022.102713","article-title":"Machine learning to navigate fitness landscapes for protein engineering","volume":"75","author":"Freschlin","year":"2022","journal-title":"Curr Opin Biotechnol"},{"key":"2025070408280409700_btaf322-B5","first-page":"msae267","article-title":"Exploring evolution to uncover insights into protein mutational stability","author":"Hermans","year":"2024","journal-title":"Mol Biol Evol"},{"key":"2025070408280409700_btaf322-B6","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with alphafold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2025070408280409700_btaf322-B7","doi-asserted-by":"crossref","first-page":"2604","DOI":"10.1093\/molbev\/msz179","article-title":"GEMME: a simple and fast global epistatic model predicting mutational effects","volume":"36","author":"Laine","year":"2019","journal-title":"Mol Biol Evol"},{"key":"2025070408280409700_btaf322-B8","author":"Li"},{"key":"2025070408280409700_btaf322-B9","doi-asserted-by":"crossref","first-page":"1123","DOI":"10.1126\/science.ade2574","article-title":"Evolutionary-scale prediction of atomic-level protein structure with a language model","volume":"379","author":"Lin","year":"2023","journal-title":"Science"},{"key":"2025070408280409700_btaf322-B10","doi-asserted-by":"crossref","first-page":"e11474","DOI":"10.15252\/msb.202211474","article-title":"Updated benchmarking of variant effect predictors using deep mutational scanning","volume":"19","author":"Livesey","year":"2023","journal-title":"Mol Syst Biol"},{"key":"2025070408280409700_btaf322-B11","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/btae621","article-title":"Expert-guided protein language models enable accurate and blazingly fast fitness prediction","volume":"40","author":"Marquet","year":"2024","journal-title":"Bioinformatics"},{"key":"2025070408280409700_btaf322-B12","doi-asserted-by":"crossref","first-page":"E1293","DOI":"10.1073\/pnas.1111471108","article-title":"Direct-coupling analysis of residue coevolution captures native contacts across many protein families","volume":"108","author":"Morcos","year":"2011","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2025070408280409700_btaf322-B13","article-title":"ProteinGym: large-scale benchmarks for protein fitness prediction and design","volume":"36","author":"Notin","year":"2024","journal-title":"Adv Neural Inform Process Syst"},{"key":"2025070408280409700_btaf322-B14","doi-asserted-by":"crossref","article-title":"TranceptEVE: combining family-specific and family-agnostic models of protein sequences for improved fitness prediction","author":"Notin","DOI":"10.1101\/2022.12.07.519495"},{"key":"2025070408280409700_btaf322-B15","doi-asserted-by":"crossref","first-page":"3659","DOI":"10.1093\/bioinformatics\/bty348","article-title":"Quantification of biases in predictions of protein stability changes upon mutations","volume":"34","author":"Pucci","year":"2018","journal-title":"Bioinformatics"},{"key":"2025070408280409700_btaf322-B16","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1016\/j.sbi.2021.11.001","article-title":"Artificial intelligence challenges for predicting the impact of mutations on protein stability","volume":"72","author":"Pucci","year":"2022","journal-title":"Curr Opin Struct Biol"},{"key":"2025070408280409700_btaf322-B17","first-page":"281","author":"Rastogi"},{"key":"2025070408280409700_btaf322-B18","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1093\/protein\/12.2.85","article-title":"Twilight zone of protein sequence alignments","volume":"12","author":"Rost","year":"1999","journal-title":"Protein Eng"},{"key":"2025070408280409700_btaf322-B19","doi-asserted-by":"crossref","first-page":"440","DOI":"10.1126\/science.aba3304","article-title":"An evolution-based model for designing chorismate mutase enzymes","volume":"369","author":"Russ","year":"2020","journal-title":"Science"},{"key":"2025070408280409700_btaf322-B20","article-title":"Protein stability is determined by single-site bias rather than pairwise covariance","author":"Sternke","year":"2025","journal-title":"bioRxiv"},{"key":"2025070408280409700_btaf322-B21","doi-asserted-by":"crossref","article-title":"SaProt: protein language modeling with structure-aware vocabulary","author":"Su","DOI":"10.1101\/2023.10.01.560349"},{"key":"2025070408280409700_btaf322-B22","first-page":"77379","article-title":"Poet: a generative model of protein families as sequences-of-sequences","volume":"36","author":"Truong","year":"2023","journal-title":"Adv Neural Inform Process Syst"},{"key":"2025070408280409700_btaf322-B23","article-title":"Quantification of biases in predictions of protein\u2013protein binding affinity changes upon mutations","volume":"25","author":"Tsishyn","year":"2025","journal-title":"Brief Bioinform"},{"key":"2025070408280409700_btaf322-B24","doi-asserted-by":"crossref","first-page":"3653","DOI":"10.1093\/bioinformatics\/bty340","article-title":"Self-consistency test reveals systematic bias in programs for prediction change of stability upon mutation","volume":"34","author":"Usmanova","year":"2018","journal-title":"Bioinformatics"},{"key":"2025070408280409700_btaf322-B25","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1002\/prot.24176","article-title":"Prediction of phenotypes of missense mutations in human proteins from biological assemblies","volume":"81","author":"Wei","year":"2013","journal-title":"Proteins: Struct Funct Bioinf"},{"key":"2025070408280409700_btaf322-B26","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1073\/pnas.0805923106","article-title":"Identification of direct residue contacts in protein\u2013protein interaction by message passing","volume":"106","author":"Weigt","year":"2009","journal-title":"Proc Natl Acad Sci U S A"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf322\/63393354\/btaf322.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/6\/btaf322\/63393354\/btaf322.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/6\/btaf322\/63393354\/btaf322.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,4]],"date-time":"2025-07-04T08:28:11Z","timestamp":1751617691000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf322\/8152299"}},"subtitle":[],"editor":[{"given":"Arne","family":"Elofsson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,5,28]]},"references-count":26,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2025,6,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf322","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2025.02.03.636212","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,6]]},"published":{"date-parts":[[2025,5,28]]},"article-number":"btaf322"}}