{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T06:20:59Z","timestamp":1772173259295,"version":"3.50.1"},"reference-count":56,"publisher":"Public Library of Science (PLoS)","issue":"2","license":[{"start":{"date-parts":[[2024,2,20]],"date-time":"2024-02-20T00:00:00Z","timestamp":1708387200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"European Union Next-GenerationEU"},{"DOI":"10.13039\/100010661","name":"Horizon 2020 Framework Programme","doi-asserted-by":"publisher","award":["734439"],"award-info":[{"award-number":["734439"]}],"id":[{"id":"10.13039\/100010661","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>\n                    The design of proteins with specific tasks is a major challenge in molecular biology with important diagnostic and therapeutic applications. High-throughput screening methods have been developed to systematically evaluate protein activity, but only a small fraction of possible protein variants can be tested using these techniques. Computational models that explore the sequence space\n                    <jats:italic>in-silico<\/jats:italic>\n                    to identify the fittest molecules for a given function are needed to overcome this limitation. In this article, we propose AnnealDCA, a machine-learning framework to learn the protein fitness landscape from sequencing data derived from a broad range of experiments that use selection and sequencing to quantify protein activity. We demonstrate the effectiveness of our method by applying it to antibody Rep-Seq data of immunized mice and screening experiments, assessing the quality of the fitness landscape reconstructions. Our method can be applied to several experimental cases where a population of protein variants undergoes various rounds of selection and sequencing, without relying on the computation of variants enrichment ratios, and thus can be used even in cases of disjoint sequence samples.\n                  <\/jats:p>","DOI":"10.1371\/journal.pcbi.1011812","type":"journal-article","created":{"date-parts":[[2024,2,20]],"date-time":"2024-02-20T13:20:52Z","timestamp":1708435252000},"page":"e1011812","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":3,"title":["Inference of annealed protein fitness landscapes with AnnealDCA"],"prefix":"10.1371","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6051-0972","authenticated-orcid":true,"given":"Luca","family":"Sesta","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6509-0807","authenticated-orcid":true,"given":"Andrea","family":"Pagnani","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4476-805X","authenticated-orcid":true,"given":"Jorge","family":"Fernandez-de-Cossio-Diaz","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4192-2864","authenticated-orcid":true,"given":"Guido","family":"Uguzzoni","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2024,2,20]]},"reference":[{"issue":"9","key":"pcbi.1011812.ref001","doi-asserted-by":"crossref","first-page":"e1010561","DOI":"10.1371\/journal.pcbi.1010561","article-title":"Generative and interpretable machine learning for aptamer design and analysis of in vitro sequence selection","volume":"18","author":"A Di Gioacchino","year":"2022","journal-title":"PLoS computational biology"},{"issue":"32","key":"pcbi.1011812.ref002","doi-asserted-by":"crossref","first-page":"E7550","DOI":"10.1073\/pnas.1804015115","article-title":"Inferring the shape of global epistasis","volume":"115","author":"J Otwinowski","year":"2018","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"10","key":"pcbi.1011812.ref003","doi-asserted-by":"crossref","first-page":"2345","DOI":"10.1093\/molbev\/msy141","article-title":"Biophysical Inference of Epistasis and the Effects of Mutations on Protein Stability and Function","volume":"35","author":"J Otwinowski","year":"2018","journal-title":"Molecular Biology and Evolution"},{"issue":"22","key":"pcbi.1011812.ref004","doi-asserted-by":"crossref","first-page":"E2301","DOI":"10.1073\/pnas.1400849111","article-title":"Inferring fitness landscapes by regression produces biased estimates of epistasis","volume":"111","author":"J Otwinowski","year":"2014","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"1","key":"pcbi.1011812.ref005","doi-asserted-by":"crossref","first-page":"msab321","DOI":"10.1093\/molbev\/msab321","article-title":"Modeling sequence-space exploration and emergence of epistatic signals in protein evolution","volume":"39","author":"M Bisardi","year":"2022","journal-title":"Molecular biology and evolution"},{"issue":"20","key":"pcbi.1011812.ref006","doi-asserted-by":"crossref","first-page":"10908","DOI":"10.3390\/ijms222010908","article-title":"Amala: Analysis of directed evolution experiments via annealed mutational approximated landscape","volume":"22","author":"L Sesta","year":"2021","journal-title":"International journal of molecular sciences"},{"key":"pcbi.1011812.ref007","article-title":"Unsupervised Inference of Protein Fitness Landscape from Deep Mutational Scan","author":"J Fernandez-de Cossio-Diaz","year":"2020","journal-title":"Molecular Biology and Evolution"},{"issue":"42","key":"pcbi.1011812.ref008","doi-asserted-by":"crossref","first-page":"16858","DOI":"10.1073\/pnas.1209751109","article-title":"A fundamental protein property, thermodynamic stability, revealed solely from large-scale measurements of protein function","volume":"109","author":"CL Araya","year":"2012","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"22","key":"pcbi.1011812.ref009","doi-asserted-by":"crossref","first-page":"2643","DOI":"10.1016\/j.cub.2014.09.072","article-title":"A comprehensive biophysical description of pairwise epistasis throughout an entire protein domain","volume":"24","author":"CA Olson","year":"2014","journal-title":"Current Biology"},{"issue":"2","key":"pcbi.1011812.ref010","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1534\/genetics.115.175802","article-title":"Massively parallel functional analysis of BRCA1 RING domain variants","volume":"200","author":"LM Starita","year":"2015","journal-title":"Genetics"},{"issue":"3","key":"pcbi.1011812.ref011","doi-asserted-by":"crossref","first-page":"594","DOI":"10.1016\/j.cell.2015.09.055","article-title":"Evolving new protein-protein interaction specificity through promiscuous intermediates","volume":"163","author":"CD Aakre","year":"2015","journal-title":"Cell"},{"issue":"3","key":"pcbi.1011812.ref012","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1038\/nmeth.3223","article-title":"Massively parallel single-amino-acid mutagenesis","volume":"12","author":"JO Kitzman","year":"2015","journal-title":"Nature methods"},{"issue":"23","key":"pcbi.1011812.ref013","doi-asserted-by":"crossref","first-page":"7159","DOI":"10.1073\/pnas.1422285112","article-title":"Dissecting enzyme function with microfluidic-based deep mutational scanning","volume":"112","author":"PA Romero","year":"2015","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"14","key":"pcbi.1011812.ref014","doi-asserted-by":"crossref","first-page":"E1263","DOI":"10.1073\/pnas.1303309110","article-title":"Activity-enhancing mutations in an E3 ubiquitin ligase identified by high-throughput mutagenesis","volume":"110","author":"LM Starita","year":"2013","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"32","key":"pcbi.1011812.ref015","doi-asserted-by":"crossref","first-page":"13067","DOI":"10.1073\/pnas.1215206110","article-title":"Capturing the mutational landscape of the beta-lactamase TEM-1","volume":"110","author":"H Jacquier","year":"2013","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"6","key":"pcbi.1011812.ref016","doi-asserted-by":"crossref","first-page":"1581","DOI":"10.1093\/molbev\/msu081","article-title":"A comprehensive, high-resolution map of a gene\u2019s fitness landscape","volume":"31","author":"E Firnberg","year":"2014","journal-title":"Molecular biology and evolution"},{"issue":"5","key":"pcbi.1011812.ref017","doi-asserted-by":"crossref","first-page":"1295","DOI":"10.1016\/j.cell.2020.08.012","article-title":"Deep mutational scanning of SARS-CoV-2 receptor binding domain reveals constraints on folding and ACE2 binding","volume":"182","author":"TN Starr","year":"2020","journal-title":"cell"},{"issue":"11","key":"pcbi.1011812.ref018","doi-asserted-by":"crossref","first-page":"e1010951","DOI":"10.1371\/journal.ppat.1010951","article-title":"Deep mutational scans for ACE2 binding, RBD expression, and antibody escape in the SARS-CoV-2 Omicron BA. 1 and BA. 2 receptor-binding domains","volume":"18","author":"TN Starr","year":"2022","journal-title":"PLoS pathogens"},{"issue":"1","key":"pcbi.1011812.ref019","doi-asserted-by":"crossref","first-page":"4162","DOI":"10.1038\/s41467-019-12101-z","article-title":"The mutational landscape of a prion-like domain","volume":"10","author":"B Bolognesi","year":"2019","journal-title":"Nature communications"},{"key":"pcbi.1011812.ref020","doi-asserted-by":"crossref","first-page":"e34420","DOI":"10.7554\/eLife.34420","article-title":"Mapping mutational effects along the evolutionary landscape of HIV envelope","volume":"7","author":"HK Haddox","year":"2018","journal-title":"Elife"},{"issue":"7904","key":"pcbi.1011812.ref021","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1038\/s41586-022-04586-4","article-title":"Mapping the energetic and allosteric landscapes of protein binding domains","volume":"604","author":"AJ Faure","year":"2022","journal-title":"Nature"},{"issue":"4","key":"pcbi.1011812.ref022","doi-asserted-by":"crossref","first-page":"1179","DOI":"10.1093\/molbev\/msz256","article-title":"Protein Structural Information and Evolutionary Landscape by In Vitro Evolution","volume":"37","author":"M Fantini","year":"2019","journal-title":"Molecular Biology and Evolution"},{"issue":"1","key":"pcbi.1011812.ref023","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1016\/j.cels.2019.11.008","article-title":"Protein structure from experimental evolution","volume":"10","author":"MA Stiffler","year":"2020","journal-title":"Cell Systems"},{"issue":"10","key":"pcbi.1011812.ref024","doi-asserted-by":"crossref","DOI":"10.1172\/jci.insight.135112","article-title":"In vivo\u2013directed evolution of adeno-associated virus in the primate retina","volume":"5","author":"LC Byrne","year":"2020","journal-title":"JCI insight"},{"issue":"3","key":"pcbi.1011812.ref025","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1111\/j.1365-2567.2011.03527.x","article-title":"Rep-Seq: uncovering the immunological repertoire through next-generation sequencing","volume":"135","author":"J Benichou","year":"2012","journal-title":"Immunology"},{"issue":"9","key":"pcbi.1011812.ref026","doi-asserted-by":"crossref","first-page":"741","DOI":"10.1038\/nmeth.1492","article-title":"High-resolution mapping of protein sequence-function relationships","volume":"7","author":"DM Fowler","year":"2010","journal-title":"Nature methods"},{"key":"pcbi.1011812.ref027","article-title":"In-silico monitoring of directed evolution convergence to unveil best performing variants with credibility score","author":"T Nemoto","year":"2023","journal-title":"bioRxiv"},{"issue":"1","key":"pcbi.1011812.ref028","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1186\/s13059-017-1272-5","article-title":"A statistical framework for analyzing deep mutational scanning data","volume":"18","author":"AF Rubin","year":"2017","journal-title":"Genome biology"},{"issue":"1","key":"pcbi.1011812.ref029","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-020-02091-3","article-title":"DiMSum: an error model and pipeline for analyzing deep mutational scanning data and diagnosing common experimental pathologies","volume":"21","author":"AJ Faure","year":"2020","journal-title":"Genome Biology"},{"issue":"4598","key":"pcbi.1011812.ref030","doi-asserted-by":"crossref","first-page":"671","DOI":"10.1126\/science.220.4598.671","article-title":"Optimization by simulated annealing","volume":"220","author":"S Kirkpatrick","year":"1983","journal-title":"science"},{"key":"pcbi.1011812.ref031","first-page":"2022","article-title":"Learning the differences: a transfer-learning approach to predict antigen immunogenicity and T-cell receptor specificity","author":"B Bravi","year":"2022","journal-title":"bioRxiv"},{"issue":"14","key":"pcbi.1011812.ref032","doi-asserted-by":"crossref","first-page":"e2023141118","DOI":"10.1073\/pnas.2023141118","article-title":"Deep generative selection models of T and B cell receptor repertoires with soNNia","volume":"118","author":"G Isacchini","year":"2021","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"4","key":"pcbi.1011812.ref033","doi-asserted-by":"crossref","first-page":"1283","DOI":"10.1103\/RevModPhys.83.1283","article-title":"Statistical genetics and evolution of quantitative traits","volume":"83","author":"RA Neher","year":"2011","journal-title":"Reviews of Modern Physics"},{"issue":"49","key":"pcbi.1011812.ref034","doi-asserted-by":"crossref","first-page":"E1293","DOI":"10.1073\/pnas.1111471108","article-title":"Direct-coupling analysis of residue coevolution captures native contacts across many protein fami lies","volume":"108","author":"F Morcos","year":"2011","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"1","key":"pcbi.1011812.ref035","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1073\/pnas.0805923106","article-title":"Identification of direct residue contacts in protein\u2013protein interaction by message passing","volume":"106","author":"M Weigt","year":"2009","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"4","key":"pcbi.1011812.ref036","doi-asserted-by":"crossref","first-page":"e1004870","DOI":"10.1371\/journal.pcbi.1004870","article-title":"Maximum-entropy models of sequenced immune repertoires predict antigen-antibody affinity","volume":"12","author":"L Asti","year":"2016","journal-title":"PLoS computational biology"},{"issue":"1","key":"pcbi.1011812.ref037","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1093\/molbev\/msv211","article-title":"Coevolutionary landscape inference and the context-dependence of mutations in beta-lactamase TEM-1","volume":"33","author":"M Figliuzzi","year":"2016","journal-title":"Molecular biology and evolution"},{"issue":"2","key":"pcbi.1011812.ref038","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1038\/nbt.3769","article-title":"Mutation effects predicted from sequence co-variation","volume":"35","author":"TA Hopf","year":"2017","journal-title":"Nature biotechnology"},{"issue":"7","key":"pcbi.1011812.ref039","doi-asserted-by":"crossref","first-page":"1260","DOI":"10.1002\/pro.2876","article-title":"How mutational epistasis impairs predictability in protein evolution and design","volume":"25","author":"CM Miton","year":"2016","journal-title":"Protein Science"},{"issue":"7","key":"pcbi.1011812.ref040","doi-asserted-by":"crossref","first-page":"1204","DOI":"10.1002\/pro.2897","article-title":"Epistasis in protein evolution","volume":"25","author":"TN Starr","year":"2016","journal-title":"Protein Science"},{"issue":"4","key":"pcbi.1011812.ref041","doi-asserted-by":"crossref","first-page":"e2113118119","DOI":"10.1073\/pnas.2113118119","article-title":"Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes","volume":"119","author":"J Rodriguez-Rivas","year":"2022","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"1","key":"pcbi.1011812.ref042","doi-asserted-by":"crossref","first-page":"012707","DOI":"10.1103\/PhysRevE.87.012707","article-title":"Improved contact prediction in proteins: using pseudolikelihoods to infer Potts models","volume":"87","author":"M Ekeberg","year":"2013","journal-title":"Physical Review E"},{"key":"pcbi.1011812.ref043","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1016\/j.jcp.2014.07.024","article-title":"Fast pseudolikelihood maximization for direct-coupling analysis of protein structure from many homologous amino-acid sequences","volume":"276","author":"M Ekeberg","year":"2014","journal-title":"Journal of Computational Physics"},{"issue":"3","key":"pcbi.1011812.ref044","doi-asserted-by":"crossref","first-page":"606","DOI":"10.1016\/j.immuni.2012.11.022","article-title":"Translating HIV sequences into quantitative fitness landscapes predicts viral vulnerabilities for rational immunogen design","volume":"38","author":"AL Ferguson","year":"2013","journal-title":"Immunity"},{"issue":"1","key":"pcbi.1011812.ref045","doi-asserted-by":"crossref","DOI":"10.1016\/j.isci.2021.103569","article-title":"Evolutionary modeling reveals enhanced mutational flexibility of HCV subtype 1b compared with 1a","volume":"25","author":"H Zhang","year":"2022","journal-title":"Iscience"},{"issue":"8","key":"pcbi.1011812.ref046","doi-asserted-by":"crossref","first-page":"e1003776","DOI":"10.1371\/journal.pcbi.1003776","article-title":"The fitness landscape of HIV-1 gag: advanced modeling approaches and validation of model predictions by in vitro testing","volume":"10","author":"JK Mann","year":"2014","journal-title":"PLoS computational biology"},{"issue":"1","key":"pcbi.1011812.ref047","doi-asserted-by":"crossref","first-page":"2073","DOI":"10.1038\/s41467-019-09819-1","article-title":"Identifying immunologically-vulnerable regions of the HCV E2 glycoprotein and broadly neutralizing antibodies that target them","volume":"10","author":"AA Quadeer","year":"2019","journal-title":"Nature communications"},{"issue":"1","key":"pcbi.1011812.ref048","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1038\/s41467-019-14174-2","article-title":"Deconvolving mutational patterns of poliovirus outbreaks reveals its intrinsic fitness landscape","volume":"11","author":"AA Quadeer","year":"2020","journal-title":"Nature communications"},{"issue":"3","key":"pcbi.1011812.ref049","doi-asserted-by":"crossref","first-page":"e1501371","DOI":"10.1126\/sciadv.1501371","article-title":"Accurate and predictive antibody repertoire profiling by molecular amplification fingerprinting","volume":"2","author":"TA Khan","year":"2016","journal-title":"Science advances"},{"issue":"6","key":"pcbi.1011812.ref050","doi-asserted-by":"crossref","first-page":"715","DOI":"10.1038\/s41587-020-0466-7","article-title":"High-throughput single-cell activity-based screening and sequencing of antibodies using droplet microfluidics","volume":"38","author":"A G\u00e9rard","year":"2020","journal-title":"Nature biotechnology"},{"issue":"8","key":"pcbi.1011812.ref051","doi-asserted-by":"crossref","first-page":"801","DOI":"10.1038\/nmeth.3027","article-title":"Deep mutational scanning: a new style of protein science","volume":"11","author":"DM Fowler","year":"2014","journal-title":"Nature methods"},{"issue":"13","key":"pcbi.1011812.ref052","doi-asserted-by":"crossref","first-page":"3482","DOI":"10.1073\/pnas.1517813113","article-title":"Hierarchy and extremes in selections from pools of randomized proteins","volume":"113","author":"S Boyer","year":"2016","journal-title":"Proceedings of the National Academy of Sciences"},{"key":"pcbi.1011812.ref053","doi-asserted-by":"crossref","first-page":"e16965","DOI":"10.7554\/eLife.16965","article-title":"Adaptation in protein fitness landscapes is facilitated by indirect paths","volume":"5","author":"NC Wu","year":"2016","journal-title":"Elife"},{"issue":"8","key":"pcbi.1011812.ref054","doi-asserted-by":"crossref","first-page":"2502","DOI":"10.4049\/jimmunol.1800708","article-title":"Observed antibody space: a resource for data mining next-generation sequencing of antibody repertoires","volume":"201","author":"A Kovaltsuk","year":"2018","journal-title":"The Journal of Immunology"},{"issue":"1-2","key":"pcbi.1011812.ref055","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.jim.2009.10.002","article-title":"Comparison of the results obtained by ELISA and surface plasmon resonance for the determination of antibody affinity","volume":"352","author":"L Heinrich","year":"2010","journal-title":"Journal of immunological methods"},{"issue":"9","key":"pcbi.1011812.ref056","doi-asserted-by":"crossref","first-page":"e106699","DOI":"10.1371\/journal.pone.0106699","article-title":"Diversity of the antibody response to tetanus toxoid: comparison of hybridoma library to phage display library","volume":"9","author":"M Sorouri","year":"2014","journal-title":"PloS one"}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1011812","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,20]],"date-time":"2024-02-20T13:21:19Z","timestamp":1708435279000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1011812"}},"subtitle":[],"editor":[{"given":"Sushmita","family":"Roy","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,2,20]]},"references-count":56,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2024,2,20]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1011812","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.05.19.541442","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,2,20]]}}}