{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:34Z","timestamp":1772138074529,"version":"3.50.1"},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2025,4,12]],"date-time":"2025-04-12T00:00:00Z","timestamp":1744416000000},"content-version":"vor","delay-in-days":14,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,3,29]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Evolutionary rates in protein-coding genes vary widely, reflecting functional and\/or structural constraints. Essential or highly expressed proteins tend to evolve more slowly, and within a protein, different amino acid sites experience distinct selective pressures. Accurately modeling this variation is critical for identifying functional and\/or structurally important amino acid sites. Standard methods assume independent substitution rates across sites, and the most conserved ones are widely distributed in protein tertiary structure. This is biologically unrealistic, as functional sites tend to cluster in 3D space.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Here, we developed CONSTRUCT, an improved strategy for detecting functional and structurally important regions in protein tertiary structure. Given a set of orthologous sequences, CONSTRUCT first estimates site-specific substitution rates using the Rate4site model. These rates are then weighted by the rates of neighboring amino acid sites within an optimally defined window size, determined by the strongest spatial correlation. To refine clustering detection, CONSTRUCT can analyze either C\u03b1 atoms or the center of mass of amino acid sites, accounting for side chain orientation. Extensive simulations and validation on 14 functionally characterized proteins of diverse sizes, interspecies conservation levels, and taxonomic origins demonstrated the robustness of CONSTRUCT. The results highlight CONSTRUCT as a powerful tool for guiding site-directed mutagenesis experiments aimed at elucidating protein function.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The CONSTRUCT program and documentation are freely available at https:\/\/github.com\/Rcoppee\/CONSTRUCT.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf166","type":"journal-article","created":{"date-parts":[[2025,4,10]],"date-time":"2025-04-10T07:44:45Z","timestamp":1744271085000},"source":"Crossref","is-referenced-by-count":2,"title":["CONSTRUCT: an algorithmic tool for identifying functional or structurally important regions in protein tertiary structure"],"prefix":"10.1093","volume":"41","author":[{"given":"Lucas","family":"Chivot","sequence":"first","affiliation":[{"name":"Universit\u00e9 de Rouen Normandie, Laboratoire de Parasitologie-Mycologie, ESCAPE , F-76000 Rouen,","place":["France"]}]},{"given":"No\u00e9","family":"Mathieux","sequence":"additional","affiliation":[{"name":"Universit\u00e9 de Rouen Normandie, Laboratoire de Parasitologie-Mycologie, ESCAPE , F-76000 Rouen,","place":["France"]}]},{"given":"Anna","family":"Cosson","sequence":"additional","affiliation":[{"name":"Universit\u00e9 de Rouen Normandie, Laboratoire de Parasitologie-Mycologie, ESCAPE , F-76000 Rouen,","place":["France"]}]},{"given":"Antoine","family":"Bridier-Nahmias","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris Cit\u00e9 et Sorbonne Paris Nord, Inserm, IAME , F-75018 Paris,","place":["France"]}]},{"given":"Lo\u00efc","family":"Favennec","sequence":"additional","affiliation":[{"name":"Universit\u00e9 de Rouen Normandie, Laboratoire de Parasitologie-Mycologie, ESCAPE , F-76000 Rouen,","place":["France"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5138-361X","authenticated-orcid":false,"given":"Jean-Christophe","family":"Gelly","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris Cit\u00e9 et Universit\u00e9 des Antilles et Universit\u00e9 de la R\u00e9union, Inserm, BIGR , F-75015 Paris,","place":["France"]}]},{"given":"J\u00e9r\u00f4me","family":"Clain","sequence":"additional","affiliation":[{"name":"Universit\u00e9 Paris Cit\u00e9, IRD, Inserm, MERIT , F-75006 Paris,","place":["France"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3024-5928","authenticated-orcid":false,"given":"Romain","family":"Copp\u00e9e","sequence":"additional","affiliation":[{"name":"Universit\u00e9 de Rouen Normandie, Laboratoire de Parasitologie-Mycologie, ESCAPE , F-76000 Rouen,","place":["France"]}]}],"member":"286","published-online":{"date-parts":[[2025,4,12]]},"reference":[{"key":"2025042719320300900_btaf166-B1","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1016\/S0014-5793(99)01177-1","article-title":"Molecular characterization of Vig4\/Vrg4 GDP-mannose transporter of the yeast Saccharomyces cerevisiae","volume":"458","author":"Abe","year":"1999","journal-title":"FEBS Lett"},{"key":"2025042719320300900_btaf166-B2","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1038\/s41586-024-07487-w","article-title":"Accurate structure prediction of biomolecular interactions with AlphaFold 3","volume":"630","author":"Abramson","year":"2024","journal-title":"Nature"},{"key":"2025042719320300900_btaf166-B3","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1007\/s00239-004-0223-4","article-title":"Tertiary windowing to detect positive diversifying selection","volume":"60","author":"Berglund","year":"2005","journal-title":"J Mol Evol"},{"key":"2025042719320300900_btaf166-B4","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1016\/j.freeradbiomed.2015.05.034","article-title":"Structural basis of Keap1 interactions with Nrf2","volume":"88","author":"Canning","year":"2015","journal-title":"Free Radic Biol Med"},{"key":"2025042719320300900_btaf166-B5","doi-asserted-by":"crossref","first-page":"1875","DOI":"10.1093\/bioinformatics\/btm270","article-title":"Predicting functionally important residues from sequence conservation","volume":"23","author":"Capra","year":"2007","journal-title":"Bioinformatics"},{"key":"2025042719320300900_btaf166-B6","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1023\/A:1022430906284","article-title":"CFTR: domains, structure, and function","volume":"29","author":"Devidas","year":"1997","journal-title":"J Bioenerg Biomembr"},{"key":"2025042719320300900_btaf166-B7","doi-asserted-by":"crossref","first-page":"5306","DOI":"10.1046\/j.1432-1327.2000.01614.x","article-title":"Nucleotide-binding domain 1 of cystic fibrosis transmembrane conductance regulator production of a suitable protein for structural studies","volume":"267","author":"Duffieux","year":"2000","journal-title":"Eur J Biochem"},{"key":"2025042719320300900_btaf166-B8","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1093\/molbev\/msy244","article-title":"Beyond stability constraints: a biophysical model of enzyme evolution with selection on stability and activity","volume":"36","author":"Echave","year":"2019","journal-title":"Mol Biol Evol"},{"key":"2025042719320300900_btaf166-B9","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1038\/nrg.2015.18","article-title":"Causes of evolutionary rate variation among protein sites","volume":"17","author":"Echave","year":"2016","journal-title":"Nat Rev Genet"},{"key":"2025042719320300900_btaf166-B10","doi-asserted-by":"crossref","first-page":"msae026","DOI":"10.1093\/molbev\/msae026","article-title":"Substitution models of protein evolution with selection on enzymatic activity","volume":"41","author":"Ferreiro","year":"2024","journal-title":"Mol Biol Evol"},{"key":"2025042719320300900_btaf166-B11","doi-asserted-by":"crossref","first-page":"2387","DOI":"10.1093\/molbev\/msp146","article-title":"Structural determinants of protein evolution are context-sensitive at the residue level","volume":"26","author":"Franzosa","year":"2009","journal-title":"Mol Biol Evol"},{"key":"2025042719320300900_btaf166-B12","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1093\/bioinformatics\/btu673","article-title":"FuncPatch: a web server for the fast Bayesian inference of conserved functional patches in protein 3D structures","volume":"31","author":"Huang","year":"2015","journal-title":"Bioinformatics"},{"key":"2025042719320300900_btaf166-B13","doi-asserted-by":"crossref","first-page":"e1003429","DOI":"10.1371\/journal.pcbi.1003429","article-title":"Phylogenetic gaussian process model for the inference of functionally important regions in protein tertiary structures","volume":"10","author":"Huang","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"2025042719320300900_btaf166-B14","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1016\/j.jmb.2004.01.053","article-title":"Prediction of functional sites in proteins using conserved functional group analysis","volume":"337","author":"Innis","year":"2004","journal-title":"J Mol Biol"},{"key":"2025042719320300900_btaf166-B15","doi-asserted-by":"crossref","first-page":"2848","DOI":"10.1073\/pnas.71.7.2848","article-title":"On some principles governing molecular evolution","volume":"71","author":"Kimura","year":"1974","journal-title":"Proc Natl Acad Sci USA"},{"key":"2025042719320300900_btaf166-B16","doi-asserted-by":"crossref","first-page":"906","DOI":"10.3390\/biom13060906","article-title":"xProtCAS: a toolkit for extracting conserved accessible surfaces from protein structures","volume":"13","author":"Kotb","year":"2023","journal-title":"Biomolecules"},{"key":"2025042719320300900_btaf166-B17","doi-asserted-by":"crossref","first-page":"606","DOI":"10.1038\/s41586-023-05854-7","article-title":"CFTR function, pathology and pharmacology at single-molecule resolution","volume":"616","author":"Levring","year":"2023","journal-title":"Nature"},{"key":"2025042719320300900_btaf166-B18","doi-asserted-by":"crossref","first-page":"1346","DOI":"10.1074\/jbc.M410968200","article-title":"Impact of the \u0394F508 mutation in first nucleotide-binding domain of human cystic fibrosis transmembrane conductance regulator on domain folding and structure","volume":"280","author":"Lewis","year":"2005","journal-title":"J Biol Chem"},{"key":"2025042719320300900_btaf166-B20","doi-asserted-by":"crossref","first-page":"msac092","DOI":"10.1093\/molbev\/msac092","article-title":"AliSim: a fast and versatile phylogenetic sequence simulator for the genomic era","volume":"39","author":"Ly-Trong","year":"2022","journal-title":"Mol Biol Evol"},{"key":"2025042719320300900_btaf166-B21","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1006\/jmbi.2001.5327","article-title":"Structural clusters of evolutionary trace residues are statistically significant and common in proteins","volume":"316","author":"Madabushi","year":"2002","journal-title":"J Mol Biol"},{"key":"2025042719320300900_btaf166-B22","doi-asserted-by":"crossref","first-page":"e911","DOI":"10.7717\/peerj.911","article-title":"Too packed to change: side-chain packing and site-specific substitution rates in protein evolution","volume":"3","author":"Marcos","year":"2015","journal-title":"PeerJ"},{"key":"2025042719320300900_btaf166-B23","doi-asserted-by":"crossref","first-page":"1530","DOI":"10.1093\/molbev\/msaa015","article-title":"IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era","volume":"37","author":"Minh","year":"2020","journal-title":"Mol Biol Evol"},{"key":"2025042719320300900_btaf166-B24","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1111\/eva.12052","article-title":"Structural dynamics flexibility informs function and evolution at a proteome scale","volume":"6","author":"Nevin Gerek","year":"2013","journal-title":"Evol Appl"},{"key":"2025042719320300900_btaf166-B25","doi-asserted-by":"crossref","first-page":"i328","DOI":"10.1093\/bioinformatics\/bti1023","article-title":"In silico identification of functional regions in proteins","volume":"21","author":"Nimrod","year":"2005","journal-title":"Bioinformatics"},{"key":"2025042719320300900_btaf166-B26","doi-asserted-by":"crossref","first-page":"884","DOI":"10.1110\/ps.03465504","article-title":"Prediction of functional sites by analysis of sequence and structure conservation","volume":"13","author":"Panchenko","year":"2004","journal-title":"Protein Sci"},{"key":"2025042719320300900_btaf166-B27","doi-asserted-by":"crossref","first-page":"521","DOI":"10.1038\/nature24464","article-title":"Structural basis of nucleotide sugar transport across the Golgi membrane","volume":"551","author":"Parker","year":"2017","journal-title":"Nature"},{"key":"2025042719320300900_btaf166-B28","doi-asserted-by":"crossref","first-page":"S71","DOI":"10.1093\/bioinformatics\/18.suppl_1.S71","article-title":"Rate4Site: an algorithmic tool for the identification of functional regions in proteins by surface mapping of evolutionary determinants within their homologues","volume":"18","author":"Pupko","year":"2002","journal-title":"Bioinformatics"},{"key":"2025042719320300900_btaf166-B30","doi-asserted-by":"crossref","first-page":"4258","DOI":"10.1038\/s41467-020-18101-8","article-title":"CFTR trafficking mutations disrupt cotranslational protein folding by targeting biosynthetic intermediates","volume":"11","author":"Shishido","year":"2020","journal-title":"Nat Commun"},{"key":"2025042719320300900_btaf166-B31","doi-asserted-by":"crossref","first-page":"2912","DOI":"10.1073\/pnas.042692299","article-title":"Inference of functional regions in proteins by quantification of evolutionary constraints","volume":"99","author":"Simon","year":"2002","journal-title":"Proc Natl Acad Sci USA"},{"key":"2025042719320300900_btaf166-B32","doi-asserted-by":"crossref","first-page":"2352","DOI":"10.1093\/molbev\/msh249","article-title":"Three-dimensional window analysis for detecting positive selection at structural regions of proteins","volume":"21","author":"Suzuki","year":"2004","journal-title":"Mol Biol Evol"},{"key":"2025042719320300900_btaf166-B33","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1038\/nsmb881","article-title":"Side chain and backbone contributions of Phe508 to CFTR folding","volume":"12","author":"Thibodeau","year":"2005","journal-title":"Nat Struct Mol Biol"},{"key":"2025042719320300900_btaf166-B34","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1038\/nature17991","article-title":"Structural basis for amino acid export by DMT superfamily transporter YddG","volume":"534","author":"Tsuchiya","year":"2016","journal-title":"Nature"},{"key":"2025042719320300900_btaf166-B35","doi-asserted-by":"crossref","first-page":"2714","DOI":"10.1093\/molbev\/mst151","article-title":"Spatial distribution of selection pressure on a protein based on the hierarchical Bayesian model","volume":"30","author":"Watabe","year":"2013","journal-title":"Mol Biol Evol"},{"key":"2025042719320300900_btaf166-B36","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1016\/0169-5347(96)10041-0","article-title":"Among-site rate variation and its impact on phylogenetic analyses","volume":"11","author":"Yang","year":"1996","journal-title":"Trends Ecol Evol"},{"key":"2025042719320300900_btaf166-B37","doi-asserted-by":"crossref","first-page":"306","DOI":"10.1007\/BF00160154","article-title":"Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods","volume":"39","author":"Yang","year":"1994","journal-title":"J Mol Evol"},{"key":"2025042719320300900_btaf166-B38","doi-asserted-by":"crossref","first-page":"1586","DOI":"10.1093\/molbev\/msm088","article-title":"PAML 4: phylogenetic analysis by maximum likelihood","volume":"24","author":"Yang","year":"2007","journal-title":"Mol. Biol. Evol"},{"key":"2025042719320300900_btaf166-B39","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1093\/genetics\/155.1.431","article-title":"Codon-substitution models for heterogeneous selection pressure at amino acid sites","volume":"155","author":"Yang","year":"2000","journal-title":"Genetics"},{"key":"2025042719320300900_btaf166-B40","doi-asserted-by":"crossref","first-page":"694","DOI":"10.1164\/ajrccm.186.7.694","article-title":"A young hispanic with c.1646G&gt;a mutation exhibits severe cystic fibrosis lung disease: is ivacaftor an option for therapy?","volume":"186","author":"Yarlagadda","year":"2012","journal-title":"Am J Respir Crit Care Med"},{"key":"2025042719320300900_btaf166-B41","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1093\/molbev\/mst178","article-title":"Site-specific structural constraints on protein sequence evolutionary divergence: local packing density versus solvent exposure","volume":"31","author":"Yeh","year":"2014","journal-title":"Mol Biol Evol"},{"key":"2025042719320300900_btaf166-B42","doi-asserted-by":"crossref","first-page":"409","DOI":"10.1038\/nrg3950","article-title":"Determinants of the rate of protein sequence evolution","volume":"16","author":"Zhang","year":"2015","journal-title":"Nat Rev Genet"},{"key":"2025042719320300900_btaf166-B43","doi-asserted-by":"crossref","first-page":"12757","DOI":"10.1073\/pnas.1815287115","article-title":"Molecular structure of the ATP-bound, phosphorylated human CFTR","volume":"115","author":"Zhang","year":"2018","journal-title":"Proc Natl Acad Sci USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf166\/62922544\/btaf166.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/4\/btaf166\/62922544\/btaf166.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/4\/btaf166\/62922544\/btaf166.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,27]],"date-time":"2025-04-27T19:32:13Z","timestamp":1745782333000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf166\/8112831"}},"subtitle":[],"editor":[{"given":"Arne","family":"Elofsson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,3,29]]},"references-count":41,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,3,29]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf166","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2024.10.07.617015","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,4]]},"published":{"date-parts":[[2025,3,29]]},"article-number":"btaf166"}}