{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:45:05Z","timestamp":1740185105543,"version":"3.37.3"},"reference-count":56,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2019,7,15]],"date-time":"2019-07-15T00:00:00Z","timestamp":1563148800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Spanish Government","award":["RYC-2015-18241","IJCI-2016-29550"],"award-info":[{"award-number":["RYC-2015-18241","IJCI-2016-29550"]}]},{"DOI":"10.13039\/501100010801","name":"Xunta de Galicia","doi-asserted-by":"publisher","award":["ED431F 2018\/08"],"award-info":[{"award-number":["ED431F 2018\/08"]}],"id":[{"id":"10.13039\/501100010801","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The nonsynonymous\/synonymous substitution rate ratio (dN\/dS) is a commonly used parameter to quantify molecular adaptation in protein-coding data. It is known that the estimation of dN\/dS can be biased if some evolutionary processes are ignored. In this concern, common ML methods to estimate dN\/dS assume invariable codon frequencies among sites, despite this characteristic is rare in nature, and it could bias the estimation of this parameter.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Here we studied the influence of variable codon frequencies among genetic regions on the estimation of dN\/dS. We explored scenarios varying the number of genetic regions that differ in codon frequencies, the amount of variability of codon frequencies among regions and the nucleotide frequencies at each codon position among regions. We found that ignoring heterogeneous codon frequencies among regions overall leads to underestimation of dN\/dS and the bias increases with the level of heterogeneity of codon frequencies. Interestingly, we also found that varying nucleotide frequencies among regions at the first or second codon position leads to underestimation of dN\/dS while variation at the third codon position leads to overestimation of dN\/dS. Next, we present a methodology to reduce this bias based on the analysis of partitions presenting similar codon frequencies and we applied it to analyze four real datasets. We conclude that accounting for heterogeneous codon frequencies along sequences is required to obtain realistic estimates of molecular adaptation through this relevant evolutionary parameter.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The applied frameworks for the computer simulations of protein-coding data and estimation of molecular adaptation are SGWE and PAML, respectively. Both are publicly available and referenced in the study.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz558","type":"journal-article","created":{"date-parts":[[2019,7,11]],"date-time":"2019-07-11T11:10:55Z","timestamp":1562843455000},"page":"430-436","source":"Crossref","is-referenced-by-count":2,"title":["The influence of heterogeneous codon frequencies along sequences on the estimation of molecular adaptation"],"prefix":"10.1093","volume":"36","author":[{"given":"Roberto","family":"Del Amparo","sequence":"first","affiliation":[{"name":"Department of Biochemistry , Genetics and Immunology"},{"name":"Biomedical Research Center (CINBIO), University of Vigo , 36310 Vigo, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6718-3233","authenticated-orcid":false,"given":"Alberto","family":"Vicens","sequence":"additional","affiliation":[{"name":"Department of Biochemistry , Genetics and Immunology"},{"name":"Biomedical Research Center (CINBIO), University of Vigo , 36310 Vigo, Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0516-2717","authenticated-orcid":false,"given":"Miguel","family":"Arenas","sequence":"additional","affiliation":[{"name":"Department of Biochemistry , Genetics and Immunology"},{"name":"Biomedical Research Center (CINBIO), University of Vigo , 36310 Vigo, Spain"}]}],"member":"286","published-online":{"date-parts":[[2019,7,15]]},"reference":[{"key":"2023013112063518900_btz558-B1","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1186\/1471-2148-13-161","article-title":"State-of the art methodologies dictate new standards for phylogenetic analysis","volume":"13","author":"Anisimova","year":"2013","journal-title":"BMC Evol. Biol"},{"key":"2023013112063518900_btz558-B2","doi-asserted-by":"crossref","first-page":"1229","DOI":"10.1093\/genetics\/164.3.1229","article-title":"Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites","volume":"164","author":"Anisimova","year":"2003","journal-title":"Genetics"},{"key":"2023013112063518900_btz558-B3","doi-asserted-by":"crossref","first-page":"e1002495.","DOI":"10.1371\/journal.pcbi.1002495","article-title":"Simulation of molecular data under diverse evolutionary scenarios","volume":"8","author":"Arenas","year":"2012","journal-title":"PLoS Comput. Biol"},{"key":"2023013112063518900_btz558-B4","doi-asserted-by":"crossref","first-page":"319.","DOI":"10.3389\/fgene.2015.00319","article-title":"Trends in substitution models of molecular evolution","volume":"6","author":"Arenas","year":"2015","journal-title":"Front Genet"},{"key":"2023013112063518900_btz558-B5","doi-asserted-by":"crossref","first-page":"746","DOI":"10.1016\/B978-0-12-809633-8.20260-4","volume-title":"Encyclopedia of Bioinformatics and Computational Biology","author":"Arenas","year":"2019"},{"key":"2023013112063518900_btz558-B6","doi-asserted-by":"crossref","first-page":"1109","DOI":"10.1093\/molbev\/msu411","article-title":"CodABC: a computational framework to coestimate recombination, substitution, and molecular adaptation rates by approximate Bayesian computation","volume":"32","author":"Arenas","year":"2015","journal-title":"Mol. Biol. Evol"},{"key":"2023013112063518900_btz558-B7","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1534\/genetics.109.109736","article-title":"Coalescent simulation of intracodon recombination","volume":"184","author":"Arenas","year":"2010","journal-title":"Genetics"},{"key":"2023013112063518900_btz558-B8","doi-asserted-by":"crossref","first-page":"1133","DOI":"10.1534\/genetics.109.113423","article-title":"The effect of recombination on the reconstruction of ancestral sequences","volume":"184","author":"Arenas","year":"2010","journal-title":"Genetics"},{"key":"2023013112063518900_btz558-B9","doi-asserted-by":"crossref","first-page":"126","DOI":"10.1093\/acprof:osobl\/9780199601165.003.0009","volume-title":"Codon Evolution","author":"Arenas","year":"2012"},{"key":"2023013112063518900_btz558-B10","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1201\/b17795-7","volume-title":"Natural Selection: Methods and Applications","author":"Arenas","year":"2014"},{"key":"2023013112063518900_btz558-B11","doi-asserted-by":"crossref","first-page":"1295","DOI":"10.1093\/molbev\/msu078","article-title":"Simulation of genome-wide evolution under heterogeneous substitution models and complex multispecies coalescent histories","volume":"31","author":"Arenas","year":"2014","journal-title":"Mol. Biol. Evol"},{"key":"2023013112063518900_btz558-B12","doi-asserted-by":"crossref","first-page":"2195","DOI":"10.1093\/molbev\/msv085","article-title":"Maximum-likelihood phylogenetic inference with selection on protein folding stability","volume":"32","author":"Arenas","year":"2015","journal-title":"Mol. Biol. Evol"},{"key":"2023013112063518900_btz558-B13","doi-asserted-by":"crossref","first-page":"145.","DOI":"10.1186\/1471-2148-11-145","article-title":"Context-dependent codon partition models provide significant increases in model fit in atpB and rbcL protein-coding genes","volume":"11","author":"Baele","year":"2011","journal-title":"BMC Evol. Biol"},{"key":"2023013112063518900_btz558-B14","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1146\/annurev-ecolsys-102209-144621","article-title":"Approximate Bayesian computation in evolution and ecology","volume":"41","author":"Beaumont","year":"2010","journal-title":"Annu. Rev. Ecol. Evol. Syst"},{"key":"2023013112063518900_btz558-B15","doi-asserted-by":"crossref","first-page":"e5179.","DOI":"10.1371\/journal.pone.0005179","article-title":"Evolutionary conservation of orthoretroviral long terminal repeats (LTRs) and ab initio detection of single LTRs in genomic data","volume":"4","author":"Benachenhou","year":"2009","journal-title":"PLoS One"},{"key":"2023013112063518900_btz558-B16","doi-asserted-by":"crossref","first-page":"817","DOI":"10.1093\/molbev\/msj102","article-title":"Recombination estimation under complex evolutionary models with the coalescent composite-likelihood method","volume":"23","author":"Carvajal-Rodriguez","year":"2006","journal-title":"Mol. Biol. Evol"},{"key":"2023013112063518900_btz558-B17","doi-asserted-by":"crossref","first-page":"e1007166.","DOI":"10.1371\/journal.pgen.1007166","article-title":"The impact of ribosomal interference, codon usage, and exit tunnel interactions on translation elongation rate variation","volume":"14","author":"Dao Duc","year":"2018","journal-title":"PLoS Genet"},{"key":"2023013112063518900_btz558-B18","doi-asserted-by":"crossref","first-page":"1172","DOI":"10.1126\/science.1102036","article-title":"Prospects for building the tree of life from large sequence databases","volume":"306","author":"Driskell","year":"2004","journal-title":"Science"},{"key":"2023013112063518900_btz558-B19","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1093\/genetics\/149.1.445","article-title":"Assessing the impact of secondary structure and solvent accessibility on protein evolution","volume":"149","author":"Goldman","year":"1998","journal-title":"Genetics"},{"key":"2023013112063518900_btz558-B20","first-page":"725","article-title":"A codon-based model of nucleotide substitution for protein-coding DNA sequences","volume":"11","author":"Goldman","year":"1994","journal-title":"Mol. Biol. Evol"},{"key":"2023013112063518900_btz558-B21","doi-asserted-by":"crossref","first-page":"734","DOI":"10.1093\/molbev\/msx308","article-title":"Unbiased estimate of synonymous and non-synonymous substitution rates with non-stationary base composition","volume":"35","author":"Gueguen","year":"2018","journal-title":"Mol. Biol. Evol"},{"key":"2023013112063518900_btz558-B22","doi-asserted-by":"crossref","first-page":"5413","DOI":"10.1128\/JVI.02554-06","article-title":"Independent evolution of human immunodeficiency virus type 1 env V1\/V2 and V4\/V5 hypervariable regions during chronic infection","volume":"81","author":"Harrington","year":"2007","journal-title":"J. Virol"},{"key":"2023013112063518900_btz558-B23","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1038\/nrg3130","article-title":"Computer simulations: tools for population and evolutionary genetics","volume":"13","author":"Hoban","year":"2012","journal-title":"Nat. Rev. Genet"},{"key":"2023013112063518900_btz558-B24","doi-asserted-by":"crossref","first-page":"486.","DOI":"10.1016\/S0168-9525(02)02722-1","article-title":"The Ka\/Ks ratio: diagnosing the form of sequence evolution","volume":"18","author":"Hurst","year":"2002","journal-title":"Trends Genet"},{"key":"2023013112063518900_btz558-B25","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1007\/978-1-4939-1438-8_4","article-title":"A beginners guide to estimating the non-synonymous to synonymous rate ratio of all protein-coding genes in a genome","volume":"1201","author":"Jeffares","year":"2015","journal-title":"Methods Mol. Biol"},{"key":"2023013112063518900_btz558-B26","doi-asserted-by":"crossref","first-page":"e5549","DOI":"10.7717\/peerj.5549","article-title":"Influence of mutation bias and hydrophobicity on the substitution rates and sequence entropies of protein evolution","volume":"6","author":"Jimenez-Santos","year":"2018","journal-title":"PeerJ"},{"key":"2023013112063518900_btz558-B27","first-page":"134","article-title":"Standard codon substitution models overestimate purifying selection for nonstationary data","volume":"9","author":"Kaehler","year":"2017","journal-title":"Genome Biol. Evol"},{"key":"2023013112063518900_btz558-B28","doi-asserted-by":"crossref","first-page":"772","DOI":"10.1093\/molbev\/mst010","article-title":"MAFFT multiple sequence alignment software version 7: improvements in performance and usability","volume":"30","author":"Katoh","year":"2013","journal-title":"Mol. Biol. Evol"},{"key":"2023013112063518900_btz558-B29","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1016\/0304-4149(82)90011-4","article-title":"The coalescent","volume":"13","author":"Kingman","year":"1982","journal-title":"Stoch. Process. Their Appl"},{"key":"2023013112063518900_btz558-B30","doi-asserted-by":"crossref","first-page":"1260","DOI":"10.1089\/aid.2006.22.1260","article-title":"Continued evolution in gp41 after interruption of enfuvirtide in subjects with advanced HIV type 1 disease","volume":"22","author":"Kitchen","year":"2006","journal-title":"AIDS Res. Hum. Retroviruses"},{"key":"2023013112063518900_btz558-B31","doi-asserted-by":"crossref","first-page":"7","DOI":"10.2307\/2992432","article-title":"A concern for evidence and a phylogenetic hypothesis of relationships among Epicrates (Boidae, Serpentes","volume":"38","author":"Kluge","year":"1989","journal-title":"Syst. Zool"},{"key":"2023013112063518900_btz558-B32","doi-asserted-by":"crossref","first-page":"1208","DOI":"10.1093\/molbev\/msi105","article-title":"Not so different after all: a comparison of methods for detecting amino acid sites under selection","volume":"22","author":"Kosakovsky Pond","year":"2005","journal-title":"Mol. Biol. Evol"},{"key":"2023013112063518900_btz558-B33","doi-asserted-by":"crossref","first-page":"3096","DOI":"10.1093\/bioinformatics\/btl474","article-title":"GARD: a genetic algorithm for recombination detection","volume":"22","author":"Kosakovsky Pond","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013112063518900_btz558-B34","doi-asserted-by":"crossref","first-page":"6752","DOI":"10.1073\/pnas.1706021114","article-title":"Human genome-wide repair map of DNA damage caused by the cigarette smoke carcinogen benzo[a]pyrene","volume":"114","author":"Li","year":"2017","journal-title":"Proc. Natl. Acad. Sci. U.S.A"},{"key":"2023013112063518900_btz558-B35","first-page":"150","article-title":"A new method for estimating sysnonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes","volume":"2","author":"Li","year":"1985","journal-title":"Mol. Biol. Evol"},{"key":"2023013112063518900_btz558-B36","doi-asserted-by":"crossref","first-page":"e88339.","DOI":"10.1371\/journal.pone.0088339","article-title":"Variation, evolution, and correlation analysis of C+G content and genome or chromosome size in different kingdoms and phyla","volume":"9","author":"Li","year":"2014","journal-title":"PLoS One"},{"key":"2023013112063518900_btz558-B37","doi-asserted-by":"crossref","first-page":"769","DOI":"10.1002\/pro.2071","article-title":"The interface of protein structure, protein biophysics, and molecular evolution","volume":"21","author":"Liberles","year":"2012","journal-title":"Protein Sci"},{"key":"2023013112063518900_btz558-B38","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1038\/hdy.2013.101","article-title":"Coestimation of recombination, substitution and molecular adaptation rates by approximate Bayesian computation","volume":"112","author":"Lopes","year":"2014","journal-title":"Heredity"},{"key":"2023013112063518900_btz558-B39","first-page":"439","volume-title":"Pattern Recognition in Computational Molecular Biology","author":"Mallo","year":"2016"},{"key":"2023013112063518900_btz558-B40","doi-asserted-by":"crossref","first-page":"460","DOI":"10.1101\/gr.229799.117","article-title":"Evolutionary origins and diversification of testis-specific short histone H2A variants in mammals","volume":"28","author":"Molaro","year":"2018","journal-title":"Genome Res"},{"key":"2023013112063518900_btz558-B42","doi-asserted-by":"crossref","first-page":"e0196267.","DOI":"10.1371\/journal.pone.0196267","article-title":"Correction: molecular adaptation in Rubisco: discriminating between convergent evolution and positive selection using mechanistic and classical codon models","volume":"13","author":"Parto","year":"2018","journal-title":"PLoS One"},{"key":"2023013112063518900_btz558-B43","doi-asserted-by":"crossref","first-page":"35275.","DOI":"10.1038\/srep35275","article-title":"Molecular evolution of a widely-adopted taxonomic marker (COI) across the animal tree of life","volume":"6","author":"Pentinsaari","year":"2016","journal-title":"Sci. Rep"},{"key":"2023013112063518900_btz558-B44","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1186\/1742-4690-6-67","article-title":"Ethnic differences in the adaptation rate of HIV gp120 from a vaccine trial","volume":"6","author":"Perez-Losada","year":"2009","journal-title":"Retrovirology"},{"key":"2023013112063518900_btz558-B45","doi-asserted-by":"crossref","first-page":"e16902","DOI":"10.1371\/journal.pone.0016902","article-title":"Phylodynamics of HIV-1 from a phase III AIDS vaccine trial in Bangkok, Thailand","volume":"6","author":"Perez-Losada","year":"2011","journal-title":"PLoS One"},{"key":"2023013112063518900_btz558-B46","doi-asserted-by":"crossref","first-page":"13598","DOI":"10.1128\/JVI.01570-07","article-title":"Mapping protease inhibitor resistance to human immunodeficiency virus type 1 sequence polymorphisms within patients","volume":"81","author":"Poon","year":"2007","journal-title":"J. Virol"},{"key":"2023013112063518900_btz558-B47","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1016\/j.gene.2004.12.011","article-title":"Site interdependence attributed to tertiary structure in amino acid sequence evolution","volume":"347","author":"Rodrigue","year":"2005","journal-title":"Gene"},{"key":"2023013112063518900_btz558-B48","doi-asserted-by":"crossref","first-page":"879","DOI":"10.1093\/genetics\/156.2.879","article-title":"Consequences of recombination on traditional phylogenetic analysis","volume":"156","author":"Schierup","year":"2000","journal-title":"Genetics"},{"volume-title":"Statistical Methods in Molecular Evolution","year":"2005","author":"Siepel","key":"2023013112063518900_btz558-B49"},{"key":"2023013112063518900_btz558-B50","article-title":"Evolutionary divergence of neuroanatomical organization and related genes in chimpanzees and bonobos","author":"Staes","year":"2018","journal-title":"Cortex"},{"key":"2023013112063518900_btz558-B51","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1095\/biolreprod.113.116871","article-title":"Coevolution of positively selected IZUMO1 and CD9 in rodents: evidence of interaction between gamete fusion proteins?","volume":"90","author":"Vicens","year":"2014","journal-title":"Biol. Reprod"},{"key":"2023013112063518900_btz558-B52","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1007\/s00239-018-9872-6","article-title":"Positive selection in the evolution of mammalian CRISPs","volume":"86","author":"Vicens","year":"2018","journal-title":"J. Mol. Evol"},{"key":"2023013112063518900_btz558-B53","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1016\/j.ympev.2018.12.026","article-title":"Analyzing the functional divergence of Slo1 and Slo3 channel subfamilies","volume":"133","author":"Vicens","year":"2019","journal-title":"Mol. Phylogenet. Evol"},{"key":"2023013112063518900_btz558-B54","doi-asserted-by":"crossref","first-page":"331.","DOI":"10.1186\/1471-2148-8-331","article-title":"A class frequency mixture model that adjusts for site-specific amino acid frequencies and improves inference of protein phylogeny","volume":"8","author":"Wang","year":"2008","journal-title":"BMC Evol. Biol"},{"key":"2023013112063518900_btz558-B55","doi-asserted-by":"crossref","first-page":"1474","DOI":"10.1093\/hmg\/ddy058","article-title":"Predominant patterns of splicing evolution on human, chimpanzee and macaque evolutionary lineages","volume":"27","author":"Xiong","year":"2018","journal-title":"Hum. Mol. Genet"},{"key":"2023013112063518900_btz558-B56","doi-asserted-by":"crossref","DOI":"10.1093\/acprof:oso\/9780198567028.001.0001","volume-title":"Computational Molecular Evolution","author":"Yang","year":"2006"},{"key":"2023013112063518900_btz558-B57","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1093\/genetics\/155.1.431","article-title":"Codon-substitution models for heterogeneous selection pressure at amino acid sites","volume":"155","author":"Yang","year":"2000","journal-title":"Genetics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz558\/29097107\/btz558.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/2\/430\/48990900\/btz558.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/2\/430\/48990900\/btz558.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T21:23:07Z","timestamp":1675200187000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/2\/430\/5532222"}},"subtitle":[],"editor":[{"given":"Russell","family":"Schwartz","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,7,15]]},"references-count":56,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz558","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2020,1,15]]},"published":{"date-parts":[[2019,7,15]]}}}