{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,22]],"date-time":"2025-11-22T11:10:56Z","timestamp":1763809856467,"version":"3.37.3"},"reference-count":51,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2018,6,27]],"date-time":"2018-06-27T00:00:00Z","timestamp":1530057600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000001","name":"US National Science Foundation","doi-asserted-by":"crossref","award":["DEB-1655683"],"award-info":[{"award-number":["DEB-1655683"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The relative rates of amino acid interchanges over evolutionary time are likely to vary among proteins. Variation in those rates has the potential to reveal information about constraints on proteins. However, the most straightforward model that could be used to estimate relative rates of amino acid substitution is parameter-rich and it is therefore impractical to use for this purpose.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>A six-parameter model of amino acid substitution that incorporates information about the physicochemical properties of amino acids was developed. It showed that amino acid side chain volume, polarity and aromaticity have major impacts on protein evolution. It also revealed variation among proteins in the relative importance of those properties. The same general approach can be used to improve the fit of empirical models such as the commonly used PAM and LG models.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Perl code and test data are available from https:\/\/github.com\/ebraun68\/sixparam.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty261","type":"journal-article","created":{"date-parts":[[2018,4,12]],"date-time":"2018-04-12T19:32:51Z","timestamp":1523561571000},"page":"i350-i356","source":"Crossref","is-referenced-by-count":15,"title":["An evolutionary model motivated by physicochemical properties of amino acids reveals variation among proteins"],"prefix":"10.1093","volume":"34","author":[{"given":"Edward L","family":"Braun","sequence":"first","affiliation":[{"name":"Department of Biology and Genetics Institute, University of Florida, Gainesville, FL, USA"}]}],"member":"286","published-online":{"date-parts":[[2018,6,27]]},"reference":[{"key":"2023051604232619000_bty261-B1","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1534\/genetics.112.140178","article-title":"Weak selection and protein evolution","volume":"192","author":"Akashi","year":"2012","journal-title":"Genetics"},{"key":"2023051604232619000_bty261-B2","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1038\/nrg1770","article-title":"Hearing silence: non-neutral evolution at synonymous sites in mammals","volume":"7","author":"Chamary","year":"2006","journal-title":"Nat. Rev. Genet"},{"key":"2023051604232619000_bty261-B3","doi-asserted-by":"crossref","first-page":"1104","DOI":"10.1093\/sysbio\/syv059","article-title":"Selecting question-specific genes to reduce incongruence in phylogenomics: a case study of jawed vertebrate backbone phylogeny","volume":"64","author":"Chen","year":"2015","journal-title":"Syst. Biol"},{"key":"2023051604232619000_bty261-B4","first-page":"345","volume-title":"Atlas of Protein Sequence and Structure. National Biomedical Research Foundation","author":"Dayhoff","year":"1978"},{"key":"2023051604232619000_bty261-B5","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1007\/s00239-001-2304-y","article-title":"rtREV: an amino acid substitution matrix for inference of retrovirus and reverse transcriptase phylogeny","volume":"55","author":"Dimmic","year":"2002","journal-title":"J. Mol. Evol"},{"key":"2023051604232619000_bty261-B6","doi-asserted-by":"crossref","first-page":"745","DOI":"10.1038\/nature06614","article-title":"Broad phylogenomic sampling improves resolution of the animal tree of life","volume":"452","author":"Dunn","year":"2008","journal-title":"Nature"},{"key":"2023051604232619000_bty261-B7","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1038\/nrg.2015.18","article-title":"Causes of evolutionary rate variation among protein sites","volume":"17","author":"Echave","year":"2016","journal-title":"Nat. Rev. Genet"},{"key":"2023051604232619000_bty261-B8","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1007\/BF01734359","article-title":"Evolutionary trees from DNA sequences: a maximum likelihood approach","volume":"17","author":"Felsenstein","year":"1981","journal-title":"J. Mol. Evol"},{"volume-title":"Inferring Phylogenies","year":"2004","author":"Felsenstein","key":"2023051604232619000_bty261-B9"},{"key":"2023051604232619000_bty261-B10","doi-asserted-by":"crossref","first-page":"1458","DOI":"10.1105\/tpc.106.049676","article-title":"The two AGPase subunits evolve at different rates in angiosperms, yet they are equally sensitive to activity-altering amino acid changes when expressed in bacteria","volume":"19","author":"Georgelis","year":"2007","journal-title":"Plant Cell"},{"key":"2023051604232619000_bty261-B11","doi-asserted-by":"crossref","first-page":"862","DOI":"10.1126\/science.185.4154.862","article-title":"Amino acid difference formula to help explain protein evolution","volume":"185","author":"Grantham","year":"1974","journal-title":"Science"},{"key":"2023051604232619000_bty261-B12","doi-asserted-by":"crossref","first-page":"2235","DOI":"10.1093\/molbev\/msm152","article-title":"The nonsynonymous\/synonymous substitution rate ratio versus the radical\/conservative replacement rate ratio in the evolution of mammalian genes","volume":"24","author":"Hanada","year":"2007","journal-title":"Mol. Biol. Evol"},{"key":"2023051604232619000_bty261-B13","doi-asserted-by":"crossref","first-page":"e1000781","DOI":"10.1371\/journal.pgen.1000781","article-title":"Increased expression and protein divergence in duplicate genes is associated with morphological diversification","volume":"5","author":"Hanada","year":"2009","journal-title":"PLoS Genet"},{"key":"2023051604232619000_bty261-B14","doi-asserted-by":"crossref","first-page":"4261","DOI":"10.1098\/rspb.2009.0896","article-title":"Assessing the root of bilaterian animals with scalable phylogenomic methods","volume":"276","author":"Hejnol","year":"2009","journal-title":"Proc. Roy. Soc. B"},{"key":"2023051604232619000_bty261-B15","doi-asserted-by":"crossref","first-page":"143.","DOI":"10.1186\/s12859-016-0985-x","article-title":"Does the choice of nucleotide substitution models matter topologically?","volume":"17","author":"Hoff","year":"2016","journal-title":"BMC Bioinformatics"},{"key":"2023051604232619000_bty261-B16","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1016\/j.gene.2009.03.012","article-title":"More radical amino acid replacements in primates than in rodents: support for the evolutionary role of effective population size","volume":"440","author":"Hughes","year":"2009","journal-title":"Gene"},{"key":"2023051604232619000_bty261-B17","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1093\/biomet\/76.2.297","article-title":"Regression and time series model selection in small samples","volume":"76","author":"Hurvich","year":"1989","journal-title":"Biometrika"},{"key":"2023051604232619000_bty261-B18","doi-asserted-by":"crossref","first-page":"13994","DOI":"10.1073\/pnas.0404142101","article-title":"Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolution","volume":"101","author":"Hwang","year":"2004","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051604232619000_bty261-B19","doi-asserted-by":"crossref","first-page":"1320","DOI":"10.1126\/science.1253451","article-title":"Whole-genome analyses resolve early branches in the tree of life of modern birds","volume":"346","author":"Jarvis","year":"2014","journal-title":"Science"},{"key":"2023051604232619000_bty261-B51","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1186\/s13742-014-0038-1","article-title":"Phylogenomic analyses data of the avian phylogenomics project","volume":"4","author":"Jarvis","year":"2015","journal-title":"GigaScience"},{"key":"2023051604232619000_bty261-B20","first-page":"275","article-title":"The rapid generation of mutation data matrices from protein sequences","volume":"8","author":"Jones","year":"1992","journal-title":"CABIOS"},{"key":"2023051604232619000_bty261-B21","doi-asserted-by":"crossref","first-page":"29.","DOI":"10.1186\/1471-2148-6-29","article-title":"Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified","volume":"6","author":"Keane","year":"2006","journal-title":"BMC Evol. Biol"},{"key":"2023051604232619000_bty261-B22","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1007\/BF02109483","article-title":"Maximum likelihood inference of protein phylogeny and the origin of chloroplasts","volume":"31","author":"Kishino","year":"1990","journal-title":"J. Mol. Evol"},{"key":"2023051604232619000_bty261-B23","doi-asserted-by":"crossref","first-page":"e1003527","DOI":"10.1371\/journal.pgen.1003527","article-title":"Strong purifying selection at synonymous sites in D.melanogaster","volume":"9","author":"Lawrie","year":"2013","journal-title":"PLoS Genet"},{"key":"2023051604232619000_bty261-B24","doi-asserted-by":"crossref","first-page":"1095","DOI":"10.1093\/molbev\/msh112","article-title":"A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process","volume":"21","author":"Lartillot","year":"2004","journal-title":"Mol. Biol. Evol"},{"key":"2023051604232619000_bty261-B25","doi-asserted-by":"crossref","first-page":"3965","DOI":"10.1098\/rstb.2008.0180","article-title":"Phylogenetic mixture models for proteins","volume":"363","author":"Le","year":"2008","journal-title":"Philos. Trans. Roy. Soc. B"},{"key":"2023051604232619000_bty261-B26","doi-asserted-by":"crossref","first-page":"1307","DOI":"10.1093\/molbev\/msn067","article-title":"An improved general amino acid replacement matrix","volume":"25","author":"Le","year":"2008","journal-title":"Mol. Biol. Evol"},{"key":"2023051604232619000_bty261-B27","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1093\/sysbio\/46.3.523","article-title":"Gene trees in species trees","volume":"46","author":"Maddison","year":"1997","journal-title":"Syst. Biol"},{"key":"2023051604232619000_bty261-B28","doi-asserted-by":"crossref","first-page":"761","DOI":"10.1089\/10665270050514918","article-title":"Modeling amino acid replacement","volume":"7","author":"M\u00fcller","year":"2000","journal-title":"J. Comput. Biol"},{"key":"2023051604232619000_bty261-B29","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1093\/molbev\/msu300","article-title":"IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies","volume":"32","author":"Nguyen","year":"2015","journal-title":"Mol. Biol. Evol"},{"key":"2023051604232619000_bty261-B30","doi-asserted-by":"crossref","first-page":"e503.","DOI":"10.1371\/journal.pone.0000503","article-title":"HIV-specific probabilistic models of protein evolution","volume":"2","author":"Nickle","year":"2007","journal-title":"PLoS ONE"},{"key":"2023051604232619000_bty261-B31","doi-asserted-by":"crossref","first-page":"1216","DOI":"10.1101\/gr.076570.108","article-title":"Transcription induces strand-specific mutations at the 5\u2032 end of human genes","volume":"18","author":"Polak","year":"2008","journal-title":"Genome Res"},{"key":"2023051604232619000_bty261-B32","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1093\/sysbio\/syx041","article-title":"Why do phylogenomic data sets yield conflicting trees? Data type influences the avian tree of life more than taxon sampling","volume":"66","author":"Reddy","year":"2017","journal-title":"Syst. Biol"},{"key":"2023051604232619000_bty261-B33","doi-asserted-by":"crossref","first-page":"1337","DOI":"10.1093\/molbev\/msi121","article-title":"More genes or more taxa? The relative contribution of gene number and taxon number to phylogenetic accuracy","volume":"22","author":"Rokas","year":"2005","journal-title":"Mol. Biol. Evol"},{"key":"2023051604232619000_bty261-B34","doi-asserted-by":"crossref","first-page":"467","DOI":"10.1007\/s00239-003-2500-z","article-title":"Are radical and conservative substitution rates useful statistics in molecular evolution?","volume":"57","author":"Smith","year":"2003","journal-title":"J. Mol. Evol"},{"key":"2023051604232619000_bty261-B35","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1080\/14772000.2017.1401016","article-title":"On the importance of homology in the age of phylogenomics","volume":"16","author":"Springer","year":"2018","journal-title":"Syst. Biodivers"},{"key":"2023051604232619000_bty261-B36","first-page":"407","volume-title":"Molecular Systematics","author":"Swofford","year":"1996"},{"key":"2023051604232619000_bty261-B37","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1186\/1756-0500-5-85","article-title":"Mathematical modeling and comparison of protein size distribution in different plant, animal, fungal and microbial species reveals a negative correlation between protein size and protein number, thus providing insight into the evolution of proteomes","volume":"5","author":"Tiessen","year":"2012","journal-title":"BMC Res. Notes"},{"key":"2023051604232619000_bty261-B38","doi-asserted-by":"crossref","first-page":"997","DOI":"10.1089\/106652703322756195","article-title":"A transition probability model for amino acid substitutions from blocks","volume":"10","author":"Veerassamy","year":"2003","journal-title":"J. Comput. Biol"},{"volume-title":"Computational Phylogenetics: An Introduction to Designing Methods for Phylogeny Estimation","year":"2018","author":"Warnow","key":"2023051604232619000_bty261-B39"},{"key":"2023051604232619000_bty261-B40","doi-asserted-by":"crossref","first-page":"542","DOI":"10.1186\/s13059-014-0542-8","article-title":"Kr\/Kc but not dN\/dS correlates positively with body mass in birds, raising implications for inferring lineage-specific selection","volume":"15","author":"Weber","year":"2014","journal-title":"Genome Biol"},{"key":"2023051604232619000_bty261-B41","doi-asserted-by":"crossref","first-page":"691","DOI":"10.1093\/oxfordjournals.molbev.a003851","article-title":"A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach","volume":"18","author":"Whelan","year":"2001","journal-title":"Mol. Biol. Evol"},{"key":"2023051604232619000_bty261-B42","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1101\/gr.1347404","article-title":"Coelomata and not Ecdysozoa: evidence from genome-wide phylogenetic analysis","volume":"14","author":"Wolf","year":"2004","journal-title":"Genome Res"},{"key":"2023051604232619000_bty261-B43","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1007\/PL00006412","article-title":"What amino acid properties affect protein evolution?","volume":"47","author":"Xia","year":"1998","journal-title":"J. Mol. Evol"},{"key":"2023051604232619000_bty261-B44","first-page":"433","article-title":"Untangling the effects of codon mutation and amino acid exchangeability","volume":"10","author":"Yampolsky","year":"2005","journal-title":"Pac. Symp. Biocomp"},{"key":"2023051604232619000_bty261-B45","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1007\/BF00178256","article-title":"Estimating the pattern of nucleotide substitution","volume":"39","author":"Yang","year":"1994","journal-title":"J. Mol. Evol"},{"key":"2023051604232619000_bty261-B46","doi-asserted-by":"crossref","first-page":"568","DOI":"10.1093\/oxfordjournals.molbev.a025957","article-title":"Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution","volume":"15","author":"Yang","year":"1998","journal-title":"Mol. Biol. Evol"},{"key":"2023051604232619000_bty261-B47","doi-asserted-by":"crossref","DOI":"10.1093\/acprof:oso\/9780198567028.001.0001","volume-title":"Computational Molecular Evolution","author":"Yang","year":"2006"},{"key":"2023051604232619000_bty261-B48","doi-asserted-by":"crossref","first-page":"908","DOI":"10.1093\/oxfordjournals.molbev.a004148","article-title":"Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages","volume":"19","author":"Yang","year":"2002","journal-title":"Mol. Biol. Evol"},{"key":"2023051604232619000_bty261-B49","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1007\/s002399910007","article-title":"Rates of conservative and radical nonsynonymous nucleotide substitutions in mammalian nuclear genes","volume":"50","author":"Zhang","year":"2000","journal-title":"J. Mol. Evol"},{"key":"2023051604232619000_bty261-B50","doi-asserted-by":"crossref","first-page":"1311","DOI":"10.1126\/science.1251385","article-title":"Comparative genomics reveals insights into avian genome evolution and adaptation","volume":"346","author":"Zhang","year":"2014","journal-title":"Science"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/13\/i350\/50315882\/bioinformatics_34_13_i350.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/13\/i350\/50315882\/bioinformatics_34_13_i350.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T04:25:10Z","timestamp":1684211110000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/13\/i350\/5045789"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,6,27]]},"references-count":51,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2018,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty261","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2018,7,1]]},"published":{"date-parts":[[2018,6,27]]}}}