{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,28]],"date-time":"2026-02-28T02:05:50Z","timestamp":1772244350490,"version":"3.50.1"},"reference-count":28,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2025,3,18]],"date-time":"2025-03-18T00:00:00Z","timestamp":1742256000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:p>Predicting phenotypic properties of a virus directly from its sequence data is an attractive goal for viral epidemiology. Here, we focus narrowly on the Orthocoronavirinae clade and demonstrate models that are powerfully predictive for a human-pathogen phenotype with 76.74% average precision and 85.96% average recall on the withheld test set groups, using only Orf1ab codon frequencies. We show alternative examples for other viral coding sequences and feature representations that do not perform well and discuss what distinguishes the models that are performant. These models point to a small subset of features, specifically 5 codons, that are critical to the success of the models. We discuss and contextualize how this observation may fit within a larger model for the role of translation in virus-host agreement.<\/jats:p>","DOI":"10.3389\/fbinf.2025.1562668","type":"journal-article","created":{"date-parts":[[2025,3,18]],"date-time":"2025-03-18T03:04:21Z","timestamp":1742267061000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["ORF1ab codon frequency model predicts host-pathogen relationship in orthocoronavirinae"],"prefix":"10.3389","volume":"5","author":[{"given":"Phillip E.","family":"Davis","sequence":"first","affiliation":[]},{"given":"Joseph A.","family":"Russell","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2025,3,18]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"577","DOI":"10.1126\/science.aap9072","article-title":"Predicting reservoir hosts and arthropod vectors from evolutionary signatures in RNA virus genomes","volume":"362","author":"Babayan","year":"2018","journal-title":"Science"},{"key":"B2","doi-asserted-by":"publisher","first-page":"2646","DOI":"10.1016\/j.csbj.2021.04.042","article-title":"Codon-based indices for modeling gene expression and transcript evolution","volume":"19","author":"Bahiri-Elitzur","year":"2021","journal-title":"Comput. Struct. Biotechnol. J."},{"key":"B3","doi-asserted-by":"publisher","first-page":"e56642","DOI":"10.1371\/journal.pone.0056642","article-title":"Causes and implications of codon usage bias in RNA viruses","volume":"8","author":"Belalov","year":"2013","journal-title":"PLOS ONE"},{"key":"B4","doi-asserted-by":"publisher","first-page":"e1009149","DOI":"10.1371\/journal.ppat.1009149","article-title":"Predicting the animal hosts of coronaviruses from compositional biases of spike protein and whole genome sequences through machine learning","volume":"17","author":"Brierley","year":"2021","journal-title":"PLoS Pathog."},{"key":"B5","doi-asserted-by":"publisher","first-page":"9864","DOI":"10.1073\/pnas.1604472113","article-title":"Link of a ubiquitous human coronavirus to dromedary camels","volume":"113","author":"Corman","year":"2016","journal-title":"Proc. Natl. Acad. Sci. U. S. A."},{"key":"B6","doi-asserted-by":"publisher","first-page":"11614","DOI":"10.3390\/ijms252111614","article-title":"SARS-CoV-2 displays a suboptimal codon usage bias for efficient translation in human cells diverted by hijacking the tRNA epitranscriptome","volume":"25","author":"Eldin","year":"2024","journal-title":"Int. J. Mol. Sci."},{"key":"B7","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/978-1-4939-2438-7_1","article-title":"Coronaviruses: an overview of their replication and pathogenesis","volume":"1282","author":"Fehr","year":"2015","journal-title":"Methods Mol. Biol."},{"key":"B8","doi-asserted-by":"publisher","first-page":"e1679","DOI":"10.1002\/wrna.1679","article-title":"Compositional biases in RNA viruses: causes, consequences and applications","volume":"13","author":"Gaunt","year":"2022","journal-title":"Wiley Interdiscip. Rev. RNA"},{"key":"B9","doi-asserted-by":"publisher","first-page":"9319","DOI":"10.1038\/s41598-023-35861-7","article-title":"Using machine learning to detect coronaviruses potentially infectious to humans","volume":"13","author":"Gonzalez-Isunza","year":"2023","journal-title":"Sci. Rep."},{"key":"B10","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1016\/j.virol.2020.12.015","article-title":"Human Schlafen 11 exploits codon preference discrimination to attenuate viral protein synthesis of prototype foamy virus (PFV)","volume":"555","author":"Guo","year":"2021","journal-title":"Virology"},{"key":"B11","doi-asserted-by":"publisher","first-page":"108872","DOI":"10.1016\/j.celrep.2021.108872","article-title":"Translational adaptation of human viruses to the tissues they infect","volume":"34","author":"Hernandez-Alias","year":"2021","journal-title":"Cell Rep."},{"key":"B12","doi-asserted-by":"publisher","first-page":"e1619","DOI":"10.1002\/wrna.1619","article-title":"Emerging translation strategies during virus-host interaction","volume":"12","author":"Hoang","year":"2021","journal-title":"Wiley Interdiscip. Rev. RNA"},{"key":"B13","doi-asserted-by":"publisher","first-page":"e03915","DOI":"10.1016\/j.heliyon.2020.e03915","article-title":"Codon usage similarity between viral and some host genes suggests a codon-specific translational regulation","volume":"6","author":"Jitobaom","year":"2020","journal-title":"Heliyon"},{"key":"B14","doi-asserted-by":"publisher","first-page":"199162","DOI":"10.1016\/j.virusres.2023.199162","article-title":"Human Schlafen 11 inhibits influenza A virus production","volume":"334","author":"Jitobaom","year":"2023","journal-title":"Virus Res."},{"key":"B15","doi-asserted-by":"publisher","first-page":"4725","DOI":"10.1038\/s41467-022-31835-x","article-title":"CHIKV infection reprograms codon optimality to favor viral RNA translation by altering the tRNA epitranscriptome","volume":"13","author":"Jungfleisch","year":"2022","journal-title":"Nat. Commun."},{"key":"B16","doi-asserted-by":"publisher","first-page":"24303","DOI":"10.1073\/pnas.1913232116","article-title":"Structure of the zinc-finger antiviral protein in complex with RNA reveals a mechanism for selective targeting of CG-rich viral sequences","volume":"116","author":"Meagher","year":"2019","journal-title":"Proc. Natl. Acad. Sci. U. S. A."},{"key":"B17","doi-asserted-by":"publisher","first-page":"e3001390","DOI":"10.1371\/journal.pbio.3001390","article-title":"Identifying and prioritizing potential human-infecting viruses from their genome sequences","volume":"19","author":"Mollentze","year":"2021","journal-title":"PLoS Biol."},{"key":"B18","doi-asserted-by":"publisher","first-page":"e80329","DOI":"10.7554\/eLife.80329","article-title":"Variation in the ACE2 receptor has limited utility for SARS-CoV-2 host prediction","volume":"11","author":"Mollentze","year":"2022","journal-title":"Elife"},{"key":"B19","doi-asserted-by":"publisher","first-page":"1914","DOI":"10.1093\/nar\/gks986","article-title":"Vaccinia and influenza A viruses select rather than adjust tRNAs to optimize translation","volume":"41","author":"Pavon-Eternod","year":"2013","journal-title":"Nucleic Acids Res."},{"key":"B20","first-page":"2825","article-title":"Scikit-learn: machine learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"JMLR"},{"key":"B21","doi-asserted-by":"publisher","first-page":"48","DOI":"10.1186\/s12859-019-2606-y","article-title":"VAPiD: a lightweight cross-platform viral annotation pipeline and identification tool to facilitate virus genome submissions to NCBI GenBank","volume":"20","author":"Shean","year":"2019","journal-title":"BMC Bioinforma."},{"key":"B22","doi-asserted-by":"publisher","first-page":"e13e01222","DOI":"10.1128\/mBio.01222-18","article-title":"Avian influenza virus PB1 gene in H3N2 viruses evolved in humans to reduce interferon inhibition by skewing codon usage toward interferon-altered tRNA pools","volume":"9","author":"Smith","year":"2018","journal-title":"mBio"},{"key":"B23","doi-asserted-by":"publisher","first-page":"2279","DOI":"10.1016\/j.cell.2022.05.019","article-title":"Structure, receptor recognition, and antigenicity of the human coronavirus CCoV-HuPn-2018 spike glycoprotein","volume":"185","author":"Tortorici","year":"2022","journal-title":"Cell"},{"key":"B24","doi-asserted-by":"publisher","first-page":"e00104","DOI":"10.1128\/JVI.00104-19","article-title":"Schlafen 11 restricts flavivirus replication","volume":"93","author":"Valdez","year":"2019","journal-title":"J. Virol."},{"key":"B25","doi-asserted-by":"publisher","first-page":"1827","DOI":"10.1093\/molbev\/msr005","article-title":"HIV-1 modulates the tRNA pool to improve translation efficiency","volume":"28","author":"van Weringh","year":"2011","journal-title":"Mol. Biol. Evol."},{"key":"B26","doi-asserted-by":"publisher","first-page":"e76038","DOI":"10.7554\/eLife.76038","article-title":"Distinct elongation stalls during translation are linked with distinct pathways for mRNA degradation","volume":"11","author":"Veltri","year":"2022","journal-title":"Elife"},{"key":"B27","doi-asserted-by":"publisher","first-page":"e1007894","DOI":"10.1371\/journal.pcbi.1007894","article-title":"Predicting host taxonomic information from viral genomes: a comparison of feature representations","volume":"16","author":"Young","year":"2020","journal-title":"PLOS Comput. Biol."},{"key":"B28","doi-asserted-by":"publisher","first-page":"RP90316","DOI":"10.7554\/elife.90316","article-title":"Proteolytic cleavage and inactivation of the TRMT1 tRNA modification enzyme by SARS-CoV-2 main protease","volume":"12","author":"Zhang","year":"2024","journal-title":"eLife"}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1562668\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,18]],"date-time":"2025-03-18T03:04:24Z","timestamp":1742267064000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1562668\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,18]]},"references-count":28,"alternative-id":["10.3389\/fbinf.2025.1562668"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2025.1562668","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2025.01.17.633649","asserted-by":"object"}]},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,3,18]]},"article-number":"1562668"}}