{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:45:18Z","timestamp":1740185118919,"version":"3.37.3"},"reference-count":4,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2019,3,1]],"date-time":"2019-03-01T00:00:00Z","timestamp":1551398400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100000038","name":"Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"publisher","award":["2015-05896"],"award-info":[{"award-number":["2015-05896"]}],"id":[{"id":"10.13039\/501100000038","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>When analyzing viral metagenomic sequences, it is often desired to filter the results of a BLAST analysis by the host species of the virus. VHost-Classifier automates this procedure using a natural language processing algorithm written in Python 3, which takes a list of taxonomic identifiers (taxids) returned from a BLAST query using viral sequences as input. The taxid output is binned by the evolutionary lineage of their host, based on string matching the words in their English names. If VHost-Classifier cannot identify a host, it attempts to bin the sequences by the environment from which the sample originated. VHost-Classifier predicts the evolutionary lineage of the host from the virus name and does not rely on referencing taxids against a database; therefore, it is not constrained by the size of a database and can host classify newly characterized viruses.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Benchmarked on a test dataset of 1000 randomly selected viral taxids on the NCBI taxonomy database, VHost-Classifier assigned, with 100% accuracy, a host to the rank of Class for &amp;gt;93% of viruses, and to the rank of Family for &amp;gt;37% of viruses.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>For more information about VHost-Classifier as well as implementation instructions, visit https:\/\/github.com\/Kzra\/VHost-Classifier.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz151","type":"journal-article","created":{"date-parts":[[2019,2,27]],"date-time":"2019-02-27T01:55:11Z","timestamp":1551232511000},"page":"3867-3869","source":"Crossref","is-referenced-by-count":3,"title":["VHost-Classifier: virus-host classification using natural language processing"],"prefix":"10.1093","volume":"35","author":[{"given":"Ezra","family":"Kitson","sequence":"first","affiliation":[{"name":"Department of Microbiology and Immunology, University of British Columbia , Vancouver, BC, Canada"}]},{"given":"Curtis A","family":"Suttle","sequence":"additional","affiliation":[{"name":"Department of Microbiology and Immunology, University of British Columbia , Vancouver, BC, Canada"},{"name":"Department of Earth, Ocean and Atmospheric Sciences, University of British Columbia , Vancouver, BC, Canada"},{"name":"Department of Botany, University of British Columbia , Vancouver, BC, Canada"},{"name":"Institute for the Oceans and Fisheries, University of British Columbia , Vancouver, BC, Canada"}]}],"member":"286","published-online":{"date-parts":[[2019,3,1]]},"reference":[{"key":"2023013108183234500_btz151-B1","doi-asserted-by":"crossref","first-page":"1635","DOI":"10.1093\/molbev\/msw046","article-title":"ETE 3: reconstruction, analysis, and visualization of phylogenomic data","volume":"33","author":"Huerta-Cepas","year":"2016","journal-title":"Mol. Biol. Evol"},{"key":"2023013108183234500_btz151-B2","doi-asserted-by":"crossref","first-page":"66.","DOI":"10.3390\/v8030066","article-title":"Linking virus genomes with host taxonomy","volume":"8","author":"Mihara","year":"2016","journal-title":"Viruses"},{"key":"2023013108183234500_btz151-B3","first-page":"D457","article-title":"IMG\/VR: a database of cultured and uncultured DNA Viruses and retroviruses","volume":"45","author":"Paez-Espino","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023013108183234500_btz151-B4","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1186\/s40168-016-0166-1","article-title":"A comprehensive method for amplicon-based and metagenomic characterization of viruses, bacteria, and eukaryotes in freshwater samples","volume":"4","author":"Uyaguari-Diaz","year":"2016","journal-title":"Microbiome"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/19\/3867\/48975867\/bioinformatics_35_19_3867.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/19\/3867\/48975867\/bioinformatics_35_19_3867.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T15:04:03Z","timestamp":1675177443000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/19\/3867\/5368011"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,3,1]]},"references-count":4,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2019,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz151","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2019,10,1]]},"published":{"date-parts":[[2019,3,1]]}}}