{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,18]],"date-time":"2026-03-18T13:52:20Z","timestamp":1773841940382,"version":"3.50.1"},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,7,5]],"date-time":"2024-07-05T00:00:00Z","timestamp":1720137600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,7,5]],"date-time":"2024-07-05T00:00:00Z","timestamp":1720137600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100003593","name":"Conselho Nacional de Desenvolvimento Cient\u00edfico e Tecnol\u00f3gico","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100003593","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Purpose<\/jats:title>\n                <jats:p>In this study, we present DeepVirusClassifier, a tool capable of accurately classifying Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) viral sequences among other subtypes of the coronaviridae family. This classification is achieved through a deep neural network model that relies on convolutional neural networks (CNNs). Since viruses within the same family share similar genetic and structural characteristics, the classification process becomes more challenging, necessitating more robust models. With the rapid evolution of viral genomes and the increasing need for timely classification, we aimed to provide a robust and efficient tool that could increase the accuracy of viral identification and classification processes. Contribute to advancing research in viral genomics and assist in surveilling emerging viral strains.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Methods<\/jats:title>\n                <jats:p>Based on a one-dimensional deep CNN, the proposed tool is capable of training and testing on the Coronaviridae family, including SARS-CoV-2. Our model\u2019s performance was assessed using various metrics, including F1-score and AUROC. Additionally, artificial mutation tests were conducted to evaluate the model\u2019s generalization ability across sequence variations. We also used the BLAST algorithm and conducted comprehensive processing time analyses for comparison.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>DeepVirusClassifier demonstrated exceptional performance across several evaluation metrics in the training and testing phases. Indicating its robust learning capacity. Notably, during testing on more than 10,000 viral sequences, the model exhibited a more than 99% sensitivity for sequences with fewer than 2000 mutations. The tool achieves superior accuracy and significantly reduced processing times compared to the Basic Local Alignment Search Tool algorithm. Furthermore, the results appear more reliable than the work discussed in the text, indicating that the tool has great potential to revolutionize viral genomic research.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>DeepVirusClassifier is a powerful tool for accurately classifying viral sequences, specifically focusing on SARS-CoV-2 and other subtypes within the Coronaviridae family. The superiority of our model becomes evident through rigorous evaluation and comparison with existing methods. Introducing artificial mutations into the sequences demonstrates the tool\u2019s ability to identify variations and significantly contributes to viral classification and genomic research. As viral surveillance becomes increasingly critical, our model holds promise in aiding rapid and accurate identification of emerging viral strains.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-024-05754-1","type":"journal-article","created":{"date-parts":[[2024,7,5]],"date-time":"2024-07-05T11:01:59Z","timestamp":1720177319000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Deepvirusclassifier: a deep learning tool for classifying SARS-CoV-2 based on viral subtypes within the coronaviridae family"],"prefix":"10.1186","volume":"25","author":[{"given":"Karolayne S.","family":"Azevedo","sequence":"first","affiliation":[]},{"given":"Lu\u00edsa C.","family":"de Souza","sequence":"additional","affiliation":[]},{"given":"Maria G. F.","family":"Coutinho","sequence":"additional","affiliation":[]},{"given":"Raquel","family":"de M. Barbosa","sequence":"additional","affiliation":[]},{"given":"Marcelo A. C.","family":"Fernandes","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,7,5]]},"reference":[{"key":"5754_CR1","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10096-020-03899-4","volume":"39","author":"H Wang","year":"2020","unstructured":"Wang H, et al. The genetic sequence, origin, and diagnosis of SARS-CoV-2. Eur J Clin Microbiol Infect Dis. 2020;39:1\u20137.","journal-title":"Eur J Clin Microbiol Infect Dis"},{"key":"5754_CR2","doi-asserted-by":"crossref","unstructured":"Maghdid HS, Ghafoor KZ, Sadiq AS, Curran K, Rabie K. A novel AI-enabled framework to diagnose coronavirus COVID 19 using smartphone embedded sensors: design study; 2020. arXiv:2003.07434.","DOI":"10.1109\/IRI49571.2020.00033"},{"key":"5754_CR3","doi-asserted-by":"publisher","first-page":"132665","DOI":"10.1109\/ACCESS.2020.3010287","volume":"8","author":"MEH Chowdhury","year":"2020","unstructured":"Chowdhury MEH, et al. Can AI help in screening viral and COVID-19 pneumonia? IEEE Access. 2020;8:132665\u201376.","journal-title":"IEEE Access"},{"key":"5754_CR4","doi-asserted-by":"publisher","first-page":"1075","DOI":"10.1038\/s10038-020-0808-9","volume":"65","author":"Y Toyoshima","year":"2020","unstructured":"Toyoshima Y, Nemoto K, Matsumoto S, Nakamura Y, Kiyotani K. SARS-CoV-2 genomic variations associated with mortality rate of COVID-19. J Hum Genet. 2020;65:1075\u201382.","journal-title":"J Hum Genet"},{"key":"5754_CR5","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12859-017-1602-3","volume":"18","author":"MA Remita","year":"2017","unstructured":"Remita MA, et al. A machine learning approach for viral genome classification. BMC Bioinform. 2017;18:1\u201311.","journal-title":"BMC Bioinform"},{"key":"5754_CR6","doi-asserted-by":"publisher","first-page":"48","DOI":"10.1016\/j.coviro.2021.09.007","volume":"51","author":"JC Cobbin","year":"2021","unstructured":"Cobbin JC, Charon J, Harvey E, Holmes EC, Mahar JE. Current challenges to virus discovery by meta-transcriptomics. Curr Opin Virol. 2021;51:48\u201355.","journal-title":"Curr Opin Virol"},{"key":"5754_CR7","doi-asserted-by":"publisher","first-page":"519","DOI":"10.1089\/cmb.2018.0239","volume":"26","author":"D Lebatteux","year":"2019","unstructured":"Lebatteux D, Remita AM, Diallo AB. Toward an alignment-free method for feature extraction and accurate classification of viral sequences. J Comput Biol. 2019;26:519\u201335.","journal-title":"J Comput Biol"},{"key":"5754_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-017-1319-7","volume":"18","author":"A Zielezinski","year":"2017","unstructured":"Zielezinski A, Vinga S, Almeida J, Karlowski WM. Alignment-free sequence comparison: benefits, applications, and tools. Genome Biol. 2017;18:1\u201317.","journal-title":"Genome Biol"},{"key":"5754_CR9","doi-asserted-by":"publisher","first-page":"749","DOI":"10.3389\/fmicb.2018.00749","volume":"9","author":"S Nooij","year":"2018","unstructured":"Nooij S, Schmitz D, Vennema H, Kroneman A, Koopmans MP. Overview of virus metagenomic classification methods and their biological applications. Front Microbiol. 2018;9:749.","journal-title":"Front Microbiol"},{"key":"5754_CR10","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"SF Altschul","year":"1997","unstructured":"Altschul SF, et al. Gapped blast and psi-blast: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389\u2013402.","journal-title":"Nucleic Acids Res"},{"key":"5754_CR11","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1038\/s41559-017-0446-6","volume":"2","author":"\u00c5J V\u00e5gene","year":"2018","unstructured":"V\u00e5gene \u00c5J, et al. Salmonella enterica genomes from victims of a major sixteenth-century epidemic in Mexico. Nat Ecol Evol. 2018;2:520\u20138.","journal-title":"Nat Ecol Evol"},{"key":"5754_CR12","doi-asserted-by":"publisher","first-page":"1435","DOI":"10.1126\/science.2983426","volume":"227","author":"DJ Lipman","year":"1985","unstructured":"Lipman DJ, Pearson WR. Rapid and sensitive protein similarity searches. Science. 1985;227:1435\u201341.","journal-title":"Science"},{"key":"5754_CR13","doi-asserted-by":"publisher","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","volume":"22","author":"JD Thompson","year":"1994","unstructured":"Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673\u201380.","journal-title":"Nucleic Acids Res"},{"key":"5754_CR14","doi-asserted-by":"publisher","first-page":"2460","DOI":"10.1093\/bioinformatics\/btq461","volume":"26","author":"RC Edgar","year":"2010","unstructured":"Edgar RC. Search and clustering orders of magnitude faster than blast. Bioinformatics. 2010;26:2460\u20131.","journal-title":"Bioinformatics"},{"key":"5754_CR15","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12859-023-05188-1","volume":"24","author":"LC de Souza","year":"2023","unstructured":"de Souza LC, Azevedo KS, de Souza JG, Barbosa RdM, Fernandes MA. New proposal of viral genome representation applied in the classification of SARS-CoV-2 with deep learning. BMC Bioinform. 2023;24:1\u201319.","journal-title":"BMC Bioinform"},{"key":"5754_CR16","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0232391","volume":"15","author":"GS Randhawa","year":"2020","unstructured":"Randhawa GS, et al. Machine learning using intrinsic genomic signatures for rapid classification of novel pathogens: COVID-19 case study. PLoS ONE. 2020;15: e0232391.","journal-title":"PLoS ONE"},{"key":"5754_CR17","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1186\/s12864-019-5571-y","volume":"20","author":"GS Randhawa","year":"2019","unstructured":"Randhawa GS, Hill KA, Kari L. ML-DSP: machine learning with digital signal processing for ultrafast, accurate, and scalable genome classification at all taxonomic levels. BMC Genom. 2019;20:267.","journal-title":"BMC Genom"},{"key":"5754_CR18","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40168-017-0283-5","volume":"5","author":"J Ren","year":"2017","unstructured":"Ren J, Ahlgren NA, Lu YY, Fuhrman JA, Sun F. VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data. Microbiome. 2017;5:1\u201320.","journal-title":"Microbiome"},{"key":"5754_CR19","doi-asserted-by":"publisher","first-page":"284","DOI":"10.1016\/j.csbj.2022.12.007","volume":"21","author":"MG Coutinho","year":"2023","unstructured":"Coutinho MG, C\u00e2mara GB, Barbosa RdM, Fernandes MA. SARS-CoV-2 virus classification based on stacked sparse autoencoder. Comput Struct Biotechnol J. 2023;21:284\u201398.","journal-title":"Comput Struct Biotechnol J"},{"key":"5754_CR20","doi-asserted-by":"crossref","unstructured":"Hu L, Yang Y, Tang Z, He Y, Luo X. FCAN-MOPSO: an improved fuzzy-based graph clustering algorithm for complex networks with multi-objective particle swarm optimization. IEEE Trans Fuzzy Syst; 2023.","DOI":"10.1109\/TFUZZ.2023.3259726"},{"key":"5754_CR21","doi-asserted-by":"publisher","first-page":"94","DOI":"10.7861\/futurehosp.6-2-94","volume":"6","author":"T Davenport","year":"2019","unstructured":"Davenport T, Kalakota R. The potential for artificial intelligence in healthcare. Future Healthc J. 2019;6:94.","journal-title":"Future Healthc J"},{"key":"5754_CR22","doi-asserted-by":"crossref","unstructured":"Mottaqi MS, Mohammadipanah F, Sajedi H. Contribution of machine learning approaches in response to SARS-CoV-2 infection. Inform Med Unlocked. 2021;100526.","DOI":"10.1016\/j.imu.2021.100526"},{"key":"5754_CR23","doi-asserted-by":"publisher","first-page":"825","DOI":"10.1038\/nbt.3313","volume":"33","author":"Y Park","year":"2015","unstructured":"Park Y, Kellis M. Deep learning for regulatory genomics. Nat Biotechnol. 2015;33:825\u20136.","journal-title":"Nat Biotechnol"},{"key":"5754_CR24","unstructured":"Lopez-Rincon A, et\u00a0al. Accurate identification of SARS-CoV-2 from viral genome sequences using deep learning. bioRxiv; 2020."},{"key":"5754_CR25","doi-asserted-by":"crossref","unstructured":"Lalmuanawma S, Hussain J, Chhakchhuak L. Applications of machine learning and artificial intelligence for COVID-19 (SARS-CoV-2) pandemic: a review. Chaos Solitons Fractals. 2020;110059.","DOI":"10.1016\/j.chaos.2020.110059"},{"key":"5754_CR26","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1038\/s41576-019-0122-6","volume":"20","author":"G Eraslan","year":"2019","unstructured":"Eraslan G, Avsec \u017d, Gagneur J, Theis FJ. Deep learning: new computational modelling techniques for genomics. Nat Rev Genet. 2019;20:389\u2013403.","journal-title":"Nat Rev Genet"},{"key":"5754_CR27","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1038\/s41588-018-0295-5","volume":"51","author":"J Zou","year":"2019","unstructured":"Zou J, et al. A primer on deep learning in genomics. Nat Genet. 2019;51:12\u20138.","journal-title":"Nat Genet"},{"key":"5754_CR28","doi-asserted-by":"publisher","first-page":"81297","DOI":"10.1109\/ACCESS.2019.2923687","volume":"7","author":"A Fabija\u0144ska","year":"2019","unstructured":"Fabija\u0144ska A, Grabowski S. Viral genome deep classifier. IEEE Access. 2019;7:81297\u2013307.","journal-title":"IEEE Access"},{"key":"5754_CR29","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0222271","volume":"14","author":"A Tampuu","year":"2019","unstructured":"Tampuu A, Bzhalava Z, Dillner J, Vicente R. Viraminer: deep learning on raw DNA sequences for identifying viral genomes in human samples. PLoS ONE. 2019;14: e0222271.","journal-title":"PLoS ONE"},{"key":"5754_CR30","doi-asserted-by":"publisher","first-page":"59597","DOI":"10.1109\/ACCESS.2021.3073728","volume":"9","author":"A Whata","year":"2021","unstructured":"Whata A, Chimedza C. Deep learning for SARS CoV-2 genome sequences. IEEE Access. 2021;9:59597\u2013611.","journal-title":"IEEE Access"},{"key":"5754_CR31","doi-asserted-by":"publisher","first-page":"2017580","DOI":"10.1080\/23311916.2021.2017580","volume":"9","author":"E Adetiba","year":"2022","unstructured":"Adetiba E, et al. DeepCOVID-19: a model for identification of COVID-19 virus sequences with genomic signal processing and deep learning. Cogent Eng. 2022;9:2017580.","journal-title":"Cogent Eng"},{"key":"5754_CR32","doi-asserted-by":"crossref","unstructured":"Gunasekaran H, et al. Analysis of DNA sequence classification using CNN and hybrid models. Comput Math Methods Med. 2021;2021.","DOI":"10.1155\/2021\/1835056"},{"key":"5754_CR33","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s40484-019-0187-4","volume":"8","author":"J Ren","year":"2020","unstructured":"Ren J, et al. Identifying viruses from metagenomic data using deep learning. Quant Biol. 2020;8:1\u201314.","journal-title":"Quant Biol"},{"key":"5754_CR34","unstructured":"NCBI. GenBank overview; 2020. https:\/\/www.ncbi.nlm.nih.gov\/genbank\/."},{"key":"5754_CR35","unstructured":"Kingma DP, Ba J. Adam: a method for stochastic optimization; 2014. arXiv:1412.6980."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-024-05754-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-024-05754-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-024-05754-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,5]],"date-time":"2024-07-05T11:02:48Z","timestamp":1720177368000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-024-05754-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,7,5]]},"references-count":35,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["5754"],"URL":"https:\/\/doi.org\/10.1186\/s12859-024-05754-1","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,7,5]]},"assertion":[{"value":"23 August 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 March 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 July 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"No competing interest is declared.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"231"}}