{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,13]],"date-time":"2026-01-13T06:41:00Z","timestamp":1768286460955,"version":"3.49.0"},"reference-count":41,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,3,11]],"date-time":"2023-03-11T00:00:00Z","timestamp":1678492800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,3,11]],"date-time":"2023-03-11T00:00:00Z","timestamp":1678492800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>In December 2019, the first case of COVID-19 was described in Wuhan, China, and by July 2022, there were already 540 million confirmed cases. Due to the rapid spread of the virus, the scientific community has made efforts to develop techniques for the viral classification of SARS-CoV-2.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>In this context, we developed a new proposal for gene sequence representation with Genomic Signal Processing techniques for the work presented in this paper. First, we applied the mapping approach to samples of six viral species of the Coronaviridae family, which belongs SARS-CoV-2 Virus. We then used the sequence downsized obtained by the method proposed in a deep learning architecture for viral classification, achieving an accuracy of 98.35%, 99.08%, and 99.69% for the 64, 128, and 256 sizes of the viral signatures, respectively, and obtaining 99.95% precision for the vectors with size 256.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>The classification results obtained, in comparison to the results produced using other state-of-the-art representation techniques, demonstrate that the proposed mapping can provide a satisfactory performance result with low computational memory and processing time costs.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12859-023-05188-1","type":"journal-article","created":{"date-parts":[[2023,3,26]],"date-time":"2023-03-26T20:24:42Z","timestamp":1679862282000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["New proposal of viral genome representation applied in the classification of SARS-CoV-2 with deep learning"],"prefix":"10.1186","volume":"24","author":[{"given":"Lu\u00edsa C.","family":"de Souza","sequence":"first","affiliation":[]},{"given":"Karolayne S.","family":"Azevedo","sequence":"additional","affiliation":[]},{"given":"Jackson G.","family":"de Souza","sequence":"additional","affiliation":[]},{"given":"Raquel de M.","family":"Barbosa","sequence":"additional","affiliation":[]},{"given":"Marcelo A. C.","family":"Fernandes","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,3,11]]},"reference":[{"issue":"7","key":"5188_CR1","doi-asserted-by":"publisher","first-page":"785","DOI":"10.1002\/bjs.11627","volume":"107","author":"A Spinelli","year":"2020","unstructured":"Spinelli A, Pellino G. COVID-19 pandemic: perspectives on an unfolding crisis. J Br Surg. 2020;107(7):785\u20137.","journal-title":"J Br Surg"},{"key":"5188_CR2","doi-asserted-by":"crossref","unstructured":"Organization WH. Origin of SARS-CoV-2, 26 March 2020; 2020.","DOI":"10.1038\/s41591-020-0850-3"},{"key":"5188_CR3","doi-asserted-by":"crossref","unstructured":"Grant PR, Turner MA, Shin GY, Nastouli E, Levett LJ. Extraction-free COVID-19 (SARS-CoV-2) diagnosis by RT-PCR to increase capacity for national testing programmes during a pandemic. BioRxiv. 2020.","DOI":"10.1101\/2020.04.06.028316"},{"key":"5188_CR4","unstructured":"Yang Y, Yang M, Shen C, Wang F, Yuan J, Li J, et\u00a0al. Laboratory diagnosis and monitoring the viral shedding of 2019-nCoV infections. MedRxiv. 2020."},{"issue":"10224","key":"5188_CR5","doi-asserted-by":"publisher","first-page":"565","DOI":"10.1016\/S0140-6736(20)30251-8","volume":"395","author":"R Lu","year":"2020","unstructured":"Lu R, Zhao X, Li J, Niu P, Yang B, Wu H, et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. The Lancet. 2020;395(10224):565\u201374.","journal-title":"The Lancet."},{"issue":"1","key":"5188_CR6","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12859-017-1602-3","volume":"18","author":"MA Remita","year":"2017","unstructured":"Remita MA, Halioui A, Malick Diouara AA, Daigle B, Kiani G, Diallo AB. A machine learning approach for viral genome classification. BMC Bioinform. 2017;18(1):1\u201311.","journal-title":"BMC Bioinform"},{"issue":"1","key":"5188_CR7","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-020-80363-5","volume":"11","author":"A Lopez-Rincon","year":"2021","unstructured":"Lopez-Rincon A, Tonda A, Mendoza-Maldonado L, Mulders DG, Molenkamp R, Perez-Romero CA, et al. Classification and specific primer design for accurate detection of SARS-CoV-2 using deep learning. Sci Rep. 2021;11(1):1\u201311.","journal-title":"Sci Rep"},{"issue":"4","key":"5188_CR8","first-page":"656","volume":"12","author":"WJ Kent","year":"2002","unstructured":"Kent WJ. BLAT\u2013the BLAST-like alignment tool. Genome Res. 2002;12(4):656\u201364.","journal-title":"Genome Res"},{"key":"5188_CR9","doi-asserted-by":"publisher","first-page":"982","DOI":"10.1016\/j.csbj.2019.07.003","volume":"17","author":"S Pei","year":"2019","unstructured":"Pei S, Dong R, He RL, Yau SST. Large-scale genome comparison based on cumulative Fourier power and phase spectra: central moment and covariance vector. Comput Struct Biotechnol J. 2019;17:982\u201394.","journal-title":"Comput Struct Biotechnol J"},{"issue":"1","key":"5188_CR10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-019-1755-7","volume":"20","author":"A Zielezinski","year":"2019","unstructured":"Zielezinski A, Girgis HZ, Bernard G, Leimeister CA, Tang K, Dencker T, et al. Benchmarking of alignment-free sequence comparison methods. Genome Biol. 2019;20(1):1\u201318.","journal-title":"Genome Biol"},{"issue":"3","key":"5188_CR11","doi-asserted-by":"publisher","first-page":"343","DOI":"10.1093\/bib\/bbt067","volume":"15","author":"K Song","year":"2014","unstructured":"Song K, Ren J, Reinert G, Deng M, Waterman MS, Sun F. New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing. Brief Bioinform. 2014;15(3):343\u201353.","journal-title":"Brief Bioinform"},{"key":"5188_CR12","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1146\/annurev-biodatasci-080917-013431","volume":"1","author":"J Ren","year":"2018","unstructured":"Ren J, Bai X, Lu YY, Tang K, Wang Y, Reinert G, et al. Alignment-free sequence analysis and applications. Annu Rev Biomed Data Sci. 2018;1:93.","journal-title":"Annu Rev Biomed Data Sci"},{"issue":"1","key":"5188_CR13","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-017-1319-7","volume":"18","author":"A Zielezinski","year":"2017","unstructured":"Zielezinski A, Vinga S, Almeida J, Karlowski WM. Alignment-free sequence comparison: benefits, applications, and tools. Genome Biol. 2017;18(1):1\u201317.","journal-title":"Genome Biol"},{"issue":"4","key":"5188_CR14","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1093\/bioinformatics\/btg005","volume":"19","author":"S Vinga","year":"2003","unstructured":"Vinga S, Almeida J. Alignment-free sequence comparison\u2014a review. Bioinformatics. 2003;19(4):513\u201323.","journal-title":"Bioinformatics"},{"key":"5188_CR15","doi-asserted-by":"crossref","unstructured":"Morales JA, Salda\u00f1a R, Santana-Castolo MH, Torres-Cerna CE, Borrayo E, Mendizabal-Ruiz AP, et\u00a0al. Deep Learning for the Classification of Genomic Signals. Mathematical Problems in Engineering. 2020;2020.","DOI":"10.1155\/2020\/7698590"},{"key":"5188_CR16","doi-asserted-by":"publisher","first-page":"81297","DOI":"10.1109\/ACCESS.2019.2923687","volume":"7","author":"A Fabija\u0144ska","year":"2019","unstructured":"Fabija\u0144ska A, Grabowski S. Viral genome deep classifier. IEEE Access. 2019;7:81297\u2013307.","journal-title":"IEEE Access"},{"key":"5188_CR17","doi-asserted-by":"crossref","unstructured":"Kwan HK, Arniker SB. Numerical representation of DNA sequences. In: 2009 IEEE International Conference on Electro\/Information Technology. IEEE; 2009. p. 307\u2013310.","DOI":"10.1109\/EIT.2009.5189632"},{"issue":"3","key":"5188_CR18","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0173288","volume":"12","author":"G Mendizabal-Ruiz","year":"2017","unstructured":"Mendizabal-Ruiz G, Rom\u00e1n-God\u00ednez I, Torres-Ramos S, Salido-Ruiz RA, Morales JA. On DNA numerical representations for genomic similarity computation. PLoS ONE. 2017;12(3): e0173288.","journal-title":"PLoS ONE"},{"issue":"4","key":"5188_CR19","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1109\/79.939833","volume":"18","author":"D Anastassiou","year":"2001","unstructured":"Anastassiou D. Genomic signal processing. IEEE Signal Process Mag. 2001;18(4):8\u201320.","journal-title":"IEEE Signal Process Mag"},{"issue":"3\u20134","key":"5188_CR20","doi-asserted-by":"publisher","first-page":"134","DOI":"10.1016\/j.ygeno.2016.08.002","volume":"108","author":"T Hoang","year":"2016","unstructured":"Hoang T, Yin C, Yau SST. Numerical encoding of DNA sequences by chaos game representation with application in similarity comparison. Genomics. 2016;108(3\u20134):134\u201342.","journal-title":"Genomics"},{"issue":"2","key":"5188_CR21","doi-asserted-by":"publisher","first-page":"1197","DOI":"10.1093\/bib\/bbaa170","volume":"22","author":"SM Naeem","year":"2021","unstructured":"Naeem SM, Mabrouk MS, Marzouk SY, Eldosoky MA. A diagnostic genomic signal processing (GSP)-based system for automatic feature analysis and detection of COVID-19. Brief Bioinform. 2021;22(2):1197\u2013205.","journal-title":"Brief Bioinform"},{"issue":"05","key":"5188_CR22","doi-asserted-by":"publisher","first-page":"280","DOI":"10.4236\/jbise.2016.95021","volume":"9","author":"NG Nguyen","year":"2016","unstructured":"Nguyen NG, Tran VA, Ngo DL, Phan D, Lumbanraja FR, Faisal MR, et al. DNA sequence classification by convolutional neural network. J Biomed Sci Eng. 2016;9(05):280.","journal-title":"J Biomed Sci Eng"},{"issue":"9","key":"5188_CR23","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0222271","volume":"14","author":"A Tampuu","year":"2019","unstructured":"Tampuu A, Bzhalava Z, Dillner J, Vicente R. ViraMiner: deep learning on raw DNA sequences for identifying viral genomes in human samples. PLoS ONE. 2019;14(9): e0222271.","journal-title":"PLoS ONE"},{"issue":"7","key":"5188_CR24","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1038\/s41576-019-0122-6","volume":"20","author":"G Eraslan","year":"2019","unstructured":"Eraslan G, Avsec \u017d, Gagneur J, Theis FJ. Deep learning: new computational modelling techniques for genomics. Nat Rev Genet. 2019;20(7):389\u2013403.","journal-title":"Nat Rev Genet"},{"key":"5188_CR25","doi-asserted-by":"publisher","DOI":"10.1016\/j.dib.2020.105618","volume":"30","author":"RDM Barbosa","year":"2020","unstructured":"Barbosa RDM, Fernandes MA. Chaos game representation dataset of SARS-CoV-2 genome. Data Brief. 2020;30: 105618.","journal-title":"Data Brief."},{"key":"5188_CR26","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1016\/j.jtbi.2015.06.033","volume":"382","author":"C Yin","year":"2015","unstructured":"Yin C, Yau SST. An improved model for whole genome phylogenetic analysis by Fourier transform. J Theor Biol. 2015;382:99\u2013110.","journal-title":"J Theor Biol"},{"issue":"8","key":"5188_CR27","doi-asserted-by":"publisher","first-page":"2163","DOI":"10.1093\/nar\/18.8.2163","volume":"18","author":"HJ Jeffrey","year":"1990","unstructured":"Jeffrey HJ. Chaos game representation of gene structure. Nucleic Acids Res. 1990;18(8):2163\u201370.","journal-title":"Nucleic Acids Res"},{"key":"5188_CR28","doi-asserted-by":"publisher","first-page":"308","DOI":"10.1016\/j.compbiomed.2015.05.022","volume":"69","author":"K Sedlar","year":"2016","unstructured":"Sedlar K, Skutkova H, Vitek M, Provaznik I. Set of rules for genomic signal downsampling. Comput Biol Med. 2016;69:308\u201314.","journal-title":"Comput Biol Med"},{"key":"5188_CR29","volume-title":"Signals and systems","author":"A Oppenheim","year":"1983","unstructured":"Oppenheim A, Willsky A, Young I. Signals and systems. Englewood Cliffs: Prentice-Hall Inc.; 1983."},{"key":"5188_CR30","doi-asserted-by":"crossref","unstructured":"Marcal AR. Evaluation of chaos game representation for comparison of DNA sequences. In: International workshop on combinatorial image analysis. Springer; 2018. p. 179\u2013188.","DOI":"10.1007\/978-3-030-05288-1_14"},{"issue":"1","key":"5188_CR31","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1186\/s12864-019-5571-y","volume":"20","author":"GS Randhawa","year":"2019","unstructured":"Randhawa GS, Hill KA, Kari L. ML-DSP: Machine Learning with Digital Signal Processing for ultrafast, accurate, and scalable genome classification at all taxonomic levels. BMC Genomics. 2019;20(1):267.","journal-title":"BMC Genomics"},{"issue":"5","key":"5188_CR32","doi-asserted-by":"publisher","first-page":"863","DOI":"10.1109\/TCBB.2014.2315991","volume":"11","author":"I Messaoudi","year":"2014","unstructured":"Messaoudi I, Elloumi-Oueslati A, Lachiri Z. Building specific signals from frequency chaos game and revealing periodicities using a smoothed Fourier analysis. IEEE\/ACM Trans Comput Biol Bioinf. 2014;11(5):863\u201377.","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"key":"5188_CR33","unstructured":"Lopez-Rincon A, Tonda A, Mendoza-Maldonado L, Claassen E, Garssen J, Kraneveld AD. Accurate identification of sars-cov-2 from viral genome sequences using deep learning. bioRxiv. 2020."},{"key":"5188_CR34","doi-asserted-by":"publisher","first-page":"133314","DOI":"10.1109\/ACCESS.2019.2940470","volume":"7","author":"K Zheng","year":"2019","unstructured":"Zheng K, Wang L, You ZH. CGMDA: an approach to predict and validate MicroRNA-disease associations by utilizing chaos game representation and LightGBM. IEEE Access. 2019;7:133314\u201323.","journal-title":"IEEE Access."},{"key":"5188_CR35","doi-asserted-by":"crossref","unstructured":"Hoo ZH, Candlish J, Teare D. What is an ROC curve? British Association for Accident and Emergency Medicine; 2017.","DOI":"10.1136\/emermed-2017-206735"},{"key":"5188_CR36","unstructured":"Goel S, A compression algorithm for DNA that uses ASCII values. In, et al. IEEE International Advance Computing Conference (IACC). IEEE. 2014;2014:739\u201343."},{"issue":"6","key":"5188_CR37","first-page":"197","volume":"1","author":"AS Nair","year":"2006","unstructured":"Nair AS, Sreenadhan SP. A coding measure scheme employing electron-ion interaction pseudopotential (EIIP). Bioinformation. 2006;1(6):197.","journal-title":"Bioinformation"},{"issue":"1","key":"5188_CR38","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-020-00305-w","volume":"7","author":"JT Hancock","year":"2020","unstructured":"Hancock JT, Khoshgoftaar TM. Survey on categorical data for neural networks. J Big Data. 2020;7(1):1\u201341.","journal-title":"J Big Data."},{"issue":"4","key":"5188_CR39","first-page":"839","volume":"24","author":"H Arslan","year":"2021","unstructured":"Arslan H, Arslan H. A new covid-19 detection method from human genome sequences using cpg island features and knn classifier. Eng Sci Technol Int J. 2021;24(4):839\u201347.","journal-title":"Eng Sci Technol Int J."},{"key":"5188_CR40","doi-asserted-by":"publisher","DOI":"10.1016\/j.compbiomed.2021.104650","volume":"136","author":"OP Singh","year":"2021","unstructured":"Singh OP, Vallejo M, El-Badawy IM, Aysha A, Madhanagopal J, Faudzi AAM. Classification of SARS-CoV-2 and non-SARS-CoV-2 using machine learning algorithms. Comput Biol Med. 2021;136: 104650.","journal-title":"Comput Biol Med"},{"issue":"4","key":"5188_CR41","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0232391","volume":"15","author":"GS Randhawa","year":"2020","unstructured":"Randhawa GS, Soltysiak MP, El Roz H, de Souza CP, Hill KA, Kari L. Machine learning using intrinsic genomic signatures for rapid classification of novel pathogens: COVID-19 case study. PLoS ONE. 2020;15(4): e0232391.","journal-title":"PLoS ONE"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05188-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-023-05188-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05188-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,9]],"date-time":"2023-12-09T16:52:35Z","timestamp":1702140755000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-023-05188-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,11]]},"references-count":41,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["5188"],"URL":"https:\/\/doi.org\/10.1186\/s12859-023-05188-1","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,11]]},"assertion":[{"value":"11 October 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 February 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 March 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"92"}}