{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T07:59:08Z","timestamp":1772524748774,"version":"3.50.1"},"reference-count":95,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,4,27]],"date-time":"2021-04-27T00:00:00Z","timestamp":1619481600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,4,27]],"date-time":"2021-04-27T00:00:00Z","timestamp":1619481600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Laserinstitut Hochschule Mittweida"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Neural Comput &amp; Applic"],"published-print":{"date-parts":[[2022,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>We present an approach to discriminate SARS-CoV-2 virus types based on their RNA sequence descriptions avoiding a sequence alignment. For that purpose, sequences are preprocessed by feature extraction and the resulting feature vectors are analyzed by prototype-based classification to remain interpretable. In particular, we propose to use variants of learning vector quantization (LVQ) based on dissimilarity measures for RNA sequence data. The respective matrix LVQ provides additional knowledge about the classification decisions like discriminant feature correlations and, additionally, can be equipped with easy to realize reject options for uncertain data. Those options provide self-controlled evidence, i.e., the model refuses to make a classification decision if the model evidence for the presented data is not sufficient. This model is first trained using a GISAID dataset with given virus types detected according to the molecular differences in coronavirus populations by phylogenetic tree clustering. In a second step, we apply the trained model to another but unlabeled SARS-CoV-2 virus dataset. For these data, we can either assign a virus type to the sequences or reject atypical samples. Those rejected sequences allow to speculate about new virus types with respect to nucleotide base mutations in the viral sequences. Moreover, this rejection analysis improves model robustness. Last but not least, the presented approach has lower computational complexity compared to methods based on (multiple) sequence alignment.<\/jats:p>","DOI":"10.1007\/s00521-021-06018-2","type":"journal-article","created":{"date-parts":[[2021,4,27]],"date-time":"2021-04-27T10:05:12Z","timestamp":1619517912000},"page":"67-78","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["Learning vector quantization as an interpretable classifier for the detection of SARS-CoV-2 types based on their RNA sequences"],"prefix":"10.1007","volume":"34","author":[{"given":"Marika","family":"Kaden","sequence":"first","affiliation":[]},{"given":"Katrin Sophie","family":"Bohnsack","sequence":"additional","affiliation":[]},{"given":"Mirko","family":"Weber","sequence":"additional","affiliation":[]},{"given":"Mateusz","family":"Kud\u0142a","sequence":"additional","affiliation":[]},{"given":"Kaja","family":"Gutowska","sequence":"additional","affiliation":[]},{"given":"Jacek","family":"Blazewicz","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6725-0141","authenticated-orcid":false,"given":"Thomas","family":"Villmann","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,4,27]]},"reference":[{"key":"6018_CR1","doi-asserted-by":"crossref","first-page":"450","DOI":"10.1038\/s41591-020-0820-9","volume":"26","author":"KG Andersen","year":"2020","unstructured":"Andersen KG, Rambaut A, Lipkin WI, Holmes EC, Garry RF (2020) The proximal origin of SARS-CoV-2. Nat Med 26:450\u2013452","journal-title":"Nat Med"},{"key":"6018_CR2","unstructured":"Bai Y, Jiang D, Lon J, Chen X, Hu M, Lin S, Chen Z, Meng Y, Du H (2020) Evolution and molecular characteristics of SARS-CoV-2 genome. bioRXiv, (2020.04.24.058933)"},{"issue":"4\u20135","key":"6018_CR3","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1016\/S0893-6080(99)00027-1","volume":"12","author":"H-U Bauer","year":"1999","unstructured":"Bauer H-U, Herrmann M, Villmann T (1999) Neural maps and topographic vector quantization. Neural Netw 12(4\u20135):659\u2013676","journal-title":"Neural Netw"},{"key":"6018_CR4","unstructured":"Bhanot G, Biehl M, Vilmann T, Z\u00fchlke D (2017) Biomedical data analysis in translational research: Integration of expert knowledge and interpretable models. In M.\u00a0Verleysen, editor, Proceedings of the European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN\u20192017), pages 177\u2013186, Louvain-La-Neuve, Belgium. i6doc.com"},{"key":"6018_CR5","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1002\/wcs.1378","volume":"2","author":"M Biehl","year":"2016","unstructured":"Biehl M, Hammer B, Villmann T (2016) Prototype-based models in machine learning. Wiley Interdisciplinary Rev Cogn Sci 2:92\u2013111","journal-title":"Wiley Interdisciplinary Rev Cogn Sci"},{"issue":"1","key":"6018_CR6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13040-018-0188-2","volume":"12","author":"S Bittrich","year":"2019","unstructured":"Bittrich S, Kaden M, Leberecht C, Kaiser F, Villmann T, Labudde D (2019) Application of an interpretable classification model on early folding residues during protein folding. BioData Min 12(1):1\u201316","journal-title":"BioData Min"},{"key":"6018_CR7","doi-asserted-by":"crossref","first-page":"5155","DOI":"10.1073\/pnas.83.14.5155","volume":"83","author":"B Blaisdell","year":"1986","unstructured":"Blaisdell B (1986) A measure of the similarity of sets of sequences not requiring sequence alignment. Proc Natl Acad Sci USA 83:5155\u20135159","journal-title":"Proc Natl Acad Sci USA"},{"key":"6018_CR8","doi-asserted-by":"crossref","first-page":"538","DOI":"10.1007\/BF02602925","volume":"29","author":"B Blaisdell","year":"1989","unstructured":"Blaisdell B (1989) Average values of a dissimilarity measure not requiring sequence alignment are twice the averages of conventional mismatch counts requiring sequence alignment for a computer-generated model system. J Mol Evol 29:538\u2013547","journal-title":"J Mol Evol"},{"key":"6018_CR9","doi-asserted-by":"crossref","unstructured":"Bosco G, diGangi M (2016) Deep learning architectures for DNA sequence classification. In A.\u00a0Petrosino, V.\u00a0Loia, and W.\u00a0Pedrycz, editors, Fuzzy Logic and Soft Computing Applications: Proceedings of the International Workshop on Fuzzy Logic and Applications (WILF 2016), volume 10147 of LNCS, pages 162\u2013171, Cham. Springer","DOI":"10.1007\/978-3-319-52962-2_14"},{"issue":"D1","key":"6018_CR10","doi-asserted-by":"crossref","first-page":"D571","DOI":"10.1093\/nar\/gku1207","volume":"43","author":"JR Brister","year":"2014","unstructured":"Brister JR, Ako-adjei D, Bao Y, Blinkova O (2014) NCBI viral genomes resource. Nucleic Acids Res 43(D1):D571\u2013D577","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"6018_CR11","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1080\/00401706.1974.10489150","volume":"16","author":"M Bryson","year":"1974","unstructured":"Bryson M (1974) Heavy-tailed distributions: properties and tests. Technometrics 16(1):61\u201368","journal-title":"Technometrics"},{"issue":"1","key":"6018_CR12","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1016\/j.neunet.2011.10.001","volume":"26","author":"K Bunte","year":"2012","unstructured":"Bunte K, Schneider P, Hammer B, Schleif F-M, Villmann T, Biehl M (2012) Limited rank matrix learning, discriminative dimension reduction and visualization. Neural Netw 26(1):159\u2013173","journal-title":"Neural Netw"},{"issue":"4","key":"6018_CR13","doi-asserted-by":"crossref","first-page":"1523","DOI":"10.1109\/TIT.2005.844059","volume":"51","author":"R Cilibrasi","year":"2005","unstructured":"Cilibrasi R, Vit\u00e1nyi P (2005) Clustering by compression. IEEE Trans Inf Theory 51(4):1523\u20131545","journal-title":"IEEE Trans Inf Theory"},{"issue":"D1","key":"6018_CR14","doi-asserted-by":"crossref","first-page":"D67","DOI":"10.1093\/nar\/gkv1276","volume":"44","author":"K Clark","year":"2015","unstructured":"Clark K, Karsch-Mizrachi I, Lipman D, Ostell J, Sayers EW (2015) GenBank. Nucleic Acids Res 44(D1):D67\u2013D72","journal-title":"Nucleic Acids Res"},{"issue":"9","key":"6018_CR15","doi-asserted-by":"crossref","first-page":"3021","DOI":"10.1093\/nar\/13.9.3021","volume":"13","author":"A Cornish-Bowden","year":"1985","unstructured":"Cornish-Bowden A (1985) Nomenclature for incompletely specified bases in nucleic acid sequences: recommendations 1984. Nucleic Acids Res 13(9):3021\u20133030","journal-title":"Nucleic Acids Res"},{"key":"6018_CR16","doi-asserted-by":"crossref","first-page":"762","DOI":"10.1016\/j.neunet.2006.05.018","volume":"19","author":"M Cottrell","year":"2006","unstructured":"Cottrell M, Hammer B, Hasenfu\u00df A, Villmann T (2006) Batch and median neural gas. Neural Netw 19:762\u2013771","journal-title":"Neural Netw"},{"issue":"3","key":"6018_CR17","doi-asserted-by":"crossref","first-page":"e17293","DOI":"10.1371\/journal.pone.0017293","volume":"6","author":"M Deng","year":"2011","unstructured":"Deng M, Yu C, Liang Q, He R, Yau S-T (2011) A novel method of characterizing sequences: genome space with biological distance and applications. PLoS One 6(3):e17293","journal-title":"PLoS One"},{"issue":"1","key":"6018_CR18","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1186\/1471-2105-5-113","volume":"5","author":"RC Edgar","year":"2004","unstructured":"Edgar RC (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinform 5(1):113","journal-title":"BMC Bioinform"},{"key":"6018_CR19","doi-asserted-by":"crossref","first-page":"C1","DOI":"10.1111\/ectj.12061","volume":"19","author":"J Fan","year":"2016","unstructured":"Fan J, Liao Y, Liu H (2016) An overview of the estimation of large covariance and precision matrices. Econom J 19:C1\u2013C32","journal-title":"Econom J"},{"issue":"4","key":"6018_CR20","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1007\/BF02603120","volume":"25","author":"D-F Feng","year":"1987","unstructured":"Feng D-F, Doolittle RF (1987) Progressive sequence alignment as a prerequisite to correct phylogenetic trees. J Mol Evol 25(4):351\u2013360","journal-title":"J Mol Evol"},{"issue":"Suppl. 7","key":"6018_CR21","doi-asserted-by":"crossref","first-page":"198","DOI":"10.1186\/s12859-018-2182-6","volume":"19","author":"A Fianacca","year":"2018","unstructured":"Fianacca A, LaPaglia L, LaRosa M, LoBosco G, Renda G, Rizzo R, Galio S, Urso A (2018) Deep learning models for bacteria taxonomic classification of metagenomic data. BMC Bioinform 19(Suppl. 7):198","journal-title":"BMC Bioinform"},{"key":"6018_CR22","doi-asserted-by":"crossref","first-page":"334","DOI":"10.1016\/j.neucom.2014.10.092","volume":"169","author":"L Fischer","year":"2015","unstructured":"Fischer L, Hammer B, Wersing H (2015) Efficient rejection strategies for prototype-based classification. Neurocomputing 169:334\u2013342","journal-title":"Neurocomputing"},{"key":"6018_CR23","doi-asserted-by":"crossref","unstructured":"Foster P, Foster L, Renfrew C, Forster M (2020) Phylogenetic network analysis of SARS-CoV-2 genomes. Proc. of the National Academy of Science of the United States of America (PNAS)","DOI":"10.1073\/pnas.2004999117"},{"key":"6018_CR24","doi-asserted-by":"crossref","first-page":"972","DOI":"10.1126\/science.1136800","volume":"315","author":"B Frey","year":"2007","unstructured":"Frey B, Dueck D (2007) Clustering by message passing between data points. Science 315:972\u2013976","journal-title":"Science"},{"key":"6018_CR25","volume-title":"Deep learning","author":"I Goodfellow","year":"2016","unstructured":"Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge"},{"key":"6018_CR26","doi-asserted-by":"crossref","first-page":"705","DOI":"10.1016\/0022-2836(82)90398-9","volume":"162","author":"O Gotoh","year":"1982","unstructured":"Gotoh O (1982) An improved algorithm for matching biological sequences. J Mol Biol 162:705\u2013708","journal-title":"J Mol Biol"},{"key":"6018_CR27","doi-asserted-by":"crossref","DOI":"10.1007\/BFb0103945","volume-title":"Foundations of quantization for probability distributions","author":"S Graf","year":"2000","unstructured":"Graf S, Lushgy H (2000) Foundations of quantization for probability distributions, vol 1730. Lect. Notes in Mathematics, Springer, Berlin"},{"key":"6018_CR28","doi-asserted-by":"crossref","unstructured":"Guan Q, Sadykov M, Nugmanova R, Carr M, Arold S, Pain A (2020) The genomic variation landscape of globally-circulating clades of SARS-CoV-2 defines a genetic barcoding scheme. bioRXiv, (2020.04.21.054221)","DOI":"10.1101\/2020.04.21.054221"},{"issue":"4","key":"6018_CR29","doi-asserted-by":"crossref","first-page":"709","DOI":"10.1002\/cjs.5550340410","volume":"34","author":"R Herbei","year":"2006","unstructured":"Herbei R, Wegkamp M (2006) Classification with reject option. Can J Stat 34(4):709\u2013721","journal-title":"Can J Stat"},{"key":"6018_CR30","unstructured":"Hinton G, Roweis S (2002) Stochastic neighbor embedding. Advances in Neural Information Processing Systems. volume 15. The MIT Press, Cambridge, MA, USA, pp 833\u2013840"},{"issue":"6","key":"6018_CR31","doi-asserted-by":"crossref","first-page":"615","DOI":"10.1089\/106652701753307511","volume":"8","author":"W Just","year":"2001","unstructured":"Just W (2001) Computational complexity of multiple sequence alignment with SP-score. J Comput Biol 8(6):615\u2013623","journal-title":"J Comput Biol"},{"issue":"2","key":"6018_CR32","doi-asserted-by":"crossref","first-page":"79","DOI":"10.2478\/fcds-2014-0006","volume":"39","author":"M Kaden","year":"2014","unstructured":"Kaden M, Lange M, Nebel D, Riedel M, Geweniger T, Villmann T (2014) Aspects in classification learning\u2014review of recent developments in learning vector quantization. Found Comput Decis Sci 39(2):79\u2013105","journal-title":"Found Comput Decis Sci"},{"issue":"4","key":"6018_CR33","doi-asserted-by":"crossref","first-page":"772","DOI":"10.1093\/molbev\/mst010","volume":"30","author":"K Katoh","year":"2013","unstructured":"Katoh K, Standley DM (2013) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30(4):772\u2013780","journal-title":"Mol Biol Evol"},{"issue":"Supplement 1","key":"6018_CR34","first-page":"303","volume":"1","author":"T Kohonen","year":"1988","unstructured":"Kohonen T (1988) Learning vector quantization. Neural Netw 1(Supplement 1):303","journal-title":"Neural Netw"},{"issue":"1","key":"6018_CR35","first-page":"1","volume":"1","author":"A Kolmogorov","year":"1965","unstructured":"Kolmogorov A (1965) Three approaches to the quantitative definition of information. Probl Inf Transm 1(1):1\u20137","journal-title":"Probl Inf Transm"},{"key":"6018_CR36","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1214\/aoms\/1177729694","volume":"22","author":"S Kullback","year":"1951","unstructured":"Kullback S, Leibler R (1951) On information and sufficiency. Annal Math Stat 22:79\u201386","journal-title":"Annal Math Stat"},{"key":"6018_CR37","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-020-2180-5)","author":"J Lan","year":"2020","unstructured":"Lan J, Ge J, Yu J, Shan S, Fan HZS, Zhang Q, Shi X, Wang Q, Zhang L, Wang X (2020) Structure of the SARS-CoV-2 spike receptorbinding domain bound to the ACE2 receptor. Nature. https:\/\/doi.org\/10.1038\/s41586-020-2180-5)","journal-title":"Nature"},{"issue":"4","key":"6018_CR38","first-page":"845","volume":"163","author":"V Levenshtein","year":"1965","unstructured":"Levenshtein V (1965) Binary codes capable of correcting deletions, insertions, and reversals. Doklady Akademii Nauk SSSR 163(4):845\u2013848","journal-title":"Doklady Akademii Nauk SSSR"},{"key":"6018_CR39","doi-asserted-by":"crossref","unstructured":"Li J, Song S, Zhang Y, Zhou Z (2016) Robust $$k$$-median and $$k$$-means clustering algorithms for incomplete data. Mathematical Problems in Engineering, 2016(Article ID 4321928):1\u20138","DOI":"10.1155\/2016\/4321928"},{"issue":"12","key":"6018_CR40","doi-asserted-by":"crossref","first-page":"3250","DOI":"10.1109\/TIT.2004.838101","volume":"50","author":"M Li","year":"2004","unstructured":"Li M, Chen X, Li X, Ma B, Vit\u00e1nyi P (2004) The similarity metric. IEEE Trans Inf Theory 50(12):3250\u20133264","journal-title":"IEEE Trans Inf Theory"},{"key":"6018_CR41","unstructured":"Li Y, Liu B, Cui J, Wang Z, Shen Y, Xu Y, Yao K, Guan Y (2020) Similarities and evolutionary relationships of COVID-19 and related viruses. arXiv, (2003.05580)"},{"key":"6018_CR42","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/j.ympev.2016.03.009","volume":"99","author":"Y Li","year":"2016","unstructured":"Li Y, Tian K, Yin C, He R, Yau S-T (2016) Virus classification in 60-dimensional protein space. Mol Phylogenetics Evol 99:53\u201362","journal-title":"Mol Phylogenetics Evol"},{"issue":"10","key":"6018_CR43","doi-asserted-by":"crossref","first-page":"1682","DOI":"10.1093\/bioinformatics\/btx809","volume":"34","author":"J Lin","year":"2018","unstructured":"Lin J, Adjeroh D, Jiang B-H, Jiang Y (2018) $$k_2$$ and $$k_2^*$$?: efficient alignment-free sequence similarity measurement based on Kendall statistics. Bioinformatics 34(10):1682\u20131689","journal-title":"Bioinformatics"},{"issue":"10224","key":"6018_CR44","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1016\/S0140-6736(20)30251-8","volume":"395","author":"R Lu","year":"2020","unstructured":"...Lu R, Zhao X, Juan L, Niu P, Yang B, Wu H, Wang W, Song H, Huang B, Zhu N, Bi Y, Ma X, Zhan F, Wang L, Hu T, Zhou H, Hu Z, Zhou W, Zhao L, Chen J, Meng Y, Wang J, Yuan YL, Xie Z, Ma J, Liu W, Wang D, Xu W, Holmes E, Gao G, Wu G, Chen W, Shi W, Tan W (2020) Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet 395(10224):565\u2013574","journal-title":"Lancet"},{"issue":"4","key":"6018_CR45","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1007\/s11222-007-9033-z","volume":"17","author":"UV Luxburg","year":"2007","unstructured":"Luxburg UV (2007) A tutorial on spectral clustering. Stat Comput 17(4):395\u2013416","journal-title":"Stat Comput"},{"key":"6018_CR46","unstructured":"Mackay D (2003) Information Theory. Cambridge University Press, Inference and Learning Algorithms"},{"issue":"1","key":"6018_CR47","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1186\/s12859-018-2357-1","volume":"19","author":"M Maiolo","year":"2018","unstructured":"Maiolo M, Zhang X, Gil M, Anisimova M (2018) Progressive multiple sequence alignment with indel evolution. BMC Bioinform 19(1):331","journal-title":"BMC Bioinform"},{"issue":"4","key":"6018_CR48","doi-asserted-by":"crossref","first-page":"558","DOI":"10.1109\/72.238311","volume":"4","author":"TM Martinetz","year":"1993","unstructured":"Martinetz TM, Berkovich SG, Schulten KJ (1993) Neural-gas network for vector quantization and its application to time-series prediction. IEEE Trans Neural Netw 4(4):558\u2013569","journal-title":"IEEE Trans Neural Netw"},{"key":"6018_CR49","unstructured":"Miyamoto S, Ichihashi H, Honda K (2008) Algorithms for Fuzzy Clustering, volume 229 of Studies in Fuzziness and Soft Computing. Springer"},{"issue":"9","key":"6018_CR50","doi-asserted-by":"crossref","first-page":"1429","DOI":"10.1016\/j.neucom.2010.10.016","volume":"74","author":"E Mwebaze","year":"2011","unstructured":"Mwebaze E, Schneider P, Schleif F-M, Aduwo J, Quinn J, Haase S, Villmann T, Biehl M (2011) Divergence based classification in learning vector quantization. Neurocomputing 74(9):1429\u20131435","journal-title":"Neurocomputing"},{"issue":"3","key":"6018_CR51","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1016\/0022-2836(70)90057-4","volume":"48","author":"S Needleman","year":"1970","unstructured":"Needleman S, Wunsch C (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 48(3):443\u2013453","journal-title":"J Mol Biol"},{"key":"6018_CR52","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1109\/34.391396","volume":"17","author":"KL Oehler","year":"1995","unstructured":"Oehler KL, Gray RM (1995) Combining image compression and classification using vector quantization. IEEE Trans Pattern Anal Mach Intell 17:461\u2013473","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"6018_CR53","doi-asserted-by":"crossref","unstructured":"Paden C, Tao Y, Queen K, Zhang J, Li Y, Uehara A, Tong S (2020) Rapid, sensitive, full genome sequencing of severe acute respiratory syndrome virus coronavirus 2 (SARS-CoV-2). bioRXiv, (2020.04.22.055897)","DOI":"10.1101\/2020.04.22.055897"},{"key":"6018_CR54","volume-title":"Probabilistic reasoning in intelligent system","author":"J Pearl","year":"1988","unstructured":"Pearl J (1988) Probabilistic reasoning in intelligent system. Morgan Kaufmann, Burlington"},{"key":"6018_CR55","first-page":"81","volume":"1","author":"J Quinlan","year":"1986","unstructured":"Quinlan J (1986) Induction of decision trees. Mach Learn 1:81\u2013106","journal-title":"Mach Learn"},{"key":"6018_CR56","unstructured":"Quinlan J (1993) C4.5: Programs for Machine Learning. Morgan Kaufmann"},{"key":"6018_CR57","doi-asserted-by":"crossref","first-page":"400","DOI":"10.1214\/aoms\/1177729586","volume":"22","author":"H Robbins","year":"1951","unstructured":"Robbins H, Monro S (1951) A stochastic approximation method. Ann Math Stat 22:400\u2013407","journal-title":"Ann Math Stat"},{"issue":"5","key":"6018_CR58","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1038\/s42256-019-0048-x","volume":"1","author":"C Rudin","year":"2019","unstructured":"Rudin C (2019) Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell 1(5):206\u2013215","journal-title":"Nat Mach Intell"},{"key":"6018_CR59","doi-asserted-by":"crossref","unstructured":"Samek W, Monatvon G, Vedaldi A, Hansen L, M\u00fcller K-R (eds) (2019) Explainable AI: Interpreting, Explaining and Visualizing Deep Learning, number 11700 in LNAI. Springer","DOI":"10.1007\/978-3-030-28954-6"},{"key":"6018_CR60","doi-asserted-by":"crossref","unstructured":"Saralajew S, Holdijk L, Rees M, Villmann T (2019) Robustness of generalized learning vector quantization models against adversarial attacks. In: Vellido A, Gibert K, Angulo C, Guerrero J (Eds) Advances in Self-Organizing Maps. Learning Vector Quantization, Clustering and Data Visualization. In: Proceedings of the 13th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization, WSOM+2019, Barcelona, volume 976 of Advances in Intelligent Systems and Computing. Springer, Berlin-Heidelberg, pp 189\u2013199","DOI":"10.1007\/978-3-030-19642-4_19"},{"key":"6018_CR61","unstructured":"Saralajew S, Holdijk L, Villmann T (2020) Fast adversarial robustness certification of nearest prototype classifiers for arbitrary seminorms. In: Proceedings of the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), page in press. MIT Press"},{"key":"6018_CR62","unstructured":"Sato A, Yamada K (1996) Generalized learning vector quantization. In: Touretzky DS, Mozer MC, Hasselmo ME (Eds) Advances in Neural Information Processing Systems 8. In: Proceedings of the (1995) Conference. MIT Press, Cambridge, MA, USA, pp 423\u20139"},{"issue":"6","key":"6018_CR63","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1142\/S012906571100295X","volume":"21","author":"F-M Schleif","year":"2011","unstructured":"Schleif F-M, Villmann T, Hammer B, Schneider P (2011) Efficient kernelized prototype based classification. Int J Neural Syst 21(6):443\u2013457","journal-title":"Int J Neural Syst"},{"issue":"5","key":"6018_CR64","doi-asserted-by":"crossref","first-page":"831","DOI":"10.1109\/TNN.2010.2042729","volume":"21","author":"P Schneider","year":"2010","unstructured":"Schneider P, Bunte K, Stiekema H, Hammer B, Villmann T, Biehl M (2010) Regularization in matrix relevance learning. IEEE Trans Neural Netw 21(5):831\u2013840","journal-title":"IEEE Trans Neural Netw"},{"key":"6018_CR65","doi-asserted-by":"crossref","first-page":"3532","DOI":"10.1162\/neco.2009.11-08-908","volume":"21","author":"P Schneider","year":"2009","unstructured":"Schneider P, Hammer B, Biehl M (2009) Adaptive relevance matrices in learning vector quantization. Neural Comput 21:3532\u20133561","journal-title":"Neural Comput"},{"key":"6018_CR66","doi-asserted-by":"crossref","first-page":"2942","DOI":"10.1162\/neco.2009.10-08-892","volume":"21","author":"P Schneider","year":"2009","unstructured":"Schneider P, Hammer B, Biehl M (2009) Distance learning in discriminative vector quantization. Neural Comput 21:2942\u20132969","journal-title":"Neural Comput"},{"issue":"122","key":"6018_CR67","first-page":"1","volume":"8","author":"A Sievers","year":"2017","unstructured":"Sievers A, Bosiek K, Bisch M, Dreessen C, Riedel J, Fro\u00df P, Hausmann M, Hildenbrand G (2017) $$k$$-mer content, correlation, and position analysis of genome DNA sequences for the identification of function and evolutionary features. Genes 8(122):1\u201318","journal-title":"Genes"},{"issue":"1","key":"6018_CR68","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/0022-2836(81)90087-5","volume":"147","author":"T Smith","year":"1981","unstructured":"Smith T, Watermann M (1981) Identification of common molecular subsequences. J Mol Biol 147(1):195\u2013197","journal-title":"J Mol Biol"},{"issue":"1","key":"6018_CR69","first-page":"1","volume":"9","author":"Y Sun","year":"2019","unstructured":"Sun Y, Zhu S, Ma K, Liu W, Yue Y, Hu G, Lu H, Chen W (2019) Identification of 12 cancer types through genome deep learning. Nat Sci Rep 9(1):1\u20139","journal-title":"Nat Sci Rep"},{"issue":"7","key":"6018_CR70","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pone.0180827","volume":"12","author":"N Szostak","year":"2017","unstructured":"Szostak N, Synak J, Borowski M, Wasik S, Blazewicz J (2017) Simulating the origins of life: the dual role of RNA replicases as an obstacle to evolution. PLoS ONE 12(7):1\u201328","journal-title":"PLoS ONE"},{"issue":"4","key":"6018_CR71","doi-asserted-by":"crossref","first-page":"e1004853","DOI":"10.1371\/journal.pcbi.1004853","volume":"12","author":"N Szostak","year":"2016","unstructured":"Szostak N, Wasik S, Blazewicz J (2016) Hypercycle. PLOS Comput Biol 12(4):e1004853","journal-title":"PLOS Comput Biol"},{"issue":"9","key":"6018_CR72","doi-asserted-by":"crossref","first-page":"e0222271","DOI":"10.1371\/journal.pone.0222271","volume":"14","author":"A Tampuu","year":"2019","unstructured":"Tampuu A, Bzhalava Z, Dillner J, Vicente R (2019) ViraMiner: deep learning on raw DNA sequences for identifying viral genomes in human samples. Plose One 14(9):e0222271","journal-title":"Plose One"},{"issue":"22","key":"6018_CR73","doi-asserted-by":"crossref","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","volume":"22","author":"JD Thompson","year":"1994","unstructured":"Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22(22):4673\u20134680","journal-title":"Nucleic Acids Res"},{"key":"6018_CR74","first-page":"2579","volume":"9","author":"L van der Maaten","year":"2008","unstructured":"van der Maaten L, Hinton G (2008) Visualizing high-dimensional data using $$t$$-SNE. J Mach Learn Res 9:2579\u20132605","journal-title":"J Mach Learn Res"},{"key":"6018_CR75","doi-asserted-by":"crossref","unstructured":"Vasilarou M, Alachiotis N, Garefalaki J, Beloukas A, Pavlidis P (2020) Population genomics insights into the recent evolution of SARS-CoV-2. bioRXiv, (2020.04.21.054122)","DOI":"10.1101\/2020.04.21.054122"},{"key":"6018_CR76","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-019-04051-w","author":"A Vellido","year":"2019","unstructured":"Vellido A (2019) The importance of interpretability and visualization in machine learning for applications in medicine and health care. Neural Netw Appl. https:\/\/doi.org\/10.1007\/s00521-019-04051-w","journal-title":"Neural Netw Appl"},{"issue":"1","key":"6018_CR77","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1515\/jaiscr-2017-0005","volume":"7","author":"T Villmann","year":"2017","unstructured":"Villmann T, Bohnsack A, Kaden M (2017) Can learning vector quantization be an alternative to SVM and deep learning? J Artif Intell Soft Comput Res 7(1):65\u201381","journal-title":"J Artif Intell Soft Comput Res"},{"issue":"2","key":"6018_CR78","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1162\/089976606775093918","volume":"18","author":"T Villmann","year":"2006","unstructured":"Villmann T, Claussen J-C (2006) Magnification control in self-organizing maps and neural gas. Neural Comput 18(2):446\u2013469","journal-title":"Neural Comput"},{"issue":"5","key":"6018_CR79","doi-asserted-by":"crossref","first-page":"1343","DOI":"10.1162\/NECO_a_00110","volume":"23","author":"T Villmann","year":"2011","unstructured":"Villmann T, Haase S (2011) Divergence based vector quantization. Neural Comput 23(5):1343\u20131392","journal-title":"Neural Comput"},{"key":"6018_CR80","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/j.neucom.2013.11.048","volume":"147","author":"T Villmann","year":"2015","unstructured":"Villmann T, Haase S, Kaden M (2015) Kernelized vector quantization in gradient-descent learning. Neurocomputing 147:83\u201395","journal-title":"Neurocomputing"},{"key":"6018_CR81","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-020-05517-y","author":"T Villmann","year":"2020","unstructured":"Villmann T, Ravichandran J, Engelsberger A, Villmann A, Kaden M (2020) Quantum-inspired learning vector quantizers for prototype-based classification. Neural Comput Appl. https:\/\/doi.org\/10.1007\/s00521-020-05517-y","journal-title":"Neural Comput Appl"},{"key":"6018_CR82","unstructured":"Villmann T, Saralajew S, Villmann A, Kaden M (2018) Learning vector quantization methods for interpretable classification learning and multilayer networks. In C.\u00a0Sabourin, J.\u00a0Merelo, A.\u00a0Barranco, K.\u00a0Madani, and K.\u00a0Warwick (Eds). Proceedings of the 10th International Joint Conference on Computational Intelligence (IJCCI), Sevilla"},{"issue":"3","key":"6018_CR83","first-page":"376","volume":"15","author":"S Vinga","year":"2004","unstructured":"Vinga S (2004) Information theory applications for biological sequence analysis. Bioinformatics 15(3):376\u2013389","journal-title":"Bioinformatics"},{"issue":"2","key":"6018_CR84","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1093\/bioinformatics\/btg392","volume":"20","author":"S Vinga","year":"2004","unstructured":"Vinga S, Almeida J (2004) Alignment-free sequence comparison\u2014a review. Bioinformatics 20(2):206\u2013215","journal-title":"Bioinformatics"},{"key":"6018_CR85","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1016\/j.jtbi.2004.06.030","volume":"231","author":"S Vinga","year":"2004","unstructured":"Vinga S, Almeida J (2004) R\u00e9nyi continuous entropy of DNA sequences. J Theor Biol 231:377\u2013388","journal-title":"J Theor Biol"},{"key":"6018_CR86","doi-asserted-by":"crossref","DOI":"10.1017\/9781316882313","volume-title":"Computational phylogenetics: an introduction to designing methods for phylogeny estimation","author":"T Warrow","year":"2017","unstructured":"Warrow T (2017) Computational phylogenetics: an introduction to designing methods for phylogeny estimation. Cambridge University Press, Cambridge"},{"key":"6018_CR87","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1016\/j.jtbi.2018.12.018","volume":"463","author":"S Wasik","year":"2019","unstructured":"Wasik S, Szostak N, Kudla M, Wachowiak M, Krawiec K, Blazewicz J (2019) Detecting life signatures with RNA sequence similarity measures. J Theor Biol 463:110\u2013120","journal-title":"J Theor Biol"},{"key":"6018_CR88","doi-asserted-by":"crossref","first-page":"689","DOI":"10.1016\/S0140-6736(20)30260-9","volume":"395","author":"J Wu","year":"2020","unstructured":"Wu J, Leung K, Leung G (2020) Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet 395:689\u2013697","journal-title":"Lancet"},{"key":"6018_CR89","doi-asserted-by":"crossref","unstructured":"Yang H-C, Chen C-H, Wang J-H, Liao H-C, Yang C-T, Chen C-W, Lin Y-C, Kao C-H, Liao J (2020) Genomic, geographic and temporal distributions of SARS-CoV-2 mutations. bioRXiv, (2020.04.22.055863)","DOI":"10.1101\/2020.04.22.055863"},{"key":"6018_CR90","doi-asserted-by":"crossref","unstructured":"Yin C (2020) Genotyping coronavirus SARS-CoV-2: methods and implications. arXiv, (2003.10965v1)","DOI":"10.1016\/j.ygeno.2020.04.016"},{"key":"6018_CR91","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1016\/j.jtbi.2014.05.043","volume":"359","author":"C Yin","year":"2014","unstructured":"Yin C, Chen Y, Yau S-T (2014) A measure of DNA sequence similarity by Fourier Transform with applications on hierarchical clustering. J Theor Biol 359:18\u201328","journal-title":"J Theor Biol"},{"issue":"5","key":"6018_CR92","doi-asserted-by":"crossref","first-page":"e64328","DOI":"10.1371\/journal.pone.0064328","volume":"8","author":"C Yu","year":"2013","unstructured":"Yu C, Hernandez T, Zheng H, Yau S-C, Huang H-H, He R, Yang J, Yau S-T (2013) Real time classification of viruses in $$12$$ dimensions. Plos One 8(5):e64328","journal-title":"Plos One"},{"key":"6018_CR93","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/rssa.12227","volume":"180","author":"J Zeng","year":"2017","unstructured":"Zeng J, Ustun B, Rudin C (2017) Interpretable classification models for recidivism prediction. J R Stat Soc Series A 180:1\u201334","journal-title":"J R Stat Soc Series A"},{"issue":"144","key":"6018_CR94","first-page":"1","volume":"20","author":"A Zielezinski","year":"2019","unstructured":"Zielezinski A, Girgis H, Bernard G, Leimeister C-A, Tang K, Dencker T, Lau A, R\u00f6hling S, Choi J, Waterman M, Comin M, Kim S-H, Vinga S, Almeida J, Chan C, James B, Sun F, Morgenstern B, Karlowski W (2019) Benchmarking of alignment-free sequence comparison methods. Genome Biol 20(144):1\u201318","journal-title":"Genome Biol"},{"issue":"186","key":"6018_CR95","first-page":"1","volume":"18","author":"A Zielezinski","year":"2017","unstructured":"Zielezinski A, Vinga S, Almeida J, Karlowski W (2017) Alignment-free sequence comparison: benefits, applications, and tools. Genome Biol 18(186):1\u201317","journal-title":"Genome Biol"}],"container-title":["Neural Computing and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-021-06018-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00521-021-06018-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-021-06018-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,1,10]],"date-time":"2022-01-10T11:12:57Z","timestamp":1641813177000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00521-021-06018-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,27]]},"references-count":95,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,1]]}},"alternative-id":["6018"],"URL":"https:\/\/doi.org\/10.1007\/s00521-021-06018-2","relation":{},"ISSN":["0941-0643","1433-3058"],"issn-type":[{"value":"0941-0643","type":"print"},{"value":"1433-3058","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,4,27]]},"assertion":[{"value":"13 July 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 April 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 April 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}