{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,15]],"date-time":"2026-06-15T21:57:41Z","timestamp":1781560661333,"version":"3.54.5"},"reference-count":125,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,5,6]],"date-time":"2024-05-06T00:00:00Z","timestamp":1714953600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,5,6]],"date-time":"2024-05-06T00:00:00Z","timestamp":1714953600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100018996","name":"Hellenic Academic Libraries Link","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100018996","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Dioni: Computing Infrastructure for Big-Data Processing and Analysis"},{"DOI":"10.13039\/501100021855","name":"University of Ioannina","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100021855","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Netw Model Anal Health Inform Bioinforma"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The decoding of the human genome, completed two decades ago, marked a revolutionary moment in biology by introducing a vast amount of data. This avalanche of information presented several computational challenges. Machine Learning has become the dominant method to address these challenges, with Natural Language Processing playing a significant role and offering promising results. In this systematic review, we will explore the application of Machine Learning and Natural Language Processing to the study of biological data. On the one hand, Machine Learning is widely used in Artificial Intelligence to improve automation, carry out tasks that require no human interaction, and perform analytical and physical activities. It helps advance our understanding of biology and improve healthcare and drug development processes in bioinformatics. On the other hand, improved machine-human language interaction is the aim of Natural Language Processing. Its three main goals are character sequence processing, pattern recognition, and algorithm development. The use of Natural Language Processing is becoming increasingly important for the analysis of omics data using both modern and conventional Machine Learning models, underscoring the necessity for a systematic review. In this work, 82 studies were included following the PRISMA guidelines, sourced from PubMed, Scopus and IEEE Xplore on April 4th, 2023. The evaluation of the publications was based on the type of the studied biological data and the employed NLP techniques. Through our in-depth exploration of NLP approaches, we highlight their significance and potential in advancing the field of bioinformatics.<\/jats:p>","DOI":"10.1007\/s13721-024-00458-1","type":"journal-article","created":{"date-parts":[[2024,5,6]],"date-time":"2024-05-06T12:01:59Z","timestamp":1714996919000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":17,"title":["How natural language processing derived techniques are used on biological data: a systematic review"],"prefix":"10.1007","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-9227-1365","authenticated-orcid":false,"given":"Emmanouil D.","family":"Oikonomou","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0483-4868","authenticated-orcid":false,"given":"Petros","family":"Karvelis","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0615-783X","authenticated-orcid":false,"given":"Nikolaos","family":"Giannakeas","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1892-0000","authenticated-orcid":false,"given":"Aristidis","family":"Vrachatis","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5604-3507","authenticated-orcid":false,"given":"Evripidis","family":"Glavas","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9043-1290","authenticated-orcid":false,"given":"Alexandros T.","family":"Tzallas","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2024,5,6]]},"reference":[{"key":"458_CR1","doi-asserted-by":"publisher","unstructured":"Albawi S, Mohammed TA, Al-Zawi S (2018) Understanding of a convolutional neural network. Proceedings of 2017 International Conference on Engineering and Technology, ICET 2017, 2018-January, 1\u20136. https:\/\/doi.org\/10.1109\/ICENGTECHNOL.2017.8308186","DOI":"10.1109\/ICENGTECHNOL.2017.8308186"},{"issue":"4","key":"458_CR2","doi-asserted-by":"publisher","first-page":"2525","DOI":"10.1007\/S11277-023-10244-3\/TABLES\/4","volume":"129","author":"MJ Al-Dujaili","year":"2023","unstructured":"Al-Dujaili MJ, Ebrahimi-Moghadam A (2023) Speech emotion recognition: a comprehensive survey. Wireless Pers Commun 129(4):2525\u20132561. https:\/\/doi.org\/10.1007\/S11277-023-10244-3\/TABLES\/4","journal-title":"Wireless Pers Commun"},{"key":"458_CR3","doi-asserted-by":"publisher","unstructured":"Ali S, Patterson M (2021) Spike2Vec: An Efficient and Scalable Embedding Approach for COVID-19 Spike Sequences. Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021, 1533\u20131540. https:\/\/doi.org\/10.1109\/BIGDATA52589.2021.9671848","DOI":"10.1109\/BIGDATA52589.2021.9671848"},{"issue":"3","key":"458_CR4","doi-asserted-by":"publisher","first-page":"418","DOI":"10.3390\/BIOLOGY11030418","volume":"11","author":"S Ali","year":"2022","unstructured":"Ali S, Bello B, Chourasia P, Punathil RT, Zhou Y, Patterson M (2022) PWM2Vec: an efficient embedding approach for viral host specification from coronavirus spike sequences. Biology 11(3):418. https:\/\/doi.org\/10.3390\/BIOLOGY11030418","journal-title":"Biology"},{"key":"458_CR5","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1007\/978-0-387-72430-0_6","volume-title":"Molecular Pathology of Lung Diseases","author":"TC Allen","year":"2008","unstructured":"Allen TC, Cagle PT (2008) Bioinformatics and omics. In: Zander DS, Popper HH, Jagirdar J, Haque AK, Cagle PT, Barrios R (eds) Molecular Pathology of Lung Diseases. Springer, New York, pp 65\u201369. https:\/\/doi.org\/10.1007\/978-0-387-72430-0_6"},{"issue":"1","key":"458_CR6","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/S42600-022-00245-9","volume":"39","author":"LM Alves","year":"2023","unstructured":"Alves LM, C\u00f4co KF, de Souza ML, Ciarelli PM (2023) Contextual microstates: an approach based on word embedding of microstates sequence to identify ADHD patients. Rec Biom Eng 39(1):1\u201313. https:\/\/doi.org\/10.1007\/S42600-022-00245-9","journal-title":"Rec Biom Eng"},{"issue":"1","key":"458_CR7","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1186\/S12859-022-04650-W","volume":"23","author":"E Amiri Souri","year":"2022","unstructured":"Amiri Souri E, Laddach R, Karagiannis SN, Papageorgiou LG, Tsoka S (2022) Novel drug-target interactions via link prediction and network embedding. BMC Bioinformatics 23(1):121. https:\/\/doi.org\/10.1186\/S12859-022-04650-W","journal-title":"BMC Bioinformatics"},{"issue":"2","key":"458_CR8","doi-asserted-by":"publisher","first-page":"1621","DOI":"10.1007\/S11277-022-09812-W\/TABLES\/1","volume":"126","author":"AA Anthony","year":"2022","unstructured":"Anthony AA, Patil CM, Basavaiah J (2022) A Review on speech disorders and processing of disordered speech. Wireless Pers Commun 126(2):1621\u20131631. https:\/\/doi.org\/10.1007\/S11277-022-09812-W\/TABLES\/1","journal-title":"Wireless Pers Commun"},{"issue":"1","key":"458_CR9","doi-asserted-by":"publisher","first-page":"399","DOI":"10.1007\/S40747-022-00802-W","volume":"9","author":"MN Asim","year":"2023","unstructured":"Asim MN, Ibrahim MA, Malik MI, Razzak I, Dengel A, Ahmed S (2023) Histone-net: a multi-paradigm computational framework for histone occupancy and modification prediction. Com Intel Syst 9(1):399\u2013419. https:\/\/doi.org\/10.1007\/S40747-022-00802-W","journal-title":"Com Intel Syst"},{"issue":"12","key":"458_CR10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.3390\/GENES11121475","volume":"11","author":"MN Asim","year":"2020","unstructured":"Asim MN, Malik MI, Zehe C, Trygg J, Dengel A, Ahmed S (2020) Mirlocpredictor: a convnet-based multi-label microrna subcellular localization predictor by incorporating k-mer positional information. Genes 11(12):1\u201323. https:\/\/doi.org\/10.3390\/GENES11121475","journal-title":"Genes"},{"issue":"6","key":"458_CR11","doi-asserted-by":"publisher","first-page":"822","DOI":"10.1093\/BIOINFORMATICS\/BTAA906","volume":"37","author":"ARMA Basher","year":"2021","unstructured":"Basher ARMA, Hallam SJ (2021) Leveraging heterogeneous network embedding for metabolic pathway prediction. Bioinformatics 37(6):822\u2013829. https:\/\/doi.org\/10.1093\/BIOINFORMATICS\/BTAA906","journal-title":"Bioinformatics"},{"issue":"D1","key":"458_CR12","doi-asserted-by":"publisher","first-page":"S36","DOI":"10.1093\/NAR\/GKS1195","volume":"41","author":"DA Benson","year":"2013","unstructured":"Benson DA, Cavanaugh M, Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW (2013) GenBank. Nucleic Acids Res 41(D1):S36-D42. https:\/\/doi.org\/10.1093\/NAR\/GKS1195","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"458_CR13","doi-asserted-by":"publisher","first-page":"616","DOI":"10.1002\/PROT.25842","volume":"88","author":"DWA Buchan","year":"2020","unstructured":"Buchan DWA, Jones DT (2020) Learning a functional grammar of protein domains using natural language word embedding techniques. Proteins: Struct, Funct, Bioinf 88(4):616\u2013624. https:\/\/doi.org\/10.1002\/PROT.25842","journal-title":"Proteins: Struct, Funct, Bioinf"},{"key":"458_CR14","doi-asserted-by":"publisher","first-page":"893520","DOI":"10.3389\/FONC.2022.893520\/FULL","volume":"12","author":"L Cao","year":"2022","unstructured":"Cao L, Liu P, Chen J, Deng L (2022) Prediction of transcription factor binding sites using a combined deep learning approach. Front Oncol 12:893520. https:\/\/doi.org\/10.3389\/FONC.2022.893520\/FULL","journal-title":"Front Oncol"},{"key":"458_CR15","doi-asserted-by":"publisher","first-page":"4378413","DOI":"10.1155\/2022\/4378413","volume":"202","author":"CT Chao","year":"2022","unstructured":"Chao CT, Tsai YT, Lee WT, Yeh HY, Chiang CK (2022) Deep learning-assisted repurposing of plant compounds for treating vascular calcification: an in silico study with experimental validation. Oxid Med Cell Longev 202:4378413. https:\/\/doi.org\/10.1155\/2022\/4378413","journal-title":"Oxid Med Cell Longev"},{"key":"458_CR16","doi-asserted-by":"publisher","unstructured":"Chen T, Guestrin C (n.d.). XGBoost: A Scalable Tree Boosting System. https:\/\/doi.org\/10.1145\/2939672.2939785","DOI":"10.1145\/2939672.2939785"},{"issue":"5","key":"458_CR17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/BIB\/BBAC319","volume":"23","author":"S Chen","year":"2022","unstructured":"Chen S, Li Q, Zhao J, Bin Y, Zheng C (2022) NeuroPred-CLQ: incorporating deep temporal convolutional networks and multi-head attention mechanism to predict neuropeptides. Brief Bioinform 23(5):1\u201312. https:\/\/doi.org\/10.1093\/BIB\/BBAC319","journal-title":"Brief Bioinform"},{"key":"458_CR18","doi-asserted-by":"publisher","unstructured":"Cheng J, Wang Z, Liu Y, Huang W (2021) CapBind: Prediction of transcription factor binding sites based on capsule network. Proceedings - 2021 6th International Conference on Computational Intelligence and Applications, ICCIA 2021, 31\u201335. https:\/\/doi.org\/10.1109\/ICCIA52886.2021.00014","DOI":"10.1109\/ICCIA52886.2021.00014"},{"key":"458_CR19","unstructured":"Chung J, Gulcehre C, Cho K (n.d.). Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling."},{"key":"458_CR20","doi-asserted-by":"publisher","first-page":"1546","DOI":"10.3389\/FMED.2022.911861\/BIBTEX","volume":"9","author":"X Dai","year":"2022","unstructured":"Dai X, Shen L (2022) Advances and trends in omics technology development. Front Med 9:1546. https:\/\/doi.org\/10.3389\/FMED.2022.911861\/BIBTEX","journal-title":"Front Med"},{"issue":"4","key":"458_CR21","doi-asserted-by":"publisher","first-page":"279","DOI":"10.1089\/BIG.2020.0383\/ASSET\/IMAGES\/LARGE\/BIG.2020.0383_FIGURE1.JPEG","volume":"10","author":"G Dallalba","year":"2022","unstructured":"Dallalba G, Casa PL, De Abreu FP, Notari DL, De Avila E, Silva S (2022) A survey of biological data in a big data perspective. Big Data 10(4):279\u2013297. https:\/\/doi.org\/10.1089\/BIG.2020.0383\/ASSET\/IMAGES\/LARGE\/BIG.2020.0383_FIGURE1.JPEG","journal-title":"Big Data"},{"key":"458_CR22","doi-asserted-by":"publisher","first-page":"82","DOI":"10.1186\/S12864-018-5370-X","volume":"20","author":"J Du","year":"2019","unstructured":"Du J, Jia P, Dai Y, Tao C, Zhao Z, Zhi D (2019) Gene2vec: distributed representation of genes based on co-expression. BMC Genomics 20:82. https:\/\/doi.org\/10.1186\/S12864-018-5370-X","journal-title":"BMC Genomics"},{"issue":"2","key":"458_CR23","doi-asserted-by":"publisher","first-page":"bbac003","DOI":"10.1093\/BIB\/BBAC003","volume":"23","author":"AA Edera","year":"2022","unstructured":"Edera AA, Milone DH, Stegmayer G (2022) Anc2vec: embedding gene ontology terms by preserving ancestors relationships. Brief Bioinform 23(2):bbac003. https:\/\/doi.org\/10.1093\/BIB\/BBAC003","journal-title":"Brief Bioinform"},{"key":"458_CR24","doi-asserted-by":"publisher","unstructured":"Forghani M, Khachay M, Firstkov A, Ramsay E (2022) An artificial neural network based ensemble model for predicting antigenic variants: application of reduced amino acid alphabets and word2Vec. Proceedings - 2022 8th International Iranian Conference on Signal Processing and Intelligent Systems, ICSPIS 2022. https:\/\/doi.org\/10.1109\/ICSPIS56952.2022.10044061","DOI":"10.1109\/ICSPIS56952.2022.10044061"},{"issue":"1","key":"458_CR25","doi-asserted-by":"publisher","first-page":"240","DOI":"10.1109\/TCBB.2019.2937771","volume":"18","author":"J Gao","year":"2021","unstructured":"Gao J, Tian L, Lv T, Wang J, Song B, Hu X (2021) Protein2Vec: aligning multiple ppi networks with representation learning. IEEE\/ACM Trans Comput Biol Bioinf 18(1):240\u2013249. https:\/\/doi.org\/10.1109\/TCBB.2019.2937771","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"issue":"1","key":"458_CR26","doi-asserted-by":"publisher","first-page":"306","DOI":"10.1186\/S12859-019-2914-2","volume":"20","author":"Z Gao","year":"2019","unstructured":"Gao Z, Fu G, Ouyang C, Tsutsui S, Liu X, Yang J, Gessner C, Foote B, Wild D, Ding Y, Yu Q (2019) Edge2vec: Representation learning using edge semantics for biomedical knowledge discovery. BMC Bioinformatics 20(1):306. https:\/\/doi.org\/10.1186\/S12859-019-2914-2","journal-title":"BMC Bioinformatics"},{"key":"458_CR27","unstructured":"G\u00f6nen M, Suleiman Khan, aaltofi A, Kaski S (2013) Kernelized Bayesian Matrix Factorization (Vol. 28, pp. 864\u2013872). PMLR. https:\/\/proceedings.mlr.press\/v28\/gonen13a.html"},{"key":"458_CR28","doi-asserted-by":"publisher","unstructured":"Grover A, Leskovec J (2018) node2vec: scalable feature learning for networks. KDD\u202f: Proceedings. International Conference on Knowledge Discovery & Data Mining, 2016, 855. https:\/\/doi.org\/10.1145\/2939672.2939754","DOI":"10.1145\/2939672.2939754"},{"issue":"5","key":"458_CR29","doi-asserted-by":"publisher","first-page":"bbac391","DOI":"10.1093\/BIB\/BBAC391","volume":"23","author":"LX Guo","year":"2022","unstructured":"Guo LX, You ZH, Wang L, Yu CQ, Zhao BW, Ren ZH, Pan J (2022) A novel circRNA-miRNA association prediction model based on structural deep neural network embedding. Brief Bioinform 23(5):bbac391. https:\/\/doi.org\/10.1093\/BIB\/BBAC391","journal-title":"Brief Bioinform"},{"key":"458_CR30","doi-asserted-by":"publisher","first-page":"498","DOI":"10.1016\/J.OMTN.2019.10.046","volume":"19","author":"ZH Guo","year":"2020","unstructured":"Guo ZH, You ZH, Yi HC (2020) Integrative construction and analysis of molecular association network in human cells by fusing node attribute and behavior information. Mol Ther Nucleic Acids 19:498\u2013506. https:\/\/doi.org\/10.1016\/J.OMTN.2019.10.046","journal-title":"Mol Ther Nucleic Acids"},{"issue":"3","key":"458_CR31","doi-asserted-by":"publisher","first-page":"582","DOI":"10.3390\/GENES14030582","volume":"14","author":"RK Halder","year":"2023","unstructured":"Halder RK, Uddin MN, Uddin MdA, Aryal S, Islam MdA, Hossain F, Jahan N, Khraisat A, Alazab A (2023) A grid search-based multilayer dynamic ensemble system to identify dna n4\u2014methylcytosine using deep learning approach. Genes 14(3):582. https:\/\/doi.org\/10.3390\/GENES14030582","journal-title":"Genes"},{"key":"458_CR32","doi-asserted-by":"publisher","first-page":"117972","DOI":"10.1016\/J.ESWA.2022.117972","volume":"208","author":"MA Helaly","year":"2022","unstructured":"Helaly MA, Rady S, Aref MM (2022) BERT contextual embeddings for taxonomic classification of bacterial DNA sequences. Expert Syst Appl 208:117972. https:\/\/doi.org\/10.1016\/J.ESWA.2022.117972","journal-title":"Expert Syst Appl"},{"key":"458_CR33","doi-asserted-by":"publisher","unstructured":"Hussein R, Yang D, Cudr\u00e9-Mauroux P, Cudr\u00e9 P (n.d.). Are Meta-Paths Necessary? Revisiting Heterogeneous Graph Embeddings. 10. https:\/\/doi.org\/10.1145\/3269206.3271777","DOI":"10.1145\/3269206.3271777"},{"issue":"8","key":"458_CR34","doi-asserted-by":"publisher","first-page":"2269","DOI":"10.1093\/BIOINFORMATICS\/BTAC104","volume":"38","author":"I Ieremie","year":"2022","unstructured":"Ieremie I, Ewing RM, Niranjan M (2022) TransformerGO: predicting protein-protein interactions by modelling the attention between sets of gene ontology terms. Bioinformatics 38(8):2269\u20132277. https:\/\/doi.org\/10.1093\/BIOINFORMATICS\/BTAC104","journal-title":"Bioinformatics"},{"issue":"1","key":"458_CR35","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1021\/ACS.JCIM.7B00616","volume":"58","author":"S Jaeger","year":"2018","unstructured":"Jaeger S, Fulle S, Turk S (2018) Mol2vec: unsupervised machine learning approach with chemical intuition. J Chem Inf Model 58(1):27\u201335. https:\/\/doi.org\/10.1021\/ACS.JCIM.7B00616","journal-title":"J Chem Inf Model"},{"issue":"6","key":"458_CR36","doi-asserted-by":"publisher","first-page":"4052","DOI":"10.1016\/J.YGENO.2021.10.007","volume":"113","author":"QY Ji","year":"2021","unstructured":"Ji QY, Gong XJ, Li HM, Du PF (2021) DeepSE: detecting super-enhancers among typical enhancers using only sequence feature embeddings. Genomics 113(6):4052\u20134060. https:\/\/doi.org\/10.1016\/J.YGENO.2021.10.007","journal-title":"Genomics"},{"issue":"2065","key":"458_CR37","doi-asserted-by":"publisher","first-page":"20150202","DOI":"10.1098\/rsta.2015.0202","volume":"374","author":"IT Jolliffe","year":"2016","unstructured":"Jolliffe IT, Cadima J (2016) Principal component analysis: a review and recent developments. Philos Trans A Math Phys Eng Sci 374(2065):20150202. https:\/\/doi.org\/10.1098\/rsta.2015.0202","journal-title":"Philos Trans A Math Phys Eng Sci"},{"key":"458_CR38","doi-asserted-by":"publisher","first-page":"104122","DOI":"10.1016\/J.JBI.2022.104122","volume":"132","author":"PVM Joshi","year":"2022","unstructured":"Joshi PVM, Mukherjee A (2022) A knowledge graph embedding based approach to predict the adverse drug reactions using a deep neural network. J Biomed Inform 132:104122. https:\/\/doi.org\/10.1016\/J.JBI.2022.104122","journal-title":"J Biomed Inform"},{"issue":"1","key":"458_CR39","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/19768354.2020.1721321","volume":"24","author":"GT Jung","year":"2020","unstructured":"Jung GT, Kim KP, Kim K (2020) How to interpret and integrate multi-omics data at systems level. Anim Cells Syst 24(1):1. https:\/\/doi.org\/10.1080\/19768354.2020.1721321","journal-title":"Anim Cells Syst"},{"key":"458_CR40","doi-asserted-by":"publisher","first-page":"58369","DOI":"10.1109\/ACCESS.2020.2982666","volume":"8","author":"J Khanal","year":"2020","unstructured":"Khanal J, Tayara H, Chong KT (2020) Identifying enhancers and their strength by the integration of word embedding and convolution neural network. IEEE Access 8:58369\u201358376. https:\/\/doi.org\/10.1109\/ACCESS.2020.2982666","journal-title":"IEEE Access"},{"key":"458_CR126","doi-asserted-by":"publisher","unstructured":"Khanal J, Tayara H, Zou Q, Chong KT (2021) Identifying DNA N4-methylcytosine sites in the rosaceae genome with a deep learning model relying on distributed feature representation. Comput Struct Biotechnol J 19:1612\u20131619. https:\/\/doi.org\/10.1016\/j.csbj.2021.03.015","DOI":"10.1016\/j.csbj.2021.03.015"},{"key":"458_CR41","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1186\/S12920-018-0349-7","volume":"11","author":"S Kim","year":"2018","unstructured":"Kim S, Lee H, Kim K, Kang J (2018) Mut2Vec: distributed representation of cancerous mutations. BMC Med Genomics 11:33. https:\/\/doi.org\/10.1186\/S12920-018-0349-7","journal-title":"BMC Med Genomics"},{"key":"458_CR42","doi-asserted-by":"publisher","first-page":"107755","DOI":"10.1016\/J.COMPBIOLCHEM.2022.107755","volume":"101","author":"MB Koca","year":"2022","unstructured":"Koca MB, Nourani E, Abbaso\u011flu F, Karadeniz \u0130, Sevilgen FE (2022) Graph convolutional network based virus-human protein-protein interaction prediction for novel viruses. Comput Biol Chem 101:107755. https:\/\/doi.org\/10.1016\/J.COMPBIOLCHEM.2022.107755","journal-title":"Comput Biol Chem"},{"issue":"4","key":"458_CR43","doi-asserted-by":"publisher","first-page":"2869","DOI":"10.1007\/S11277-022-10076-7\/TABLES\/3","volume":"128","author":"N Kumari","year":"2023","unstructured":"Kumari N, Anwar S, Bhattacharjee V (2023) A comparative analysis of machine and deep learning techniques for EEG evoked emotion classification. Wireless Pers Commun 128(4):2869\u20132890. https:\/\/doi.org\/10.1007\/S11277-022-10076-7\/TABLES\/3","journal-title":"Wireless Pers Commun"},{"key":"458_CR44","doi-asserted-by":"crossref","unstructured":"Lau J H, Baldwin T (2016) An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation. 78\u201386. https:\/\/groups.google.com\/forum\/#!topic\/","DOI":"10.18653\/v1\/W16-1609"},{"issue":"5","key":"458_CR45","doi-asserted-by":"publisher","first-page":"e15708","DOI":"10.2196\/15708","volume":"23","author":"A Le Glaz","year":"2021","unstructured":"Le Glaz A, Haralambous Y, Kim-Dufor DH, Lenca P, Billot R, Ryan TC, Marsh J, DeVylder J, Walter M, Berrouiguet S, Lemey C (2021) Machine learning and natural language processing in mental health: systematic review. J Med Internet Res 23(5):e15708. https:\/\/doi.org\/10.2196\/15708","journal-title":"J Med Internet Res"},{"issue":"1","key":"458_CR46","doi-asserted-by":"publisher","first-page":"258","DOI":"10.1186\/S12859-022-04789-6","volume":"23","author":"Y Liang","year":"2022","unstructured":"Liang Y, Wu Y, Zhang Z, Liu N, Peng J, Tang J (2022) Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction. BMC Bioinformatics 23(1):258. https:\/\/doi.org\/10.1186\/S12859-022-04789-6","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"458_CR47","doi-asserted-by":"publisher","first-page":"480","DOI":"10.1186\/S12859-022-05033-X","volume":"23","author":"M Liao","year":"2022","unstructured":"Liao M, Jian-ping Z, Tian J, Zheng CH (2022) iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework. BMC Bioinformatics 23(1):480. https:\/\/doi.org\/10.1186\/S12859-022-05033-X","journal-title":"BMC Bioinformatics"},{"key":"458_CR48","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1016\/J.JTBI.2015.08.025","volume":"385","author":"B Liu","year":"2015","unstructured":"Liu B, Fang L, Wang S, Wang X, Li H, Chou KC (2015) Identification of microRNA precursor with the degenerate K-tuple or Kmer strategy. J Theor Biol 385:153\u2013159. https:\/\/doi.org\/10.1016\/J.JTBI.2015.08.025","journal-title":"J Theor Biol"},{"key":"458_CR49","doi-asserted-by":"publisher","first-page":"896925","DOI":"10.3389\/FGENE.2022.896925\/FULL","volume":"13","author":"HY Liu","year":"2022","unstructured":"Liu HY, Du PF (2022) i5hmCVec: Identifying 5-hydroxymethylcytosine sites of drosophila RNA using sequence feature embeddings. Front Genet 13:896925. https:\/\/doi.org\/10.3389\/FGENE.2022.896925\/FULL","journal-title":"Front Genet"},{"key":"458_CR50","doi-asserted-by":"publisher","first-page":"728333","DOI":"10.3389\/FGENE.2021.728333\/FULL","volume":"12","author":"H Liu","year":"2021","unstructured":"Liu H, Hou L, Xu S, Li H, Chen X, Gao J, Wang Z, Han B, Liu X, Wan S (2021) Discovering cerebral ischemic stroke associated genes based on network representation learning. Front Genet 12:728333. https:\/\/doi.org\/10.3389\/FGENE.2021.728333\/FULL","journal-title":"Front Genet"},{"issue":"1","key":"458_CR51","doi-asserted-by":"publisher","first-page":"btad047","DOI":"10.1093\/BIOINFORMATICS\/BTAD047","volume":"39","author":"R Liu","year":"2023","unstructured":"Liu R, Hirn M, Krishnan A (2023) Accurately modeling biased random walks on weighted networks using node2vec. Bioinformatics 39(1):btad047. https:\/\/doi.org\/10.1093\/BIOINFORMATICS\/BTAD047","journal-title":"Bioinformatics"},{"issue":"1","key":"458_CR52","doi-asserted-by":"publisher","first-page":"266","DOI":"10.1109\/JBHI.2020.2998906","volume":"25","author":"Y Long","year":"2021","unstructured":"Long Y, Luo J (2021) Association mining to identify microbe drug interactions based on heterogeneous network embedding representation. IEEE J Biomed Health Inform 25(1):266\u2013275. https:\/\/doi.org\/10.1109\/JBHI.2020.2998906","journal-title":"IEEE J Biomed Health Inform"},{"issue":"2","key":"458_CR53","doi-asserted-by":"publisher","first-page":"286","DOI":"10.1093\/BIB\/BBW114","volume":"19","author":"C Manzoni","year":"2018","unstructured":"Manzoni C, Kia DA, Vandrovcova J, Hardy J, Wood NW, Lewis PA, Ferrari R (2018) Genome, transcriptome and proteome: the rise of omics data and their integration in biomedical sciences. Brief Bioinform 19(2):286. https:\/\/doi.org\/10.1093\/BIB\/BBW114","journal-title":"Brief Bioinform"},{"issue":"1","key":"458_CR54","doi-asserted-by":"publisher","first-page":"371","DOI":"10.1186\/S12859-022-04916-3","volume":"23","author":"O Maruyama","year":"2022","unstructured":"Maruyama O, Li Y, Narita H, Toh H, Au Yeung WK, Sasaki H (2022) CMIC: predicting DNA methylation inheritance of CpG islands with embedding vectors of variable-length k-mers. BMC Bioinformatics 23(1):371. https:\/\/doi.org\/10.1186\/S12859-022-04916-3","journal-title":"BMC Bioinformatics"},{"issue":"7","key":"458_CR55","doi-asserted-by":"publisher","first-page":"992","DOI":"10.2174\/1574893616666210621101150","volume":"16","author":"B Matougui","year":"2021","unstructured":"Matougui B, Boukelia A, Belhadef H, Galiez C, Batouche M (2021) NLP-metaxa: a natural language processing approach for metagenomic taxonomic binning based on deep learning. Curr Bioinform 16(7):992\u20131003. https:\/\/doi.org\/10.2174\/1574893616666210621101150","journal-title":"Curr Bioinform"},{"issue":"5","key":"458_CR56","doi-asserted-by":"publisher","first-page":"1216","DOI":"10.1093\/BIOINFORMATICS\/BTAB845","volume":"38","author":"Y Miao","year":"2022","unstructured":"Miao Y, Liu F, Hou T, Liu Y (2022) Virtifier: a deep learning-based identifier for viral sequences from metagenomes. Bioinformatics 38(5):1216\u20131222. https:\/\/doi.org\/10.1093\/BIOINFORMATICS\/BTAB845","journal-title":"Bioinformatics"},{"key":"458_CR57","unstructured":"Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient Estimation of Word Representations in Vector Space. http:\/\/ronan.collobert.com\/senna\/"},{"key":"458_CR58","doi-asserted-by":"publisher","first-page":"71840","DOI":"10.1109\/ACCESS.2023.3294618","volume":"11","author":"A Miltiadous","year":"2023","unstructured":"Miltiadous A, Gionanidis E, Tzimourta KD, Giannakeas N, Tzallas AT (2023) DICE-Net: a novel convolution-transformer architecture for alzheimer detection in eeg signals. IEEE Access 11:71840\u201371858. https:\/\/doi.org\/10.1109\/ACCESS.2023.3294618","journal-title":"IEEE Access"},{"key":"458_CR59","doi-asserted-by":"publisher","unstructured":"Mostavi M, Salekin S, Huang Y (2018) Deep-2\u2019-O-Me: predicting 2\u2032-O-methylation sites by convolutional neural networks. Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS, 2018-July, 2394\u20132397. https:\/\/doi.org\/10.1109\/EMBC.2018.8512780","DOI":"10.1109\/EMBC.2018.8512780"},{"key":"458_CR60","doi-asserted-by":"publisher","first-page":"2913","DOI":"10.1007\/s11277-022-10079-4","volume":"128","author":"J Naskath","year":"2023","unstructured":"Naskath J, Sivakamasundari G, Alif A, Begum S (2023) A study on different deep learning algorithms used in deep neural nets: MLP SOM and DBN. Wireless Pers Commun 128:2913\u20132936. https:\/\/doi.org\/10.1007\/s11277-022-10079-4","journal-title":"Wireless Pers Commun"},{"key":"458_CR61","unstructured":"Ng P (2017) dna2vec: Consistent vector representations of variable-length k-mers. https:\/\/arxiv.org\/abs\/1701.06279v1"},{"key":"458_CR62","doi-asserted-by":"publisher","first-page":"680687","DOI":"10.3389\/FIMMU.2021.680687\/FULL","volume":"12","author":"M Ostrovsky-Berman","year":"2021","unstructured":"Ostrovsky-Berman M, Frankel B, Polak P, Yaari G (2021) Immune2vec: embedding B\/T cell receptor sequences in \u211d N using natural language processing. Front Immunol 12:680687. https:\/\/doi.org\/10.3389\/FIMMU.2021.680687\/FULL","journal-title":"Front Immunol"},{"issue":"1","key":"458_CR63","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/S13643-016-0384-4\/FIGURES\/6","volume":"5","author":"M Ouzzani","year":"2016","unstructured":"Ouzzani M, Hammady H, Fedorowicz Z, Elmagarmid A (2016) Rayyan-a web and mobile app for systematic reviews. Syst Rev 5(1):1\u201310. https:\/\/doi.org\/10.1186\/S13643-016-0384-4\/FIGURES\/6","journal-title":"Syst Rev"},{"issue":"1","key":"458_CR64","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1186\/S12859-021-04055-1","volume":"22","author":"K Ovens","year":"2021","unstructured":"Ovens K, Maleki F, Eames BF, McQuillan I (2021) Juxtapose: a gene-embedding approach for comparing co-expression networks. BMC Bioinformatics 22(1):125. https:\/\/doi.org\/10.1186\/S12859-021-04055-1","journal-title":"BMC Bioinformatics"},{"key":"458_CR65","doi-asserted-by":"publisher","first-page":"n71","DOI":"10.1136\/BMJ.N71","volume":"372","author":"MJ Page","year":"2021","unstructured":"Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, Shamseer L, Tetzlaff JM, Akl EA, Brennan SE, Chou R, Glanville J, Grimshaw JM, Hr\u00f3bjartsson A, Lalu MM, Li T, Loder EW, Mayo-Wilson E, McDonald S, Moher D (2021) The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 372:n71. https:\/\/doi.org\/10.1136\/BMJ.N71","journal-title":"BMJ"},{"issue":"2","key":"458_CR66","doi-asserted-by":"publisher","first-page":"666","DOI":"10.1109\/TCBB.2021.3080386","volume":"19","author":"X Pan","year":"2022","unstructured":"Pan X, Chen L, Liu M, Niu Z, Huang T, Cai YD (2022) Identifying protein subcellular locations with embeddings-based node2loc. IEEE\/ACM Trans Comput Biol Bioinf 19(2):666\u2013675. https:\/\/doi.org\/10.1109\/TCBB.2021.3080386","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"issue":"21","key":"458_CR67","doi-asserted-by":"publisher","first-page":"5159","DOI":"10.1093\/BIOINFORMATICS\/BTAA656","volume":"36","author":"X Pan","year":"2020","unstructured":"Pan X, Zuallaert J, Wang X, Shen HB, Campos EP, Marushchak DO, De Neve W (2020) ToxDL: deep learning using primary structure and domain embeddings for assessing protein toxicity. Bioinformatics 36(21):5159\u20135168. https:\/\/doi.org\/10.1093\/BIOINFORMATICS\/BTAA656","journal-title":"Bioinformatics"},{"issue":"1","key":"458_CR68","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1080\/00220670209598786","volume":"96","author":"CYJ Peng","year":"2002","unstructured":"Peng CYJ, Lee KL, Ingersoll GM (2002) An introduction to logistic regression analysis and reporting. J Educ Res 96(1):3\u201314. https:\/\/doi.org\/10.1080\/00220670209598786","journal-title":"J Educ Res"},{"issue":"13","key":"458_CR69","doi-asserted-by":"publisher","first-page":"1607","DOI":"10.1093\/BIOINFORMATICS\/BTM158","volume":"23","author":"V Pihur","year":"2007","unstructured":"Pihur V, Datta S, Datta S (2007) Weighted rank aggregation of cluster validation measures: a monte carlo cross-entropy approach. Bioinformatics 23(13):1607\u20131615. https:\/\/doi.org\/10.1093\/BIOINFORMATICS\/BTM158","journal-title":"Bioinformatics"},{"key":"458_CR70","doi-asserted-by":"publisher","first-page":"107035","DOI":"10.1016\/J.CMPB.2022.107035","volume":"225","author":"V Pipoli","year":"2022","unstructured":"Pipoli V, Cappelli M, Palladini A, Peluso C, Lovino M, Ficarra E (2022) Predicting gene expression levels from DNA sequences and post-transcriptional information with transformers: predicting gene expression levels from DNA sequences. Comput Methods Programs Biomed 225:107035. https:\/\/doi.org\/10.1016\/J.CMPB.2022.107035","journal-title":"Comput Methods Programs Biomed"},{"issue":"1","key":"458_CR71","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1109\/TCBB.2021.3084397","volume":"19","author":"Y Qian","year":"2022","unstructured":"Qian Y, Li X, Zhang Q, Zhang J (2022) SPP-CPI: predicting compound-protein interactions based on neural networks. IEEE\/ACM Trans Comput Biol Bioinf 19(1):40\u201347. https:\/\/doi.org\/10.1109\/TCBB.2021.3084397","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"issue":"8","key":"458_CR72","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/S12032-022-01711-1","volume":"39","author":"S Quazi","year":"2022","unstructured":"Quazi S (2022) Artificial intelligence and machine learning in precision and genomic medicine. Med Oncol 39(8):1\u201318. https:\/\/doi.org\/10.1007\/S12032-022-01711-1","journal-title":"Med Oncol"},{"issue":"6","key":"458_CR73","doi-asserted-by":"publisher","first-page":"1463","DOI":"10.1109\/TETCI.2022.3182354","volume":"6","author":"S Ray","year":"2022","unstructured":"Ray S, Lall S, Bandyopadhyay S (2022) A deep integrated framework for predicting SARS-CoV2-human protein-protein interaction. IEEE Trans Emerg Topics Comput Intell 6(6):1463\u20131472. https:\/\/doi.org\/10.1109\/TETCI.2022.3182354","journal-title":"IEEE Trans Emerg Topics Comput Intell"},{"key":"458_CR74","doi-asserted-by":"publisher","first-page":"57967","DOI":"10.1109\/ACCESS.2020.2982416","volume":"8","author":"NM Rezk","year":"2020","unstructured":"Rezk NM, Purnaprajna M, Nordstrom T, Ul-Abdin Z (2020) Recurrent neural networks: an embedded computing perspective. IEEE Access 8:57967\u201357996. https:\/\/doi.org\/10.1109\/ACCESS.2020.2982416","journal-title":"IEEE Access"},{"key":"458_CR75","unstructured":"Rish I (n.d.). An empirical study of the naive Bayes classifier."},{"key":"458_CR76","doi-asserted-by":"publisher","DOI":"10.1021\/ci100050t","author":"D Rogers","year":"2010","unstructured":"Rogers D, Hahn M (2010). Extended-Connectivity Fingerprints. https:\/\/doi.org\/10.1021\/ci100050t","journal-title":"Extended-Connectivity Fingerprints"},{"key":"458_CR77","doi-asserted-by":"publisher","unstructured":"Rokach L, Maimon O (2006) Decision Trees. Data Mining and Knowledge Discovery Handbook, 165\u2013192. https:\/\/doi.org\/10.1007\/0-387-25465-X_9","DOI":"10.1007\/0-387-25465-X_9"},{"key":"458_CR78","unstructured":"Sabour S, Frosst N, Hinton GE (n.d.). Dynamic Routing Between Capsules."},{"key":"458_CR79","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1501\/0003168","volume":"1","author":"MH Sazli","year":"2006","unstructured":"Sazli MH (2006) A brief review of feed-forward neural networks. Commun Fac Sci Univ Ank Series 1:11\u201317","journal-title":"Commun Fac Sci Univ Ank Series"},{"issue":"4","key":"458_CR80","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1109\/5254.708428","volume":"13","author":"B Sch\u00f6lkopf","year":"1998","unstructured":"Sch\u00f6lkopf B (1998) SVMs - a practical consequence of learning theory. IEEE Intel Syst Their Appl 13(4):18\u201321. https:\/\/doi.org\/10.1109\/5254.708428","journal-title":"IEEE Intel Syst Their Appl"},{"issue":"1","key":"458_CR81","doi-asserted-by":"publisher","first-page":"bbab422","DOI":"10.1093\/BIB\/BBAB422","volume":"23","author":"R Sharma","year":"2022","unstructured":"Sharma R, Shrivastava S, Singh SK, Kumar A, Saxena S, Singh RK (2022) Deep-AFPpred: Identifying novel antifungal peptides using pretrained embeddings from seq2vec with 1DCNN-BiLSTM. Brief Bioinform 23(1):bbab422. https:\/\/doi.org\/10.1093\/BIB\/BBAB422","journal-title":"Brief Bioinform"},{"issue":"1","key":"458_CR82","doi-asserted-by":"publisher","first-page":"671","DOI":"10.1007\/S11277-021-09152-1\/TABLES\/8","volume":"123","author":"S Sharma","year":"2022","unstructured":"Sharma S, Singh S (2022) Recognition of indian sign language (ISL) using deep learning model. Wireless Pers Commun 123(1):671\u2013692. https:\/\/doi.org\/10.1007\/S11277-021-09152-1\/TABLES\/8","journal-title":"Wireless Pers Commun"},{"key":"458_CR83","doi-asserted-by":"publisher","first-page":"103246","DOI":"10.1016\/J.JBI.2019.103246","volume":"96","author":"F Shen","year":"2019","unstructured":"Shen F, Peng S, Fan Y, Wen A, Liu S, Wang Y, Wang L, Liu H (2019) HPO2Vec+: leveraging heterogeneous knowledge resources to enrich node embeddings for the human phenotype ontology. J Biomed Inform 96:103246. https:\/\/doi.org\/10.1016\/J.JBI.2019.103246","journal-title":"J Biomed Inform"},{"issue":"1","key":"458_CR84","doi-asserted-by":"publisher","first-page":"15270","DOI":"10.1038\/S41598-018-33321-1","volume":"8","author":"Z Shen","year":"2018","unstructured":"Shen Z, Bao W, Huang DS (2018) Recurrent neural network for predicting transcription factor binding sites. Sci Rep 8(1):15270. https:\/\/doi.org\/10.1038\/S41598-018-33321-1","journal-title":"Sci Rep"},{"key":"458_CR85","doi-asserted-by":"publisher","first-page":"485","DOI":"10.1128\/microbe.4.485.2","volume":"4","author":"R Sitaraman","year":"2009","unstructured":"Sitaraman R (2009) The first paper in bioinformatics? Microbe (washington, D.c.) 4:485\u2013486. https:\/\/doi.org\/10.1128\/microbe.4.485.2","journal-title":"Microbe (washington, D.c.)"},{"issue":"13","key":"458_CR86","doi-asserted-by":"publisher","first-page":"i52","DOI":"10.1093\/BIOINFORMATICS\/BTY259","volume":"34","author":"FZ Smaili","year":"2018","unstructured":"Smaili FZ, Gao X, Hoehndorf R (2018) Onto2Vec: joint vector-based representation of biological entities and their ontology-based annotations. Bioinformatics 34(13):i52\u2013i60. https:\/\/doi.org\/10.1093\/BIOINFORMATICS\/BTY259","journal-title":"Bioinformatics"},{"key":"458_CR87","doi-asserted-by":"publisher","first-page":"4490154","DOI":"10.1155\/2022\/4490154","volume":"2022","author":"Y Sun","year":"2022","unstructured":"Sun Y, Xiong F, Sun Y, Zhao Y, Cao Y (2022) A miRNA target prediction model based on distributed representation learning and deep learning. Comput Math Methods Med 2022:4490154. https:\/\/doi.org\/10.1155\/2022\/4490154","journal-title":"Comput Math Methods Med"},{"key":"458_CR88","doi-asserted-by":"publisher","first-page":"D638","DOI":"10.1093\/nar\/gkac1000","volume":"51","author":"D Szklarczyk","year":"2023","unstructured":"Szklarczyk D, Kirsch R, Koutrouli M, Nastou K, Mehryary F, Hachilif R, Gable AL, Fang T, Doncheva NT, Pyysalo S, Jensen LJ, Von Mering C (2023) The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Database Issue Published Online 51:D638\u2013D646. https:\/\/doi.org\/10.1093\/nar\/gkac1000","journal-title":"Database Issue Published Online"},{"key":"458_CR89","doi-asserted-by":"publisher","unstructured":"Thafar MA, Albaradie S, Olayan RS, AshoorH, Essack M, Bajic VB (2020) Computational drug-target interaction prediction based on graph embedding and graph mining. ACM International Conference Proceeding Series, 14\u201321. https:\/\/doi.org\/10.1145\/3386052.3386062","DOI":"10.1145\/3386052.3386062"},{"issue":"8","key":"458_CR90","doi-asserted-by":"publisher","first-page":"e43","DOI":"10.1093\/NAR\/GKZ087","volume":"47","author":"X Tong","year":"2019","unstructured":"Tong X, Liu S (2019) CPPred: coding potential prediction based on the global description of RNA sequence. Nucleic Acids Res 47(8):e43. https:\/\/doi.org\/10.1093\/NAR\/GKZ087","journal-title":"Nucleic Acids Res"},{"key":"458_CR91","doi-asserted-by":"publisher","DOI":"10.1007\/S10489-022-04387-2","author":"HN Tran","year":"2023","unstructured":"Tran HN, Xuan QNP, Nguyen TT (2023) DeepCF-PPI: improved prediction of protein-protein interactions by combining learned and handcrafted features based on attention mechanisms. Appl Intell. https:\/\/doi.org\/10.1007\/S10489-022-04387-2","journal-title":"Appl Intell"},{"key":"458_CR92","doi-asserted-by":"publisher","first-page":"5564","DOI":"10.1016\/J.CSBJ.2022.10.012","volume":"20","author":"S Tsukiyama","year":"2022","unstructured":"Tsukiyama S, Kurata H (2022) Cross-attention PHV: prediction of human and virus protein-protein interactions using cross-attention\u2013based neural networks. Comput Struct Biotechnol J 20:5564\u20135573. https:\/\/doi.org\/10.1016\/J.CSBJ.2022.10.012","journal-title":"Comput Struct Biotechnol J"},{"key":"458_CR93","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser \u0141, Polosukhin I (2017) Attention Is All You Need. Advances in Neural Information Processing Systems, 2017-December, 5999\u20136009. https:\/\/arxiv.org\/abs\/1706.03762v5"},{"issue":"1","key":"458_CR94","doi-asserted-by":"publisher","first-page":"212","DOI":"10.1038\/S41598-020-80430-X","volume":"11","author":"A Wahab","year":"2021","unstructured":"Wahab A, Tayara H, Xuan Z, Chong KT (2021) DNA sequences performs as natural language processing by exploiting deep learning algorithm for the identification of N4-methylcytosine. Sci Rep 11(1):212. https:\/\/doi.org\/10.1038\/S41598-020-80430-X","journal-title":"Sci Rep"},{"key":"458_CR95","doi-asserted-by":"publisher","first-page":"2468789","DOI":"10.1155\/2020\/2468789","volume":"2020","author":"C Wang","year":"2020","unstructured":"Wang C, Zhang Y, Han S (2020) Its2vec: fungal species identification using sequence embedding and random forest classification. Biomed Res Int 2020:2468789. https:\/\/doi.org\/10.1155\/2020\/2468789","journal-title":"Biomed Res Int"},{"key":"458_CR96","doi-asserted-by":"publisher","first-page":"700036","DOI":"10.3389\/FGENE.2021.700036\/FULL","volume":"12","author":"L Wang","year":"2021","unstructured":"Wang L, Liu F, Du L, Qin G (2021) Single-cell transcriptome analysis in melanoma using network embedding. Front Genet 12:700036. https:\/\/doi.org\/10.3389\/FGENE.2021.700036\/FULL","journal-title":"Front Genet"},{"key":"458_CR97","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1007\/978-3-030-57821-3_9\/FIGURES\/3","volume":"12304 LNBI","author":"N Wang","year":"2020","unstructured":"Wang N, Zeng M, Zhang J, Li Y, Li M (2020) Ess-NEXG: predict essential proteins by constructing a weighted protein interaction network based on node embedding and XGBoost. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 12304 LNBI:95\u2013104. https:\/\/doi.org\/10.1007\/978-3-030-57821-3_9\/FIGURES\/3","journal-title":"Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)"},{"issue":"2","key":"458_CR98","doi-asserted-by":"publisher","first-page":"122","DOI":"10.3390\/CELLS8020122","volume":"8","author":"Y Wang","year":"2019","unstructured":"Wang Y, You ZH, Yang S, Li X, Jiang TH, Zhou X (2019) A high efficient biological language model for predicting protein-protein interactions. Cells 8(2):122. https:\/\/doi.org\/10.3390\/CELLS8020122","journal-title":"Cells"},{"issue":"6","key":"458_CR99","doi-asserted-by":"publisher","first-page":"bbab342","DOI":"10.1093\/BIB\/BBAB342","volume":"22","author":"Z Wang","year":"2021","unstructured":"Wang Z, Lei X (2021) Prediction of RBP binding sites on circRNAs using an LSTM-based deep sequence learning architecture. Brief Bioinform 22(6):bbab342. https:\/\/doi.org\/10.1093\/BIB\/BBAB342","journal-title":"Brief Bioinform"},{"issue":"2","key":"458_CR100","doi-asserted-by":"publisher","first-page":"e1006721","DOI":"10.1371\/JOURNAL.PCBI.1006721","volume":"15","author":"S Woloszynek","year":"2019","unstructured":"Woloszynek S, Zhao Z, Chen J, Rosen GL (2019) 16S rRNA sequence embeddings: meaningful numeric feature representations of nucleotide sequences that are convenient for downstream analyses. PLoS Comput Biol 15(2):e1006721. https:\/\/doi.org\/10.1371\/JOURNAL.PCBI.1006721","journal-title":"PLoS Comput Biol"},{"issue":"1","key":"458_CR101","doi-asserted-by":"publisher","first-page":"456","DOI":"10.1186\/S12859-019-3006-Z","volume":"20","author":"C Wu","year":"2019","unstructured":"Wu C, Gao R, Zhang Y, De Marinis Y (2019) PTPD: predicting therapeutic peptides by deep learning and word2vec. BMC Bioinformatics 20(1):456. https:\/\/doi.org\/10.1186\/S12859-019-3006-Z","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"458_CR102","doi-asserted-by":"publisher","first-page":"844","DOI":"10.1038\/S41598-020-80670-X","volume":"11","author":"F Wu","year":"2021","unstructured":"Wu F, Yang R, Zhang C, Zhang L (2021) A deep learning framework combined with word embedding to identify DNA replication origins. Sci Rep 11(1):844. https:\/\/doi.org\/10.1038\/S41598-020-80670-X","journal-title":"Sci Rep"},{"key":"458_CR103","doi-asserted-by":"publisher","first-page":"560","DOI":"10.1186\/S12859-022-04771-2","volume":"23","author":"X Wu","year":"2022","unstructured":"Wu X, Zeng W, Lin F (2022) GCNCPR-ACPs: a novel graph convolution network method for ACPs prediction. BMC Bioinformatics 23:560. https:\/\/doi.org\/10.1186\/S12859-022-04771-2","journal-title":"BMC Bioinformatics"},{"issue":"6","key":"458_CR104","doi-asserted-by":"publisher","first-page":"3395","DOI":"10.1109\/TCBB.2021.3113326","volume":"19","author":"F Xie","year":"2022","unstructured":"Xie F, Yang Z, Song J, Dai Q, Duan X (2022) DHNLDA: a novel deep hierarchical network based method for predicting lncRNA-disease associations. IEEE\/ACM Trans Comput Biol Bioinf 19(6):3395\u20133403. https:\/\/doi.org\/10.1109\/TCBB.2021.3113326","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"issue":"8","key":"458_CR105","doi-asserted-by":"publisher","first-page":"4335","DOI":"10.1109\/JBHI.2022.3169542","volume":"26","author":"W Xie","year":"2022","unstructured":"Xie W, Zheng Z, Zhang W, Huang L, Lin Q, Wong KC (2022) SRG-vote: predicting mirna-gene relationships via embedding and LSTM ensemble. IEEE J Biomed Health Inform 26(8):4335\u20134344. https:\/\/doi.org\/10.1109\/JBHI.2022.3169542","journal-title":"IEEE J Biomed Health Inform"},{"issue":"19","key":"458_CR106","doi-asserted-by":"publisher","first-page":"4372","DOI":"10.3390\/MOLECULES25194372","volume":"25","author":"S Yang","year":"2020","unstructured":"Yang S, Wang Y, Lin Y, Shao D, He K, Huang L (2020) LncMirNet: predicting LncRNA\u2013miRNA interaction based on deep learning of ribonucleic acid sequences. Molecules 25(19):4372. https:\/\/doi.org\/10.3390\/MOLECULES25194372","journal-title":"Molecules"},{"key":"458_CR107","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1016\/J.CSBJ.2019.12.005","volume":"18","author":"X Yang","year":"2020","unstructured":"Yang X, Yang S, Li Q, Wuchty S, Zhang Z (2020) Prediction of human-virus protein-protein interactions through a sequence embedding-based machine learning method. Comput Struct Biotechnol J 18:153\u2013161. https:\/\/doi.org\/10.1016\/J.CSBJ.2019.12.005","journal-title":"Comput Struct Biotechnol J"},{"issue":"6","key":"458_CR108","doi-asserted-by":"publisher","first-page":"e7126","DOI":"10.7717\/PEERJ.7126","volume":"7","author":"Y Yao","year":"2019","unstructured":"Yao Y, Du X, Diao Y, Zhu H (2019) An integration of deep learning with feature embedding for protein\u2013protein interaction prediction. PeerJ 7(6):e7126. https:\/\/doi.org\/10.7717\/PEERJ.7126","journal-title":"PeerJ"},{"key":"458_CR109","doi-asserted-by":"publisher","unstructured":"Yuan Y, Xun G, Suo Q, Jia K, Zhang A (2017) Wave2Vec: learning deep representations for biosignals. Proceedings - IEEE International Conference on Data Mining, ICDM, 2017-November, 1159\u20131164. https:\/\/doi.org\/10.1109\/ICDM.2017.155","DOI":"10.1109\/ICDM.2017.155"},{"key":"458_CR110","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1016\/J.NEUCOM.2018.03.074","volume":"324","author":"Y Yuan","year":"2019","unstructured":"Yuan Y, Xun G, Suo Q, Jia K, Zhang A (2019) Wave2Vec: deep representation learning for clinical temporal data. Neurocomputing 324:31\u201342. https:\/\/doi.org\/10.1016\/J.NEUCOM.2018.03.074","journal-title":"Neurocomputing"},{"issue":"1","key":"458_CR111","doi-asserted-by":"publisher","first-page":"296","DOI":"10.1109\/TCBB.2019.2897679","volume":"18","author":"M Zeng","year":"2021","unstructured":"Zeng M, Li M, Fei Z, Wu FX, Li Y, Pan Y, Wang J (2021) A deep learning framework for identifying essential proteins by integrating multiple types of biological information. IEEE\/ACM Trans Comput Biol Bioinf 18(1):296\u2013305. https:\/\/doi.org\/10.1109\/TCBB.2019.2897679","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"key":"458_CR112","doi-asserted-by":"publisher","first-page":"506","DOI":"10.1186\/S12859-019-3076-Y","volume":"20","author":"M Zeng","year":"2019","unstructured":"Zeng M, Li M, Wu FX, Li Y, Pan Y (2019) DeepEP: a deep learning framework for identifying essential proteins. BMC Bioinformatics 20:506. https:\/\/doi.org\/10.1186\/S12859-019-3076-Y","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"458_CR113","doi-asserted-by":"publisher","first-page":"bbab360","DOI":"10.1093\/BIB\/BBAB360","volume":"23","author":"M Zeng","year":"2022","unstructured":"Zeng M, Wu Y, Lu C, Zhang F, Wu FX, Li M (2022) DeepLncLoc: a deep learning framework for long non-coding RNA subcellular localization prediction based on subsequence embedding. Brief Bioinform 23(1):bbab360. https:\/\/doi.org\/10.1093\/BIB\/BBAB360","journal-title":"Brief Bioinform"},{"key":"458_CR114","doi-asserted-by":"publisher","first-page":"84","DOI":"10.1186\/S12864-018-4459-6","volume":"19","author":"W Zeng","year":"2018","unstructured":"Zeng W, Wu M, Jiang R (2018) Prediction of enhancer-promoter interactions via natural language processing. BMC Genomics 19:84. https:\/\/doi.org\/10.1186\/S12864-018-4459-6","journal-title":"BMC Genomics"},{"issue":"6","key":"458_CR115","doi-asserted-by":"publisher","first-page":"2208","DOI":"10.1109\/TCBB.2020.2968882","volume":"18","author":"F Zhang","year":"2021","unstructured":"Zhang F, Song H, Zeng M, Wu FX, Li Y, Pan Y, Li M (2021) A deep learning framework for gene ontology annotations with sequence- and network-based information. IEEE\/ACM Trans Comput Biol Bioinf 18(6):2208\u20132217. https:\/\/doi.org\/10.1109\/TCBB.2020.2968882","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"issue":"3","key":"458_CR116","doi-asserted-by":"publisher","first-page":"1257","DOI":"10.1109\/TCBB.2020.3003941","volume":"19","author":"J Zhang","year":"2022","unstructured":"Zhang J, Zhu M, Qian Y (2022) Protein2vec: predicting protein-protein interactions based on LSTM. IEEE\/ACM Trans Comput Biol Bioinf 19(3):1257\u20131266. https:\/\/doi.org\/10.1109\/TCBB.2020.3003941","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"issue":"1","key":"458_CR117","doi-asserted-by":"publisher","first-page":"288","DOI":"10.1186\/S12859-021-04206-4","volume":"22","author":"L Zhang","year":"2021","unstructured":"Zhang L, Li G, Li X, Wang H, Chen S, Liu H (2021) EDLm6APred: ensemble deep learning approach for mRNA m6A site prediction. BMC Bioinformatics 22(1):288. https:\/\/doi.org\/10.1186\/S12859-021-04206-4","journal-title":"BMC Bioinformatics"},{"key":"458_CR118","doi-asserted-by":"publisher","first-page":"895","DOI":"10.3389\/FCHEM.2019.00895\/FULL","volume":"7","author":"YF Zhang","year":"2020","unstructured":"Zhang YF, Wang X, Kaushik AC, Chu Y, Shan X, Zhao MZ, Xu Q, Wei DQ (2020) SPVec: a word2vec-inspired feature representation method for drug-target interaction prediction. Front Chem 7:895. https:\/\/doi.org\/10.3389\/FCHEM.2019.00895\/FULL","journal-title":"Front Chem"},{"key":"458_CR119","doi-asserted-by":"publisher","unstructured":"Zhao L, Wang J, Cheng L, Wang C (2020) Ontosem: an ontology semantic representation methodology for biomedical domain. Proceedings - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020, 523\u2013527. https:\/\/doi.org\/10.1109\/BIBM49941.2020.9313128","DOI":"10.1109\/BIBM49941.2020.9313128"},{"issue":"5","key":"458_CR120","doi-asserted-by":"publisher","first-page":"bbac124","DOI":"10.1093\/BIB\/BBAC318","volume":"23","author":"L Zhao","year":"2022","unstructured":"Zhao L, Sun H, Cao X, Wen N, Wang J, Wang C (2022) Learning representations for gene ontology terms by jointly encoding graph structure and textual node descriptors. Brief Bioinform 23(5):bbac124. https:\/\/doi.org\/10.1093\/BIB\/BBAC318","journal-title":"Brief Bioinform"},{"key":"458_CR121","doi-asserted-by":"publisher","unstructured":"Zhou S, Yue X, Xu X, Liu S, Zhang W, Niu Y (2019) LncRNA-miRNA interaction prediction from the heterogeneous network through graph embedding ensemble learning. Proceedings - 2019 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2019, 622\u2013627. https:\/\/doi.org\/10.1109\/BIBM47256.2019.8983044","DOI":"10.1109\/BIBM47256.2019.8983044"},{"key":"458_CR122","doi-asserted-by":"publisher","first-page":"248","DOI":"10.3389\/FGENE.2018.00248\/FULL","volume":"9","author":"S Zhu","year":"2018","unstructured":"Zhu S, Bing J, Min X, Lin C, Zeng X (2018) Prediction of drug-gene interaction by using metapath2vec. Front Genet 9:248. https:\/\/doi.org\/10.3389\/FGENE.2018.00248\/FULL","journal-title":"Front Genet"},{"issue":"2","key":"458_CR123","doi-asserted-by":"publisher","first-page":"205","DOI":"10.1261\/RNA.069112.118","volume":"25","author":"Q Zou","year":"2019","unstructured":"Zou Q, Xing P, Wei L, Liu B (2019) Gene2vec: gene subsequence embedding for prediction of mammalian N 6 -methyladenosine sites from mRNA. RNA 25(2):205\u2013218. https:\/\/doi.org\/10.1261\/RNA.069112.118","journal-title":"RNA"},{"key":"458_CR124","doi-asserted-by":"publisher","first-page":"558","DOI":"10.1016\/J.YMETH.2021.07.011","volume":"203","author":"H Zulfiqar","year":"2022","unstructured":"Zulfiqar H, Sun ZJ, Huang QL, Yuan SS, Lv H, Dao FY, Lin H, Li YW (2022) Deep-4mCW2V: a sequence-based predictor to identify N4-methylcytosine sites in escherichia coli. Methods 203:558\u2013563. https:\/\/doi.org\/10.1016\/J.YMETH.2021.07.011","journal-title":"Methods"}],"container-title":["Network Modeling Analysis in Health Informatics and Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13721-024-00458-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s13721-024-00458-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13721-024-00458-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,11]],"date-time":"2024-12-11T20:31:32Z","timestamp":1733949092000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s13721-024-00458-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,6]]},"references-count":125,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["458"],"URL":"https:\/\/doi.org\/10.1007\/s13721-024-00458-1","relation":{},"ISSN":["2192-6670"],"issn-type":[{"value":"2192-6670","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,5,6]]},"assertion":[{"value":"24 December 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 April 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 April 2024","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 May 2024","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have declared no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interests"}}],"article-number":"23"}}