{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T21:35:48Z","timestamp":1772055348729,"version":"3.50.1"},"reference-count":84,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2020,6,29]],"date-time":"2020-06-29T00:00:00Z","timestamp":1593388800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Collaborative Research Program of Institute for Chemical Research","award":["2018-28"],"award-info":[{"award-number":["2018-28"]}]},{"name":"Collaborative Research Program of Institute for Chemical Research","award":["2019-32"],"award-info":[{"award-number":["2019-32"]}]},{"DOI":"10.13039\/100000060","name":"National Institute of Allergy and Infectious Diseases","doi-asserted-by":"publisher","award":["R01 AI111965"],"award-info":[{"award-number":["R01 AI111965"]}],"id":[{"id":"10.13039\/100000060","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000923","name":"Australian Research Council","doi-asserted-by":"publisher","award":["DP120104460"],"award-info":[{"award-number":["DP120104460"]}],"id":[{"id":"10.13039\/501100000923","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000923","name":"Australian Research Council","doi-asserted-by":"publisher","award":["LP110200333"],"award-info":[{"award-number":["LP110200333"]}],"id":[{"id":"10.13039\/501100000923","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000925","name":"National Health and Medical Research Council","doi-asserted-by":"publisher","award":["1127948"],"award-info":[{"award-number":["1127948"]}],"id":[{"id":"10.13039\/501100000925","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000925","name":"National Health and Medical Research Council","doi-asserted-by":"publisher","award":["1144652"],"award-info":[{"award-number":["1144652"]}],"id":[{"id":"10.13039\/501100000925","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000925","name":"National Health and Medical Research Council","doi-asserted-by":"publisher","award":["1092262"],"award-info":[{"award-number":["1092262"]}],"id":[{"id":"10.13039\/501100000925","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Natural Science Foundation of Guangxi","award":["2016GXNSFCA380005"],"award-info":[{"award-number":["2016GXNSFCA380005"]}]},{"name":"Natural Science Foundation of Guangxi","award":["2018GXNSFAA138117"],"award-info":[{"award-number":["2018GXNSFAA138117"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61862017"],"award-info":[{"award-number":["61862017"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,5,20]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Virulence factors (VFs) enable pathogens to infect their hosts. A wealth of individual, disease-focused studies has identified a wide variety of VFs, and the growing mass of bacterial genome sequence data provides an opportunity for computational methods aimed at predicting VFs. Despite their attractive advantages and performance improvements, the existing methods have some limitations and drawbacks. Firstly, as the characteristics and mechanisms of VFs are continually evolving with the emergence of antibiotic resistance, it is more and more difficult to identify novel VFs using existing tools that were previously developed based on the outdated data sets; secondly, few systematic feature engineering efforts have been made to examine the utility of different types of features for model performances, as the majority of tools only focused on extracting very few types of features. By addressing the aforementioned issues, the accuracy of VF predictors can likely be significantly improved. This, in turn, would be particularly useful in the context of genome wide predictions of VFs. In this work, we present a deep learning (DL)-based hybrid framework (termed DeepVF) that is utilizing the stacking strategy to achieve more accurate identification of VFs. Using an enlarged, up-to-date dataset, DeepVF comprehensively explores a wide range of heterogeneous features with popular machine learning algorithms. Specifically, four classical algorithms, including random forest, support vector machines, extreme gradient boosting and multilayer perceptron, and three DL algorithms, including convolutional neural networks, long short-term memory networks and deep neural networks are employed to train 62 baseline models using these features. In order to integrate their individual strengths, DeepVF effectively combines these baseline models to construct the final meta model using the stacking strategy. Extensive benchmarking experiments demonstrate the effectiveness of DeepVF: it achieves a more accurate and stable performance compared with baseline models on the benchmark dataset and clearly outperforms state-of-the-art VF predictors on the independent test. Using the proposed hybrid ensemble model, a user-friendly online predictor of DeepVF (http:\/\/deepvf.erc.monash.edu\/) is implemented. Furthermore, its utility, from the user\u2019s viewpoint, is compared with that of existing toolkits. We believe that DeepVF will be exploited as a useful tool for screening and identifying potential VFs from protein-coding gene sequences in bacterial genomes.<\/jats:p>","DOI":"10.1093\/bib\/bbaa125","type":"journal-article","created":{"date-parts":[[2020,5,25]],"date-time":"2020-05-25T11:06:44Z","timestamp":1590404804000},"source":"Crossref","is-referenced-by-count":72,"title":["DeepVF: a deep learning-based hybrid framework for identifying virulence factors using the stacking strategy"],"prefix":"10.1093","volume":"22","author":[{"given":"Ruopeng","family":"Xie","sequence":"first","affiliation":[{"name":"Bioinformatics Lab at Guilin University of Electronic Technology"}]},{"given":"Jiahui","family":"Li","sequence":"additional","affiliation":[{"name":"Bioinformatics Lab at Guilin University of Electronic Technology"}]},{"given":"Jiawei","family":"Wang","sequence":"additional","affiliation":[{"name":"Biomedicine Discovery Institute and the Department of Microbiology at Monash University, Australia"}]},{"given":"Wei","family":"Dai","sequence":"additional","affiliation":[{"name":"School of Computer Science and Information Security, Guilin University of Electronic Technology, China"}]},{"given":"Andr\u00e9","family":"Leier","sequence":"additional","affiliation":[{"name":"Department of Genetics and the Department of Cell, Developmental and Integrative Biology, University of Alabama at Birmingham (UAB) School of Medicine, USA"}]},{"given":"Tatiana T","family":"Marquez-Lago","sequence":"additional","affiliation":[{"name":"Department of Genetics and the Department of Cell, Developmental and Integrative Biology, University of Alabama at Birmingham (UAB) School of Medicine, USA"}]},{"given":"Tatsuya","family":"Akutsu","sequence":"additional","affiliation":[{"name":"University of Tokyo, Japan"}]},{"given":"Trevor","family":"Lithgow","sequence":"additional","affiliation":[{"name":"Biomedicine Discovery Institute and the Director of the Centre to Impact AMR at Monash University, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8031-9086","authenticated-orcid":false,"given":"Jiangning","family":"Song","sequence":"additional","affiliation":[{"name":"Group Leader in the Biomedicine Discovery Institute and the Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8629-258X","authenticated-orcid":false,"given":"Yanju","family":"Zhang","sequence":"additional","affiliation":[{"name":"Leiden Institute of Advanced Computer Science, Leiden University"}]}],"member":"286","published-online":{"date-parts":[[2020,6,29]]},"reference":[{"key":"2021052110084388600_ref1","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1016\/j.ijmm.2005.12.015","article-title":"Infectious diseases - a global challenge","volume":"296","author":"Becker","year":"2006","journal-title":"Int J Med Microbiol"},{"key":"2021052110084388600_ref2","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.prevetmed.2012.11.021","article-title":"Diseases at the livestock-wildlife interface: status, challenges, and opportunities in the United States","volume":"110","author":"Miller","year":"2013","journal-title":"Prev Vet Med"},{"key":"2021052110084388600_ref3","doi-asserted-by":"crossref","first-page":"D693","DOI":"10.1093\/nar\/gky999","article-title":"Victors: a web-based knowledge base of virulence factors in human and animal pathogens","volume":"47","author":"Sayers","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2021052110084388600_ref4","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1017\/ice.2018.304","article-title":"Re-estimating annual deaths due to multidrug-resistant organism infections","volume":"40","author":"Burnham","year":"2019","journal-title":"Infect Control Hosp Epidemiol"},{"key":"2021052110084388600_ref5","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1086\/322044","article-title":"Host-pathogen interactions: the attributes of virulence","volume":"184","author":"Casadevall","year":"2001","journal-title":"J Infect Dis"},{"key":"2021052110084388600_ref6","doi-asserted-by":"crossref","first-page":"196","DOI":"10.1186\/cc7127","article-title":"What is a virulence factor?","volume":"12","author":"Cross","year":"2008","journal-title":"Crit Care"},{"key":"2021052110084388600_ref7","doi-asserted-by":"crossref","first-page":"2627","DOI":"10.1128\/AEM.66.6.2627-2630.2000","article-title":"Bacillus anthracis, Bacillus cereus, and bacillus thuringiensis--one species on the basis of genetic evidence","volume":"66","author":"Helgason","year":"2000","journal-title":"Appl Environ Microbiol"},{"key":"2021052110084388600_ref8","doi-asserted-by":"crossref","first-page":"560","DOI":"10.1128\/MMBR.68.3.560-602.2004","article-title":"Phages and the evolution of bacterial pathogens: from genomic rearrangements to lysogenic conversion","volume":"68","author":"Brussow","year":"2004","journal-title":"Microbiol Mol Biol Rev"},{"key":"2021052110084388600_ref9","doi-asserted-by":"crossref","first-page":"20142","DOI":"10.1073\/pnas.1107176108","article-title":"Genomic anatomy of Escherichia coli O157:H7 outbreaks","volume":"108","author":"Eppinger","year":"2011","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2021052110084388600_ref10","doi-asserted-by":"publisher","DOI":"10.1016\/j.vaccine.2019.06.034","article-title":"CTX phage of Vibrio cholerae: genomics and applications","author":"Pant","year":"2019","journal-title":"Vaccine"},{"key":"2021052110084388600_ref11","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1186\/1471-2105-9-62","article-title":"VirulentPred: a SVM based prediction method for virulent proteins in bacterial pathogens","volume":"9","author":"Garg","year":"2008","journal-title":"BMC Bioinformat"},{"key":"2021052110084388600_ref12","doi-asserted-by":"crossref","first-page":"314","DOI":"10.1016\/S0966-842X(02)02391-0","article-title":"Virulence and pathogenesis","volume":"10","author":"Weiss","year":"2002","journal-title":"Trends Microbiol"},{"key":"2021052110084388600_ref13","doi-asserted-by":"crossref","first-page":"161","DOI":"10.3389\/fcimb.2012.00161","article-title":"Paradigms of pathogenesis: targeting the mobile genetic elements of disease","volume":"2","author":"Keen","year":"2012","journal-title":"Front Cell Infect Microbiol"},{"key":"2021052110084388600_ref14","doi-asserted-by":"crossref","first-page":"7458","DOI":"10.1016\/j.eswa.2008.09.036","article-title":"An ensemble of support vector machines for predicting virulent proteins","volume":"36","author":"Nanni","year":"2009","journal-title":"Expert Syst Appl"},{"key":"2021052110084388600_ref15","doi-asserted-by":"crossref","first-page":"467","DOI":"10.1109\/TCBB.2011.117","article-title":"Identifying bacterial virulent proteins by fusing a set of classifiers based on variants of Chou's pseudo amino acid composition and on evolutionary information","volume":"9","author":"Nanni","year":"2012","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2021052110084388600_ref16","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1093\/bioinformatics\/bti028","article-title":"SPAAN: a software program for prediction of adhesins and adhesin-like proteins using neural networks","volume":"21","author":"Sachdeva","year":"2005","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref17","first-page":"3","article-title":"Virulent-GO: prediction of virulent proteins in bacterial pathogens utilizing gene ontology terms","volume":"1","author":"Tsai","year":"2009","journal-title":"Development"},{"key":"2021052110084388600_ref18","doi-asserted-by":"crossref","first-page":"e42517","DOI":"10.1371\/journal.pone.0042517","article-title":"A comparison of computational methods for identifying virulence factors","volume":"7","author":"Zheng","year":"2012","journal-title":"PLoS One"},{"key":"2021052110084388600_ref19","doi-asserted-by":"crossref","first-page":"e93907","DOI":"10.1371\/journal.pone.0093907","article-title":"MP3: a software tool for the prediction of pathogenic proteins in genomic and metagenomic data","volume":"9","author":"Gupta","year":"2014","journal-title":"PLoS One"},{"key":"2021052110084388600_ref20","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbz076","article-title":"Predicting bacterial virulence factors \u2013 evaluation of machine learning and negative data strategies","author":"Rentzsch","year":"2019","journal-title":"Brief Bioinform"},{"key":"2021052110084388600_ref21","doi-asserted-by":"crossref","first-page":"1447","DOI":"10.1039\/c3mb70024k","article-title":"Computationally identifying virulence factors based on KEGG pathways","volume":"9","author":"Cui","year":"2013","journal-title":"Mol Biosyst"},{"key":"2021052110084388600_ref22","doi-asserted-by":"crossref","first-page":"D687","DOI":"10.1093\/nar\/gky1080","article-title":"VFDB 2019: a comparative pathogenomic platform with an interactive web interface","volume":"47","author":"Liu","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2021052110084388600_ref23","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1093\/bioinformatics\/btu631","article-title":"Curation, integration and visualization of bacterial virulence factors in PATRIC","volume":"31","author":"Mao","year":"2015","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref24","doi-asserted-by":"crossref","first-page":"D535","DOI":"10.1093\/nar\/gkw1017","article-title":"Improvements to PATRIC, the all-bacterial bioinformatics database and analysis resource center","volume":"45","author":"Wattam","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2021052110084388600_ref25","volume-title":"PATRIC v2 FTP Download Site"},{"key":"2021052110084388600_ref26","doi-asserted-by":"crossref","first-page":"2185","DOI":"10.1093\/bib\/bby079","article-title":"Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework","volume":"20","author":"Zhang","year":"2019","journal-title":"Brief Bioinform"},{"key":"2021052110084388600_ref27","doi-asserted-by":"crossref","first-page":"2749","DOI":"10.1093\/bioinformatics\/bty1043","article-title":"PredGly: predicting lysine glycation sites for Homo sapiens based on XGboost feature optimization","volume":"35","author":"Yu","year":"2019","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref28","doi-asserted-by":"crossref","first-page":"2546","DOI":"10.1093\/bioinformatics\/bty155","article-title":"Bastion6: a bioinformatics approach for accurate prediction of type VI secreted effectors","volume":"34","author":"Wang","year":"2018","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref29","doi-asserted-by":"crossref","first-page":"931","DOI":"10.1093\/bib\/bbx164","article-title":"Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches","volume":"20","author":"Wang","year":"2019","journal-title":"Brief Bioinform"},{"key":"2021052110084388600_ref30","doi-asserted-by":"crossref","first-page":"680","DOI":"10.1093\/bioinformatics\/btq003","article-title":"CD-HIT suite: a web server for clustering and comparing biological sequences","volume":"26","author":"Huang","year":"2010","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref31","doi-asserted-by":"crossref","first-page":"2017","DOI":"10.1093\/bioinformatics\/bty914","article-title":"Bastion3: a two-layer ensemble predictor of type III secreted effectors","volume":"35","author":"Wang","year":"2019","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref32","doi-asserted-by":"crossref","first-page":"704","DOI":"10.1093\/bioinformatics\/btz629","article-title":"PeNGaRoo, a combined gradient boosting and ensemble learning framework for predicting non-classical secreted proteins","volume":"36","author":"Zhang","year":"2020","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref33","doi-asserted-by":"crossref","first-page":"648","DOI":"10.1089\/omi.2015.0095","article-title":"Harnessing computational biology for exact linear B-cell epitope prediction: a novel amino acid composition-based feature descriptor","volume":"19","author":"Saravanan","year":"2015","journal-title":"OMICS"},{"key":"2021052110084388600_ref34","first-page":"270","article-title":"Prediction and identification of the effectors of heterotrimeric G proteins in rice (Oryza sativa L.)","volume":"18","author":"Li","year":"2017","journal-title":"Brief Bioinform"},{"key":"2021052110084388600_ref35","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1016\/j.ab.2007.10.012","article-title":"PseAAC: a flexible web server for generating various kinds of protein pseudo amino acid composition","volume":"373","author":"Shen","year":"2008","journal-title":"Anal Biochem"},{"key":"2021052110084388600_ref36","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1002\/prot.1035","article-title":"Prediction of protein cellular attributes using pseudo-amino acid composition","volume":"43","author":"Chou","year":"2001","journal-title":"Proteins"},{"key":"2021052110084388600_ref37","doi-asserted-by":"crossref","first-page":"477","DOI":"10.1006\/bbrc.2000.3815","article-title":"Prediction of protein subcellular locations by incorporating quasi-sequence-order effect","volume":"278","author":"Chou","year":"2000","journal-title":"Biochem Biophys Res Commun"},{"key":"2021052110084388600_ref38","first-page":"148","article-title":"Comprehensive assessment and performance improvement of effector protein predictors for bacterial secretion systems III, IV and VI","volume":"19","author":"An","year":"2018","journal-title":"Brief Bioinform"},{"key":"2021052110084388600_ref39","doi-asserted-by":"crossref","first-page":"2756","DOI":"10.1093\/bioinformatics\/btx302","article-title":"POSSUM: a bioinformatics toolkit for generating numerical sequence feature descriptors based on PSSM profiles","volume":"33","author":"Wang","year":"2017","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref40","doi-asserted-by":"crossref","first-page":"3135","DOI":"10.1093\/bioinformatics\/btt554","article-title":"Accurate prediction of bacterial type IV secreted effectors using amino acid composition and PSSM profiles","volume":"29","author":"Zou","year":"2013","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref41","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1016\/j.ygeno.2013.05.006","article-title":"PPIevo: protein-protein interaction prediction from PSSM based evolutionary information","volume":"102","author":"Zahiri","year":"2013","journal-title":"Genomics"},{"key":"2021052110084388600_ref42","doi-asserted-by":"crossref","first-page":"308","DOI":"10.1109\/TCBB.2010.93","article-title":"On position-specific scoring matrix for protein function prediction","volume":"8","author":"Jeong","year":"2011","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2021052110084388600_ref43","doi-asserted-by":"crossref","first-page":"2740","DOI":"10.1093\/bioinformatics\/bty179","article-title":"Deep learning improves antimicrobial peptide recognition","volume":"34","author":"Veltri","year":"2018","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref44","doi-asserted-by":"crossref","first-page":"2605","DOI":"10.1093\/bioinformatics\/bty166","article-title":"DeepSol: a deep learning framework for sequence-based protein solubility prediction","volume":"34","author":"Khurana","year":"2018","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref45","doi-asserted-by":"crossref","first-page":"3685","DOI":"10.1093\/bioinformatics\/btx531","article-title":"An introduction to deep learning on biological sequence data: examples and solutions","volume":"33","author":"Jurtz","year":"2017","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref46","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach Learn"},{"key":"2021052110084388600_ref47","doi-asserted-by":"crossref","first-page":"2267","DOI":"10.1093\/bib\/bby089","article-title":"Large-scale comparative assessment of computational predictors for lysine post-translational modification sites","volume":"20","author":"Chen","year":"2019","journal-title":"Brief Bioinform"},{"key":"2021052110084388600_ref48","doi-asserted-by":"crossref","first-page":"i79","DOI":"10.1093\/bioinformatics\/bty260","article-title":"Random forest based similarity learning for single cell RNA sequencing data","volume":"34","author":"Pouyan","year":"2018","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref49","first-page":"18","article-title":"Classification and regression by RandomForest","volume":"2","author":"Liaw","year":"2002","journal-title":"R News"},{"key":"2021052110084388600_ref50","doi-asserted-by":"crossref","first-page":"785","DOI":"10.1145\/2939672.2939785","volume-title":"Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining","author":"Chen","year":"2016"},{"key":"2021052110084388600_ref51","doi-asserted-by":"crossref","first-page":"2118","DOI":"10.1038\/s41598-017-02365-0","article-title":"CarcinoPred-EL: novel models for predicting the carcinogenicity of chemicals using molecular fingerprints and ensemble learning methods","volume":"7","author":"Zhang","year":"2017","journal-title":"Sci Rep"},{"key":"2021052110084388600_ref52","doi-asserted-by":"crossref","first-page":"983","DOI":"10.3390\/molecules21080983","article-title":"Bioactive molecule prediction using extreme gradient boosting","volume":"21","author":"Babajide Mustapha","year":"2016","journal-title":"Molecules"},{"key":"2021052110084388600_ref53","first-page":"281","article-title":"Random search for hyper-parameter optimization","volume":"13","author":"Bergstra","year":"2012","journal-title":"J Mach Learn Res"},{"key":"2021052110084388600_ref54","doi-asserted-by":"crossref","first-page":"755","DOI":"10.1093\/bioinformatics\/btk036","article-title":"Optimized multilayer perceptrons for molecular classification and diagnosis using genomic data","volume":"22","author":"Wang","year":"2006","journal-title":"Bioinformatics"},{"issue":"Suppl 2","key":"2021052110084388600_ref55","doi-asserted-by":"crossref","first-page":"ii7","DOI":"10.1093\/bioinformatics\/bti1100","article-title":"Augmented cell-graphs for automated cancer diagnosis","volume":"21","author":"Demir","year":"2005","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref56","doi-asserted-by":"crossref","first-page":"878","DOI":"10.15252\/msb.20156651","article-title":"Deep learning for computational biology","volume":"12","author":"Angermueller","year":"2016","journal-title":"Mol Syst Biol"},{"key":"2021052110084388600_ref57","volume-title":"Next-Step Conditioned Deep Convolutional Neural Networks Improve Protein Secondary Structure PredictionarXiv preprint arXiv:1702.03865","author":"Busia","year":"2017"},{"key":"2021052110084388600_ref58","doi-asserted-by":"crossref","first-page":"2449","DOI":"10.1093\/bioinformatics\/bts475","article-title":"Deep architectures for protein contact map prediction","volume":"28","author":"Di Lena","year":"2012","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref59","doi-asserted-by":"crossref","first-page":"i639","DOI":"10.1093\/bioinformatics\/btw427","article-title":"DeepChrome: deep-learning for predicting gene expression from histone modifications","volume":"32","author":"Singh","year":"2016","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref60","doi-asserted-by":"crossref","first-page":"931","DOI":"10.1038\/nmeth.3547","article-title":"Predicting effects of noncoding variants with deep learning-based sequence model","volume":"12","author":"Zhou","year":"2015","journal-title":"Nat Methods"},{"key":"2021052110084388600_ref61","doi-asserted-by":"crossref","first-page":"3600","DOI":"10.1093\/bioinformatics\/btv371","article-title":"High-order neural networks and kernel methods for peptide-MHC binding prediction","volume":"31","author":"Kuksa","year":"2015","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref62","first-page":"851","article-title":"Deep learning in bioinformatics","volume":"18","author":"Min","year":"2017","journal-title":"Brief Bioinform"},{"key":"2021052110084388600_ref63","doi-asserted-by":"crossref","first-page":"i121","DOI":"10.1093\/bioinformatics\/btw255","article-title":"Convolutional neural network architectures for predicting DNA-protein binding","volume":"32","author":"Zeng","year":"2016","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref64","doi-asserted-by":"crossref","first-page":"1041","DOI":"10.1038\/s41467-019-09027-x","article-title":"Deep convolutional neural networks for accurate somatic mutation detection","volume":"10","author":"Sahraeian","year":"2019","journal-title":"Nat Commun"},{"key":"2021052110084388600_ref65","doi-asserted-by":"crossref","first-page":"1559","DOI":"10.1038\/s41591-018-0177-5","article-title":"Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning","volume":"24","author":"Coudray","year":"2018","journal-title":"Nat Med"},{"key":"2021052110084388600_ref66","doi-asserted-by":"crossref","first-page":"1054","DOI":"10.1038\/s41591-019-0462-y","article-title":"Deep learning can predict microsatellite instability directly from histology in gastrointestinal cancer","volume":"25","author":"Kather","year":"2019","journal-title":"Nat Med"},{"key":"2021052110084388600_ref67","doi-asserted-by":"crossref","first-page":"2009","DOI":"10.1093\/bioinformatics\/bty937","article-title":"Identifying antimicrobial peptides using word embedding with deep recurrent neural networks","volume":"35","author":"Hamid","year":"2019","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref68","doi-asserted-by":"crossref","first-page":"1728","DOI":"10.1093\/bioinformatics\/btm247","article-title":"Fast model-based protein homology detection without alignment","volume":"23","author":"Hochreiter","year":"2007","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref69","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1007\/978-3-319-21233-3_6","volume-title":"International Conference on Algorithms for Computational Biology","author":"S\u00f8nderby","year":"2015"},{"key":"2021052110084388600_ref70","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1016\/j.neunet.2014.09.003","article-title":"Deep learning in neural networks: an overview","volume":"61","author":"Schmidhuber","year":"2015","journal-title":"Neural Netw"},{"key":"2021052110084388600_ref71","doi-asserted-by":"crossref","first-page":"i121","DOI":"10.1093\/bioinformatics\/btu277","article-title":"Deep learning of the tissue-regulated splicing code","volume":"30","author":"Leung","year":"2014","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref72","doi-asserted-by":"crossref","first-page":"e0141287","DOI":"10.1371\/journal.pone.0141287","article-title":"Continuous distributed representation of biological sequences for deep proteomics and genomics","volume":"10","author":"Asgari","year":"2015","journal-title":"PLoS One"},{"key":"2021052110084388600_ref73","doi-asserted-by":"crossref","first-page":"5128","DOI":"10.1093\/bioinformatics\/btz464","article-title":"DNN-Dom: predicting protein domain boundary from sequence alone by deep neural network","volume":"35","author":"Shi","year":"2019","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref74","volume-title":"Network in networkarXiv preprint arXiv:1312.4400","author":"Lin","year":"2013"},{"key":"2021052110084388600_ref75","doi-asserted-by":"crossref","first-page":"585","DOI":"10.1093\/bioinformatics\/btp039","article-title":"Sequence-based prediction of protein interaction sites with an integrative method","volume":"25","author":"Chen","year":"2009","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref76","doi-asserted-by":"crossref","first-page":"40242","DOI":"10.1038\/srep40242","article-title":"Detecting N(6)-methyladenosine sites from RNA transcriptomes using ensemble support vector machines","volume":"7","author":"Chen","year":"2017","journal-title":"Sci Rep"},{"key":"2021052110084388600_ref77","doi-asserted-by":"crossref","first-page":"1700262","DOI":"10.1002\/pmic.201700262","article-title":"HPSLPred: An ensemble multi-label classifier for human protein subcellular location prediction with imbalanced source","volume":"17","author":"Wan","year":"2017","journal-title":"Proteomics"},{"key":"2021052110084388600_ref78","doi-asserted-by":"crossref","first-page":"761","DOI":"10.1002\/minf.201500031","article-title":"Improving tRNAscan-SE annotation results via ensemble classifiers","volume":"34","author":"Zou","year":"2015","journal-title":"Mol Inform"},{"key":"2021052110084388600_ref79","doi-asserted-by":"crossref","first-page":"4007","DOI":"10.1093\/bioinformatics\/bty451","article-title":"ACPred-FL: a sequence-based predictor based on effective feature representation to improve the prediction of anti-cancer peptides","volume":"34","author":"Wei","year":"2018","journal-title":"Bioinformatics"},{"key":"2021052110084388600_ref80","doi-asserted-by":"crossref","first-page":"2571","DOI":"10.3389\/fmicb.2018.02571","article-title":"PredT4SE-stack: prediction of bacterial type IV secreted effectors from protein sequences using a stacked ensemble method","volume":"9","author":"Xiong","year":"2018","journal-title":"Front Microbiol"},{"key":"2021052110084388600_ref81","doi-asserted-by":"crossref","first-page":"21734","DOI":"10.3390\/ijms160921734","article-title":"An ensemble method to distinguish bacteriophage Virion from non-Virion proteins based on protein sequence characteristics","volume":"16","author":"Zhang","year":"2015","journal-title":"Int J Mol Sci"},{"key":"2021052110084388600_ref82","doi-asserted-by":"crossref","first-page":"EL140","DOI":"10.1121\/1.4865840","article-title":"Estimating confidence intervals for information transfer analysis of confusion matrices","volume":"135","author":"Azadpour","year":"2014","journal-title":"J Acoust Soc Am"},{"key":"2021052110084388600_ref83","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1186\/1471-2105-10-421","article-title":"BLAST+: architecture and applications","volume":"10","author":"Camacho","year":"2009","journal-title":"BMC Bioinformat"},{"key":"2021052110084388600_ref84","doi-asserted-by":"crossref","first-page":"272","DOI":"10.1093\/bioinformatics\/btz493","article-title":"Deep learning on chaos game representation for proteins","volume":"36","author":"Lochel","year":"2020","journal-title":"Bioinformatics"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/3\/bbaa125\/37965864\/bbaa125.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/3\/bbaa125\/37965864\/bbaa125.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,1]],"date-time":"2023-10-01T12:32:54Z","timestamp":1696163574000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbaa125\/5864586"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,29]]},"references-count":84,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2021,5,20]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaa125","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,5]]},"published":{"date-parts":[[2020,6,29]]},"article-number":"bbaa125"}}