{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T07:24:05Z","timestamp":1769585045462,"version":"3.49.0"},"reference-count":55,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2021,7,19]],"date-time":"2021-07-19T00:00:00Z","timestamp":1626652800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["32070659"],"award-info":[{"award-number":["32070659"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Guangdong Province Basic and Applied Basic Research Fund","award":["2021A1515012447"],"award-info":[{"award-number":["2021A1515012447"]}]},{"name":"Ganghong Young Scholar Development Fund","award":["2021E007"],"award-info":[{"award-number":["2021E007"]}]},{"name":"Warshel Institute for Computational Biology"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,11,5]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Antiviral peptide (AVP) is a kind of antimicrobial peptide (AMP) that has the potential ability to fight against virus infection. Machine learning-based prediction with a computational biology approach can facilitate the development of the novel therapeutic agents. In this study, we proposed a double-stage classification scheme, named AVPIden, for predicting the AVPs and their functional activities against different viruses. The first stage is to distinguish the AVP from a broad-spectrum peptide collection, including not only the regular peptides (non-AMP) but also the AMPs without antiviral functions (non-AVP). The second stage is responsible for characterizing one or more virus families or species that the AVP targets. Imbalanced learning is utilized to improve the performance of prediction. The AVPIden uses multiple descriptors to precisely demonstrate the peptide properties and adopts explainable machine learning strategies based on Shapley value to exploit how the descriptors impact the antiviral activities. Finally, the evaluation performance of the proposed model suggests its ability to predict the antivirus activities and their potential functions against six virus families (Coronaviridae, Retroviridae, Herpesviridae, Paramyxoviridae, Orthomyxoviridae, Flaviviridae) and eight kinds of virus (FIV, HCV, HIV, HPIV3, HSV1, INFVA, RSV, SARS-CoV). The AVPIden gives an option for reinforcing the development of AVPs with the computer-aided method and has been deployed at http:\/\/awi.cuhk.edu.cn\/AVPIden\/.<\/jats:p>","DOI":"10.1093\/bib\/bbab263","type":"journal-article","created":{"date-parts":[[2021,6,23]],"date-time":"2021-06-23T11:17:08Z","timestamp":1624447028000},"source":"Crossref","is-referenced-by-count":81,"title":["AVPIden: a new scheme for identification and functional prediction of antiviral peptides based on machine learning approaches"],"prefix":"10.1093","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6778-7034","authenticated-orcid":false,"given":"Yuxuan","family":"Pang","sequence":"first","affiliation":[{"name":"Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, PR China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4554-6827","authenticated-orcid":false,"given":"Lantian","family":"Yao","sequence":"additional","affiliation":[{"name":"Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, PR China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6158-5207","authenticated-orcid":false,"given":"Jhih-Hua","family":"Jhong","sequence":"additional","affiliation":[{"name":"Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, PR China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7076-8432","authenticated-orcid":false,"given":"Zhuo","family":"Wang","sequence":"additional","affiliation":[{"name":"Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, PR China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8475-7868","authenticated-orcid":false,"given":"Tzong-Yi","family":"Lee","sequence":"additional","affiliation":[{"name":"Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, PR China"}]}],"member":"286","published-online":{"date-parts":[[2021,7,19]]},"reference":[{"key":"2021110815064751500_ref1","doi-asserted-by":"crossref","first-page":"3525","DOI":"10.1007\/s00018-019-03138-w","article-title":"Antiviral peptides as promising therapeutic drugs","volume":"76","author":"Vilas Boas","year":"2019","journal-title":"Cell Mol Life Sci"},{"key":"2021110815064751500_ref2","doi-asserted-by":"crossref","first-page":"811","DOI":"10.1007\/s10989-019-09888-2","article-title":"Smp76, a Scorpine-like peptide isolated from the venom of the scorpion Scorpio maurus palmatus, with a potent antiviral activity against hepatitis C virus and dengue virus","volume":"26","author":"El-Bitar","year":"2020","journal-title":"Int J Pept Res Ther"},{"key":"2021110815064751500_ref3","doi-asserted-by":"crossref","first-page":"1518","DOI":"10.1016\/j.peptides.2011.05.015","article-title":"Virucidal activity of a scorpion venom peptide variant mucroporin-M1 against measles, SARS-CoV and influenza H5N1 viruses","volume":"32","author":"Qiaoli","year":"2011","journal-title":"Peptides"},{"key":"2021110815064751500_ref4","doi-asserted-by":"crossref","first-page":"567","DOI":"10.3390\/md17100567","article-title":"Griffithsin, a highly potent broad-Spectrum antiviral lectin from red algae: from discovery to clinical application","volume":"17","author":"Lee","year":"2019","journal-title":"Mar Drugs"},{"key":"2021110815064751500_ref5","doi-asserted-by":"crossref","first-page":"W199","DOI":"10.1093\/nar\/gks450","article-title":"AVPpred: collection and prediction of highly effective antiviral peptides","volume":"40","author":"Nishant","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2021110815064751500_ref6","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1016\/j.compbiomed.2019.02.011","article-title":"AntiVPP 1.0: a portable tool for prediction of antiviral peptides","volume":"107","author":"Beltr\u00e1n Lissabet","year":"2019","journal-title":"Comput Biol Med"},{"key":"2021110815064751500_ref7","doi-asserted-by":"crossref","first-page":"5743","DOI":"10.3390\/ijms20225743","article-title":"Meta-iAVP: a sequence-based meta-predictor for improving the prediction of antiviral peptides using effective feature representation","volume":"20","author":"Nalini","year":"2019","journal-title":"Int J Mol Sci"},{"key":"2021110815064751500_ref8","doi-asserted-by":"crossref","first-page":"753","DOI":"10.1002\/bip.22703","article-title":"AVP-IC50Pred: multiple machine learning techniques-based prediction of peptide antiviral activity in terms of half maximal inhibitory concentration (IC50)","volume":"104","author":"Qureshi","year":"2015","journal-title":"Pept Sci"},{"key":"2021110815064751500_ref9","doi-asserted-by":"crossref","first-page":"1085","DOI":"10.1093\/bib\/bbaa423","article-title":"Identifying anti-coronavirus peptides by incorporating different negative datasets and imbalanced learning strategies","volume":"22","author":"Pang","year":"2021","journal-title":"Brief Bioinform"},{"key":"2021110815064751500_ref10","doi-asserted-by":"crossref","first-page":"986","DOI":"10.3390\/ijms21030986","article-title":"Characterization and identification of natural antimicrobial peptides on different organisms","volume":"21","author":"Chung","year":"2020","journal-title":"Int J Mol Sci"},{"key":"2021110815064751500_ref11","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1016\/j.ab.2013.01.019","article-title":"iAMP-2L: a two-level multi-label classifier for identifying antimicrobial peptides and their functional types","volume":"436","author":"Xiao","year":"2013","journal-title":"Anal Biochem"},{"key":"2021110815064751500_ref12","doi-asserted-by":"crossref","first-page":"42362","DOI":"10.1038\/srep42362","article-title":"Predicting antimicrobial peptides with improved accuracy by incorporating the compositional, physico-chemical and structural features into Chou\u2019s general PseAAC","volume":"7","author":"Meher","year":"2017","journal-title":"Sci Rep"},{"key":"2021110815064751500_ref13","doi-asserted-by":"crossref","first-page":"1098","DOI":"10.1093\/bib\/bbz043","article-title":"Characterization and identification of antimicrobial peptides with different functional activities","volume":"21","author":"Chung","year":"2020","journal-title":"Brief Bioinform"},{"key":"2021110815064751500_ref14","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1016\/j.eswa.2016.12.035","article-title":"Learning from class-imbalanced data: review of methods and applications","volume":"73","author":"Haixiang","year":"2017","journal-title":"Expert Syst Appl"},{"key":"2021110815064751500_ref15","first-page":"4765","article-title":"Unified approach to interpreting model predictions","volume":"30","author":"Lundberg","year":"2017","journal-title":"Adv Neural Inf Process Syst"},{"key":"2021110815064751500_ref16","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1080\/10408347.2017.1361314","article-title":"Multiple versus single set validation of multivariate models to avoid mistakes","volume":"48","author":"Harrington P de","year":"2018","journal-title":"Crit Rev Anal Chem"},{"key":"2021110815064751500_ref17","doi-asserted-by":"crossref","first-page":"e0119490","DOI":"10.1371\/journal.pone.0119490","article-title":"Analysis and prediction of the critical regions of antimicrobial peptides based on conditional random fields","volume":"10","author":"Chang","year":"2015","journal-title":"PLoS One"},{"key":"2021110815064751500_ref18","doi-asserted-by":"crossref","first-page":"4779","DOI":"10.1021\/bi300090x","article-title":"Glycines: role in \u03b1-helical membrane protein structures and a potential indicator of native conformation","volume":"51","author":"Dong","year":"2012","journal-title":"Biochemistry"},{"key":"2021110815064751500_ref19","doi-asserted-by":"crossref","first-page":"12043","DOI":"10.1038\/s41598-019-48541-2","article-title":"A comprehensive computational study of amino acid interactions in membrane proteins","volume":"9","author":"Mbaye","year":"2019","journal-title":"Sci Rep"},{"key":"2021110815064751500_ref20","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1016\/j.bbamem.2004.11.014","article-title":"Methionine-rich repeat proteins: a family of membrane-associated proteins which contain unusual repeat regions","volume":"1668","author":"Weiss","year":"2005","journal-title":"Biochim Biophys Acta - Biomembr"},{"key":"2021110815064751500_ref21","first-page":"194","article-title":"Antimicrobial peptides: an emerging category of therapeutic agents","volume":"6","author":"Margit Mahlapuu Joakim H\u00e5kansson LRCB","year":"2016","journal-title":"Front Cell Infect Microbiol"},{"key":"2021110815064751500_ref22","doi-asserted-by":"crossref","first-page":"388","DOI":"10.1111\/j.1365-2672.2010.04663.x","article-title":"Isoelectric points of viruses","volume":"109","author":"Michen","year":"2010","journal-title":"J Appl Microbiol"},{"key":"2021110815064751500_ref23","doi-asserted-by":"crossref","first-page":"515 LP","DOI":"10.4049\/jimmunol.173.1.515","article-title":"Activity of \u03b1- and \u03b8-Defensins against primary isolates of HIV-1","volume":"173","author":"Wang","year":"2004","journal-title":"J Immunol"},{"key":"2021110815064751500_ref24","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1111\/j.1365-2958.2008.06288.x","article-title":"A predatory mechanism dramatically increases the efficiency of lateral gene transfer in Streptococcus pneumoniae and related commensal species","volume":"69","author":"Johnsborg","year":"2008","journal-title":"Mol Microbiol"},{"key":"2021110815064751500_ref25","first-page":"285","article-title":"dbAMP: an integrated resource for exploring antimicrobial peptides with functional activities and physicochemical properties on transcriptome and proteome data","volume":"47","author":"Jhih-Hua","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2021110815064751500_ref26","doi-asserted-by":"crossref","first-page":"D1147","DOI":"10.1093\/nar\/gkt1191","article-title":"AVPdb: a database of experimentally validated antiviral peptides targeting medically important viruses","volume":"42","author":"Abid","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2021110815064751500_ref27","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1038\/s41597-019-0154-y","article-title":"DRAMP 2.0, an updated data repository of antimicrobial peptides","volume":"6","author":"Kang","year":"2019","journal-title":"Sci Data"},{"key":"2021110815064751500_ref28","doi-asserted-by":"crossref","first-page":"D288","DOI":"10.1093\/nar\/gkaa991","article-title":"DBAASP v3: database of antimicrobial\/cytotoxic activity and structure of peptides as a resource for development of new therapeutics","volume":"49","author":"Pirtskhalava","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2021110815064751500_ref29","doi-asserted-by":"crossref","first-page":"e54908","DOI":"10.1371\/journal.pone.0054908","article-title":"HIPdb: a database of experimentally validated HIV inhibiting peptides","volume":"8","author":"Qureshi","year":"2013","journal-title":"PLoS One"},{"key":"2021110815064751500_ref30","doi-asserted-by":"crossref","first-page":"D506","DOI":"10.1093\/nar\/gky1049","article-title":"UniProt: a worldwide hub of protein knowledge","volume":"47","author":"Consortium","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2021110815064751500_ref31","first-page":"1658","article-title":"Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"ioinformatics"},{"key":"2021110815064751500_ref32","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1016\/j.gpb.2018.10.010","article-title":"SuccSite: incorporating amino acid composition and informative k-spaced amino acid pairs to identify protein Succinylation sites","volume":"18","author":"Kao","year":"2020","journal-title":"Genomics Proteomics Bioinformatics"},{"key":"2021110815064751500_ref33","doi-asserted-by":"crossref","first-page":"859","DOI":"10.1016\/j.ygeno.2019.05.027","article-title":"Prediction of lysine formylation sites using the composition of k-spaced amino acid pairs via Chou\u2019s 5-steps rule and general pseudo components","volume":"112","author":"Ju","year":"2020","journal-title":"Genomics"},{"key":"2021110815064751500_ref34","doi-asserted-by":"crossref","first-page":"23510","DOI":"10.1038\/srep23510","article-title":"DephosSite: a machine learning approach for discovering phosphotase-specific dephosphorylation sites","volume":"6","author":"Wang","year":"2016","journal-title":"Sci Rep"},{"key":"2021110815064751500_ref35","doi-asserted-by":"crossref","first-page":"1887","DOI":"10.1016\/j.patrec.2008.06.007","article-title":"Using Chou\u2019s pseudo amino acid composition to predict subcellular localization of apoptosis proteins: an approach with immune genetic algorithm-based ensemble classifier","volume":"29","author":"Ding","year":"2008","journal-title":"Pattern Recognit Lett"},{"key":"2021110815064751500_ref36","doi-asserted-by":"crossref","first-page":"e18476","DOI":"10.1371\/journal.pone.0018476","article-title":"Prediction of antimicrobial peptides based on sequence alignment and feature selection methods","volume":"6","author":"Wang","year":"2011","journal-title":"PLoS One"},{"key":"2021110815064751500_ref37","doi-asserted-by":"crossref","first-page":"262","DOI":"10.2174\/157016409789973707","article-title":"Pseudo amino acid composition and its applications in bioinformatics, proteomics and system biology","volume":"6","author":"Kuo-Chen","year":"2009","journal-title":"Curr Proteomics"},{"key":"2021110815064751500_ref38","first-page":"1895","article-title":"Thermostability and aliphatic index of globular proteins","volume":"88","author":"Ikai","year":"1980","journal-title":"J Biochem"},{"key":"2021110815064751500_ref39","doi-asserted-by":"crossref","first-page":"4277","DOI":"10.1021\/bi00613a026","article-title":"Conformational preferences of amino acids in globular proteins","volume":"17","author":"Levitt","year":"1978","journal-title":"Biochemistry"},{"key":"2021110815064751500_ref40","doi-asserted-by":"crossref","first-page":"1987","DOI":"10.1110\/ps.062286306","article-title":"An amino acid \u201ctransmembrane tendency\u201d scale that approaches the theoretical limit to accuracy for prediction of transmembrane helices: relationship to biological hydrophobicity","volume":"15","author":"Zhao","year":"2006","journal-title":"Protein Sci"},{"key":"2021110815064751500_ref41","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1038\/299371a0","article-title":"The helical hydrophobic moment: a measure of the amphiphilicity of a helix","volume":"299","author":"Eisenberg","year":"1982","journal-title":"Nature"},{"key":"2021110815064751500_ref42","doi-asserted-by":"crossref","first-page":"3824 LP","DOI":"10.1073\/pnas.78.6.3824","article-title":"Prediction of protein antigenic determinants from amino acid sequences","volume":"78","author":"Hopp","year":"1981","journal-title":"Proc Natl Acad Sci"},{"key":"2021110815064751500_ref43","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1016\/0014-5793(89)81505-4","article-title":"Antibacterial and antimalarial properties of peptides that are cecropin-melittin hybrids","volume":"259","author":"Boman","year":"1989","journal-title":"FEBS Lett"},{"key":"2021110815064751500_ref44","doi-asserted-by":"crossref","first-page":"1023","DOI":"10.1002\/elps.11501401163","article-title":"The focusing positions of polypeptides in immobilized pH gradients can be predicted from their amino acid sequences","volume":"14","author":"Bjellqvist","year":"1993","journal-title":"Electrophoresis"},{"key":"2021110815064751500_ref45","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1002\/elps.1150150171","article-title":"Reference points for comparisons of two-dimensional maps of proteins from different human cell types defined in a pH scale where isoelectric points correlate with polypeptide compositions","volume":"15","author":"Bjellqvist","year":"1994","journal-title":"Electrophoresis"},{"key":"2021110815064751500_ref46","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1007\/s13748-016-0094-0","article-title":"Learning from imbalanced data: open challenges and future directions","volume":"5","author":"Krawczyk","year":"2016","journal-title":"Prog Artif Intell"},{"key":"2021110815064751500_ref47","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach Learn"},{"key":"2021110815064751500_ref48","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1148\/radiology.143.1.7063747","article-title":"The meaning and use of the area under a receiver operating characteristic (ROC) curve","volume":"143","author":"Hanley","year":"1982","journal-title":"Radiology"},{"key":"2021110815064751500_ref49","doi-asserted-by":"crossref","first-page":"250","DOI":"10.1111\/j.1541-0420.2007.00781_1.x","article-title":"The skill plot: a graphical technique for evaluating continuous diagnostic tests","volume":"64","author":"Briggs","year":"2008","journal-title":"Biometrics"},{"key":"2021110815064751500_ref50","first-page":"27","article-title":"Evaluation measures for models assessment over imbalanced data sets","volume":"3","author":"Bekkar","year":"2013","journal-title":"J Inf Eng Appl"},{"key":"2021110815064751500_ref51","doi-asserted-by":"crossref","first-page":"2499","DOI":"10.1093\/bioinformatics\/bty140","article-title":"iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences","volume":"34","author":"Chen","year":"2018","journal-title":"Bioinformatics"},{"key":"2021110815064751500_ref52","doi-asserted-by":"crossref","first-page":"1422","DOI":"10.1093\/bioinformatics\/btp163","article-title":"Biopython: freely available Python tools for computational molecular biology and bioinformatics","volume":"25","author":"Cock","year":"2009","journal-title":"Bioinformatics"},{"key":"2021110815064751500_ref53","doi-asserted-by":"crossref","first-page":"2753","DOI":"10.1093\/bioinformatics\/btx285","article-title":"modlAMP: Python for antimicrobial peptides","volume":"33","author":"M\u00fcller","year":"2017","journal-title":"Bioinformatics"},{"key":"2021110815064751500_ref54","first-page":"2825","article-title":"Scikit-learn: machine learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J Mach Learn Res"},{"key":"2021110815064751500_ref55","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/s42256-019-0138-9","article-title":"From local explanations to global understanding with explainable AI for trees","volume":"2","author":"Lundberg","year":"2020","journal-title":"Nat Mach Intell"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/22\/6\/bbab263\/41088925\/bbab263.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/22\/6\/bbab263\/41088925\/bbab263.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,11,8]],"date-time":"2021-11-08T15:11:48Z","timestamp":1636384308000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbab263\/6323205"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,19]]},"references-count":55,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2021,11,5]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbab263","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,11]]},"published":{"date-parts":[[2021,7,19]]},"article-number":"bbab263"}}