{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,26]],"date-time":"2026-01-26T20:14:48Z","timestamp":1769458488807,"version":"3.49.0"},"reference-count":64,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2023,4,3]],"date-time":"2023-04-03T00:00:00Z","timestamp":1680480000000},"content-version":"vor","delay-in-days":2,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62102030"],"award-info":[{"award-number":["62102030"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U22A2039"],"award-info":[{"award-number":["U22A2039"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,4,3]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Therapeutic peptides play an important role in immune regulation. Recently various therapeutic peptides have been used in the field of medical research, and have great potential in the design of therapeutic schedules. Therefore, it is essential to utilize the computational methods to predict the therapeutic peptides. However, the therapeutic peptides cannot be accurately predicted by the existing predictors. Furthermore, chaotic datasets are also an important obstacle of the development of this important field. Therefore, it is still challenging to develop a multi-classification model for identification of therapeutic peptides and their types.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>In this work, we constructed a general therapeutic peptide dataset. An ensemble-learning method named PreTP-2L was developed for predicting various therapeutic peptide types. PreTP-2L consists of two layers. The first layer predicts whether a peptide sequence belongs to therapeutic peptide, and the second layer predicts if a therapeutic peptide belongs to a particular species.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>A user-friendly webserver PreTP-2L can be accessed at http:\/\/bliulab.net\/PreTP-2L.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad125","type":"journal-article","created":{"date-parts":[[2023,4,3]],"date-time":"2023-04-03T14:51:55Z","timestamp":1680533515000},"source":"Crossref","is-referenced-by-count":21,"title":["PreTP-2L: identification of therapeutic peptides and their types using two-layer ensemble learning framework"],"prefix":"10.1093","volume":"39","author":[{"given":"Ke","family":"Yan","sequence":"first","affiliation":[{"name":"School of Computer Science and Technology, Beijing Institute of Technology , Beijing 100081, China"}]},{"given":"Yichen","family":"Guo","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Beijing Institute of Technology , Beijing 100081, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3685-9469","authenticated-orcid":false,"given":"Bin","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Beijing Institute of Technology , Beijing 100081, China"},{"name":"Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology , Beijing 100081, China"}]}],"member":"286","published-online":{"date-parts":[[2023,4,3]]},"reference":[{"key":"2023040521011083700_","volume-title":"Deep Learning using Rectified Linear Units (ReLU)","author":"Agarap","year":"2018"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"bbaa153","DOI":"10.1093\/bib\/bbaa153","article-title":"AntiCP 2.0: an updated model for predicting anticancer peptides","volume":"22","author":"Agrawal","year":"2021","journal-title":"Brief Bioinform"},{"key":"2023040521011083700_","first-page":"2767","author":"Albardi","year":"2021"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1186\/s40537-021-00444-8","article-title":"Review of deep learning: concepts, CNN architectures, challenges, applications, future directions","volume":"8","author":"Alzubaidi","year":"2021","journal-title":"J Big Data"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"D154","DOI":"10.1093\/nar\/gki070","article-title":"The Universal Protein Resource (UniProt)","volume":"33","author":"Bairoch","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"bbab252","DOI":"10.1093\/bib\/bbab252","article-title":"Integrative machine learning framework for the identification of cell-specific enhancers from the human genome","volume":"22","author":"Basith","year":"2021","journal-title":"Brief Bioinform"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"bbab376","DOI":"10.1093\/bib\/bbab376","article-title":"STALLION: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction","volume":"23","author":"Basith","year":"2022","journal-title":"Brief Bioinform"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"1276","DOI":"10.1002\/med.21658","article-title":"Machine intelligence in peptide therapeutics: a next-generation tool for rapid disease screening","volume":"40","author":"Basith","year":"2020","journal-title":"Med Res Rev"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"713","DOI":"10.1002\/psc.717","article-title":"Current strategies for the development of peptide-based anti-cancer therapeutics","volume":"11","author":"Borghouts","year":"2005","journal-title":"J Peptide Sci"},{"key":"2023040521011083700_","first-page":"111","author":"Boureau","year":"2010"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach Learn"},{"key":"2023040521011083700_","volume-title":"Convolution Algorithms","author":"Burrus","year":"1985"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"bbab172","DOI":"10.1093\/bib\/bbab172","article-title":"StackIL6: a stacking ensemble model for improving the prediction of IL-6 inducing peptides","volume":"22","author":"Charoenkwan","year":"2021","journal-title":"Brief Bioinform"},{"key":"2023040521011083700_","first-page":"73","article-title":"SGD: Saccharomyces Genome Database","volume-title":"Nucleic Acids Res","author":"Cherry","year":"1998"},{"key":"2023040521011083700_","author":"Dondoshansky","year":"2002"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"e0136990","DOI":"10.1371\/journal.pone.0136990","article-title":"AntiAngioPred: a server for prediction of anti-angiogenic peptides","volume":"10","author":"Ettayapuram Ramaprasad","year":"2015","journal-title":"PLoS One"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"168956","DOI":"10.1109\/ACCESS.2019.2952621","article-title":"iRBP-Motif-PSSM: identification of RNA-binding proteins based on collaborative learning","volume":"7","author":"Gao","year":"2019","journal-title":"IEEE Access"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"bbab358","DOI":"10.1093\/bib\/bbab358","article-title":"PreTP-EL: prediction of therapeutic peptides based on ensemble learning","volume":"22","author":"Guo","year":"2021","journal-title":"Brief Bioinform"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1186\/s12967-016-1103-6","article-title":"Prediction of anti-inflammatory proteins\/peptides: an insilico approach","volume":"15","author":"Gupta","year":"2017","journal-title":"J Transl Med"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"bbab167","DOI":"10.1093\/bib\/bbab167","article-title":"NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning","volume":"22","author":"Hasan","year":"2021","journal-title":"Brief Bioinform"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"2856","DOI":"10.1016\/j.ymthe.2022.05.001","article-title":"Deepm5C: a deep-learning-based hybrid framework for identifying human RNA N5-methylcytosine sites using a stacking strategy","volume":"30","author":"Hasan","year":"2022","journal-title":"Mol Ther"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","article-title":"Amino acid substitution matrices from protein blocks","volume":"89","author":"Henikoff","year":"1992","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"4806","DOI":"10.1109\/ACCESS.2019.2962617","article-title":"The real-world-weight cross-entropy loss function: Modeling the costs of mislabeling","volume":"8","author":"Ho","journal-title":"IEEE Access"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1093\/bioinformatics\/14.5.423","article-title":"Removing near-neighbour redundancy from large protein sequence collections","volume":"14","author":"Holm","year":"1998","journal-title":"Bioinformatics"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"D38","DOI":"10.1093\/nar\/gkv1116","article-title":"Tools and data services registry: a community effort to document bioinformatics resources","volume":"44","author":"Ison","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023040521011083700_","volume-title":"Categorical Reparameterization with Gumbel-Softmax","author":"Jang","year":"2016"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1038\/s41597-019-0154-y","article-title":"DRAMP 2.0, an updated data repository of antimicrobial peptides","volume":"6","author":"Kang","year":"2019","journal-title":"Sci Data"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1186\/1471-2105-8-263","article-title":"Analysis and prediction of antibacterial peptides","volume":"8","author":"Lata","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","article-title":"Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"Bioinformatics"},{"key":"2023040521011083700_","first-page":"1","article-title":"PSBinder: a web service for predicting polystyrene surface-binding peptides","volume":"2017","author":"Li","year":"2017","journal-title":"Biomed Res Int"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"e129","DOI":"10.1093\/nar\/gkab829","article-title":"BioSeq-BLM: a platform for analyzing DNA, RNA, and protein sequences based on biological language models","volume":"49","author":"Li","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"1203","DOI":"10.1109\/TCBB.2018.2789880","article-title":"ProtDet-CCH: protein remote homology detection by combining long short-term memory and ranking methods","volume":"16","author":"Liu","year":"2019","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"e106691","DOI":"10.1371\/journal.pone.0106691","article-title":"iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition","volume":"9","author":"Liu","year":"2014","journal-title":"PLoS One"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"S3","DOI":"10.1186\/1471-2105-15-S16-S3","article-title":"Using distances between top-n-gram and residue pairs for protein remote homology detection","volume":"15","author":"Liu","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"2185","DOI":"10.1093\/bib\/bbz139","article-title":"Fold-LTR-TCP: protein fold recognition based on triadic closure principle","volume":"21","author":"Liu","year":"2020","journal-title":"Brief Bioinform"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"276","DOI":"10.3389\/fphar.2018.00276","article-title":"AIPpred: sequence-based prediction of anti-inflammatory peptides using random forest","volume":"9","author":"Manavalan","year":"2018","journal-title":"Front Pharmacol"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"2136","DOI":"10.1109\/TCOMM.2002.806518","article-title":"Cyclic prefixing or zero padding for wireless multicarrier transmissions?","volume":"50","author":"Muquet","year":"2002","journal-title":"IEEE Trans Commun"},{"key":"2023040521011083700_","author":"O'Shea","year":"2015"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"D288","DOI":"10.1093\/nar\/gkaa991","article-title":"DBAASP v3: database of antimicrobial\/cytotoxic activity and structure of peptides as a resource for development of new therapeutics","volume":"49","author":"Pirtskhalava","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023040521011083700_","author":"Powers","year":"2008"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"e0120066","DOI":"10.1371\/journal.pone.0120066","article-title":"Prediction and analysis of quorum sensing peptides based on sequence features","volume":"10","author":"Rajput","year":"2015","journal-title":"PLoS One"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"1846","DOI":"10.1093\/bib\/bbz088","article-title":"ACPred-Fuse: fusing multi-view information improves the prediction of anticancer peptides","volume":"21","author":"Rao","year":"2020","journal-title":"Brief Bioinform"},{"key":"2023040521011083700_","first-page":"95","article-title":"Going deeper in spiking neural networks: VGG and residual architectures","author":"Sengupta","year":"2019"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1016\/j.ab.2007.10.012","article-title":"PseAAC: a flexible web server for generating various kinds of protein pseudo amino acid composition","volume":"373","author":"Shen","year":"2008","journal-title":"Anal Biochem"},{"key":"2023040521011083700_","author":"Simonyan","year":"2014"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"D1119","DOI":"10.1093\/nar\/gkv1114","article-title":"SATPdb: a database of structurally annotated therapeutic peptides","volume":"44","author":"Singh","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"W199","DOI":"10.1093\/nar\/gks450","article-title":"AVPpred: collection and prediction of highly effective antiviral peptides","volume":"40","author":"Thakur","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023040521011083700_","author":"Tolias","year":"2015"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"951","DOI":"10.1038\/s41551-021-00698-w","article-title":"The evolution of commercial drug delivery technologies","volume":"5","author":"Vargason","year":"2021","journal-title":"Nat Biomed Eng"},{"key":"2023040521011083700_","first-page":"3249","article-title":"Complex network study of the immune epitope database for parasitic organisms","volume":"18","author":"Vazquez-Prieto","year":"2017","journal-title":"Curr Top Med Chem"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"713","DOI":"10.1007\/s11030-017-9749-4","article-title":"A study of the immune epitope database for some fungi species using network topological indices","volume":"21","author":"V\u00e1zquez-Prieto","year":"2017","journal-title":"Mol Divers"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1002\/pro.3714","article-title":"Collection of antimicrobial peptides database and its derivatives: applications and beyond","volume":"29","author":"Waghu","year":"2020","journal-title":"Protein Sci"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"2044","DOI":"10.1021\/acs.jproteome.7b00019","article-title":"CPPred-RF: a sequence-based predictor for identifying cell-penetrating peptides and their uptake efficiency","volume":"16","author":"Wei","year":"2017","journal-title":"J Proteome Res"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"4007","DOI":"10.1093\/bioinformatics\/bty451","article-title":"ACPred-FL: a sequence-based predictor using effective feature representation to improve the prediction of anti-cancer peptides","volume":"34","author":"Wei","year":"2018","journal-title":"Bioinformatics"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"4272","DOI":"10.1093\/bioinformatics\/btz246","article-title":"PEPred-Suite: improved and robust prediction of therapeutic peptides using adaptive feature representation learning","volume":"35","author":"Wei","year":"2019","journal-title":"Bioinformatics"},{"key":"2023040521011083700_","first-page":"11","article-title":"CPPred-FL: a sequence-based predictor for large-scale identification of cell-penetrating peptides by feature representation learning","volume":"21","author":"Qiang","year":"2018","journal-title":"Brief Bioinform"},{"key":"2023040521011083700_","article-title":"PreTP-Stack: prediction of therapeutic peptide based on the stacked ensemble learning","author":"Yan","year":"2022","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"btac715","DOI":"10.1093\/bioinformatics\/btac715","article-title":"sAMPpred-GAT: prediction of antimicrobial peptide by graph attention network and predicted peptide structure","volume":"39","author":"Yan","year":"2023","journal-title":"Bioinformatics"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"2712","DOI":"10.1093\/bioinformatics\/btac200","article-title":"TPpred-ATMV: therapeutic peptides prediction by adaptive multi-view tensor learning model","volume":"38","author":"Yan","year":"2022","journal-title":"Bioinformatics"},{"key":"2023040521011083700_","author":"Zeiler","year":"2012"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"5860","DOI":"10.1016\/j.jmb.2020.09.008","article-title":"iDRBP_MMC: identifying DNA-binding proteins and RNA-binding proteins based on multi-label learning model and motif-based convolutional neural network","volume":"432","author":"Zhang","year":"2020","journal-title":"J Mol Biol"},{"key":"2023040521011083700_","doi-asserted-by":"crossref","first-page":"3982","DOI":"10.1093\/bioinformatics\/btaa275","article-title":"PPTPP: a novel therapeutic peptide prediction method using physicochemical property encoding and adaptive feature representation learning","volume":"36","author":"Zhang","year":"2020","journal-title":"Bioinformatics"},{"key":"2023040521011083700_","first-page":"31","author":"Zhang","year":"2018"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad125\/49732024\/btad125.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/4\/btad125\/49772096\/btad125.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/4\/btad125\/49772096\/btad125.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,10]],"date-time":"2023-12-10T04:32:31Z","timestamp":1702182751000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad125\/7100341"}},"subtitle":[],"editor":[{"given":"Pier Luigi","family":"Martelli","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,4,1]]},"references-count":64,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,4,3]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad125","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,4,1]]},"published":{"date-parts":[[2023,4,1]]},"article-number":"btad125"}}