{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T05:31:33Z","timestamp":1777527093970,"version":"3.51.4"},"reference-count":54,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2023,3,10]],"date-time":"2023-03-10T00:00:00Z","timestamp":1678406400000},"content-version":"vor","delay-in-days":9,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62071278"],"award-info":[{"award-number":["62071278"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,3,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Plant Small Secreted Peptides (SSPs) play an important role in plant growth, development, and plant\u2013microbe interactions. Therefore, the identification of SSPs is essential for revealing the functional mechanisms. Over the last few decades, machine learning-based methods have been developed, accelerating the discovery of SSPs to some extent. However, existing methods highly depend on handcrafted feature engineering, which easily ignores the latent feature representations and impacts the predictive performance.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Here, we propose ExamPle, a novel deep learning model using Siamese network and multi-view representation for the explainable prediction of the plant SSPs. Benchmarking comparison results show that our ExamPle performs significantly better than existing methods in the prediction of plant SSPs. Also, our model shows excellent feature extraction ability. Importantly, by utilizing in silicomutagenesis experiment, ExamPle can discover sequential characteristics and identify the contribution of each amino acid for the predictions. The key novel principle learned by our model is that the head region of the peptide and some specific sequential patterns are strongly associated with the SSPs\u2019 functions. Thus, ExamPle is expected to be a useful tool for predicting plant SSPs and designing effective plant SSPs.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>Our codes and datasets are available at https:\/\/github.com\/Johnsunnn\/ExamPle.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad108","type":"journal-article","created":{"date-parts":[[2023,3,9]],"date-time":"2023-03-09T22:21:45Z","timestamp":1678400505000},"source":"Crossref","is-referenced-by-count":25,"title":["ExamPle: explainable deep learning framework for the prediction of plant small secreted peptides"],"prefix":"10.1093","volume":"39","author":[{"given":"Zhongshen","family":"Li","sequence":"first","affiliation":[{"name":"School of Software, Shandong University , Jinan 250101, China"},{"name":"Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University , Jinan 250101, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Junru","family":"Jin","sequence":"additional","affiliation":[{"name":"School of Software, Shandong University , Jinan 250101, China"},{"name":"Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University , Jinan 250101, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yu","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Software, Shandong University , Jinan 250101, China"},{"name":"Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University , Jinan 250101, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wentao","family":"Long","sequence":"additional","affiliation":[{"name":"School of Software, Shandong University , Jinan 250101, China"},{"name":"Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University , Jinan 250101, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuanhao","family":"Ding","sequence":"additional","affiliation":[{"name":"Hainan Key Laboratory for Sustainable Utilization of Tropical Bioresource, College of Tropical Crops, Hainan University , Haikou 570228, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haiyan","family":"Hu","sequence":"additional","affiliation":[{"name":"Hainan Key Laboratory for Sustainable Utilization of Tropical Bioresource, College of Tropical Crops, Hainan University , Haikou 570228, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1444-190X","authenticated-orcid":false,"given":"Leyi","family":"Wei","sequence":"additional","affiliation":[{"name":"School of Software, Shandong University , Jinan 250101, China"},{"name":"Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University , Jinan 250101, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2023,3,10]]},"reference":[{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1002\/wics.101","article-title":"Principal component analysis","volume":"2","author":"Abdi","year":"2010","journal-title":"Wiley Interdiscip Rev Comput Stat"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"W202","DOI":"10.1093\/nar\/gkp335","article-title":"MEME SUITE: tools for motif discovery and searching","volume":"37","author":"Bailey","year":"2009","journal-title":"Nucleic Acids Res"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"2834","DOI":"10.1093\/bioinformatics\/btab203","article-title":"STREME: accurate and versatile sequence motif discovery","volume":"37","author":"Bailey","year":"2021","journal-title":"Bioinformatics"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1104\/pp.19.01088","article-title":"MtSSPdb: the Medicago truncatula small secreted peptide database","volume":"183","author":"Boschiero","year":"2020","journal-title":"Plant Physiol"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12915-016-0280-3","article-title":"Q&A: how does peptide signaling direct plant development?","volume":"14","author":"Breiden","year":"2016","journal-title":"BMC Biol"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach Learn"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1016\/j.tplants.2009.02.002","article-title":"Plant peptides in signalling: looking for new partners","volume":"14","author":"Butenko","year":"2009","journal-title":"Trends Plant Sci"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"e60","DOI":"10.1093\/nar\/gkab122","article-title":"iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization","volume":"49","author":"Chen","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"2499","DOI":"10.1093\/bioinformatics\/bty140","article-title":"iFeature: a python package and web server for features extraction and selection from protein and peptide sequences","volume":"34","author":"Chen","year":"2018","journal-title":"Bioinformatics"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"1047","DOI":"10.1093\/bib\/bbz041","article-title":"iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data","volume":"21","author":"Chen","year":"2020","journal-title":"Brief Bioinform"},{"key":"2023032018461826100_","first-page":"1724","author":"Cho"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1023\/A:1005986004615","article-title":"Prosystemin from potato, black nightshade, and bell pepper: primary structure and biological activity of predicted systemin polypeptides","volume":"36","author":"Constabel","year":"1998","journal-title":"Plant Mol Biol"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/BF00994018","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Mach Learn"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"5281","DOI":"10.1093\/jxb\/ert283","article-title":"Message in a bottle: small signalling peptide outputs during growth and development","volume":"64","author":"Czyzewicz","year":"2013","journal-title":"J Exp Bot"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"1669","DOI":"10.1104\/pp.17.01096","article-title":"Genome-wide identification of Medicago peptides involved in macronutrient responses and nodulation","volume":"175","author":"de Bang","year":"2017","journal-title":"Plant Physiol"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"594","DOI":"10.1126\/science.1160158","article-title":"Receptor-like kinase ACR4 restricts formative cell divisions in the Arabidopsis root","volume":"322","author":"De Smet","year":"2008","journal-title":"Science"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1002\/(SICI)1097-0134(19990601)35:4<401::AID-PROT3>3.0.CO;2-K","article-title":"Recognition of a protein fold in the context of the SCOP classification","volume":"35","author":"Dubchak","year":"1999","journal-title":"Proteins"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1016\/bs.ctdb.2018.10.005","article-title":"Vascular tissue development in plants","volume":"131","author":"Fukuda","year":"2019","journal-title":"Curr Top Dev Biol"},{"key":"2023032018461826100_","first-page":"1735","author":"Hadsell","year":"2006"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","DOI":"10.1002\/9781118548387","volume-title":"Applied Logistic Regression","author":"Hosmer","year":"2013"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"2206151","DOI":"10.1002\/advs.202206151","article-title":"Explainable deep hypergraph learning modeling the peptide secondary structure prediction","volume":"10","author":"Jiang","year":"2023","journal-title":"Adv Sci"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"12205","DOI":"10.1073\/pnas.0700344104","article-title":"Tomato MAPKs LeMPK1, LeMPK2, and LeMPK3 function in the systemin-mediated defense response against herbivorous insects","volume":"104","author":"Kandoth","year":"2007","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"831","DOI":"10.1104\/pp.106.086041","article-title":"The Arabidopsis unannotated secreted peptide database, a resource for plant peptidomics","volume":"142","author":"Lease","year":"2006","journal-title":"Plant Physiol"},{"key":"2023032018461826100_","doi-asserted-by":"publisher","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","article-title":"Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"Bioinformatics"},{"key":"2023032018461826100_","first-page":"10","article-title":"Supervised graph co-contrastive learning for drug-target interaction prediction","author":"Li","year":"2022","journal-title":"Bioinformatics"},{"key":"2023032018461826100_","article-title":"Drug\u2013target interaction predication via multi-channel graph neural networks","author":"Li","year":"2021","journal-title":"Brief Bioinform"},{"key":"2023032018461826100_","article-title":"Evaluating disease similarity based on gene network reconstruction and representation","author":"Li","year":"2021","journal-title":"Bioinformatics"},{"key":"2023032018461826100_","author":"Li","year":"2019"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1080\/18756891.2011.9727765","article-title":"An improved method for protein similarity searching by alignment of fuzzy energy signatures","volume":"4","author":"Malysiak-Mrozek","year":"2011","journal-title":"Int J Comput Intell Syst"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1146\/annurev-arplant-050312-120122","article-title":"Posttranslationally modified small-peptide signals in plants","volume":"65","author":"Matsubayashi","year":"2014","journal-title":"Annu Rev Plant Biol"},{"key":"2023032018461826100_","first-page":"109","volume-title":"Psychology of Learning and Motivation","author":"McCloskey","year":"1989"},{"key":"2023032018461826100_","first-page":"378","author":"Melekhov","year":"2016"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1007\/s00425-010-1236-4","article-title":"CLE14\/CLE20 peptides may interact with CLAVATA2\/CORYNE receptor-like kinases to irreversibly inhibit cell division in the root meristem of Arabidopsis","volume":"232","author":"Meng","year":"2010","journal-title":"Planta"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"D412","DOI":"10.1093\/nar\/gkaa913","article-title":"Pfam: the protein families database in 2021","volume":"49","author":"Mistry","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.dsp.2017.10.011","article-title":"Methods for interpreting and understanding deep neural networks","volume":"73","author":"Montavon","year":"2018","journal-title":"Digit Signal Process"},{"key":"2023032018461826100_","first-page":"1","author":"Mrozek","year":"2007"},{"key":"2023032018461826100_","first-page":"1","author":"Mrozek","year":"2009"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"107365","DOI":"10.1016\/j.compbiolchem.2020.107365","article-title":"A review of cloud computing technologies for comprehensive microRNA analyses","volume":"88","author":"Mrozek","year":"2020","journal-title":"Computat Biol Chem"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"3198","DOI":"10.1105\/tpc.112.099010","article-title":"Small signaling peptides in Arabidopsis development: how cells communicate over a short distance","volume":"24","author":"Murphy","year":"2012","journal-title":"Plant Cell"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"5810","DOI":"10.1073\/pnas.1719491115","article-title":"AtPep3 is a hormone-like peptide that plays a role in the salinity stress tolerance of plants","volume":"115","author":"Nakaminami","year":"2018","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1111\/j.1365-313X.2008.03464.x","article-title":"Identification of a biologically active, small, secreted peptide in Arabidopsis by in silico gene screening, followed by LC-MS-based structure analysis","volume":"55","author":"Ohyama","year":"2008","journal-title":"Plant J"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"895","DOI":"10.1126\/science.253.5022.895","article-title":"A polypeptide from tomato leaves induces wound-inducible proteinase inhibitor proteins","volume":"253","author":"Pearce","year":"1991","journal-title":"Science"},{"key":"2023032018461826100_","first-page":"41","author":"Rish","year":"2001"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"4337","DOI":"10.1073\/pnas.0607879104","article-title":"Predicting protein\u2013protein interactions based only on sequences information","volume":"104","author":"Shen","year":"2007","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"1190","DOI":"10.1021\/ac60139a006","article-title":"Automatic recording apparatus for use in chromatography of amino acids","volume":"30","author":"Spackman","year":"1958","journal-title":"Anal Chem"},{"key":"2023032018461826100_","doi-asserted-by":"publisher","first-page":"1023","DOI":"10.1038\/s41587-021-01156-3","article-title":"SignalP 6.0 predicts all five types of signal peptides using protein language models","volume":"40","author":"Teufel","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"1260419","DOI":"10.1126\/science.1260419","article-title":"Tissue-based map of the human proteome","volume":"347","author":"Uhl\u00e9n","year":"2015","journal-title":"Science"},{"key":"2023032018461826100_","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Van der Maaten","year":"2008","journal-title":"J Mach Learn Res"},{"key":"2023032018461826100_","article-title":"Attention is all you need","volume":"30","author":"Vaswani","year":"2017","journal-title":"Adv Neural Inf Process Syst"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1038\/s41477-018-0106-0","article-title":"The systemin receptor SYR1 enhances resistance of tomato against herbivorous insects","volume":"4","author":"Wang","year":"2018","journal-title":"Nat Plants"},{"key":"2023032018461826100_","author":"Wang","year":"2022"},{"key":"2023032018461826100_","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1093\/pcp\/pcx202","article-title":"CYSTM, a novel non-secreted cysteine-rich peptide family, involved in environmental stresses in Arabidopsis thaliana","volume":"59","author":"Xu","year":"2018","journal-title":"Plant Cell Physiol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad108\/49501932\/btad108.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/3\/btad108\/49571399\/btad108.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/3\/btad108\/49571399\/btad108.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,3,27]],"date-time":"2023-03-27T23:46:04Z","timestamp":1679960764000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad108\/7075544"}},"subtitle":[],"editor":[{"given":"Pier Luigi","family":"Martelli","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2023,3,1]]},"references-count":54,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,3,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad108","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,3,1]]},"published":{"date-parts":[[2023,3,1]]},"article-number":"btad108"}}