{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,19]],"date-time":"2026-05-19T15:35:05Z","timestamp":1779204905145,"version":"3.51.4"},"reference-count":47,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2023,9,22]],"date-time":"2023-09-22T00:00:00Z","timestamp":1695340800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/pages\/standard-publication-reuse-rights"}],"funder":[{"DOI":"10.13039\/501100001809","name":"Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62071278"],"award-info":[{"award-number":["62071278"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,9,22]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>NcRNA-encoded small peptides (ncPEPs) have recently emerged as promising targets and biomarkers for cancer immunotherapy. Therefore, identifying cancer-associated ncPEPs is crucial for cancer research. In this work, we propose CoraL, a novel supervised contrastive meta-learning framework for predicting cancer-associated ncPEPs. Specifically, the proposed meta-learning strategy enables our model to learn meta-knowledge from different types of peptides and train a promising predictive model even with few labeled samples. The results show that our model is capable of making high-confidence predictions on unseen cancer biomarkers with only five samples, potentially accelerating the discovery of novel cancer biomarkers for immunotherapy. Moreover, our approach remarkably outperforms existing deep learning models on 15 cancer-associated ncPEPs datasets, demonstrating its effectiveness and robustness. Interestingly, our model exhibits outstanding performance when extended for the identification of short open reading frames derived from ncPEPs, demonstrating the strong prediction ability of CoraL at the transcriptome level. Importantly, our feature interpretation analysis discovers unique sequential patterns as the fingerprint for each cancer-associated ncPEPs, revealing the relationship among certain cancer biomarkers that are validated by relevant literature and motif comparison. Overall, we expect CoraL to be a useful tool to decipher the pathogenesis of cancer and provide valuable information for cancer research. The dataset and source code of our proposed method can be found at https:\/\/github.com\/Johnsunnn\/CoraL.<\/jats:p>","DOI":"10.1093\/bib\/bbad352","type":"journal-article","created":{"date-parts":[[2023,10,20]],"date-time":"2023-10-20T11:01:37Z","timestamp":1697799697000},"source":"Crossref","is-referenced-by-count":11,"title":["CoraL: interpretable contrastive meta-learning for the prediction of cancer-associated ncRNA-encoded small peptides"],"prefix":"10.1093","volume":"24","author":[{"given":"Zhongshen","family":"Li","sequence":"first","affiliation":[{"name":"Shandong University School of Software, , Jinan 250101 , China"},{"name":"Shandong University Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), , Jinan 250101 , China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Junru","family":"Jin","sequence":"additional","affiliation":[{"name":"Shandong University School of Software, , Jinan 250101 , China"},{"name":"Shandong University Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), , Jinan 250101 , China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wenjia","family":"He","sequence":"additional","affiliation":[{"name":"King Abdullah University of Science and Technology (KAUST) Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), , Thuwal , Saudi Arabia"},{"name":"King Abdullah University of Science and Technology (KAUST) Computational Bioscience Research Center (CBRC), , Thuwal , Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wentao","family":"Long","sequence":"additional","affiliation":[{"name":"Shandong University School of Software, , Jinan 250101 , China"},{"name":"Shandong University Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), , Jinan 250101 , China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haoqing","family":"Yu","sequence":"additional","affiliation":[{"name":"Shandong University School of Software, , Jinan 250101 , China"},{"name":"Shandong University Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), , Jinan 250101 , China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xin","family":"Gao","sequence":"additional","affiliation":[{"name":"King Abdullah University of Science and Technology (KAUST) Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), , Thuwal , Saudi Arabia"},{"name":"King Abdullah University of Science and Technology (KAUST) Computational Bioscience Research Center (CBRC), , Thuwal , Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kenta","family":"Nakai","sequence":"additional","affiliation":[{"name":"The University of Tokyo Department of Computational Biology and Medical Sciences, , 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba 277-8562 , Japan"},{"name":"The Institute of Medical Science, The University of Tokyo Human Genome Center, , 4-6-1 Shirokanedai Minato-ku, Tokyo 108-8639 , Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Quan","family":"Zou","sequence":"additional","affiliation":[{"name":"University of Electronic Science and Technology of China Institute of Fundamental and Frontier Sciences, , Chengdu, 610054 , China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Leyi","family":"Wei","sequence":"additional","affiliation":[{"name":"Shandong University School of Software, , Jinan 250101 , China"},{"name":"Shandong University Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), , Jinan 250101 , China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2023,10,19]]},"reference":[{"key":"2023102011013019400_ref1","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nature11247","article-title":"An integrated encyclopedia of DNA elements in the human genome","volume":"489","author":"ENCODE Project Consortium","year":"2012","journal-title":"Nature"},{"key":"2023102011013019400_ref2","doi-asserted-by":"crossref","first-page":"720","DOI":"10.2174\/0929866525666180809142326","article-title":"Insights into the noncoding RNA-encoded peptides","volume":"25","author":"Pan","year":"2018","journal-title":"Protein Pept Lett"},{"key":"2023102011013019400_ref3","doi-asserted-by":"crossref","first-page":"3364","DOI":"10.1016\/j.jmb.2020.02.022","article-title":"ncEP: a manually curated database for experimentally validated ncRNA-encoded proteins or peptides","volume":"432","author":"Liu","year":"2020","journal-title":"J Mol Biol"},{"key":"2023102011013019400_ref4","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1016\/j.molcel.2008.09.027","article-title":"A ncRNA modulates histone modification and mRNA induction in the yeast GAL gene cluster","volume":"32","author":"Houseley","year":"2008","journal-title":"Mol Cell"},{"key":"2023102011013019400_ref5","doi-asserted-by":"crossref","first-page":"1401","DOI":"10.1016\/j.cell.2007.04.040","article-title":"A mammalian microRNA expression atlas based on small RNA library sequencing","volume":"129","author":"Landgraf","year":"2007","journal-title":"Cell"},{"key":"2023102011013019400_ref6","doi-asserted-by":"crossref","first-page":"180","DOI":"10.1080\/10590501.2019.1639481","article-title":"Regulation of cytochrome P450 expression by microRNAs and long noncoding RNAs: epigenetic mechanisms in environmental toxicology and carcinogenesis","volume":"37","author":"Li","year":"2019","journal-title":"J Environ Sci Health C"},{"key":"2023102011013019400_ref7","doi-asserted-by":"crossref","first-page":"380","DOI":"10.1093\/carcin\/bgy143","article-title":"Long non-coding RNA LOC284454 promotes migration and invasion of nasopharyngeal carcinoma via modulating the rho\/Rac signaling pathway","volume":"40","author":"Fan","year":"2019","journal-title":"Carcinogenesis"},{"key":"2023102011013019400_ref8","doi-asserted-by":"crossref","first-page":"582","DOI":"10.5732\/cjc.013.10170","article-title":"Noncoding RNAs in cancer and cancer stem cells","volume":"32","author":"Huang","year":"2013","journal-title":"Chin J Cancer"},{"key":"2023102011013019400_ref9","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13045-019-0748-z","article-title":"Noncoding RNAs in cancer therapy resistance and targeted drug development","volume":"12","author":"Wang","year":"2019","journal-title":"J Hematol Oncol"},{"key":"2023102011013019400_ref10","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1038\/nature14346","article-title":"Primary transcripts of microRNAs encode regulatory peptides","volume":"520","author":"Lauressergues","year":"2015","journal-title":"Nature"},{"key":"2023102011013019400_ref11","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1038\/nature21034","article-title":"mTORC1 and muscle regeneration are regulated by the LINC00961-encoded SPAR polypeptide","volume":"541","author":"Matsumoto","year":"2017","journal-title":"Nature"},{"key":"2023102011013019400_ref12","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nchembio.1120","article-title":"Peptidomic discovery of short open reading frame\u2013encoded peptides in human cells","volume":"9","author":"Slavoff","year":"2013","journal-title":"Nat Chem Biol"},{"key":"2023102011013019400_ref13","doi-asserted-by":"crossref","first-page":"1853","DOI":"10.1093\/bib\/bby055","article-title":"The small peptide world in long noncoding RNAs","volume":"20","author":"Choi","year":"2019","journal-title":"Brief Bioinform"},{"key":"2023102011013019400_ref14","doi-asserted-by":"crossref","first-page":"1295","DOI":"10.3389\/fphar.2018.01295","article-title":"Peptides\/proteins encoded by non-coding RNA: a novel resource bank for drug targets and biomarkers","volume":"9","author":"Zhu","year":"2018","journal-title":"Front Pharmacol"},{"key":"2023102011013019400_ref15","doi-asserted-by":"crossref","first-page":"E10702","DOI":"10.1073\/pnas.1810653115","article-title":"Isolation and characterization of NY-ESO-1\u2013specific T cell receptors restricted on various MHC molecules","volume":"115","author":"Bethune","year":"2018","journal-title":"Proc Natl Acad Sci"},{"key":"2023102011013019400_ref16","doi-asserted-by":"crossref","first-page":"2180","DOI":"10.1111\/cas.14034","article-title":"Circ MAN 1A2 could serve as a novel serum biomarker for malignant tumors","volume":"110","author":"Fan","year":"2019","journal-title":"Cancer Sci"},{"key":"2023102011013019400_ref17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41598-019-48774-1","article-title":"Harnessing the tissue and plasma lncRNA-peptidome to discover peptide-based cancer biomarkers","volume":"9","author":"Chakraborty","year":"2019","journal-title":"Sci Rep"},{"key":"2023102011013019400_ref18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12943-019-1010-6","article-title":"A novel protein encoded by a circular RNA circPPP1R12A promotes tumor pathogenesis and metastasis of colon cancer via hippo-YAP signaling","volume":"18","author":"Zheng","year":"2019","journal-title":"Mol Cancer"},{"key":"2023102011013019400_ref19","doi-asserted-by":"crossref","first-page":"4750","DOI":"10.1038\/s41388-018-0281-5","article-title":"The cancer-associated microprotein CASIMO1 controls cell proliferation and interacts with squalene epoxidase modulating lipid droplet formation","volume":"37","author":"Polycarpou-Schwarz","year":"2018","journal-title":"Oncogene"},{"key":"2023102011013019400_ref20","doi-asserted-by":"crossref","first-page":"304","DOI":"10.1093\/jnci\/djx166","article-title":"Novel role of FBXW7 circular RNA in repressing glioma tumorigenesis","volume":"110","author":"Yang","year":"2018","journal-title":"J Natl Cancer Inst"},{"key":"2023102011013019400_ref21","doi-asserted-by":"crossref","first-page":"1805","DOI":"10.1038\/s41388-017-0019-9","article-title":"A novel protein encoded by the circular form of the SHPRH gene suppresses glioma tumorigenesis","volume":"37","author":"Zhang","year":"2018","journal-title":"Oncogene"},{"key":"2023102011013019400_ref22","doi-asserted-by":"crossref","first-page":"2342","DOI":"10.7150\/jca.30454","article-title":"Proteomic analysis of the molecular mechanism of lovastatin inhibiting the growth of nasopharyngeal carcinoma cells","volume":"10","author":"Mo","year":"2019","journal-title":"J Cancer"},{"key":"2023102011013019400_ref23","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1093\/bioinformatics\/btp688","article-title":"sORF finder: a program package to identify small open reading frames with high coding potential","volume":"26","author":"Hanada","year":"2010","journal-title":"Bioinformatics"},{"key":"2023102011013019400_ref24","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-019-3033-9","article-title":"MiPepid: MicroPeptide identification tool using machine learning","volume":"20","author":"Zhu","year":"2019","journal-title":"BMC Bioinform"},{"key":"2023102011013019400_ref25","doi-asserted-by":"crossref","first-page":"bbab499","DOI":"10.1093\/bib\/bbab499","article-title":"Accelerating bioactive peptide discovery via mutual information-based meta-learning","volume":"23","author":"He","year":"2022","journal-title":"Brief Bioinform"},{"key":"2023102011013019400_ref26","doi-asserted-by":"crossref","first-page":"4739","DOI":"10.1093\/bioinformatics\/btz260","article-title":"Graph-based data integration from bioactive peptide databases of pharmaceutical interest: toward an organized collection enabling visual network analysis","volume":"35","author":"Aguilera-Mendoza","year":"2019","journal-title":"Bioinformatics"},{"key":"2023102011013019400_ref27","doi-asserted-by":"crossref","first-page":"5978","DOI":"10.3390\/ijms20235978","article-title":"BIOPEP-UWM database of bioactive peptides: current opportunities","volume":"20","author":"Minkiewicz","year":"2019","journal-title":"Int J Mol Sci"},{"key":"2023102011013019400_ref28","doi-asserted-by":"crossref","first-page":"D1373","DOI":"10.1093\/nar\/gkab822","article-title":"SPENCER: a comprehensive database for small peptides encoded by noncoding RNAs in cancer patients","volume":"50","author":"Luo","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2023102011013019400_ref29","doi-asserted-by":"crossref","DOI":"10.3115\/v1\/D14-1181","article-title":"Convolutional neural networks for sentence classification","volume-title":"Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing","author":"Kim","year":"2014"},{"key":"2023102011013019400_ref30","article-title":"An introduction to convolutional neural networks","author":"O'Shea"},{"key":"2023102011013019400_ref31","first-page":"18661","article-title":"Supervised contrastive learning","volume":"33","author":"Khosla","year":"2020","journal-title":"Adv Neural Inform Process Systems"},{"key":"2023102011013019400_ref32","article-title":"Generalized cross entropy loss for training deep neural networks with noisy labels","volume":"31","author":"Zhang","journal-title":"Advances in neural information processing systems"},{"key":"2023102011013019400_ref33","article-title":"Training convolutional networks with noisy labels","author":"Sukhbaatar"},{"key":"2023102011013019400_ref34","article-title":"Large margin deep networks for classification","volume":"31","author":"Elsayed","journal-title":"Advances in neural information processing systems"},{"key":"2023102011013019400_ref35","article-title":"Large-margin softmax loss for convolutional neural networks","author":"Liu"},{"key":"2023102011013019400_ref36","first-page":"776","volume-title":"European Conference on Computer Vision","author":"Tian","year":"2020"},{"key":"2023102011013019400_ref37","first-page":"1597","volume-title":"International Conference on Machine Learning","author":"Chen","year":"2020"},{"key":"2023102011013019400_ref38","article-title":"Adam: a method for stochastic optimization","author":"Kingma"},{"key":"2023102011013019400_ref39","article-title":"Decoupled weight decay regularization","author":"Loshchilov"},{"key":"2023102011013019400_ref40","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1002\/wics.101","article-title":"Principal component analysis","volume":"2","author":"Abdi","year":"2010","journal-title":"Wiley interdisciplinary reviews: computational statistics"},{"key":"2023102011013019400_ref41","first-page":"11","article-title":"Visualizing data using t-SNE","volume":"9","author":"Van der Maaten","year":"2008","journal-title":"J Mach Learn Res"},{"key":"2023102011013019400_ref42","first-page":"1","volume-title":"Pearson Correlation Coefficient. Noise Reduction in Speech Processing","author":"Benesty","year":"2009"},{"key":"2023102011013019400_ref43","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1056\/NEJM199001043220101","article-title":"Leukemia following chemotherapy for ovarian cancer","volume":"322","author":"Kaldor","year":"1990","journal-title":"N Engl J Med"},{"key":"2023102011013019400_ref44","doi-asserted-by":"crossref","first-page":"1422","DOI":"10.1093\/jnci\/84.18.1422","article-title":"Second cancers in patients with chronic lymphocytic leukemia","volume":"84","author":"Travis","year":"1992","journal-title":"J Natnl Cancer Inst"},{"key":"2023102011013019400_ref45","doi-asserted-by":"crossref","first-page":"2834","DOI":"10.1093\/bioinformatics\/btab203","article-title":"STREME: accurate and versatile sequence motif discovery","volume":"37","author":"Bailey","year":"2021","journal-title":"Bioinformatics"},{"key":"2023102011013019400_ref46","first-page":"37358","volume-title":"International Conference on Machine Learning","author":"Wu","year":"2023"},{"key":"2023102011013019400_ref47","article-title":"GAME: GAussian mixture error-based meta-learning architecture","volume":"35","author":"Dong","journal-title":"Neural Comput Appl"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/6\/bbad352\/52268620\/bbad352.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/6\/bbad352\/52268620\/bbad352.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,20]],"date-time":"2023-10-20T11:02:11Z","timestamp":1697799731000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbad352\/7323684"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,22]]},"references-count":47,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2023,9,22]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbad352","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,11,1]]},"published":{"date-parts":[[2023,9,22]]},"article-number":"bbad352"}}