{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,24]],"date-time":"2026-04-24T22:17:43Z","timestamp":1777069063189,"version":"3.51.4"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2023,3,1]],"date-time":"2023-03-01T00:00:00Z","timestamp":1677628800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100007567","name":"City University of Hong Kong","doi-asserted-by":"publisher","award":["7005215"],"award-info":[{"award-number":["7005215"]}],"id":[{"id":"10.13039\/100007567","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,3,19]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>The adaptive immune response to foreign antigens is initiated by T-cell receptor (TCR) recognition on the antigens. Recent experimental advances have enabled the generation of a large amount of TCR data and their cognate antigenic targets, allowing machine learning models to predict the binding specificity of TCRs. In this work, we present TEINet, a deep learning framework that utilizes transfer learning to address this prediction problem. TEINet employs two separately pretrained encoders to transform TCR and epitope sequences into numerical vectors, which are subsequently fed into a fully connected neural network to predict their binding specificities. A major challenge for binding specificity prediction is the lack of a unified approach to sampling negative data. Here, we first assess the current negative sampling approaches comprehensively and suggest that the Unified Epitope is the most suitable one. Subsequently, we compare TEINet with three baseline methods and observe that TEINet achieves an average AUROC of 0.760, which outperforms baseline methods by 6.4\u201326%. Furthermore, we investigate the impacts of the pretraining step and notice that excessive pretraining may lower its transferability to the final prediction task. Our results and analysis show that TEINet can make an accurate prediction using only the TCR sequence (CDR3$\\beta $) and the epitope sequence, providing novel insights to understand the interactions between TCRs and epitopes.<\/jats:p>","DOI":"10.1093\/bib\/bbad086","type":"journal-article","created":{"date-parts":[[2023,2,15]],"date-time":"2023-02-15T10:12:58Z","timestamp":1676455978000},"source":"Crossref","is-referenced-by-count":72,"title":["TEINet: a deep learning framework for prediction of TCR\u2013epitope binding specificity"],"prefix":"10.1093","volume":"24","author":[{"given":"Yuepeng","family":"Jiang","sequence":"first","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong"}]},{"given":"Miaozhe","family":"Huo","sequence":"additional","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong"}]},{"given":"Shuai","family":"Cheng Li","sequence":"additional","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong"}]}],"member":"286","published-online":{"date-parts":[[2023,3,11]]},"reference":[{"issue":"1675","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"20140291","DOI":"10.1098\/rstb.2014.0291","article-title":"Estimating t-cell repertoire diversity: limitations of classical estimators and a new approach","volume":"370","author":"Laydon","year":"2015","journal-title":"Philos Trans R Soc B: Biol Sci"},{"issue":"12","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"1156","DOI":"10.1038\/nbt.4282","article-title":"High-throughput determination of the antigen specificities of t cell receptors in single cells","volume":"36","author":"Zhang","year":"2018","journal-title":"Nat Biotechnol"},{"issue":"5284","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1126\/science.274.5284.94","article-title":"Phenotypic analysis of antigen-specific t lymphocytes","volume":"274","author":"Altman","year":"1996","journal-title":"Science"},{"issue":"4","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"1016","DOI":"10.1016\/j.cell.2019.07.009","article-title":"T-scan: a genome-wide method for the systematic discovery of t cell epitopes","volume":"178","author":"Kula","year":"2019","journal-title":"Cell"},{"issue":"D1","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"D419","DOI":"10.1093\/nar\/gkx760","article-title":"Vdjdb: a curated database of t-cell receptor sequences with known antigen specificity","volume":"46","author":"Shugay","year":"2018","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"D339","DOI":"10.1093\/nar\/gky1006","article-title":"The immune epitope database (iedb): 2018 update","volume":"47","author":"Vita","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"18","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"2924","DOI":"10.1093\/bioinformatics\/btx286","article-title":"Mcpas-tcr: a manually curated catalogue of pathology-associated t cell receptor sequences","volume":"33","author":"Tickotsky","year":"2017","journal-title":"Bioinformatics"},{"key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"640725","DOI":"10.3389\/fimmu.2021.640725","article-title":"Tcrmatch: predicting t-cell receptor specificity based on sequence similarity to previously characterized receptors","volume":"12","author":"Chronister","year":"2021","journal-title":"Front Immunol"},{"key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"2820","DOI":"10.3389\/fimmu.2019.02820","article-title":"Detection of enriched t cell epitope specificity in full t cell receptor sequence repertoires","volume":"10","author":"Gielis","year":"2019","journal-title":"Front Immunol"},{"issue":"3","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"e1008814","DOI":"10.1371\/journal.pcbi.1008814","article-title":"Predicting recognition between t cell receptors and epitopes with tcrgp","volume":"17","author":"Jokinen","year":"2021","journal-title":"PLoS Comput Biol"},{"issue":"4","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"bbaa318","DOI":"10.1093\/bib\/bbaa318","article-title":"Current challenges for unseen-epitope tcr interaction prediction and a new perspective derived from image classification","volume":"22","author":"Moris","year":"2021","journal-title":"Brief Bioinform"},{"issue":"Supplement_1","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"i237","DOI":"10.1093\/bioinformatics\/btab294","article-title":"Titan: T-cell receptor specificity prediction with bimodal attention networks","volume":"37","author":"Weber","year":"2021","journal-title":"Bioinformatics"},{"key":"2023032004561312600_","article-title":"A framework for highly multiplexed dextramer mapping and prediction of t cell receptor sequences to antigen specificity","volume":"7","author":"Zhang","year":"2021","journal-title":"Sci Adv"},{"issue":"10","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"864","DOI":"10.1038\/s42256-021-00383-2","article-title":"Deep learning-based prediction of the t cell receptor\u2013antigen binding specificity. Nature","volume":"3","author":"Tianshi","year":"2021","journal-title":"Mach Intell"},{"issue":"1","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s42003-021-02610-3","article-title":"Nettcr-2.0 enables accurate prediction of tcr-peptide binding by using paired tcr$\\alpha $ and $\\beta $ sequence data","volume":"4","author":"Montemurro","year":"2021","journal-title":"Commun Biol"},{"key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"1803","DOI":"10.3389\/fimmu.2020.01803","article-title":"Prediction of specific tcr-peptide binding from large dictionaries of tcr-peptide pairs","author":"Springer","year":"2020","journal-title":"Front Immunol"},{"issue":"7661","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1038\/nature22383","article-title":"Quantifiable predictive features define epitope-specific t cell receptor repertoires","volume":"547","author":"Dash","year":"2017","journal-title":"Nature"},{"key":"2023032004561312600_","first-page":"433706","article-title":"Nettcr: sequence-based prediction of tcr binding to peptide-mhc complexes using convolutional neural networks","author":"Jurtz","year":"2018","journal-title":"BioRxiv"},{"key":"2023032004561312600_","article-title":"Attention-aware contrastive learning for predicting t cell receptor-antigen binding specificity","author":"Fang","year":"2022","journal-title":"bioRxiv"},{"key":"2023032004561312600_","doi-asserted-by":"crossref","article-title":"Tcr-epitope binding affinity prediction using multi-head self attention model","author":"Cai","DOI":"10.3389\/fimmu.2022.893247"},{"issue":"2","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1159\/000445656","article-title":"Analysis of the repertoire features of tcr beta chain cdr3 in human by high-throughput sequencing","volume":"39","author":"Hou","year":"2016","journal-title":"Cell Physiol Biochem"},{"key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"2080","DOI":"10.3389\/fimmu.2019.02080","article-title":"T-cell receptor cognate target prediction based on paired $\\alpha $ and $\\beta $ chain sequence and structural cdr loop similarities","volume":"10","author":"Lanzarotti","year":"2019","journal-title":"Front Immunol"},{"key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"664514","DOI":"10.3389\/fimmu.2021.664514","article-title":"Contribution of t cell receptor alpha and beta cdr3, mhc typing, v and j genes to peptide binding prediction","volume":"12","author":"Springer","year":"2021","journal-title":"Front Immunol"},{"issue":"4","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"719","DOI":"10.1007\/s10994-020-05877-5","article-title":"Learning from positive and unlabeled data: a survey","volume":"109","author":"Bekker","year":"2020","journal-title":"Mach Learn"},{"key":"2023032004561312600_","article-title":"Revisiting negative sampling vs. non-sampling in implicit recommendation","author":"Chen","journal-title":"ACM Trans Inf Syst"},{"key":"2023032004561312600_","article-title":"Deep autoregressive generative models capture the intrinsics embedded in t-cell receptor repertoires","author":"Jiang","year":"2022","journal-title":"bioRxiv"},{"issue":"5","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1038\/ng.3822","article-title":"Immunosequencing identifies signatures of cytomegalovirus exposure history and hla-mediated effects on the t cell repertoire","volume":"49","author":"Emerson","year":"2017","journal-title":"Nat Genet"},{"issue":"5","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"bbaa415","DOI":"10.1093\/bib\/bbaa415","article-title":"Anthem: a user customised tool for fast and accurate prediction of binding between peptides and hla class i molecules","volume":"22","author":"Mei","year":"2021","journal-title":"Brief Bioinform"},{"issue":"1","key":"2023032004561312600_","first-page":"1929","article-title":"Dropout: a simple way to prevent neural networks from overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"J Mach Learn Res"},{"key":"2023032004561312600_","article-title":"Layer normalization","author":"Ba","year":"2016","journal-title":"arXiv preprint arXiv:160706450"},{"key":"2023032004561312600_","article-title":"Self-normalizing neural networks","volume":"30","author":"Klambauer","year":"2017","journal-title":"Adv Neural InfProcess Syst"},{"key":"2023032004561312600_","article-title":"Pytorch: an imperative style, high-performance deep learning library","volume":"32","author":"Paszke","year":"2019","journal-title":"Adv Neural Inf Process Syst"},{"key":"2023032004561312600_","article-title":"Adam: a method for stochastic optimization","author":"Kingma","year":"2014","journal-title":"arXiv preprint arXiv:14126980"},{"key":"2023032004561312600_","article-title":"Interpretable deep learning to uncover the molecular binding patterns determining tcr\u2013epitope interactions","author":"Dens","year":"2022","journal-title":"bioRxiv"},{"issue":"1","key":"2023032004561312600_","first-page":"1","article-title":"Deeptcr is a deep learning framework for revealing sequence concepts within t-cell repertoires","volume":"12","author":"John-William Sidhom","year":"2021","journal-title":"Nat Commun"},{"issue":"6","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"1078","DOI":"10.1107\/S0907444998009378","article-title":"Protein data bank (pdb): database of three-dimensional structural information of biological macromolecules","volume":"54","author":"Sussman","year":"1998","journal-title":"Acta Crystallogr D Biol Crystallogr"},{"key":"2023032004561312600_","first-page":"8950","article-title":"Rapid mapping of protein functional epitopes by combinatorial alanine scanning","volume-title":"Proc Natl Acad Sci","author":"Weiss","year":"2000"},{"issue":"3","key":"2023032004561312600_","doi-asserted-by":"crossref","first-page":"897","DOI":"10.1093\/bioinformatics\/btz614","article-title":"Pird: pan immune repertoire database","volume":"36","author":"Zhang","year":"2020","journal-title":"Bioinformatics"},{"key":"2023032004561312600_","first-page":"E1754","article-title":"Tcr contact residue hydrophobicity is a hallmark of immunogenic cd8+ t cell epitopes","volume-title":"Proc Natl Acad Sci","author":"Chowell","year":"2015"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/2\/bbad086\/49560698\/bbad086.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/2\/bbad086\/49560698\/bbad086.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,3,25]],"date-time":"2023-03-25T06:38:40Z","timestamp":1679726320000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbad086\/7076118"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3]]},"references-count":39,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,3,19]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbad086","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2022.10.20.513029","asserted-by":"object"}]},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,3]]},"published":{"date-parts":[[2023,3]]},"article-number":"bbad086"}}