{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T05:09:16Z","timestamp":1777439356042,"version":"3.51.4"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The development of epitope-based vaccines crucially relies on the ability to classify Human Leukocyte Antigen (HLA) molecules into sets that have similar peptide binding specificities, termed supertypes. In their seminal work, Sette and Sidney defined nine HLA class I supertypes and claimed that these provide an almost perfect coverage of the entire repertoire of HLA class I molecules.<\/jats:p><jats:p>HLA alleles are highly polymorphic and polygenic and therefore experimentally classifying each of these molecules to supertypes is at present an impossible task. Recently, a number of computational methods have been proposed for this task. These methods are based on defining protein similarity measures, derived from analysis of binding peptides or from analysis of the proteins themselves.<\/jats:p><jats:p>Results: In this paper we define both peptide derived and protein derived similarity measures, which are based on learning distance functions. The peptide derived measure is defined using a peptide\u2013peptide distance function, which is learned using information about known binding and non-binding peptides. The protein derived similarity measure is defined using a protein\u2013protein distance function, which is learned using information about alleles previously classified to supertypes by Sette and Sidney (1999). We compare the classification obtained by these two complimentary methods to previously suggested classification methods. In general, our results are in excellent agreement with the classifications suggested by Sette and Sidney (1999) and with those reported by Buus et al. (2004).<\/jats:p><jats:p>The main important advantage of our proposed distance-based approach is that it makes use of two different and important immunological sources of information\u2014HLA alleles and peptides that are known to bind or not bind to these alleles. Since each of our distance measures is trained using a different source of information, their combination can provide a more confident classification of alleles to supertypes.<\/jats:p><jats:p>Contact: \u00a0tomboy@cs.huji.ac.il; cheny@cs.huji.ac.il<\/jats:p>","DOI":"10.1093\/bioinformatics\/btl324","type":"journal-article","created":{"date-parts":[[2007,1,19]],"date-time":"2007-01-19T18:51:12Z","timestamp":1169232672000},"page":"e148-e155","source":"Crossref","is-referenced-by-count":41,"title":["Identifying HLA supertypes by learning distance functions"],"prefix":"10.1093","volume":"23","author":[{"given":"Tomer","family":"Hertz","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering 1 \u00a0 1 \u00a0 \u00a0 Israel"},{"name":"Interdisciplinary Center for Neural Computation, The Hebrew University of Jerusalem 2 \u00a0 2 \u00a0 \u00a0 Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chen","family":"Yanover","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering 1 \u00a0 1 \u00a0 \u00a0 Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2007,1,15]]},"reference":[{"key":"2023041107132920700_","article-title":"Learning distance functions using equivalence relations","author":"Bar-Hilel","year":"2003"},{"key":"2023041107132920700_","article-title":"Integrating constraints and metric learning in semisupervised clustering","author":"Bilenko","year":"2004"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1093\/nar\/26.1.368","article-title":"MHCPEP, a database of MHC-binding peptides: update 1997","volume":"26","author":"Brusic","year":"1998","journal-title":"Nucliec Acids Res."},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"378","DOI":"10.1034\/j.1399-0039.2003.00112.x","article-title":"Sensitive quantitative predictions of peptide-MHC binding by a \u2018query by committee\u2019 artificial neural network approach","volume":"62","author":"Buus","year":"2003","journal-title":"Tissue Antigens"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"685","DOI":"10.4049\/jimmunol.154.2.685","article-title":"Binding of a peptide antigen to multiple HLA alleles allows definition of anA2-like supertype","volume":"154","author":"del Guercio","year":"1995","journal-title":"J. Immunol."},{"key":"2023041107132920700_","doi-asserted-by":"crossref","DOI":"10.1186\/1471-2105-3-25","article-title":"Prediction of MHC class I binding","volume":"3","author":"Donnes","year":"2002","journal-title":"BMC Bioinformatics"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"4314","DOI":"10.4049\/jimmunol.172.7.4314","article-title":"Identifiying human MHC supertypes using bioinformatic methods","volume":"172","author":"Doytchinova","year":"2004","journal-title":"J. Immunol."},{"key":"2023041107132920700_","article-title":"PHYLIP (phylogeny inference package) version 3.5c","author":"Felsenstein","year":"1993"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","DOI":"10.1016\/j.it.2003.10.006","article-title":"Towards in silico prediction of immunogenic epitopes","volume":"24","author":"Flower","year":"2003","journal-title":"Trends immunol."},{"key":"2023041107132920700_","first-page":"296","article-title":"Leveraging information across HLA alleles\/supertypes improves epitope prediction","author":"Heckerman","year":"2006","journal-title":"RECOMB"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","DOI":"10.1186\/1471-2105-7-S1-S3","article-title":"Pepdist: a new framework for protein-peptide binding prediction based on learning peptide distance functions","volume":"7","author":"Hertz","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","DOI":"10.1145\/1015330.1015389","article-title":"Boosting margin based distance functions for clustering","author":"Hertz","year":"2004"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","DOI":"10.1109\/CVPR.2004.1315215","article-title":"Learning distance functions for image retrieval","author":"Hertz","year":"2004"},{"key":"2023041107132920700_","volume-title":"Immunobiology","author":"Janeway","year":"2001","edition":"5th edn"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"797","DOI":"10.1007\/s00251-004-0647-4","article-title":"Definition of supertypes for HLA molecules using clustering of specificity matrices","volume":"55","author":"Lund","year":"2004","journal-title":"Immunogenetics"},{"key":"2023041107132920700_","first-page":"357","article-title":"Treeview: an application to display phylogenetic trees on personal computers","volume":"12","author":"Page","year":"1996","journal-title":"Comp. Appl. Biosci."},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1007\/s002510050595","article-title":"SYFPEITHI: database for MHC ligands and peptide motifs","volume":"50","author":"Rammensee","year":"1999","journal-title":"Immunogenetics"},{"key":"2023041107132920700_","first-page":"189","article-title":"Definition of MHC supertypes through clustering of MHC peptide binding repertoires","author":"Reche","year":"2004"},{"key":"2023041107132920700_","first-page":"405","article-title":"Enhancement to the RANKPEP resource for the prediction of peptide binding to MHC molecules using profiles","volume":"26","author":"Reche","year":"2004","journal-title":"Immunogenetics"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1093\/nar\/gkg070","article-title":"IMGT\/HLA and IMGT\/MHC: sequence databases for the study of the major histocompatibility complex","volume":"31","author":"Robinson","year":"2003","journal-title":"Nucliec. Acids Res."},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1023\/A:1007614523901","article-title":"Improved boosting using confidence-rated predictions","volume":"37","author":"Schapire","year":"1999","journal-title":"Mach. Learn."},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1016\/S0952-7915(03)00083-9","article-title":"Epitope-based vaccines: an update on epitope identification, vaccine design and delivery","volume":"15","author":"Sette","year":"2003","journal-title":"Curr. Opin. Immunol."},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1007\/s002510050594","article-title":"Nine major HLA class I supertypes account for the vast preponderance of HLA-A and -B polymorphism","volume":"50","author":"Sette","year":"1999","journal-title":"Immunogenetics"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1034\/j.1399-0039.2002.590601.x","article-title":"Optimizing vaccine design for cellular processing, MHC binding and TCR recognition","volume":"59","author":"Sette","year":"2002","journal-title":"Tissue Antigens"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1016\/0198-8859(95)00173-5","article-title":"Definition of an HLA-A3-like supermotif demonstrates the overlapping peptide-binding repertoires of common HLA molecules","volume":"45","author":"Sidney","year":"1996","journal-title":"Hum. Immunol."},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"3480","DOI":"10.4049\/jimmunol.157.8.3480","article-title":"Specificity and degeneracy in peptide binding to HLA-B7-like class I molecules","volume":"157","author":"Sidney","year":"1996","journal-title":"J. Immunol."},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1007\/s00894-001-0058-5","article-title":"New quantitative descriptors of amino acids based on multidimensional scaling of a large number of physical-chemical properties","volume":"7","author":"Venkatarajan","year":"2001","journal-title":"J. Mol. Model."},{"key":"2023041107132920700_","first-page":"1433","article-title":"Supervised graph inference","volume-title":"NIPS 17","author":"Vert","year":"2005"},{"key":"2023041107132920700_","article-title":"Distance metric learning with application to clustering with side-information","author":"Xing","year":"2002"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","DOI":"10.1007\/11415770_34","article-title":"Predicting protein-peptide binding affinity by learning peptide-peptide distance functions","author":"Yanover","year":"2005"},{"key":"2023041107132920700_","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1146\/annurev.immunol.17.1.51","article-title":"Immunodominance in major histocompatibility complex class I-restricted T-lymphocyte responses","volume":"17","author":"Yewdell","year":"1999","journal-title":"Ann. Rev. Immunol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/2\/e148\/49820171\/bioinformatics_23_2_e148.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/2\/e148\/49820171\/bioinformatics_23_2_e148.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,10]],"date-time":"2023-05-10T13:04:04Z","timestamp":1683723844000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/2\/e148\/204023"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,1,15]]},"references-count":31,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2007,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btl324","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,1,15]]},"published":{"date-parts":[[2007,1,15]]}}}