{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T02:05:31Z","timestamp":1773885931959,"version":"3.50.1"},"reference-count":36,"publisher":"Public Library of Science (PLoS)","issue":"12","license":[{"start":{"date-parts":[[2020,12,9]],"date-time":"2020-12-09T00:00:00Z","timestamp":1607472000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>The diversity of T-cell receptor (TCR) repertoires is achieved by a combination of two intrinsically stochastic steps: random receptor generation by VDJ recombination, and selection based on the recognition of random self-peptides presented on the major histocompatibility complex. These processes lead to a large receptor variability within and between individuals. However, the characterization of the variability is hampered by the limited size of the sampled repertoires. We introduce a new software tool SONIA to facilitate inference of individual-specific computational models for the generation and selection of the TCR beta chain (TRB) from sequenced repertoires of 651 individuals, separating and quantifying the variability of the two processes of generation and selection in the population. We find not only that most of the variability is driven by the VDJ generation process, but there is a large degree of consistency between individuals with the inter-individual variance of repertoires being about \u223c2% of the intra-individual variance. Known viral-specific TCRs follow the same generation and selection statistics as all TCRs.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1008394","type":"journal-article","created":{"date-parts":[[2020,12,9]],"date-time":"2020-12-09T17:53:13Z","timestamp":1607536393000},"page":"e1008394","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":53,"title":["Population variability in the generation and selection of T-cell repertoires"],"prefix":"10.1371","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8859-1084","authenticated-orcid":true,"given":"Zachary","family":"Sethna","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2040-1058","authenticated-orcid":true,"given":"Giulio","family":"Isacchini","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1803-1617","authenticated-orcid":true,"given":"Thomas","family":"Dupic","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5456-9361","authenticated-orcid":true,"given":"Thierry","family":"Mora","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2686-5702","authenticated-orcid":true,"given":"Aleksandra M.","family":"Walczak","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4354-8864","authenticated-orcid":true,"given":"Yuval","family":"Elhanati","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2020,12,9]]},"reference":[{"key":"pcbi.1008394.ref001","volume-title":"Immunobiology: The Immune System in Health and Disease","author":"CA Janeway","year":"2001"},{"issue":"10","key":"pcbi.1008394.ref002","doi-asserted-by":"crossref","first-page":"3628","DOI":"10.1073\/pnas.73.10.3628","article-title":"Evidence for somatic rearrangement of immunoglobulin genes coding for variable and constant regions","volume":"73","author":"N Hozumi","year":"1976","journal-title":"Proc Natl Acad Sci"},{"issue":"49","key":"pcbi.1008394.ref003","doi-asserted-by":"crossref","first-page":"18691","DOI":"10.1073\/pnas.0608907103","article-title":"Sharing of T cell receptors in antigen-specific responses is driven by convergent recombination","volume":"103","author":"V Venturi","year":"2006","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"40","key":"pcbi.1008394.ref004","doi-asserted-by":"crossref","first-page":"16161","DOI":"10.1073\/pnas.1212755109","article-title":"Statistical inference of the generation probability of T-cell receptors from sequence repertoires","volume":"109","author":"A Murugan","year":"2012","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"1","key":"pcbi.1008394.ref005","doi-asserted-by":"crossref","first-page":"571","DOI":"10.1146\/annurev.immunol.23.021704.115601","article-title":"A CENTRAL ROLE FOR CENTRAL TOLERANCE","volume":"24","author":"B Kyewski","year":"2006","journal-title":"Annual Review of Immunology"},{"issue":"1","key":"pcbi.1008394.ref006","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1146\/annurev.immunol.21.120601.141107","article-title":"Positive and Negative Selection of T Cells","volume":"21","author":"TK Starr","year":"2003","journal-title":"Annual Review of Immunology"},{"issue":"7","key":"pcbi.1008394.ref007","doi-asserted-by":"crossref","first-page":"735","DOI":"10.1007\/s00109-014-1145-2","article-title":"Molecules in medicine mini review: the \u03b1\u03b2 T cell receptor","volume":"92","author":"ET Clambey","year":"2014","journal-title":"Journal of Molecular Medicine"},{"issue":"4","key":"pcbi.1008394.ref008","first-page":"554","article-title":"High-throughput sequencing of the T-cell receptor repertoire: pitfalls and opportunities","volume":"19","author":"JM Heather","year":"2017","journal-title":"Briefings in Bioinformatics"},{"key":"pcbi.1008394.ref009","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1016\/j.coisb.2016.12.009","article-title":"Advances and applications of immune receptor sequencing in systems immunology","volume":"1","author":"P Lindau","year":"2017","journal-title":"Current Opinion in Systems Biology"},{"issue":"413","key":"pcbi.1008394.ref010","first-page":"413","article-title":"The past, present and future of immune repertoire biology\u2014the rise of next-generation repertoire analysis","volume":"4","author":"A Six","year":"2013","journal-title":"Front Immunol"},{"issue":"10","key":"pcbi.1008394.ref011","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1186\/gm502","article-title":"Sequence analysis of T-cell repertoires in health and disease","volume":"5","author":"DJ Woodsworth","year":"2013","journal-title":"Genome Med"},{"key":"pcbi.1008394.ref012","doi-asserted-by":"crossref","first-page":"e38358","DOI":"10.7554\/eLife.38358","article-title":"Human T cell receptor occurrence patterns encode immune history, genetic background, and receptor specificity","volume":"7","author":"S DeWitt I William","year":"2018","journal-title":"eLife"},{"issue":"1","key":"pcbi.1008394.ref013","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1111\/imr.12665","article-title":"Predicting the spectrum of TCR repertoire sharing with a data-driven model of recombination","volume":"284","author":"Y Elhanati","year":"2018","journal-title":"Immunological Reviews"},{"issue":"5","key":"pcbi.1008394.ref014","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1038\/ng.3822","article-title":"Immunosequencing identifies signatures of cytomegalovirus exposure history and HLA-mediated effects on the T cell repertoire","volume":"49","author":"RO Emerson","year":"2017","journal-title":"Nature Genetics"},{"issue":"1","key":"pcbi.1008394.ref015","doi-asserted-by":"crossref","first-page":"561","DOI":"10.1038\/s41467-018-02832-w","article-title":"High-throughput immune repertoire analysis with IGoR","volume":"9","author":"Q Marcou","year":"2018","journal-title":"Nature Communications"},{"issue":"17","key":"pcbi.1008394.ref016","doi-asserted-by":"crossref","first-page":"2974","DOI":"10.1093\/bioinformatics\/btz035","article-title":"OLGA: fast computation of generation probabilities of B- and T-cell receptor amino acid sequences and motifs","volume":"35","author":"Z Sethna","year":"2019","journal-title":"Bioinformatics"},{"issue":"27","key":"pcbi.1008394.ref017","doi-asserted-by":"crossref","first-page":"9875","DOI":"10.1073\/pnas.1409572111","article-title":"Quantifying selection in immune receptor repertoires","volume":"111","author":"Y Elhanati","year":"2014","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"22","key":"pcbi.1008394.ref018","doi-asserted-by":"crossref","first-page":"11011","DOI":"10.1073\/pnas.89.22.11011","article-title":"Regulation of N-region diversity in antigen receptors through thymocyte differentiation and thymus ontogeny","volume":"89","author":"M Bogue","year":"1992","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"13","key":"pcbi.1008394.ref019","first-page":"1","article-title":"VDJdb in 2019: database extension, new analysis infrastructure and a T-cell receptor motif compendium","author":"DV Bagaev","year":"2019","journal-title":"Nucleic Acids Research"},{"issue":"D","key":"pcbi.1008394.ref020","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1146\/annurev-genet-110410-132552","article-title":"V(D)J recombination: mechanisms of initiation","volume":"45","author":"DG Schatz","year":"2011","journal-title":"Annual review of genetics"},{"issue":"9","key":"pcbi.1008394.ref021","first-page":"1","article-title":"Estimating Copy Number and Allelic Variation at the Immunoglobulin Heavy Chain Locus Using Short Reads","volume":"12","author":"S Luo","year":"2016","journal-title":"PLoS Computational Biology"},{"issue":"1","key":"pcbi.1008394.ref022","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1146\/annurev.biochem.052308.093131","article-title":"The Mechanism of Double-Strand DNA Break Repair by the Nonhomologous DNA End-Joining Pathway","volume":"79","author":"MR Lieber","year":"2010","journal-title":"Annual review of biochemistry"},{"key":"pcbi.1008394.ref023","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1016\/j.coisb.2019.10.007","article-title":"Selected before selection: A case for inherent antigen bias in the T cell receptor repertoire","volume":"18","author":"PG Thomas","year":"2019","journal-title":"Current Opinion in Systems Biology"},{"issue":"22","key":"pcbi.1008394.ref024","doi-asserted-by":"crossref","first-page":"3181","DOI":"10.1093\/bioinformatics\/btu523","article-title":"Tracking global changes induced in the CD4 T-cell receptor repertoire by immunization with a complex antigen using short stretches of CDR3 protein sequence","volume":"30","author":"N Thomas","year":"2014","journal-title":"Bioinformatics"},{"issue":"50","key":"pcbi.1008394.ref025","doi-asserted-by":"crossref","first-page":"12704","DOI":"10.1073\/pnas.1809642115","article-title":"Precise tracking of vaccine-responding T cell clones reveals convergent and personalized response in identical twins","volume":"115","author":"MV Pogorelyy","year":"2018","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"1","key":"pcbi.1008394.ref026","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1182\/blood-2006-11-056168","article-title":"Activation-induced expression of CD137 permits detection, isolation, and expansion of the full repertoire of CD8+ T cells responding to antigen without requiring knowledge of epitope specificities","volume":"110","author":"M Wolfl","year":"2007","journal-title":"Blood"},{"issue":"5284","key":"pcbi.1008394.ref027","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1126\/science.274.5284.94","article-title":"Phenotypic Analysis of Antigen-Specific T Lymphocytes","volume":"274","author":"JD Altman","year":"1996","journal-title":"Science"},{"key":"pcbi.1008394.ref028","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1038\/nature22976","article-title":"Identifying specificity groups in the T cell receptor repertoire","author":"J Glanville","year":"2017","journal-title":"Nature"},{"issue":"7661","key":"pcbi.1008394.ref029","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1038\/nature22383","article-title":"Quantifiable predictive features define epitope-specific T cell receptor repertoires","volume":"547","author":"P Dash","year":"2017","journal-title":"Nature"},{"issue":"9","key":"pcbi.1008394.ref030","doi-asserted-by":"crossref","first-page":"734","DOI":"10.1158\/2326-6066.CIR-16-0001","article-title":"Tumor- and Neoantigen-Reactive T-cell Receptors Can Be Identified Based on Their Frequency in Fresh Tumor","volume":"4","author":"A Pasetto","year":"2016","journal-title":"Cancer Immunol Res"},{"issue":"5","key":"pcbi.1008394.ref031","first-page":"433706","article-title":"NetTCR: sequence-based prediction of TCR binding to peptide-MHC complexes using convolutional neural networks","author":"VI Jurtz","year":"2018","journal-title":"bioRxiv"},{"key":"pcbi.1008394.ref032","first-page":"464107","article-title":"DeepTCR: a deep learning framework for revealing structural concepts within TCR Repertoire","author":"JW Sidhom","year":"2018","journal-title":"bioRxiv"},{"key":"pcbi.1008394.ref033","first-page":"650861","article-title":"Prediction of specific TCR-peptide binding from large dictionaries of TCR-peptide pairs","author":"I Springer","year":"2019","journal-title":"bioRxiv"},{"issue":"2017","key":"pcbi.1008394.ref034","first-page":"4","author":"E Jokinen","year":"2019","journal-title":"TCRGP: Determining epitope specificity of T cell receptors"},{"key":"pcbi.1008394.ref035","unstructured":"Chollet F, et al. Keras; 2015. https:\/\/keras.io."},{"key":"pcbi.1008394.ref036","unstructured":"Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems; 2015. Available from: https:\/\/www.tensorflow.org\/."}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1008394","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,12,9]],"date-time":"2020-12-09T17:53:45Z","timestamp":1607536425000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1008394"}},"subtitle":[],"editor":[{"given":"Benny","family":"Chain","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,12,9]]},"references-count":36,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2020,12,9]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1008394","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.01.08.899682","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,12,9]]}}}