{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:42:46Z","timestamp":1753875766933,"version":"3.41.2"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2024,7,1]],"date-time":"2024-07-01T00:00:00Z","timestamp":1719792000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100013748","name":"The Hope Foundation","doi-asserted-by":"publisher","award":["U10CA180819"],"award-info":[{"award-number":["U10CA180819"]}],"id":[{"id":"10.13039\/100013748","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>T cell receptors (TCRs) constitute a major component of our adaptive immune system, governing the recognition and response to internal and external antigens. Studying the TCR diversity via sequencing technology is critical for a deeper understanding of immune dynamics. However, library sizes differ substantially across samples, hindering the accurate estimation\/comparisons of alpha diversities. To address this, researchers frequently use an overall rarefying approach in which all samples are sub-sampled to an even depth. Despite its pervasive application, its efficacy has never been rigorously assessed.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this paper, we develop an innovative \u201cmulti-bin\u201d rarefying approach that partitions samples into multiple bins according to their library sizes, conducts rarefying within each bin for alpha diversity calculations, and performs meta-analysis across bins. Extensive simulations using real-world data highlight the inadequacy of the overall rarefying approach in controlling the confounding effect of library size. Our method proves robust in addressing library size confounding, outperforming competing normalization strategies by achieving better-controlled type-I error rates and enhanced statistical power in association tests.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The code is available at https:\/\/github.com\/mli171\/MultibinAlpha. The datasets are freely available at https:\/\/doi.org\/10.21417\/B7001Z and https:\/\/doi.org\/10.21417\/AR2019NC.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae431","type":"journal-article","created":{"date-parts":[[2024,7,1]],"date-time":"2024-07-01T19:06:42Z","timestamp":1719860802000},"source":"Crossref","is-referenced-by-count":0,"title":["A multi-bin rarefying method for evaluating alpha diversities in TCR sequencing data"],"prefix":"10.1093","volume":"40","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5188-9346","authenticated-orcid":false,"given":"Mo","family":"Li","sequence":"first","affiliation":[{"name":"Department of Mathematics, University of Louisiana at Lafayette , Lafayette, LA, 70504, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xing","family":"Hua","sequence":"additional","affiliation":[{"name":"Public Health Sciences Division, Fred Hutchinson Cancer Center , Seattle, WA, 98109, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shuai","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Johns Hopkins University , Baltimore, MD, 21205, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael C","family":"Wu","sequence":"additional","affiliation":[{"name":"Public Health Sciences Division, Fred Hutchinson Cancer Center , Seattle, WA, 98109, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7762-3949","authenticated-orcid":false,"given":"Ni","family":"Zhao","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Johns Hopkins University , Baltimore, MD, 21205, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2024,7,1]]},"reference":[{"key":"2024071321260116900_btae431-B1","doi-asserted-by":"crossref","first-page":"1446","DOI":"10.1093\/bioinformatics\/bts155","article-title":"Loess correction for length variation in gene set-based genomic sequence analysis","volume":"28","author":"Aboukhalil","year":"2012","journal-title":"Bioinformatics"},{"key":"2024071321260116900_btae431-B2","doi-asserted-by":"crossref","first-page":"1293","DOI":"10.1016\/j.cell.2018.05.060","article-title":"Single-cell map of diverse immune phenotypes in the breast tumor microenvironment","volume":"174","author":"Azizi","year":"2018","journal-title":"Cell"},{"key":"2024071321260116900_btae431-B3","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1158\/2326-6066.CIR-20-0398","article-title":"Improved t-cell receptor diversity estimates associate with survival and response to anti\u2013pd-1 therapy","volume":"9","author":"Bortone","year":"2021","journal-title":"Cancer Immunol Res"},{"key":"2024071321260116900_btae431-B4","doi-asserted-by":"crossref","first-page":"22302","DOI":"10.1038\/s41598-021-01636-1","article-title":"Enhancing diversity analysis by repeatedly rarefying next generation sequencing data describing microbial communities","volume":"11","author":"Cameron","year":"2021","journal-title":"Sci Rep"},{"key":"2024071321260116900_btae431-B5","doi-asserted-by":"crossref","first-page":"16983","DOI":"10.1038\/srep16983","article-title":"Goodness-of-fit test for meta-analysis","volume":"5","author":"Chen","year":"2015","journal-title":"Sci Rep"},{"key":"2024071321260116900_btae431-B6","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1158\/2326-6066.CIR-21-0119","article-title":"Expansion of candidate HPV-specific t cells in the tumor microenvironment during chemoradiotherapy is prognostic in HPV16+ cancerschemoradiation promotes antigen-specific t-cell expansion","volume":"10","author":"Colbert","year":"2022","journal-title":"Cancer Immunol Res"},{"key":"2024071321260116900_btae431-B7","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1038\/334395a0","article-title":"T-cell antigen receptor genes and t-cell recognition","volume":"334","author":"Davis","year":"1988","journal-title":"Nature"},{"key":"2024071321260116900_btae431-B8","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1016\/0197-2456(86)90046-2","article-title":"Meta-analysis in clinical trials","volume":"7","author":"DerSimonian","year":"1986","journal-title":"Control Clin Trials"},{"key":"2024071321260116900_btae431-B9","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1038\/ng.3822","article-title":"Immunosequencing identifies signatures of cytomegalovirus exposure history and hla-mediated effects on the t cell repertoire","volume":"49","author":"Emerson","year":"2017","journal-title":"Nat Genet"},{"key":"2024071321260116900_btae431-B10","doi-asserted-by":"crossref","first-page":"586","DOI":"10.1016\/j.ajhg.2011.04.014","article-title":"Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies","volume":"88","author":"Han","year":"2011","journal-title":"Am J Hum Genet"},{"volume-title":"Statistical Methods for Meta-Analysis","year":"2014","author":"Hedges","key":"2024071321260116900_btae431-B11"},{"key":"2024071321260116900_btae431-B12","doi-asserted-by":"crossref","first-page":"2389","DOI":"10.1093\/bioinformatics\/btac127","article-title":"To rarefy or not to rarefy: robustness and efficiency trade-offs of rarefying microbiome data","volume":"38","author":"Hong","year":"2022","journal-title":"Bioinformatics"},{"key":"2024071321260116900_btae431-B13","doi-asserted-by":"crossref","first-page":"3689","DOI":"10.1093\/bioinformatics\/btac399","article-title":"A rarefaction-without-resampling extension of permanova for testing presence\u2013absence associations in the microbiome","volume":"38","author":"Hu","year":"2022","journal-title":"Bioinformatics"},{"key":"2024071321260116900_btae431-B14","doi-asserted-by":"crossref","first-page":"577","DOI":"10.2307\/1934145","article-title":"The nonconcept of species diversity: a critique and alternative parameters","volume":"52","author":"Hurlbert","year":"1971","journal-title":"Ecology"},{"key":"2024071321260116900_btae431-B15","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1385\/IR:23:2-3:215","article-title":"Immune recognition, response, and regulation: how t lymphocytes do it","volume":"23","author":"Joyce","year":"2001","journal-title":"Immunol Res"},{"key":"2024071321260116900_btae431-B16","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1016\/0092-8674(87)90568-X","article-title":"T cell tolerance by clonal elimination in the thymus","volume":"49","author":"Kappler","year":"1987","journal-title":"Cell"},{"key":"2024071321260116900_btae431-B17","doi-asserted-by":"crossref","first-page":"2859","DOI":"10.1002\/sim.8940","article-title":"Meta-analysis methods for multiple related markers: applications to microbiome studies with the results on multiple \u03b1-diversity indices","volume":"40","author":"Koh","year":"2021","journal-title":"Stat Med"},{"key":"2024071321260116900_btae431-B18","doi-asserted-by":"crossref","first-page":"5","DOI":"10.2307\/3545743","article-title":"Statistics and partitioning of species diversity, and similarity among multiple communities","volume":"76","author":"Lande","year":"1996","journal-title":"Oikos"},{"key":"2024071321260116900_btae431-B19","doi-asserted-by":"crossref","first-page":"e0178167","DOI":"10.1371\/journal.pone.0178167","article-title":"T cell receptor repertoire among women who cleared and failed to clear cervical human papillomavirus infection: an exploratory proof-of-principle study","volume":"13","author":"Lang Kuhs","year":"2018","journal-title":"PLoS One"},{"key":"2024071321260116900_btae431-B20","doi-asserted-by":"crossref","first-page":"20140291","DOI":"10.1098\/rstb.2014.0291","article-title":"Estimating t-cell repertoire diversity: limitations of classical estimators and a new approach","volume":"370","author":"Laydon","year":"2015","journal-title":"Phil Trans R Soc B"},{"key":"2024071321260116900_btae431-B21","doi-asserted-by":"crossref","first-page":"e1003531","DOI":"10.1371\/journal.pcbi.1003531","article-title":"Waste not, want not: why rarefying microbiome data is inadmissible","volume":"10","author":"McMurdie","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"2024071321260116900_btae431-B22","doi-asserted-by":"crossref","first-page":"958","DOI":"10.1126\/science.286.5441.958","article-title":"A direct estimate of the human \u03b1\u03b2 t cell receptor diversity","volume":"286","author":"Petteri Arstila","year":"1999","journal-title":"Science"},{"key":"2024071321260116900_btae431-B23","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1038\/s41467-019-14273-0","article-title":"Comprehensive t cell repertoire characterization of non-small cell lung cancer","volume":"11","author":"Reuben","year":"2020","journal-title":"Nat Commun"},{"key":"2024071321260116900_btae431-B24","doi-asserted-by":"crossref","first-page":"4099","DOI":"10.1182\/blood-2009-04-217604","article-title":"Comprehensive assessment of T-cell receptor \u03b2-chain diversity in \u03b1\u03b2 T cells","volume":"114","author":"Robins","year":"2009","journal-title":"Blood"},{"key":"2024071321260116900_btae431-B25","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1186\/s12896-017-0379-9","article-title":"Overview of methodologies for T-cell receptor repertoire analysis","volume":"17","author":"Rosati","year":"2017","journal-title":"BMC Biotechnol"},{"key":"2024071321260116900_btae431-B26","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1086\/282541","article-title":"Marine benthic diversity: a comparative study","volume":"102","author":"Sanders","year":"1968","journal-title":"Am Nat"},{"volume-title":"Methods of Meta-Analysis: Correcting Error and Bias in Research Findings","year":"2014","author":"Schmidt","key":"2024071321260116900_btae431-B27"},{"key":"2024071321260116900_btae431-B28","doi-asserted-by":"crossref","first-page":"414","DOI":"10.1086\/282781","article-title":"Properties of the rarefaction diversity measurement","volume":"106","author":"Simberloff","year":"1972","journal-title":"Am Nat"},{"key":"2024071321260116900_btae431-B29","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1016\/j.jim.2007.01.019","article-title":"Methods for comparing the diversity of samples of the t cell receptor repertoire","volume":"321","author":"Venturi","year":"2007","journal-title":"J Immunol Methods"},{"key":"2024071321260116900_btae431-B30","doi-asserted-by":"crossref","first-page":"2407","DOI":"10.3389\/fmicb.2019.02407","article-title":"Rarefaction, alpha diversity, and statistics","volume":"10","author":"Willis","year":"2019","journal-title":"Front Microbiol"},{"key":"2024071321260116900_btae431-B31","doi-asserted-by":"crossref","DOI":"10.1007\/978-981-13-1534-3","volume-title":"Statistical Analysis of Microbiome Data with R","author":"Xia","year":"2018"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae431\/58382298\/btae431.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/7\/btae431\/58536930\/btae431.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/7\/btae431\/58536930\/btae431.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,13]],"date-time":"2024-07-13T22:18:28Z","timestamp":1720909108000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae431\/7702327"}},"subtitle":[],"editor":[{"given":"Christina","family":"Kendziorski","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2024,7,1]]},"references-count":31,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2024,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae431","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2024,7]]},"published":{"date-parts":[[2024,7,1]]},"article-number":"btae431"}}