{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,17]],"date-time":"2026-04-17T20:04:30Z","timestamp":1776456270009,"version":"3.51.2"},"reference-count":33,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2018,6,27]],"date-time":"2018-06-27T00:00:00Z","timestamp":1530057600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"HPC"},{"name":"Yale Center for Research Computing"},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","award":["R01AI104739"],"award-info":[{"award-number":["R01AI104739"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>B cells derive their antigen-specificity through the expression of Immunoglobulin (Ig) receptors on their surface. These receptors are initially generated stochastically by somatic re-arrangement of the DNA and further diversified following antigen-activation by a process of somatic hypermutation, which introduces mainly point substitutions into the receptor DNA at a high rate. Recent advances in next-generation sequencing have enabled large-scale profiling of the B cell Ig repertoire from blood and tissue samples. A key computational challenge in the analysis of these data is partitioning the sequences to identify descendants of a common B cell (i.e. a clone). Current methods group sequences using a fixed distance threshold, or a likelihood calculation that is computationally-intensive. Here, we propose a new method based on spectral clustering with an adaptive threshold to determine the local sequence neighborhood. Validation using simulated and experimental datasets demonstrates that this method has high sensitivity and specificity compared to a fixed threshold that is optimized for these measures. In addition, this method works on datasets where choosing an optimal fixed threshold is difficult and is more computationally efficient in all cases. The ability to quickly and accurately identify members of a clone from repertoire sequencing data will greatly improve downstream analyses. Clonally-related sequences cannot be treated independently in statistical models, and clonal partitions are used as the basis for the calculation of diversity metrics, lineage reconstruction and selection analysis. Thus, the spectral clustering-based method here represents an important contribution to repertoire analysis.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Source code for this method is freely available in the SCOPe (Spectral Clustering for clOne Partitioning) R package in the Immcantation framework: www.immcantation.org under the CC BY-SA 4.0 license.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty235","type":"journal-article","created":{"date-parts":[[2018,4,13]],"date-time":"2018-04-13T06:58:15Z","timestamp":1523602695000},"page":"i341-i349","source":"Crossref","is-referenced-by-count":88,"title":["A spectral clustering-based method for identifying clones from high-throughput B cell repertoire sequencing data"],"prefix":"10.1093","volume":"34","author":[{"given":"Nima","family":"Nouri","sequence":"first","affiliation":[{"name":"Department of Pathology, Yale School of Medicine, New Haven, CT, USA"}]},{"given":"Steven H","family":"Kleinstein","sequence":"additional","affiliation":[{"name":"Department of Pathology, Yale School of Medicine, New Haven, CT, USA"},{"name":"Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA"}]}],"member":"286","published-online":{"date-parts":[[2018,6,27]]},"reference":[{"key":"2023051604234006900_bty235-B1","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/j.coi.2016.12.004","article-title":"Germinal centers: programmed for affinity maturation and antibody diversification","volume":"45","author":"Bannard","year":"2017","journal-title":"Curr. Opin. Immunol"},{"key":"2023051604234006900_bty235-B2","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1128\/9781555817411.ch20","volume-title":"Antibodies for Infectious Diseases.","author":"Boyd","year":"2015"},{"key":"2023051604234006900_bty235-B3","doi-asserted-by":"crossref","first-page":"12ra23","DOI":"10.1126\/scitranslmed.3000540","article-title":"Measurement and clinical monitoring of human lymphocyte clonality by massively parallel VDJ pyrosequencing","volume":"1","author":"Boyd","year":"2009","journal-title":"Sci. Trans. Med"},{"key":"2023051604234006900_bty235-B4","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1038\/gene.2012.28","article-title":"Location and length distribution of somatic hypermutation-associated dna insertions and deletions reveals regions of antibody structural plasticity","volume":"13","author":"Briney","year":"2012","journal-title":"Genes Immun"},{"key":"2023051604234006900_bty235-B5","doi-asserted-by":"crossref","first-page":"1105","DOI":"10.1073\/pnas.1617959114","article-title":"Phylogenetic analysis of the human antibody repertoire reveals quantitative signatures of immune senescence and aging","volume":"114","author":"de Bourcy","year":"2017","journal-title":"Proc. Natl. Acad. Sci"},{"key":"2023051604234006900_bty235-B6","doi-asserted-by":"crossref","first-page":"20066","DOI":"10.1073\/pnas.1107498108","article-title":"Naive antibody gene-segment frequencies are heritable and unaltered by chronic lymphocyte ablation","volume":"108","author":"Glanville","year":"2011","journal-title":"Proc. Natl. Acad. Sci"},{"key":"2023051604234006900_bty235-B7","doi-asserted-by":"crossref","first-page":"3356","DOI":"10.1093\/bioinformatics\/btv359","article-title":"Change-o: a toolkit for analyzing large-scale b cell immunoglobulin repertoire sequencing data","volume":"31","author":"Gupta","year":"2015","journal-title":"Bioinformatics"},{"key":"2023051604234006900_bty235-B8","doi-asserted-by":"crossref","first-page":"2489","DOI":"10.4049\/jimmunol.1601850","article-title":"Hierarchical clustering can identify B cell clones with high confidence in ig repertoire sequencing data","volume":"198","author":"Gupta","year":"2017","journal-title":"J. Immunol"},{"key":"2023051604234006900_bty235-B9","doi-asserted-by":"crossref","first-page":"20140239","DOI":"10.1098\/rstb.2014.0239","article-title":"The analysis of clonal expansions in normal and autoimmune B cell repertoires","volume":"370","author":"Hershberg","year":"2015","journal-title":"Phil. Trans. R. Soc. B"},{"key":"2023051604234006900_bty235-B10","doi-asserted-by":"crossref","first-page":"R51","DOI":"10.1186\/ar4481","article-title":"Persistence and selection of an expanded b-cell clone in the setting of rituximab therapy for Sj\u00f6gren\u2019s syndrome","volume":"16","author":"Hershberg","year":"2014","journal-title":"Arthritis Res. Therapy"},{"key":"2023051604234006900_bty235-B11","doi-asserted-by":"crossref","first-page":"8614","DOI":"10.1073\/pnas.1709203114","article-title":"Sequence intrinsic somatic mutation mechanisms contribute to affinity maturation of VRC01-class HIV-1 broadly neutralizing antibodies","volume":"114","author":"Hwang","year":"2017","journal-title":"Proc. Natl. Acad. Sci"},{"key":"2023051604234006900_bty235-B12","doi-asserted-by":"crossref","first-page":"171ra19","DOI":"10.1126\/scitranslmed.3004794","article-title":"Lineage structure of the human antibody repertoire in response to influenza vaccination","volume":"5","author":"Jiang","year":"2013","journal-title":"Sci. Trans. Med"},{"key":"2023051604234006900_bty235-B13","doi-asserted-by":"crossref","first-page":"103","DOI":"10.12688\/f1000research.2-103.v1","article-title":"Reconstructing a B-cell clonal lineage. I. statistical inference of unobserved ancestors","volume":"2","author":"Kepler","year":"2013","journal-title":"F1000Research"},{"key":"2023051604234006900_bty235-B14","doi-asserted-by":"crossref","first-page":"4639","DOI":"10.4049\/jimmunol.171.9.4639","article-title":"Estimating hypermutation rates from clonal tree data","volume":"171","author":"Kleinstein","year":"2003","journal-title":"J. Immunol"},{"key":"2023051604234006900_bty235-B15","doi-asserted-by":"crossref","first-page":"21194","DOI":"10.1073\/pnas.1118357109","article-title":"High-throughput vdj sequencing for quantification of minimal residual disease in chronic lymphocytic leukemia and immune reconstitution assessment","volume":"108","author":"Logan","year":"2011","journal-title":"Proc. Natl. Acad. Sci"},{"key":"2023051604234006900_bty235-B16","doi-asserted-by":"crossref","first-page":"3180","DOI":"10.1073\/pnas.81.10.3180","article-title":"Generation of antibody diversity in the immune response of balb\/c mice to influenza virus hemagglutinin","volume":"81","author":"McKean","year":"1984","journal-title":"Proc. Natl. Acad. Sci"},{"key":"2023051604234006900_bty235-B17","doi-asserted-by":"crossref","first-page":"879","DOI":"10.1038\/nbt.3942","article-title":"An atlas of B-cell clonal distribution in the human body","volume":"35","author":"Meng","year":"2017","journal-title":"Nat. Biotechnol"},{"key":"2023051604234006900_bty235-B18","volume-title":"Graph Symmetry. NATO ASI Series (Series C: Mathematical and Physical Sciences)","author":"Mohar","year":"1997"},{"key":"2023051604234006900_bty235-B19","first-page":"871","article-title":"The laplacian spectrum of graphs","volume":"2","author":"Mohar","year":"1991","journal-title":"Graph Theor. Combinatorics Appl"},{"key":"2023051604234006900_bty235-B20","first-page":"1","article-title":"An introduction to immunobiology and innate immunity","volume-title":"Janeway's Immunobiology","author":"Murphy","year":"2011"},{"key":"2023051604234006900_bty235-B21","author":"Nouri","year":"2017"},{"key":"2023051604234006900_bty235-B22","doi-asserted-by":"crossref","first-page":"691","DOI":"10.1016\/j.chom.2013.05.008","article-title":"Convergent antibody signatures in human dengue","volume":"13","author":"Parameswaran","year":"2013","journal-title":"Cell Host Microbe"},{"key":"2023051604234006900_bty235-B23","doi-asserted-by":"crossref","first-page":"e1005086","DOI":"10.1371\/journal.pcbi.1005086","article-title":"Likelihood-based inference of B cell clonal families","volume":"12","author":"Ralph","year":"2016","journal-title":"PLoS Comput. Biol"},{"key":"2023051604234006900_bty235-B24","doi-asserted-by":"crossref","first-page":"1274","DOI":"10.1038\/ni.3873","article-title":"Adaptive immune receptor repertoire community recommendations for sharing immune-repertoire sequencing data","volume":"18","author":"Rubelt","year":"2017","journal-title":"Nat. Immunol"},{"key":"2023051604234006900_bty235-B25","doi-asserted-by":"crossref","first-page":"2642","DOI":"10.4049\/jimmunol.156.7.2642","article-title":"Di-and trinucleotide target preferences of somatic mutagenesis in normal and autoreactive B cells","volume":"156","author":"Smith","year":"1996","journal-title":"J. Immunol"},{"key":"2023051604234006900_bty235-B26","doi-asserted-by":"crossref","first-page":"248ra107","DOI":"10.1126\/scitranslmed.3008879","article-title":"B cells populating the multiple sclerosis brain mature in the draining cervical lymph nodes","volume":"6","author":"Stern","year":"2014","journal-title":"Sci. Trans. Med"},{"key":"2023051604234006900_bty235-B27","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1038\/302575a0","article-title":"Somatic generation of antibody diversity","volume":"302","author":"Tonegawa","year":"1983","journal-title":"Nature"},{"key":"2023051604234006900_bty235-B28","doi-asserted-by":"crossref","first-page":"1587","DOI":"10.1039\/C5IB00169B","article-title":"Neutralizing antibodies against west nile virus identified directly from human B cells by single-cell analysis and next generation sequencing","volume":"7","author":"Tsioris","year":"2015","journal-title":"Integrative Biol"},{"key":"2023051604234006900_bty235-B29","doi-asserted-by":"crossref","first-page":"1460","DOI":"10.4049\/jimmunol.1601415","article-title":"Dysregulation of B cell repertoire formation in myasthenia gravis patients revealed through deep sequencing","volume":"198","author":"Vander Heiden","year":"2017","journal-title":"J. Immunol"},{"key":"2023051604234006900_bty235-B30","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1146\/annurev-immunol-020711-075032","article-title":"Germinal centers","volume":"30","author":"Victora","year":"2012","journal-title":"Annu. Rev. Immunol"},{"key":"2023051604234006900_bty235-B31","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1007\/s11222-007-9033-z","article-title":"A tutorial on spectral clustering","volume":"17","author":"Von Luxburg","year":"2007","journal-title":"Stat. Comput"},{"key":"2023051604234006900_bty235-B32","doi-asserted-by":"crossref","first-page":"603","DOI":"10.4049\/jimmunol.1301384","article-title":"Effects of aging, cytomegalovirus infection, and ebv infection on human b cell repertoires","volume":"192","author":"Wang","year":"2014","journal-title":"J. Immunol"},{"key":"2023051604234006900_bty235-B33","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1186\/s13073-015-0243-2","article-title":"Practical guidelines for B-cell receptor repertoire sequencing analysis","volume":"7","author":"Yaari","year":"2015","journal-title":"Genome Med"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/13\/i341\/50315907\/bioinformatics_34_13_i341.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/13\/i341\/50315907\/bioinformatics_34_13_i341.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T04:25:55Z","timestamp":1684211155000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/13\/i341\/5045726"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,6,27]]},"references-count":33,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2018,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty235","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,7,1]]},"published":{"date-parts":[[2018,6,27]]}}}