{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,23]],"date-time":"2026-06-23T17:00:53Z","timestamp":1782234053974,"version":"3.54.5"},"reference-count":23,"publisher":"Oxford University Press (OUP)","issue":"18","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Recombined T- and B-cell receptor repertoires are increasingly being studied using next generation sequencing (NGS) in order to interrogate the repertoire composition as well as changes in the distribution of receptor clones under different physiological and disease states. This type of analysis requires efficient and unambiguous clonotype assignment to a large number of NGS read sequences, including the identification of the incorporated V and J gene segments and the CDR3 sequence. Current tools have deficits with respect to performance, accuracy and documentation of their underlying algorithms and usage.<\/jats:p>\n               <jats:p>Results: We present IMSEQ, a method to derive clonotype repertoires from NGS data with sophisticated routines for handling errors stemming from PCR and sequencing artefacts. The application can handle different kinds of input data originating from single- or paired-end sequencing in different configurations and is generic regarding the species and gene of interest. We have carefully evaluated our method with simulated and real world data and show that IMSEQ is superior to other tools with respect to its clonotyping as well as standalone error correction and runtime performance.<\/jats:p>\n               <jats:p>Availability and implementation: IMSEQ was implemented in C++ using the SeqAn library for efficient sequence analysis. It is freely available under the GPLv2 open source license and can be downloaded at www.imtools.org.<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <jats:p>Contact: \u00a0lkuchenb@inf.fu-berlin.de or peter.robinson@charite.de<\/jats:p>","DOI":"10.1093\/bioinformatics\/btv309","type":"journal-article","created":{"date-parts":[[2015,5,19]],"date-time":"2015-05-19T00:50:48Z","timestamp":1431996648000},"page":"2963-2971","source":"Crossref","is-referenced-by-count":93,"title":["IMSEQ\u2014a fast and error aware approach to immunogenetic sequence analysis"],"prefix":"10.1093","volume":"31","author":[{"given":"Leon","family":"Kuchenbecker","sequence":"first","affiliation":[{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"},{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"},{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mikalai","family":"Nienen","sequence":"additional","affiliation":[{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jochen","family":"Hecht","sequence":"additional","affiliation":[{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"},{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Avidan U.","family":"Neumann","sequence":"additional","affiliation":[{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"},{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nina","family":"Babel","sequence":"additional","affiliation":[{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"},{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Knut","family":"Reinert","sequence":"additional","affiliation":[{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"},{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Peter N.","family":"Robinson","sequence":"additional","affiliation":[{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"},{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"},{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"},{"name":"1 Berlin-Brandenburg Center for Regenerative Therapies, Charit\u00e9 Universit\u00e4tsmedizin, Berlin, 2Department of Computer Science, Freie Universit\u00e4t, Berlin, 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany, 4Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel, 5Marien Hospital Herne, Ruhr University Bochum, Bochum and 6Institute of Medical Genetics and Human Genetics, Charit\u00e9 Universit\u00e4tsmedizin Berlin, Berlin, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2015,5,18]]},"reference":[{"key":"2023020202231524400_btv309-B1","doi-asserted-by":"crossref","first-page":"958","DOI":"10.1126\/science.286.5441.958","article-title":"A direct estimate of the human\u03b1\u03b2 T cell receptor diversity","volume":"286","author":"Arstila","year":"1999","journal-title":"Science"},{"key":"2023020202231524400_btv309-B2","doi-asserted-by":"crossref","first-page":"813","DOI":"10.1038\/nmeth.2555","article-title":"MiTCR: software for T-cell receptor sequencing data analysis","volume":"10","author":"Bolotin","year":"2013","journal-title":"Nature Methods"},{"key":"2023020202231524400_btv309-B3","doi-asserted-by":"crossref","first-page":"W503","DOI":"10.1093\/nar\/gkn316","article-title":"IMGT\/V-QUEST: the highly customized and integrated system for IG and TR standardized VJ and VDJ sequence analysis","volume":"36","author":"Brochet","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023020202231524400_btv309-B4","first-page":"481","article-title":"Aligning two sequences within a specified diagonal band","volume":"8","author":"Chao","year":"1992","journal-title":"Comput. Appl. Biosci."},{"key":"2023020202231524400_btv309-B5","doi-asserted-by":"crossref","first-page":"e105","DOI":"10.1093\/nar\/gkn425","article-title":"Substantial biases in ultra-short read data sets from high-throughput DNA sequencing","volume":"36","author":"Dohm","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023020202231524400_btv309-B6","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1186\/1471-2105-9-11","article-title":"SeqAn an efficient, generic C++ library for sequence analysis","volume":"9","author":"D\u00f6ring","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023020202231524400_btv309-B7","doi-asserted-by":"crossref","first-page":"2842","DOI":"10.1111\/ajt.12431","article-title":"TCR repertoire analysis by next generation sequencing allows complex differential diagnosis of T cell\u2013related pathology","volume":"13","author":"Dziubianau","year":"2013","journal-title":"Am. J. Transplant."},{"key":"2023020202231524400_btv309-B8","doi-asserted-by":"crossref","first-page":"D256","DOI":"10.1093\/nar\/gki010","article-title":"IMGT\/GENE-DB: a comprehensive database for human and mouse immunoglobulin and T cell receptor genes","volume":"33","author":"Giudicelli","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023020202231524400_btv309-B9","article-title":"Mason\u2013a read simulator for second generation sequencing data","author":"Holtgrewe","year":"2010","journal-title":"Technical report FU Berlin"},{"key":"2023020202231524400_btv309-B10","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/j.chom.2014.05.013","article-title":"Human responses to influenza vaccination show seroconversion signatures and convergent antibody rearrangements","volume":"16","author":"Jackson","year":"2014","journal-title":"Cell Host and Microbe"},{"key":"2023020202231524400_btv309-B11","volume-title":"Immunobiology: The Immune System in Health and Disease","author":"Janeway","year":"1999"},{"key":"2023020202231524400_btv309-B12","doi-asserted-by":"crossref","first-page":"2333","DOI":"10.1038\/ncomms3333","article-title":"IMGT\/HighV QUEST paradigm for T cell receptor IMGT clonotype diversity and next generation repertoire immunoprofiling","volume":"4","author":"Li","year":"2013","journal-title":"Nat. Commun."},{"key":"2023020202231524400_btv309-B13","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1145\/316542.316550","article-title":"A fast bit-vector algorithm for approximate string matching based on dynamic programming","volume":"46","author":"Myers","year":"1999","journal-title":"J. ACM"},{"key":"2023020202231524400_btv309-B14","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1089\/cmb.2006.13.296","article-title":"Efficient q-gram filters for finding all \u03b5-matches over a given length","volume":"13","author":"Rasmussen","year":"2006","journal-title":"J. Comput. Biol."},{"key":"2023020202231524400_btv309-B15","doi-asserted-by":"crossref","first-page":"653","DOI":"10.1038\/nmeth.2960","article-title":"Towards error-free profiling of immune repertoires","volume":"11","author":"Shugay","year":"2014","journal-title":"Nat. Methods"},{"key":"2023020202231524400_btv309-B16","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1038\/nrg3642","article-title":"Sequencing depth and coverage: key considerations in genomic analyses","volume":"15","author":"Sims","year":"2014","journal-title":"Nat. Rev. Genet."},{"key":"2023020202231524400_btv309-B17","doi-asserted-by":"crossref","first-page":"542","DOI":"10.1093\/bioinformatics\/btt004","article-title":"Decombinator: a tool for fast, efficient gene assignment in T-cell receptor sequences using a finite state machine","volume":"29","author":"Thomas","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020202231524400_btv309-B18","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1038\/302575a0","article-title":"Somatic generation of antibody diversity","volume":"302","author":"Tonegawa","year":"1983","journal-title":"Nature"},{"key":"2023020202231524400_btv309-B19","doi-asserted-by":"crossref","first-page":"13463","DOI":"10.1073\/pnas.1312146110","article-title":"Genetic measurement of memory B-cell recall using antibody repertoire sequencing","volume":"110","author":"Vollmers","year":"2013","journal-title":"Proc. Natl. Acad. Sci."},{"key":"2023020202231524400_btv309-B20","doi-asserted-by":"crossref","first-page":"134ra63","DOI":"10.1126\/scitranslmed.3003656","article-title":"High-throughput sequencing detects minimal residual disease in acute T lymphoblastic leukemia","volume":"4","author":"Wu","year":"2012","journal-title":"Sci. Trans. Med."},{"key":"2023020202231524400_btv309-B21","doi-asserted-by":"crossref","first-page":"446","DOI":"10.4049\/jimmunol.1400711","article-title":"TCRklass: a new k-string\u2013based algorithm for human and mouse TCR repertoire characterization","volume":"194","author":"Yang","year":"2015","journal-title":"J. Immunol."},{"key":"2023020202231524400_btv309-B22","doi-asserted-by":"crossref","first-page":"W34","DOI":"10.1093\/nar\/gkt382","article-title":"IgBLAST: an immunoglobulin variable domain sequence analysis tool","volume":"41","author":"Ye","year":"2013","journal-title":"Nucleic Acids Res."},{"key":"2023020202231524400_btv309-B23","doi-asserted-by":"crossref","first-page":"614","DOI":"10.1093\/bioinformatics\/btt593","article-title":"PEAR: a fast and accurate Illumina Paired-End reAd mergeR","volume":"30","author":"Zhang","year":"2014","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/18\/2963\/49035140\/bioinformatics_31_18_2963.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/18\/2963\/49035140\/bioinformatics_31_18_2963.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T03:47:35Z","timestamp":1675309655000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/31\/18\/2963\/240876"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,5,18]]},"references-count":23,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2015,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btv309","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,9,15]]},"published":{"date-parts":[[2015,5,18]]}}}