{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,23]],"date-time":"2025-09-23T12:00:20Z","timestamp":1758628820748,"version":"3.41.2"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2023,3,1]],"date-time":"2023-03-01T00:00:00Z","timestamp":1677628800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"CityU\/UGC Research Matching Grant Scheme","award":["9229012","9229013"],"award-info":[{"award-number":["9229012","9229013"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,3,19]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The adaptive immune receptor repertoire (AIRR), consisting of T- and B-cell receptors, is the core component of the immune system. The AIRR sequencing is commonly used in cancer immunotherapy and minimal residual disease (MRD) detection of leukemia and lymphoma. The AIRR is captured by primers and sequenced to yield paired-end (PE) reads. The PE reads could be merged into one sequence by the overlapped region between them. However, the wide range of AIRR data raises the difficulty, so a special tool is required. We developed a software package for IMmune PE reads merger of sequencing data, named IMperm. We used the k-mer-and-vote strategy to pin down the overlapped region rapidly. IMperm could handle all types of PE reads, eliminate adapter contamination and successfully merge low-quality and minor\/non-overlapping reads. Compared with existing tools, IMperm performed better in both simulated and sequencing data. Notably, IMperm was well suited to processing the data of MRD detection in leukemia and lymphoma and detected 19 novel MRD clones in 14 patients with leukemia from previously published data. Additionally, IMperm can handle PE reads from other sources, and we demonstrated its effectiveness on two genomic and one cell-free deoxyribonucleic acid datasets. IMperm is implemented in the C programming language and consumes little runtime and memory. It is freely available at https:\/\/github.com\/zhangwei2015\/IMperm.<\/jats:p>","DOI":"10.1093\/bib\/bbad080","type":"journal-article","created":{"date-parts":[[2023,3,9]],"date-time":"2023-03-09T14:46:09Z","timestamp":1678373169000},"source":"Crossref","is-referenced-by-count":1,"title":["IMperm: a fast and comprehensive IMmune Paired-End Reads Merger for sequencing data"],"prefix":"10.1093","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6968-6974","authenticated-orcid":false,"given":"Wei","family":"Zhang","sequence":"first","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong , Hong Kong 999077 , China"}]},{"given":"Jia","family":"Ju","sequence":"additional","affiliation":[{"name":"Institute of Cancer Research, Shenzhen Bay Laboratory , Shenzhen 518132 , China"}]},{"given":"Yong","family":"Zhou","sequence":"additional","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong , Hong Kong 999077 , China"}]},{"given":"Teng","family":"Xiong","sequence":"additional","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong , Hong Kong 999077 , China"}]},{"given":"Mengyao","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong , Hong Kong 999077 , China"}]},{"given":"Chaohui","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong , Hong Kong 999077 , China"}]},{"given":"Shixin","family":"Lu","sequence":"additional","affiliation":[{"name":"NeoImmune Inc. , Shenzhen 518055 , China"}]},{"given":"Zefeng","family":"Lu","sequence":"additional","affiliation":[{"name":"NeoImmune Inc. , Shenzhen 518055 , China"}]},{"given":"Liya","family":"Lin","sequence":"additional","affiliation":[{"name":"NeoImmune Inc. , Shenzhen 518055 , China"}]},{"given":"Xiao","family":"Liu","sequence":"additional","affiliation":[{"name":"Shenzhen International Graduate School, Tsinghua University , Shenzhen 518055 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6246-6349","authenticated-orcid":false,"given":"Shuai Cheng","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Computer Science, City University of Hong Kong , Hong Kong 999077 , China"}]}],"member":"286","published-online":{"date-parts":[[2023,3,9]]},"reference":[{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"1251","DOI":"10.1038\/s41591-019-0522-3","article-title":"Clonal replacement of tumor-specific T cells following PD-1 blockade","volume":"25","author":"Yost","year":"2019","journal-title":"Nat Med"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.coi.2021.07.006","article-title":"Application of T cell receptor (TCR) repertoire analysis for the advancement of cancer immunotherapy","volume":"74","author":"Joshi","year":"2022","journal-title":"Curr Opin Immunol"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J Mol Biol"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"437","DOI":"10.1038\/s41586-020-2456-9","article-title":"Convergent antibody responses to SARS-CoV-2 in convalescent individuals","volume":"584","author":"Robbiani","year":"2020","journal-title":"Nature"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"1458","DOI":"10.1126\/science.aad9195","article-title":"HIV-1 broadly neutralizing antibody precursor B cells revealed by germline-targeting immunogen","volume":"351","author":"Jardine","year":"2016","journal-title":"Science"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1016\/j.jmoldx.2018.10.008","article-title":"Establishment of immunoglobulin heavy (IGH) chain clonality testing by next-generation sequencing for routine characterization of B-cell and plasma cell neoplasms","volume":"21","author":"Arcila","year":"2019","journal-title":"J Mol Diagn"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1016\/j.jmoldx.2020.10.015","article-title":"Routine evaluation of minimal residual disease in myeloma using next-generation sequencing clonality testing: feasibility, challenges, and direct comparison with high-sensitivity flow cytometry","volume":"23","author":"Ho","year":"2021","journal-title":"J Mol Diagn"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1111\/j.1365-2141.2011.08803.x","article-title":"BIOMED-2 PCR assays for IGK gene rearrangements are essential for B-cell clonality analysis in follicular lymphoma","volume":"155","author":"Payne","year":"2011","journal-title":"Br J Haematol"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.anndiagpath.2017.04.008","article-title":"Molecular pathology diagnosis of diffuse large B cell lymphoma using BIOMED-2 clonal gene rearrangements","volume":"29","author":"Ghorbian","year":"2017","journal-title":"Ann Diagn Pathol"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"2870","DOI":"10.1093\/bioinformatics\/bts563","article-title":"COPE: an accurate k-mer-based pair-end reads connection tool to facilitate genome assembly","volume":"28","author":"Liu","year":"2012","journal-title":"Bioinformatics"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"2957","DOI":"10.1093\/bioinformatics\/btr507","article-title":"FLASH: fast length adjustment of short reads to improve genome assemblies","volume":"27","author":"Magoc","year":"2011","journal-title":"Bioinformatics"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"614","DOI":"10.1093\/bioinformatics\/btt593","article-title":"PEAR: a fast and accurate Illumina paired-end reAd mergeR","volume":"30","author":"Zhang","year":"2014","journal-title":"Bioinformatics"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1186\/s12859-018-2307-y","article-title":"PIPEBAR and OverlapPER: tools for a fast and accurate DNA barcoding analysis and paired-end assembly","volume":"19","author":"Oliveira","year":"2018","journal-title":"BMC Bioinform"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"e141","DOI":"10.1093\/nar\/gku699","article-title":"leeHom: adaptor trimming and merging for Illumina sequencing reads","volume":"42","author":"Renaud","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"S1","DOI":"10.1186\/1755-8794-8-S3-S1","article-title":"Konnector v2.0: pseudo-long reads from paired-end sequencing data","volume":"8","author":"Vandervalk","year":"2015","journal-title":"BMC Med Genomics"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"813","DOI":"10.1038\/nmeth.2555","article-title":"MiTCR: software for T-cell receptor sequencing data analysis","volume":"10","author":"Bolotin","year":"2013","journal-title":"Nat Methods"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkv1016","article-title":"LymAnalyzer: a tool for comprehensive analysis of next generation sequencing data of T cell receptors and immunoglobulins","volume":"44","author":"Yu","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"380","DOI":"10.1038\/nmeth.3364","article-title":"MiXCR: software for comprehensive adaptive immunity profiling","volume":"12","author":"Bolotin","year":"2015","journal-title":"Nat Methods"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"1930","DOI":"10.1093\/bioinformatics\/btu138","article-title":"pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires","volume":"30","author":"Vander Heiden","year":"2014","journal-title":"Bioinformatics"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1038\/nri2260","article-title":"The molecular basis for public T-cell responses?","volume":"8","author":"Venturi","year":"2008","journal-title":"Nat Rev Immunol"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1186\/s13073-015-0243-2","article-title":"Practical guidelines for B-cell receptor repertoire sequencing analysis","volume":"7","author":"Yaari","year":"2015","journal-title":"Genome Med"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"e0152464","DOI":"10.1371\/journal.pone.0152464","article-title":"Systematic comparative evaluation of methods for investigating the TCRbeta repertoire","volume":"11","author":"Liu","year":"2016","journal-title":"PloS One"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"1817","DOI":"10.1101\/gr.092924.109","article-title":"Profiling the T-cell receptor beta-chain repertoire by massively parallel sequencing","volume":"19","author":"Freeman","year":"2009","journal-title":"Genome Res"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"3719","DOI":"10.4049\/jimmunol.1602039","article-title":"Characterization of the B cell receptor repertoire in the intestinal mucosa and of tumor-infiltrating lymphocytes in colorectal adenoma and carcinoma","volume":"198","author":"Zhang","year":"2017","journal-title":"J Immunol"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1186\/s13073-015-0169-8","article-title":"A bioinformatic framework for immune repertoire diversity profiling enables detection of immunological status","volume":"7","author":"Greiff","year":"2015","journal-title":"Genome Med"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1158\/2326-6066.CIR-16-0107","article-title":"The different T-cell receptor repertoires in breast cancer tumors, draining lymph nodes, and adjacent tissues","volume":"5","author":"Wang","year":"2017","journal-title":"Cancer Immunol Res"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"615669","DOI":"10.3389\/fmicb.2020.615669","article-title":"Decrease of clone diversity in IgM repertoires of HBV chronically infected individuals with high level of viral replication","volume":"11","author":"Hong","year":"2020","journal-title":"Front Microbiol"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1534\/genetics.115.176735","article-title":"IMonitor: a robust pipeline for TCR and BCR repertoire analysis","volume":"201","author":"Zhang","year":"2015","journal-title":"Genetics"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1053\/j.seminhematol.2018.03.003","article-title":"MRD testing in multiple myeloma: from a surrogate marker of clinical outcomes to an every-day clinical tool","volume":"55","author":"Landgren","year":"2018","journal-title":"Semin Hematol"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"S19","DOI":"10.5045\/br.2020.S004","article-title":"Minimal residual disease in acute lymphoblastic leukemia: technical aspects and implications for clinical interpretation","volume":"55","author":"Kim","year":"2020","journal-title":"Blood Res"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"2456","DOI":"10.1182\/blood-2018-06-858613","article-title":"Minimal residual disease negativity using deep sequencing is a major prognostic factor in multiple myeloma","volume":"132","author":"Perrot","year":"2018","journal-title":"Blood"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"5173","DOI":"10.1182\/blood-2012-07-444042","article-title":"Deep-sequencing approach for minimal residual disease detection in acute lymphoblastic leukemia","volume":"120","author":"Faham","year":"2012","journal-title":"Blood"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"403","DOI":"10.3389\/fimmu.2016.00403","article-title":"Minimal residual disease detection and evolved IGH clones analysis in acute B lymphoblastic leukemia using IGH deep sequencing","volume":"7","author":"Wu","year":"2016","journal-title":"Front Immunol"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1038\/s41586-020-2008-3","article-title":"A new coronavirus associated with human respiratory disease in China","volume":"579","author":"Wu","year":"2020","journal-title":"Nature"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"e0083721","DOI":"10.1128\/mSystems.00837-21","article-title":"Genomic epidemiology and characterization of methicillin-resistant Staphylococcus aureus from bloodstream infections in China","volume":"6","author":"Jin","year":"2021","journal-title":"mSystems"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"61ra91","DOI":"10.1126\/scitranslmed.3001720","article-title":"Maternal plasma DNA sequencing reveals the genome-wide genetic and mutational profile of the fetus","volume":"2","author":"Lo","year":"2010","journal-title":"Sci Transl Med"},{"key":"2023032004560572300_","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1093\/bioinformatics\/btp698","article-title":"Fast and accurate long-read alignment with Burrows-Wheeler transform","volume":"26","author":"Li","year":"2010","journal-title":"Bioinformatics"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/2\/bbad080\/49560506\/bbad080.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/2\/bbad080\/49560506\/bbad080.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,3,25]],"date-time":"2023-03-25T02:20:39Z","timestamp":1679710839000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbad080\/7073966"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3]]},"references-count":37,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,3,19]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbad080","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"type":"print","value":"1467-5463"},{"type":"electronic","value":"1477-4054"}],"subject":[],"published-other":{"date-parts":[[2023,3]]},"published":{"date-parts":[[2023,3]]},"article-number":"bbad080"}}