{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,22]],"date-time":"2026-01-22T21:02:28Z","timestamp":1769115748624,"version":"3.49.0"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"11","license":[{"start":{"date-parts":[[2025,10,22]],"date-time":"2025-10-22T00:00:00Z","timestamp":1761091200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"European Research Council (ERC) Advanced","award":["788016"],"award-info":[{"award-number":["788016"]}]},{"name":"National Institutes of Health National Institute of Allergy and Infectious Diseases","award":["R01 AI157854"],"award-info":[{"award-number":["R01 AI157854"]}]},{"DOI":"10.13039\/501100004359","name":"Swedish Research Council","doi-asserted-by":"publisher","award":["2022-05034"],"award-info":[{"award-number":["2022-05034"]}],"id":[{"id":"10.13039\/501100004359","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) has emerged as a central approach for studying T cell and B cell receptor populations, and is now an important component of studies of autoimmunity, immune responses to pathogens, vaccines, allergens, and cancers, and for antibody discovery. When amplifying the rearranged V(D)J genes encoding antigen receptors, each cycle of the Polymerase Chain Reaction (PCR) can produce spurious \u201cchimeric\u201d hybrids of two or more different template sequences. While the generation of chimeras is well understood in bacterial and viral sequencing, and there are dedicated tools to detect such sequences in bacterial and viral datasets, this is not the case for AIRR-seq. Further, the process that results in immune receptor sequences has domain-specific challenges, such as somatic hypermutation (SHM), and domain-specific opportunities, such as relatively well-known germline gene \u201creference\u201d sequences.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Here, we describe CHMMAIRRa, a hidden Markov model for detecting chimeric sequences in AIRR-seq data, that specifically models SHM and incorporates germline reference sequences. We use simulations to characterize the performance of CHMMAIRRa and compare it to existing methods from other domains, we test the effect of PCR conditions on chimerism using IgM libraries generated in this study, and we apply CHMMAIRRa to four published AIRR-seq datasets to show the extent and impact of artifactual chimerism.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>CHMMAIRRa is published on the Julia package registry and is available at https:\/\/github.com\/MurrellGroup\/CHMMAIRRa.jl (DOI: 10.5281\/zenodo.17279881). The core HMM implementation is available at https:\/\/github.com\/MurrellGroup\/CHMMera.jl (DOI: 10.5281\/zenodo.17279998), and the scripts used to generate the results in this paper at https:\/\/github.com\/MurrellGroup\/CHMMAIRRaAnalyses (DOI: 10.5281\/zenodo.17281446).<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf576","type":"journal-article","created":{"date-parts":[[2025,10,15]],"date-time":"2025-10-15T12:26:21Z","timestamp":1760531181000},"source":"Crossref","is-referenced-by-count":1,"title":["Detection of PCR chimeras in adaptive immune receptor repertoire sequences"],"prefix":"10.1093","volume":"41","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1622-9240","authenticated-orcid":false,"given":"Mark","family":"Chernyshev","sequence":"first","affiliation":[{"name":"Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet , Stockholm 171 77,","place":["Sweden"]}]},{"given":"Aron","family":"St\u00e5lmarck","sequence":"additional","affiliation":[{"name":"Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet , Stockholm 171 77,","place":["Sweden"]}]},{"given":"Martin","family":"Corcoran","sequence":"additional","affiliation":[{"name":"Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet , Stockholm 171 77,","place":["Sweden"]}]},{"given":"Gunilla B","family":"Karlsson Hedestam","sequence":"additional","affiliation":[{"name":"Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet , Stockholm 171 77,","place":["Sweden"]}]},{"given":"Ben","family":"Murrell","sequence":"additional","affiliation":[{"name":"Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet , Stockholm 171 77,","place":["Sweden"]}]}],"member":"286","published-online":{"date-parts":[[2025,10,22]]},"reference":[{"key":"2025111511265633300_btaf576-B1","doi-asserted-by":"publisher","first-page":"164","DOI":"10.1214\/aoms\/1177697196","article-title":"A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains","volume":"41","author":"Baum","year":"1970","journal-title":"Ann Math Statist"},{"key":"2025111511265633300_btaf576-B2","doi-asserted-by":"publisher","first-page":"W503","DOI":"10.1093\/nar\/gkn316","article-title":"IMGT\/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis","volume":"36","author":"Brochet","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2025111511265633300_btaf576-B3","doi-asserted-by":"publisher","first-page":"2249","DOI":"10.1038\/s41467-023-37972-1","article-title":"Vaccination of SARS-CoV-2-infected individuals expands a broad range of clonally diverse affinity-matured B cell lineages","volume":"14","author":"Chernyshev","year":"2023","journal-title":"Nat Commun"},{"key":"2025111511265633300_btaf576-B4","doi-asserted-by":"publisher","first-page":"1153","DOI":"10.1016\/j.cell.2019.04.012","article-title":"Slow delivery immunization enhances HIV neutralizing antibody and germinal center responses via modulation of immunodominance","volume":"177","author":"Cirelli","year":"2019","journal-title":"Cell"},{"key":"2025111511265633300_btaf576-B5","doi-asserted-by":"publisher","first-page":"635","DOI":"10.1016\/j.immuni.2023.01.026","article-title":"Archaic humans have contributed to large-scale variation in modern human T cell receptor genes","volume":"56","author":"Corcoran","year":"2023","journal-title":"Immunity"},{"key":"2025111511265633300_btaf576-B6","doi-asserted-by":"publisher","first-page":"13642","DOI":"10.1038\/ncomms13642","article-title":"Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity","volume":"7","author":"Corcoran","year":"2016","journal-title":"Nat Commun"},{"key":"2025111511265633300_btaf576-B7","doi-asserted-by":"publisher","author":"Edgar","year":"2016","DOI":"10.1101\/074252"},{"key":"2025111511265633300_btaf576-B8","doi-asserted-by":"publisher","first-page":"6565","DOI":"10.1038\/ncomms7565","article-title":"Analysis of immunoglobulin transcripts and hypermutation following shivad8 infection and protein-plus-adjuvant immunization","volume":"6","author":"Francica","year":"2015","journal-title":"Nat Commun"},{"key":"2025111511265633300_btaf576-B9","doi-asserted-by":"publisher","first-page":"E862","DOI":"10.1073\/pnas.1417683112","article-title":"Automated analysis of high-throughput B-cell sequencing data reveals a high frequency of novel immunoglobulin v gene segment alleles","volume":"112","author":"Gadala-Maria","year":"2015","journal-title":"Proc Natl Acad Sci USA"},{"key":"2025111511265633300_btaf576-B10","doi-asserted-by":"publisher","first-page":"2070","DOI":"10.1016\/j.ebiom.2015.11.034","article-title":"Analysis of B cell repertoire dynamics following hepatitis B vaccination in humans, and enrichment of vaccine-specific antibody sequences","volume":"2","author":"Galson","year":"2015","journal-title":"EBioMedicine"},{"key":"2025111511265633300_btaf576-B11","doi-asserted-by":"publisher","first-page":"612","DOI":"10.1158\/2326-6066.CIR-20-0817","article-title":"\u03b3\u03b4 T cells in merkel cell carcinomas have a proinflammatory profile prognostic of patient survival","volume":"9","author":"Gherardin","year":"2021","journal-title":"Cancer Immunol Res"},{"key":"2025111511265633300_btaf576-B12","doi-asserted-by":"publisher","first-page":"3356","DOI":"10.1093\/bioinformatics\/btv359","article-title":"Change-o: a toolkit for analyzing large-scale B cell immunoglobulin repertoire sequencing data","volume":"31","author":"Gupta","year":"2015","journal-title":"Bioinformatics"},{"key":"2025111511265633300_btaf576-B13","doi-asserted-by":"publisher","first-page":"494","DOI":"10.1101\/gr.112730.110","article-title":"Chimeric 16s rRNA sequence formation and detection in sanger and 454-pyrosequenced PCR amplicons","volume":"21","author":"Haas","year":"2011","journal-title":"Genome Res"},{"key":"2025111511265633300_btaf576-B14","doi-asserted-by":"publisher","first-page":"593","DOI":"10.1093\/bioinformatics\/btr708","article-title":"Art: a next-generation sequencing read simulator","volume":"28","author":"Huang","year":"2012","journal-title":"Bioinformatics"},{"key":"2025111511265633300_btaf576-B15","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1093\/molbev\/msx263","article-title":"Improved algorithmic complexity for the 3seq recombination detection algorithm","volume":"35","author":"Lam","year":"2018","journal-title":"Mol Biol Evol"},{"key":"2025111511265633300_btaf576-B16","doi-asserted-by":"publisher","first-page":"D964","DOI":"10.1093\/nar\/gkz822","article-title":"OGRDB: A reference database of inferred immune receptor genes","volume":"48","author":"Lees","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2025111511265633300_btaf576-B17","doi-asserted-by":"publisher","first-page":"6338","DOI":"10.1038\/s41467-024-50286-0","article-title":"Multi-compartmental diversification of neutralizing antibody lineages dissected in sars-cov-2 spike-immunized macaques","volume":"15","author":"Mandolesi","year":"2024","journal-title":"Nat Commun"},{"key":"2025111511265633300_btaf576-B18","doi-asserted-by":"publisher","first-page":"veaa087","DOI":"10.1093\/ve\/veaa087","article-title":"RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets","volume":"7","author":"Martin","year":"2021","journal-title":"Virus Evol"},{"key":"2025111511265633300_btaf576-B19","doi-asserted-by":"publisher","first-page":"590","DOI":"10.1093\/oxfordjournals.molbev.a025960","article-title":"Detecting recombination from gene trees","volume":"15","author":"Maynard Smith","year":"1998","journal-title":"Mol Biol Evol"},{"key":"2025111511265633300_btaf576-B20","doi-asserted-by":"publisher","first-page":"409","DOI":"10.1038\/s41592-019-0392-0","article-title":"Multiplexed detection of proteins, transcriptomes, clonotypes and crispr perturbations in single cells","volume":"16","author":"Mimitou","year":"2019","journal-title":"Nat Methods"},{"key":"2025111511265633300_btaf576-B21","doi-asserted-by":"publisher","first-page":"e20191155","DOI":"10.1084\/jem.20191155","article-title":"Extensive dissemination and intraclonal maturation of HIV env vaccine-induced B cell responses","volume":"217","author":"Phad","year":"2020","journal-title":"J Exp Med"},{"key":"2025111511265633300_btaf576-B22","doi-asserted-by":"publisher","first-page":"13757","DOI":"10.1073\/pnas.241370698","article-title":"Evaluation of methods for detecting recombination from DNA sequences: computer simulations","volume":"98","author":"Posada","year":"2001","journal-title":"Proc Natl Acad Sci USA"},{"key":"2025111511265633300_btaf576-B23","doi-asserted-by":"publisher","first-page":"880","DOI":"10.1128\/aem.67.2.880-887.2001","article-title":"Evaluation of PCR-generated chimeras, mutations, and heteroduplexes with 16s rRNA gene-based cloning","volume":"67","author":"Qiu","year":"2001","journal-title":"Appl Environ Microbiol"},{"key":"2025111511265633300_btaf576-B24","doi-asserted-by":"publisher","first-page":"257","DOI":"10.1109\/5.18626","article-title":"A tutorial on hidden markov models and selected applications in speech recognition","volume":"77","author":"Rabiner","year":"1989","journal-title":"Proc IEEE"},{"key":"2025111511265633300_btaf576-B25","doi-asserted-by":"publisher","first-page":"e1007133","DOI":"10.1371\/journal.pcbi.1007133","article-title":"Per-sample immunoglobulin germline inference from B cell receptor deep sequencing data","volume":"15","author":"Ralph","year":"2019","journal-title":"PLoS Comput Biol"},{"key":"2025111511265633300_btaf576-B26","doi-asserted-by":"publisher","first-page":"2136","DOI":"10.3389\/fimmu.2020.02136","article-title":"A novel framework for characterizing genomic haplotype diversity in the human immunoglobulin heavy chain locus","volume":"11","author":"Rodriguez","year":"2020","journal-title":"Front Immunol"},{"key":"2025111511265633300_btaf576-B27","doi-asserted-by":"publisher","first-page":"11112","DOI":"10.1038\/ncomms11112","article-title":"Individual heritable differences result in unique cell lymphocyte receptor repertoires of na\u00efve and antigen-experienced cells","volume":"7","author":"Rubelt","year":"2016","journal-title":"Nat Commun"},{"key":"2025111511265633300_btaf576-B28","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1016\/j.gene.2010.08.009","article-title":"Reducing chimera formation during PCR amplification to ensure accurate genotyping","volume":"469","author":"Smyth","year":"2010","journal-title":"Gene"},{"key":"2025111511265633300_btaf576-B29","doi-asserted-by":"publisher","first-page":"660","DOI":"10.3389\/fimmu.2019.00660","article-title":"High-quality library preparation for NGS-based immunoglobulin germline gene inference and repertoire expression analysis","volume":"10","author":"V\u00e1zquez Bernat","year":"2019","journal-title":"Front Immunol"},{"key":"2025111511265633300_btaf576-B30","doi-asserted-by":"publisher","first-page":"4645","DOI":"10.1128\/aem.63.12.4645-4650.1997","article-title":"Frequency of formation of chimeric molecules as a consequence of PCR coamplification of 16s rRNA genes from mixed bacterial genomes","volume":"63","author":"Wang","year":"1997","journal-title":"Appl Environ Microbiol"},{"key":"2025111511265633300_btaf576-B31","doi-asserted-by":"publisher","first-page":"358","DOI":"10.3389\/fimmu.2013.00358","article-title":"Models of somatic hypermutation targeting and substitution based on synonymous mutations from high-throughput immunoglobulin sequencing data","volume":"4","author":"Yaari","year":"2013","journal-title":"Front Immunol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf576\/64857513\/btaf576.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/11\/btaf576\/64857513\/btaf576.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/11\/btaf576\/64857513\/btaf576.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,15]],"date-time":"2025-11-15T16:27:04Z","timestamp":1763224024000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf576\/8297098"}},"subtitle":[],"editor":[{"given":"Can","family":"Alkan","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,10,22]]},"references-count":31,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2025,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf576","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,11]]},"published":{"date-parts":[[2025,10,22]]},"article-number":"btaf576"}}