{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:21Z","timestamp":1772138061068,"version":"3.50.1"},"reference-count":20,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2023,6,5]],"date-time":"2023-06-05T00:00:00Z","timestamp":1685923200000},"content-version":"vor","delay-in-days":4,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100010269","name":"Wellcome Trust","doi-asserted-by":"publisher","award":["220096\/Z\/20\/Z"],"award-info":[{"award-number":["220096\/Z\/20\/Z"]}],"id":[{"id":"10.13039\/100010269","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Summary<\/jats:title>\n                    <jats:p>Adaptive Immune Receptor Repertoire Sequencing is a rapidly developing field that has advanced understanding of the role of the adaptive immune system in health and disease. Numerous tools have been developed to analyse the complex data produced by this technique but work to compare their accuracy and reliability has been limited. Thorough, systematic assessment of their performance is dependent on the ability to produce high quality simulated datasets with known ground truth. We have developed AIRRSHIP, a flexible and fast Python package that produces synthetic human B cell receptor sequences. AIRRSHIP uses a comprehensive set of reference data to replicate key mechanisms in the immunoglobulin recombination process, with a particular focus on junctional complexity. Repertoires generated by AIRRSHIP are highly similar to published data and all steps in the sequence generation process are recorded. These data can be used to not only determine the accuracy of repertoire analysis tools but can also, by tuning of the large number of user-controllable parameters, give insight into factors that contribute to inaccuracies in results.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>AIRRSHIP is implemented in Python. It is available via https:\/\/github.com\/Cowanlab\/airrship and on PyPI at https:\/\/pypi.org\/project\/airrship\/. Documentation can be found at https:\/\/airrship.readthedocs.io\/.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad365","type":"journal-article","created":{"date-parts":[[2023,6,3]],"date-time":"2023-06-03T07:37:39Z","timestamp":1685777859000},"source":"Crossref","is-referenced-by-count":6,"title":["AIRRSHIP: simulating human B cell receptor repertoire sequences"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5862-4014","authenticated-orcid":false,"given":"Catherine","family":"Sutherland","sequence":"first","affiliation":[{"name":"Institute of Immunology and Infection Research, School of Biological Sciences, University of Edinburgh , Edinburgh, EH9 3FL, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1949-4253","authenticated-orcid":false,"given":"Graeme J M","family":"Cowan","sequence":"additional","affiliation":[{"name":"Institute of Immunology and Infection Research, School of Biological Sciences, University of Edinburgh , Edinburgh, EH9 3FL, United Kingdom"}]}],"member":"286","published-online":{"date-parts":[[2023,6,5]]},"reference":[{"key":"2023061606245015000_btad365-B1","doi-asserted-by":"crossref","first-page":"380","DOI":"10.1038\/nmeth.3364","article-title":"MiXCR: software for comprehensive adaptive immunity profiling","volume":"12","author":"Bolotin","year":"2015","journal-title":"Nat Methods"},{"key":"2023061606245015000_btad365-B2","doi-asserted-by":"crossref","first-page":"W503","DOI":"10.1093\/nar\/gkn316","article-title":"IMGT\/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis","volume":"36","author":"Brochet","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2023061606245015000_btad365-B3","doi-asserted-by":"crossref","first-page":"e0160853","DOI":"10.1371\/journal.pone.0160853","article-title":"A public database of memory and naive B-cell receptor sequences","volume":"11","author":"DeWitt","year":"2016","journal-title":"PLoS One"},{"key":"2023061606245015000_btad365-B4","doi-asserted-by":"crossref","first-page":"E862","DOI":"10.1073\/pnas.1417683112","article-title":"Automated analysis of high-throughput B-cell sequencing data reveals a high frequency of novel immunoglobulin V gene segment alleles","volume":"112","author":"Gadala-Maria","year":"2015","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023061606245015000_btad365-B5","first-page":"vbac062","article-title":"Echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes","volume":"2","author":"Han","year":"2022","journal-title":"Bioinf Adv"},{"key":"2023061606245015000_btad365-B6","doi-asserted-by":"crossref","first-page":"899","DOI":"10.1007\/s00251-007-0260-4","article-title":"WHO-IUIS nomenclature subcommittee for immunoglobulins and T cell receptors report","volume":"59","author":"Lefranc","year":"2007","journal-title":"Immunogenetics"},{"key":"2023061606245015000_btad365-B7","doi-asserted-by":"crossref","first-page":"561","DOI":"10.1038\/s41467-018-02832-w","article-title":"High-throughput immune repertoire analysis with IGoR","volume":"9","author":"Marcou","year":"2018","journal-title":"Nat Commun"},{"key":"2023061606245015000_btad365-B8","doi-asserted-by":"crossref","first-page":"2533","DOI":"10.3389\/fimmu.2019.02533","article-title":"sumrep: a summary statistic framework for immune receptor repertoire comparison and model validation","volume":"10","author":"Olson","year":"2019","journal-title":"Front Immunol"},{"key":"2023061606245015000_btad365-B9","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pcbi.1004409","article-title":"Consistency of VDJ rearrangement and substitution parameters enables accurate B cell receptor sequence annotation","volume":"12","author":"Ralph","year":"2016","journal-title":"PLoS Comput Biol"},{"key":"2023061606245015000_btad365-B10","doi-asserted-by":"crossref","first-page":"3213","DOI":"10.1093\/bioinformatics\/btv326","article-title":"IgSimulator: a versatile immunosequencing simulator","volume":"31","author":"Safonova","year":"2015","journal-title":"Bioinformatics"},{"key":"2023061606245015000_btad365-B11","doi-asserted-by":"crossref","first-page":"1731","DOI":"10.1093\/bioinformatics\/btz845","article-title":"Benchmarking immunoinformatic tools for the analysis of antibody repertoire sequences","volume":"36","author":"Smakaj","year":"2020","journal-title":"Bioinformatics"},{"key":"2023061606245015000_btad365-B12","doi-asserted-by":"crossref","first-page":"2206","DOI":"10.3389\/fimmu.2018.02206","article-title":"AIRR community standardized representations for annotated immune repertoires","volume":"9","author":"Vander Heiden","year":"2018","journal-title":"Front Immunol"},{"key":"2023061606245015000_btad365-B13","doi-asserted-by":"crossref","first-page":"3594","DOI":"10.1093\/bioinformatics\/btaa158","article-title":"ImmuneSIM: tunable multi-feature simulation of B- and T-cell receptor repertoires for immunoinformatics benchmarking","volume":"36","author":"Weber","year":"2020","journal-title":"Bioinformatics"},{"key":"2023061606245015000_btad365-B14","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1186\/s13073-015-0243-2","article-title":"Practical guidelines for B-cell receptor repertoire sequencing analysis","volume":"7","author":"Yaari","year":"2015","journal-title":"Genome Med"},{"key":"2023061606245015000_btad365-B15","doi-asserted-by":"crossref","first-page":"358","DOI":"10.3389\/fimmu.2013.00358","article-title":"Models of somatic hypermutation targeting and substitution based on synonymous mutations from high-throughput immunoglobulin sequencing data","volume":"4","author":"Yaari","year":"2013","journal-title":"Front Immunol"},{"key":"2023061606245015000_btad365-B16","doi-asserted-by":"crossref","first-page":"109110","DOI":"10.1016\/j.celrep.2021.109110","article-title":"Large-scale analysis of 2,152 Ig-seq datasets reveals key features of B cell biology and the antibody repertoire","volume":"35","author":"Yang","year":"2021","journal-title":"Cell Rep"},{"key":"2023061606245015000_btad365-B17","doi-asserted-by":"crossref","first-page":"739179","DOI":"10.3389\/fimmu.2021.739179","article-title":"Novel allele detection tool benchmark and application with antibody repertoire sequencing dataset","volume":"12","author":"Yang","year":"2021","journal-title":"Front Immunol"},{"key":"2023061606245015000_btad365-B18","doi-asserted-by":"crossref","first-page":"W34","DOI":"10.1093\/nar\/gkt382","article-title":"IgBLAST: an immunoglobulin variable domain sequence analysis tool","volume":"41","author":"Ye","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023061606245015000_btad365-B19","doi-asserted-by":"crossref","first-page":"3938","DOI":"10.1093\/bioinformatics\/btx533","article-title":"Comparison of methods for phylogenetic B-cell lineage inference using time-resolved antibody repertoire simulations (AbSim)","volume":"33","author":"Yermanos","year":"2017","journal-title":"Bioinformatics"},{"key":"2023061606245015000_btad365-B20","doi-asserted-by":"crossref","first-page":"105002","DOI":"10.1016\/j.isci.2022.105002","article-title":"B-cell receptor repertoire sequencing: deeper digging into the mechanisms and clinical aspects of immune-mediated diseases","volume":"25","author":"Zheng","year":"2022","journal-title":"iScience"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad365\/50518959\/btad365.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/6\/btad365\/50625342\/btad365.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/6\/btad365\/50625342\/btad365.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,16]],"date-time":"2023-06-16T02:25:30Z","timestamp":1686882330000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad365\/7190367"}},"subtitle":[],"editor":[{"given":"Can","family":"Alkan","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,6,1]]},"references-count":20,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2023,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad365","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2022.12.20.521228","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,6,1]]},"published":{"date-parts":[[2023,6,1]]},"article-number":"btad365"}}