{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T06:15:47Z","timestamp":1772172947376,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1009254","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2021,8,13]],"date-time":"2021-08-13T00:00:00Z","timestamp":1628812800000}}],"reference-count":69,"publisher":"Public Library of Science (PLoS)","issue":"8","license":[{"start":{"date-parts":[[2021,8,3]],"date-time":"2021-08-03T00:00:00Z","timestamp":1627948800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100008664","name":"Juvenile Diabetes Research Foundation United Kingdom","doi-asserted-by":"publisher","award":["5-SRA-2015-130-A-N"],"award-info":[{"award-number":["5-SRA-2015-130-A-N"]}],"id":[{"id":"10.13039\/100008664","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100008664","name":"Juvenile Diabetes Research Foundation United Kingdom","doi-asserted-by":"publisher","award":["4-SRA-2017-473-A-N"],"award-info":[{"award-number":["4-SRA-2017-473-A-N"]}],"id":[{"id":"10.13039\/100008664","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004440","name":"Wellcome Trust","doi-asserted-by":"publisher","award":["107212\/Z\/15\/Z"],"award-info":[{"award-number":["107212\/Z\/15\/Z"]}],"id":[{"id":"10.13039\/100004440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004440","name":"Wellcome Trust","doi-asserted-by":"publisher","award":["203141\/Z\/16\/Z"],"award-info":[{"award-number":["203141\/Z\/16\/Z"]}],"id":[{"id":"10.13039\/100004440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004543","name":"China Scholarship Council","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100004543","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100004440","name":"Wellcome Trust","doi-asserted-by":"publisher","award":["204911\/Z\/16\/Z"],"award-info":[{"award-number":["204911\/Z\/16\/Z"]}],"id":[{"id":"10.13039\/100004440","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>\n                    Driven by the necessity to survive environmental pathogens, the human immune system has evolved exceptional diversity and plasticity, to which several factors contribute including inheritable structural polymorphism of the underlying genes. Characterizing this variation is challenging due to the complexity of these loci, which contain extensive regions of paralogy, segmental duplication and high copy-number repeats, but recent progress in long-read sequencing and optical mapping techniques suggests this problem may now be tractable. Here we assess this by using long-read sequencing platforms from PacBio and Oxford Nanopore, supplemented with short-read sequencing and Bionano optical mapping, to sequence DNA extracted from CD14\n                    <jats:sup>+<\/jats:sup>\n                    monocytes and peripheral blood mononuclear cells from a single European individual identified as HV31. We use this data to build a\n                    <jats:italic>de novo<\/jats:italic>\n                    assembly of eight genomic regions encoding four key components of the immune system, namely the human leukocyte antigen, immunoglobulins, T cell receptors, and killer-cell immunoglobulin-like receptors. Validation of our assembly using k-mer based and alignment approaches suggests that it has high accuracy, with estimated base-level error rates below 1 in 10 kb, although we identify a small number of remaining structural errors. We use the assembly to identify heterozygous and homozygous structural variation in comparison to GRCh38. Despite analyzing only a single individual, we find multiple large structural variants affecting core genes at all three immunoglobulin regions and at two of the three T cell receptor regions. Several of these variants are not accurately callable using current algorithms, implying that further methodological improvements are needed. Our results demonstrate that assessing haplotype variation in these regions is possible given sufficiently accurate long-read and associated data. Continued reductions in the cost of these technologies will enable application of these methods to larger samples and provide a broader catalogue of germline structural variation at these loci, an important step toward making these regions accessible to large-scale genetic association studies.\n                  <\/jats:p>","DOI":"10.1371\/journal.pcbi.1009254","type":"journal-article","created":{"date-parts":[[2021,8,3]],"date-time":"2021-08-03T13:35:44Z","timestamp":1627997744000},"page":"e1009254","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":30,"title":["Using de novo assembly to identify structural variation of eight complex immune system gene regions"],"prefix":"10.1371","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1321-0384","authenticated-orcid":true,"given":"Jia-Yuan","family":"Zhang","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5738-4111","authenticated-orcid":true,"given":"Hannah","family":"Roberts","sequence":"additional","affiliation":[]},{"given":"David S. C.","family":"Flores","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9617-9994","authenticated-orcid":true,"given":"Antony J.","family":"Cutler","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4951-3056","authenticated-orcid":true,"given":"Andrew C.","family":"Brown","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0403-3091","authenticated-orcid":true,"given":"Justin P.","family":"Whalley","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9806-2186","authenticated-orcid":true,"given":"Olga","family":"Mielczarek","sequence":"additional","affiliation":[]},{"given":"David","family":"Buck","sequence":"additional","affiliation":[]},{"given":"Helen","family":"Lockstone","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5990-3739","authenticated-orcid":true,"given":"Barbara","family":"Xella","sequence":"additional","affiliation":[]},{"given":"Karen","family":"Oliver","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7942-2246","authenticated-orcid":true,"given":"Craig","family":"Corton","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6713-6346","authenticated-orcid":true,"given":"Emma","family":"Betteridge","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6838-0711","authenticated-orcid":true,"given":"Rachael","family":"Bashford-Rogers","sequence":"additional","affiliation":[]},{"given":"Julian C.","family":"Knight","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2740-8148","authenticated-orcid":true,"given":"John A.","family":"Todd","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1710-9024","authenticated-orcid":true,"given":"Gavin","family":"Band","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2021,8,3]]},"reference":[{"key":"pcbi.1009254.ref001","article-title":"Pervasive additive and non-additive effects within the HLA region contribute to disease risk in the UK Biobank.","author":"GR Venkataraman","year":"2020","journal-title":"BioRxiv."},{"key":"pcbi.1009254.ref002","doi-asserted-by":"crossref","first-page":"R29","DOI":"10.1093\/hmg\/dds384","article-title":"Interrogating the major histocompatibility complex with high-throughput genomics","volume":"21","author":"PI de Bakker","year":"2012","journal-title":"Hum Mol Genet"},{"key":"pcbi.1009254.ref003","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-017-00257-5","article-title":"Genome-wide association and HLA ion fine-mapping studies identify susceptibility loci for multiple common infections.","volume":"8","author":"C Tian","year":"2017","journal-title":"Nat Commun"},{"key":"pcbi.1009254.ref004","doi-asserted-by":"crossref","first-page":"e1000791","DOI":"10.1371\/journal.pgen.1000791","article-title":"Common genetic variation and the control of HIV-1 in humans.","volume":"5","author":"J Fellay","year":"2009","journal-title":"PLoS Genet"},{"key":"pcbi.1009254.ref005","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1038\/nri.2017.143","article-title":"HLA variation and disease","volume":"18","author":"CA Dendrou","year":"2018","journal-title":"Nat Rev Immunol"},{"key":"pcbi.1009254.ref006","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1038\/nature16549","article-title":"Schizophrenia risk from complex variation of complement component 4","volume":"530","author":"A Sekar","year":"2016","journal-title":"Nature"},{"key":"pcbi.1009254.ref007","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1038\/gene.2012.12","article-title":"The immunoglobulin heavy chain locus: genetic variation, missing data, and implications for human disease","volume":"13","author":"CT Watson","year":"2012","journal-title":"Genes Immun"},{"key":"pcbi.1009254.ref008","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1038\/s41586-019-1595-3","article-title":"Analysis of the B cell receptor repertoire in six immune-mediated diseases","volume":"574","author":"R Bashford-Rogers","year":"2019","journal-title":"Nature"},{"key":"pcbi.1009254.ref009","doi-asserted-by":"crossref","first-page":"8","DOI":"10.3389\/fimmu.2013.00008","article-title":"Killer cell immunoglobulin-like receptor gene associations with autoimmune and allergic diseases, recurrent spontaneous abortion, and neoplasms.","volume":"4","author":"PK Kusnierczyk","year":"2013","journal-title":"Front Immunol."},{"key":"pcbi.1009254.ref010","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","article-title":"A global reference for human genetic variation","volume":"526","author":"The 1000 Genomes Project Consortium","year":"2015","journal-title":"Nature"},{"key":"pcbi.1009254.ref011","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/s41586-020-2287-8","article-title":"A structural variation reference for medical and population genetics","volume":"581","author":"Genome Aggregation Database Production Team","year":"2020","journal-title":"Nature"},{"key":"pcbi.1009254.ref012","first-page":"563866","article-title":"Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.","author":"D Taliun","year":"2019","journal-title":"BioRxiv"},{"key":"pcbi.1009254.ref013","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1186\/s13059-019-1707-2","article-title":"Systematic analysis of dark and camouflaged genes reveals disease-relevant genes hiding in plain sight","volume":"20","author":"MTW Ebbert","year":"2019","journal-title":"Genome Biol"},{"key":"pcbi.1009254.ref014","article-title":"Worldwide genetic variation of the IGHV and TRBV immune receptor gene families in humans","volume":"2","author":"S Luo","year":"2019","journal-title":"Life Sci Alliance"},{"key":"pcbi.1009254.ref015","doi-asserted-by":"crossref","DOI":"10.3389\/fimmu.2020.02136","article-title":"A novel framework for characterizing genomic haplotype diversity in the human immunoglobulin heavy chain locus.","volume":"11","author":"OL Rodriguez","year":"2020","journal-title":"Front Immunol."},{"key":"pcbi.1009254.ref016","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1038\/s41587-020-0711-0","article-title":"Chromosome-scale, haplotype-resolved assembly of human genomes","volume":"39","author":"S Garg","year":"2021","journal-title":"Nat Biotechnol"},{"key":"pcbi.1009254.ref017","doi-asserted-by":"crossref","first-page":"eabf7117","DOI":"10.1126\/science.abf7117","article-title":"Haplotype-resolved diverse human genomes and integrated analysis of structural variation","volume":"372","author":"P Ebert","year":"2021","journal-title":"Science"},{"key":"pcbi.1009254.ref018","doi-asserted-by":"crossref","first-page":"D413","DOI":"10.1093\/nar\/gku1056","article-title":"IMGT, the international ImMunoGeneTics information system 25 years on","volume":"43","author":"M-P Lefranc","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009254.ref019","doi-asserted-by":"crossref","first-page":"D733","DOI":"10.1093\/nar\/gkv1189","article-title":"Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.","volume":"44","author":"NA O\u2019Leary","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009254.ref020","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1038\/jhg.2008.5","article-title":"The HLA genomic loci map: expression, interaction, diversity and disease","volume":"54","author":"T Shiina","year":"2009","journal-title":"J Hum Genet"},{"key":"pcbi.1009254.ref021","article-title":"The KIR gene cluster","author":"M Carrington","year":"2003","journal-title":"Natl Cent Biotechnol Inf US"},{"key":"pcbi.1009254.ref022","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1016\/j.ajhg.2013.03.004","article-title":"Complete Haplotype Sequence of the Human Immunoglobulin Heavy-Chain Variable, Diversity, and Joining Genes and Characterization of Allelic and Copy-Number Variation","volume":"92","author":"CT Watson","year":"2013","journal-title":"Am J Hum Genet"},{"key":"pcbi.1009254.ref023","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1038\/gene.2014.56","article-title":"Sequencing of the human IG light chain loci from a hydatidiform mole BAC library reveals locus-specific signatures of genetic diversity","volume":"16","author":"CT Watson","year":"2015","journal-title":"Genes Immun"},{"key":"pcbi.1009254.ref024","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1186\/s13059-020-02134-9","article-title":"Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies","volume":"21","author":"A Rhie","year":"2020","journal-title":"Genome Biol"},{"key":"pcbi.1009254.ref025","doi-asserted-by":"crossref","first-page":"722","DOI":"10.1101\/gr.215087.116","article-title":"Canu: scalable and accurate long-read assembly via adaptive k -mer weighting and repeat separation","volume":"27","author":"S Koren","year":"2017","journal-title":"Genome Res"},{"key":"pcbi.1009254.ref026","article-title":"BiSCoT: Improving large eukaryotic genome assemblies with optical maps","author":"B Istace","year":"2019","journal-title":"Bioinformatics"},{"key":"pcbi.1009254.ref027","article-title":"TGS-GapCloser: fast and accurately passing through the Bermuda in large genome using error-prone third-generation long reads","author":"M Xu","year":"2019","journal-title":"Bioinformatics"},{"key":"pcbi.1009254.ref028","doi-asserted-by":"crossref","first-page":"e112963","DOI":"10.1371\/journal.pone.0112963","article-title":"Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement","volume":"9","author":"BJ Walker","year":"2014","journal-title":"PLoS ONE."},{"key":"pcbi.1009254.ref029","article-title":"Telomere-to-telomere assembly of a complete human X chromosome","author":"KH Miga","year":"2020","journal-title":"Nature"},{"key":"pcbi.1009254.ref030","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1038\/s41587-020-0538-8","article-title":"A robust benchmark for detection of germline large deletions and insertions","volume":"38","author":"JM Zook","year":"2020","journal-title":"Nat Biotechnol"},{"key":"pcbi.1009254.ref031","doi-asserted-by":"crossref","first-page":"1291","DOI":"10.1101\/gr.263566.120","article-title":"HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads","volume":"30","author":"S Nurk","year":"2020","journal-title":"Genome Res"},{"key":"pcbi.1009254.ref032","first-page":"2021","article-title":"The complete sequence of a human genome","author":"S Nurk","year":"2021","journal-title":"bioRxiv"},{"key":"pcbi.1009254.ref033","doi-asserted-by":"crossref","first-page":"R12","DOI":"10.1186\/gb-2004-5-2-r12","article-title":"Versatile and open software for comparing large genomes","volume":"5","author":"S Kurtz","year":"2004","journal-title":"Genome Biol"},{"key":"pcbi.1009254.ref034","doi-asserted-by":"crossref","first-page":"3021","DOI":"10.1093\/bioinformatics\/btw369","article-title":"Assemblytics: a web analytics tool for the detection of variants from an assembly","volume":"32","author":"M Nattestad","year":"2016","journal-title":"Bioinformatics"},{"key":"pcbi.1009254.ref035","doi-asserted-by":"crossref","first-page":"2202","DOI":"10.1093\/bioinformatics\/btx153","article-title":"GenomeScope: fast reference-free genome profiling from short reads","volume":"33","author":"GW Vurture","year":"2017","journal-title":"Bioinformatics"},{"key":"pcbi.1009254.ref036","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1089\/cmb.1995.2.275","article-title":"Toward Simplifying and Accurately Formulating Fragment Assembly","volume":"2","author":"EW Myers","year":"1995","journal-title":"J Comput Biol"},{"key":"pcbi.1009254.ref037","doi-asserted-by":"crossref","first-page":"e1003628","DOI":"10.1371\/journal.pcbi.1003628","article-title":"Genomic Characterization of Large Heterochromatic Gaps in the Human Genome Assembly","volume":"10","author":"N Altemose","year":"2014","journal-title":"PLoS Comput Biol."},{"key":"pcbi.1009254.ref038","doi-asserted-by":"crossref","first-page":"0069","DOI":"10.1038\/s41559-016-0069","article-title":"The evolution and population diversity of human-specific segmental duplications","volume":"1","author":"MY Dennis","year":"2017","journal-title":"Nat Ecol Evol"},{"key":"pcbi.1009254.ref039","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1038\/s41467-020-20146-8","article-title":"Construction and integration of three de novo Japanese human genome assemblies toward a population-specific reference","volume":"12","author":"J Takayama","year":"2021","journal-title":"Nat Commun"},{"key":"pcbi.1009254.ref040","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1101\/gr.3302705","article-title":"Interchromosomal segmental duplications of the pericentromeric region on the human Y chromosome","volume":"15","author":"S Kirsch","year":"2005","journal-title":"Genome Res"},{"key":"pcbi.1009254.ref041","doi-asserted-by":"crossref","first-page":"1690","DOI":"10.1101\/gr.6675307","article-title":"Islands of euchromatin-like sequence and expressed polymorphic sequences within the short arm of human chromosome 21","volume":"17","author":"R Lyle","year":"2007","journal-title":"Genome Res"},{"key":"pcbi.1009254.ref042","doi-asserted-by":"crossref","first-page":"682","DOI":"10.1038\/ng.3257","article-title":"Improved genome inference in the MHC using a population reference graph","volume":"47","author":"A Dilthey","year":"2015","journal-title":"Nat Genet"},{"key":"pcbi.1009254.ref043","doi-asserted-by":"crossref","first-page":"e1005151","DOI":"10.1371\/journal.pcbi.1005151","article-title":"High-Accuracy HLA Type Inference from Whole-Genome Sequencing Data Using Population Reference Graphs.","volume":"12","author":"AT Dilthey","year":"2016","journal-title":"PLOS Comput Biol."},{"key":"pcbi.1009254.ref044","article-title":"Practical use of methods for imputation of HLA alleles from SNP genotype data.","author":"A Motyer","year":"2016","journal-title":"bioRxiv"},{"key":"pcbi.1009254.ref045","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1002\/gepi.22334","article-title":"SNP-HLA Reference Consortium (SHLARC): HLA and SNP data sharing for promoting MHC-centric analyses in genomics.","volume":"44","author":"N Vince","year":"2020","journal-title":"Genet Epidemiol"},{"key":"pcbi.1009254.ref046","doi-asserted-by":"crossref","first-page":"593","DOI":"10.1016\/j.ajhg.2015.09.005","article-title":"Imputation of KIR types from SNP variation data","volume":"97","author":"D Vukcevic","year":"2015","journal-title":"Am J Hum Genet"},{"key":"pcbi.1009254.ref047","doi-asserted-by":"crossref","first-page":"849","DOI":"10.1101\/gr.213611.116","article-title":"Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly","volume":"27","author":"VA Schneider","year":"2017","journal-title":"Genome Res"},{"key":"pcbi.1009254.ref048","doi-asserted-by":"crossref","first-page":"4794","DOI":"10.1038\/s41467-020-18564-9","article-title":"A diploid assembly-based benchmark for variants in the major histocompatibility complex.","volume":"11","author":"C-S Chin","year":"2020","journal-title":"Nat Commun"},{"key":"pcbi.1009254.ref049","doi-asserted-by":"crossref","first-page":"1050","DOI":"10.1038\/nmeth.4035","article-title":"Phased diploid genome assembly with single-molecule real-time sequencing","volume":"13","author":"C-S Chin","year":"2016","journal-title":"Nat Methods"},{"key":"pcbi.1009254.ref050","doi-asserted-by":"crossref","first-page":"498","DOI":"10.1089\/cmb.2014.0157","article-title":"WhatsHap: Weighted Haplotype Assembly for Future-Generation Sequencing Reads","volume":"22","author":"M Patterson","year":"2015","journal-title":"J Comput Biol"},{"key":"pcbi.1009254.ref051","doi-asserted-by":"crossref","first-page":"1293","DOI":"10.1038\/s41467-017-01389-4","article-title":"Dense and accurate whole-chromosome haplotyping of individual genomes.","volume":"8","author":"D Porubsky","year":"2017","journal-title":"Nat Commun."},{"key":"pcbi.1009254.ref052","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1038\/s41587-019-0217-9","article-title":"Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome","volume":"37","author":"AM Wenger","year":"2019","journal-title":"Nat Biotechnol"},{"key":"pcbi.1009254.ref053","article-title":"SDip: A novel graph-based approach to haplotype-aware assembly based structural variant calling in targeted segmental duplications sequencing","author":"D Heller","year":"2020","journal-title":"Bioinformatics"},{"key":"pcbi.1009254.ref054","doi-asserted-by":"crossref","first-page":"D493","DOI":"10.1093\/nar\/gkh103","article-title":"The UCSC Table Browser data retrieval tool","volume":"32","author":"D Karolchik","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009254.ref055","doi-asserted-by":"crossref","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","article-title":"Minimap2: pairwise alignment for nucleotide sequences","volume":"34","author":"H. Li","year":"2018","journal-title":"Bioinformatics"},{"key":"pcbi.1009254.ref056","article-title":"Human Genome Assembly in 100 Minutes.","author":"C-S Chin","year":"2019","journal-title":"BioRxiv"},{"key":"pcbi.1009254.ref057","doi-asserted-by":"crossref","first-page":"764","DOI":"10.1093\/bioinformatics\/btr011","article-title":"A fast, lock-free approach for efficient parallel counting of occurrences of k-mers","volume":"27","author":"G Mar\u00e7ais","year":"2011","journal-title":"Bioinformatics"},{"key":"pcbi.1009254.ref058","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1038\/s41592-018-0001-7","article-title":"Accurate detection of complex structural variations using single-molecule sequencing.","volume":"15","author":"FJ Sedlazeck","year":"2018","journal-title":"Nat Methods"},{"key":"pcbi.1009254.ref059","doi-asserted-by":"crossref","first-page":"1297","DOI":"10.1101\/gr.107524.110","article-title":"The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data","volume":"20","author":"A McKenna","year":"2010","journal-title":"Genome Res"},{"key":"pcbi.1009254.ref060","article-title":"IMGT\/V-QUEST: IMGT standardized analysis of the immunoglobulin (IG) and T cell receptor (TR) nucleotide sequences.","volume":"2011","author":"V Giudicelli","year":"2011","journal-title":"Cold Spring Harb Protoc."},{"key":"pcbi.1009254.ref061","first-page":"D948","article-title":"IPD-IMGT\/HLA Database.","volume":"48","author":"J Robinson","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009254.ref062","doi-asserted-by":"crossref","first-page":"D1234","DOI":"10.1093\/nar\/gks1140","article-title":"IPD\u2014the immuno polymorphism database","volume":"41","author":"J Robinson","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009254.ref063","doi-asserted-by":"crossref","first-page":"W34","DOI":"10.1093\/nar\/gkt382","article-title":"IgBLAST: an immunoglobulin variable domain sequence analysis tool","volume":"41","author":"J Ye","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1009254.ref064","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1186\/1471-2105-10-421","article-title":"BLAST+: architecture and applications","volume":"10","author":"C Camacho","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"pcbi.1009254.ref065","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1016\/0161-5890(92)90001-E","article-title":"The human \u03b3\/\u03b4+ and \u03b1\/\u03b2+ T cells: a branched pathway of differentiation","volume":"29","author":"D Alexandre","year":"1992","journal-title":"Mol Immunol"},{"key":"pcbi.1009254.ref066","article-title":"T-cell receptor gene rearrangement. Immunobiology: The Immune System in Health and Disease 5th edition","author":"CA Janeway","year":"2001","journal-title":"Garland Science"},{"key":"pcbi.1009254.ref067","doi-asserted-by":"crossref","first-page":"1174","DOI":"10.1038\/nbt.4277","article-title":"De novo assembly of haplotype-resolved genomes with trio binning","volume":"36","author":"S Koren","year":"2018","journal-title":"Nat Biotechnol"},{"key":"pcbi.1009254.ref068","doi-asserted-by":"crossref","first-page":"1044","DOI":"10.1038\/s41587-020-0503-6","article-title":"Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes","volume":"38","author":"K Shafin","year":"2020","journal-title":"Nat Biotechnol"},{"key":"pcbi.1009254.ref069","doi-asserted-by":"crossref","first-page":"540","DOI":"10.1038\/s41587-019-0072-8","article-title":"Assembly of long, error-prone reads using repeat graphs","volume":"37","author":"M Kolmogorov","year":"2019","journal-title":"Nat Biotechnol"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1009254","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2021,8,13]],"date-time":"2021-08-13T00:00:00Z","timestamp":1628812800000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009254","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,13]],"date-time":"2021-08-13T13:45:56Z","timestamp":1628862356000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009254"}},"subtitle":[],"editor":[{"given":"Aakrosh","family":"Ratan","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,8,3]]},"references-count":69,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2021,8,3]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1009254","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.02.03.429586","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,8,3]]}}}