{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:21Z","timestamp":1772138061530,"version":"3.50.1"},"reference-count":22,"publisher":"Oxford University Press (OUP)","issue":"10","funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1705077"],"award-info":[{"award-number":["1705077"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["2007714"],"award-info":[{"award-number":["2007714"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["2146838"],"award-info":[{"award-number":["2146838"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["1R01HG008164"],"award-info":[{"award-number":["1R01HG008164"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1651236"],"award-info":[{"award-number":["1651236"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1703403"],"award-info":[{"award-number":["1703403"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01HG011649"],"award-info":[{"award-number":["R01HG011649"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,10,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Detection of structural variants (SVs) from the alignment of sample DNA reads to the reference genome is an important problem in understanding human diseases. Long reads that can span repeat regions, along with an accurate alignment of these long reads play an important role in identifying novel SVs. Long-read sequencers, such as nanopore sequencing, can address this problem by providing very long reads but with high error rates, making accurate alignment challenging. Many errors induced by nanopore sequencing have a bias because of the physics of the sequencing process and proper utilization of these error characteristics can play an important role in designing a robust aligner for SV detection problems. In this article, we design and evaluate HQAlign, an aligner for SV detection using nanopore sequenced reads. The key ideas of HQAlign include (i) using base-called nanopore reads along with the nanopore physics to improve alignments for SVs, (ii) incorporating SV-specific changes to the alignment pipeline, and (iii) adapting these into existing state-of-the-art long-read aligner pipeline, minimap2 (v2.24), for efficient alignments.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We show that HQAlign captures about 4%\u20136% complementary SVs across different datasets, which are missed by minimap2 alignments while having a standalone performance at par with minimap2 for real nanopore reads data. For the common SV calls between HQAlign and minimap2, HQAlign improves the start and the end breakpoint accuracy by about 10%\u201350% for SVs across different datasets. Moreover, HQAlign improves the alignment rate to 89.35% from minimap2 85.64% for nanopore reads alignment to recent telomere-to-telomere CHM13 assembly, and it improves to 86.65% from 83.48% for nanopore reads alignment to GRCh37 human genome.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>https:\/\/github.com\/joshidhaivat\/HQAlign.git.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad580","type":"journal-article","created":{"date-parts":[[2023,9,19]],"date-time":"2023-09-19T17:44:28Z","timestamp":1695145468000},"source":"Crossref","is-referenced-by-count":3,"title":["HQAlign: aligning nanopore reads for SV detection using current-level modeling"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9050-5931","authenticated-orcid":false,"given":"Dhaivat","family":"Joshi","sequence":"first","affiliation":[{"name":"Electrical & Computer Engineering, University of California, Los Angeles , CA, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Suhas","family":"Diggavi","sequence":"additional","affiliation":[{"name":"Electrical & Computer Engineering, University of California, Los Angeles , CA, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5395-1457","authenticated-orcid":false,"given":"Mark J P","family":"Chaisson","sequence":"additional","affiliation":[{"name":"Department of Quantitative and Computational Biology, University of Southern California , Los Angeles, CA, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sreeram","family":"Kannan","sequence":"additional","affiliation":[{"name":"Electrical & Computer Engineering, University of Washington , Seattle, WA, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2023,9,21]]},"reference":[{"key":"2023101423111015100_btad580-B1","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1038\/nrg2958","article-title":"Genome structural variation discovery and genotyping","volume":"12","author":"Alkan","year":"2011","journal-title":"Nat Rev Genet"},{"key":"2023101423111015100_btad580-B2","doi-asserted-by":"crossref","first-page":"1784","DOI":"10.1038\/s41467-018-08148-z","article-title":"Multi-platform discovery of haplotype-resolved structural variation in human genomes","volume":"10","author":"Chaisson","year":"2019","journal-title":"Nat Commun"},{"key":"2023101423111015100_btad580-B3","doi-asserted-by":"crossref","first-page":"518","DOI":"10.1038\/nbt.3423","article-title":"Three decades of nanopore sequencing","volume":"34","author":"Deamer","year":"2016","journal-title":"Nat Biotechnol"},{"key":"2023101423111015100_btad580-B4","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1038\/ng.806","article-title":"A framework for variation discovery and genotyping using next-generation DNA sequencing data","volume":"43","author":"DePristo","year":"2011","journal-title":"Nat Genet"},{"key":"2023101423111015100_btad580-B5","doi-asserted-by":"crossref","first-page":"eabf7117","DOI":"10.1126\/science.abf7117","article-title":"Haplotype-resolved diverse human genomes and integrated analysis of structural variation","volume":"372","author":"Ebert","year":"2021","journal-title":"Science"},{"key":"2023101423111015100_btad580-B6","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1186\/s13059-022-02840-6","article-title":"Truvari: refined structural variant comparison preserves allelic diversity","volume":"23","author":"English","year":"2022","journal-title":"Genome Biol"},{"key":"2023101423111015100_btad580-B7","doi-asserted-by":"crossref","first-page":"14061","DOI":"10.1038\/ncomms14061","article-title":"Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast","volume":"8","author":"Jeffares","year":"2017","journal-title":"Nat Commun"},{"key":"2023101423111015100_btad580-B8","doi-asserted-by":"crossref","first-page":"625","DOI":"10.1093\/bioinformatics\/btaa875","article-title":"QAlign: aligning nanopore reads accurately using current-level modeling","volume":"37","author":"Joshi","year":"2021","journal-title":"Bioinformatics"},{"key":"2023101423111015100_btad580-B9","doi-asserted-by":"crossref","first-page":"748","DOI":"10.1093\/bioinformatics\/btx668","article-title":"Evaluation of tools for long read RNA-seq splice-aware alignment","volume":"34","author":"Kri\u017eanovi\u0107","year":"2018","journal-title":"Bioinformatics"},{"key":"2023101423111015100_btad580-B10","doi-asserted-by":"crossref","first-page":"829","DOI":"10.1038\/nbt.2950","article-title":"Decoding long nanopore sequencing reads of natural DNA","volume":"32","author":"Laszlo","year":"2014","journal-title":"Nat Biotechnol"},{"key":"2023101423111015100_btad580-B11","doi-asserted-by":"crossref","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","article-title":"Minimap2: pairwise alignment for nucleotide sequences","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"2023101423111015100_btad580-B12","doi-asserted-by":"crossref","first-page":"4572","DOI":"10.1093\/bioinformatics\/btab705","article-title":"New strategies to improve minimap2 alignment accuracy","volume":"37","author":"Li","year":"2021","journal-title":"Bioinformatics"},{"key":"2023101423111015100_btad580-B13","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1038\/s41592-018-0054-7","article-title":"A synthetic-diploid benchmark for accurate variant-calling evaluation","volume":"15","author":"Li","year":"2018","journal-title":"Nat Methods"},{"key":"2023101423111015100_btad580-B14","doi-asserted-by":"crossref","first-page":"3216","DOI":"10.1109\/TIT.2018.2809001","article-title":"Models and information-theoretic bounds for nanopore sequencing","volume":"64","author":"Mao","year":"2018","journal-title":"IEEE Trans Inform Theory"},{"key":"2023101423111015100_btad580-B15","doi-asserted-by":"crossref","first-page":"1097","DOI":"10.1111\/1755-0998.12324","article-title":"A first look at the oxford nanopore minion sequencer","volume":"14","author":"Mikheyev","year":"2014","journal-title":"Mol Ecol Resour"},{"key":"2023101423111015100_btad580-B16","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1038\/s41586-020-1969-6","article-title":"Pan-cancer analysis of whole genomes","volume":"578","author":"The ICGC\/TCGA Pan-Cancer Analysis of Whole Genomes Consortium","year":"2020","journal-title":"Nature"},{"key":"2023101423111015100_btad580-B17","doi-asserted-by":"crossref","first-page":"e1009078","DOI":"10.1371\/journal.pcbi.1009078","article-title":"Lra: a long read aligner for sequences and contigs","volume":"17","author":"Ren","year":"2021","journal-title":"PLoS Comput Biol"},{"key":"2023101423111015100_btad580-B18","first-page":"344","author":"Rhie"},{"key":"2023101423111015100_btad580-B19","first-page":"1723","article-title":"Comprehensive variant detection in a human genome with highly accurate long reads","volume":"27","author":"Rowell","year":"2019","journal-title":"Eur J Hum Genet"},{"key":"2023101423111015100_btad580-B20","author":"Smolka","year":"2022"},{"key":"2023101423111015100_btad580-B21","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/gigascience\/gix010","article-title":"NanoSim: nanopore sequence read simulator based on statistical characterization","volume":"6","author":"Yang","year":"2017","journal-title":"Gigascience"},{"key":"2023101423111015100_btad580-B22","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1038\/s41587-020-0538-8","article-title":"A robust benchmark for detection of germline large deletions and insertions","volume":"38","author":"Zook","year":"2020","journal-title":"Nat Biotechnol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad580\/51721470\/btad580.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/10\/btad580\/52147189\/btad580.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/10\/btad580\/52147189\/btad580.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,14]],"date-time":"2023-10-14T19:11:40Z","timestamp":1697310700000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad580\/7280145"}},"subtitle":[],"editor":[{"given":"Can","family":"Alkan","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2023,9,21]]},"references-count":22,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2023,10,3]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad580","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.01.08.523172","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,10,1]]},"published":{"date-parts":[[2023,9,21]]},"article-number":"btad580"}}