{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T09:09:58Z","timestamp":1773738598816,"version":"3.50.1"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2020,10,14]],"date-time":"2020-10-14T00:00:00Z","timestamp":1602633600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1705077"],"award-info":[{"award-number":["1705077"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["1R01HG008164"],"award-info":[{"award-number":["1R01HG008164"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1651236"],"award-info":[{"award-number":["1651236"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1703403"],"award-info":[{"award-number":["1703403"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,5,5]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Efficient and accurate alignment of DNA\/RNA sequence reads to each other or to a reference genome\/transcriptome is an important problem in genomic analysis. Nanopore sequencing has emerged as a major sequencing technology and many long-read aligners have been designed for aligning nanopore reads. However, the high error rate makes accurate and efficient alignment difficult. Utilizing the noise and error characteristics inherent in the sequencing process properly can play a vital role in constructing a robust aligner. In this article, we design QAlign, a pre-processor that can be used with any long-read aligner for aligning long reads to a genome\/transcriptome or to other long reads. The key idea in QAlign is to convert the nucleotide reads into discretized current levels that capture the error modes of the nanopore sequencer before running it through a sequence aligner.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We show that QAlign is able to improve alignment rates from around 80% up to 90% with nanopore reads when aligning to the genome. We also show that QAlign improves the average overlap quality by 9.2,\u20092.5 and 10.8% in three real datasets for read-to-read alignment. Read-to-transcriptome alignment rates are improved from 51.6% to 75.4% and 82.6% to 90% in two real datasets.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>https:\/\/github.com\/joshidhaivat\/QAlign.git.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa875","type":"journal-article","created":{"date-parts":[[2020,9,29]],"date-time":"2020-09-29T17:35:55Z","timestamp":1601400955000},"page":"625-633","source":"Crossref","is-referenced-by-count":10,"title":["QAlign: aligning nanopore reads accurately using current-level modeling"],"prefix":"10.1093","volume":"37","author":[{"given":"Dhaivat","family":"Joshi","sequence":"first","affiliation":[{"name":"Electrical & Computer Engineering, University of California , Los Angeles, CA 90095, USA"}]},{"given":"Shunfu","family":"Mao","sequence":"additional","affiliation":[{"name":"Electrical & Computer Engineering, University of Washington , Seattle, WA 98195, USA"}]},{"given":"Sreeram","family":"Kannan","sequence":"additional","affiliation":[{"name":"Electrical & Computer Engineering, University of Washington , Seattle, WA 98195, USA"}]},{"given":"Suhas","family":"Diggavi","sequence":"additional","affiliation":[{"name":"Electrical & Computer Engineering, University of California , Los Angeles, CA 90095, USA"}]}],"member":"286","published-online":{"date-parts":[[2020,12,11]]},"reference":[{"key":"2023051800332296400_btaa875-B1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/ncomms16027","article-title":"Nanopore long-read RNAseq reveals widespread transcriptional variation among the surface receptors of individual B cells","volume":"8","author":"Byrne","year":"2017","journal-title":"Nat. Commun"},{"key":"2023051800332296400_btaa875-B2","first-page":"117","volume-title":"International Conference on Research in Computational Molecular Biology,\u00a0","author":"Chaisson","year":"2017"},{"key":"2023051800332296400_btaa875-B3","doi-asserted-by":"crossref","first-page":"1178","DOI":"10.1101\/gr.244939.118","article-title":"Structural variants identified by Oxford Nanopore PromethION sequencing of the human genome","volume":"29","author":"De Coster","year":"2019","journal-title":"Genome Res"},{"key":"2023051800332296400_btaa875-B4","doi-asserted-by":"crossref","first-page":"518","DOI":"10.1038\/nbt.3423","article-title":"Three decades of nanopore sequencing","volume":"34","author":"Deamer","year":"2016","journal-title":"Nat. Biotechnol"},{"key":"2023051800332296400_btaa875-B5","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1038\/ng.806","article-title":"A framework for variation discovery and genotyping using next-generation DNA sequencing data","volume":"43","author":"DePristo","year":"2011","journal-title":"Nat. Genet"},{"key":"2023051800332296400_btaa875-B6","doi-asserted-by":"crossref","first-page":"1494","DOI":"10.1038\/nprot.2013.084","article-title":"De novo transcript sequence reconstruction from RNA-seq using the trinity platform for reference generation and analysis","volume":"8","author":"Haas","year":"2013","journal-title":"Nat. Protoc"},{"key":"2023051800332296400_btaa875-B7","doi-asserted-by":"crossref","first-page":"D853","DOI":"10.1093\/nar\/gky1095","article-title":"The ucsc genome browser database: 2019 update","volume":"47","author":"Haeussler","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023051800332296400_btaa875-B8","doi-asserted-by":"crossref","first-page":"338","DOI":"10.1038\/nbt.4060","article-title":"Nanopore sequencing and assembly of a human genome with ultra-long reads","volume":"36","author":"Jain","year":"2018","journal-title":"Nat. Biotechnol"},{"key":"2023051800332296400_btaa875-B9","doi-asserted-by":"crossref","first-page":"748","DOI":"10.1093\/bioinformatics\/btx668","article-title":"Evaluation of tools for long read RNA-seq splice-aware alignment","volume":"34","author":"Kri\u017eanovi\u0107","year":"2018","journal-title":"Bioinformatics"},{"key":"2023051800332296400_btaa875-B10","doi-asserted-by":"crossref","first-page":"829","DOI":"10.1038\/nbt.2950","article-title":"Decoding long nanopore sequencing reads of natural DNA","volume":"32","author":"Laszlo","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2023051800332296400_btaa875-B11","doi-asserted-by":"crossref","first-page":"2987","DOI":"10.1093\/bioinformatics\/btr509","article-title":"A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data","volume":"27","author":"Li","year":"2011","journal-title":"Bioinformatics"},{"key":"2023051800332296400_btaa875-B12","doi-asserted-by":"crossref","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","article-title":"MiniMap2: pairwise alignment for nucleotide sequences","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"2023051800332296400_btaa875-B13","doi-asserted-by":"crossref","first-page":"2899","DOI":"10.1093\/bioinformatics\/bty223","article-title":"DeepSimulator: a deep simulator for nanopore sequencing","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"2023051800332296400_btaa875-B14","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1038\/nmeth.3444","article-title":"A complete bacterial genome assembled de novo using only nanopore sequencing data","volume":"12","author":"Loman","year":"2015","journal-title":"Nat. Methods"},{"key":"2023051800332296400_btaa875-B15","volume-title":"17th International Workshop on Algorithms in Bioinformatics (WABI 2017),\u00a0","author":"Mao","year":"2017"},{"key":"2023051800332296400_btaa875-B16","doi-asserted-by":"crossref","first-page":"3216","DOI":"10.1109\/TIT.2018.2809001","article-title":"Models and information-theoretic bounds for nanopore sequencing","volume":"64","author":"Mao","year":"2018","journal-title":"IEEE Trans. Inf. Theory"},{"key":"2023051800332296400_btaa875-B17","doi-asserted-by":"crossref","first-page":"1097","DOI":"10.1111\/1755-0998.12324","article-title":"A first look at the oxford nanopore minion sequencer","volume":"14","author":"Mikheyev","year":"2014","journal-title":"Mol. Ecol. Resources"},{"key":"2023051800332296400_btaa875-B18","doi-asserted-by":"crossref","first-page":"9748","DOI":"10.1073\/pnas.171285098","article-title":"An eulerian path approach to DNA fragment assembly","volume":"98","author":"Pevzner","year":"2001","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051800332296400_btaa875-B19","doi-asserted-by":"crossref","first-page":"1786","DOI":"10.1101\/gr.2395204","article-title":"De novo repeat classification and fragment assembly","volume":"14","author":"Pevzner","year":"2004","journal-title":"Genome Res"},{"key":"2023051800332296400_btaa875-B20","first-page":"1","article-title":"Mapping and phasing of structural variation in patient genomes using nanopore sequencing","volume":"8","author":"Stancu","year":"2017","journal-title":"Nat. Commun"},{"key":"2023051800332296400_btaa875-B21","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1038\/nmeth.1315","article-title":"mrna-seq whole-transcriptome analysis of a single cell","volume":"6","author":"Tang","year":"2009","journal-title":"Nat. Methods"},{"key":"2023051800332296400_btaa875-B22","doi-asserted-by":"crossref","first-page":"8247","DOI":"10.1073\/pnas.1705691114","article-title":"De novo peptide sequencing by deep learning","volume":"114","author":"Tran","year":"2017","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051800332296400_btaa875-B23","doi-asserted-by":"crossref","DOI":"10.1099\/mgen.0.000132","article-title":"Completing bacterial genome assemblies with multiplex minion sequencing","volume":"3","author":"Wick","year":"2017","journal-title":"Microb. Genomics"},{"key":"2023051800332296400_btaa875-B24","doi-asserted-by":"crossref","first-page":"1859","DOI":"10.1093\/bioinformatics\/bti310","article-title":"Gmap: a genomic mapping and alignment program for mrna and est sequences","volume":"21","author":"Wu","year":"2005","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa875\/34874371\/btaa875.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/5\/625\/50356514\/btaa875.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/5\/625\/50356514\/btaa875.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,17]],"date-time":"2023-05-17T20:34:51Z","timestamp":1684355691000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/5\/625\/5922809"}},"subtitle":[],"editor":[{"given":"Inanc","family":"Birol","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,12,11]]},"references-count":24,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2021,5,5]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa875","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/862813","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,3,1]]},"published":{"date-parts":[[2020,12,11]]}}}