{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T11:13:59Z","timestamp":1774955639984,"version":"3.50.1"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T00:00:00Z","timestamp":1688083200000},"content-version":"vor","delay-in-days":29,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["CNS-1910193"],"award-info":[{"award-number":["CNS-1910193"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,6,30]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The introduction of portable DNA sequencers such as the Oxford Nanopore Technologies MinION has enabled real-time and in the field DNA sequencing. However, in the field sequencing is actionable only when coupled with in the field DNA classification. This poses new challenges for metagenomic software since mobile deployments are typically in remote locations with limited network connectivity and without access to capable computing devices.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We propose new strategies to enable in the field metagenomic classification on mobile devices. We first introduce a programming model for expressing metagenomic classifiers that decomposes the classification process into well-defined and manageable abstractions. The model simplifies resource management in mobile setups and enables rapid prototyping of classification algorithms. Next, we introduce the compact string B-tree, a practical data structure for indexing text in external storage, and we demonstrate its viability as a strategy to deploy massive DNA databases on memory-constrained devices. Finally, we combine both solutions into Coriolis, a metagenomic classifier designed specifically to operate on lightweight mobile devices. Through experiments with actual MinION metagenomic reads and a portable supercomputer-on-a-chip, we show that compared with the state-of-the-art solutions Coriolis offers higher throughput and lower resource consumption without sacrificing quality of classification.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Source code and test data are available from http:\/\/score-group.org\/?id=smarten.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad243","type":"journal-article","created":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T08:14:25Z","timestamp":1688112865000},"page":"i66-i75","source":"Crossref","is-referenced-by-count":10,"title":["Coriolis: enabling metagenomic classification on lightweight mobile devices"],"prefix":"10.1093","volume":"39","author":[{"given":"Andrew J","family":"Mikalsen","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering, University at Buffalo , Buffalo, NY 14260, United States"}]},{"given":"Jaroslaw","family":"Zola","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, University at Buffalo , Buffalo, NY 14260, United States"}]}],"member":"286","published-online":{"date-parts":[[2023,6,30]]},"reference":[{"key":"2023063008142075200_btad243-B1","doi-asserted-by":"crossref","first-page":"198","DOI":"10.1186\/s13059-018-1568-0","article-title":"KrakenUniq: confident and fast metagenomics classification using unique k-mer counts","volume":"19","author":"Breitwieser","year":"2018","journal-title":"Genome Biol"},{"key":"2023063008142075200_btad243-B2","doi-asserted-by":"crossref","first-page":"18022","DOI":"10.1038\/s41598-017-18364-0","article-title":"Nanopore DNA sequencing and genome assembly on the international space station","volume":"7","author":"Castro-Wallace","year":"2017","journal-title":"Sci Rep"},{"key":"2023063008142075200_btad243-B3","author":"Collet","year":"2022"},{"key":"2023063008142075200_btad243-B4","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1016\/j.future.2018.07.036","article-title":"Combining edge and cloud computing for low-power, cost-effective metagenomics analysis","volume":"90","author":"D\u2019Agostino","year":"2019","journal-title":"Future Generation Computer Systems"},{"key":"2023063008142075200_btad243-B5","first-page":"137","author":"Dean","year":"2004"},{"key":"2023063008142075200_btad243-B6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13073-016-0356-2","article-title":"Mobile real-time surveillance of Zika virus in Brazil","volume":"8","author":"Faria","year":"2016","journal-title":"Genome Med"},{"key":"2023063008142075200_btad243-B7","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1145\/129888.129896","article-title":"Bit-Tree: a data structure for fast file processing","volume":"35","author":"Ferguson","year":"1992","journal-title":"Commun ACM"},{"key":"2023063008142075200_btad243-B8","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1145\/301970.301973","article-title":"The string B-tree: a new data structure for string search in external memory and its applications","volume":"46","author":"Ferragina","year":"1999","journal-title":"J ACM"},{"key":"2023063008142075200_btad243-B9","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1038\/nrg.2017.88","article-title":"Towards a genomics-informed, real-time, global pathogen surveillance system","volume":"19","author":"Gardy","year":"2018","journal-title":"Nat Rev Genet"},{"key":"2023063008142075200_btad243-B10","first-page":"1","author":"Gog","year":"2013"},{"key":"2023063008142075200_btad243-B11","first-page":"500","author":"Grzesik","year":"2021"},{"key":"2023063008142075200_btad243-B12","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1186\/2049-2618-2-33","article-title":"PathoScope 2.0: a complete computational framework for strain identification in environmental or clinical sequencing samples","volume":"2","author":"Hong","year":"2014","journal-title":"Microbiome"},{"key":"2023063008142075200_btad243-B13","doi-asserted-by":"crossref","first-page":"2","DOI":"10.7171\/jbt.17-2801-009","article-title":"Real-time DNA sequencing in the Antarctic dry valleys using the oxford nanopore sequencer","volume":"28","author":"Johnson","year":"2017","journal-title":"J Biomol Tech"},{"key":"2023063008142075200_btad243-B14","doi-asserted-by":"crossref","first-page":"1721","DOI":"10.1101\/gr.210641.116","article-title":"Centrifuge: rapid and sensitive classification of metagenomic sequences","volume":"26","author":"Kim","year":"2016","journal-title":"Genome Res"},{"key":"2023063008142075200_btad243-B15","first-page":"1","author":"Ko","year":"2018"},{"key":"2023063008142075200_btad243-B16","doi-asserted-by":"crossref","first-page":"5125","DOI":"10.1038\/s41598-020-61989-x","article-title":"Benchmarking the MinION","volume":"10","author":"Leidenfrost","year":"2020","journal-title":"Sci Rep"},{"key":"2023063008142075200_btad243-B17","author":"Lu","year":"2022"},{"key":"2023063008142075200_btad243-B18","author":"Massie","year":"2013"},{"key":"2023063008142075200_btad243-B19","doi-asserted-by":"crossref","first-page":"467","DOI":"10.1016\/j.future.2018.05.010","article-title":"Low-power portable devices for metagenomics analysis: fog computing makes bioinformatics ready for the internet of things","volume":"88","author":"Merelli","year":"2018","journal-title":"Fut Generat Comput Syst"},{"key":"2023063008142075200_btad243-B20","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1016\/j.procs.2018.07.164","article-title":"Third-generation sequencing data analytics on mobile devices: cache oblivious and out-of-core approaches as a proof-of-concept","volume":"134","author":"Milicchio","year":"2018","journal-title":"Proc Comput Sci"},{"key":"2023063008142075200_btad243-B21","author":"Mori","year":"2022"},{"key":"2023063008142075200_btad243-B22","doi-asserted-by":"crossref","first-page":"4399","DOI":"10.1093\/bioinformatics\/btaa237","article-title":"Portable nanopore analytics: are we there yet?","volume":"36","author":"Oliva","year":"2020","journal-title":"Bioinformatics"},{"key":"2023063008142075200_btad243-B23","author":"Oxford Nanopore Technologies","year":"2022"},{"key":"2023063008142075200_btad243-B24","doi-asserted-by":"crossref","first-page":"442","DOI":"10.1038\/s41587-020-00746-x","article-title":"Readfish enables targeted nanopore sequencing of gigabase-sized genomes","volume":"39","author":"Payne","year":"2021","journal-title":"Nat Biotechnol"},{"key":"2023063008142075200_btad243-B25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/gigascience\/giy033","article-title":"Real-time DNA barcoding in a rainforest using nanopore sequencing","volume":"7","author":"Pomerantz","year":"2018","journal-title":"GigaScience"},{"key":"2023063008142075200_btad243-B26","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1038\/nature16996","article-title":"Real-time, portable genome sequencing for Ebola surveillance","volume":"530","author":"Quick","year":"2016","journal-title":"Nature"},{"key":"2023063008142075200_btad243-B27","doi-asserted-by":"crossref","first-page":"833","DOI":"10.1038\/nbt.3935","article-title":"Shotgun metagenomics, from sampling to analysis","volume":"35","author":"Quince","year":"2017","journal-title":"Nat Biotechnol"},{"key":"2023063008142075200_btad243-B28","doi-asserted-by":"crossref","first-page":"1063","DOI":"10.1038\/nmeth.4458","article-title":"Critical assessment of metagenome interpretation-a benchmark of metagenomics software","volume":"14","author":"Sczyrba","year":"2017","journal-title":"Nat Methods"},{"key":"2023063008142075200_btad243-B29","author":"SMARTEn","year":"2023"},{"key":"2023063008142075200_btad243-B30","doi-asserted-by":"crossref","first-page":"e1002195","DOI":"10.1371\/journal.pbio.1002195","article-title":"Big data: astronomical or genomical?","volume":"13","author":"Stephens","year":"2015","journal-title":"PLoS Biol"},{"key":"2023063008142075200_btad243-B31","author":"Vitter","year":"2008"},{"key":"2023063008142075200_btad243-B32","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1016\/j.jbiotec.2016.12.006","article-title":"MinION as part of a biomedical rapidly deployable laboratory","volume":"250","author":"Walter","year":"2017","journal-title":"J Biotechnol"},{"key":"2023063008142075200_btad243-B33","article-title":"Portable DNA sequencer MinION helps build the internet of living things","author":"Waltz","year":"2017","journal-title":"IEEE Spectrum"},{"key":"2023063008142075200_btad243-B34","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1186\/s13059-019-1891-0","article-title":"Improved metagenomic analysis with kraken 2","volume":"20","author":"Wood","year":"2019","journal-title":"Genome Biol"},{"key":"2023063008142075200_btad243-B35","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/gigascience\/gix010","article-title":"NanoSim: nanopore sequence read simulator based on statistical characterization","volume":"6","author":"Yang","year":"2017","journal-title":"Gigascience"},{"key":"2023063008142075200_btad243-B36","doi-asserted-by":"crossref","first-page":"i477","DOI":"10.1093\/bioinformatics\/btab264","article-title":"Real-time mapping of nanopore raw signals","volume":"37","author":"Zhang","year":"2021","journal-title":"Bioinformatics"},{"key":"2023063008142075200_btad243-B37","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1089\/10665270050081478","article-title":"A greedy algorithm for aligning DNA sequences","volume":"7","author":"Zhang","year":"2000","journal-title":"J Comput Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/Supplement_1\/i66\/50741350\/btad243.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/Supplement_1\/i66\/50741350\/btad243.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T08:14:47Z","timestamp":1688112887000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/39\/Supplement_1\/i66\/7210430"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,1]]},"references-count":37,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2023,6,30]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad243","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,6,1]]},"published":{"date-parts":[[2023,6,1]]}}}