{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T07:20:51Z","timestamp":1760080851374,"version":"3.41.2"},"reference-count":13,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2023,7,27]],"date-time":"2023-07-27T00:00:00Z","timestamp":1690416000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Institute of Health","award":["U24CA209999"],"award-info":[{"award-number":["U24CA209999"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>In time-critical clinical settings, such as precision medicine, genomic data needs to be processed as fast as possible to arrive at data-informed treatment decisions in a timely fashion. While sequencing throughput has dramatically increased over the past decade, bioinformatics analysis throughput has not been able to keep up with the pace of computer hardware improvement, and consequently has now turned into the primary bottleneck. Modern computer hardware today is capable of much higher performance than current genomic informatics algorithms can typically utilize, therefore presenting opportunities for significant improvement of performance. Accessing the raw sequencing data from BAM files, e.g. is a necessary and time-consuming step in nearly all sequence analysis tools, however existing programming libraries for BAM access do not take full advantage of the parallel input\/output capabilities of storage devices.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In an effort to stimulate the development of a new generation of faster sequence analysis tools, we developed quickBAM, a software library to accelerate sequencing data access by exploiting the parallelism in commodity storage hardware currently widely available. We demonstrate that analysis software ported to quickBAM consistently outperforms their current versions, in some cases finishing an analysis in under 3\u2009min while the original version took 1.5\u2009h, using the same storage solution.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Open source and freely available at https:\/\/gitlab.com\/yiq\/quickbam\/, we envision that quickBAM will enable a new generation of high-performance informatics tools, either directly boosting their performance if they are currently data-access bottlenecked, or allow data-access to keep up with further optimizations in algorithms and compute techniques.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad463","type":"journal-article","created":{"date-parts":[[2023,7,26]],"date-time":"2023-07-26T21:10:00Z","timestamp":1690405800000},"source":"Crossref","is-referenced-by-count":2,"title":["quickBAM: a parallelized BAM file access API for high-throughput sequence analysis informatics"],"prefix":"10.1093","volume":"39","author":[{"given":"Anders","family":"Pitman","sequence":"first","affiliation":[{"name":"UTAH Center for Genetic Discovery, Department of Human Genetics, University of Utah School of Medicine , 15 N 2030 E , Salt Lake City, UT 84112, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaomeng","family":"Huang","sequence":"additional","affiliation":[{"name":"UTAH Center for Genetic Discovery, Department of Human Genetics, University of Utah School of Medicine , 15 N 2030 E , Salt Lake City, UT 84112, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gabor T","family":"Marth","sequence":"additional","affiliation":[{"name":"UTAH Center for Genetic Discovery, Department of Human Genetics, University of Utah School of Medicine , 15 N 2030 E , Salt Lake City, UT 84112, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8612-9598","authenticated-orcid":false,"given":"Yi","family":"Qiao","sequence":"additional","affiliation":[{"name":"UTAH Center for Genetic Discovery, Department of Human Genetics, University of Utah School of Medicine , 15 N 2030 E , Salt Lake City, UT 84112, United States"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2023,7,27]]},"reference":[{"key":"2023081001051022300_btad463-B1","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","article-title":"A global reference for human genetic variation","volume":"526","author":"1000 Genomes Project Consortium","year":"2015","journal-title":"Nature"},{"key":"2023081001051022300_btad463-B2","doi-asserted-by":"crossref","first-page":"1691","DOI":"10.1093\/bioinformatics\/btr174","article-title":"BamTools: a C++ API and toolkit for analyzing and managing BAM files","volume":"27","author":"Barnett","year":"2011","journal-title":"Bioinformatics"},{"key":"2023081001051022300_btad463-B3","doi-asserted-by":"crossref","first-page":"giab007","DOI":"10.1093\/gigascience\/giab007","article-title":"HTSlib: C library for reading\/writing high-throughput sequencing data","volume":"10","author":"Bonfield","year":"2021","journal-title":"Gigascience"},{"key":"2023081001051022300_btad463-B4","doi-asserted-by":"crossref","first-page":"1207","DOI":"10.1007\/s00431-019-03399-4","article-title":"RAPIDOMICS: rapid genome-wide sequencing in a neonatal intensive care unit-successes and challenges","volume":"178","author":"Elliott","year":"2019","journal-title":"Eur J Pediatr"},{"key":"2023081001051022300_btad463-B5","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1007\/978-3-540-30218-6_19","volume-title":"Recent Advances in Parallel Virtual Machine and Message Passing Interface","author":"Gabriel","year":"2004"},{"key":"2023081001051022300_btad463-B6","doi-asserted-by":"crossref","first-page":"170","DOI":"10.1186\/s13073-021-00989-6","article-title":"Novel temporal and spatial patterns of metastatic colonization from breast cancer rapid-autopsy tumor biopsies","volume":"13","author":"Huang","year":"2021","journal-title":"Genome Med"},{"key":"2023081001051022300_btad463-B7","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023081001051022300_btad463-B8","doi-asserted-by":"crossref","first-page":"623","DOI":"10.1053\/j.semperi.2015.09.009","article-title":"Rapid whole genome sequencing and precision neonatology","volume":"39","author":"Petrikin","year":"2015","journal-title":"Semin Perinatol"},{"key":"2023081001051022300_btad463-B9","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1200\/EDBK_174176","article-title":"Precision oncology: who, how, what, when, and when not?","volume":"37","author":"Schwartzberg","year":"2017","journal-title":"Am Soc Clin Oncol Educ Book"},{"key":"2023081001051022300_btad463-B10","doi-asserted-by":"crossref","first-page":"e131","DOI":"10.1093\/nar\/gkw520","article-title":"FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing","volume":"44","author":"Shen","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023081001051022300_btad463-B11","doi-asserted-by":"crossref","first-page":"677","DOI":"10.1101\/gr.9.8.677","article-title":"dbSNP-database for single nucleotide polymorphisms and other classes of minor genetic variation","volume":"9","author":"Sherry","year":"1999","journal-title":"Genome Res"},{"key":"2023081001051022300_btad463-B12","doi-asserted-by":"crossref","first-page":"751","DOI":"10.1093\/bioinformatics\/btw741","article-title":"SeqLib: a C ++ API for rapid BAM manipulation, sequence alignment and sequence assembly","volume":"33","author":"Wala","year":"2017","journal-title":"Bioinformatics"},{"key":"2023081001051022300_btad463-B13","doi-asserted-by":"crossref","first-page":"160025","DOI":"10.1038\/sdata.2016.25","article-title":"Extensive sequencing of seven human genomes to characterize benchmark reference materials","volume":"3","author":"Zook","year":"2016","journal-title":"Sci Data"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad463\/50975964\/btad463.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/8\/btad463\/51078469\/btad463.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/8\/btad463\/51078469\/btad463.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,10]],"date-time":"2023-08-10T01:05:41Z","timestamp":1691629541000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad463\/7232227"}},"subtitle":[],"editor":[{"given":"Janet","family":"Kelso","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2023,7,27]]},"references-count":13,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2023,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad463","relation":{},"ISSN":["1367-4811"],"issn-type":[{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2023,8,1]]},"published":{"date-parts":[[2023,7,27]]},"article-number":"btad463"}}