{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,19]],"date-time":"2026-01-19T10:18:22Z","timestamp":1768817902080,"version":"3.49.0"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"14","license":[{"start":{"date-parts":[[2019,7,8]],"date-time":"2019-07-08T00:00:00Z","timestamp":1562544000000},"content-version":"vor","delay-in-days":7,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100006769","name":"Russian Science Foundation","doi-asserted-by":"publisher","award":["19\u201314\u201300172"],"award-info":[{"award-number":["19\u201314\u201300172"]}],"id":[{"id":"10.13039\/501100006769","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The recently developed barcoding-based synthetic long read (SLR) technologies have already found many applications in genome assembly and analysis. However, although some new barcoding protocols are emerging and the range of SLR applications is being expanded, the existing SLR assemblers are optimized for a narrow range of parameters and are not easily extendable to new barcoding technologies and new applications such as metagenomics or hybrid assembly.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We describe the algorithmic challenge of the SLR assembly and present a cloudSPAdes algorithm for SLR assembly that is based on analyzing the de Bruijn graph of SLRs. We benchmarked cloudSPAdes across various barcoding technologies\/applications and demonstrated that it improves on the state-of-the-art SLR assemblers in accuracy and speed.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Source code and installation manual for cloudSPAdes are available at https:\/\/github.com\/ablab\/spades\/releases\/tag\/cloudspades-paper.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary Information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz349","type":"journal-article","created":{"date-parts":[[2019,5,15]],"date-time":"2019-05-15T11:11:23Z","timestamp":1557918683000},"page":"i61-i70","source":"Crossref","is-referenced-by-count":33,"title":["cloudSPAdes: assembly of synthetic long reads using de Bruijn graphs"],"prefix":"10.1093","volume":"35","author":[{"given":"Ivan","family":"Tolstoganov","sequence":"first","affiliation":[{"name":"Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia"}]},{"given":"Anton","family":"Bankevich","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, University of California at San Diego, La Jolla, CA, USA"}]},{"given":"Zhoutao","family":"Chen","sequence":"additional","affiliation":[{"name":"Universal Sequencing Technology Corporation, Carlsbad, CA, USA"}]},{"given":"Pavel A","family":"Pevzner","sequence":"additional","affiliation":[{"name":"Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia"},{"name":"Department of Computer Science and Engineering, University of California at San Diego, La Jolla, CA, USA"}]}],"member":"286","published-online":{"date-parts":[[2019,7,5]]},"reference":[{"key":"2023062712441068200_btz349-B1","doi-asserted-by":"crossref","first-page":"2041","DOI":"10.1101\/gr.178319.114","article-title":"In vitro, long-range sequence information for De Novo genome assembly via transposase contiguity","volume":"24","author":"Adey","year":"2014","journal-title":"Genome Res"},{"key":"2023062712441068200_btz349-B2","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1007\/BF01188581","article-title":"Physical mapping of chromosomes: a combinatorial problem in molecular biology","volume":"13","author":"Alizadeh","year":"1995","journal-title":"Algorithmica"},{"key":"2023062712441068200_btz349-B3","doi-asserted-by":"crossref","first-page":"26775.","DOI":"10.1038\/srep26775","article-title":"An improved method for high quality metagenomics DNA extraction from human and environmental samples","volume":"6","author":"Bag","year":"2016","journal-title":"Sci. Rep"},{"key":"2023062712441068200_btz349-B4","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1038\/nmeth.3737","article-title":"TruSPAdes: barcode assembly of TruSeq synthetic long reads","volume":"13","author":"Bankevich","year":"2016","journal-title":"Nat. Methods"},{"key":"2023062712441068200_btz349-B5","doi-asserted-by":"crossref","first-page":"192","DOI":"10.1016\/j.cels.2018.06.009","article-title":"Joint analysis of long and short reads enables accurate estimates of microbiome complexity","volume":"7","author":"Bankevich","year":"2018","journal-title":"Cell Syst"},{"key":"2023062712441068200_btz349-B6","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1089\/cmb.2012.0021","article-title":"SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing","volume":"19","author":"Bankevich","year":"2012","journal-title":"J. Comput. Biol"},{"key":"2023062712441068200_btz349-B7","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1007\/3-540-48452-3_5","volume-title":"Combinatorial Pattern Matching","author":"Batzoglou","year":"1999"},{"key":"2023062712441068200_btz349-B9","doi-asserted-by":"crossref","first-page":"1067","DOI":"10.1038\/nbt.4266","article-title":"High-quality genome sequences of uncultured microbes by assembly of read clouds","volume":"36","author":"Bishara","year":"2018","journal-title":"Nat. Biotechnol"},{"key":"2023062712441068200_btz349-B10","doi-asserted-by":"crossref","first-page":"116","DOI":"10.1101\/gr.235499.118","article-title":"Minerva: an alignment- and reference-free approach to deconvolve linked-reads for metagenomics","volume":"29","author":"Danko","year":"2019","journal-title":"Genome Res"},{"key":"2023062712441068200_btz349-B11","volume-title":"Eulerian Graphs and Related Topics","author":"Fleischner","year":"1990"},{"key":"2023062712441068200_btz349-B12","doi-asserted-by":"crossref","first-page":"e0190853.","DOI":"10.1371\/journal.pone.0190853","article-title":"Culture-independent analysis of liver abscess using nanopore sequencing","volume":"13","author":"Gong","year":"2018","journal-title":"PLoS One"},{"key":"2023062712441068200_btz349-B13","doi-asserted-by":"crossref","first-page":"2594.","DOI":"10.3389\/fmicb.2017.02594","article-title":"In situ field sequencing and life detection in remote (79\u201326\u2019N) Canadian high arctic permafrost ice wedge microbial communities","volume":"8","author":"Goordial","year":"2017","journal-title":"Front. Microbiol"},{"key":"2023062712441068200_btz349-B14","doi-asserted-by":"crossref","first-page":"1072","DOI":"10.1093\/bioinformatics\/btt086","article-title":"QUAST: quality assessment tool for genome assemblies","volume":"29","author":"Gurevich","year":"2013","journal-title":"Bioinformatics"},{"key":"2023062712441068200_btz349-B15","doi-asserted-by":"crossref","first-page":"722","DOI":"10.1101\/gr.215087.116","article-title":"Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation","volume":"27","author":"Koren","year":"2017","journal-title":"Genome Res"},{"key":"2023062712441068200_btz349-B16","author":"Koslicki","year":"2017"},{"key":"2023062712441068200_btz349-B17","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1016\/j.mimet.2017.10.007","article-title":"Comparison of ten different DNA extraction procedures with respect to their suitability for environmental samples","volume":"143","author":"Kuhn","year":"2017","journal-title":"J. Microbiol. Methods"},{"key":"2023062712441068200_btz349-B19","doi-asserted-by":"crossref","first-page":"i216","DOI":"10.1093\/bioinformatics\/btw267","article-title":"Genome assembly from synthetic long read clouds","volume":"32","author":"Kuleshov","year":"2016","journal-title":"Bioinformatics"},{"key":"2023062712441068200_btz349-B20","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1101\/gr.234443.118","article-title":"Resolving the full spectrum of human genome variation using linked-reads","volume":"29","author":"Marks","year":"2018","journal-title":"Genome Res"},{"key":"2023062712441068200_btz349-B21","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1089\/cmb.1999.6.237","article-title":"Construction of physical maps from oligonucleotide fingerprints data","volume":"6","author":"Mayraz","year":"1999","journal-title":"J. Comput. Biol"},{"key":"2023062712441068200_btz349-B23","doi-asserted-by":"crossref","first-page":"1088","DOI":"10.1093\/bioinformatics\/btv697","article-title":"MetaQUAST: evaluation of metagenome assemblies","volume":"32","author":"Mikheenko","year":"2016","journal-title":"Bioinformatics"},{"key":"2023062712441068200_btz349-B25","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1101\/gr.213959.116","article-title":"metaSPAdes: a new versatile metagenomic assembler","volume":"27","author":"Nurk","year":"2017","journal-title":"Genome Res"},{"key":"2023062712441068200_btz349-B26","doi-asserted-by":"crossref","first-page":"D733","DOI":"10.1093\/nar\/gkv1189","article-title":"Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation","volume":"44","author":"O\u2019Leary","year":"2016","journal-title":"Nucleic Acids Res"},{"key":"2023062712441068200_btz349-B27","doi-asserted-by":"crossref","first-page":"132.","DOI":"10.1186\/s13059-016-0997-x","article-title":"Mash: fast genome and metagenome distance estimation using MinHash","volume":"17","author":"Ondov","year":"2016","journal-title":"Genome Biol"},{"key":"2023062712441068200_btz349-B28","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/2022.001.0001","volume-title":"Computational Molecular Biology: An Algorithmic Approach","author":"Pevzner","year":"2000"},{"key":"2023062712441068200_btz349-B30","doi-asserted-by":"crossref","first-page":"418","DOI":"10.1109\/TCBB.2016.2528239","article-title":"Algorithms and complexity results for genome mapping problems","volume":"14","author":"Rajaraman","year":"2017","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinformatics"},{"key":"2023062712441068200_btz349-B31","doi-asserted-by":"crossref","first-page":"4893","DOI":"10.1002\/j.1460-2075.1995.tb00170.x","article-title":"The phage Mu transpososome core: DNA requirements for assembly and function","volume":"14","author":"Savilahti","year":"1995","journal-title":"EMBO J"},{"key":"2023062712441068200_btz349-B32","doi-asserted-by":"crossref","first-page":"534","DOI":"10.1101\/gr.183012.114","article-title":"Accurate, multi-kb reads resolve complex populations and detect rare microorganisms","volume":"25","author":"Sharon","year":"2015","journal-title":"Genome Res"},{"key":"2023062712441068200_btz349-B33","doi-asserted-by":"crossref","first-page":"e00569.","DOI":"10.7554\/eLife.00569","article-title":"The genome sequence of the colonial chordate, Botryllus schlosseri","volume":"2","author":"Voskoboynik","year":"2013","journal-title":"Elife"},{"key":"2023062712441068200_btz349-B34","doi-asserted-by":"crossref","first-page":"757","DOI":"10.1101\/gr.214874.116","article-title":"Direct determination of diploid genome sequences","volume":"27","author":"Weisenfeld","year":"2017","journal-title":"Genome Res"},{"key":"2023062712441068200_btz349-B35","doi-asserted-by":"crossref","first-page":"725","DOI":"10.1093\/bioinformatics\/btx675","article-title":"ARCS: scaffolding genome drafts with linked reads","volume":"34","author":"Yeo","year":"2018","journal-title":"Bioinformatics"},{"key":"2023062712441068200_btz349-B36","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1038\/nbt.3432","article-title":"Haplotyping germline and cancer genomes with high-throughput linked-read sequencing","volume":"34","author":"Zheng","year":"2016","journal-title":"Nat. Biotechnol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/14\/i61\/50721647\/bioinformatics_35_14_i61.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/14\/i61\/50721647\/bioinformatics_35_14_i61.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,27]],"date-time":"2023-06-27T12:45:15Z","timestamp":1687869915000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/14\/i61\/5529115"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,7]]},"references-count":31,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2019,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz349","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,7]]},"published":{"date-parts":[[2019,7]]}}}