{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,10]],"date-time":"2025-11-10T20:59:55Z","timestamp":1762808395563},"reference-count":21,"publisher":"Oxford University Press (OUP)","issue":"9","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: String and de Bruijn graphs are two graph models used by most genome assemblers. At present, none of the existing assemblers clearly outperforms the others across all datasets. We found that although a string graph can make use of entire reads for resolving repeats, de Bruijn graphs can naturally assemble through regions that are error-prone due to sequencing bias.<\/jats:p>\n               <jats:p>Results: We developed a novel assembler called StriDe that has advantages of both string and de Bruijn graphs. First, the reads are decomposed adaptively only in error-prone regions. Second, each paired-end read is extended into a long read directly using an FM-index. The decomposed and extended reads are used to build an assembly graph. In addition, several essential components of an assembler were designed or improved. The resulting assembler was fully parallelized, tested and compared with state-of-the-art assemblers using benchmark datasets. The results indicate that contiguity of StriDe is comparable with top assemblers on both short-read and long-read datasets, and the assembly accuracy is high in comparison with the others.<\/jats:p>\n               <jats:p>Availability and implementation: \u00a0https:\/\/github.com\/ythuang0522\/StriDe<\/jats:p>\n               <jats:p>Contact: ythuang@cs.ccu.edu.tw<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btw011","type":"journal-article","created":{"date-parts":[[2016,2,15]],"date-time":"2016-02-15T01:09:07Z","timestamp":1455498547000},"page":"1301-1307","source":"Crossref","is-referenced-by-count":12,"title":["Integration of string and de Bruijn graphs for genome assembly"],"prefix":"10.1093","volume":"32","author":[{"given":"Yao-Ting","family":"Huang","sequence":"first","affiliation":[{"name":"Department of Computer Science and Information Engineering, National Chung Cheng University, Chiayi, Taiwan"}]},{"given":"Chen-Fu","family":"Liao","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Information Engineering, National Chung Cheng University, Chiayi, Taiwan"}]}],"member":"286","published-online":{"date-parts":[[2016,1,10]]},"reference":[{"key":"2023020112225638700_btw011-B1","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1089\/cmb.2012.0021","article-title":"Spades: a new genome assembly algorithm and its applications to single-cell sequencing","volume":"19","author":"Bankevich","year":"2012","journal-title":"J. Comput. Biol"},{"key":"2023020112225638700_btw011-B2","first-page":"1","article-title":"Assemblathon 2 assemblies","volume":"2","author":"Bradnam","year":"2013","journal-title":"GigaScience Datab"},{"key":"2023020112225638700_btw011-B3","doi-asserted-by":"crossref","first-page":"810","DOI":"10.1101\/gr.7337908","article-title":"Allpaths: de novo assembly of whole-genome shotgun microreads","volume":"18","author":"Butler","year":"2008","journal-title":"Genome Res"},{"key":"2023020112225638700_btw011-B4","doi-asserted-by":"crossref","first-page":"2224","DOI":"10.1101\/gr.126599.111","article-title":"Assemblathon 1: a competitive assessment of de novo short read assembly methods","volume":"21","author":"Earl","year":"2011","journal-title":"Genome Res"},{"key":"2023020112225638700_btw011-B5","first-page":"390","author":"Ferragina","year":"2000"},{"key":"2023020112225638700_btw011-B6","doi-asserted-by":"crossref","first-page":"1072","DOI":"10.1093\/bioinformatics\/btt086","article-title":"QUAST: quality assessment tool for genome assemblies","volume":"29","author":"Gurevich","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020112225638700_btw011-B7","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1093\/jhered\/esp086","article-title":"Genome 10K: a proposal to obtain whole-genome sequence for 10,000 vertebrate species","volume":"100","author":"Haussler","year":"2009","journal-title":"J. Hered"},{"key":"2023020112225638700_btw011-B8","doi-asserted-by":"crossref","first-page":"3274","DOI":"10.1093\/bioinformatics\/btu541","article-title":"Fast construction of fm-index for long sequence reads","volume":"30","author":"Li","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020112225638700_btw011-B9","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1186\/2047-217X-1-18","article-title":"Soapdenovo2: an empirically improved memory-efficient short-read de novo assembler","volume":"1","author":"Luo","year":"2012","journal-title":"GigaScience"},{"key":"2023020112225638700_btw011-B10","doi-asserted-by":"crossref","first-page":"1718","DOI":"10.1093\/bioinformatics\/btt273","article-title":"Gage-b: an evaluation of genome assemblers for bacterial organisms","volume":"29","author":"Magoc","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020112225638700_btw011-B11","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1038\/nrg2626","article-title":"Sequencing technologies \u2013 the next generation","volume":"11","author":"Metzker","year":"2010","journal-title":"Nat. Rev. Genet"},{"key":"2023020112225638700_btw011-B12","doi-asserted-by":"crossref","first-page":"2818","DOI":"10.1093\/bioinformatics\/btn548","article-title":"Aggressive assembly of pyrosequencing reads with mates","volume":"24","author":"Miller","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020112225638700_btw011-B13","doi-asserted-by":"crossref","first-page":"ii79","DOI":"10.1093\/bioinformatics\/bti1114","article-title":"The fragment assembly string graph","volume":"21","author":"Myers","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020112225638700_btw011-B14","doi-asserted-by":"crossref","first-page":"R55","DOI":"10.1186\/gb-2008-9-3-r55","article-title":"Genome assembly forensics: finding the elusive mis-assembly","volume":"9","author":"Phillippy","year":"2008","journal-title":"Genome Biol"},{"key":"2023020112225638700_btw011-B15","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1101\/gr.131383.111","article-title":"Gage: a critical evaluation of genome assemblies and assembly algorithms","volume":"22","author":"Salzberg","year":"2012","journal-title":"Genome Res"},{"key":"2023020112225638700_btw011-B16","doi-asserted-by":"crossref","first-page":"e37","DOI":"10.1093\/nar\/gku1341","article-title":"Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform","volume":"43","author":"Schirmer","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023020112225638700_btw011-B17","doi-asserted-by":"crossref","first-page":"1117","DOI":"10.1101\/gr.089532.108","article-title":"Abyss: a parallel assembler for short read sequence data","volume":"19","author":"Simpson","year":"2009","journal-title":"Genome Res"},{"key":"2023020112225638700_btw011-B18","doi-asserted-by":"crossref","first-page":"i367","DOI":"10.1093\/bioinformatics\/btq217","article-title":"Efficient construction of an assembly string graph using the fm-index","volume":"26","author":"Simpson","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020112225638700_btw011-B19","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1101\/gr.126953.111","article-title":"Efficient de novo assembly of large genomes using compressed data structures","volume":"22","author":"Simpson","year":"2012","journal-title":"Genome Res"},{"key":"2023020112225638700_btw011-B20","doi-asserted-by":"crossref","first-page":"821","DOI":"10.1101\/gr.074492.107","article-title":"Velvet: algorithms for de novo short read assembly using de Bruijn graphs","volume":"18","author":"Zerbino","year":"2008","journal-title":"Genome Res"},{"key":"2023020112225638700_btw011-B21","doi-asserted-by":"crossref","first-page":"2669","DOI":"10.1093\/bioinformatics\/btt476","article-title":"The MaSuRCA genome assembler","volume":"29","author":"Zimin","year":"2013","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/9\/1301\/49019521\/bioinformatics_32_9_1301.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/9\/1301\/49019521\/bioinformatics_32_9_1301.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T22:28:37Z","timestamp":1675290517000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/9\/1301\/1744507"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,1,10]]},"references-count":21,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2016,5,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw011","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2016,5,1]]},"published":{"date-parts":[[2016,1,10]]}}}