{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T01:15:37Z","timestamp":1773278137986,"version":"3.50.1"},"reference-count":17,"publisher":"Oxford University Press (OUP)","issue":"17","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":772,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/3.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2014,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Automatic error correction of high-throughput sequencing data can have a dramatic impact on the amount of usable base pairs and their quality. It has been shown that the performance of tasks such as de novo genome assembly and SNP calling can be dramatically improved after read error correction. While a large number of methods specialized for correcting substitution errors as found in Illumina data exist, few methods for the correction of indel errors, common to technologies like 454 or Ion Torrent, have been proposed.<\/jats:p>\n               <jats:p>Results: We present Fiona, a new stand-alone read error\u2013correction method. Fiona provides a new statistical approach for sequencing error detection and optimal error correction and estimates its parameters automatically. Fiona is able to correct substitution, insertion and deletion errors and can be applied to any sequencing technology. It uses an efficient implementation of the partial suffix array to detect read overlaps with different seed lengths in parallel. We tested Fiona on several real datasets from a variety of organisms with different read lengths and compared its performance with state-of-the-art methods. Fiona shows a constantly higher correction accuracy over a broad range of datasets from 454 and Ion Torrent sequencers, without compromise in speed.<\/jats:p>\n               <jats:p>Conclusion: Fiona is an accurate parameter-free read error\u2013correction method that can be run on inexpensive hardware and can make use of multicore parallelization whenever available. Fiona was implemented using the SeqAn library for sequence analysis and is publicly available for download at http:\/\/www.seqan.de\/projects\/fiona.<\/jats:p>\n               <jats:p>Contact: mschulz@mmci.uni-saarland.de or hugues.richard@upmc.fr<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btu440","type":"journal-article","created":{"date-parts":[[2014,8,26]],"date-time":"2014-08-26T11:23:57Z","timestamp":1409052237000},"page":"i356-i363","source":"Crossref","is-referenced-by-count":59,"title":["Fiona: a parallel and automatic strategy for read error correction"],"prefix":"10.1093","volume":"30","author":[{"given":"Marcel H.","family":"Schulz","sequence":"first","affiliation":[{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"},{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"}]},{"given":"David","family":"Weese","sequence":"additional","affiliation":[{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"}]},{"given":"Manuel","family":"Holtgrewe","sequence":"additional","affiliation":[{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"}]},{"given":"Viktoria","family":"Dimitrova","sequence":"additional","affiliation":[{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"},{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"}]},{"given":"Sijia","family":"Niu","sequence":"additional","affiliation":[{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"},{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"}]},{"given":"Knut","family":"Reinert","sequence":"additional","affiliation":[{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"}]},{"given":"Hugues","family":"Richard","sequence":"additional","affiliation":[{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"},{"name":"1 \u2018Multimodal Computing and Interaction\u2019, Saarland University & Department for Computational Biology and Applied Computing, Max Planck Institute for Informatics, Saarbr\u00fccken, 66123 Saarland, Germany, 2Ray and Stephanie Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, 15206 PA, USA, 3Department of Mathematics and Computer Science, Freie Universit\u00e4t Berlin, 14195 Berlin, Germany, 4Universit\u00e9 Pierre et Marie Curie, UMR7238, CNRS-UPMC, Paris, France and 5CNRS, UMR7238, Laboratory of Computational and Quantitative Biology, Paris, France"}]}],"member":"286","published-online":{"date-parts":[[2014,8,22]]},"reference":[{"key":"2023012711525634100_btu440-B1","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/S1570-8667(03)00065-0","article-title":"Replacing suffix trees with enhanced suffix arrays","volume":"2","author":"Abouelhoda","year":"2004","journal-title":"J. Discrete Alg."},{"key":"2023012711525634100_btu440-B2","doi-asserted-by":"crossref","first-page":"324","DOI":"10.1101\/gr.7088808","article-title":"Short read fragment assembly of bacterial genomes","volume":"18","author":"Chaisson","year":"2008","journal-title":"Genome Res"},{"key":"2023012711525634100_btu440-B3","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1186\/1471-2105-9-11","article-title":"SeqAn an efficient, generic C++ library for sequence analysis","volume":"9","author":"D\u00f6ring","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012711525634100_btu440-B4","doi-asserted-by":"crossref","first-page":"1513","DOI":"10.1073\/pnas.1017351108","article-title":"High-quality draft assemblies of mammalian genomes from massively parallel sequence data","volume":"108","author":"Gnerre","year":"2010","journal-title":"PNAS"},{"key":"2023012711525634100_btu440-B5","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1093\/bioinformatics\/btq653","article-title":"HiTEC: accurate error correction in high-throughput sequencing data","volume":"27","author":"Ilie","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012711525634100_btu440-B6","doi-asserted-by":"crossref","first-page":"1181","DOI":"10.1101\/gr.111351.110","article-title":"ECHO: a reference-free short-read error correction algorithm","volume":"21","author":"Kao","year":"2011","journal-title":"Genome Res."},{"key":"2023012711525634100_btu440-B7","doi-asserted-by":"crossref","first-page":"R116","DOI":"10.1186\/gb-2010-11-11-r116","article-title":"Quake: quality-aware detection and correction of sequencing errors","volume":"11","author":"Kelley","year":"2010","journal-title":"Genome Biol."},{"key":"2023012711525634100_btu440-B8","doi-asserted-by":"crossref","first-page":"R112","DOI":"10.1186\/gb-2011-12-11-r112","article-title":"Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and Genome Analyzer systems","volume":"12","author":"Minoche","year":"2011","journal-title":"Genome Biol."},{"key":"2023012711525634100_btu440-B9","doi-asserted-by":"crossref","first-page":"9748","DOI":"10.1073\/pnas.171285098","article-title":"An eulerian path approach to dna fragment assembly","volume":"98","author":"Pevzner","year":"2001","journal-title":"PNAS"},{"key":"2023012711525634100_btu440-B10","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1186\/1471-2164-13-341","article-title":"A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers","volume":"13","author":"Quail","year":"2012","journal-title":"BMC Genomics"},{"key":"2023012711525634100_btu440-B11","doi-asserted-by":"crossref","first-page":"1284","DOI":"10.1093\/bioinformatics\/btq151","article-title":"Correction of sequencing errors in a mixed set of reads","volume":"26","author":"Salmela","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012711525634100_btu440-B12","doi-asserted-by":"crossref","first-page":"1455","DOI":"10.1093\/bioinformatics\/btr170","article-title":"Correcting errors in short reads by multiple alignments","volume":"27","author":"Salmela","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012711525634100_btu440-B13","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1101\/gr.131383.111","article-title":"GAGE: a critical evaluation of genome assemblies and assembly algorithms","volume":"22","author":"Salzberg","year":"2012","journal-title":"Genome Res."},{"key":"2023012711525634100_btu440-B14","doi-asserted-by":"crossref","first-page":"2157","DOI":"10.1093\/bioinformatics\/btp379","article-title":"SHREC: a short-read error correction method","volume":"25","author":"Schr\u00f6der","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012711525634100_btu440-B15","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1186\/1471-2105-13-185","article-title":"Estimation of sequencing error rates in short reads","volume":"13","author":"Wang","year":"2012","journal-title":"BMC Bioinformatics"},{"key":"2023012711525634100_btu440-B16","doi-asserted-by":"crossref","first-page":"2526","DOI":"10.1093\/bioinformatics\/btq468","article-title":"Reptile: representative tiling for short read error correction","volume":"26","author":"Yang","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012711525634100_btu440-B17","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1093\/bib\/bbs015","article-title":"A survey of error-correction methods for next-generation sequencing","volume":"14","author":"Yang","year":"2013","journal-title":"Brief. Bioinform."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/17\/i356\/48927319\/bioinformatics_30_17_i356.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/30\/17\/i356\/48927319\/bioinformatics_30_17_i356.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T12:16:22Z","timestamp":1674821782000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/30\/17\/i356\/199558"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,8,22]]},"references-count":17,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2014,9,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btu440","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2014,9,1]]},"published":{"date-parts":[[2014,8,22]]}}}