{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,13]],"date-time":"2026-03-13T07:18:18Z","timestamp":1773386298566,"version":"3.50.1"},"reference-count":21,"publisher":"Oxford University Press (OUP)","issue":"6","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,3,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Several recent studies have demonstrated the effectiveness of resequencing and single nucleotide variant (SNV) detection by deep short-read sequencing platforms. While several reliable algorithms are available for automated SNV detection, the automated detection of microindels in deep short-read data presents a new bioinformatics challenge.<\/jats:p>\n               <jats:p>Results: We systematically analyzed how the short-read mapping tools MAQ, Bowtie, Burrows-Wheeler alignment tool (BWA), Novoalign and RazerS perform on simulated datasets that contain indels and evaluated how indels affect error rates in SNV detection. We implemented a simple algorithm to compute the equivalent indel region eir, which can be used to process the alignments produced by the mapping tools in order to perform indel calling. Using simulated data that contains indels, we demonstrate that indel detection works well on short-read data: the detection rate for microindels (&amp;lt;4 bp) is &amp;gt;90%. Our study provides insights into systematic errors in SNV detection that is based on ungapped short sequence read alignments. Gapped alignments of short sequence reads can be used to reduce this error and to detect microindels in simulated short-read data. A comparison with microindels automatically identified on the ABI Sanger and Roche 454 platform indicates that microindel detection from short sequence reads identifies both overlapping and distinct indels.<\/jats:p>\n               <jats:p>Contact: \u00a0peter.krawitz@googlemail.com; peter.robinson@charite.de<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq027","type":"journal-article","created":{"date-parts":[[2010,2,10]],"date-time":"2010-02-10T01:33:54Z","timestamp":1265765634000},"page":"722-729","source":"Crossref","is-referenced-by-count":88,"title":["Microindel detection in short-read sequence data"],"prefix":"10.1093","volume":"26","author":[{"given":"Peter","family":"Krawitz","sequence":"first","affiliation":[{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"},{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"},{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"}]},{"given":"Christian","family":"R\u00f6delsperger","sequence":"additional","affiliation":[{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"},{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"},{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"}]},{"given":"Marten","family":"J\u00e4ger","sequence":"additional","affiliation":[{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"},{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"}]},{"given":"Luke","family":"Jostins","sequence":"additional","affiliation":[{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"}]},{"given":"Sebastian","family":"Bauer","sequence":"additional","affiliation":[{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"},{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"}]},{"given":"Peter N.","family":"Robinson","sequence":"additional","affiliation":[{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"},{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"},{"name":"1 Institute for Medical Genetics, Charit\u00e9-Universit\u00e4tsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, 2 Berlin-Brandenburg Center for Regenerative Therapies, Augustenburger Platz 1, 13353 Berlin, 3 Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin and 4 Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK"}]}],"member":"286","published-online":{"date-parts":[[2010,2,9]]},"reference":[{"key":"2023012508015684400_B1","doi-asserted-by":"crossref","first-page":"1622","DOI":"10.1101\/gr.092197.109","article-title":"The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group","volume":"19","author":"Ahn","year":"2009","journal-title":"Genome Res."},{"key":"2023012508015684400_B2","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1002\/humu.20212","article-title":"Microdeletions and microinsertions causing human genetic disease: common mechanisms of mutagenesis and the role of local DNA sequence complexity","volume":"26","author":"Ball","year":"2005","journal-title":"Hum. Mutat."},{"key":"2023012508015684400_B3","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1038\/nature07517","article-title":"Accurate whole human genome sequencing using reversible terminator chemistry","volume":"456","author":"Bentley","year":"2008","journal-title":"Nature"},{"key":"2023012508015684400_B4","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1093\/hmg\/ddi006","article-title":"Comprehensive identification and characterization of diallelic insertion-deletion polymorphisms in 330 human candidate genes","volume":"14","author":"Bhangale","year":"2005","journal-title":"Hum. Mol. Genet."},{"key":"2023012508015684400_B5","volume-title":"Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids.","author":"Durbin","year":"1999"},{"key":"2023012508015684400_B6","doi-asserted-by":"crossref","first-page":"R32","DOI":"10.1186\/gb-2009-10-3-r32","article-title":"Evaluation of next generation sequencing platforms for population targeted sequencing studies","volume":"10","author":"Harismendy","year":"2009","journal-title":"Genome Biol."},{"key":"2023012508015684400_B7","author":"Hercus","year":"2009"},{"key":"2023012508015684400_B8","doi-asserted-by":"crossref","first-page":"3672","DOI":"10.1093\/nar\/gkg617","article-title":"mreps: efficient and flexible detection of tandem repeats in DNA","volume":"31","author":"Kolpakov","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012508015684400_B9","doi-asserted-by":"crossref","first-page":"420","DOI":"10.1126\/science.1149504","article-title":"Paired-end mapping reveals extensive structural variation in the human genome","volume":"318","author":"Korbel","year":"2007","journal-title":"Science"},{"key":"2023012508015684400_B10","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1089\/106652703321825937","article-title":"The mutation process of microsatellites during the polymerase chain reaction","volume":"10","author":"Lai","year":"2003","journal-title":"J. Comput. Biol."},{"key":"2023012508015684400_B11","doi-asserted-by":"crossref","first-page":"R25","DOI":"10.1186\/gb-2009-10-3-r25","article-title":"Ultrafast and memory-efficient alignment of short DNA sequences to the human genome","volume":"10","author":"Langmead","year":"2009","journal-title":"Genome Biol."},{"key":"2023012508015684400_B12","doi-asserted-by":"crossref","first-page":"e254","DOI":"10.1371\/journal.pbio.0050254","article-title":"The diploid genome sequence of an individual human","volume":"5","author":"Levy","year":"2007","journal-title":"PLoS Biol."},{"key":"2023012508015684400_B13","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/btp698","article-title":"Fast and accurate long read alignment with Burrows-Wheeler transform","author":"Li","year":"2010","journal-title":"Bioinformatics."},{"key":"2023012508015684400_B14","doi-asserted-by":"crossref","first-page":"1851","DOI":"10.1101\/gr.078212.108","article-title":"Mapping short DNA sequencing reads and calling variants using mapping quality scores","volume":"18","author":"Li","year":"2008","journal-title":"Genome Res."},{"key":"2023012508015684400_B15","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1146\/annurev.genom.9.081307.164359","article-title":"Next-generation DNA sequencing methods","volume":"9","author":"Mardis","year":"2008","journal-title":"Annu. Rev. Genomics Hum. Genet."},{"key":"2023012508015684400_B16","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1038\/nature03959","article-title":"Genome sequencing in microfabricated high-density picolitre reactors","volume":"437","author":"Margulies","year":"2005","journal-title":"Nature"},{"key":"2023012508015684400_B17","doi-asserted-by":"crossref","first-page":"2024","DOI":"10.1101\/gr.080200.108","article-title":"Sequencing of natural strains of Arabidopsis thaliana with short reads","volume":"18","author":"Ossowski","year":"2008","journal-title":"Genome Res."},{"key":"2023012508015684400_B18","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/nature05329","article-title":"Global variation in copy number in the human genome","volume":"444","author":"Redon","year":"2006","journal-title":"Nature"},{"key":"2023012508015684400_B19","doi-asserted-by":"crossref","first-page":"974","DOI":"10.1093\/nar\/gkg178","article-title":"Taq DNA polymerase slippage mutation rates measured by PCR and quasi-likelihood analysis: (CA\/GT)n and (A\/T)n microsatellites","volume":"31","author":"Shinde","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012508015684400_B20","doi-asserted-by":"crossref","first-page":"1646","DOI":"10.1101\/gr.088823.108","article-title":"RazerS\u2013fast read mapping with sensitivity control","volume":"19","author":"Weese","year":"2009","journal-title":"Genome Res."},{"key":"2023012508015684400_B21","doi-asserted-by":"crossref","first-page":"872","DOI":"10.1038\/nature06884","article-title":"The complete genome of an individual by massively parallel DNA sequencing","volume":"452","author":"Wheeler","year":"2008","journal-title":"Nature"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/6\/722\/48855505\/bioinformatics_26_6_722.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/6\/722\/48855505\/bioinformatics_26_6_722.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:02:15Z","timestamp":1674633735000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/6\/722\/244455"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,2,9]]},"references-count":21,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2010,3,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq027","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,3,15]]},"published":{"date-parts":[[2010,2,9]]}}}