{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T04:27:13Z","timestamp":1776054433745,"version":"3.50.1"},"reference-count":38,"publisher":"Oxford University Press (OUP)","issue":"17","license":[{"start":{"date-parts":[[2019,1,21]],"date-time":"2019-01-21T00:00:00Z","timestamp":1548028800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"International Max Planck Research School for Computational Biology and Scientific Computing"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Structural variants are defined as genomic variants larger than 50\u00a0bp. They have been shown to affect more bases in any given genome than single-nucleotide polymorphisms or small insertions and deletions. Additionally, they have great impact on human phenotype and diversity and have been linked to numerous diseases. Due to their size and association with repeats, they are difficult to detect by shotgun sequencing, especially when based on short reads. Long read, single-molecule sequencing technologies like those offered by Pacific Biosciences or Oxford Nanopore Technologies produce reads with a length of several thousand base pairs. Despite the higher error rate and sequencing cost, long-read sequencing offers many advantages for the detection of structural variants. Yet, available software tools still do not fully exploit the possibilities.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We present SVIM, a tool for the sensitive detection and precise characterization of structural variants from long-read data. SVIM consists of three components for the collection, clustering and combination of structural variant signatures from read alignments. It discriminates five different variant classes including similar types, such as tandem and interspersed duplications and novel element insertions. SVIM is unique in its capability of extracting both the genomic origin and destination of duplications. It compares favorably with existing tools in evaluations on simulated data and real datasets from Pacific Biosciences and Nanopore sequencing machines.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The source code and executables of SVIM are available on Github: github.com\/eldariont\/svim. SVIM has been implemented in Python 3 and published on bioconda and the Python Package Index.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz041","type":"journal-article","created":{"date-parts":[[2019,1,16]],"date-time":"2019-01-16T23:38:58Z","timestamp":1547681938000},"page":"2907-2915","source":"Crossref","is-referenced-by-count":302,"title":["SVIM: structural variant identification using mapped long reads"],"prefix":"10.1093","volume":"35","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8346-9565","authenticated-orcid":false,"given":"David","family":"Heller","sequence":"first","affiliation":[{"name":"Max Planck Institute for Molecular Genetics Department of Computational Molecular Biology, , Berlin, Germany"}]},{"given":"Martin","family":"Vingron","sequence":"additional","affiliation":[{"name":"Max Planck Institute for Molecular Genetics Department of Computational Molecular Biology, , Berlin, Germany"}]}],"member":"286","published-online":{"date-parts":[[2019,1,21]]},"reference":[{"key":"2023062711302887200_btz041-B1","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","article-title":"A global reference for human genetic variation","volume":"526","year":"2015","journal-title":"Nature"},{"key":"2023062711302887200_btz041-B2","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1038\/nrg2958","article-title":"Genome structural variation discovery and genotyping","volume":"12","author":"Alkan","year":"2011","journal-title":"Nat. Rev. Genet"},{"key":"2023062711302887200_btz041-B3","doi-asserted-by":"crossref","first-page":"1679","DOI":"10.1093\/bioinformatics\/btt198","article-title":"Rsvsim: an R\/Bioconductor package for the simulation of structural variations","volume":"29","author":"Bartenhagen","year":"2013","journal-title":"Bioinformatics"},{"key":"2023062711302887200_btz041-B4","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1145\/362342.362367","article-title":"Algorithm 457: finding all cliques of an undirected graph","volume":"16","author":"Bron","year":"1973","journal-title":"Commun. ACM"},{"key":"2023062711302887200_btz041-B5","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1038\/nrg.2015.25","article-title":"Mechanisms underlying structural variant formation in genomic disorders","volume":"17","author":"Carvalho","year":"2016","journal-title":"Nat. Rev. Genet"},{"key":"2023062711302887200_btz041-B6","doi-asserted-by":"crossref","first-page":"238.","DOI":"10.1186\/1471-2105-13-238","article-title":"Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory","volume":"13","author":"Chaisson","year":"2012","journal-title":"BMC Bioinformatics"},{"key":"2023062711302887200_btz041-B7","doi-asserted-by":"crossref","first-page":"608","DOI":"10.1038\/nature13907","article-title":"Resolving the complexity of the human genome using single-molecule sequencing","volume":"517","author":"Chaisson","year":"2015","journal-title":"Nature"},{"key":"2023062711302887200_btz041-B8","article-title":"Multi-platform discovery of haplotype-resolved structural variation in human genomes","author":"Chaisson","year":"2018"},{"key":"2023062711302887200_btz041-B9","doi-asserted-by":"crossref","first-page":"3423","DOI":"10.1093\/bioinformatics\/btr539","article-title":"Pybedtools: a flexible python library for manipulating genomic datasets and annotations","volume":"27","author":"Dale","year":"2011","journal-title":"Bioinformatics"},{"key":"2023062711302887200_btz041-B10","doi-asserted-by":"crossref","first-page":"180.","DOI":"10.1186\/1471-2105-15-180","article-title":"PBHoney: identifying genomic variants via long-read discordance and interrupted mapping","volume":"15","author":"English","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"2023062711302887200_btz041-B11","doi-asserted-by":"crossref","first-page":"286.","DOI":"10.1186\/s12864-015-1479-3","article-title":"Assessing structural variation in a personal genome\u2013towards a human reference diploid genome","volume":"16","author":"English","year":"2015","journal-title":"BMC Genomics"},{"key":"2023062711302887200_btz041-B12","doi-asserted-by":"crossref","first-page":"523","DOI":"10.2217\/pme.14.34","article-title":"The road from next-generation sequencing to personalized medicine","volume":"11","author":"Gonzalez-Garay","year":"2014","journal-title":"Pers. Med"},{"key":"2023062711302887200_btz041-B13","doi-asserted-by":"crossref","first-page":"567","DOI":"10.1016\/0031-3203(91)90022-W","article-title":"Symbolic clustering using a new dissimilarity measure","volume":"24","author":"Gowda","year":"1991","journal-title":"Pattern Recogn"},{"key":"2023062711302887200_btz041-B14","author":"Hagberg","year":"2008"},{"key":"2023062711302887200_btz041-B15","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1146\/annurev-genet-110711-155616","article-title":"Active transposition in genomes","volume":"46","author":"Huang","year":"2012","journal-title":"Annu. Rev. Genet"},{"key":"2023062711302887200_btz041-B16","doi-asserted-by":"crossref","first-page":"1251","DOI":"10.1534\/genetics.115.180539","article-title":"An incomplete understanding of human genetic variation","volume":"202","author":"Huddleston","year":"2016","journal-title":"Genetics"},{"key":"2023062711302887200_btz041-B17","doi-asserted-by":"crossref","first-page":"677","DOI":"10.1101\/gr.214007.116","article-title":"Discovery and genotyping of structural variation from long-read haploid genome sequence data","volume":"27","author":"Huddleston","year":"2016","journal-title":"Genome Res"},{"key":"2023062711302887200_btz041-B18","doi-asserted-by":"crossref","first-page":"351.","DOI":"10.1038\/nmeth.3290","article-title":"Improved data analysis for the MinION nanopore sequencer","volume":"12","author":"Jain","year":"2015","journal-title":"Nat. Methods"},{"key":"2023062711302887200_btz041-B19","doi-asserted-by":"crossref","first-page":"338.","DOI":"10.1038\/nbt.4060","article-title":"Nanopore sequencing and assembly of a human genome with ultra-long reads","volume":"36","author":"Jain","year":"2018","journal-title":"Nat. Biotechnol"},{"key":"2023062711302887200_btz041-B20","first-page":"7.","article-title":"Minimap2: pairwise alignment for nucleotide sequences","volume":"1","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"2023062711302887200_btz041-B21","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023062711302887200_btz041-B22","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1101\/gr.141705.112","article-title":"Sequencing the unsequenceable: expanded CGG-repeat alleles of the fragile X gene","volume":"23","author":"Loomis","year":"2013","journal-title":"Genome Res"},{"key":"2023062711302887200_btz041-B23","doi-asserted-by":"crossref","first-page":"2875","DOI":"10.1093\/bioinformatics\/bts566","article-title":"CLEVER: clique-enumerating variant finder","volume":"28","author":"Marschall","year":"2012","journal-title":"Bioinformatics"},{"key":"2023062711302887200_btz041-B24","doi-asserted-by":"crossref","first-page":"159.","DOI":"10.1038\/gim.2017.86","article-title":"Long-read genome sequencing identifies causal structural variation in a mendelian disease","volume":"20","author":"Merker","year":"2018","journal-title":"Genet. Med"},{"key":"2023062711302887200_btz041-B25","doi-asserted-by":"crossref","first-page":"256","DOI":"10.1093\/bib\/bbs086","article-title":"A survey of tools for variant analysis of next-generation genome sequencing data","volume":"15","author":"Pabinger","year":"2014","journal-title":"Brief. Bioinform"},{"key":"2023062711302887200_btz041-B26","doi-asserted-by":"crossref","first-page":"64.","DOI":"10.1186\/s12864-016-2366-2","article-title":"svclassify: a method to establish benchmark structural variant calls","volume":"17","author":"Parikh","year":"2016","journal-title":"BMC Genomics"},{"key":"2023062711302887200_btz041-B27","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/nature05329","article-title":"Global variation in copy number in the human genome","volume":"444","author":"Redon","year":"2006","journal-title":"Nature"},{"key":"2023062711302887200_btz041-B28","doi-asserted-by":"crossref","first-page":"R51.","DOI":"10.1186\/gb-2013-14-5-r51","article-title":"Characterizing and measuring bias in sequence data","volume":"14","author":"Ross","year":"2013","journal-title":"Genome Biol"},{"key":"2023062711302887200_btz041-B29","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1093\/bfgp\/elu016","article-title":"Structural variations in plant genomes","volume":"13","author":"Saxena","year":"2014","journal-title":"Brief. Funct. Genomics"},{"key":"2023062711302887200_btz041-B30","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1126\/science.1138659","article-title":"Strong association of de novo copy number mutations with autism","volume":"316","author":"Sebat","year":"2007","journal-title":"Science"},{"key":"2023062711302887200_btz041-B31","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1038\/s41592-018-0001-7","article-title":"Accurate detection of complex structural variations using single-molecule sequencing","volume":"15","author":"Sedlazeck","year":"2018","journal-title":"Nat. Methods"},{"key":"2023062711302887200_btz041-B32","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1038\/s41576-018-0003-4","article-title":"Piercing the dark matter: bioinformatics of long-range sequencing and mapping","volume":"19","author":"Sedlazeck","year":"2018","journal-title":"Nat. Rev. Genet"},{"key":"2023062711302887200_btz041-B33","doi-asserted-by":"crossref","first-page":"e1002236.","DOI":"10.1371\/journal.pgen.1002236","article-title":"A comprehensive map of mobile element insertion polymorphisms in humans","volume":"7","author":"Stewart","year":"2011","journal-title":"PLoS Genet"},{"key":"2023062711302887200_btz041-B34","doi-asserted-by":"crossref","first-page":"2704","DOI":"10.1093\/bioinformatics\/btw286","article-title":"Simlord: simulation of long read data","volume":"32","author":"St\u00f6cker","year":"2016","journal-title":"Bioinformatics"},{"key":"2023062711302887200_btz041-B35","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1038\/nature15394","article-title":"An integrated map of structural variation in 2, 504 human genomes","volume":"526","author":"Sudmant","year":"2015","journal-title":"Nature"},{"key":"2023062711302887200_btz041-B36","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1038\/nrg3373","article-title":"Phenotypic impact of genomic structural variation: insights from and for human disease","volume":"14","author":"Weischenfeldt","year":"2013","journal-title":"Nat. Rev. Genet"},{"key":"2023062711302887200_btz041-B37","doi-asserted-by":"crossref","first-page":"1894","DOI":"10.1101\/gr.177774.114","article-title":"The landscape of human STR variation","volume":"24","author":"Willems","year":"2014","journal-title":"Genome Res"},{"key":"2023062711302887200_btz041-B38","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1038\/nbt.2835","article-title":"Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls","volume":"32","author":"Zook","year":"2014","journal-title":"Nat. Biotechnol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/17\/2907\/50720009\/bioinformatics_35_17_2907.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/17\/2907\/50720009\/bioinformatics_35_17_2907.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,27]],"date-time":"2023-06-27T07:32:54Z","timestamp":1687851174000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/17\/2907\/5298305"}},"subtitle":[],"editor":[{"given":"Inanc","family":"Birol","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,1,21]]},"references-count":38,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2019,9,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz041","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/494096","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,9,1]]},"published":{"date-parts":[[2019,1,21]]}}}