{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,13]],"date-time":"2025-11-13T12:29:24Z","timestamp":1763036964888},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,6,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Advances of next generation sequencing technologies and availability of short read data enable the detection of structural variations (SVs). Deletions, an important type of SVs, have been suggested in association with genetic diseases. There are three types of deletions: blunt deletions, deletions with microhomologies and deletions with microsinsertions. The last two types are very common in the human genome, but they pose difficulty for the detection. Furthermore, finding deletions from sequencing data remains challenging. It is highly appealing to develop sensitive and accurate methods to detect deletions from sequencing data, especially deletions with microhomology and deletions with microinsertion.<\/jats:p><jats:p>Results: We present a novel method called Sprites (SPlit Read re-alIgnment To dEtect Structural variants) which finds deletions from sequencing data. It aligns a whole soft-clipping read rather than its clipped part to the target sequence, a segment of the reference which is determined by spanning reads, in order to find the longest prefix or suffix of the read that has a match in the target sequence. This alignment aims to solve the problem of deletions with microhomologies and deletions with microinsertions. Using both simulated and real data we show that Sprites performs better on detecting deletions compared with other current methods in terms of F-score.<\/jats:p><jats:p>Availability and implementation: Sprites is open source software and freely available at https:\/\/github.com\/zhangzhen\/sprites.<\/jats:p><jats:p>Contact: \u00a0jxwang@mail.csu.edu.cn<\/jats:p><jats:p>Supplementary data: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btw053","type":"journal-article","created":{"date-parts":[[2016,2,15]],"date-time":"2016-02-15T01:09:07Z","timestamp":1455498547000},"page":"1788-1796","source":"Crossref","is-referenced-by-count":17,"title":["Sprites: detection of deletions from sequencing data by re-aligning split reads"],"prefix":"10.1093","volume":"32","author":[{"given":"Zhen","family":"Zhang","sequence":"first","affiliation":[{"name":"1 School of Information Science and Engineering, Central South University, Changsha, 410083, China,"},{"name":"2 College of Information and Communication Engineering, Hunan Institute of Science and Technology, Yueyang, 414006, China,"}]},{"given":"Jianxin","family":"Wang","sequence":"additional","affiliation":[{"name":"1 School of Information Science and Engineering, Central South University, Changsha, 410083, China,"}]},{"given":"Junwei","family":"Luo","sequence":"additional","affiliation":[{"name":"1 School of Information Science and Engineering, Central South University, Changsha, 410083, China,"}]},{"given":"Xiaojun","family":"Ding","sequence":"additional","affiliation":[{"name":"1 School of Information Science and Engineering, Central South University, Changsha, 410083, China,"}]},{"given":"Jiancheng","family":"Zhong","sequence":"additional","affiliation":[{"name":"1 School of Information Science and Engineering, Central South University, Changsha, 410083, China,"}]},{"given":"Jun","family":"Wang","sequence":"additional","affiliation":[{"name":"3 Department of Molecular Physiology & Biophysics, Baylor College of Medicine, Houston, TX 77030, USA,"}]},{"given":"Fang-Xiang","family":"Wu","sequence":"additional","affiliation":[{"name":"4 Department of Mechanical Engineering and Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada and"}]},{"given":"Yi","family":"Pan","sequence":"additional","affiliation":[{"name":"5 Department of Computer Science, Georgia State University, Atlanta, GA 30302-4110, USA"}]}],"member":"286","published-online":{"date-parts":[[2016,2,1]]},"reference":[{"key":"2023020112295123700_btw053-B1","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1093\/bioinformatics\/btq713","article-title":"Age: defining breakpoints of genomic structural variants at single-nucleotide resolution, through optimal alignments with gap excision","volume":"27","author":"Abyzov","year":"2011","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B2","doi-asserted-by":"crossref","first-page":"974","DOI":"10.1101\/gr.114876.110","article-title":"Cnvnator: an approach to discover, genotype, and characterize typical and atypical cnvs from family and population genome sequencing","volume":"21","author":"Abyzov","year":"2011","journal-title":"Genome Res"},{"key":"2023020112295123700_btw053-B3","doi-asserted-by":"crossref","first-page":"7256","DOI":"10.1038\/ncomms8256","article-title":"Analysis of deletion breakpoints from 1,092 humans reveals details of mutation mechanisms","volume":"6","author":"Abyzov","year":"2015","journal-title":"Nat. Commun"},{"key":"2023020112295123700_btw053-B4","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1038\/nrg2958","article-title":"Genome structural variation discovery and genotyping","volume":"12","author":"Alkan","year":"2011","journal-title":"Nat. Rev. Genet"},{"key":"2023020112295123700_btw053-B5","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1038\/nmeth.1858","article-title":"Structural variation: the genome\u2019s hidden architecture","volume":"9","author":"Baker","year":"2012","journal-title":"Nat. Methods"},{"key":"2023020112295123700_btw053-B6","doi-asserted-by":"crossref","first-page":"677","DOI":"10.1038\/nmeth.1363","article-title":"Breakdancer: an algorithm for high-resolution mapping of genomic structural variation","volume":"6","author":"Chen","year":"2009","journal-title":"Nat. Methods"},{"key":"2023020112295123700_btw053-B7","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1038\/nmeth.1276","article-title":"High-resolution mapping of copy-number alterations with massively parallel sequencing","volume":"6","author":"Chiang","year":"2009","journal-title":"Nat. Methods"},{"key":"2023020112295123700_btw053-B8","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1038\/ng.564","article-title":"Mutation spectrum revealed by breakpoint sequencing of human germline cnvs","volume":"42","author":"Conrad","year":"2010","journal-title":"Nat. Genet"},{"key":"2023020112295123700_btw053-B9","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nature11632","article-title":"An integrated map of genetic variation from 1,092 human genomes","volume":"491","author":"Consortium","year":"2012","journal-title":"Nature"},{"key":"2023020112295123700_btw053-B10","doi-asserted-by":"crossref","first-page":"2417","DOI":"10.1093\/bioinformatics\/bts456","article-title":"Yaha: fast and flexible long-read alignment with optimal breakpoint detection","volume":"28","author":"Faust","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B11","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1038\/nrg1767","article-title":"Structural variation in the human genome","volume":"7","author":"Feuk","year":"2006","journal-title":"Nat. Rev. Genet"},{"key":"2023020112295123700_btw053-B12","doi-asserted-by":"crossref","first-page":"1270","DOI":"10.1101\/gr.088633.108","article-title":"Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes","volume":"19","author":"Hormozdiari","year":"2009","journal-title":"Genome Res"},{"key":"2023020112295123700_btw053-B13","doi-asserted-by":"crossref","first-page":"i350","DOI":"10.1093\/bioinformatics\/btq216","article-title":"Next-generation variationhunter: combinatorial algorithms for transposon insertion discovery","volume":"26","author":"Hormozdiari","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B14","doi-asserted-by":"crossref","first-page":"2576","DOI":"10.1093\/bioinformatics\/bts484","article-title":"Prism: pair-read informed split-read mapping for base-pair level detection of insertion, deletion and structural variants","volume":"28","author":"Jiang","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B15","doi-asserted-by":"crossref","first-page":"R23","DOI":"10.1186\/gb-2009-10-2-r23","article-title":"Pemer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data","volume":"10","author":"Korbel","year":"2009","journal-title":"Genome Biol"},{"key":"2023020112295123700_btw053-B16","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/nmeth.1923","article-title":"Fast gapped-read alignment with bowtie 2","volume":"9","author":"Langmead","year":"2012","journal-title":"Nat. Methods"},{"key":"2023020112295123700_btw053-B17","doi-asserted-by":"crossref","first-page":"R84","DOI":"10.1186\/gb-2014-15-6-r84","article-title":"Lumpy: a probabilistic framework for structural variant discovery","volume":"15","author":"Layer","year":"2014","journal-title":"Genome Biol"},{"key":"2023020112295123700_btw053-B18","article-title":"Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM","author":"Li","year":"2013","journal-title":"ArXiv e-Prints"},{"key":"2023020112295123700_btw053-B19","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short read alignment with burrows-wheeler transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B20","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and samtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B21","doi-asserted-by":"crossref","first-page":"825","DOI":"10.1093\/bioinformatics\/btu762","article-title":"Epga: de novo assembly using the distributions of reads and insert size","volume":"31","author":"Luo","year":"2015","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B22","doi-asserted-by":"crossref","first-page":"3988","DOI":"10.1093\/bioinformatics\/btv487","article-title":"Epga2: memory-efficient de novo assembler","volume":"31","author":"Luo","year":"2015","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B23","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nature09708","article-title":"Mapping copy number variation by population-scale genome sequencing","volume":"470","author":"Mills","year":"2011","journal-title":"Nature"},{"key":"2023020112295123700_btw053-B24","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1093\/bioinformatics\/btq033","article-title":"Bedtools: a flexible suite of utilities for comparing genomic features","volume":"26","author":"Quinlan","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B25","doi-asserted-by":"crossref","first-page":"i333","DOI":"10.1093\/bioinformatics\/bts378","article-title":"Delly: structural variant discovery by integrated paired-end and split-read analysis","volume":"28","author":"Rausch","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B26","doi-asserted-by":"crossref","first-page":"3451","DOI":"10.1093\/bioinformatics\/btu545","article-title":"Mindthegap: integrated detection and assembly of short and long insertions","volume":"30","author":"Rizk","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B27","doi-asserted-by":"crossref","first-page":"1064","DOI":"10.1093\/bioinformatics\/btt767","article-title":"Socrates: identification of genomic rearrangements in tumour genomes by re-aligning soft clipped reads","volume":"30","author":"Schr\u00f6der","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B28","doi-asserted-by":"crossref","first-page":"i222","DOI":"10.1093\/bioinformatics\/btp208","article-title":"A geometric approach for classification and comparison of structural variants","volume":"25","author":"Sindi","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B29","doi-asserted-by":"crossref","first-page":"R22","DOI":"10.1186\/gb-2012-13-3-r22","article-title":"An integrative probabilistic model for identification of structural variation in sequencing data","volume":"13","author":"Sindi","year":"2012","journal-title":"Genome Biol"},{"key":"2023020112295123700_btw053-B30","doi-asserted-by":"crossref","first-page":"S7","DOI":"10.1186\/1471-2105-12-S14-S7","article-title":"Clipcrop: a tool for detecting structural variations with single-base resolution using soft-clipping information","volume":"12(Suppl 14)","author":"Suzuki","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023020112295123700_btw053-B31","doi-asserted-by":"crossref","first-page":"652","DOI":"10.1038\/nmeth.1628","article-title":"Crest maps somatic structural variation in cancer genomes with base-pair resolution","volume":"8","author":"Wang","year":"2011","journal-title":"Nat. Methods"},{"key":"2023020112295123700_btw053-B32","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1038\/nrg3373","article-title":"Phenotypic impact of genomic structural variation: insights from and for human disease","volume":"14","author":"Weischenfeldt","year":"2013","journal-title":"Nat. Rev. Genet"},{"key":"2023020112295123700_btw053-B33","doi-asserted-by":"crossref","first-page":"2865","DOI":"10.1093\/bioinformatics\/btp394","article-title":"Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads","volume":"25","author":"Ye","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020112295123700_btw053-B34","doi-asserted-by":"crossref","first-page":"1586","DOI":"10.1101\/gr.092981.109","article-title":"Sensitive and accurate detection of copy number variants using read depth of coverage","volume":"19","author":"Yoon","year":"2009","journal-title":"Genome Res"},{"key":"2023020112295123700_btw053-B35","doi-asserted-by":"crossref","first-page":"S6","DOI":"10.1186\/1471-2105-13-S6-S6","article-title":"An improved approach for accurate and efficient calling of structural variations with low-coverage sequence data","volume":"13(Suppl 6)","author":"Zhang","year":"2012","journal-title":"BMC Bioinformatics"},{"key":"2023020112295123700_btw053-B36","doi-asserted-by":"crossref","first-page":"3228","DOI":"10.1093\/bioinformatics\/btr563","article-title":"Svseq: an approach for detecting exact breakpoints of deletions with low-coverage sequence data","volume":"27","author":"Zhang","year":"2011","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/12\/1788\/49021004\/bioinformatics_32_12_1788.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/12\/1788\/49021004\/bioinformatics_32_12_1788.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,16]],"date-time":"2023-08-16T21:40:06Z","timestamp":1692222006000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/12\/1788\/1743630"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,2,1]]},"references-count":36,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2016,6,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw053","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2016,6,15]]},"published":{"date-parts":[[2016,2,1]]}}}