{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,1]],"date-time":"2026-03-01T11:56:12Z","timestamp":1772366172474,"version":"3.50.1"},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"10","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,5,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Structural variation including deletions, duplications and rearrangements of DNA sequence are an important contributor to genome variation in many organisms. In human, many structural variants are found in complex and highly repetitive regions of the genome making their identification difficult. A new sequencing technology called strobe sequencing generates strobe reads containing multiple subreads from a single contiguous fragment of DNA. Strobe reads thus generalize the concept of paired reads, or mate pairs, that have been routinely used for structural variant detection. Strobe sequencing holds promise for unraveling complex variants that have been difficult to characterize with current sequencing technologies.<\/jats:p><jats:p>Results: We introduce an algorithm for identification of structural variants using strobe sequencing data. We consider strobe reads from a test genome that have multiple possible alignments to a reference genome due to sequencing errors and\/or repetitive sequences in the reference. We formulate the combinatorial optimization problem of finding the minimum number of structural variants in the test genome that are consistent with these alignments. We solve this problem using an integer linear program. Using simulated strobe sequencing data, we show that our algorithm has better sensitivity and specificity than paired read approaches for structural variation identification.<\/jats:p><jats:p>Contact: \u00a0braphael@brown.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq153","type":"journal-article","created":{"date-parts":[[2010,4,9]],"date-time":"2010-04-09T00:19:08Z","timestamp":1270772348000},"page":"1291-1298","source":"Crossref","is-referenced-by-count":28,"title":["Structural variation analysis with strobe reads"],"prefix":"10.1093","volume":"26","author":[{"given":"Anna","family":"Ritz","sequence":"first","affiliation":[{"name":"1 Department of Computer Science, Brown University, Providence, RI 02912, 2 Pacific Biosciences, 1505 Adams Drive, Menlo Park, CA 94025 and 3 Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA"}]},{"given":"Ali","family":"Bashir","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, Brown University, Providence, RI 02912, 2 Pacific Biosciences, 1505 Adams Drive, Menlo Park, CA 94025 and 3 Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA"}]},{"given":"Benjamin J.","family":"Raphael","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science, Brown University, Providence, RI 02912, 2 Pacific Biosciences, 1505 Adams Drive, Menlo Park, CA 94025 and 3 Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA"},{"name":"1 Department of Computer Science, Brown University, Providence, RI 02912, 2 Pacific Biosciences, 1505 Adams Drive, Menlo Park, CA 94025 and 3 Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,4,8]]},"reference":[{"key":"2023012507514788500_B1","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1038\/ng1215","article-title":"Chromosome aberrations in solid tumors","volume":"34","author":"Albertson","year":"2003","journal-title":"Nat. Genet."},{"key":"2023012507514788500_B2","doi-asserted-by":"crossref","first-page":"e1000051","DOI":"10.1371\/journal.pcbi.1000051","article-title":"Evaluation of paired-end sequencing strategies for detection of genome rearrangements in cancer","volume":"4","author":"Bashir","year":"2008","journal-title":"PLoS Comput. Biol."},{"key":"2023012507514788500_B3","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1038\/nature07517","article-title":"Accurate whole human genome sequencing using reversible terminator chemistry","volume":"456","author":"Bentley","year":"2008","journal-title":"Nature"},{"key":"2023012507514788500_B4","doi-asserted-by":"crossref","first-page":"677","DOI":"10.1038\/nmeth.1363","article-title":"BreakDancer: an algorithm for high-resolution mapping of genomic structural variation","volume":"6","author":"Chen","year":"2009","journal-title":"Nat. Methods"},{"key":"2023012507514788500_B5","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1016\/S0166-218X(00)00310-3","article-title":"Bundle-based relaxation methods for multicommodity capacitated fixed charge network design","volume":"112","author":"Crainic","year":"2001","journal-title":"Discrete Appl. Math."},{"key":"2023012507514788500_B6","doi-asserted-by":"crossref","first-page":"19920","DOI":"10.1073\/pnas.0709888104","article-title":"A portrait of copy-number polymorphism in Drosophila melanogaster","volume":"104","author":"Dopman","year":"2007","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012507514788500_B7","doi-asserted-by":"crossref","first-page":"1384","DOI":"10.1038\/ng.2007.19","article-title":"Recurrent DNA copy number variation in the laboratory mouse","volume":"39","author":"Egan","year":"2007","journal-title":"Nat. Genet."},{"key":"2023012507514788500_B8","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1126\/science.1162986","article-title":"Real-time DNA sequencing from single polymerase molecules","volume":"323","author":"Eid","year":"2009","journal-title":"Science"},{"key":"2023012507514788500_B9","doi-asserted-by":"crossref","first-page":"e1000502","DOI":"10.1371\/journal.pgen.1000502","article-title":"Systematic identification of balanced transposition polymorphisms in Saccharomyces cerevisiae","volume":"5","author":"Faddah","year":"2009","journal-title":"PLoS Genet."},{"key":"2023012507514788500_B10","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1038\/ng.534","article-title":"A recurrent 16p12.1 microdeletion supports a two-hit model for severe developmental delay","volume":"42","author":"Girirajan","year":"2010","journal-title":"Nat. Genet."},{"key":"2023012507514788500_B11","doi-asserted-by":"crossref","first-page":"931","DOI":"10.1038\/ng.415","article-title":"De novo copy number variants identify new genes and loci in isolated sporadic tetralogy of Fallot","volume":"41","author":"Greenway","year":"2009","journal-title":"Nat. Genet."},{"key":"2023012507514788500_B12","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1002\/net.3230190304","article-title":"Analysis of a flow problem with fixed charges","volume":"19","author":"Hochbaum","year":"1989","journal-title":"Networks"},{"key":"2023012507514788500_B13","doi-asserted-by":"crossref","first-page":"1270","DOI":"10.1101\/gr.088633.108","article-title":"Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes","volume":"19","author":"Hormozdiari","year":"2009","journal-title":"Genome Res."},{"key":"2023012507514788500_B14","doi-asserted-by":"crossref","first-page":"949","DOI":"10.1038\/ng1416","article-title":"Detection of large-scale variation in the human genome","volume":"36","author":"Iafrate","year":"2004","journal-title":"Nat. Genet."},{"key":"2023012507514788500_B15","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nature06862","article-title":"Mapping and sequencing of structural variation from eight human genomes","volume":"453","author":"Kidd","year":"2008","journal-title":"Nature"},{"key":"2023012507514788500_B16","doi-asserted-by":"crossref","first-page":"420","DOI":"10.1126\/science.1149504","article-title":"Paired-end mapping reveals extensive structural variation in the human genome","volume":"318","author":"Korbel","year":"2007","journal-title":"Science"},{"key":"2023012507514788500_B17","doi-asserted-by":"crossref","first-page":"R23","DOI":"10.1186\/gb-2009-10-2-r23","article-title":"PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data","volume":"10","author":"Korbel","year":"2009","journal-title":"Genome Biol."},{"key":"2023012507514788500_B18","doi-asserted-by":"crossref","first-page":"722","DOI":"10.1093\/bioinformatics\/btq027","article-title":"Microindel detection in short-read sequence data","volume":"26","author":"Krawitz","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012507514788500_B19","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1093\/bioinformatics\/btn176","article-title":"A robust framework for detecting structural variations in a genome","volume":"24","author":"Lee","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012507514788500_B20","doi-asserted-by":"crossref","first-page":"e254","DOI":"10.1371\/journal.pbio.0050254","article-title":"The diploid genome sequence of an individual human","volume":"5","author":"Levy","year":"2007","journal-title":"PLoS Biol."},{"key":"2023012507514788500_B21","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short read alignment with Burrows-Wheeler transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012507514788500_B22","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1093\/bioinformatics\/btp698","article-title":"Fast and accurate long-read alignment with Burrows-Wheeler transform","volume":"26","author":"Li","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012507514788500_B23","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The Sequence Alignment\/Map format and SAMtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012507514788500_B24","doi-asserted-by":"crossref","first-page":"477","DOI":"10.1016\/j.ajhg.2007.12.009","article-title":"Structural variation of chromosomes in autism spectrum disorder","volume":"82","author":"Marshall","year":"2008","journal-title":"Am. J. Hum. Genet."},{"key":"2023012507514788500_B25","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1038\/ng1335","article-title":"Fusion genes and rearranged genes as a linear function of chromosome aberrations in cancer","volume":"36","author":"Mitelman","year":"2004","journal-title":"Nat. Genet."},{"key":"2023012507514788500_B26","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1016\/j.ajhg.2007.12.010","article-title":"The fine-scale and complex architecture of human copy-number variation","volume":"82","author":"Perry","year":"2008","journal-title":"Am. J. Hum. Genet."},{"key":"2023012507514788500_B27","doi-asserted-by":"crossref","DOI":"10.1101\/gr.102970.109","article-title":"Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome","author":"Quinlan","year":"2010","journal-title":"Genome Res."},{"issue":"Suppl. 2","key":"2023012507514788500_B28","doi-asserted-by":"crossref","first-page":"i162","DOI":"10.1093\/bioinformatics\/btg1074","article-title":"Reconstructing tumor genome architectures","volume":"19","author":"Raphael","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012507514788500_B29","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/nature05329","article-title":"Global variation in copy number in the human genome","volume":"444","author":"Redon","year":"2006","journal-title":"Nature"},{"key":"2023012507514788500_B30","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1038\/ng2093","article-title":"Challenges and standards in integrating surveys of structural variation","volume":"39","author":"Scherer","year":"2007","journal-title":"Nat. Genet."},{"key":"2023012507514788500_B31","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1146\/annurev.genom.7.080505.115618","article-title":"Structural variation of the human genome","volume":"7","author":"Sharp","year":"2006","journal-title":"Annu. Rev. Genomics Hum. Genet."},{"key":"2023012507514788500_B32","doi-asserted-by":"crossref","first-page":"i222","DOI":"10.1093\/bioinformatics\/btp208","article-title":"A geometric approach for classification and comparison of structural variants","volume":"25","author":"Sindi","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012507514788500_B33","volume-title":"Personal Genomes (conference talk).","author":"Turner","year":"2009"},{"key":"2023012507514788500_B34","doi-asserted-by":"crossref","first-page":"727","DOI":"10.1038\/ng1562","article-title":"Fine-scale structural variation of the human genome","volume":"37","author":"Tuzun","year":"2005","journal-title":"Nat. Genet."},{"key":"2023012507514788500_B35","doi-asserted-by":"crossref","first-page":"7696","DOI":"10.1073\/pnas.1232418100","article-title":"End-sequence profiling: sequence-based analysis of aberrant genomes","volume":"100","author":"Volik","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/10\/1291\/48851768\/bioinformatics_26_10_1291.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/10\/1291\/48851768\/bioinformatics_26_10_1291.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,31]],"date-time":"2023-05-31T19:05:16Z","timestamp":1685559916000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/10\/1291\/194131"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,4,8]]},"references-count":35,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2010,5,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq153","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,5,15]]},"published":{"date-parts":[[2010,4,8]]}}}