{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,4]],"date-time":"2024-06-04T04:36:06Z","timestamp":1717475766322},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"14","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Discovering variation among high-throughput sequenced genomes relies on efficient and effective mapping of sequence reads. The speed, sensitivity and accuracy of read mapping are crucial to determining the full spectrum of single nucleotide variants (SNVs) as well as structural variants (SVs) in the donor genomes analyzed.<\/jats:p>\n               <jats:p>Results: We present drFAST, a read mapper designed for di-base encoded \u2018color-space\u2019 sequences generated with the AB SOLiD platform. drFAST is specially designed for better delineation of structural variants, including segmental duplications, and is able to return all possible map locations and underlying sequence variation of short reads within a user-specified distance threshold. We show that drFAST is more sensitive in comparison to all commonly used aligners such as Bowtie, BFAST and SHRiMP. drFAST is also faster than both BFAST and SHRiMP and achieves a mapping speed comparable to Bowtie.<\/jats:p>\n               <jats:p>Availability: The source code for drFAST is available at http:\/\/drfast.sourceforge.net<\/jats:p>\n               <jats:p>Contact: \u00a0calkan@u.washington.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr303","type":"journal-article","created":{"date-parts":[[2011,5,18]],"date-time":"2011-05-18T01:09:25Z","timestamp":1305680965000},"page":"1915-1921","source":"Crossref","is-referenced-by-count":15,"title":["Sensitive and fast mapping of di-base encoded reads"],"prefix":"10.1093","volume":"27","author":[{"given":"Farhad","family":"Hormozdiari","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Faraz","family":"Hach","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"S. Cenk","family":"Sahinalp","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Evan E.","family":"Eichler","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Can","family":"Alkan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2011,5,17]]},"reference":[{"key":"2023012712454428600_B1","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/ng.437","article-title":"Personalized copy number and segmental duplication maps using next-generation sequencing","volume":"41","author":"Alkan","year":"2009","journal-title":"Nat. Genet."},{"key":"2023012712454428600_B2","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1038\/nrg2958","article-title":"Genome structural variation discovery and genotyping","volume":"12","author":"Alkan","year":"2011","journal-title":"Nat. Rev. Genet."},{"key":"2023012712454428600_B3","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1007\/978-1-4757-4034-9","volume-title":"Groups and Symmetry.","author":"Armstrong","year":"1988"},{"key":"2023012712454428600_B4","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1038\/nature07517","article-title":"Accurate whole human genome sequencing using reversible terminator chemistry","volume":"456","author":"Bentley","year":"2008","journal-title":"Nature"},{"key":"2023012712454428600_B5","first-page":"124","article-title":"A block sorting lossless data compression algorithm","author":"Burrows","year":"1994","journal-title":"Digital Equipment Corporation Technical Report"},{"key":"2023012712454428600_B6","doi-asserted-by":"crossref","first-page":"2514","DOI":"10.1093\/bioinformatics\/btp486","article-title":"PerM: efficient mapping of short sequencing reads with periodic full sensitive spaced seeds","volume":"25","author":"Chen","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012712454428600_B7","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1126\/science.1162986","article-title":"Real-time DNA sequencing from single polymerase molecules","volume":"323","author":"Eid","year":"2009","journal-title":"Science"},{"key":"2023012712454428600_B8","doi-asserted-by":"crossref","first-page":"390","DOI":"10.1109\/SFCS.2000.892127","article-title":"Opportunistic data structures with applications","volume-title":"Proceedings of the 41st Annual Symposium on Foundations of Computer Science (FOCS 2000)","author":"Ferragina","year":"2000"},{"key":"2023012712454428600_B9","first-page":"285","article-title":"Cache-oblivious algorithms","volume-title":"40th Annual Symposium on Foundations of Computer Science","author":"Frigo","year":"1999"},{"key":"2023012712454428600_B10","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1093\/jhered\/esp086","article-title":"Genome 10K: a proposal to obtain whole-genome sequence for 10 000 vertebrate species","volume":"100","author":"Genome 10K Community of Scientists","year":"2009","journal-title":"J. Hered."},{"key":"2023012712454428600_B11","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/nature09534","article-title":"A map of human genome variation from population-scale sequencing","volume":"467","author":"1000 Genomes Project Consortium","year":"2010","journal-title":"Nature"},{"key":"2023012712454428600_B12","doi-asserted-by":"crossref","first-page":"576","DOI":"10.1038\/nmeth0810-576","article-title":"mrsFAST: a cache-oblivious algorithm for short-read mapping","volume":"7","author":"Hach","year":"2010","journal-title":"Nat. Methods"},{"key":"2023012712454428600_B13","doi-asserted-by":"crossref","first-page":"1277","DOI":"10.1093\/bioinformatics\/btq152","article-title":"Detection and characterization of novel sequence insertions using paired-end next-generation sequencing","volume":"26","author":"Hajirasouliha","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012712454428600_B14","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1371\/journal.pone.0007767","article-title":"BFAST: an alignment tool for large scale genome resequencing","volume":"4","author":"Homer","year":"2009","journal-title":"PLoS One"},{"key":"2023012712454428600_B15","doi-asserted-by":"crossref","first-page":"1270","DOI":"10.1101\/gr.088633.108","article-title":"Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes","volume":"19","author":"Hormozdiari","year":"2009","journal-title":"Genome Res."},{"key":"2023012712454428600_B16","doi-asserted-by":"crossref","first-page":"350","DOI":"10.1093\/bioinformatics\/btq216","article-title":"Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery","volume":"26","author":"Hormozdiari","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012712454428600_B17","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nature06862","article-title":"Mapping and sequencing of structural variation from eight human genomes","volume":"453","author":"Kidd","year":"2008","journal-title":"Nature"},{"key":"2023012712454428600_B18","doi-asserted-by":"crossref","first-page":"R25","DOI":"10.1186\/gb-2009-10-3-r25","article-title":"Ultrafast and memory-efficient alignment of short DNA sequences to the human genome","volume":"10","author":"Langmead","year":"2009","journal-title":"Genome Biol."},{"key":"2023012712454428600_B19","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1038\/nmeth.f.256","article-title":"MoDIL: detecting small indels from clone-end sequencing with mixtures of distributions","volume":"6","author":"Lee","year":"2009","journal-title":"Nat. Methods"},{"key":"2023012712454428600_B20","doi-asserted-by":"crossref","first-page":"1851","DOI":"10.1101\/gr.078212.108","article-title":"Mapping short DNA sequencing reads and calling variants using mapping quality scores","volume":"18","author":"Li","year":"2008","journal-title":"Genome Res."},{"key":"2023012712454428600_B21","doi-asserted-by":"crossref","first-page":"713","DOI":"10.1093\/bioinformatics\/btn025","article-title":"SOAP: short oligonucleotide alignment program","volume":"24","author":"Li","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012712454428600_B22","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short read alignment with Burrows-Wheeler transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012712454428600_B23","doi-asserted-by":"crossref","first-page":"1181","DOI":"10.1056\/NEJMoa0908094","article-title":"Whole-genome sequencing in a patient with Charcot-Marie-Tooth neuropathy","volume":"362","author":"Lupski","year":"2010","journal-title":"N. Engl. J. Med."},{"key":"2023012712454428600_B24","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1038\/nature03959","article-title":"Genome sequencing in microfabricated high-density picolitre reactors","volume":"437","author":"Margulies","year":"2005","journal-title":"Nature"},{"key":"2023012712454428600_B25","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1101\/gr.091868.109","article-title":"Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding","volume":"19","author":"McKernan","year":"2009","journal-title":"Genome Res."},{"key":"2023012712454428600_B26","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nature09708","article-title":"Mapping copy number variation by population-scale genome sequencing","volume":"470","author":"Mills","year":"2011","journal-title":"Nature"},{"key":"2023012712454428600_B27","doi-asserted-by":"crossref","first-page":"790","DOI":"10.1038\/ng.646","article-title":"Exome sequencing identifies mll2 mutations as a cause of Kabuki syndrome","volume":"42","author":"Ng","year":"2010","journal-title":"Nat. Genet."},{"key":"2023012712454428600_B28","doi-asserted-by":"crossref","DOI":"10.1007\/s00439-011-0971-3","article-title":"Technology-specific error signatures in the 1000 Genomes Project data","author":"Nothnagel","year":"2011","journal-title":"Hum. Genet"},{"key":"2023012712454428600_B29","doi-asserted-by":"crossref","first-page":"2776","DOI":"10.1093\/bioinformatics\/btn512","article-title":"Efficient mapping of Applied Biosystems SOLiD sequence data to a reference genome for functional genomic applications","volume":"24","author":"Ondov","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012712454428600_B30","doi-asserted-by":"crossref","first-page":"847","DOI":"10.1038\/nbt.1561","article-title":"Single-molecule sequencing of an individual human genome","volume":"27","author":"Pushkarev","year":"2009","journal-title":"Nat. Biotechnol."},{"key":"2023012712454428600_B31","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1371\/journal.pcbi.1000386","article-title":"SHRiMP: accurate mapping of short color-space reads","volume":"5","author":"Rumble","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2023012712454428600_B32","doi-asserted-by":"crossref","first-page":"1638","DOI":"10.1101\/gr.077776.108","article-title":"Rapid whole-genome mutational profiling using next-generation sequencing technologies","volume":"18","author":"Smith","year":"2008","journal-title":"Genome Res."},{"key":"2023012712454428600_B33","doi-asserted-by":"crossref","first-page":"641","DOI":"10.1126\/science.1197005","article-title":"Diversity of human copy number variation and multicopy genes","volume":"330","author":"Sudmant","year":"2010","journal-title":"Science"},{"key":"2023012712454428600_B34","doi-asserted-by":"crossref","first-page":"1109","DOI":"10.1038\/ng.712","article-title":"A de novo paradigm for mental retardation","volume":"42","author":"Vissers","year":"2010","journal-title":"Nat. Genet."},{"key":"2023012712454428600_B35","doi-asserted-by":"crossref","first-page":"872","DOI":"10.1038\/nature06884","article-title":"The complete genome of an individual by massively parallel DNA sequencing","volume":"452","author":"Wheeler","year":"2008","journal-title":"Nature"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/14\/1915\/48933156\/bioinformatics_27_14_1915.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/14\/1915\/48933156\/bioinformatics_27_14_1915.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T13:38:27Z","timestamp":1674826707000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/14\/1915\/194452"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,5,17]]},"references-count":35,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2011,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr303","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2011,7]]},"published":{"date-parts":[[2011,5,17]]}}}