{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,12]],"date-time":"2026-03-12T00:12:13Z","timestamp":1773274333784,"version":"3.50.1"},"reference-count":15,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2008,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Second-generation sequencing has the potential to revolutionize genomics and impact all areas of biomedical science. New technologies will make re-sequencing widely available for such applications as identifying genome variations or interrogating the oligonucleotide content of a large sample (<jats:italic>e.g<\/jats:italic>. ChIP-sequencing). The increase in speed, sensitivity and availability of sequencing technology brings demand for advances in computational technology to perform associated analysis tasks. The Solexa\/Illumina 1G sequencer can produce tens of millions of reads, ranging in length from ~25\u201350 nt, in a single experiment. Accurately mapping the reads back to a reference genome is a critical task in almost all applications. Two sources of information that are often ignored when mapping reads from the Solexa technology are the 3' ends of longer reads, which contain a much higher frequency of sequencing errors, and the base-call quality scores.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>To investigate whether these sources of information can be used to improve accuracy when mapping reads, we developed the RMAP tool, which can map reads having a wide range of lengths and allows base-call quality scores to determine which positions in each read are more important when mapping. We applied RMAP to analyze data re-sequenced from two human BAC regions for varying read lengths, and varying criteria for use of quality scores. RMAP is freely available for downloading at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/rulai.cshl.edu\/rmap\/\" ext-link-type=\"uri\">http:\/\/rulai.cshl.edu\/rmap\/<\/jats:ext-link>.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>Our results indicate that significant gains in Solexa read mapping performance can be achieved by considering the information in 3' ends of longer reads, and appropriately using the base-call quality scores. The RMAP tool we have developed will enable researchers to effectively exploit this information in targeted re-sequencing projects.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-9-128","type":"journal-article","created":{"date-parts":[[2008,2,29]],"date-time":"2008-02-29T07:14:45Z","timestamp":1204269285000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":236,"title":["Using quality scores and longer reads improves accuracy of Solexa read mapping"],"prefix":"10.1186","volume":"9","author":[{"given":"Andrew D","family":"Smith","sequence":"first","affiliation":[]},{"given":"Zhenyu","family":"Xuan","sequence":"additional","affiliation":[]},{"given":"Michael Q","family":"Zhang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2008,2,28]]},"reference":[{"issue":"7057","key":"2113_CR1","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1038\/nature03959","volume":"437","author":"M Margulies","year":"2005","unstructured":"Margulies M, Egholm M, Altman W, Attiya S, Bader J, Bemben L, Berka J, Braverman M, Chen Y, Chen Z, Dewell S, Du L, Fierro J, Gomes X, Godwin B, He W, Helgesen S, Ho C, Ho C, Irzyk G, Jando S, Alenquer M, Jarvie T, Jirage K, Kim J, Knight J, Lanza J, Leamon J, Lefkowitz S, Lei M, Li J, Lohman K, Lu H, Makhijani V, McDade K, McKenna M, Myers E, Nickerson E, Nobile J, Plant R, Puc B, Ronan M, Roth G, Sarkis G, Simons J, Simpson J, Srinivasan M, Tartaro K, Tomasz A, Vogt K, Volkmer G, Wang S, Wang Y, Weiner M, Yu P, Begley R, Rothberg J: Genome sequencing in microfabricated high-density picolitre reactors. Nature 2005, 437(7057):376\u201380.","journal-title":"Nature"},{"issue":"6: Genomes and","key":"2113_CR2","doi-asserted-by":"publisher","first-page":"545","DOI":"10.1016\/j.gde.2006.10.009","volume":"16","author":"DR Bentley","year":"2006","unstructured":"Bentley DR: Whole-genome re-sequencing. Current Opinion in Genetics & Development 2006, 16(6: Genomes and evolution):545\u2013552. 10.1016\/j.gde.2006.10.009","journal-title":"Current Opinion in Genetics & Development"},{"issue":"4","key":"2113_CR3","doi-asserted-by":"publisher","first-page":"823","DOI":"10.1016\/j.cell.2007.05.009","volume":"129","author":"A Barski","year":"2007","unstructured":"Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, Wei G, Chepelev I, Zhao K: High-Resolution Profiling of Histone Methylations in the Human Genome. Cell 2007, 129(4):823\u2013837. 10.1016\/j.cell.2007.05.009","journal-title":"Cell"},{"issue":"7153","key":"2113_CR4","doi-asserted-by":"publisher","first-page":"553","DOI":"10.1038\/nature06008","volume":"448","author":"TS Mikkelsen","year":"2007","unstructured":"Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, Giannoukos G, Alvarez P, Brockman W, Kim TK, Koche RP, Lee W, Mendenhall E, O'Donovan A, Presser A, Russ C, Xie X, Meissner A, Wernig M, Jaenisch R, Nusbaum C, Lander ES, Bernstein BE: Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 2007, 448(7153):553\u2013560. 10.1038\/nature06008","journal-title":"Nature"},{"issue":"8","key":"2113_CR5","doi-asserted-by":"publisher","first-page":"651","DOI":"10.1038\/nmeth1068","volume":"4","author":"G Robertson","year":"2007","unstructured":"Robertson G, Hirst M, Bainbridge M, Bilenky M, Zhao Y, Zeng T, Euskirchen G, Bernier B, Varhol R, Delaney A, Thiessen N, Griffith OL, He A, Marra M, Snyder M, Jones S: Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nature Methods 2007, 4(8):651\u2013657. 10.1038\/nmeth1068","journal-title":"Nature Methods"},{"issue":"3","key":"2113_CR6","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1101\/gr.8.3.175","volume":"8","author":"B Ewing","year":"1998","unstructured":"Ewing B, Hillier L, Wendl MC, Green P: Base-Calling of Automated Sequencer Traces Using Phred. I. Accuracy Assessment. Genome Res 1998, 8(3):175\u2013185.","journal-title":"Genome Res"},{"issue":"3","key":"2113_CR7","doi-asserted-by":"publisher","first-page":"186","DOI":"10.1101\/gr.8.3.186","volume":"8","author":"B Ewing","year":"1998","unstructured":"Ewing B, Green P: Base-Calling of Automated Sequencer Traces Using Phred. II. Error Probabilities. Genome Res 1998, 8(3):186\u2013194.","journal-title":"Genome Res"},{"issue":"6","key":"2113_CR8","doi-asserted-by":"publisher","first-page":"1176","DOI":"10.1101\/gr.2188104","volume":"14","author":"CA Stewart","year":"2004","unstructured":"Stewart CA, Horton R, Allcock RJ, Ashurst JL, Atrazhev AM, Coggill P, Dunham I, Forbes S, Halls K, Howson JM, Humphray SJ, Hunt S, Mungall AJ, Osoegawa K, Palmer S, Roberts AN, Rogers J, Sims S, Wang Y, Wilming LG, Elliott JF, de Jong PJ, Sawcer S, Todd JA, Trowsdale J, Beck S: Complete MHC Haplotype Sequencing for Common Disease Gene Mapping. Genome Res 2004, 14(6):1176\u20131187. 10.1101\/gr.2188104","journal-title":"Genome Res"},{"key":"2113_CR9","volume-title":"Version 0.6.3","author":"H Li","year":"2008","unstructured":"Li H: Maq: Mapping and Assembly with Qualities. Version 0.6.3 2008. [http:\/\/maq.sourceforge.net\/index.shtml]"},{"issue":"3","key":"2113_CR10","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","volume":"215","author":"S Altschul","year":"1990","unstructured":"Altschul S, Gish W, Miller W, Myers E, Lipman D: Basic local alignment search tool. J Mol Biol 1990, 215(3):403\u2013410.","journal-title":"J Mol Biol"},{"issue":"3","key":"2113_CR11","doi-asserted-by":"publisher","first-page":"440","DOI":"10.1093\/bioinformatics\/18.3.440","volume":"18","author":"B Ma","year":"2002","unstructured":"Ma B, Tromp J, Li M: PatternHunter: faster and more sensitive homology search. Bioinformatics 2002, 18(3):440\u2013445. 10.1093\/bioinformatics\/18.3.440","journal-title":"Bioinformatics"},{"issue":"1\/2","key":"2113_CR12","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1007\/BF01188584","volume":"13","author":"PA Pevzner","year":"1995","unstructured":"Pevzner PA, Waterman MS: Multiple Filtration and Approximate Pattern Matching. Algorithmica 1995, 13(1\/2):135\u2013154. 10.1007\/BF01188584","journal-title":"Algorithmica"},{"key":"2113_CR13","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511574931","volume-title":"Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology","author":"D Gusfield","year":"1997","unstructured":"Gusfield D: Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press; 1997."},{"key":"2113_CR14","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1016\/0020-0190(96)00083-X","volume":"59","author":"R Baeza-Yates","year":"1996","unstructured":"Baeza-Yates R, Perleberg C: Fast and practical approximate pattern matching. Information Processing Letters 1996, 59: 21\u201327. 10.1016\/0020-0190(96)00083-X","journal-title":"Information Processing Letters"},{"key":"2113_CR15","volume-title":"Hacker's Delight","author":"HS Warren","year":"2002","unstructured":"Warren HS: Hacker's Delight. Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc; 2002."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-9-128.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T03:26:12Z","timestamp":1630466772000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-9-128"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,2,28]]},"references-count":15,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2008,12]]}},"alternative-id":["2113"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-9-128","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,2,28]]},"assertion":[{"value":"5 October 2007","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 February 2008","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 February 2008","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"128"}}