{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,18]],"date-time":"2026-05-18T23:12:59Z","timestamp":1779145979144,"version":"3.51.4"},"reference-count":47,"publisher":"Oxford University Press (OUP)","issue":"17","license":[{"start":{"date-parts":[[2018,4,6]],"date-time":"2018-04-06T00:00:00Z","timestamp":1522972800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004052","name":"King Abdullah University of Science and Technology","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004052","name":"KAUST","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Office of Sponsored Research"},{"name":"OSR","award":["FCC\/1\/1976-04"],"award-info":[{"award-number":["FCC\/1\/1976-04"]}]},{"name":"OSR","award":["URF\/1\/2602-01"],"award-info":[{"award-number":["URF\/1\/2602-01"]}]},{"name":"OSR","award":["URF\/1\/3007-01"],"award-info":[{"award-number":["URF\/1\/3007-01"]}]},{"name":"OSR","award":["URF\/1\/3412-01"],"award-info":[{"award-number":["URF\/1\/3412-01"]}]},{"name":"OSR","award":["URF\/1\/3450-01"],"award-info":[{"award-number":["URF\/1\/3450-01"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Oxford Nanopore sequencing is a rapidly developed sequencing technology in recent years. To keep pace with the explosion of the downstream data analytical tools, a versatile Nanopore sequencing simulator is needed to complement the experimental data as well as to benchmark those newly developed tools. However, all the currently available simulators are based on simple statistics of the produced reads, which have difficulty in capturing the complex nature of the Nanopore sequencing procedure, the main task of which is the generation of raw electrical current signals.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Here we propose a deep learning based simulator, DeepSimulator, to mimic the entire pipeline of Nanopore sequencing. Starting from a given reference genome or assembled contigs, we simulate the electrical current signals by a context-dependent deep learning model, followed by a base-calling procedure to yield simulated reads. This workflow mimics the sequencing procedure more naturally. The thorough experiments performed across four species show that the signals generated by our context-dependent model are more similar to the experimentally obtained signals than the ones generated by the official context-independent pore model. In terms of the simulated reads, we provide a parameter interface to users so that they can obtain the reads with different accuracies ranging from 83 to 97%. The reads generated by the default parameter have almost the same properties as the real data. Two case studies demonstrate the application of DeepSimulator to benefit the development of tools in de novo assembly and in low coverage SNP detection.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The software can be accessed freely at: https:\/\/github.com\/lykaust15\/DeepSimulator.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty223","type":"journal-article","created":{"date-parts":[[2018,4,4]],"date-time":"2018-04-04T15:12:36Z","timestamp":1522854756000},"page":"2899-2908","source":"Crossref","is-referenced-by-count":74,"title":["DeepSimulator: a deep simulator for Nanopore sequencing"],"prefix":"10.1093","volume":"34","author":[{"given":"Yu","family":"Li","sequence":"first","affiliation":[{"name":"Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Renmin","family":"Han","sequence":"additional","affiliation":[{"name":"Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chongwei","family":"Bi","sequence":"additional","affiliation":[{"name":"Biological and Environmental Sciences and Engineering (BESE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mo","family":"Li","sequence":"additional","affiliation":[{"name":"Biological and Environmental Sciences and Engineering (BESE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sheng","family":"Wang","sequence":"additional","affiliation":[{"name":"Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xin","family":"Gao","sequence":"additional","affiliation":[{"name":"Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2018,4,6]]},"reference":[{"key":"2023061313372862500_bty223-B1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3022670.2976746","article-title":"Tensorflow: learning functions at scale","volume":"51","author":"Abadi","year":"2016","journal-title":"ACM Sigplan Notices"},{"key":"2023061313372862500_bty223-B2","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1016\/S0076-5392(08)60869-3","article-title":"Canonical correlation analysis of time series and the use of an information criterion","volume":"126","author":"Akaike","year":"1976","journal-title":"Math. Sci. Eng"},{"key":"2023061313372862500_bty223-B3","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped blast and psi-blast: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"key":"2023061313372862500_bty223-B4","author":"Baker","year":"2016"},{"key":"2023061313372862500_bty223-B5","doi-asserted-by":"crossref","first-page":"e0178751.","DOI":"10.1371\/journal.pone.0178751","article-title":"Deepnano: deep recurrent neural networks for base calling in minion nanopore reads","volume":"12","author":"Bo\u017ea","year":"2017","journal-title":"PloS One"},{"key":"2023061313372862500_bty223-B6","first-page":"16027","author":"Byrne","year":"2017"},{"key":"2023061313372862500_bty223-B7","doi-asserted-by":"crossref","first-page":"14515.","DOI":"10.1038\/ncomms14515","article-title":"Scaffolding and completing genome assemblies in real-time with nanopore sequencing","volume":"8","author":"Cao","year":"2017","journal-title":"Nat. Commun"},{"key":"2023061313372862500_bty223-B8","doi-asserted-by":"crossref","first-page":"3575","DOI":"10.1093\/bioinformatics\/btx480","article-title":"Sequence2vec: a novel embedding approach for modeling transcription factor binding affinity landscape","volume":"33","author":"Dai","year":"2017","journal-title":"Bioinformatics"},{"key":"2023061313372862500_bty223-B9","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1093\/bioinformatics\/btw569","article-title":"Nanocall: an open source basecaller for oxford nanopore sequencing data","volume":"33","author":"David","year":"2017","journal-title":"Bioinformatics"},{"key":"2023061313372862500_bty223-B10","doi-asserted-by":"crossref","first-page":"518","DOI":"10.1038\/nbt.3423","article-title":"Three decades of nanopore sequencing","volume":"34","author":"Deamer","year":"2016","journal-title":"Nat. Biotechnol"},{"key":"2023061313372862500_bty223-B11","doi-asserted-by":"crossref","first-page":"2369","DOI":"10.1093\/nar\/27.11.2369","article-title":"Alignment of whole genomes","volume":"27","author":"Delcher","year":"1999","journal-title":"Nucleic Acids Res"},{"key":"2023061313372862500_bty223-B12","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1038\/nrg.2016.57","article-title":"A comparison of tools for the simulation of genomic next-generation sequencing data","volume":"17","author":"Escalona","year":"2016","journal-title":"Nat. Rev. Genet"},{"key":"2023061313372862500_bty223-B13","author":"Ester","year":"1996"},{"key":"2023061313372862500_bty223-B14","author":"Gehring","year":"2017"},{"key":"2023061313372862500_bty223-B15","author":"Graves","year":"2013"},{"key":"2023061313372862500_bty223-B16","doi-asserted-by":"crossref","first-page":"602","DOI":"10.1016\/j.neunet.2005.06.042","article-title":"Framewise phoneme classification with bidirectional lstm and other neural network architectures","volume":"18","author":"Graves","year":"2005","journal-title":"Neural Netw"},{"key":"2023061313372862500_bty223-B17","author":"Ioffe","year":"2015"},{"key":"2023061313372862500_bty223-B18","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1007\/978-3-319-56970-3_5","volume-title":"Research in Computational Molecular Biology","author":"Jain","year":"2017"},{"key":"2023061313372862500_bty223-B19","first-page":"338","author":"Jain","year":"2018"},{"key":"2023061313372862500_bty223-B20","author":"Kingma","year":"2014"},{"key":"2023061313372862500_bty223-B21","doi-asserted-by":"crossref","first-page":"722","DOI":"10.1101\/gr.215087.116","article-title":"Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation","volume":"27","author":"Koren","year":"2017","journal-title":"Genome Res"},{"key":"2023061313372862500_bty223-B22","first-page":"6395","author":"Lee","year":"2014"},{"key":"2023061313372862500_bty223-B23","doi-asserted-by":"crossref","first-page":"2987","DOI":"10.1093\/bioinformatics\/btr509","article-title":"A statistical framework for snp calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data","volume":"27","author":"Li","year":"2011","journal-title":"Bioinformatics"},{"key":"2023061313372862500_bty223-B24","doi-asserted-by":"crossref","first-page":"2103","DOI":"10.1093\/bioinformatics\/btw152","article-title":"Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences","volume":"32","author":"Li","year":"2016","journal-title":"Bioinformatics"},{"key":"2023061313372862500_bty223-B25","author":"Li","year":"2017"},{"key":"2023061313372862500_bty223-B26","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.1093\/bioinformatics\/btp352","article-title":"The sequence alignment\/map format and samtools","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023061313372862500_bty223-B27","doi-asserted-by":"crossref","first-page":"760","DOI":"10.1093\/bioinformatics\/btx680","article-title":"Deepre: sequence-based enzyme ec number prediction by deep learning","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"2023061313372862500_bty223-B28","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1016\/j.gpb.2016.05.004","article-title":"Oxford nanopore minion sequencing and genome assembly","volume":"14","author":"Lu","year":"2016","journal-title":"Genomics Proteomics Bioinf"},{"key":"2023061313372862500_bty223-B29","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1038\/nrmicro2088","article-title":"Application of \u2019next-generation\u2019 sequencing technologies to microbial genetics","volume":"7","author":"MacLean","year":"2009","journal-title":"Nat. Rev. Microbiol"},{"key":"2023061313372862500_bty223-B30","doi-asserted-by":"crossref","first-page":"31.","DOI":"10.1038\/nrg2626","article-title":"Sequencing technologies\u2013the next generation","volume":"11","author":"Metzker","year":"2010","journal-title":"Nat. Rev. Genet"},{"key":"2023061313372862500_bty223-B31","doi-asserted-by":"crossref","first-page":"1719.","DOI":"10.1016\/j.cell.2016.11.052","article-title":"In vivo amelioration of age-associated hallmarks by partial reprogramming","volume":"167","author":"Ocampo","year":"2016","journal-title":"Cell"},{"key":"2023061313372862500_bty223-B32","author":"Rajeswar","year":"2017"},{"key":"2023061313372862500_bty223-B33","doi-asserted-by":"crossref","first-page":"561","DOI":"10.3233\/IDA-2007-11508","article-title":"Toward accurate dynamic time warping in linear time and space","volume":"11","author":"Salvador","year":"2007","journal-title":"Intell. Data Anal"},{"key":"2023061313372862500_bty223-B34","doi-asserted-by":"crossref","first-page":"12065","DOI":"10.1038\/ncomms12065","article-title":"Long-read sequencing and de novo assembly of a chinese genome","volume":"7","author":"Shi","year":"2016","journal-title":"Nat. Commun"},{"key":"2023061313372862500_bty223-B35","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1038\/nmeth.4184","article-title":"Detecting dna cytosine methylation using nanopore sequencing","volume":"14","author":"Simpson","year":"2017","journal-title":"Nat. Methods"},{"key":"2023061313372862500_bty223-B36","doi-asserted-by":"crossref","first-page":"11307.","DOI":"10.1038\/ncomms11307","article-title":"Fast and sensitive mapping of nanopore sequencing reads with graphmap","volume":"7","author":"Sovi\u0107","year":"2016","journal-title":"Nat. Commun"},{"key":"2023061313372862500_bty223-B37","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/nrg3966","article-title":"The dynamics of mitochondrial dna heteroplasmy: implications for human health and disease","volume":"16","author":"Stewart","year":"2015","journal-title":"Nat. Rev. Genet"},{"key":"2023061313372862500_bty223-B38","author":"Stoiber","year":"2017"},{"key":"2023061313372862500_bty223-B39","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1007\/BF00130487","article-title":"Color indexing","volume":"7","author":"Swain","year":"1991","journal-title":"Int. J. Comput. Vis"},{"key":"2023061313372862500_bty223-B40","author":"Teng","year":"2018"},{"key":"2023061313372862500_bty223-B41","author":"Trigeorgis","year":"2016"},{"key":"2023061313372862500_bty223-B42","doi-asserted-by":"crossref","first-page":"737","DOI":"10.1101\/gr.214270.116","article-title":"Fast and accurate de novo genome assembly from long uncorrected reads","volume":"27","author":"Vaser","year":"2017","journal-title":"Genome Res"},{"key":"2023061313372862500_bty223-B43","first-page":"6000","author":"Vaswani","year":"2017"},{"key":"2023061313372862500_bty223-B44","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1146\/annurev-anchem-061516-045228","article-title":"Single-cell transcriptional analysis","volume":"10","author":"Wu","year":"2017","journal-title":"Annu. Rev. Anal. Chem"},{"key":"2023061313372862500_bty223-B45","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/gigascience\/gix010","article-title":"Nanosim: nanopore sequence read simulator based on statistical characterization","volume":"6","author":"Yang","year":"2017","journal-title":"GigaScience"},{"key":"2023061313372862500_bty223-B46","doi-asserted-by":"crossref","first-page":"2859","DOI":"10.1093\/bioinformatics\/btt512","article-title":"Pyrohmmvar: a sensitive and accurate method to call short indels and snps for ion torrent and 454 data","volume":"29","author":"Zeng","year":"2013","journal-title":"Bioinformatics"},{"key":"2023061313372862500_bty223-B47","author":"Zhang","year":"2017"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/17\/2899\/50582124\/bioinformatics_34_17_2899.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/17\/2899\/50582124\/bioinformatics_34_17_2899.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,13]],"date-time":"2023-06-13T09:38:34Z","timestamp":1686649114000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/17\/2899\/4962495"}},"subtitle":[],"editor":[{"given":"Bonnie","family":"Berger","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2018,4,6]]},"references-count":47,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2018,9,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty223","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/238683","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,9,1]]},"published":{"date-parts":[[2018,4,6]]}}}