{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,24]],"date-time":"2026-01-24T08:54:59Z","timestamp":1769244899457,"version":"3.49.0"},"reference-count":87,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2018,8,20]],"date-time":"2018-08-20T00:00:00Z","timestamp":1534723200000},"content-version":"vor","delay-in-days":3,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100002322","name":"Coordination for the Improvement of Higher Education Personnel","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100002322","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100012637","name":"Vale","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100012637","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003593","name":"National Council for Scientific and Technological Development","doi-asserted-by":"publisher","award":["307479\/2016-1"],"award-info":[{"award-number":["307479\/2016-1"]}],"id":[{"id":"10.13039\/501100003593","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003593","name":"CNPq","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100003593","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,11,27]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: With the recent advances in DNA sequencing technologies, the study of the genetic composition of living organisms has become more accessible for researchers. Several advances have been achieved because of it, especially in the health sciences. However, many challenges which emerge from the complexity of sequencing projects remain unsolved. Among them is the task of assembling DNA fragments from previously unsequenced organisms, which is classified as an NP-hard (nondeterministic polynomial time hard) problem, for which no efficient computational solution with reasonable execution time exists. However, several tools that produce approximate solutions have been used with results that have facilitated scientific discoveries, although there is ample room for improvement. As with other NP-hard problems, machine learning algorithms have been one of the approaches used in recent years in an attempt to find better solutions to the DNA fragment assembly problem, although still at a low scale.<\/jats:p><jats:p>Results: This paper presents a broad review of pioneering literature comprising artificial intelligence-based DNA assemblers\u2014particularly the ones that use machine learning\u2014to provide an overview of state-of-the-art approaches and to serve as a starting point for further study in this field.<\/jats:p>","DOI":"10.1093\/bib\/bby072","type":"journal-article","created":{"date-parts":[[2018,7,27]],"date-time":"2018-07-27T03:40:57Z","timestamp":1532662857000},"page":"2116-2129","source":"Crossref","is-referenced-by-count":22,"title":["Machine learning meets genome assembly"],"prefix":"10.1093","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8276-2305","authenticated-orcid":false,"given":"Kleber","family":"Padovani de Souza","sequence":"first","affiliation":[{"name":"Federal University of Par\u00e1, Brazil"}]},{"given":"Jo\u00e3o Carlos","family":"Setubal","sequence":"additional","affiliation":[{"name":"University of S\u00e3o Paulo, Brazil"},{"name":"Department of Computer Science, University of S\u00e3o Paulo, Brazil"}]},{"given":"Andr\u00e9 Carlos","family":"Ponce de Leon F. de Carvalho","sequence":"additional","affiliation":[{"name":"Vale Technology Institute\u2014Sustainable Development, Brazil"}]},{"given":"Guilherme","family":"Oliveira","sequence":"additional","affiliation":[{"name":"University of Montpellier, LIRMM, France"}]},{"given":"Annie","family":"Chateau","sequence":"additional","affiliation":[{"name":"Vale Technology Institute\u2014Sustainable Development, Brazil"}]},{"given":"Ronnie","family":"Alves","sequence":"additional","affiliation":[{"name":"Federal University of Par\u00e1, Brazil"},{"name":"University of Montpellier, LIRMM, France"}]}],"member":"286","published-online":{"date-parts":[[2018,8,17]]},"reference":[{"issue":"6","key":"2020011102320767500_C1","doi-asserted-by":"crossref","first-page":"349","DOI":"10.1016\/S1470-2045(02)00775-1","article-title":"Impact of the human genome project on the clinical management of sporadic cancers","volume":"3","author":"Fey","year":"2002","journal-title":"Lancet Oncol"},{"issue":"3","key":"2020011102320767500_C2","doi-asserted-by":"crossref","first-page":"561","DOI":"10.1590\/1516-73132014000300004","article-title":"Projeto Genoma Humano: um retrato da constru\u00e7\u00e3o do conhecimento cient\u00edfico sob a \u00f3tica da revista Ci\u00eancia Hoje","volume":"20","author":"de Souza G\u00f3es","year":"2014","journal-title":"Ci\u00eancia & Educa\u00e7tilde\u00e3o (Bauru)"},{"key":"2020011102320767500_C3","article-title":"A Machine Learning Approach to DNA Shotgun Sequence Assembly","volume-title":"Dissertation, University of the Witwatersrand,","author":"Constantinescu","year":"2015"},{"issue":"4","key":"2020011102320767500_C4","doi-asserted-by":"crossref","first-page":"354","DOI":"10.1093\/bib\/bbp026","article-title":"Genome assembly reborn: recent computational challenges","volume":"10","author":"Pop","year":"2009","journal-title":"Brief Bioinform"},{"issue":"9","key":"2020011102320767500_C5","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1186\/gm483","article-title":"The human genome project: big science transforms biology and medicine","volume":"5","author":"Hood","year":"2013","journal-title":"Genome Med"},{"issue":"5","key":"2020011102320767500_C6","doi-asserted-by":"crossref","first-page":"523","DOI":"10.2217\/pme.14.34","article-title":"The road from next-generation sequencing to personalized medicine","volume":"11","author":"Gonzalez-Garay","year":"2014","journal-title":"Per Med"},{"issue":"1","key":"2020011102320767500_C7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.ygeno.2015.11.003","article-title":"The sequence of sequencers: the history of sequencing DNA","volume":"107","author":"Heather","year":"2016","journal-title":"Genomics"},{"issue":"3","key":"2020011102320767500_C8","first-page":"353","article-title":"Metagenomic assembly: overview, challenges and applications","volume":"89","author":"Ghurye","year":"2016","journal-title":"Yale J Biol Med"},{"key":"2020011102320767500_C9","first-page":"142","article-title":"Bioinformatics challenges of new sequencing technology","volume-title":"Trends in Genetics","author":"Pop","year":"2008"},{"key":"2020011102320767500_C10","first-page":"O111.014902","article-title":"De novo sequencing and homology searching","volume-title":"Mol Cell Proteomics","author":"Ma","year":"2011"},{"issue":"12","key":"2020011102320767500_C11","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pone.0114253","article-title":"PERGA: a paired-end read guided de novo assembler for extending contigs using SVM and look ahead approach","volume":"9","author":"Zhu","year":"2014","journal-title":"PLoS ONE"},{"key":"2020011102320767500_C12","first-page":"340","article-title":"Graph mining for next generation sequencing: leveraging the assembly graph for biological insights","volume-title":"BMC Genomics","author":"Warnke-Sommer","year":"2016"},{"key":"2020011102320767500_C13","article-title":"A reinforcement learning approach for solving the fragment assembly problem. In: 2011 13th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing","author":"Bocicor","year":"2011","journal-title":"IEEE,"},{"issue":"1","key":"2020011102320767500_C14","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1186\/s12864-017-3679-5","article-title":"Utilization of defined microbial communities enables effective evaluation of meta-genomic assemblies","volume":"18","author":"Greenwald","year":"2017","journal-title":"BMC Genomics"},{"issue":"6","key":"2020011102320767500_C15","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1038\/nrg3920","article-title":"Machine learning applications in genetics and genomics","volume":"16","author":"Libbrecht","year":"2015","journal-title":"Nat Rev Gen"},{"issue":"6","key":"2020011102320767500_C16","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1016\/j.ygeno.2010.03.001","article-title":"Assembly algorithms for next-generation sequencing data","volume":"95","author":"Miller","year":"2010","journal-title":"Genomics"},{"issue":"3","key":"2020011102320767500_C17","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1016\/j.jgg.2011.02.003","article-title":"The impact of next-generation sequencing on genomics","volume":"38","author":"Zhang","year":"2011","journal-title":"J Genet Genomics"},{"key":"2020011102320767500_C18","first-page":"1063","article-title":"Stepping stones in DNA sequencing","volume-title":"Biotechnol J","author":"Stranneheim","year":"2012"},{"key":"2020011102320767500_C19","doi-asserted-by":"crossref","DOI":"10.4137\/BBI.S12462","article-title":"Metagenomics: tools and insights for analyzing next-generation sequencing data derived from biodiversity studies","volume-title":"Bioinform Biol Insights","author":"Oulas","year":"2015"},{"key":"2020011102320767500_C20","first-page":"e0157033","article-title":"Pattern recognition on read positioning in next generation sequencing","volume-title":"PLoS One","author":"Byeon","year":"2016"},{"issue":"2","key":"2020011102320767500_C21","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1000667","article-title":"A primer on metagenomics","volume":"6","author":"Wooley","year":"2010","journal-title":"PLoS Comput Biol"},{"issue":"1","key":"2020011102320767500_C22","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1186\/1471-2105-12-493","article-title":"A new strategy for better genome assembly from very short reads","volume":"12","author":"Ji","year":"2011","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"2020011102320767500_C23","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1186\/2042-5783-3-2","article-title":"Beginner\u2019s guide to comparative bacterial genome analysis using next-generation sequence data","volume":"3","author":"Edwards","year":"2013","journal-title":"Microb Inform Exp"},{"issue":"1","key":"2020011102320767500_C24","doi-asserted-by":"crossref","first-page":"474","DOI":"10.1186\/s12859-017-1911-6","article-title":"Reference-guided de novo assembly approach improves genome reconstruction for related species","volume":"18","author":"Lischer","year":"2017","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"2020011102320767500_C25","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1186\/s13099-018-0242-0","article-title":"A hybrid reference-guided de novo assembly approach for generating cyclospora mitochondrion genomes","volume":"10","author":"Gopinath","year":"2018","journal-title":"Gut Pathog"},{"issue":"11","key":"2020011102320767500_C26","doi-asserted-by":"crossref","first-page":"R116","DOI":"10.1186\/gb-2010-11-11-r116","article-title":"Quake: quality-aware detection and correction of sequencing errors","volume":"11","author":"Kelley","year":"2010","journal-title":"Genome Biol"},{"key":"2020011102320767500_C27","first-page":"115","article-title":"DNA transposons: nature and applications in genomics","volume-title":"Curr Genomics","author":"Munoz-Lopez","year":"2010"},{"issue":"12","key":"2020011102320767500_C28","doi-asserted-by":"crossref","first-page":"564","DOI":"10.1186\/s13059-014-0564-2","article-title":"Tracking down the sources of experimental contamination in microbiome studies","volume":"15","author":"Weiss","year":"2014","journal-title":"Genome Biol"},{"issue":"Suppl 7","key":"2020011102320767500_C29","doi-asserted-by":"crossref","first-page":"S5","DOI":"10.1186\/1471-2105-16-S7-S5","article-title":"Heterozygous genome assembly via binary classification of homologous sequence","volume":"16","author":"Bodily","year":"2015","journal-title":"BMC Bioinformatics"},{"key":"2020011102320767500_C30","first-page":"i352","article-title":"Haplotype assembly in polyploid genomes and identical by descent shared tracts","volume-title":"Bioinformatics","author":"Aguiar","year":"2013"},{"issue":"1","key":"2020011102320767500_C31","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1186\/s13059-015-0587-3","article-title":"Extending reference assembly models","volume":"16","author":"Church","year":"2015","journal-title":"Genome Biol"},{"issue":"12","key":"2020011102320767500_C32","doi-asserted-by":"crossref","first-page":"1050","DOI":"10.1038\/nmeth.4035","article-title":"Phased diploid genome assembly with single-molecule real-time sequencing","volume":"13","author":"Chin","year":"2016","journal-title":"Nat Methods"},{"issue":"1","key":"2020011102320767500_C33","doi-asserted-by":"crossref","first-page":"1050","DOI":"10.1186\/s40168-015-0093-6","article-title":"Xander: employing a novel method for efficient gene-targeted metagenomic assembly","volume":"3","author":"Wang","year":"2015","journal-title":"Microbiome"},{"key":"2020011102320767500_C34","volume-title":"Human Genome Assembly GRCh38","author":"NCBI.","year":"2017"},{"key":"2020011102320767500_C35","volume-title":"The New Science of Metagenomics: Revealing the Secrets of Our Microbial Planet.","author":"National Research Council","year":"2007"},{"issue":"1","key":"2020011102320767500_C36","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pone.0169662","article-title":"Comparing and evaluating metagenome assembly tools from a microbiologist\u2019s perspective\u2014not only size matters","volume":"12","author":"Vollmers","year":"2017","journal-title":"PLOS One"},{"key":"2020011102320767500_C37","doi-asserted-by":"crossref","DOI":"10.1038\/ncomms14306","article-title":"MetaSort untangles metagenome assembly by reducing microbial community complexity","volume":"8","author":"Ji","year":"2017","journal-title":"Nat Commun"},{"key":"2020011102320767500_C38","article-title":"Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes","author":"Olson","year":"2017","journal-title":"Brief Bioinformatics"},{"issue":"4","key":"2020011102320767500_C39","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1128\/MMBR.00009-08","article-title":"A bioinformatician\u2019s guide to metagenomics","volume":"72","author":"Kunin","year":"2008","journal-title":"Microbiol Mol Biol Rev"},{"issue":"5","key":"2020011102320767500_C40","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1089\/cmb.2012.0021","article-title":"SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing","volume":"19","author":"Bankevich","year":"2012","journal-title":"J Comput Biol"},{"issue":"1","key":"2020011102320767500_C41","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/2047-217X-1-18","article-title":"SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler","volume":"1","author":"Luo","year":"2012","journal-title":"GigaScience"},{"issue":"5","key":"2020011102320767500_C42","doi-asserted-by":"crossref","first-page":"810","DOI":"10.1101\/gr.7337908","article-title":"ALLPATHS: de novo assembly of whole-genome shotgun microreads","volume":"18","author":"Butler","year":"2008","journal-title":"Genome Res"},{"issue":"21","key":"2020011102320767500_C43","doi-asserted-by":"crossref","first-page":"2669","DOI":"10.1093\/bioinformatics\/btt476","article-title":"The MaSuRCA genome assembler","volume":"29","author":"Zimin","year":"2013","journal-title":"Bioinformatics"},{"issue":"5","key":"2020011102320767500_C44","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1101\/gr.213959.116","article-title":"metaSPAdes: a new versatile metagenomic assembler","volume":"27","author":"Nurk","year":"2017","journal-title":"Genome Res"},{"issue":"20","key":"2020011102320767500_C45","doi-asserted-by":"crossref","first-page":"e155","DOI":"10.1093\/nar\/gks678","article-title":"MetaVelvet: an extension of velvet assembler to de novo metagenome assembly from short sequence reads","volume":"40","author":"Namiki","year":"2012","journal-title":"Nucleic Acids Res"},{"issue":"11","key":"2020011102320767500_C46","doi-asserted-by":"crossref","first-page":"1420","DOI":"10.1093\/bioinformatics\/bts174","article-title":"IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth","volume":"28","author":"Peng","year":"2012","journal-title":"Bioinformatics"},{"issue":"19","key":"2020011102320767500_C47","doi-asserted-by":"crossref","first-page":"2717","DOI":"10.1093\/bioinformatics\/btu395","article-title":"Omega: an overlap-graph de novo assembler for metagenomics","volume":"30","author":"Haider","year":"2014","journal-title":"Bioinformatics"},{"issue":"12","key":"2020011102320767500_C48","doi-asserted-by":"crossref","first-page":"R122","DOI":"10.1186\/gb-2012-13-12-r122","article-title":"Ray meta: scalable de novo metagenome assembly and profiling","volume":"13","author":"Boisvert","year":"2012","journal-title":"Genome Biol"},{"issue":"1","key":"2020011102320767500_C49","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1093\/dnares\/dsu041","article-title":"MetaVelvet-SL: an extension of the velvet assembler to a de novo metagenomic assembler utilizing supervised learning","volume":"22","author":"Afiahayati","year":"2014","journal-title":"DNA Res"},{"key":"2020011102320767500_C50","doi-asserted-by":"crossref","first-page":"209","DOI":"10.3389\/fpls.2014.00209","article-title":"An introduction to the analysis of shotgun metagenomic data","volume":"5","author":"Sharpton","year":"2014","journal-title":"Frontiers in Plant Sci"},{"issue":"11","key":"2020011102320767500_C51","doi-asserted-by":"crossref","first-page":"1533","DOI":"10.1038\/s41564-017-0012-7","article-title":"Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life","volume":"2","author":"Parks","year":"2017","journal-title":"Nat Microbiol"},{"issue":"1","key":"2020011102320767500_C52","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/2047-217X-2-10","article-title":"Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species","volume":"2","author":"Bradnam","year":"2013","journal-title":"GigaScience"},{"issue":"3","key":"2020011102320767500_C53","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1101\/gr.131383.111","article-title":"GAGE: a critical evaluation of genome assemblies and assembly algorithms","volume":"22","author":"Salzberg","year":"2012","journal-title":"Genome Res"},{"key":"2020011102320767500_C54","volume-title":"The Great A.I. Awakening.","author":"Lewis-Kraus","year":"2016"},{"issue":"1","key":"2020011102320767500_C55","doi-asserted-by":"crossref","first-page":"362","DOI":"10.1186\/s12859-015-0788-5","article-title":"Evaluation of shotgun metagenomics sequence classification methods using in silico and in vitro simulated communities","volume":"16","author":"Peabody","year":"2015","journal-title":"BMC Bioinformatics"},{"issue":"11","key":"2020011102320767500_C56","doi-asserted-by":"crossref","first-page":"1063","DOI":"10.1038\/nmeth.4458","article-title":"Critical assessment of metagenome interpretation\u2014a benchmark of metagenomics software","volume":"14","author":"Sczyrba","year":"2017","journal-title":"Nat Methods"},{"issue":"12","key":"2020011102320767500_C57","doi-asserted-by":"crossref","first-page":"2224","DOI":"10.1101\/gr.126599.111","article-title":"John, et al. Assemblathon 1: a competitive assessment of de novo short read assembly methods","volume":"21","author":"Earl","year":"2011","journal-title":"Genome Res"},{"issue":"8","key":"2020011102320767500_C58","doi-asserted-by":"crossref","first-page":"1072","DOI":"10.1093\/bioinformatics\/btt086","article-title":"QUAST: quality assessment tool for genome assemblies","volume":"29","author":"Gurevich","year":"2013","journal-title":"Bioinformatics"},{"issue":"7","key":"2020011102320767500_C59","doi-asserted-by":"crossref","first-page":"1043","DOI":"10.1101\/gr.186072.114","article-title":"Checkm: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes","volume":"25","author":"Parks","year":"2015","journal-title":"Genome Re"},{"issue":"7","key":"2020011102320767500_C60","doi-asserted-by":"crossref","first-page":"1088","DOI":"10.1093\/bioinformatics\/btv697","article-title":"MetaQUAST: evaluation of metagenome assemblies","volume":"32","author":"Mikheenko","year":"2015","journal-title":"Bioinformatics"},{"key":"2020011102320767500_C61","first-page":"612","article-title":"Evaluating the quantitative capabilities of metagenomic analysis software","volume-title":"Curr Microbiol","author":"Kerepesi","year":"2016"},{"issue":"2","key":"2020011102320767500_C62","doi-asserted-by":"publisher","first-page":"538","DOI":"10.1016\/j.gene.2013.10.015","article-title":"AmphoraNet: The webserver implementation of the AMPHORA2 metagenomic workflow suite","volume":"533","author":"Kerepesi","year":"2014","journal-title":"Gene"},{"key":"2020011102320767500_C63","first-page":"1","article-title":"Machine learning for metagenomics: methods and tools","volume-title":"Metagenomics","author":"Soueidan","year":"2017"},{"key":"2020011102320767500_C64","doi-asserted-by":"crossref","first-page":"512","DOI":"10.1007\/978-3-662-44845-8_47","article-title":"Machine learning approaches for metagenomics","author":"Rangwala","year":"2014","journal-title":"Machine Learning and Knowledge Discovery in Databases,"},{"issue":"4","key":"2020011102320767500_C65","first-page":"207","article-title":"Artificial intelligence in biological data","volume":"7","author":"Chakraborty","year":"2017","journal-title":"J Inf Technol Softw Eng"},{"key":"2020011102320767500_C66","first-page":"851","article-title":"Deep learning in bioinformatics","volume-title":"Brief Bioinformatics","author":"Min","year":"2016"},{"issue":"06","key":"2020011102320767500_C67","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1142\/S0129065799000563","article-title":"DNA fragment assembly using neural prediction techniques","volume":"09","author":"Angeleri","year":"1999","journal-title":"Int J Neural Syst"},{"key":"2020011102320767500_C68","first-page":"337","article-title":"A time-efficient, linear-space local similarity algorithm","volume-title":"Adv Appl Math","author":"Huang","year":"1991"},{"key":"2020011102320767500_C69","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1016\/j.procs.2017.05.242","article-title":"Machine learning models in error and variant detection in high-variation high-throughput sequencing datasets","volume":"108","author":"Krachunov","year":"2017","journal-title":"Procedia Comput Sci"},{"issue":"6","key":"2020011102320767500_C70","doi-asserted-by":"crossref","first-page":"744","DOI":"10.1093\/bioinformatics\/btm608","article-title":"A machine-learning approach to combined evidence validation of genome assemblies","volume":"24","author":"Choi","year":"2008","journal-title":"Bioinformatics"},{"key":"2020011102320767500_C71","doi-asserted-by":"crossref","DOI":"10.1109\/ICCABS.2013.6629196","article-title":"An unsupervised learning approach to assembly validation","volume-title":"2013 IEEE 3rd International Conference on Computational Advances in Bio and medical Sciences (ICCABS).","author":"Lanc","year":"2013"},{"key":"2020011102320767500_C72","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1007\/978-1-4939-3578-9_17","article-title":"Introducing machine learning concepts with WEKA","author":"Smith","year":"2016","journal-title":"Methods in Molecular Biology,"},{"issue":"1","key":"2020011102320767500_C73","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1186\/s12859-015-0644-7","article-title":"supervised ranking of contigs in de novo assemblies","volume":"16","author":"Kuhring","year":"2015","journal-title":"BMC Bioinformatics"},{"issue":"3","key":"2020011102320767500_C74","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J Mol Biol"},{"issue":"4","key":"2020011102320767500_C75","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1093\/bioinformatics\/bts723","article-title":"ALE: a generic assembly likelihood evaluation framework for assessing the accuracy of genome and metagenome assemblies","volume":"29","author":"Clark","year":"2013","journal-title":"Bioinformatics"},{"issue":"1","key":"2020011102320767500_C76","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1186\/1471-2105-11-33","article-title":"Improving de novo sequence assembly using machine learning and comparative genomics for overlap correction","volume":"11","author":"Palmer","year":"2010","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"2020011102320767500_C77","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1186\/1471-2105-8-64","article-title":"Minimus: a fast, lightweight genome assembler","volume":"8","author":"Sommer","year":"2007","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"2020011102320767500_C78","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1109\/JPROC.2015.2494198","article-title":"Machine learning in genomic medicine: a review of computational problems and data sets","volume":"104","author":"Leung","year":"2016","journal-title":"Proc IEEE Inst Electr Electron Eng"},{"key":"2020011102320767500_C79","doi-asserted-by":"crossref","first-page":"514","DOI":"10.1109\/ACCESS.2014.2325029","article-title":"Big data deep learning: challenges and perspectives","volume":"2","author":"Chen","year":"2014","journal-title":"IEEE Access"},{"issue":"6218","key":"2020011102320767500_C80","doi-asserted-by":"crossref","first-page":"1254806","DOI":"10.1126\/science.1254806","article-title":"The human splicing code reveals new insights into the genetic determinants of disease","volume":"347","author":"Xiong","year":"2014","journal-title":"Science"},{"key":"2020011102320767500_C81","article-title":"Creating a universal SNP and small indel variant caller with deep","author":"Poplin","year":"2016","journal-title":"neural networks."},{"issue":"6","key":"2020011102320767500_C82","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1109\/MSP.2017.2743240","article-title":"Deep reinforcement learning: A brief survey","volume":"34","author":"Arulkumaran","year":"2017","journal-title":"IEEE Signal Process Mag"},{"key":"2020011102320767500_C83","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1007\/978-3-319-07064-3_6","article-title":"Play Ms. Pac-Man using an advanced reinforcement learning agent","volume-title":"Artificial Intelligence: Methods and Applications,","author":"Tziortziotis","year":"2014"},{"key":"2020011102320767500_C84","first-page":"278","article-title":"PacBio sequencing and its applications","volume-title":"Genomics Proteomics Bioinformatics","author":"Rhoads","year":"2015"},{"key":"2020011102320767500_C85","article-title":"Efficiency of PacBio long read correction by 2nd generation illumina sequencing","author":"Mahmoud","year":"2017","journal-title":"Genomics"},{"key":"2020011102320767500_C86","volume-title":"Smrt sequencing: read lengths,","author":"PacificBioscience","year":"2018"},{"issue":"6","key":"2020011102320767500_C87","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1093\/bioinformatics\/btw321","article-title":"Accurate self-correction of errors in long reads using de bruijn graphs","volume":"33","author":"Salmela","year":"2017","journal-title":"Bioinformatics"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/20\/6\/2116\/31789390\/bby072.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/20\/6\/2116\/31789390\/bby072.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,4]],"date-time":"2023-09-04T00:13:14Z","timestamp":1693786394000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/20\/6\/2116\/5074612"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,8,17]]},"references-count":87,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2018,8,17]]},"published-print":{"date-parts":[[2019,11,27]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bby072","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,11]]},"published":{"date-parts":[[2018,8,17]]}}}