{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T09:55:34Z","timestamp":1775210134693,"version":"3.50.1"},"reference-count":20,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2023,4,3]],"date-time":"2023-04-03T00:00:00Z","timestamp":1680480000000},"content-version":"vor","delay-in-days":2,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"GlaxoSmithKline and the KCL Funded Centre for Doctoral Training"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,4,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Summary<\/jats:title>\n                  <jats:p>The current widespread adoption of next-generation sequencing (NGS) in all branches of basic research and clinical genetics fields means that users with highly variable informatics skills, computing facilities and application purposes need to process, analyse, and interpret NGS data. In this landscape, versatility, scalability, and user-friendliness are key characteristics for an NGS analysis software. We developed DNAscan2, a highly flexible, end-to-end pipeline for the analysis of NGS data, which (i) can be used for the detection of multiple variant types, including SNVs, small indels, transposable elements, short tandem repeats, and other large structural variants; (ii) covers all standard steps of NGS analysis, from quality control of raw data and genome alignment to variant calling, annotation, and generation of reports for the interpretation and prioritization of results; (iii) is highly adaptable as it can be deployed and run via either a graphic user interface for non-bioinformaticians and a command line tool for personal computer usage; (iv) is scalable as it can be executed in parallel as a Snakemake workflow, and; (v) is computationally efficient by minimizing RAM and CPU time requirements.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>DNAscan2 is implemented in Python3 and is available at https:\/\/github.com\/KHP-Informatics\/DNAscanv2.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad152","type":"journal-article","created":{"date-parts":[[2023,4,3]],"date-time":"2023-04-03T14:51:41Z","timestamp":1680533501000},"source":"Crossref","is-referenced-by-count":8,"title":["DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1231-2880","authenticated-orcid":false,"given":"Heather","family":"Marriott","sequence":"first","affiliation":[{"name":"Department of Basic and Clinical Neuroscience, King\u2019s College London , London, United Kingdom"},{"name":"Department of Biostatistics and Health Informatics, King\u2019s College London , London, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5183-4844","authenticated-orcid":false,"given":"Renata","family":"Kabiljo","sequence":"additional","affiliation":[{"name":"Department of Basic and Clinical Neuroscience, King\u2019s College London , London, United Kingdom"},{"name":"Department of Biostatistics and Health Informatics, King\u2019s College London , London, United Kingdom"}]},{"given":"Ahmad","family":"Al Khleifat","sequence":"additional","affiliation":[{"name":"Department of Basic and Clinical Neuroscience, King\u2019s College London , London, United Kingdom"}]},{"given":"Richard J","family":"Dobson","sequence":"additional","affiliation":[{"name":"Department of Biostatistics and Health Informatics, King\u2019s College London , London, United Kingdom"},{"name":"NIHR BRC SLAM NHS Foundation Trust , London, United Kingdom"},{"name":"Institute of Health Informatics and NIHR BRC at University College London Hospitals NHS Foundation Trust , London, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4924-7712","authenticated-orcid":false,"given":"Ammar","family":"Al-Chalabi","sequence":"additional","affiliation":[{"name":"Department of Basic and Clinical Neuroscience, King\u2019s College London , London, United Kingdom"},{"name":"King\u2019s College Hospital , London, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5280-5017","authenticated-orcid":false,"given":"Alfredo","family":"Iacoangeli","sequence":"additional","affiliation":[{"name":"Department of Basic and Clinical Neuroscience, King\u2019s College London , London, United Kingdom"},{"name":"Department of Biostatistics and Health Informatics, King\u2019s College London , London, United Kingdom"},{"name":"NIHR BRC SLAM NHS Foundation Trust , London, United Kingdom"}]}],"member":"286","published-online":{"date-parts":[[2023,4,3]]},"reference":[{"key":"2023041820541051600_btad152-B1","doi-asserted-by":"crossref","DOI":"10.1002\/0471142727.mb1910s89","article-title":"Galaxy: a web-based genome analysis tool for experimentalists","author":"Blankenberg","year":"2010","journal-title":"Curr Protoc Mol Biol"},{"key":"2023041820541051600_btad152-B2","doi-asserted-by":"crossref","first-page":"6793","DOI":"10.1038\/s41598-018-25022-6","article-title":"DNAp: a pipeline for DNA-seq data analysis","volume":"8","author":"Causey","year":"2018","journal-title":"Sci Rep"},{"key":"2023041820541051600_btad152-B3","doi-asserted-by":"crossref","first-page":"966","DOI":"10.1038\/nmeth.3505","article-title":"SpeedSeq: ultra-fast personal genome analysis and interpretation","volume":"12","author":"Chiang","year":"2015","journal-title":"Nat Methods"},{"key":"2023041820541051600_btad152-B4","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/s41586-020-2287-8","article-title":"A structural variation reference for medical and population genetics","volume":"581","author":"Collins","year":"2020","journal-title":"Nature"},{"key":"2023041820541051600_btad152-B5","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1038\/ng.806","article-title":"A framework for variation discovery and genotyping using next-generation DNA sequencing data","volume":"43","author":"DePristo","year":"2011","journal-title":"Nat Genet"},{"key":"2023041820541051600_btad152-B6","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1186\/s13059-020-02017-z","article-title":"ExpansionHunter denovo: a computational method for locating known and novel repeat expansions in short-read sequencing data","volume":"21","author":"Dolzhenko","year":"2020","journal-title":"Genome Biol"},{"key":"2023041820541051600_btad152-B7","doi-asserted-by":"crossref","first-page":"1916","DOI":"10.1101\/gr.218032.116","article-title":"The mobile element locator tool (MELT): population-scale mobile element discovery and biology","volume":"27","author":"Gardner","year":"2017","journal-title":"Genome Res"},{"key":"2023041820541051600_btad152-B8","author":"Garrison","year":"2012"},{"key":"2023041820541051600_btad152-B9","doi-asserted-by":"crossref","first-page":"3572","DOI":"10.1093\/bioinformatics\/bty304","article-title":"AnnotSV: an integrated tool for structural variations annotation","volume":"34","author":"Geoffroy","year":"2018","journal-title":"Bioinformatics"},{"key":"2023041820541051600_btad152-B10","doi-asserted-by":"crossref","first-page":"W21","DOI":"10.1093\/nar\/gkab402","article-title":"AnnotSV and knotAnnotSV: a web server for human structural variations annotations, ranking and analysis","volume":"49","author":"Geoffroy","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023041820541051600_btad152-B11","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-019-2791-8","article-title":"DNAscan: personal computer compatible NGS analysis, annotation and visualisation","volume":"20","author":"Iacoangeli","year":"2019","journal-title":"BMC Bioinformatics"},{"key":"2023041820541051600_btad152-B12","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1080\/21678421.2018.1562553","article-title":"ALSgeneScanner: a pipeline for the analysis and interpretation of DNA sequencing data of ALS patients","volume":"20","author":"Iacoangeli","year":"2019","journal-title":"Amyotroph Lateral Scler Frontotemporal Degener"},{"key":"2023041820541051600_btad152-B13","doi-asserted-by":"crossref","first-page":"907","DOI":"10.1038\/s41587-019-0201-4","article-title":"Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype","volume":"37","author":"Kim","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2023041820541051600_btad152-B14","doi-asserted-by":"crossref","first-page":"591","DOI":"10.1038\/s41592-018-0051-x","article-title":"Strelka2: fast and accurate calling of germline and somatic variants","volume":"15","author":"Kim","year":"2018","journal-title":"Nat Methods"},{"key":"2023041820541051600_btad152-B15","author":"Li","year":"2013"},{"key":"2023041820541051600_btad152-B16","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1186\/s13073-015-0221-8","article-title":"A 26-hour system of highly sensitive whole genome sequencing for emergency management of genetic diseases","volume":"7","author":"Miller","year":"2015","journal-title":"Genome Med"},{"key":"2023041820541051600_btad152-B17","author":"Poplin","year":"2018"},{"key":"2023041820541051600_btad152-B18","doi-asserted-by":"crossref","first-page":"i333","DOI":"10.1093\/bioinformatics\/bts378","article-title":"DELLY: structural variant discovery by integrated paired-end and split-read analysis","volume":"28","author":"Rausch","year":"2012","journal-title":"Bioinformatics"},{"key":"2023041820541051600_btad152-B19","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1038\/nature15394","article-title":"An integrated map of structural variation in 2,504 human genomes","volume":"526","author":"Sudmant","year":"2015","journal-title":"Nature"},{"key":"2023041820541051600_btad152-B20","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/gigascience\/giaa145","article-title":"Parliament2: accurate structural variant calling at scale","volume":"9","author":"Zarate","year":"2020","journal-title":"Gigascience"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad152\/49732017\/btad152.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/4\/btad152\/50015840\/btad152.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/4\/btad152\/50015840\/btad152.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,4,18]],"date-time":"2023-04-18T20:54:35Z","timestamp":1681851275000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad152\/7100339"}},"subtitle":[],"editor":[{"given":"Can","family":"Alkan","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,4,1]]},"references-count":20,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,4,3]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad152","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,4,1]]},"published":{"date-parts":[[2023,4,1]]},"article-number":"btad152"}}