{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,24]],"date-time":"2026-01-24T14:15:44Z","timestamp":1769264144523,"version":"3.49.0"},"reference-count":122,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2021,1,8]],"date-time":"2021-01-08T00:00:00Z","timestamp":1610064000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"Key research and development project of Shandong province","award":["2020SFXGFY01"],"award-info":[{"award-number":["2020SFXGFY01"]}]},{"name":"Key research and development project of Shandong province","award":["2020SFXGFY08"],"award-info":[{"award-number":["2020SFXGFY08"]}]},{"name":"National Major Project for Control and Prevention of Infectious Disease in China","award":["2018ZX10101004"],"award-info":[{"award-number":["2018ZX10101004"]}]},{"name":"National Major Project for Control and Prevention of Infectious Disease in China","award":["2017ZX10104001"],"award-info":[{"award-number":["2017ZX10104001"]}]},{"name":"Academic Promotion Programme of Shandong First Medical University","award":["2019QL006"],"award-info":[{"award-number":["2019QL006"]}]},{"name":"National Key Research and Development Programme of China","award":["2020YFC0840800"],"award-info":[{"award-number":["2020YFC0840800"]}]},{"name":"Taishan Scholars Programme of Shandong Province","award":["ts201511056"],"award-info":[{"award-number":["ts201511056"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,3,22]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In early January 2020, the novel coronavirus (SARS-CoV-2) responsible for a pneumonia outbreak in Wuhan, China, was identified using next-generation sequencing (NGS) and readily available bioinformatics pipelines. In addition to virus discovery, these NGS technologies and bioinformatics resources are currently being employed for ongoing genomic surveillance of SARS-CoV-2 worldwide, tracking its spread, evolution and patterns of variation on a global scale. In this review, we summarize the bioinformatics resources used for the discovery and surveillance of SARS-CoV-2. We also discuss the advantages and disadvantages of these bioinformatics resources and highlight areas where additional technical developments are urgently needed. Solutions to these problems will be beneficial not only to the prevention and control of the current COVID-19 pandemic but also to infectious disease outbreaks of the future.<\/jats:p>","DOI":"10.1093\/bib\/bbaa386","type":"journal-article","created":{"date-parts":[[2020,11,30]],"date-time":"2020-11-30T20:15:29Z","timestamp":1606767329000},"page":"631-641","source":"Crossref","is-referenced-by-count":49,"title":["Bioinformatics resources for SARS-CoV-2 discovery and surveillance"],"prefix":"10.1093","volume":"22","author":[{"given":"Tao","family":"Hu","sequence":"first","affiliation":[{"name":"Shandong First Medical University, China"}]},{"given":"Juan","family":"Li","sequence":"additional","affiliation":[{"name":"Shandong First Medical University, China"}]},{"given":"Hong","family":"Zhou","sequence":"additional","affiliation":[{"name":"Shandong First Medical University, China"}]},{"given":"Cixiu","family":"Li","sequence":"additional","affiliation":[{"name":"Shandong First Medical University, China"}]},{"given":"Edward C","family":"Holmes","sequence":"additional","affiliation":[{"name":"University of Sydney, Australia"}]},{"given":"Weifeng","family":"Shi","sequence":"additional","affiliation":[{"name":"Shandong First Medical University, China"}]}],"member":"286","published-online":{"date-parts":[[2021,1,8]]},"reference":[{"key":"2021032314374224700_ref1","doi-asserted-by":"crossref","first-page":"727","DOI":"10.1056\/NEJMoa2001017","article-title":"A novel coronavirus from patients with pneumonia in China. 2019","volume":"382","author":"Zhu","year":"2020","journal-title":"N Engl J Med"},{"key":"2021032314374224700_ref2","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1016\/S0140-6736(20)30251-8","article-title":"Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding","volume":"395","author":"Lu","year":"2020","journal-title":"Lancet"},{"key":"2021032314374224700_ref3","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1038\/s41586-020-2008-3","article-title":"A new coronavirus associated with human respiratory disease in China","volume":"579","author":"Wu","year":"2020","journal-title":"Nature"},{"issue":"11","key":"2021032314374224700_ref4","doi-asserted-by":"crossref","first-page":"1403","DOI":"10.1038\/s41564-020-0770-5","article-title":"A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology","volume":"5","author":"Rambaut","year":"2020","journal-title":"Nat Microbiol"},{"key":"2021032314374224700_ref5","doi-asserted-by":"crossref","first-page":"16151","DOI":"10.1038\/nmicrobiol.2016.151","article-title":"Intra-host dynamics of Ebola virus during 2014","volume":"1","author":"Ni","year":"2016","journal-title":"Nat Microbiol"},{"key":"2021032314374224700_ref6","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1038\/nature20167","article-title":"Redefining the invertebrate RNA virosphere","volume":"540","author":"Shi","year":"2016","journal-title":"Nature"},{"key":"2021032314374224700_ref7","doi-asserted-by":"crossref","first-page":"1168","DOI":"10.1016\/j.cell.2018.02.043","article-title":"Using metagenomics to characterize an expanding virosphere","volume":"172","author":"Zhang","year":"2018","journal-title":"Cell"},{"key":"2021032314374224700_ref8","doi-asserted-by":"crossref","first-page":"991","DOI":"10.1056\/NEJMoa073785","article-title":"A new arenavirus in a cluster of fatal transplant-associated diseases","volume":"358","author":"Palacios","year":"2008","journal-title":"N Engl J Med"},{"key":"2021032314374224700_ref9","doi-asserted-by":"crossref","first-page":"2408","DOI":"10.1056\/NEJMoa1401268","article-title":"Actionable diagnosis of neuroleptospirosis by next-generation sequencing","volume":"370","author":"Wilson","year":"2014","journal-title":"N Engl J Med"},{"key":"2021032314374224700_ref10","doi-asserted-by":"crossref","first-page":"12281","DOI":"10.1038\/s41598-019-48692-2","article-title":"Efficient and specific oligo-based depletion of rRNA","volume":"9","author":"Kraus","year":"2019","journal-title":"Sci Rep"},{"key":"2021032314374224700_ref11","doi-asserted-by":"crossref","first-page":"1261","DOI":"10.1038\/nprot.2017.066","article-title":"Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples","volume":"12","author":"Quick","year":"2017","journal-title":"Nat Protoc"},{"key":"2021032314374224700_ref12","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1186\/s13073-020-00751-4","article-title":"Multiple approaches for massively parallel sequencing of SARS-CoV-2 genomes directly from clinical samples","volume":"12","author":"Xiao","year":"2020","journal-title":"Genome Med"},{"issue":"10","key":"2021032314374224700_ref13","doi-asserted-by":"crossref","first-page":"2401","DOI":"10.3201\/eid2610.201800","article-title":"Rapid, sensitive, full-genome sequencing of severe acute respiratory syndrome coronavirus 2","volume":"26","author":"Paden","year":"2020","journal-title":"Emerg Infect Dis"},{"key":"2021032314374224700_ref14","doi-asserted-by":"crossref","first-page":"990","DOI":"10.1016\/j.cell.2020.04.021","article-title":"Coast-to-coast spread of SARS-CoV-2 during the early epidemic in the United States","volume":"181","author":"Fauver","year":"2020","journal-title":"Cell"},{"key":"2021032314374224700_ref15","article-title":"Nanopore targeted sequencing for the accurate and comprehensive detection of SARS-CoV-2 and other respiratory viruses","author":"Wang","year":"2020","journal-title":"Small"},{"key":"2021032314374224700_ref16","first-page":"241","author":"Sarkozy"},{"key":"2021032314374224700_ref17","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1146\/annurev-pathmechdis-012418-012751","article-title":"Clinical metagenomic next-generation sequencing for pathogen detection","volume":"14","author":"Gu","year":"2019","journal-title":"Annu Rev Pathol"},{"key":"2021032314374224700_ref18","first-page":"449","article-title":"The evolution of nanopore sequencing","volume":"5","author":"Wang","year":"2014","journal-title":"Front Genet"},{"key":"2021032314374224700_ref19","doi-asserted-by":"crossref","first-page":"914","DOI":"10.1016\/j.cell.2020.04.011","article-title":"The architecture of SARS-CoV-2 transcriptome","volume":"181","author":"Kim","year":"2020","journal-title":"Cell"},{"key":"2021032314374224700_ref20","doi-asserted-by":"crossref","first-page":"2114","DOI":"10.1093\/bioinformatics\/btu170","article-title":"Trimmomatic: a flexible trimmer for Illumina sequence data","volume":"30","author":"Bolger","year":"2014","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref21","doi-asserted-by":"crossref","first-page":"10","DOI":"10.14806\/ej.17.1.200","article-title":"CUTADAPT removes adapter sequences from high-throughput sequencing reads","volume":"17","author":"Martin","year":"2011","journal-title":"EMBnet J"},{"key":"2021032314374224700_ref22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/gigascience\/gix120","article-title":"SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data","volume":"7","author":"Chen","year":"2018","journal-title":"Gigascience"},{"key":"2021032314374224700_ref23","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1186\/s12859-017-1469-3","article-title":"AfterQC: automatic filtering, trimming, error removing and quality control for fastq data","volume":"18","author":"Chen","year":"2017","journal-title":"BMC Bioinformatics"},{"key":"2021032314374224700_ref24","doi-asserted-by":"crossref","first-page":"i884","DOI":"10.1093\/bioinformatics\/bty560","article-title":"fastp: an ultra-fast all-in-one FASTQ preprocessor","volume":"34","author":"Chen","year":"2018","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref25","doi-asserted-by":"crossref","first-page":"907","DOI":"10.1038\/s41587-019-0201-4","article-title":"Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype","volume":"37","author":"Kim","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2021032314374224700_ref26","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1093\/bioinformatics\/btp698","article-title":"Fast and accurate long-read alignment with burrows-wheeler transform","volume":"26","author":"Li","year":"2010","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref27","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/nmeth.1923","article-title":"Fast gapped-read alignment with bowtie 2","volume":"9","author":"Langmead","year":"2012","journal-title":"Nat Methods"},{"key":"2021032314374224700_ref28","doi-asserted-by":"crossref","first-page":"307","DOI":"10.1186\/s12859-018-2336-6","article-title":"Rapid and precise alignment of raw reads against redundant databases with KMA","volume":"19","author":"Clausen","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2021032314374224700_ref29","doi-asserted-by":"crossref","first-page":"3211","DOI":"10.1093\/bioinformatics\/bts611","article-title":"SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data","volume":"28","author":"Kopylova","year":"2012","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref30","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1093\/bfgp\/elr035","article-title":"Comparison of the two major classes of assembly algorithms: overlap-layout-consensus and De Bruijn graph","volume":"11","author":"Li","year":"2012","journal-title":"Brief Funct Genomics"},{"key":"2021032314374224700_ref31","doi-asserted-by":"crossref","first-page":"644","DOI":"10.1038\/nbt.1883","article-title":"Full-length transcriptome assembly from RNA-Seq data without a reference genome","volume":"29","author":"Grabherr","year":"2011","journal-title":"Nat Biotechnol"},{"key":"2021032314374224700_ref32","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/j.ymeth.2016.02.020","article-title":"MEGAHIT v1.0: a fast and scalable metagenome assembler driven by advanced methodologies and community practices","volume":"102","author":"Li","year":"2016","journal-title":"Methods"},{"key":"2021032314374224700_ref33","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1089\/cmb.2012.0021","article-title":"SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing","volume":"19","author":"Bankevich","year":"2012","journal-title":"J Comput Biol"},{"key":"2021032314374224700_ref34","doi-asserted-by":"crossref","first-page":"909","DOI":"10.1038\/nmeth.1517","article-title":"De novo assembly and analysis of RNA-seq data","volume":"7","author":"Robertson","year":"2010","journal-title":"Nat Methods"},{"key":"2021032314374224700_ref35","doi-asserted-by":"crossref","first-page":"2927","DOI":"10.1093\/bioinformatics\/bty202","article-title":"De novo haplotype reconstruction in viral quasispecies using paired-end read guided path finding","volume":"34","author":"Chen","year":"2018","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref36","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1101\/gr.215038.116","article-title":"De novo assembly of viral quasispecies using overlap graphs","volume":"27","author":"Baaijens","year":"2017","journal-title":"Genome Res"},{"key":"2021032314374224700_ref37","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/gigascience\/giz039","article-title":"De novo transcriptome assembly: a comprehensive cross-species comparison of short-read RNA-Seq assemblers","volume":"8","author":"Holzer","year":"2019","journal-title":"Gigascience"},{"key":"2021032314374224700_ref38","article-title":"coronaSPAdes: from biosynthetic gene clusters to coronaviral assemblies","author":"Meleshko","year":"2020","journal-title":"bioRxiv"},{"key":"2021032314374224700_ref39","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nmeth.3176","article-title":"Fast and sensitive protein alignment using DIAMOND","volume":"12","author":"Buchfink","year":"2015","journal-title":"Nat Methods"},{"key":"2021032314374224700_ref40","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1186\/1471-2105-10-421","article-title":"BLAST+: architecture and applications","volume":"10","author":"Camacho","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2021032314374224700_ref41","doi-asserted-by":"crossref","first-page":"613","DOI":"10.3390\/v12060613","article-title":"A divergent articulavirus in an Australian gecko identified using meta-transcriptomics and protein structure comparisons","volume":"12","author":"Ortiz-Baez","year":"2020","journal-title":"Viruses"},{"key":"2021032314374224700_ref42","doi-asserted-by":"crossref","first-page":"845","DOI":"10.1038\/nprot.2015.053","article-title":"The Phyre2 web portal for protein modeling, prediction and analysis","volume":"10","author":"Kelley","year":"2015","journal-title":"Nat Protoc"},{"key":"2021032314374224700_ref43","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1038\/nbt.1754","article-title":"Integrative genomics viewer","volume":"29","author":"Robinson","year":"2011","journal-title":"Nat Biotechnol"},{"key":"2021032314374224700_ref44","doi-asserted-by":"crossref","first-page":"1072","DOI":"10.1093\/bioinformatics\/btt086","article-title":"QUAST: quality assessment tool for genome assemblies","volume":"29","author":"Gurevich","year":"2013","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref45","doi-asserted-by":"crossref","first-page":"2666","DOI":"10.1093\/bioinformatics\/bty149","article-title":"NanoPack: visualizing and processing long-read sequencing data","volume":"34","author":"De Coster","year":"2018","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref46","doi-asserted-by":"crossref","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","article-title":"Minimap2: pairwise alignment for nucleotide sequences","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref47","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1038\/s41592-018-0001-7","article-title":"Accurate detection of complex structural variations using single-molecule sequencing","volume":"15","author":"Sedlazeck","year":"2018","journal-title":"Nat Methods"},{"key":"2021032314374224700_ref48","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1038\/nmeth.3290","article-title":"Improved data analysis for the MinION nanopore sequencer","volume":"12","author":"Jain","year":"2015","journal-title":"Nat Methods"},{"key":"2021032314374224700_ref49","doi-asserted-by":"crossref","first-page":"722","DOI":"10.1101\/gr.215087.116","article-title":"Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation","volume":"27","author":"Koren","year":"2017","journal-title":"Genome Res"},{"key":"2021032314374224700_ref50","doi-asserted-by":"crossref","first-page":"2103","DOI":"10.1093\/bioinformatics\/btw152","article-title":"Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences","volume":"32","author":"Li","year":"2016","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref51","doi-asserted-by":"crossref","first-page":"30494","DOI":"10.2807\/1560-7917.ES.2017.22.13.30494","article-title":"GISAID: global initiative on sharing all influenza data - from vision to reality","volume":"22","author":"Shu","year":"2017","journal-title":"Euro Surveill"},{"key":"2021032314374224700_ref52","doi-asserted-by":"crossref","first-page":"D482","DOI":"10.1093\/nar\/gkw1065","article-title":"Virus variation resource - improved response to emergent viral outbreaks","volume":"45","author":"Hatcher","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2021032314374224700_ref53","first-page":"212","article-title":"The 2019 novel coronavirus resource","volume":"42","author":"Zhao","year":"2020","journal-title":"Yi Chuan"},{"key":"2021032314374224700_ref54","first-page":"47","article-title":"Hepatitis C virus database and bioinformatics analysis tools in the virus pathogen resource (ViPR)","volume":"2019","author":"Zhang","year":"1911","journal-title":"Methods Mol Biol"},{"key":"2021032314374224700_ref55","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1517\/14622416.3.1.131","article-title":"Recent progress in multiple sequence alignment: a survey","volume":"3","author":"Notredame","year":"2002","journal-title":"Pharmacogenomics"},{"key":"2021032314374224700_ref56","doi-asserted-by":"crossref","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","article-title":"CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice","volume":"22","author":"Thompson","year":"1994","journal-title":"Nucleic Acids Res"},{"key":"2021032314374224700_ref57","doi-asserted-by":"crossref","first-page":"3059","DOI":"10.1093\/nar\/gkf436","article-title":"MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform","volume":"30","author":"Katoh","year":"2002","journal-title":"Nucleic Acids Res"},{"key":"2021032314374224700_ref58","doi-asserted-by":"crossref","first-page":"1792","DOI":"10.1093\/nar\/gkh340","article-title":"MUSCLE: multiple sequence alignment with high accuracy and high throughput","volume":"32","author":"Edgar","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2021032314374224700_ref59","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1006\/jmbi.2000.4042","article-title":"T-Coffee: a novel method for fast and accurate multiple sequence alignment","volume":"302","author":"Notredame","year":"2000","journal-title":"J Mol Biol"},{"key":"2021032314374224700_ref60","doi-asserted-by":"crossref","first-page":"330","DOI":"10.1101\/gr.2821705","article-title":"ProbCons: probabilistic consistency-based multiple sequence alignment","volume":"15","author":"Do","year":"2005","journal-title":"Genome Res"},{"key":"2021032314374224700_ref61","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1093\/bib\/bbn013","article-title":"Recent developments in the MAFFT multiple sequence alignment program","volume":"9","author":"Katoh","year":"2008","journal-title":"Brief Bioinform"},{"key":"2021032314374224700_ref62","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1007\/978-1-62703-646-7_10","article-title":"Phylogeny-aware alignment with PRANK","volume":"1079","author":"Loytynoja","year":"2014","journal-title":"Methods Mol Biol"},{"key":"2021032314374224700_ref63","doi-asserted-by":"crossref","first-page":"2047","DOI":"10.1093\/bioinformatics\/btl175","article-title":"BAli-Phy: simultaneous Bayesian inference of alignment and phylogeny","volume":"22","author":"Suchard","year":"2006","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref64","doi-asserted-by":"crossref","first-page":"2403","DOI":"10.1093\/bioinformatics\/btn457","article-title":"StatAlign: an extendable software package for joint Bayesian estimation of alignments and evolutionary trees","volume":"24","author":"Novak","year":"2008","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref65","doi-asserted-by":"crossref","first-page":"2001","DOI":"10.1093\/bioinformatics\/btr304","article-title":"Java bioinformatics analysis web services for multiple sequence alignment--JABAWS:MSA","volume":"27","author":"Troshin","year":"2011","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref66","doi-asserted-by":"crossref","first-page":"W597","DOI":"10.1093\/nar\/gkt376","article-title":"Analysis tool web services from the EMBL-EBI","volume":"41","author":"McWilliam","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2021032314374224700_ref67","doi-asserted-by":"crossref","first-page":"579","DOI":"10.1186\/1471-2105-11-579","article-title":"webPRANK: a phylogeny-aware multiple sequence aligner with interactive alignment browser","volume":"11","author":"Loytynoja","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2021032314374224700_ref68","doi-asserted-by":"crossref","first-page":"642","DOI":"10.1093\/molbev\/mss256","article-title":"Class of multiple sequence alignment algorithm affects genomic analysis","volume":"30","author":"Blackburne","year":"2013","journal-title":"Mol Biol Evol"},{"key":"2021032314374224700_ref69","doi-asserted-by":"crossref","first-page":"1189","DOI":"10.1093\/bioinformatics\/btp033","article-title":"Jalview version 2--a multiple sequence alignment editor and analysis workbench","volume":"25","author":"Waterhouse","year":"2009","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref70","doi-asserted-by":"crossref","first-page":"3501","DOI":"10.1093\/bioinformatics\/btw474","article-title":"MSAViewer: interactive JavaScript visualization of multiple sequence alignments","volume":"32","author":"Yachdav","year":"2016","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref71","doi-asserted-by":"crossref","first-page":"3276","DOI":"10.1093\/bioinformatics\/btu531","article-title":"AliView: a fast and lightweight alignment viewer and editor for large datasets","volume":"30","author":"Larsson","year":"2014","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref72","first-page":"95","article-title":"BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95\/98\/NT","volume":"41","author":"Hall","year":"1999","journal-title":"Nucleic Acids Symp Ser"},{"key":"2021032314374224700_ref73","first-page":"406","article-title":"The neighbor-joining method: a new method for reconstructing phylogenetic trees","volume":"4","author":"Saitou","year":"1987","journal-title":"Mol Biol Evol"},{"key":"2021032314374224700_ref74","doi-asserted-by":"crossref","first-page":"406","DOI":"10.1093\/sysbio\/20.4.406","article-title":"Toward defining the course of evolution: minimum change for a specific tree topology","volume":"20","author":"Fitch","year":"1971","journal-title":"Syst Biol"},{"key":"2021032314374224700_ref75","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1007\/BF01734359","article-title":"Evolutionary trees from DNA sequences: a maximum likelihood approach","volume":"17","author":"Felsenstein","year":"1981","journal-title":"J Mol Evol"},{"key":"2021032314374224700_ref76","doi-asserted-by":"crossref","first-page":"750","DOI":"10.1093\/oxfordjournals.molbev.a026160","article-title":"Markov Chasin Monte Carlo algorithms for the Bayesian analysis of phylogenetic trees","volume":"16","author":"Larget","year":"1999","journal-title":"Mol Biol Evol"},{"key":"2021032314374224700_ref77","doi-asserted-by":"crossref","first-page":"1253","DOI":"10.1093\/molbev\/msn083","article-title":"jModelTest: phylogenetic model averaging","volume":"25","author":"Posada","year":"2008","journal-title":"Mol Biol Evol"},{"key":"2021032314374224700_ref78","doi-asserted-by":"crossref","first-page":"2104","DOI":"10.1093\/bioinformatics\/bti263","article-title":"ProtTest: selection of best-fit models of protein evolution","volume":"21","author":"Abascal","year":"2005","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref79","doi-asserted-by":"crossref","first-page":"428","DOI":"10.1038\/s41576-020-0233-0","article-title":"Phylogenetic tree building in the genomic age","volume":"21","author":"Kapli","year":"2020","journal-title":"Nat Rev Genet"},{"key":"2021032314374224700_ref80","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1016\/j.dci.2004.07.007","article-title":"Using models of nucleotide evolution to build phylogenetic trees","volume":"29","author":"Bos","year":"2005","journal-title":"Dev Comp Immunol"},{"key":"2021032314374224700_ref81","first-page":"e47","article-title":"Emerging concepts of data integration in pathogen phylodynamics","volume":"66","author":"Baele","year":"2017","journal-title":"Syst Biol"},{"key":"2021032314374224700_ref82","doi-asserted-by":"crossref","first-page":"vew007","DOI":"10.1093\/ve\/vew007","article-title":"Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen)","volume":"2","author":"Rambaut","year":"2016","journal-title":"Virus Evol"},{"key":"2021032314374224700_ref83","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1093\/oxfordjournals.molbev.a025808","article-title":"BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data","volume":"14","author":"Gascuel","year":"1997","journal-title":"Mol Biol Evol"},{"key":"2021032314374224700_ref84","doi-asserted-by":"crossref","first-page":"W557","DOI":"10.1093\/nar\/gki352","article-title":"PHYML online--a web server for fast maximum likelihood-based phylogenetic inference","volume":"33","author":"Guindon","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2021032314374224700_ref85","doi-asserted-by":"crossref","first-page":"1312","DOI":"10.1093\/bioinformatics\/btu033","article-title":"RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies","volume":"30","author":"Stamatakis","year":"2014","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref86","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1093\/molbev\/msu300","article-title":"IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies","volume":"32","author":"Nguyen","year":"2015","journal-title":"Mol Biol Evol"},{"key":"2021032314374224700_ref87","doi-asserted-by":"crossref","first-page":"1572","DOI":"10.1093\/bioinformatics\/btg180","article-title":"MrBayes 3: Bayesian phylogenetic inference under mixed models","volume":"19","author":"Ronquist","year":"2003","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref88","doi-asserted-by":"crossref","first-page":"2286","DOI":"10.1093\/bioinformatics\/btp368","article-title":"PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating","volume":"25","author":"Lartillot","year":"2009","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref89","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1186\/1471-2148-7-214","article-title":"BEAST: Bayesian evolutionary analysis by sampling trees","volume":"7","author":"Drummond","year":"2007","journal-title":"BMC Evol Biol"},{"key":"2021032314374224700_ref90","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1003537","article-title":"BEAST 2: a software platform for Bayesian evolutionary analysis","volume":"10","author":"Bouckaert","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"2021032314374224700_ref91","first-page":"Unit 6.4","article-title":"Inferring evolutionary trees with PAUP*","volume":"Chapter 6","author":"Wilgenbusch","year":"2003","journal-title":"Curr Protoc Bioinformatics"},{"key":"2021032314374224700_ref92","doi-asserted-by":"crossref","first-page":"2731","DOI":"10.1093\/molbev\/msr121","article-title":"MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods","volume":"28","author":"Tamura","year":"2011","journal-title":"Mol Biol Evol"},{"key":"2021032314374224700_ref93","doi-asserted-by":"crossref","first-page":"348","DOI":"10.1111\/1755-0998.13096","article-title":"PhyloSuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies","volume":"20","author":"Zhang","year":"2020","journal-title":"Mol Ecol Resour"},{"key":"2021032314374224700_ref94","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1093\/sysbio\/sys062","article-title":"Dendroscope 3: an interactive tool for rooted phylogenetic trees and networks","volume":"61","author":"Huson","year":"2012","journal-title":"Syst Biol"},{"key":"2021032314374224700_ref95","doi-asserted-by":"crossref","first-page":"3041","DOI":"10.1093\/molbev\/msy194","article-title":"Two methods for mapping and visualizing associated data on phylogeny using Ggtree","volume":"35","author":"Yu","year":"2018","journal-title":"Mol Biol Evol"},{"key":"2021032314374224700_ref96","doi-asserted-by":"crossref","first-page":"W256","DOI":"10.1093\/nar\/gkz239","article-title":"Interactive tree of life (iTOL) v4: recent updates and new developments","volume":"47","author":"Letunic","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2021032314374224700_ref97","doi-asserted-by":"crossref","first-page":"W270","DOI":"10.1093\/nar\/gkz357","article-title":"Evolview v3: a webserver for visualization, annotation, and management of phylogenetic trees","volume":"47","author":"Subramanian","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2021032314374224700_ref98","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1038\/s41586-020-2012-7","article-title":"A pneumonia outbreak associated with a new coronavirus of probable bat origin","volume":"579","author":"Zhou","year":"2020","journal-title":"Nature"},{"issue":"7815","key":"2021032314374224700_ref99","first-page":"282","article-title":"Identification of 2019-nCoV related coronaviruses in Malayan pangolins in southern China","volume":"583","author":"Lam","year":"2020","journal-title":"bioRxiv"},{"issue":"11","key":"2021032314374224700_ref100","doi-asserted-by":"crossref","first-page":"1408","DOI":"10.1038\/s41564-020-0771-4","article-title":"Evolutionary origins of the SARS-CoV-2 sarbecovirus lineage responsible for the COVID-19 pandemic","volume":"5","author":"Boni","year":"2020","journal-title":"Nat Microbiol"},{"key":"2021032314374224700_ref101","doi-asserted-by":"crossref","first-page":"861","DOI":"10.1038\/s41591-020-0877-5","article-title":"Clinical and virologic characteristics of the first 12 patients with coronavirus disease 2019 (COVID-19) in the United States","volume":"26","author":"Covid-Investigation Team","year":"2020","journal-title":"Nat Med"},{"key":"2021032314374224700_ref102","doi-asserted-by":"crossref","first-page":"1547","DOI":"10.1093\/molbev\/msy096","article-title":"MEGA X: molecular evolutionary genetics analysis across computing platforms","volume":"35","author":"Kumar","year":"2018","journal-title":"Mol Biol Evol"},{"issue":"13","key":"2021032314374224700_ref103","doi-asserted-by":"crossref","first-page":"2000305","DOI":"10.2807\/1560-7917.ES.2020.25.13.2000305","article-title":"Whole genome and phylogenetic analysis of two SARS-CoV-2 strains isolated in Italy in January and February 2020: additional clues on multiple introductions and further circulation in Europe","volume":"25","author":"Stefanelli","year":"2020","journal-title":"Euro Surveill"},{"key":"2021032314374224700_ref104","doi-asserted-by":"crossref","first-page":"4121","DOI":"10.1093\/bioinformatics\/bty407","article-title":"Nextstrain: real-time tracking of pathogen evolution","volume":"34","author":"Hadfield","year":"2018","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref105","doi-asserted-by":"crossref","first-page":"336","DOI":"10.1089\/cmb.2006.13.336","article-title":"The average common substring approach to phylogenomic reconstruction","volume":"13","author":"Ulitsky","year":"2006","journal-title":"J Comput Biol"},{"key":"2021032314374224700_ref106","doi-asserted-by":"crossref","first-page":"2323","DOI":"10.1007\/s11434-010-3008-8","article-title":"Whole-genome based Archaea phylogeny and taxonomy: a composition vector approach","volume":"55","author":"Sun","year":"2010","journal-title":"Chin Sci Bull"},{"key":"2021032314374224700_ref107","doi-asserted-by":"crossref","first-page":"517","DOI":"10.1186\/1471-2164-9-517","article-title":"A new method to compute K-mer frequencies and its application to annotate large repetitive plant genomes","volume":"9","author":"Kurtz","year":"2008","journal-title":"BMC Genomics"},{"key":"2021032314374224700_ref108","doi-asserted-by":"crossref","first-page":"D200","DOI":"10.1093\/nar\/gkw1129","article-title":"CDD\/SPARCLE: functional classification of proteins via subfamily domain architectures","volume":"45","author":"Marchler-Bauer","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2021032314374224700_ref109","doi-asserted-by":"crossref","first-page":"743","DOI":"10.1093\/bioinformatics\/16.8.743","article-title":"gff2ps: visualizing genomic annotations","volume":"16","author":"Abril","year":"2000","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref110","doi-asserted-by":"crossref","first-page":"378","DOI":"10.1093\/bib\/5.4.378","article-title":"Vector NTI, a balanced all-in-one sequence analysis suite","volume":"5","author":"Lu","year":"2004","journal-title":"Brief Bioinform"},{"key":"2021032314374224700_ref111","doi-asserted-by":"crossref","first-page":"3359","DOI":"10.1093\/bioinformatics\/btv362","article-title":"IBS: an illustrator for the presentation and visualization of biological sequences","volume":"31","author":"Liu","year":"2015","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref112","author":"Felsenstein","year":"1993"},{"key":"2021032314374224700_ref113","doi-asserted-by":"crossref","first-page":"490","DOI":"10.1016\/j.tim.2016.03.003","article-title":"Epidemiology, genetic recombination, and pathogenesis of coronaviruses","volume":"24","author":"Su","year":"2016","journal-title":"Trends Microbiol"},{"key":"2021032314374224700_ref114","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1128\/JVI.73.1.152-160.1999","article-title":"Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination","volume":"73","author":"Lole","year":"1999","journal-title":"J Virol"},{"key":"2021032314374224700_ref115","doi-asserted-by":"crossref","first-page":"vev003","DOI":"10.1093\/ve\/vev003","article-title":"RDP4: detection and analysis of recombination patterns in virus genomes","volume":"1","author":"Martin","year":"2015","journal-title":"Virus Evol"},{"key":"2021032314374224700_ref116","doi-asserted-by":"crossref","first-page":"W296","DOI":"10.1093\/nar\/gky427","article-title":"SWISS-MODEL: homology modelling of protein structures and complexes","volume":"46","author":"Waterhouse","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2021032314374224700_ref117","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1093\/bioinformatics\/btw638","article-title":"PyMod 2.0: improvements in protein sequence-structure analysis and homology modeling within PyMOL","volume":"33","author":"Janson","year":"2017","journal-title":"Bioinformatics"},{"key":"2021032314374224700_ref118","doi-asserted-by":"crossref","first-page":"2196","DOI":"10.1016\/j.cub.2020.05.023","article-title":"A novel bat coronavirus closely related to SARS-CoV-2 contains natural insertions at the S1\/S2 cleavage site of the spike protein","volume":"30","author":"Zhou","year":"2020","journal-title":"Curr Biol"},{"key":"2021032314374224700_ref119","doi-asserted-by":"crossref","first-page":"332","DOI":"10.1186\/s12864-018-4703-0","article-title":"Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms","volume":"19","author":"Costello","year":"2018","journal-title":"BMC Genomics"},{"key":"2021032314374224700_ref120","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1186\/s12864-019-5569-5","article-title":"Reliable multiplex sequencing with rare index mis-assignment on DNB-based NGS platform","volume":"20","author":"Li","year":"2019","journal-title":"BMC Genomics"},{"issue":"5","key":"2021032314374224700_ref121","doi-asserted-by":"crossref","first-page":"1171","DOI":"10.1111\/1755-0998.13009","article-title":"Index hopping on the Illumina HiseqX platform and its consequences for ancient DNA studies","volume":"20","author":"Valk","year":"2020","journal-title":"Mol Ecol Resour"},{"key":"2021032314374224700_ref122","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.3958883","article-title":"A global phylogeny of SARS-CoV-2 from GISAID data, including sequences deposited up to 31-July-2020. 2020","author":"Lanfear","journal-title":"Zenodo"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/2\/631\/36655618\/bbaa386.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/bib\/article-pdf\/22\/2\/631\/36655618\/bbaa386.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,14]],"date-time":"2023-10-14T08:12:35Z","timestamp":1697271155000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/22\/2\/631\/6067880"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,8]]},"references-count":122,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2021,1,8]]},"published-print":{"date-parts":[[2021,3,22]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbaa386","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,3]]},"published":{"date-parts":[[2021,1,8]]}}}