{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:02Z","timestamp":1772138042310,"version":"3.50.1"},"reference-count":14,"publisher":"Oxford University Press (OUP)","issue":"17","license":[{"start":{"date-parts":[[2021,2,3]],"date-time":"2021-02-03T00:00:00Z","timestamp":1612310400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"National Science and Engineering Research Council (NSERC) Discovery grant"},{"DOI":"10.13039\/501100000196","name":"Canada Foundation for Innovation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100000196","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Genome Canada\/Genome Alberta"},{"name":"Alberta Children\u2019s Hospital Research Institute Postdoctoral Fellowship"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,9]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Summary<\/jats:title>\n                    <jats:p>Many tools can reconstruct viral sequences based on next-generation sequencing reads. Although existing tools effectively recover local regions, their accuracy suffers when reconstructing the whole viral genomes (strains). Moreover, they consume significant memory when the sequencing coverage is high or when the genome size is large. We present WgLink to meet this challenge. WgLink takes local reconstructions produced by other tools as input and patches the resulting segments together into coherent whole-genome strains. We accomplish this using an L0+L1-regularized regression, synthesizing variant allele frequency data with physical linkage between multiple variants spanning multiple regions simultaneously. WgLink achieves higher accuracy than existing tools both on simulated and on real datasets while using significantly less memory (RAM) and fewer CPU hours.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>Source code and binaries are freely available at https:\/\/github.com\/theLongLab\/wglink.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab076","type":"journal-article","created":{"date-parts":[[2021,1,30]],"date-time":"2021-01-30T07:17:49Z","timestamp":1611991069000},"page":"2744-2746","source":"Crossref","is-referenced-by-count":10,"title":["WgLink: reconstructing whole-genome viral haplotypes using L0+L1-regularization"],"prefix":"10.1093","volume":"37","author":[{"given":"Chen","family":"Cao","sequence":"first","affiliation":[{"name":"Department of Biochemistry and Molecular Biology, Alberta Children\u2019s Hospital Research Institute, Calgary, AB, T2N 4N1 Canada"}]},{"given":"Matthew","family":"Greenberg","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Statistics"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3795-1873","authenticated-orcid":false,"given":"Quan","family":"Long","sequence":"additional","affiliation":[{"name":"Department of Biochemistry and Molecular Biology, Alberta Children\u2019s Hospital Research Institute, Calgary, AB, T2N 4N1 Canada"},{"name":"Department of Mathematics and Statistics"},{"name":"Department of Medical Genetics, Hotchkiss Brain Institute, University of Calgary , Calgary, AB, T2N 4N1 Canada"}]}],"member":"286","published-online":{"date-parts":[[2021,2,3]]},"reference":[{"key":"2023051609214614100_btab076-B1","doi-asserted-by":"crossref","first-page":"i23","DOI":"10.1093\/bioinformatics\/bty291","article-title":"Viral quasispecies reconstruction via tensor factorization with successive read removal","volume":"34","author":"Ahn","year":"2018","journal-title":"Bioinformatics"},{"key":"2023051609214614100_btab076-B2","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1089\/cmb.2017.0249","article-title":"aBayesQR: a Bayesian method for reconstruction of viral populations characterized by low diversity","volume":"25","author":"Ahn","year":"2018","journal-title":"J. Comput. Biol"},{"key":"2023051609214614100_btab076-B3","doi-asserted-by":"crossref","first-page":"2260","DOI":"10.1038\/s41467-017-02209-5","article-title":"Strain profiling and epidemiology of bacterial species from metagenomic sequencing","volume":"8","author":"Albanese","year":"2017","journal-title":"Nat. Commun"},{"key":"2023051609214614100_btab076-B4","doi-asserted-by":"crossref","first-page":"2385","DOI":"10.1093\/bioinformatics\/btz942","article-title":"A haplotype-aware de novo assembly of related individuals using pedigree sequence graph","volume":"36","author":"Garg","year":"2020","journal-title":"Bioinformatics"},{"key":"2023051609214614100_btab076-B5","doi-asserted-by":"crossref","first-page":"e115","DOI":"10.1093\/nar\/gku537","article-title":"Full-length haplotype reconstruction to infer the structure of heterogeneous virus populations","volume":"42","author":"Giallonardo","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023051609214614100_btab076-B6","article-title":"Fast best subset selection: coordinate descent and local combinatorial optimization algorithms","volume":"68","author":"Hazimeh","year":"2018","journal-title":"Oper. Res"},{"key":"2023051609214614100_btab076-B7","doi-asserted-by":"crossref","DOI":"10.1101\/2020.02.25.964445","article-title":"SDip: a novel graph-based approach to haplotype-aware assembly based structural variant calling in targeted segmental duplications sequencing","volume-title":"bioRxiv","author":"Heller","year":"2020"},{"key":"2023051609214614100_btab076-B8","article-title":"CliqueSNV: scalable reconstruction of intrahost viral populations from NGS reads","author":"Knyazev","year":"2018","journal-title":"bioRxiv"},{"key":"2023051609214614100_btab076-B9","doi-asserted-by":"crossref","first-page":"2455","DOI":"10.1093\/bioinformatics\/btx187","article-title":"A penalized regression approach to haplotype reconstruction of viral populations arising in early HIV\/SIV infection","volume":"33","author":"Leviyang","year":"2017","journal-title":"Bioinformatics"},{"key":"2023051609214614100_btab076-B10","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btp324","article-title":"Fast and accurate short read alignment with Burrows-Wheeler transform","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023051609214614100_btab076-B11","doi-asserted-by":"crossref","first-page":"e15292","DOI":"10.1371\/journal.pone.0015292","article-title":"PoolHap: inferring haplotype frequencies from pooled samples by next generation sequencing","volume":"6","author":"Long","year":"2011","journal-title":"PLoS One"},{"key":"2023051609214614100_btab076-B12","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1101\/gr.213959.116","article-title":"metaSPAdes: a new versatile metagenomic assembler","volume":"27","author":"Nurk","year":"2017","journal-title":"Genome Res"},{"key":"2023051609214614100_btab076-B13","doi-asserted-by":"crossref","first-page":"24614","DOI":"10.1073\/pnas.2017726117","article-title":"Low genetic diversity may be an Achilles heel of SARS-CoV-2","volume":"117","author":"Rausch","year":"2020","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051609214614100_btab076-B14","doi-asserted-by":"crossref","first-page":"e1003515","DOI":"10.1371\/journal.pcbi.1003515","article-title":"Viral quasispecies assembly via maximal clique enumeration","volume":"10","author":"Topfer","year":"2014","journal-title":"PLoS Comput. Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab076\/36254811\/btab076.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/17\/2744\/50339384\/btab076.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/17\/2744\/50339384\/btab076.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T05:27:25Z","timestamp":1684214845000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/17\/2744\/6126800"}},"subtitle":[],"editor":[{"given":"Inanc","family":"Birol","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,2,3]]},"references-count":14,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2021,9,9]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab076","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.08.14.251835","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,9,1]]},"published":{"date-parts":[[2021,2,3]]}}}