{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:51Z","timestamp":1772138091813,"version":"3.50.1"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2018,11,8]],"date-time":"2018-11-08T00:00:00Z","timestamp":1541635200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01AI108441"],"award-info":[{"award-number":["R01AI108441"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Providence\/Boston Center for AIDS Research","award":["P30AI042853"],"award-info":[{"award-number":["P30AI042853"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Next-generation deep sequencing of viral genomes, particularly on the Illumina platform, is increasingly applied in HIV research. Yet, there is no standard protocol or method used by the research community to account for measurement errors that arise during sample preparation and sequencing. Correctly calling high and low-frequency variants while controlling for erroneous variants is an important precursor to downstream interpretation, such as studying the emergence of HIV drug-resistance mutations, which in turn has clinical applications and can improve patient care.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We developed a new variant-calling pipeline, hivmmer, for Illumina sequences from HIV viral genomes. First, we validated hivmmer by comparing it to other variant-calling pipelines on real HIV plasmid datasets. We found that hivmmer achieves a lower rate of erroneous variants, and that all methods agree on the frequency of correctly called variants. Next, we compared the methods on an HIV plasmid dataset that was sequenced using Primer ID, an amplicon-tagging protocol, which is designed to reduce errors and amplification bias during library preparation. We show that the Primer ID consensus exhibits fewer erroneous variants compared to the variant-calling pipelines, and that hivmmer more closely approaches this low error rate compared to the other pipelines. The frequency estimates from the Primer ID consensus do not differ significantly from those of the variant-calling pipelines.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>hivmmer is freely available for non-commercial use from https:\/\/github.com\/kantorlab\/hivmmer.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty919","type":"journal-article","created":{"date-parts":[[2018,11,6]],"date-time":"2018-11-06T15:12:19Z","timestamp":1541517139000},"page":"2029-2035","source":"Crossref","is-referenced-by-count":25,"title":["Measurement error and variant-calling in deep Illumina sequencing of HIV"],"prefix":"10.1093","volume":"35","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0764-4090","authenticated-orcid":false,"given":"Mark","family":"Howison","sequence":"first","affiliation":[{"name":"Watson Institute for International and Public Affairs"}]},{"given":"Mia","family":"Coetzer","sequence":"additional","affiliation":[{"name":"Division of Infectious Diseases, The Alpert Medical School, Brown University, Providence, RI, USA"}]},{"given":"Rami","family":"Kantor","sequence":"additional","affiliation":[{"name":"Division of Infectious Diseases, The Alpert Medical School, Brown University, Providence, RI, USA"}]}],"member":"286","published-online":{"date-parts":[[2018,11,8]]},"reference":[{"key":"2023012713073029000_bty919-B1","doi-asserted-by":"crossref","first-page":"e579","DOI":"10.1016\/S2352-3018(16)30119-9","article-title":"Pretreatment HIV-drug resistance in Mexico and its impact on the effectiveness of first-line antiretroviral therapy: a nationally representative 2015 WHO survey","volume":"3","author":"\u00c1vila R\u00edos","year":"2016","journal-title":"Lancet HIV"},{"key":"2023012713073029000_bty919-B2","doi-asserted-by":"crossref","first-page":"329","DOI":"10.3389\/fmicb.2012.00329","article-title":"Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data","volume":"3","author":"Beerenwinkel","year":"2012","journal-title":"Front. Microbiol"},{"key":"2023012713073029000_bty919-B3","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1186\/s12977-016-0321-6","article-title":"Ultrasensitive single-genome sequencing: accurate, targeted, next generation sequencing of HIV-1 RNA","volume":"13","author":"Boltz","year":"2016","journal-title":"Retrovirology"},{"key":"2023012713073029000_bty919-B4","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1016\/j.virusres.2016.12.008","article-title":"Promises and pitfalls of Illumina sequencing for HIV resistance genotyping","volume":"239","author":"Brumme","year":"2016","journal-title":"Virus Res"},{"key":"2023012713073029000_bty919-B5","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1016\/j.virusres.2016.10.019","article-title":"Deep sequencing for HIV-1 clinical management","volume":"239","author":"Casadell\u00e0","year":"2017","journal-title":"Virus Res"},{"key":"2023012713073029000_bty919-B6","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1146\/annurev-genom-091212-153406","article-title":"Deep sequencing of HIV: clinical and research applications","volume":"15","author":"Chabria","year":"2014","journal-title":"Annu. Rev. Genomics Hum. Genet"},{"key":"2023012713073029000_bty919-B7","doi-asserted-by":"crossref","first-page":"96.","DOI":"10.1186\/1471-2164-14-96","article-title":"Ultra-deep mutant spectrum profiling: improving sequencing accuracy using overlapping read pairs","volume":"14","author":"Chen-Harris","year":"2013","journal-title":"BMC Genomics"},{"key":"2023012713073029000_bty919-B8","doi-asserted-by":"crossref","first-page":"e115","DOI":"10.1093\/nar\/gku537","article-title":"Full-length haplotype reconstruction to infer the structure of heterogeneous virus populations","volume":"42","author":"Di Giallonardo","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023012713073029000_bty919-B9","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1186\/s12977-014-0122-8","article-title":"Cross-clade simultaneous HIV drug resistance genotyping for reverse transcriptase, protease, and integrase inhibitor mutations by Illumina MiSeq","volume":"11","author":"Dudley","year":"2014","journal-title":"Retrovirology"},{"key":"2023012713073029000_bty919-B10","doi-asserted-by":"crossref","first-page":"e1002195.","DOI":"10.1371\/journal.pcbi.1002195","article-title":"Accelerated profile HMM searches","volume":"7","author":"Eddy","year":"2011","journal-title":"PLoS Comput. Biol"},{"key":"2023012713073029000_bty919-B11","doi-asserted-by":"crossref","first-page":"3349","DOI":"10.1093\/jac\/dku278","article-title":"Cost-efficient HIV-1 drug resistance surveillance using multiplexed high-throughput amplicon sequencing: implications for use in low- and middle-income countries","volume":"69","author":"Ekici","year":"2014","journal-title":"J. Antimicrob. Chemother"},{"key":"2023012713073029000_bty919-B12","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.jcv.2014.11.014","article-title":"Next generation sequencing improves detection of drug resistance mutations in infants after PMTCT failure","volume":"62","author":"Fisher","year":"2015","journal-title":"J. Clin. Virol"},{"key":"2023012713073029000_bty919-B13","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1038\/nrg.2016.49","article-title":"Coming of age: ten years of next-generation sequencing technologies","volume":"17","author":"Goodwin","year":"2016","journal-title":"Nat. Rev. Genet"},{"key":"2023012713073029000_bty919-B14","doi-asserted-by":"crossref","first-page":"20166","DOI":"10.1073\/pnas.1110064108","article-title":"Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID","volume":"108","author":"Jabara","year":"2011","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023012713073029000_bty919-B15","author":"Ji","year":"2015"},{"key":"2023012713073029000_bty919-B16","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/nmeth.1923","article-title":"Fast gapped-read alignment with Bowtie 2","volume":"9","author":"Langmead","year":"2012","journal-title":"Nat. Methods"},{"key":"2023012713073029000_bty919-B17","doi-asserted-by":"crossref","first-page":"6824","DOI":"10.1128\/AAC.01490-15","article-title":"HIV drug resistance testing by high-multiplex \u201cWide\u201d sequencing on the MiSeq instrument","volume":"59","author":"Lapointe","year":"2015","journal-title":"Antimicrob. Agents Chemother"},{"key":"2023012713073029000_bty919-B18","doi-asserted-by":"crossref","first-page":"2345.","DOI":"10.1097\/QAD.0000000000001619","article-title":"Prevalence and clinical impacts of HIV-1 intersubtype recombinants in Uganda revealed by near-full-genome population and deep sequencing approaches","volume":"31","author":"Lee","year":"2017","journal-title":"AIDS"},{"key":"2023012713073029000_bty919-B19","doi-asserted-by":"crossref","first-page":"S829","DOI":"10.1093\/infdis\/jix397","article-title":"Next-generation human immunodeficiency virus sequencing for patient management and drug resistance surveillance","volume":"216","author":"Noguera-Julian","year":"2017","journal-title":"J. Infect. Dis"},{"key":"2023012713073029000_bty919-B20","doi-asserted-by":"crossref","first-page":"1258","DOI":"10.3389\/fmicb.2015.01258","article-title":"Quasispecies analyses of the HIV-1 near-full-length genome with Illumina MiSeq","volume":"6","author":"Ode","year":"2015","journal-title":"Front. Microbiol"},{"key":"2023012713073029000_bty919-B21","doi-asserted-by":"crossref","first-page":"229.","DOI":"10.1186\/s12864-015-1456-x","article-title":"Distinguishing low frequency mutations from RT-PCR and sequence errors in viral deep sequencing data","volume":"16","author":"Orton","year":"2015","journal-title":"BMC Genomics"},{"key":"2023012713073029000_bty919-B22","doi-asserted-by":"crossref","first-page":"e112674","DOI":"10.1371\/journal.pone.0112674","article-title":"Deep sequencing of HIV-1 near full-length proviral genomes identifies high rates of BF1 recombinants including two novel circulating recombinant forms (CRF) 70_BF1 and a disseminating 71_BF1 among blood donors in Pernambuco, Brazil","volume":"9","author":"Pess\u00f4a","year":"2014","journal-title":"PLoS One"},{"key":"2023012713073029000_bty919-B23","doi-asserted-by":"crossref","first-page":"e0152499.","DOI":"10.1371\/journal.pone.0152499","article-title":"Ultra-deep sequencing of HIV-1 near full-length and partial proviral genomes reveals high genetic diversity among Brazilian blood donors","volume":"11","author":"Pess\u00f4a","year":"2016","journal-title":"PLoS One"},{"key":"2023012713073029000_bty919-B24","doi-asserted-by":"crossref","first-page":"464.","DOI":"10.1186\/s12864-016-2669-3","article-title":"High-specificity detection of rare alleles with paired-end low error sequencing (PELE-Seq)","volume":"17","author":"Preston","year":"2016","journal-title":"BMC Genomics"},{"key":"2023012713073029000_bty919-B25","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1016\/j.jcv.2014.06.013","article-title":"Deep sequencing: becoming a critical tool in clinical virology","volume":"61","author":"Qui\u00f1ones-Mateu","year":"2014","journal-title":"J. Clin. Virol"},{"key":"2023012713073029000_bty919-B26","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1016\/j.jmb.2015.12.012","article-title":"A comprehensive analysis of primer IDs to study heterogeneous HIV-1 populations","volume":"428","author":"Seifert","year":"2016","journal-title":"J. Mol. Biol"},{"key":"2023012713073029000_bty919-B27","doi-asserted-by":"crossref","first-page":"1769","DOI":"10.1172\/JCI4948","article-title":"A 6-basepair insert in the reverse transcriptase gene of human immunodeficiency virus type 1 confers resistance to multiple nucleoside inhibitors","volume":"102","author":"Winters","year":"1998","journal-title":"J. Clin. Invest"},{"key":"2023012713073029000_bty919-B28","doi-asserted-by":"crossref","first-page":"vey007","DOI":"10.1093\/ve\/vey007","article-title":"Easy and accurate reconstruction of whole HIV genomes from short-read sequence data with shiver","volume":"4","author":"Wymant","year":"2018","journal-title":"Virus Evol"},{"key":"2023012713073029000_bty919-B29","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1016\/j.virusres.2016.12.009","article-title":"Error rates, PCR recombination, and sampling depth in HIV-1 whole genome deep sequencing","volume":"239","author":"Zanini","year":"2017","journal-title":"Virus Res"},{"key":"2023012713073029000_bty919-B30","doi-asserted-by":"crossref","first-page":"614","DOI":"10.1093\/bioinformatics\/btt593","article-title":"PEAR: a fast and accurate Illumina Paired-End reAd mergeR","volume":"30","author":"Zhang","year":"2014","journal-title":"Bioinformatics"},{"key":"2023012713073029000_bty919-B31","doi-asserted-by":"crossref","first-page":"8540","DOI":"10.1128\/JVI.00522-15","article-title":"Primer ID validates template sampling depth and greatly reduces the error rate of next-generation sequencing of HIV-1 genomic RNA populations","volume":"89","author":"Zhou","year":"2015","journal-title":"J. Virol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/12\/2029\/48934969\/bioinformatics_35_12_2029.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/12\/2029\/48934969\/bioinformatics_35_12_2029.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T09:12:44Z","timestamp":1674810764000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/12\/2029\/5165375"}},"subtitle":[],"editor":[{"given":"Bonnie","family":"Berger","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,11,8]]},"references-count":31,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2019,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty919","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/276576","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,6]]},"published":{"date-parts":[[2018,11,8]]}}}