{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T06:18:44Z","timestamp":1772173124973,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1010696","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,12,15]],"date-time":"2022-12-15T00:00:00Z","timestamp":1671062400000}}],"reference-count":32,"publisher":"Public Library of Science (PLoS)","issue":"12","license":[{"start":{"date-parts":[[2022,12,5]],"date-time":"2022-12-05T00:00:00Z","timestamp":1670198400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000060","name":"National Institute of Allergy and Infectious Diseases","doi-asserted-by":"publisher","award":["R01AI147336"],"award-info":[{"award-number":["R01AI147336"]}],"id":[{"id":"10.13039\/100000060","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000060","name":"National Institute of Allergy and Infectious Diseases","doi-asserted-by":"publisher","award":["R01AI097045"],"award-info":[{"award-number":["R01AI097045"]}],"id":[{"id":"10.13039\/100000060","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>\n                    Identifying host factors that influence infectious disease transmission is an important step toward developing interventions to reduce disease incidence. Recent advances in methods for reconstructing infectious disease transmission events using pathogen genomic and epidemiological data open the door for investigation of host factors that affect onward transmission. While most transmission reconstruction methods are designed to work with densely sampled outbreaks, these methods are making their way into surveillance studies, where the fraction of sampled cases with sequenced pathogens could be relatively low. Surveillance studies that use transmission event reconstruction then use the reconstructed events as response variables (i.e., infection source status of each sampled case) and use host characteristics as predictors (e.g., presence of HIV infection) in regression models. We use simulations to study estimation of the effect of a host factor on probability of being an infection source via this multi-step inferential procedure. Using\n                    <jats:monospace>TransPhylo<\/jats:monospace>\n                    \u2014a widely-used method for Bayesian estimation of infectious disease transmission events\u2014and logistic regression, we find that low sensitivity of identifying infection sources leads to dilution of the signal, biasing logistic regression coefficients toward zero. We show that increasing the proportion of sampled cases improves sensitivity and some, but not all properties of the logistic regression inference. Application of these approaches to real world data from a population-based TB study in Botswana fails to detect an association between HIV infection and probability of being a TB infection source. We conclude that application of a pipeline, where one first uses\n                    <jats:monospace>TransPhylo<\/jats:monospace>\n                    and sparsely sampled surveillance data to infer transmission events and then estimates effects of host characteristics on probabilities of these events, should be accompanied by a realistic simulation study to better understand biases stemming from imprecise transmission event inference.\n                  <\/jats:p>","DOI":"10.1371\/journal.pcbi.1010696","type":"journal-article","created":{"date-parts":[[2022,12,5]],"date-time":"2022-12-05T13:31:35Z","timestamp":1670247095000},"page":"e1010696","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":11,"title":["Using genetic data to identify transmission risk factors: Statistical assessment and application to tuberculosis transmission"],"prefix":"10.1371","volume":"18","author":[{"given":"Isaac H.","family":"Goldstein","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2819-7645","authenticated-orcid":true,"given":"Damon","family":"Bayer","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4789-2777","authenticated-orcid":true,"given":"Ivan","family":"Barilar","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9905-9322","authenticated-orcid":true,"given":"Balladiah","family":"Kizito","sequence":"additional","affiliation":[]},{"given":"Ogopotse","family":"Matsiri","sequence":"additional","affiliation":[]},{"given":"Chawangwa","family":"Modongo","sequence":"additional","affiliation":[]},{"given":"Nicola M.","family":"Zetola","sequence":"additional","affiliation":[]},{"given":"Stefan","family":"Niemann","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1917-9288","authenticated-orcid":true,"given":"Volodymyr M.","family":"Minin","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9982-4373","authenticated-orcid":true,"given":"Sanghyuk S.","family":"Shin","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,12,5]]},"reference":[{"issue":"3","key":"pcbi.1010696.ref001","doi-asserted-by":"crossref","first-page":"e65","DOI":"10.1016\/S1473-3099(18)30477-8","article-title":"Advances in the understanding of Mycobacterium tuberculosis transmission in HIV-endemic settings","volume":"19","author":"JS Peters","year":"2019","journal-title":"The Lancet Infectious diseases"},{"issue":"9","key":"pcbi.1010696.ref002","doi-asserted-by":"crossref","first-page":"1510","DOI":"10.1093\/infdis\/jiu628","article-title":"HIV infection and the transmission of tuberculosis","volume":"211","author":"TA Yates","year":"2015","journal-title":"J Infect Dis"},{"issue":"8","key":"pcbi.1010696.ref003","doi-asserted-by":"crossref","first-page":"730","DOI":"10.1056\/NEJMoa1003176","article-title":"Whole-genome sequencing and social-network analysis of a tuberculosis outbreak","volume":"364","author":"JL Gardy","year":"2011","journal-title":"New England Journal of Medicine"},{"issue":"1","key":"pcbi.1010696.ref004","first-page":"287","article-title":"Using genomics data to reconstruct transmission trees during disease outbreaks","volume":"35","author":"M Hall","year":"2016","journal-title":"Revue scientifique et technique (International Office of Epizootics)"},{"issue":"4","key":"pcbi.1010696.ref005","first-page":"997","article-title":"Genomic infectious disease epidemiology in partially sampled and ongoing outbreaks","volume":"34","author":"X Didelot","year":"2017","journal-title":"Molecular Biology and Evolution"},{"issue":"3","key":"pcbi.1010696.ref006","doi-asserted-by":"crossref","first-page":"719","DOI":"10.1093\/molbev\/msx304","article-title":"PHYLOSCANNER: inferring transmission from within-and between-host pathogen genetic diversity","volume":"35","author":"C Wymant","year":"2018","journal-title":"Molecular Biology and Evolution"},{"issue":"12","key":"pcbi.1010696.ref007","doi-asserted-by":"crossref","first-page":"e1003397","DOI":"10.1371\/journal.pcbi.1003397","article-title":"Inferring the source of transmission with phylogenetic data","volume":"9","author":"EM Volz","year":"2013","journal-title":"PLoS Computational Biology"},{"issue":"12","key":"pcbi.1010696.ref008","doi-asserted-by":"crossref","first-page":"e1004613","DOI":"10.1371\/journal.pcbi.1004613","article-title":"Epidemic reconstruction in a phylogenetics framework: transmission trees as partitions of the node set","volume":"11","author":"M Hall","year":"2015","journal-title":"PLoS Computational Biology"},{"issue":"3","key":"pcbi.1010696.ref009","doi-asserted-by":"crossref","first-page":"e1006930","DOI":"10.1371\/journal.pcbi.1006930","article-title":"Bayesian inference of transmission chains using timing of symptoms, pathogen genomes and contact data","volume":"15","author":"F Campbell","year":"2019","journal-title":"PLoS Computational Biology"},{"issue":"11","key":"pcbi.1010696.ref010","article-title":"outbreaker2: a modular platform for outbreak reconstruction","volume":"19","author":"F Campbell","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"pcbi.1010696.ref011","doi-asserted-by":"crossref","unstructured":"Sledzieski S, Zhang C, Mandoiu I, Bansal MS. TreeFix-TP: phylogenetic error-correction for infectious disease transmission network inference. In: BIOCOMPUTING 2021: Proceedings of the Pacific Symposium. World Scientific; 2020.","DOI":"10.1142\/9789811232701_0012"},{"key":"pcbi.1010696.ref012","article-title":"Branching process models to identify risk factors for infectious disease transmission","author":"SK Gallagher","year":"2021","journal-title":"Journal of Computational and Graphical Statistics"},{"issue":"10","key":"pcbi.1010696.ref013","doi-asserted-by":"crossref","first-page":"e1002961","DOI":"10.1371\/journal.pmed.1002961","article-title":"High-resolution mapping of tuberculosis transmission: Whole genome sequencing and phylogenetic modelling of a cohort from Valencia Region, Spain","volume":"16","author":"Y Xu","year":"2019","journal-title":"PLoS Medicine"},{"issue":"7","key":"pcbi.1010696.ref014","doi-asserted-by":"crossref","first-page":"788","DOI":"10.1016\/S1473-3099(18)30218-4","article-title":"Internal migration and transmission dynamics of tuberculosis in Shanghai, China: an epidemiological, spatial, genomic analysis","volume":"18","author":"C Yang","year":"2018","journal-title":"The Lancet Infectious Diseases"},{"issue":"4","key":"pcbi.1010696.ref015","doi-asserted-by":"crossref","DOI":"10.1099\/mgen.0.000361","article-title":"Bayesian reconstruction of Mycobacterium tuberculosis transmission networks in a high incidence area over two decades in Malawi reveals associated risk factors and genomic variants","volume":"6","author":"B Sobkowiak","year":"2020","journal-title":"Microbial Genomics"},{"key":"pcbi.1010696.ref016","unstructured":"R Core Team. R: A Language and Environment for Statistical Computing; 2020. Available from: https:\/\/www.R-project.org\/."},{"issue":"8","key":"pcbi.1010696.ref017","doi-asserted-by":"crossref","first-page":"1002","DOI":"10.1111\/2041-210X.13422","article-title":"nosoi: A stochastic agent-based transmission chain simulation framework in R","volume":"11","author":"S Lequime","year":"2020","journal-title":"Methods in Ecology and Evolution"},{"key":"pcbi.1010696.ref018","unstructured":"Organization WH. Global Tuberculosis Report 2020;. https:\/\/www.who.int\/publications-detail-redirect\/9789240013131."},{"issue":"4","key":"pcbi.1010696.ref019","doi-asserted-by":"crossref","first-page":"843","DOI":"10.1093\/biomet\/86.4.843","article-title":"Bias and efficiency loss due to misclassified responses in binary regression","volume":"86","author":"JM Neuhaus","year":"1999","journal-title":"Biometrika"},{"key":"pcbi.1010696.ref020","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1111\/biom.13400","article-title":"Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification","volume":"78","author":"LJ Beesley","year":"2022","journal-title":"Biometrics"},{"key":"pcbi.1010696.ref021","unstructured":"Botswana Country Profile;. https:\/\/www.cdc.gov\/globalhivtb\/where-we-work\/botswana\/botswana.html."},{"issue":"5","key":"pcbi.1010696.ref022","doi-asserted-by":"crossref","first-page":"e010046","DOI":"10.1136\/bmjopen-2015-010046","article-title":"Protocol for a population-based molecular epidemiology study of tuberculosis transmission in a high HIV-burden setting: the Botswana Kopanyo study","volume":"6","author":"N Zetola","year":"2016","journal-title":"BMJ Open"},{"issue":"3","key":"pcbi.1010696.ref023","doi-asserted-by":"crossref","first-page":"835","DOI":"10.3201\/eid2703.203840","article-title":"Population-Based Geospatial and Molecular Epidemiologic Study of Tuberculosis Transmission Dynamics, Botswana, 2012\u20132016","volume":"27","author":"NM Zetola","year":"2021","journal-title":"Emerging Infectious Diseases"},{"key":"pcbi.1010696.ref024","doi-asserted-by":"crossref","first-page":"e5895","DOI":"10.7717\/peerj.5895","article-title":"MTBseq: a comprehensive pipeline for whole genome sequence analysis of Mycobacterium tuberculosis complex isolates","volume":"6","author":"TA Kohl","year":"2018","journal-title":"PeerJ"},{"issue":"4","key":"pcbi.1010696.ref025","doi-asserted-by":"crossref","first-page":"e1006650","DOI":"10.1371\/journal.pcbi.1006650","article-title":"BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis","volume":"15","author":"R Bouckaert","year":"2019","journal-title":"PLoS Computational Biology"},{"key":"pcbi.1010696.ref026","unstructured":"Bouackert, Remco. Ascertainment correction\/Sampling bias; 2019. Available from: https:\/\/www.beast2.org\/2019\/07\/18\/ascertainment-correction.html."},{"issue":"9","key":"pcbi.1010696.ref027","doi-asserted-by":"crossref","first-page":"e1008067","DOI":"10.1371\/journal.ppat.1008067","article-title":"The molecular clock of Mycobacterium tuberculosis","volume":"15","author":"F Menardo","year":"2019","journal-title":"PLoS Pathogens"},{"issue":"1","key":"pcbi.1010696.ref028","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1111\/2041-210X.12628","article-title":"ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data","volume":"8","author":"G Yu","year":"2017","journal-title":"Methods in Ecology and Evolution"},{"key":"pcbi.1010696.ref029","article-title":"Comparing transmission reconstruction models with Mycobacterium tuberculosis whole genome sequence data","author":"B Sobkowiak","year":"2022","journal-title":"bioRxiv"},{"issue":"1","key":"pcbi.1010696.ref030","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1097\/EDE.0000000000001414","article-title":"What Can Genetic Relatedness Tell Us About Risk Factors for Tuberculosis Transmission?","volume":"33","author":"SV Leavitt","year":"2021","journal-title":"Epidemiology"},{"issue":"1","key":"pcbi.1010696.ref031","doi-asserted-by":"crossref","DOI":"10.1186\/s12916-020-01849-7","article-title":"The impact of HIV infection on tuberculosis transmission in a country with low tuberculosis incidence: a national retrospective study using molecular epidemiology","volume":"18","author":"JR Winter","year":"2020","journal-title":"BMC Medicine"},{"issue":"1","key":"pcbi.1010696.ref032","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1093\/infdis\/jiu403","article-title":"Transmission of tuberculosis in a South African community with a high prevalence of HIV infection","volume":"211","author":"K Middelkoop","year":"2015","journal-title":"The Journal of Infectious Diseases"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1010696","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,12,15]],"date-time":"2022-12-15T00:00:00Z","timestamp":1671062400000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010696","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,15]],"date-time":"2022-12-15T14:07:48Z","timestamp":1671113268000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010696"}},"subtitle":[],"editor":[{"given":"Nicola","family":"Segata","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,12,5]]},"references-count":32,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2022,12,5]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1010696","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.12.12.21267687","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,5]]}}}