{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T06:16:20Z","timestamp":1772172980648,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1009805","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,3,8]],"date-time":"2022-03-08T00:00:00Z","timestamp":1646697600000}}],"reference-count":46,"publisher":"Public Library of Science (PLoS)","issue":"2","license":[{"start":{"date-parts":[[2022,2,11]],"date-time":"2022-02-11T00:00:00Z","timestamp":1644537600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"The Oxford Martin School Programme on Pandemic Genomics"},{"name":"Medical Research Council","award":["MR\/R015600\/1"],"award-info":[{"award-number":["MR\/R015600\/1"]}]},{"name":"UK Department for International Development"}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Inferring the dynamics of pathogen transmission during an outbreak is an important problem in infectious disease epidemiology. In mathematical epidemiology, estimates are often informed by time series of confirmed cases, while in phylodynamics genetic sequences of the pathogen, sampled through time, are the primary data source. Each type of data provides different, and potentially complementary, insight. Recent studies have recognised that combining data sources can improve estimates of the transmission rate and the number of infected individuals. However, inference methods are typically highly specialised and field-specific and are either computationally prohibitive or require intensive simulation, limiting their real-time utility. We present a novel birth-death phylogenetic model and derive a tractable analytic approximation of its likelihood, the computational complexity of which is linear in the size of the dataset. This approach combines epidemiological and phylodynamic data to produce estimates of key parameters of transmission dynamics and the unobserved prevalence. Using simulated data, we show (a) that the approximation agrees well with existing methods, (b) validate the claim of linear complexity and (c) explore robustness to model misspecification. This approximation facilitates inference on large datasets, which is increasingly important as large genomic sequence datasets become commonplace.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1009805","type":"journal-article","created":{"date-parts":[[2022,2,11]],"date-time":"2022-02-11T13:33:51Z","timestamp":1644586431000},"page":"e1009805","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":14,"title":["A computationally tractable birth-death model that combines phylogenetic and epidemiological data"],"prefix":"10.1371","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1824-7653","authenticated-orcid":true,"given":"Alexander Eugene","family":"Zarebski","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0352-6289","authenticated-orcid":true,"given":"Louis","family":"du Plessis","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7806-3605","authenticated-orcid":true,"given":"Kris Varun","family":"Parag","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8797-2667","authenticated-orcid":true,"given":"Oliver George","family":"Pybus","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,2,11]]},"reference":[{"key":"pcbi.1009805.ref001","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-540-78911-6","volume-title":"Mathematical Epidemiology","author":"F Brauer","year":"2008"},{"issue":"6","key":"pcbi.1009805.ref002","doi-asserted-by":"crossref","first-page":"477","DOI":"10.1038\/nrmicro1845","article-title":"Mathematical models of infectious disease transmission","volume":"6","author":"NC Grassly","year":"2008","journal-title":"Nature Reviews Microbiology"},{"issue":"8","key":"pcbi.1009805.ref003","doi-asserted-by":"crossref","first-page":"540","DOI":"10.1038\/nrg2583","article-title":"Evolutionary analysis of the dynamics of viral infectious disease","volume":"10","author":"OG Pybus","year":"2009","journal-title":"Nature Reviews Genetics"},{"issue":"1","key":"pcbi.1009805.ref004","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1093\/molbev\/msr217","article-title":"Estimating the Basic Reproductive Number from Viral Sequence Data","volume":"29","author":"T Stadler","year":"2011","journal-title":"Molecular Biology and Evolution"},{"issue":"8","key":"pcbi.1009805.ref005","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pcbi.1002136","article-title":"Inference for Nonlinear Epidemiological Models Using Genealogies and Time Series","volume":"7","author":"DA Rasmussen","year":"2011","journal-title":"PLOS Computational Biology"},{"issue":"1","key":"pcbi.1009805.ref006","doi-asserted-by":"crossref","first-page":"12","DOI":"10.3390\/tropicalmed4010012","article-title":"Accounting for Healthcare-Seeking Behaviours and Testing Practices in Real-Time Influenza Forecasts","volume":"4","author":"R Moss","year":"2019","journal-title":"Tropical Medicine and Infectious Disease"},{"key":"pcbi.1009805.ref007","article-title":"Infectious disease phylodynamics with occurrence data","author":"LA Featherstone","year":"2020","journal-title":"bioRxiv"},{"key":"pcbi.1009805.ref008","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/j.epidem.2014.09.004","article-title":"Four key challenges in infectious disease modelling using data from multiple sources","volume":"10","author":"DD Angelis","year":"2015","journal-title":"Epidemics"},{"key":"pcbi.1009805.ref009","doi-asserted-by":"crossref","first-page":"100393","DOI":"10.1016\/j.epidem.2020.100393","article-title":"Influencing public health policy with data-informed mathematical models of infectious diseases: Recent developments and new challenges","volume":"32","author":"A Alahmadi","year":"2020","journal-title":"Epidemics"},{"issue":"3","key":"pcbi.1009805.ref010","doi-asserted-by":"crossref","first-page":"1055","DOI":"10.1534\/genetics.113.154856","article-title":"Relating Phylogenetic Trees to Transmission Trees of Infectious Disease Outbreaks","volume":"195","author":"RJF Ypma","year":"2013","journal-title":"Genetics"},{"issue":"6","key":"pcbi.1009805.ref011","doi-asserted-by":"crossref","first-page":"1163","DOI":"10.1093\/sysbio\/syaa035","article-title":"Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models","volume":"69","author":"KV Parag","year":"2020","journal-title":"Systematic Biology"},{"issue":"1309","key":"pcbi.1009805.ref012","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1098\/rstb.1994.0068","article-title":"The reconstructed evolutionary process","volume":"344","author":"S Nee","year":"1994","journal-title":"Philosophical Transactions of the Royal Society of London Series B: Biological Sciences"},{"issue":"1","key":"pcbi.1009805.ref013","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1073\/pnas.1207965110","article-title":"Birth-death skyline plot reveals temporal changes of epidemic spread in HIV and hepatitis C virus (HCV)","volume":"110","author":"T Stadler","year":"2013","journal-title":"Proceedings of the National Academy of Sciences"},{"key":"pcbi.1009805.ref014","doi-asserted-by":"crossref","first-page":"27","DOI":"10.2307\/3213548","article-title":"On the Genealogy of Large Populations","volume":"19","author":"JFC Kingman","year":"1982","journal-title":"Journal of Applied Probability"},{"issue":"3","key":"pcbi.1009805.ref015","doi-asserted-by":"crossref","first-page":"1429","DOI":"10.1093\/genetics\/155.3.1429","article-title":"An Integrated Framework for the Inference of Viral Population History From Reconstructed Genealogies","volume":"155","author":"OG Pybus","year":"2000","journal-title":"Genetics"},{"issue":"5525","key":"pcbi.1009805.ref016","doi-asserted-by":"crossref","first-page":"2323","DOI":"10.1126\/science.1058321","article-title":"The Epidemic Behavior of the Hepatitis C Virus","volume":"292","author":"OG Pybus","year":"2001","journal-title":"Science"},{"issue":"4","key":"pcbi.1009805.ref017","doi-asserted-by":"crossref","first-page":"1421","DOI":"10.1534\/genetics.109.106021","article-title":"Phylodynamics of Infectious Disease Epidemics","volume":"183","author":"EM Volz","year":"2009","journal-title":"Genetics"},{"issue":"2","key":"pcbi.1009805.ref018","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1534\/genetics.114.172791","article-title":"Inferring Epidemiological Dynamics with Bayesian Coalescent Inference: The Merits of Deterministic and Stochastic Models","volume":"199","author":"A Popinga","year":"2015","journal-title":"Genetics"},{"key":"pcbi.1009805.ref019","unstructured":"Tang M, Dudas G, Bedford T, Minin VN. Fitting stochastic epidemic models to gene genealogies using linear noise approximation. arXiv e-prints. 2019;."},{"issue":"6","key":"pcbi.1009805.ref020","doi-asserted-by":"crossref","first-page":"1041","DOI":"10.1093\/sysbio\/syw050","article-title":"Understanding Past Population Dynamics: Bayesian Coalescent-Based Modeling with Covariates","volume":"65","author":"MS Gill","year":"2016","journal-title":"Systematic Biology"},{"issue":"4","key":"pcbi.1009805.ref021","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pcbi.1003570","article-title":"Phylodynamic Inference for Structured Epidemiological Models","volume":"10","author":"DA Rasmussen","year":"2014","journal-title":"PLOS Computational Biology"},{"issue":"3","key":"pcbi.1009805.ref022","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pcbi.1004789","article-title":"Quantifying and Mitigating the Effect of Preferential Sampling on Phylodynamic Inference","volume":"12","author":"MD Karcher","year":"2016","journal-title":"PLOS Computational Biology"},{"issue":"8","key":"pcbi.1009805.ref023","doi-asserted-by":"crossref","first-page":"2414","DOI":"10.1093\/molbev\/msaa016","article-title":"Jointly Inferring the Dynamics of Population Size and Sampling Intensity from Molecular Sequences","volume":"37","author":"KV Parag","year":"2020","journal-title":"Molecular Biology and Evolution"},{"issue":"1","key":"pcbi.1009805.ref024","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1214\/aoms\/1177730285","article-title":"On the Generalized \u201cBirth-and-Death\u201d Process","volume":"19","author":"DG Kendall","year":"1948","journal-title":"The Annals of Mathematical Statistics"},{"issue":"3","key":"pcbi.1009805.ref025","doi-asserted-by":"crossref","first-page":"396","DOI":"10.1016\/j.jtbi.2010.09.010","article-title":"Sampling-through-time in birth-death trees","volume":"267","author":"T Stadler","year":"2010","journal-title":"Journal of Theoretical Biology"},{"issue":"94","key":"pcbi.1009805.ref026","doi-asserted-by":"crossref","first-page":"20131106","DOI":"10.1098\/rsif.2013.1106","article-title":"Simultaneous reconstruction of evolutionary history and epidemiological dynamics from viral sequences with the birth-death SIR model","volume":"11","author":"D K\u00fchnert","year":"2014","journal-title":"Journal of The Royal Society Interface"},{"issue":"1","key":"pcbi.1009805.ref027","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1093\/sysbio\/syab049","article-title":"Unifying Phylogenetic Birth\u2013Death Models in Epidemiology and Macroevolution","volume":"71","author":"A MacPherson","year":"2021","journal-title":"Systematic Biology"},{"issue":"3","key":"pcbi.1009805.ref028","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1111\/j.1467-9868.2009.00736.x","article-title":"Particle Markov chain Monte Carlo methods","volume":"72","author":"C Andrieu","year":"2010","journal-title":"Journal of the Royal Statistical Society: Series B (Statistical Methodology)"},{"issue":"8","key":"pcbi.1009805.ref029","doi-asserted-by":"crossref","first-page":"1804","DOI":"10.1093\/molbev\/msz106","article-title":"Estimating Epidemic Incidence and Prevalence from Genomic Data","volume":"36","author":"TG Vaughan","year":"2019","journal-title":"Molecular Biology and Evolution"},{"issue":"11","key":"pcbi.1009805.ref030","doi-asserted-by":"crossref","first-page":"2982","DOI":"10.1093\/molbev\/msx195","article-title":"Quantifying Transmission Heterogeneity Using Both Pathogen Phylogenies and Incidence Time Series","volume":"34","author":"LM Li","year":"2017","journal-title":"Molecular Biology and Evolution"},{"issue":"11","key":"pcbi.1009805.ref031","first-page":"1","article-title":"A Systematic Bayesian Integration of Epidemiological and Genetic Data","volume":"11","author":"MSY Lau","year":"2015","journal-title":"PLOS Computational Biology"},{"key":"pcbi.1009805.ref032","doi-asserted-by":"crossref","first-page":"110400","DOI":"10.1016\/j.jtbi.2020.110400","article-title":"The probability distribution of the ancestral population size conditioned on the reconstructed phylogenetic tree with occurrence data","volume":"509","author":"M Manceau","year":"2021","journal-title":"Journal of Theoretical Biology"},{"key":"pcbi.1009805.ref033","doi-asserted-by":"crossref","first-page":"110115","DOI":"10.1016\/j.jtbi.2019.110115","article-title":"The probability distribution of the reconstructed phylogenetic tree with occurrence data","volume":"488","author":"A Gupta","year":"2020","journal-title":"Journal of Theoretical Biology"},{"issue":"1","key":"pcbi.1009805.ref034","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1137\/S00361445024180","article-title":"Nineteen Dubious Ways to Compute the Exponential of a Matrix, Twenty-Five Years Later","volume":"45","author":"C Moler","year":"2003","journal-title":"SIAM Review"},{"issue":"6530","key":"pcbi.1009805.ref035","doi-asserted-by":"crossref","first-page":"708","DOI":"10.1126\/science.abf2946","article-title":"Establishment and lineage dynamics of the SARS-CoV-2 epidemic in the UK","volume":"371","author":"L du Plessis","year":"2021","journal-title":"Science"},{"issue":"6","key":"pcbi.1009805.ref036","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1093\/aje\/kwh255","article-title":"Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control Measures","volume":"160","author":"J Wallinga","year":"2004","journal-title":"American Journal of Epidemiology"},{"issue":"8","key":"pcbi.1009805.ref037","doi-asserted-by":"crossref","first-page":"2102","DOI":"10.1093\/molbev\/msw064","article-title":"Phylodynamics with Migration: A Computational Framework to Quantify Population Structure from Genomic Data","volume":"33","author":"D K\u00fchnert","year":"2016","journal-title":"Molecular Biology and Evolution"},{"issue":"7","key":"pcbi.1009805.ref038","doi-asserted-by":"crossref","first-page":"1459","DOI":"10.1093\/molbev\/msn090","article-title":"Smooth Skyride through a Rough Skyline: Bayesian Coalescent-Based Inference of Population Dynamics","volume":"25","author":"VN Minin","year":"2008","journal-title":"Molecular Biology and Evolution"},{"issue":"2","key":"pcbi.1009805.ref039","first-page":"184","article-title":"A Characteristic Property of Linear Growth Birth and Death Processes","volume":"50","author":"B Ycart","year":"1988","journal-title":"Sankhy\u0101: The Indian Journal of Statistics, Series A"},{"issue":"1","key":"pcbi.1009805.ref040","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1017\/S0269964815000297","article-title":"Linear Birth\/Immigration-Death Process with Binomial Catastrophes","volume":"30","author":"S Kapodistria","year":"2016","journal-title":"Probability in the Engineering and Informational Sciences"},{"key":"pcbi.1009805.ref041","article-title":"Estimates of outbreak-specific SARS-CoV-2 epidemiological parameters from genomic data","author":"TG Vaughan","year":"2020","journal-title":"medRxiv"},{"key":"pcbi.1009805.ref042","article-title":"Fundamental identifiability limits in molecular epidemiology","author":"S Louca","year":"2021","journal-title":"bioRxiv"},{"issue":"6540","key":"pcbi.1009805.ref043","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1126\/science.abf8003","article-title":"Timing the SARS-CoV-2 index case in Hubei province","volume":"372","author":"J Pekar","year":"2021","journal-title":"Science"},{"issue":"4","key":"pcbi.1009805.ref044","doi-asserted-by":"crossref","first-page":"694","DOI":"10.1111\/j.1558-5646.1995.tb02306.x","article-title":"Inferring the Rates of Branching and Extinction from Molecular Phylogenies","volume":"49","author":"T Kubo","year":"1995","journal-title":"Evolution"},{"issue":"12","key":"pcbi.1009805.ref045","doi-asserted-by":"crossref","first-page":"729","DOI":"10.1016\/j.tree.2013.09.007","article-title":"Phylogenetic estimates of speciation and extinction rates for testing ecological and evolutionary hypotheses","volume":"28","author":"RA Pyron","year":"2013","journal-title":"Trends in Ecology & Evolution"},{"issue":"12","key":"pcbi.1009805.ref046","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pcbi.1003919","article-title":"Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration","volume":"10","author":"A Gavryushkina","year":"2014","journal-title":"PLOS Computational Biology"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1009805","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,3,8]],"date-time":"2022-03-08T00:00:00Z","timestamp":1646697600000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009805","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,3,8]],"date-time":"2022-03-08T13:54:36Z","timestamp":1646747676000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009805"}},"subtitle":[],"editor":[{"given":"Alison L.","family":"Hill","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,2,11]]},"references-count":46,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2022,2,11]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1009805","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.10.21.349068","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,2,11]]}}}