Loading…

RECONSTRUCTING TRANSMISSION TREES FOR COMMUNICABLE DISEASES USING DENSELY SAMPLED GENETIC DATA

Whole genome sequencing of pathogens from multiple hosts in an epidemic offers the potential to investigate who infected whom with unparalleled resolution, potentially yielding important insights into disease dynamics and the impact of control measures. We considered disease outbreaks in a setting w...

Full description

Saved in:
Bibliographic Details
Published in:The annals of applied statistics 2016-03, Vol.10 (1), p.395-417
Main Authors: Worby, Colin J., O'Neill, Philip D., Kypraios, Theodore, Robotham, Julie V., De Angelis, Daniela, Cartwright, Edward J. P., Peacock, Sharon J., Cooper, Ben S.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Whole genome sequencing of pathogens from multiple hosts in an epidemic offers the potential to investigate who infected whom with unparalleled resolution, potentially yielding important insights into disease dynamics and the impact of control measures. We considered disease outbreaks in a setting with dense genomic sampling, and formulated stochastic epidemic models to investigate person-to-person transmission, based on observed genomic and epidemiological data. We constructed models in which the genetic distance between sampled genotypes depends on the epidemiological relationship between the hosts. A data-augmented Markov chain Monte Carlo algorithm was used to sample over the transmission trees, providing a posterior probability for any given transmission route. We investigated the predictive performance of our methodology using simulated data, demonstrating high sensitivity and specificity, particularly for rapidly mutating pathogens with low transmissibility. We then analyzed data collected during an outbreak of methicillin-resistant Staphylococcus aureus in a hospital, identifying probable transmission routes and estimating epidemiological parameters. Our approach overcomes limitations of previous methods, providing a framework with the flexibility to allow for unobserved infection times, multiple independent introductions of the pathogen and within-host genetic diversity, as well as allowing forward simulation.
ISSN:1932-6157
1941-7330
DOI:10.1214/15-AOAS898