Methicillin-resistant Staphylococcus aureus (MRSA) transmission in the hospital setting has been a frequent subject of investigation using bacterial genomes, but previous approaches have not yet fully utilised the extra deductive power provided when multiple pathogen samples are acquired from each host. Here, we use a large dataset of MRSA sequences from multiply-sampled patients to reconstruct colonisation of individuals in a high-transmission setting in a hospital in Thailand. We reconstructed transmission trees for MRSA. We also investigated transmission between anatomical sites on the same individual, finding that this either occurs repeatedly or involves a wide transmission bottleneck. We examined the between-subject bottleneck, finding a wide range in the amount of diversity transmitted. Finally, we compared our approach to the simpler method of identifying transmission pairs using single nucleotide polymorphism (SNP) counts. This suggested that the optimum threshold for identifying a pair is 39 SNPs, if sensitivities and specificities are equally weighted.
Big Data Institute, University of Oxford, Oxford, United Kingdom.