Evidence of microevolution of Salmonella Typhimurium during a series of egg-associated outbreaks linked to a single chicken farm

Hawkey, Jane; Edwards, David J; Dimovski, Karolina; Hiley, Lester; Billman-Jacobe, Helen; Hogg, Geoff; Holt, Kathryn E

doi:10.1186/1471-2164-14-800

Research article
Open access
Published: 19 November 2013

Evidence of microevolution of Salmonella Typhimurium during a series of egg-associated outbreaks linked to a single chicken farm

Jane Hawkey^1,2,
David J Edwards¹,
Karolina Dimovski³,
Lester Hiley⁴,
Helen Billman-Jacobe²,
Geoff Hogg³ &
…
Kathryn E Holt¹

BMC Genomics volume 14, Article number: 800 (2013) Cite this article

6642 Accesses
49 Citations
2 Altmetric
Metrics details

Abstract

Background

The bacterium Salmonella enterica serovar Typhimurium (S. Typhimurium) is one of the most frequent causes of foodborne outbreaks of gastroenteritis. Between 2005–2008 a series of S. Typhimurium outbreaks occurred in Tasmania, Australia, that were all traced to eggs originating from a single chicken farm. We sequenced the genomes of 12 isolates linked to these outbreaks, in order to investigate the microevolution of a pathogenic S. Typhimurium clone in a natural, spatiotemporally restricted population.

Results

The isolates, which shared a phage type similar to DT135 known locally as 135@ or 135a, formed a clade within the S. Typhimurium population with close similarity to the reference genome SL1334 (160 single nucleotide polymorphisms, or SNPs). Ten of the isolates belonged to a single clone (<23 SNPs between isolate pairs) which likely represents the population of S. Typhimurium circulating at the chicken farm; the other two were from sporadic cases and were genetically distinct from this clone. Divergence dating indicated that all 12 isolates diverged from a common ancestor in the mid 1990s, and the clone began to diversify in 2003–2004. This clone spilled out into the human population several times between 2005–2008, during which time it continued to accumulate SNPs at a constant rate of 3–5 SNPs per year or 1x10^-6 substitutions site^-1 year^-1, faster than the longer-term (~50 year) rates estimated previously for S. Typhimurium. Our data suggest that roughly half of non-synonymous substitutions are rapidly removed from the S. Typhimurium population, after which purifying selection is no longer important and the remaining substitutions become fixed in the population. The S. Typhimurium 135@ isolates were nearly identical to SL1344 in terms of gene content and virulence plasmids. Their phage contents were close to SL1344, except that they carried a different variant of Gifsy-1, lacked the P2 remnant found in SL1344 and carried a novel P2 phage, P2-Hawk, in place SL1344’s P2 phage SopEϕ. DT135 lacks P2 prophage. Two additional plasmids were identified in the S. Typhimurium 135@ isolates, pSTM2 and pSTM7. Both plasmids were IncI1, but phylogenetic analysis of the plasmids and their bacterial hosts shows these plasmids are genetically distinct and result from independent plasmid acquisition events.

Conclusions

This study provides a high-resolution insight into short-term microevolution of the important human pathogen S. Typhimurium. It indicates that purifying selection occurs rapidly in this population (≤6 years) and then declines, and provides an estimate for the short-term substitution rate. The latter is likely to be more relevant for foodborne outbreak investigation than previous estimates based on longer time scales.

Background

Salmonella enterica serovar Typhimurium (S. Typhimurium) is a frequent cause of gastroenteritis in humans [1, 2], including foodborne disease outbreaks [3]. A variant of S. Typhimurium phage type DT135 - sometimes referred to locally as 135@ or 135a but without official phage type designation - is amongst the most common forms of S. Typhimurium in Australia [4, 5] and has been isolated from chickens and eggs in Australia [6]. S. Typhimurium phage type 135 and 135@ have been associated with multiple foodborne outbreaks in Australia, which are generally epidemiologically linked to the consumption of eggs [7–13] or chicken [14–16].

In 2005, 2007 and 2008 a series of seven outbreaks of S. Typhimurium 135@ occurred in Tasmania, Australia [12, 13]. These outbreaks involved 193 microbiologically confirmed cases of S. Typhimurium 135@ infection, and were each linked to the consumption of raw egg-containing foods through epidemiological investigations conducted at the time [12, 13]. For outbreaks 2, 5 and 7, S. Typhimurium 135@ was isolated from a food source implicated during the investigation. While different raw egg-containing foods (bakery goods, mayonnaise) and retail outlets (bakeries, cafés and restaurants) were implicated in the various outbreaks, each was traced back to eggs supplied from the same farm [12, 13]. S. Typhimurium 135@ was isolated from the farm in December 2005 and January 2006, which subsequently ceased to operate.

The series of isolates collected during these outbreaks, linked to a single farm source, provides a unique opportunity to investigate the possible microevolution of a clinically important S. Typhimurium clone in a natural but spatiotemporally contained bacterial population. In order to determine the unique features of S. Typhimurium 135@, consider its microevolution and explore the utility of whole genome sequencing in understanding intermittent foodborne outbreaks, we sequenced 12 isolates associated with the Tasmanian outbreaks and performed phylogenetic and comparative genomic analysis.

Results

Sequencing and phylogenetic analysis

Twelve isolates were selected for sequencing based on their epidemiological link to the Tasmanian outbreaks (including 8/193 isolates from 5 Tasmanian outbreaks, 2/3 farm isolates and 2/100 sporadic isolates) and multi-locus VNTR analysis (MLVA) profiles, such that each outbreak was represented by at least one clinical isolate possessing the dominant MLVA profile for the outbreak; linked food isolates were included where available for that outbreak; and sporadic isolates were those with the closest MLVA profiles to the outbreak isolates. Details of the twelve S. Typhiumurium 135@ isolates, named STm1 to STm12 and sequenced via Illumina HiSeq, are given in Table 1. They include seven isolates from 2005/2006 (4 human cases – 1 sporadic, 1 food isolate and 2 farm isolates); two isolates from 2007 (one from an outbreak case and one sporadic case that occurred six months later); and three isolates from the 2008 outbreak (two human cases and a food isolate). The genome assemblies were first reported in [17]. To investigate the relationships between the outbreak isolates, we mapped the sequenced reads to the reference chromosome sequence for S. Typhiumurium strain SL1344 (phage type DT44, accession NC_016810.1). Using the single nucleotide polymorphisms (SNPs) identified from mapping, we inferred a maximum likelihood phylogenetic tree. We also included all publicly available S. Typhimurium chromosome sequences, including DT135, in the analysis for comparison (Table 2). The tree (Figure 1) confirmed that SL1344 was the closest finished reference to the S. Typhimurium 135@ isolates, and therefore the most suitable reference for read mapping, SNP calling and gene content comparison.

Table 1 S . Typhimurium isolates sequenced for this study

Full size table

Table 2 Publicly available S . Typhimurium genomes included in this study

Full size table

The phylogenetic tree (Figure 1) showed that the S. Typhimurium 135@ isolates formed a clade that was closely related to SL1344 and included a DT12 isolate. This clade (shaded in Figure 1) was separated from SL1344 by just 160 SNPs. Full details of the SNPs that varied within the 135@ clade are provided in Additional file 1: Table S1. Ten of the 12 S. Typhimurium 135@ isolates formed a tight clonal group, displaying ≤23 SNPs between pairs of isolates (red branches in Figures 1, 2). The branch leading to this clone was defined by 32 SNPs that the members of the clone share relative to all other S. Typhimurium analysed (Figures 1, 2). The S. Typhimurium 135@ isolates outside this clone were STm10, isolated during the fourth 2005 outbreak and STm4, isolated from a sporadic case between the 2007 and 2008 outbreaks. The publicly available DT12 genome, isolated from a human infection in the UK in 2009 [26], was equidistant from STm4, STm10 and the S. Typhimurium 135@ clonal group (Figure 1). Interestingly, the publicly available DT135 genome, also isolated from a human infection in the UK in 2009, was not within the S. Typhimurium 135@ group but clustered near the common ancestor of S. Typhimurium 135@ and SL1344.

Dissecting the series of outbreaks

Figure 2 shows a detailed phylogeny for the S. Typhimurium 135@ isolates, linked to a timeline depicting the size and timing of the series of outbreaks as first reported in [12, 13]. S. Typhimurium 135@ isolated from the same outbreak were near-identical to one another (0–5 SNPs), but were differentiated from other outbreaks by >10 SNPs (Figure 2). The two isolates from the second 2005 outbreak (STm3, STm5) shared 6 SNPs that were not detected in other isolates, and differed from one another at just one SNP. STm5 was from a human case, and STm3 was isolated from a piping bag at the bakery to which the outbreak had been traced [13]. The genomic data therefore provides strong independent evidence that the bacteria isolated from the bakery and the human infection were linked by a short chain of transmission. Similarly, the three isolates from the 2008 outbreak (STm6, STm7, STm12) also clustered closely together, sharing 9 SNPs that were not detected in other isolates (Figure 2). This outbreak was traced to a restaurant [12], from which STm6 was isolated from aioli (mayonnaise made with raw egg). Again the genomic data strongly supports the finding that raw egg-containing food at the restaurant was a source of transmission during the outbreak [12].

The clonal group includes single representative cases from three other outbreaks, and two isolates from the farm to which all outbreaks have been linked (STm1, STm2). Interestingly, while isolates from the same outbreak were nearly identical and shared ≥5 SNPs differentiating them from other outbreaks, isolates from different outbreaks were roughly equidistant from each other and from farm isolates (Figure 2). This suggests that the clonal group to which all outbreak isolates belong (red branches in Figures 1, 2) represents a broader population of S. Typhimurium 135@ circulating at the farm, and that this population descends from a single ancestral strain introduced into the farm some time previously. As a corollary, the variation present within the outbreak clone therefore potentially represents the diversification of S. Typhimurium 135@ through microevolutionary processes which occurred in situ at the farm, whose chicken population presumably provided a reservoir host for the bacteria. We therefore refer to this group, highlighted in red in Figures 1 and 2, as the “farm clone”.

Dating the emergence of S. Typhimurium 135@ and the farm clone

In order to estimate the likely date of emergence of the farm clone, we analysed the SNP data using Bayesian and maximum likelihood analyses (Table 3). The maximum likelihood tree provided strong evidence for a constant accumulation of SNPs over time within the farm clone (Pearson R² = 0.71), with a substitution rate of 7 × 10^-7 site^-1 year^-1 or 3 SNPs per chromosome per year. The Bayesian and maximum likelihood methods gave similar results (Table 3), supporting a substitution rate of 5-19 × 10^-7 site^-1 year^-1 or 3–5 SNPs per year and indicating the most recent common ancestor (mrca) for the farm clone existed in 2003–2004 (95% HPD, 2002–2005). Consistent with this, STm8 from the first outbreak in June 2005 had acquired only two SNPs since the mrca, whereas the 2008 isolates had acquired 14–15 SNPs since the mrca (Figure 2).

Table 3 Divergence dating analysis for outbreak-related S. Typhimurium 135@

Full size table

STm4 and STm10 were comparatively very distant from the farm clone, separated from it by 75–100 SNPs and sharing a much older mrca that diverged around 1996–1997 (95% HPD, 1986–2001) (Figure 2). This further supports that STm10 and STm4, while certainly related to the farm clone, cannot be considered part of the same chains of transmission that caused the Tasmanian outbreaks and did not derive from the S. Typhimurium 135@ population circulating at the farm. Consistent with this, STm4 and STm10 were isolated from sporadic cases of S. Typhimurium 135@ in Tasmania with no epidemiological links to the outbreaks, suggesting independent sources of S. Typhimurium 135@ infection.

While we cannot validate the divergence date estimates with independent data, MLVA analysis of our S. Typhimurium 135@ collection showed isolates with MLVA profiles identical to that of the outbreak clone (2-11-10-10-212) were detected in an Australian chicken (backyard, Queensland) and human infection (New South Wales) in 2004. This confirms the presence of the clone in Australia in 2004, consistent with predicted introduction of the ancestor into the Tasmanian chicken farm in 2003–2004. Interestingly, closely related STm135 isolates (MLVA profile 2-11-12-11-212) were collected during a Tasmanian gastroenteritis outbreak in 1994 and from a wild Tasmanian devil in 1996 (STm135, MLVA profile 2-10-7-10-212).

Microevolution and natural selection within S. Typhimurium

Our data set provides a unique opportunity to examine the microevolutionary processes occurring in the S. Typhimurium population, within the spatiotemporal confines of a clonal bacterial population circulating in a host chicken population at a single farm over a 4–6 year period (beginning 2003–2005 and ending in 2008). We mapped each SNP onto the phylogenetic tree (numbers on each branch in Figure 2) and determined its effect on encoded proteins (synonymous SNPs, resulting in no amino acid changes; non-synonymous SNPs, resulting in amino acid changes; or SNPs in non-protein-coding regions), see Additional file 1: Table S1. Across all branches, the mean genome-wide rate of non-synonymous to synonymous substitutions (dN/dS) was 0.52. This is broadly indicative of purifying selection, suggesting that for every 2 non-synonymous SNPs that arise in the S. Typhimurium population, one is deleterious and is removed. As this selective process is expected to take time, a decline in dN/dS is sometimes observed when moving from recent time scales (dN/dS ~ 1, reflecting the underlying substitution rate without selection) through to longer time scales (dN/dS - > 0, as non-synonymous SNPs are removed over time) [27]. Because our data set includes isolates that are separated by varying amounts of evolution (within the farm clone; between the farm clone and other S. Typhimurium 135@; and between more diverse S. Typhimurium), we examined whether the dN/dS rate varied between branches associated with these different evolutionary scales. Among branches separating diverse lineages of Typhimurium (long-term evolution), the average dN/dS was 0.43. On branches separating the three clades of S. Typhimurium 135@ (STm4, STm10 and the farm clone; estimated above to encompass ~12 years of evolution), dN/dS values were 0.45, 0.48 and 0.45. Across all SNPs accumulating within the farm clone (estimated above to encompass 4–6 years of evolution), the dN/dS was 0.46. Hence there is no evidence that dN/dS declines through time in S. Typhimurium; rather, non-synonymous mutations appear to either be removed rapidly (within a few years, ~50% of non-synonymous SNPs) or become fixed within local subpopulations.

We have two outbreaks represented by multiple isolates, from which to examine the accumulation of SNPs during the course of an outbreak (Figure 2). We identified only 6 SNPs that arose during outbreaks; 5 of which were non-synonymous (Additional file 1: Table S1). This number is too small to draw conclusions from, but could be explained by underlying mutation rates without selection (assuming equal mutation rates, we would expect 3.2 non-synonymous mutations for every 1 synonymous mutation observed). For the other outbreaks, for which we have just one representative isolate each, it is impossible to distinguish SNPs that arose during the outbreak from those arising prior to the outbreak (i.e. during circulation at the farm). STm1 and STm2 were isolated at the farm, therefore their differences from the mrca reflect mutations that have arisen during circulation within the farm. The SNPs they have accumulated since the mrca show a dN/dS of 0.6, which is not significantly different from the average within the farm clone (0.46). There were also two internal branches within the clone that represent mutations arising in the farm; the branch leading to the second 2005 outbreak (Figure 2, 1 non-synonymous and 5 synonymous SNPs; dN/dS ~ 0.06) and the branch leading to the 2008 outbreak (Figure 2, 2 non-synonymous and 6 synonymous SNPs plus one intergenic SNP; dN/dS ~ 0.1). Given the short time scale associated with these branches (1–4 years), the lack of non-synonymous SNPs is suggestive of strong purifying selection, which might be expected in a large bacterial population competing within a small, geographically confined host population such as a farm. However since no such paucity of non-synonymous SNPs was evident in the farm isolates STm1 and STm2, and the mean dN/dS within the clone was the same as outside the clone, these low dN/dS values are probably anomalies and there is no evidence for significant difference in the short-term and long-term impacts of purifying selection within this S. Typhimurium population.

Phage content in S. Typhimurium 135@

We also investigated differences in gene content in S. Typhimurium 135@. There were very few differences in chromosomal gene content between S. Typhimurium 135@ and SL1344 (Figure 3). SL1344 contains five prophage sequences as well as prophage remnants, which were mainly conserved in all S. Typhimurium 135@ isolates (Figure 3). The P2 phage remnant of SL1344 (coordinates 2,815,301 - 2,825,986) was missing from S. Typhimurium 135@, and the Gifsy-1 phage of S. Typhimurium 135@ is much closer to that of DT2 than SL1344. The major difference in phage content between SL1344 and S. Typhimurium 135@ was the replacement of SL1344’s SopEϕ prophage with a novel prophage we have designated P2-Hawk (between bases 47,071-94,739 in contig 102, WGS project AMDX02). In SL1344, the P2 SopEϕ prophage is inserted in front of another prophage sequence, a 10 kbp P4 prophage located next to SL1344_2723 (Figure 4). The S. Typhimurium 135@ chromosomes have the same P4 phage located at the same site, but in place of the 34 kbp SopEϕ prophage there is a novel P2 prophage 32 kbp in size (Figure 4). This P2-P4 phage region was completely conserved among all the S. Typhimurium 135@ isolates (Figure 3). The novel P2-Hawk phage in the S. Typhimurium 135@ isolates carries two cargo genes – one is a hypothetical protein with no known function, however the second shares 70% amino acid sequence identity with a phage repressor protein (AbiC; Pfam14355) (dark orange, Figure 4). Nucleotide BLAST searches of the NCBI non-redundant database (accessed July 2013) identified prophage sequences with substantial sequence homology to P2-Hawk in S. Typhimurium T000240 and in ten other S. enterica serovars (Agona, Give, Hadar, Heidelberg, Johannesburg, Newport, Paratyphi A, Paratyphi C, Virchow and Weltevreden) as well as S. enterica subspecies houtenae, often at other insertion sites in the chromosome. These other prophages however lack the two cargo genes. The phage content of the DT135 and DT12 genomes were very similar to S. Typhimurium 135@ (Figure 3), except that they lacked a P2 phage near the P4 integration site (Figure 4) and DT12 harboured an additional P22 phage.

Plasmid content in S. Typhimurium 135@

All S. Typhimurium 135@ isolates carried a copy of the S. Typhimurium virulence plasmid pSLT. We identified only two SNPs that separated the S. Typhimurium 135@ plasmids from SL1344’s pSLT (synonymous C- > T in codon 11 of hypothetical protein SL1344_P1_0060; synonymous G- > A in codon 94 of putative outer membrane protein SL1344_P1_0095) and a single SNP that separated the plasmids of the outbreak clone from those of STm4 and STm10 (non-synonymous G- > T in codon 49 of putative resolvase SL1344_P1_0062). The STm4 and STm12 plasmids also harboured one unique SNP each, both of which were non-synonymous (SL1344_P1_0091, SL1344_P1_0064). No differences in virulence plasmid gene content were identified, amongst the S. Typhimurium 135@ isolates or in comparison with pSLT.

Two of the S. Typhimurium 135@ isolates carried additional plasmids, both of the IncI1 incompatibility group. STm2 (farm isolate, 2005) carried pSTM2 (accession KF290378) and STm7 (human case, 2008) carried pSTM7 (accession KF290377) which included a sul2 gene that confers resistance to sulfonamide antimicrobials. Plasmid pSTM2 did not have any antimicrobial resistance genes but contained a 4.8 kbp region which was not present in pSTM7, encoding aDNA adenine methytransferase (PSTM2_00004), a DNA damage inducible protein I (PSTM2_00006) and several hypothetical proteins (PSTM2_00001, PSTM2_00002, PSTM2_00003, PSTM2_00005). This is consistent with antimicrobial susceptibility typing, which showed that all 12 S. Typhimurium 135@ isolates were susceptible to all antimicrobials tested with the exception of STm7 which was resistant to sulphathiazole. To determine whether pSTM2 and pSTM7 could be related via conjugal transfer between S. Typhimurium 135@ host strains or via microevolution within the farm’s S. Typhimurium 135@ population, we constructed a phylogenetic tree of the IncI1 core genes for these and 23 publicly available IncI1 plasmids (Table 4, Figure 5). This analysis showed that pSTM2 and pSTM7 belong to a subclade including 5 other IncI1 plasmids isolated from Salmonella enterica, three of which were also associated with poultry (Figure 5). However the tree indicates pSTM2 and pSTM7 cannot be directly related to one another via transfer or local microevolution (Figure 5); rather, they represent distinct transfers of IncI1 plasmids into members of the S. Typhimurium 135@ farm clone. This is also supported by plasmid MLST analysis (Table 4), which shows pSTM2 (ST7) and pSTM7 (ST3) belong to different clonal complexes identified via analysis of much larger plasmid collections (clonal complex 7, clonal complex 3). Interestingly STm6, isolated from the restaurant to which the STm7 case was linked, did not carry an IncI1 plasmid, nor did STm12 which was isolated from another human case from the same outbreak. Hence plasmid pSTM7 is likely to have been acquired in vivo during the infection in the human host rather than in the farm environment; this could potentially be related to selection for the sul2 resistance gene in pSTM7.

Table 4 IncI1 plasmid sequences analysed in this study

Full size table

Discussion

Implications for understanding whole genome sequencing (WGS) data in the context of outbreak investigation and source attribution

It is important to note that both the epidemiological data and the phylogenetic approach are crucial for proper interpretation of the data. Firstly, the epidemiological investigation was crucial in linking the infections to specific food sources and tracing these back to a single farm, with a high degree of confidence. This enables us to interpret the clonal group (red branches in Figures 1, 2) as representing a diverse bacterial source population at the farm; without the epidemiological information and farm isolates we could only guess at a common source. Secondly, phylogenetic inference is critical to reaching this understanding. If we were to consider only pairwise differences between isolates, we would be able to identify close relationships between isolates from the same outbreak (0–5 SNPs) and conclude these form part of direct transmission chains; but we would not be able to resolve the nature of the relationships between different outbreaks or between farm isolates. However, the phylogenetic inference shows us that all the outbreaks and the farm isolates share a recent common ancestor, which enables us to understand that these isolates form a single local population resulting from just a few years of clonal expansion. Other studies have also been able to demonstrate the benefits of using WGS and phylogenetic inference in addition to epidemiology in outbreak investigations, finding that WGS methods are stable and consistent with epidemiological results [47, 48].

Our study provides insight into the level of diversity that can be expected during Salmonella outbreaks. It highlights that when considering whether a specific food product, implicated by epidemiological investigation of outbreak cases, is in “immediate source” involved directly in disease transmission (in this case the bakery piping bag and restaurant aioli), we should expect very few mutations (0–1 SNPs) between bacteria isolated from the proposed transmission vehicle and those from cases. However when tracing these food products back to a potential “ultimate source” (in this case a farm), we must understand that we are likely to be sampling from a bacterial source population which has diversified to some extent, and that the transmission chain that led to human infections may represent just one sublineage of the overall diversity present in the ultimate source population. We therefore need to expect more variation between infection isolates and isolates from potential ultimate sources, without ruling out a direct transmission link. Our data also suggests that, if multiple isolates are obtained from a suspected source population, there will likely be added value in generating WGS data on many or all of them rather than sequencing a representative isolate. Even if no source isolates are identical to infection isolates, establishing the diversity range of possible ancestors of source and infection isolates will likely be informative, as it was in this case.

Our data suggests that, as long as epidemiological and phylogenetic approaches were combined as they were here, most of the conclusions from this retrospective analysis could have been made during the outbreak investigation. If WGS had been performed prospectively during the outbreaks, it would have confirmed the existence of a close transmission chain between case (STm5) and food source (STm3) in outbreak 2 (1 SNP), and a close relationship between outbreaks 1 and 2 (10 SNPs, recent common ancestor), see Figure 2. WGS would have confirmed immediately that STm10 (outbreak 3) was not related to the contemporaneous outbreaks 1 and 2 (>75 SNPs) but that outbreak 5 probably was (Figure 2). When the farm isolates were obtained soon after, WGS would have confirmed that these also derived from the common ancestor of outbreaks 1, 2 and 5 (Figure 2), lending further weight to the conclusions of the epidemiological investigation by providing strong phylogenetic evidence that outbreaks 1, 2 and 5 stemmed from a common source population at the farm.

Purifying selection within S. Typhimurium

We found that genome-wide dN/dS was approximately 0.5 across all branches of the S. Typhimurium tree. This is consistent with a previous estimate based on comparison of two S. Typhimurium lineages (0.53) [18]. Our observation that dN/dS rapidly reaches this level and then remains consistent across the phylogeny, is strikingly similar to the pattern observed in S. enterica serovar Typhi, the agent of typhoid fever [49]. In S. Typhi, dN/dS within WGS-defined subclones was roughly equal to one, reflecting the underlying mutation rates and an absence of purifying selection over short time scales; this is very similar to our observation that intra-outbreak SNPs in S. Typhimurium were consistent with underlying mutation rates. Similarly, in S. Typhi SNPs that differentiated WGS-defined lineages (similar to the scale of the farm clone vs STm4 and STm10) or occurred on the oldest internal branches of the WGS tree (similar to the scale of the non-S. Typhimurium 135@ branches in our S. Typhimurium tree) showed a dN/dS 0.46-0.52 [49]. This suggests that in both S. Typhi and S. Typhimurium, approximately half of all non-synonymous SNPs are somewhat deleterious and removed rapidly from the population via purifying selection; however after this there is very little evidence of further selection. A recent study of S. enterica serovar Agona found a similar dN/dS rate of 0.67, and no evidence of adaptive selection within the S. Agona population [50].

Substitution rates in S. Typhimurium

We estimated a substitution rate of 6.7-12 × 10^-7 substitutions site^-1 year^-1 or 3–5 SNPs chromosome^-1 year^-1 among the S. Typhimurium 135@ isolates. We did not perform this analysis for the whole S. Typhimurium data set as there was no relationship between branch lengths and date of isolation across the rest of the tree. Our rate is faster than that estimated within two lineages of S. Typhimurium causing invasive typhoid-like disease in Africa (1.9 × 10^-7 and 3.9 × 10^-7 substitutions site^-1 year^-1 or 1–2 SNPs chromosome^-1 year^-1), which were calculated using methods similar to our whole-genome SNP analysis with BEAST [26]. This may be because our analysis reflects short term evolution (isolates collected over 4 year period, with estimated mrca in 1996 reflecting 12 years of evolution) in a small and spatiotemporally contained population (mostly within a single farm host population), whereas the African population analysis reflects longer term evolution (isolates collected over a 22 year period, mrca in 1960 reflecting 50 years of evolution) in a larger host population. The recent S. Agona population genomics analysis estimated a much lower substitution rate of 5.7 × 10^-8-1.3 × 10^-7 site^-1 year^-1, however this analysis included multiple different lineages and displayed substantial variation in substitution rates across the phylogeny, hence it is not directly comparable [50].

Phage and plasmid variation

The genomes of S. Typhimurium SL1344 and S. Typhimurium 135@ share a P4 prophage sequence, with different P2 prophages adjacent to it (Figure 4). A P2 prophage (Fels-1) exists at the same location in S. Typhimurium LT2, but no P4 phage is present. P4 is a defective phage that lacks its own genes for capsid, tail and lysis functions. Instead it utilises P2 as a helper phage to build structural components and package its DNA [51]. There is no evidence that physical proximity of the P2 and P4 prophages is important for this interaction, so the close physical link between P2 and P4 prophages at this locus in different S. Typhimurium chromosomes may be related to preferred integration sites rather than the functional interaction between phages. Sequenced S. Tyhimurium isolates DT135, DT12 and LT12 [26] have the same P4 prophage as SL1344 and S. Typhimurium 135@, but without accompanying P2 prophage. The P2 phage in SL1344 encodes SopE, a type three secreted effector protein associated with host cell invasion. A recent screen of SL1344 transposon mutants in mouse, chicken, calf and pig colonization models [52] showed that SopE mutants were attenuated in their ability to colonize the gut of chickens, calves and pigs (data available at http://www-tradis.vet.cam.ac.uk). No data on SopE mutants was obtained for the mouse infection model so the importance of SopE in mammalian infection or gut colonization is unclear. However the ability of the SopE-negative S. Typhimurium 135@ to circulate in the chicken farm for many years, and to cause several large outbreaks of gastroenteritis in humans, indicates that it is not critical for colonization of chickens or for virulence in humans. This may be due to the redundancy of secreted effectors in the S. Typhimurium genome - the SopE2 protein, conserved in S. Typhimurium 135@, is 70% identical at the amino acid level to the phage-encoded SopE protein. The P2-Hawk phage in S. Typhimurium 135@ carried two cargo genes, which are not found in homologous phages in other S. enterica. One of these genes has close homology to a protein associated with conferring resistance to specific phages in Lactococcus lactis[53]. It is intriguing to speculate whether these genes may also confer resistance to certain Salmonella phages, or even help the P2-Hawk phage to repress the activity the P4 phage and prevent it from hijacking the P2 machinery to assist with its own dissemination.

The SNPs identified within the S. Typhimurium virulence plasmid pSLT were entirely compatible with the chromosomal SNP phylogeny, indicating no evidence for transfer of the plasmid between members of the host bacterial population. The difference between and farm clone and sporadic isolates in the repeat copy numbers for the STTR10 VNTR, located in the pSLT plasmid, may be an indication that this VNTR is relatively stable and further supports that the pSLT plasmid is stable within the clone. The detection of two independent acquisitions of IncI1 plasmids, pSTM2 and pSTM7, in the S. Typhimurium 135@ population is interesting. According to the IncI1 plasmid MLST scheme, pSTM2 and pSTM7 belong to clonal complexes 7 and 3, respectively. Both are associated with S. enterica and E. coli isolated from humans, poultry and occasionally other food animals (see e.g. [54]), and frequently carry beta-lactamase CTX-M genes, encoding resistance to third generation cephalosporins (http://pubmlst.org/plasmid/, accessed June 2013 [55]). The novel plasmids pSTM2 and pSTM7, which lack CTX-M genes, may be useful comparators to investigate the emergence of CTX-M and other resistance genes in related IncI1 plasmids.

Conclusions

We have shown that S. Typhimurium 135@ is closely related to the well-studied laboratory strain SL1344, with very few differences in terms of SNPs or gene content. However we identified several minor phage differences which could account for the phenotypic differences in phage type, most notably the novel prophage P2-Hawk replacing the SopEϕ prophage of SL1344. We also identified two novel IncI1 plasmids in S. Typhimurium 135@, which belong to plasmid lineages that are associated with Enterobacteriaceae in poultry in many other parts of the world. By analysing the genomes of S. Typhimurium 135@ from a series of outbreaks, we obtained estimates of short-term mutation rates and population structure in this important foodborne pathogen, which will be useful in interpreting genomic data in future outbreak investigations.

Methods

Sequencing and assembly

Genomic DNA was extracted from fresh overnight subcultures of S. Typhimurium 135@ isolates using QIAamp DNA Mini Kit and QIAcube (Qiagen) and transferred to the Australian Genome Research Facility (AGRF) for multiplex sequencing on Illumina HiSeq (11 isolates multiplexed in one lane), generating 100 bp paired-end reads. STm5 was sequenced earlier in a single lane of HiSeq generating 36 bp paired-end reads. Illumina reads were assembled using SPAdes 2.4.0 [56], resulting in a median of 118 contigs per genome (range, 84–458 contigs), covering a median of 4.94 Mbp of sequence (range, 4.93 – 5.04 Mbp), with N50 of 21 kbp - 259 kbp and mean read depth 300× - 1000×. Read mapping to the available finished S. Typhimurium reference chromosome sequences (Table 2) using BWA0.7.5a [57] revealed the closest reference for all S. Typhimurium 135@ isolates was S. Typhimurium SL1344 (phage type DT44, accession NC_016810.1). Each set of contigs was ordered against the S. Typhimurium SL1344 chromosome and plasmid using MUMmer 3.23 [58] and ABACAS (version 1.3.1, http://abacas.sourceforge.net/) and annotated using NCBI’s PGAP (http://www.ncbi.nlm.nih.gov/genome/annotation_prok/). Prophage sequences were identified with the help of PHAST [59] and comparison to the finished genomes of S. Typhimurium SL1344 (NC_016810.1) and LT2 (NC_003197.1) was performed using ACT (version 12.0.0) [60].

Phylogenetic and evolutionary analysis

SNPs were identified by mapping reads to a reference sequence using BWA [57] and calling SNPs with SamTools 0.1.18 [61]. Raw SNP calls were filtered for quality (phred score ≥20), depth (≥10x) and homozygosity as in [62]. References used were: (i) chromosome – S. Typhimurium SL1344 (NC_016810.1), (ii) virulence plasmid - S. Typhimurium SL1344 plasmid pSLT (NC_017720.1), and (iii) IncI1 plasmid – S. Thompson plasmid pNF1358 (NC_019011.1). Other publicly available genome sequences were included in phylogenies by first simulating 1 million 100 bp paired end reads from the finished sequence, and mapping these to the reference sequence in the same manner as the Illumina reads. SNP calls located in repeat regions, insertion sequences or phage sequences were excluded from phylogenetic analysis (391 kbp or 8% of the SL1344 chromosome). Maximum likelihood phylogenetic trees were inferred using RAxML 7.2.8 [63] to analyse the concatenated alignment of SNP alleles (GTR + Γ model of nucleotide substitution, 10 replicate runs and 1,000 bootstraps). The neighbour-joining split network (Figure 1) was inferred from the same alignment using SplitsTree4 [64]. SNPs were mapped onto the trees using the baseml function of the PAML software package (version 4.7) [65]. dN/dS was approximated by dividing the ratio of non-synonymous SNPs to synonymous SNPs by the ratio of possible non-synonymous and synonymous mutations across all protein coding sequences in the SL1344 reference genome, which we calculated to be 3.2 using the codon usage function (cusp) in the EMBOSS package (version 6.6.0) [66].

Divergence dating

Isolation dates were converted to a continuous discrete variable representing days since 1900. Path-O-Gen v1.3 was used to analyse the association between this variable and branch lengths in the rooted maximum likelihood tree for S. Typhimurium 135@ (Figure 2). For BEAST analysis (v1.6), we used the same isolation date variable (expressed in days since 1900) and the same concatenated SNP alignment as used for RAxML analysis. We investigated 4 alternative models resulting from the combination of two demographic models (constant population size and Bayesian Skyline) and two molecular clock models (strict and relaxed lognormal). For each model, we ran 5 replicate runs for 100 million iterations, and combined the results after excluding the first 10 million iterations as burn-in. The relaxed clock models estimated an uncorrelated standard deviation of the mutation rate (ucld.stdev) abutting zero, providing evidence for a strict rather than relaxed molecular clock. The two demographic models gave nearly identical results (log Bayes Factor = 0.16) and the Bayesian Skyline plot indicated a constant population size. Hence the results reported in Table 3 are those from the strict clock, constant population size model only. Since our estimates are based on a SNP alignment with time expressed in days, the raw estimates were in units of substitutions per variable site per day. These were scaled to genome-wide units of substitutions per site per year by multiplying the estimates by constant k = n/N × 365 where n is the number of SNP sites in the alignment (1,871), N is the total positions considered for SNP calling (4,487,272); hence k = 0.1522.

Gene content analysis

The phage sequences of S. Typhimurium 135@, SL1344 and LT2 were identified with the help of PHAST [59] and the genome annotations for SL1344 and LT2. Pairwise comparisons were examined using ACT [60] and multiple genome comparisons were visualised using BRIG (version 0.95) [67]. To identify contigs belonging to the virulence plasmid pSLT (accession NC_017720), contigs which did not map to SL1344 with ABACAS/MUMmer were compared to pSLT using ABACAS/MUMmer and nucleotide BLAST. The comparison of non-chromosomal contigs to pSLT was visualised using ACT. Contigs not mapping to the chromosome or pSLT were identified in STm2 and STm7, these were then used as blastn queries to the NCBI nr database to investigate their origin. This showed that both the STm2 and STm7 contigs had close homology S. enterica IncI1 plasmid pNF1358 (accession NC_019099). We therefore named these contigs pSTM2 and pSTM7 respectively, and compared them to the pNF1358 sequence using ACT to identify regions of difference and confirm that they represent complete IncI1 circular replicons. These were then isolated using Prokka (version 1.5.2, http://vicbioinformatics.com/) and submitted to GenBank. Phylogenetic inference is described above, with core genes defined as genes that were present in all plasmid sequences. Plasmid MLST for all IncI1 plasmids was determined using SRST [68].

Antimicrobial susceptibility and multi-locus VNTR analysis (MLVA)

The sequenced S. Typhimurium 135@ isolates were tested for resistance to the following antimicrobials (MIC cut-off for resistance): ampicillin (>16 μg/ml), streptomycin (>8 μg/ml), tetracycline (>8 μg/ml), chloramphenicol (>16 μg/ml), sulphathiazole (>512 μg/ml), trimethoprim (>8 μg/ml), kanamycin (>32 μg/ml), nalidixic acid (>16 μg/ml), spectinomycin (>50 μg/ml), gentamicin (>8 μg/ml), ciprofloxacin (>0.06 μg/ml) and cefotaxime (>1 μg/ml). MLVA profiles for 1,930 S. Typhimurium 135@ were checked for similarity to the outbreak isolates by the Microbiological Diagnostic Unit Public Health Laboratory (MDU PHL), Victoria. MLVA profiles were generated using a multiplex assay targeting five VNTR loci [69]. The resulting profiles are expressed here in the form of repeat copy numbers for locus STTR9, STTR5, STTR6 and STTR10; and an allele code for STTR3, using the methods and nomenclature of [70]. Hence the profile 2-11-10-10-212 indicates 2 repeats at locus STTR9, 11 repeats at STTR5, 10 repeats at STTR6, 10 repeats at STTR10 and the 212 allele of locus STTR3 (corresponding to 524 bp at this locus).

Sequence data accessions

The annotated S. Typhimurium 135@ whole genome sequences were deposited as Whole Genome Shotgun projects at DDBJ/EMBL/GenBank and Illumina reads are available in the NCBI short read archive, accessions are given in Table 1. The plasmids pSTM2 and pSTM7 were deposited in GenBank under accessions KF290378 and KF290377, respectively.

References

Majowicz SE, Musto J, Scallan E, Angulo FJ, Kirk M, O'Brien SJ, Jones TF, Fazil A, Hoekstra RM: The global burden of nontyphoidal Salmonella gastroenteritis. Clin Infect Dis. 2010, 50 (6): 882-889. 10.1086/650733.
Article PubMed Google Scholar
Hendriksen RS, Vieira AR, Karlsmose S, Lo Fo Wong DM, Jensen AB, Wegener HC, Aarestrup FM: Global monitoring of Salmonella serovar distribution from the World Health Organization Global Foodborne Infections Network Country Data Bank: results of quality assured laboratories from 2001 to 2007. Foodborne Pathog Dis. 2011, 8 (8): 887-900. 10.1089/fpd.2010.0787.
Article PubMed Google Scholar
Harker KS, Lane C, Gormley FJ, Adak GK: National outbreaks of Salmonella infection in the UK, 2000–2011. Epidemiol Infect. in press
Sintchenko V, Wang Q, Howard P, Ha CW, Kardamanidis K, Musto J, Gilbert GL: Improving resolution of public health surveillance for human Salmonella enterica serovar Typhimurium infection: 3 years of prospective multiple-locus variable-number tandem-repeat analysis (MLVA). BMC Infect Dis. 2012, 12: 78-10.1186/1471-2334-12-78.
Article PubMed Central PubMed Google Scholar
OzFoodNet Working Group: Monitoring the incidence and causes of diseases potentially transmitted by food in Australia: annual report of the OzFoodNet Network, 2008. Commun Dis Intell. 2009, 33 (4): 389-413.
Google Scholar
Fearnley E, Raupach J, Lagala F, Cameron S: Salmonella in chicken meat, eggs and humans; Adelaide, South Australia, 2008. Int J Food Microbiol. 2011, 146 (3): 219-227. 10.1016/j.ijfoodmicro.2011.02.004.
Article PubMed Google Scholar
Hall R: Outbreak of gastroenteritis due to Salmonella typhimurium phage type I 35a following consumption of raw egg. Communicable diseases intelligence quarterly report. 2002, 26 (2): 285-287.
PubMed Google Scholar
Sarna M, Dowse G, Evans G, Guest C: An outbreak of Salmonella typhimurium PTI35 gastroenteritis associated with a minimally cooked dessert containing raw eggs. Communicable diseases intelligence quarterly report. 2002, 26 (1): 32-37.
PubMed Google Scholar
Tribe IG, Cowell D, Cameron P, Cameron S: An outbreak of Salmonella typhimurium phage type 135 infection linked to the consumption of raw shell eggs in an aged care facility. Communicable diseases intelligence quarterly report. 2002, 26 (1): 38-39.
PubMed Google Scholar
Department of Health, Victoria: Surveillance of Notifiable Infectious Diseases in Victoria 2008. 2009, Melbourne: Victorian Department of Human Services, 24-
Google Scholar
McCall BJ, Bell RJ, Neill AS, Micalizzi GR, Vakaci GR, Towner CD: An outbreak of Salmonella typhimurium phage type 135a in a child care centre. Communicable diseases intelligence quarterly report. 2003, 27 (2): 257-259.
PubMed Google Scholar
Stephens N, Coleman D, Shaw K: Recurring outbreaks of Salmonella typhimurium phage type 135 associated with the consumption of products containing raw egg in Tasmania. Commun Dis Intell. 2008, 32 (4): 466-468.
Google Scholar
Stephens N, Sault C, Firestone SM, Lightfoot D, Bell C: Large outbreaks of Salmonella Typhimurium phage type 135 infections associated with the consumption of products containing raw egg in Tasmania. Commun Dis Intell. 2007, 31 (1): 118-124.
Google Scholar
Department of Health, Victoria: Surveillance of Notifiable Infectious Diseases in Victoria 1998. 1999, Melbourne: Victorian Department of Human Services, 25-
Google Scholar
Department of Health, Victoria: Surveillance of Notifiable Infectious Diseases in Victoria 2002. 2003, Melbourne: Victorian Department of Human Services, 21-23.
Google Scholar
McPherson ME, Fielding JE, Telfer B, Stephens N, Combs BG, Rice BA, Fitzsimmons GJ, Gregory JE: A multi-jurisdiction outbreak of Salmonella Typhimurium phage type 135 associated with purchasing chicken meat from a supermarket chain. Communicable diseases intelligence quarterly report. 2006, 30 (4): 449-455.
PubMed Google Scholar
Hogg G, Dimovski K, Hiley L, Holt KE: Draft Genome Sequences for Ten Salmonella enterica Serovar Typhimurium Phage Type 135 Variants. Genome Announc. 2013, 1 (3): e00293-13-
Article PubMed Central PubMed Google Scholar
Jarvik T, Smillie C, Groisman EA, Ochman H: Short-term signatures of evolutionary change in the Salmonella enterica serovar typhimurium 14028 genome. J Bacteriol. 2010, 192 (2): 560-567. 10.1128/JB.01233-09.
Article PubMed Central CAS PubMed Google Scholar
Luo Y, Kong Q, Yang J, Golden G, Wanda SY, Jensen RV, Ernst PB, Curtiss R: Complete genome sequence of the universal killer Salmonella enterica Serovar Typhimurium UK-1 (ATCC 68169). J Bacteriol. 2011, 193 (15): 4035-4036. 10.1128/JB.05224-11.
Article PubMed Central CAS PubMed Google Scholar
Kingsley RA, Msefula CL, Thomson NR, Kariuki S, Holt KE, Gordon MA, Harris D, Clarke L, Whitehead S, Sangal V: Epidemic multiple drug resistant Salmonella Typhimurium causing invasive disease in sub-Saharan Africa have a distinct genotype. Genome Res. 2009, 19 (12): 2279-2287. 10.1101/gr.091017.109.
Article PubMed Central CAS PubMed Google Scholar
Patterson SK, Borewicz K, Johnson T, Xu W, Isaacson RE: Characterization and differential gene expression between two phenotypic phase variants in Salmonella enterica serovar Typhimurium. PLoS One. 2012, 7 (8): e43592-10.1371/journal.pone.0043592.
Article PubMed Central CAS PubMed Google Scholar
Richardson EJ, Limaye B, Inamdar H, Datta A, Manjari KS, Pullinger GD, Thomson NR, Joshi RR, Watson M, Stevens MP: Genome sequences of Salmonella enterica serovar typhimurium, Choleraesuis, Dublin, and Gallinarum strains of well- defined virulence in food-producing animals. J Bacteriol. 2011, 193 (12): 3162-3163. 10.1128/JB.00394-11.
Article PubMed Central CAS PubMed Google Scholar
McClelland M, Sanderson KE, Spieth J, Clifton SW, Latreille P, Courtney L, Porwollik S, Ali J, Dante M, Du F: Complete genome sequence of Salmonella enterica serovar Typhimurium LT2. Nature. 2001, 413 (6858): 852-856. 10.1038/35101614.
Article CAS PubMed Google Scholar
Izumiya H, Sekizuka T, Nakaya H, Taguchi M, Oguchi A, Ichikawa N, Nishiko R, Yamazaki S, Fujita N, Watanabe H: Whole-genome analysis of Salmonella enterica serovar Typhimurium T000240 reveals the acquisition of a genomic island involved in multidrug resistance via IS1 derivatives on the chromosome. Antimicrob Agents Chemother. 2011, 55 (2): 623-630. 10.1128/AAC.01215-10.
Article PubMed Central CAS PubMed Google Scholar
Li L, Cheng CK, Cheung MK, Law PT, Ling JM, Kam KM, Cheung WM, Kwan HS: Draft genome sequence of Salmonella enterica serovar Typhimurium ST1660/06, a multidrug-resistant clinical strain isolated from a diarrheic patient. J Bacteriol. 2012, 194 (22): 6319-6320. 10.1128/JB.01593-12.
Article PubMed Central CAS PubMed Google Scholar
Okoro CK, Kingsley RA, Connor TR, Harris SR, Parry CM, Al-Mashhadani MN, Kariuki S, Msefula CL, Gordon MA, de Pinna E: Intracontinental spread of human invasive Salmonella Typhimurium pathovariants in sub-Saharan Africa. Nat Genet. 2012, 44 (11): 1215-1221. 10.1038/ng.2423.
Article PubMed Central CAS PubMed Google Scholar
Rocha EP, Smith JM, Hurst LD, Holden MT, Cooper JE, Smith NH, Feil EJ: Comparisons of dN/dS are time dependent for closely related bacterial genomes. Journal of theoretical biology. 2006, 239 (2): 226-235. 10.1016/j.jtbi.2005.08.037.
Article CAS PubMed Google Scholar
Mankovich JA, Hsu CH, Konisky J: DNA and amino acid sequence analysis of structural and immunity genes of colicins Ia and Ib. J Bacteriol. 1986, 168 (1): 228-236.
PubMed Central CAS PubMed Google Scholar
Meynell E, Datta N: The relation of resistance transfer factors to the F-factor (sex-factor) of Escherichia coli K12. Genetical research. 1966, 7 (1): 134-140. 10.1017/S0016672300009538.
Article CAS PubMed Google Scholar
Oshima K, Toh H, Ogura Y, Sasamoto H, Morita H, Park SH, Ooka T, Iyoda S, Taylor TD, Hayashi T: Complete genome sequence and comparative analysis of the wild-type commensal Escherichia coli strain SE11 isolated from a healthy adult. DNA research: an international journal for rapid publication of reports on genes and genomes. 2008, 15 (6): 375-386. 10.1093/dnares/dsn026.
Article CAS Google Scholar
Takahashi H, Shao M, Furuya N, Komano T: The genome sequence of the incompatibility group Igamma plasmid R621a: evolution of IncI plasmids. Plasmid. 2011, 66 (2): 112-121. 10.1016/j.plasmid.2011.06.004.
Article CAS PubMed Google Scholar
Fricke WF, Mammel MK, McDermott PF, Tartera C, White DG, Leclerc JE, Ravel J, Cebula TA: Comparative genomics of 28 Salmonella enterica isolates: evidence for CRISPR-mediated adaptive sublineage evolution. J Bacteriol. 2011, 193 (14): 3556-3568. 10.1128/JB.00297-11.
Article PubMed Central CAS PubMed Google Scholar
Fricke WF, McDermott PF, Mammel MK, Zhao S, Johnson TJ, Rasko DA, Fedorka-Cray PJ, Pedroso A, Whichard JM, Leclerc JE: Antimicrobial resistance-conferring plasmids with similarity to virulence plasmids from avian pathogenic Escherichia coli strains in Salmonella enterica serovar Kentucky isolates from poultry. Appl Environ Microbiol. 2009, 75 (18): 5963-5971. 10.1128/AEM.00786-09.
Article PubMed Central CAS PubMed Google Scholar
Archer CT, Kim JF, Jeong H, Park JH, Vickers CE, Lee SY, Nielsen LK: The genome sequence of E. coli W (ATCC 9637): comparative genome analysis and an improved genome-scale reconstruction of E. coli. BMC Genomics. 2011, 12: 9-10.1186/1471-2164-12-9.
Article PubMed Central CAS PubMed Google Scholar
Turner PC, Yomano LP, Jarboe LR, York SW, Baggett CL, Moritz BE, Zentz EB, Shanmugam KT, Ingram LO: Optical mapping and sequencing of the Escherichia coli KO11 genome reveal extensive chromosomal rearrangements, and multiple tandem copies of the Zymomonas mobilis pdc and adhB genes. Journal of industrial microbiology & biotechnology. 2012, 39 (4): 629-639. 10.1007/s10295-011-1052-2.
Article CAS Google Scholar
Shepard SM, Danzeisen JL, Isaacson RE, Seemann T, Achtman M, Johnson TJ: Genome sequences and phylogenetic analysis of K88- and F18-positive porcine enterotoxigenic Escherichia coli. J Bacteriol. 2012, 194 (2): 395-405. 10.1128/JB.06225-11.
Article PubMed Central CAS PubMed Google Scholar
Ahmed SA, Awosika J, Baldwin C, Bishop-Lilly KA, Biswas B, Broomall S, Chain PS, Chertkov O, Chokoshvili O, Coyne S: Genomic comparison of Escherichia coli O104:H4 isolates from 2009 and 2011 reveals plasmid, and prophage heterogeneity, including shiga toxin encoding phage stx2. PLoS One. 2012, 7 (11): e48228-10.1371/journal.pone.0048228.
Article PubMed Central CAS PubMed Google Scholar
Woodford N, Carattoli A, Karisik E, Underwood A, Ellington MJ, Livermore DM: Complete nucleotide sequences of plasmids pEK204, pEK499, and pEK516, encoding CTX-M enzymes in three major Escherichia coli lineages from the United Kingdom, all belonging to the international O25:H4-ST131 clone. Antimicrob Agents Chemother. 2009, 53 (10): 4472-4482. 10.1128/AAC.00688-09.
Article PubMed Central CAS PubMed Google Scholar
Crossman LC, Chaudhuri RR, Beatson SA, Wells TJ, Desvaux M, Cunningham AF, Petty NK, Mahon V, Brinkley C, Hobman JL: A commensal gone bad: complete genome sequence of the prototypical enterotoxigenic Escherichia coli strain H10407. J Bacteriol. 2010, 192 (21): 5822-5831. 10.1128/JB.00710-10.
Article PubMed Central CAS PubMed Google Scholar
Smet A, Van Nieuwerburgh F, Vandekerckhove TT, Martel A, Deforce D, Butaye P, Haesebrouck F: Complete nucleotide sequence of CTX-M-15-plasmids from clinical Escherichia coli isolates: insertional events of transposons and insertion sequences. PLoS One. 2010, 5 (6): e11202-10.1371/journal.pone.0011202.
Article PubMed Central PubMed Google Scholar
Holt KE, Thieu Nga TV, Thanh DP, Vinh H, Kim DW, Vu Tra MP, Campbell JI, Hoang NV, Vinh NT, Minh PV: Tracking the establishment of local endemic populations of an emergent enteric pathogen. Proc Natl Acad Sci U S A. 2013, 110 (43): 17522-17527. 10.1073/pnas.1308632110.
Article PubMed Central CAS PubMed Google Scholar
Johnson TJ, Shepard SM, Rivet B, Danzeisen JL, Carattoli A: Comparative genomics and phylogeny of the IncI1 plasmids: a common plasmid type among porcine enterotoxigenic Escherichia coli. Plasmid. 2011, 66 (3): 144-151. 10.1016/j.plasmid.2011.07.003.
Article CAS PubMed Google Scholar
Han J, Lynne AM, David DE, Tang H, Xu J, Nayak R, Kaldhone P, Logue CM, Foley SL: DNA sequence analysis of plasmids from multidrug resistant Salmonella enterica serotype Heidelberg isolates. PLoS One. 2012, 7 (12): e51160-10.1371/journal.pone.0051160.
Article PubMed Central CAS PubMed Google Scholar
Eberhart LJ, Deringer JR, Brayton KA, Sawant AA, Besser TE, Call DR: Characterization of a novel microcin that kills enterohemorrhagic Escherichia coli O157:H7 and O26. Appl Environ Microbiol. 2012, 78 (18): 6592-6599. 10.1128/AEM.01067-12.
Article PubMed Central CAS PubMed Google Scholar
Bleicher A, Schofl G, Rodicio Mdel R, Saluz HP: The plasmidome of a Salmonella enterica serovar Derby isolated from pork meat. Plasmid. 2013, 69 (3): 202-210. 10.1016/j.plasmid.2013.01.001.
Article CAS PubMed Google Scholar
Carattoli A, Tosini F, Giles WP, Rupp ME, Hinrichs SH, Angulo FJ, Barrett TJ, Fey PD: Characterization of plasmids carrying CMY-2 from expanded-spectrum cephalosporin-resistant Salmonella strains isolated in the United States between 1996 and 1998. Antimicrob Agents Chemother. 2002, 46 (5): 1269-1272. 10.1128/AAC.46.5.1269-1272.2002.
Article PubMed Central CAS PubMed Google Scholar
Allard MW, Luo Y, Strain E, Li C, Keys CE, Son I, Stones R, Musser SM, Brown EW: High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach. BMC Genomics. 2012, 13: 32-10.1186/1471-2164-13-32.
Article PubMed Central CAS PubMed Google Scholar
Grad YH, Lipsitch M, Feldgarden M, Arachchi HM, Cerqueira GC, Fitzgerald M, Godfrey P, Haas BJ, Murphy CI, Russ C: Genomic epidemiology of the Escherichia coli O104:H4 outbreaks in Europe, 2011. Proc Natl Acad Sci U S A. 2012, 109 (8): 3065-3070. 10.1073/pnas.1121491109.
Article PubMed Central CAS PubMed Google Scholar
Holt KE, Parkhill J, Mazzoni CJ, Roumagnac P, Weill FX, Goodhead I, Rance R, Baker S, Maskell DJ, Wain J: High-throughput sequencing provides insights into genome variation and evolution in Salmonella Typhi. Nat Genet. 2008, 40 (8): 987-993. 10.1038/ng.195.
Article PubMed Central CAS PubMed Google Scholar
Zhou Z, McCann A, Litrup E, Murphy R, Cormican M, Fanning S, Brown D, Guttman DS, Brisse S, Achtman M: Neutral genomic microevolution of a recently emerged pathogen, Salmonella enterica serovar Agona. PLoS Genet. 2013, 9 (4): e1003471-10.1371/journal.pgen.1003471.
Article PubMed Central CAS PubMed Google Scholar
Lindqvist BH, Deho G, Calendar R: Mechanisms of genome propagation and helper exploitation by satellite phage P4. Microbiological reviews. 1993, 57 (3): 683-702.
PubMed Central CAS PubMed Google Scholar
Chaudhuri RR, Morgan E, Peters SE, Pleasance SJ, Hudson DL, Davies HM, Wang J, van Diemen PM, Buckley AM, Bowen AJ: Comprehensive assignment of roles for Salmonella typhimurium genes in intestinal colonization of food-producing animals. PLoS Genet. 2013, 9 (4): e1003456-10.1371/journal.pgen.1003456.
Article PubMed Central CAS PubMed Google Scholar
Deng YM, Harvey ML, Liu CQ, Dunn NW: A novel plasmid-encoded phage abortive infection system from Lactococcus lactis biovar. diacetylactis. FEMS Microbiol Lett. 1997, 146 (1): 149-154. 10.1111/j.1574-6968.1997.tb10185.x.
Article CAS PubMed Google Scholar
Leverstein-van Hall MA, Dierikx CM, Cohen Stuart J, Voets GM, van den Munckhof MP, van Essen-Zandbergen A, Platteel T, Fluit AC, van de Sande-Bruinsma N, Scharinga J: Dutch patients, retail chicken meat and poultry share the same ESBL genes, plasmids and strains. Clin Microbiol Infect. 2011, 17 (6): 873-880. 10.1111/j.1469-0691.2011.03497.x.
Article CAS PubMed Google Scholar
Garcia-Fernandez A, Chiaretto G, Bertini A, Villa L, Fortini D, Ricci A, Carattoli A: Multilocus sequence typing of IncI1 plasmids carrying extended-spectrum beta-lactamases in Escherichia coli and Salmonella of human and animal origin. The Journal of antimicrobial chemotherapy. 2008, 61 (6): 1229-1233. 10.1093/jac/dkn131.
Article CAS PubMed Google Scholar
Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD: SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. Journal of computational biology: a journal of computational molecular cell biology. 2012, 19 (5): 455-477. 10.1089/cmb.2012.0021.
Article CAS Google Scholar
Li H, Durbin R: Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010, 26 (5): 589-595. 10.1093/bioinformatics/btp698.
Article PubMed Central PubMed Google Scholar
Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL: Versatile and open software for comparing large genomes. Genome Biol. 2004, 5 (2): R12-10.1186/gb-2004-5-2-r12.
Article PubMed Central PubMed Google Scholar
Zhou Y, Liang Y, Lynch KH, Dennis JJ, Wishart DS: PHAST: a fast phage search tool. Nucleic Acids Res. 2011, 39: W347-W352. 10.1093/nar/gkr485. Web Server issue
Article PubMed Central CAS PubMed Google Scholar
Carver TJ, Rutherford KM, Berriman M, Rajandream MA, Barrell BG, Parkhill J: ACT: the Artemis Comparison Tool. Bioinformatics. 2005, 21 (16): 3422-3423. 10.1093/bioinformatics/bti553.
Article CAS PubMed Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009, 25 (16): 2078-2079. 10.1093/bioinformatics/btp352.
Article PubMed Central PubMed Google Scholar
Holt K, Baker S, Weill F, Holmes E, Kitchen A, Yu J, Sangal V, Brown D, Coia J, Kim D: Shigella sonnei genome sequencing and phylogenetic analysis indicate recent global dissemination from Europe. Nat Genet. 2012, 44 (9): 1056-1059. 10.1038/ng.2369.
Article PubMed Central CAS PubMed Google Scholar
Stamatakis A: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006, 22 (21): 2688-2690. 10.1093/bioinformatics/btl446.
Article CAS PubMed Google Scholar
Kloepper TH, Huson DH: Drawing explicit phylogenetic networks and their integration into SplitsTree. BMC Evol Biol. 2008, 8: 22-10.1186/1471-2148-8-22.
Article PubMed Central PubMed Google Scholar
Yang Z: PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007, 24 (8): 1586-1591. 10.1093/molbev/msm088.
Article CAS PubMed Google Scholar
Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16 (6): 276-277. 10.1016/S0168-9525(00)02024-2.
Article CAS PubMed Google Scholar
Alikhan NF, Petty NK, Ben Zakour NL, Beatson SA: BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons. BMC Genomics. 2011, 12: 402-10.1186/1471-2164-12-402.
Article PubMed Central CAS PubMed Google Scholar
Inouye M, Conway TC, Zobel J, Holt KE: Short read sequence typing (SRST): multi-locus sequence types from short reads. BMC Genomics. 2012, 13: 338-10.1186/1471-2164-13-338.
Article PubMed Central CAS PubMed Google Scholar
Lindstedt BA, Vardund T, Aas L, Kapperud G: Multiple-locus variable-number tandem-repeats analysis of Salmonella enterica subsp. enterica serovar Typhimurium using PCR multiplexing and multicolor capillary electrophoresis. J Microbiol Methods. 2004, 59 (2): 163-172. 10.1016/j.mimet.2004.06.014.
Article CAS PubMed Google Scholar
Larsson JT, Torpdahl M, Petersen RF, Sorensen G, Lindstedt BA, Nielsen EM: Development of a new nomenclature for Salmonella typhimurium multilocus variable number of tandem repeats analysis (MLVA). Euro surveillance: bulletin europeen sur les maladies transmissibles = European communicable disease bulletin. 2009, 14: 15-
Google Scholar

Download references

Acknowledgements

This work was supported by an Early Career Researcher grant from the University of Melbourne and a Victorian Life Sciences Computation Initiative (VLSCI) grant (#VR0082). KEH was supported by a fellowship from the NHMRC of Australia (#628930). MDU acknowledges the support of the Victorian Government. We thank Mary Valcanis and other MDU staff for their contributions, and the Australian Genome Research Facility for kindly sequencing the STm5 isolate.

Author information

Authors and Affiliations

Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Parkville, Victoria, 3010, Australia
Jane Hawkey, David J Edwards & Kathryn E Holt
Department of Agriculture and Food Systems, Melbourne School of Land and Environment, University of Melbourne, Parkville, Victoria, 3010, Australia
Jane Hawkey & Helen Billman-Jacobe
Microbiological Diagnostic Unit Public Health Laboratory, Department of Microbiology and Immunology, University of Melbourne, Parkville, Victoria, 3010, Australia
Karolina Dimovski & Geoff Hogg
Department of Health, Coopers Plains, Forensic and Scientific Services, Queensland, 4108, Australia
Lester Hiley

Authors

Jane Hawkey
View author publications
You can also search for this author in PubMed Google Scholar
David J Edwards
View author publications
You can also search for this author in PubMed Google Scholar
Karolina Dimovski
View author publications
You can also search for this author in PubMed Google Scholar
Lester Hiley
View author publications
You can also search for this author in PubMed Google Scholar
Helen Billman-Jacobe
View author publications
You can also search for this author in PubMed Google Scholar
Geoff Hogg
View author publications
You can also search for this author in PubMed Google Scholar
Kathryn E Holt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kathryn E Holt.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

JH analysed data and helped to draft the manuscript. DJE performed mapping and SNP calling analysis. KD performed MLVA analysis and extracted DNA for sequencing. LH analysed the prophage regions. HBJ contributed to data interpretation and manuscript writing. GH suggested the cluster be studied, participated in the design and coordination of the study and provided data for analysis. KEH conceived the study, performed data analysis and drafted the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material

Additional file 1: Table S1: Details of SNPs that define, or vary within, the S. Typhimurium 135@ group. (XLSX 56 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Hawkey, J., Edwards, D.J., Dimovski, K. et al. Evidence of microevolution of Salmonella Typhimurium during a series of egg-associated outbreaks linked to a single chicken farm. BMC Genomics 14, 800 (2013). https://doi.org/10.1186/1471-2164-14-800

Download citation

Received: 05 August 2013
Accepted: 14 November 2013
Published: 19 November 2013
DOI: https://doi.org/10.1186/1471-2164-14-800

Evidence of microevolution of Salmonella Typhimurium during a series of egg-associated outbreaks linked to a single chicken farm

Abstract

Background

Results

Conclusions

Background

Results

Sequencing and phylogenetic analysis

Dissecting the series of outbreaks

Dating the emergence of S. Typhimurium 135@ and the farm clone

Microevolution and natural selection within S. Typhimurium

Phage content in S. Typhimurium 135@

Plasmid content in S. Typhimurium 135@

Discussion

Implications for understanding whole genome sequencing (WGS) data in the context of outbreak investigation and source attribution

Purifying selection within S. Typhimurium

Substitution rates in S. Typhimurium

Phage and plasmid variation

Conclusions

Methods

Sequencing and assembly

Phylogenetic and evolutionary analysis

Divergence dating

Gene content analysis

Antimicrobial susceptibility and multi-locus VNTR analysis (MLVA)

Sequence data accessions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Electronic supplementary material

Additional file 1: Table S1: Details of SNPs that define, or vary within, the S. Typhimurium 135@ group. (XLSX 56 KB)

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomics

Contact us