Whole genome sequencing reveals the emergence of a Pseudomonas aeruginosa shared strain sub-lineage among patients treated within a single cystic fibrosis centre
BMC Genomics volume 19, Article number: 644 (2018)
Chronic lung infections caused by Pseudomonas aeruginosa are a significant cause of morbidity and mortality in people with cystic fibrosis (CF). Shared P. aeruginosa strains, that can be transmitted between patients, are of concern and in Australia the AUST-02 shared strain is predominant in individuals attending CF centres in Queensland and Western Australia. M3L7 is a multidrug resistant sub-type of AUST-02 that was recently identified in a Queensland CF centre and was shown to be associated with poorer clinical outcomes. The main aim of this study was to resolve the relationship of the emergent M3L7 sub-type within the AUST-02 group of strains using whole genome sequencing.
A whole genome core phylogeny of 63 isolates indicated that M3L7 is a monophyletic sub-lineage within the context of the broader AUST-02 group. Relatively short branch lengths connected all of the M3L7 isolates. A phylogeny based on nucleotide polymorphisms present across the genome showed that the chronological estimation of the most recent common ancestor was around 2001 (± 3 years). SNP differences between sequential non-hypermutator M3L7 isolates collected 3–4 years apart from five patients suggested both continuous infection of the same strain and cross-infection of some M3L7 variants between patients. The majority of polymorphisms that were characteristic of M3L7 (i.e. acquired after divergence from all other AUST-02 isolates sequenced) were found to produce non-synonymous mutations in virulence and antibiotic resistance genes.
M3L7 has recently diverged from a common ancestor, indicating descent from a single carrier at a CF treatment centre in Australia. Both adaptation to the lung and transmission of M3L7 between adults attending this centre may have contributed to its rapid dissemination. Further genomic investigations are required on multiple intra-sample isolates of this sub-type to decipher potential mechanisms which facilitates its epidemiological success.
Cystic fibrosis (CF) is the most common recessively lethal inherited disease in people of European ancestry. The majority of mortality and morbidity in people with CF is caused by chronic lung infection with Pseudomonas aeruginosa . During chronic infections P. aeruginosa adapts to the CF airway microenvironment, which promotes multiple phenotypic and genotypic changes including enhanced resistance to antibiotics, excessive exopolysaccharide production, auxotrophy, auxotrophic metabolism, hypermutability and the loss of motility [2,3,4,5,6,7,8]. This evolution strategy has been found to occur in parallel in different P. aeruginosa strains, suggesting that these pathoadaptive modifications are important in the transition from an opportunistic pathogen to a specialised pathogen of diseased human lungs .
Person-to-person transmission of airway-adapted P. aeruginosa strains has also been reported with the acquisition of some “shared strains” (also referred to as “epidemic” or “transmissible” strains) correlated with adverse clinical outcomes [9,10,11,12,13]. AUST-02 (Sequence Type [ST] 775) is a prevalent shared strain in CF centres around Australia and is the dominant strain in Queensland infecting approximately 40% of patients infected with P. aeruginosa [10, 14, 15]. A recent study further demonstrated that ongoing evolution within this clonally successful AUST-02 strain has led to the emergence and dissemination of a sub-type within a treatment centre (Brisbane, Queensland) which infected approximately 5% of patients . This sub-type could be distinguished from all other AUST-02 sub-types by a unique mexZ (mexZ-3, M3) and lasR (lasR-7, L7) genotype and therefore, was designated M3L7 . The M3L7 sub-type is of particular clinical importance given that it is associated with enhanced multidrug resistance, transmissibility, greater intravenous antibiotic and hospitalisation requirements, and a higher 3-year risk of death/lung transplantation, than other AUST-02 subtypes . Whole genome sequencing (WGS) of 11 longitudinal M3L7 isolates from a single patient revealed the within-host diversity of M3L7 during and after antibiotic treatment of an acute pulmonary exacerbation in 2014 , but the M3L7 sub-type has yet to be characterised at the population level.
The aim of this study was to use WGS to reconstruct the population structure and resolve the relationship of the M3L7 sub-type within the AUST-02 group of strains. The analyses revealed that the M3L7 sub-type has recently diverged from a common ancestor, suggesting descent from a single founder within a rapidly growing CF centre population. Genetic mutations exclusive to the M3L7 sub-type were identified and may have aided adaptation to the CF airway microenvironment prior to its dissemination.
Selection of bacterial isolates
One to three P. aeruginosa isolates were cultured from single sputum specimens (collected annually in 2007, 2008 and 2011) from adults with CF attending The Prince Charles Hospital as per the Australian Clonal P. aeruginosa in Cystic Fibrosis (ACPinCF) study protocol . In brief, sputum samples were prospectively collected and when P. aeruginosa was identified, three colonies representing different morphotypes from each specimen were randomly selected. Cross-sectional molecular surveillance for the M3L7 sub-type was conducted in P. aeruginosa isolates collected from study participants at two time-points (2007–2008 and 2011). The M3L7 sub-type was detected in 28/509 (5.5%) P. aeruginosa isolates collected from 13/170 (7.6%) patients in 2007–2008 and in 21/519 (4.0%) P. aeruginosa isolates from 11/173 (6.4%) patients in 2011 . Twenty-five of these M3L7 isolates (named AUS934 to AUS958) were selected from 19 patients (patients 24 to 42; Additional file 1: Table S1) for further WGS as described below.
Of the 13 patients identified with M3L7 in 2007–2008, six patients had no follow-up isolates available (five underwent lung transplantation and one moved interstate by 2011; Patients 24, 32, 33, 34, 35 and 36). Of the seven patients who had M3L7 infection in 2007–2008 and had samples collected in 2011, five remained infected with M3L7 in 2011 (Patients 28, 30, 31, 38 and 41), one (Patient 37) no longer had the M3L7 sub-type detected, while another (Patient 39) tested positive by the M3 allele-specific PCR, but the lasR sequence could not be determined because of suboptimal sequence quality (this isolate, AUS947, was subsequently confirmed as M3L7 by WGS in this study). A further five patients (Patients 25, 26, 27, 40 and 42) acquired M3L7 in 2011 (they were infected with other strains in 2007) and were identified as incident cases. Finally, one patient (Patient 29) infected with M3L7 in 2011 had no previous strain-typing data available .
One M3L7 isolate was randomly selected per patient at each time-point (2007–2008 and 2011) for WGS and comprised: i) 13 M3L7 isolates from 13 patients in 2007–2008; ii) six M3L7 isolates from six patients with persistent M3L7 infection in 2011; iii) five M3L7 isolates from five incident cases in 2011; and iv) one M3L7 isolate from a patient with no previous P. aeruginosa strain typing data collected in 2011. One further P. aeruginosa isolate (AUS970) from a patient (patient 43; Additional file 1: Table S1) in 2007 which contained the M3 mexZ allele and a non-L7 lasR genotype was included as an outgroup. These patients had chronic infection with P. aeruginosa prior to 2007 [18, 19].
Whole genome sequencing
Preparation of genomic DNA for WGS was undertaken using the UltraClean® Microbial DNA Isolation Kit as described previously . Library preparation (Truseq), qPCR (TapeStation, Agilent Genomics) and WGS using the Illumina HiSeq 2500 platform with 100 bp paired-end read chemistry were carried out by the Australian Genome Research Facility, Melbourne, Australia.
In order to reconstruct the M3L7 population structure, a further 37 AUST-02 genomes that were previously sequenced (as part of an ongoing AUST-02 population genetic diversity study (Hanks et al., unpublished)) and a further 11 M3L7 genomes that were previously sequenced (as part of a longitudinal study of patient 37 isolates ) were included in the analyses (Additional file 1: Table S1).
Genome mapping and assembly
Reads were taxonomically assigned with Kraken (v0.10.4) to check for contamination and trimmed using Nesoni clip (v0.128) to filter out adapter sequences and low-quality regions [20, 21]. Reads were mapped to the P. aeruginosa PAO1 reference genome (NC_002516) using SHRiMP as implemented in Nesoni (v0.128) [20, 22, 23]. The PAO1 genome was chosen as a reference due to its high quality, and expert-curated annotation . SNPs and small insertions or deletions (indels) shorter than the read length were called using Nesoni.
Genomes were assembled using Velvet (v1.2.10) and VelvetOptimiser (v2.2.5) [24, 25]. Assembled contigs were reordered against PAO1 using Mauve (v2.4.0) and annotated with Prokka (v1.10) [26, 27]. Gene annotations from PAO1 were used as the primary reference. M3L7 and M3L1 draft genome assemblies ranged in total size from 6.15 to 6.25 Mb (mean = 6.24 Mb) with N50 values between 226 Kb and 413 Kb (mean = 319Kb).
An alignment of 30,811 core SNPs obtained from mapping against PAO1 was used to reconstruct the phylogeny of the 63 AUST-02 genome sequences. RAxML (v8.1.15) was used to estimate the Maximum Likelihood tree with the rapid bootstrap analysis option (−f a) and GTRGAMMA model of nucleotide substitution with a correction for ascertainment bias (−m ASC_GTRGAMMA --asc-corr lewis) . A resolved phylogeny of all available M3L7 genome sequences (including the 25 generated in this study, AUS22 (Hanks et al., unpublished) and the 11 isolates from patient 37 in 2014 ) was constructed from a SNP matrix of 2573 SNPs with the same settings as above. Phylogenetic trees were viewed and explored using Dendroscope (v3.4.1) and FigTree (v1.4.2) [29, 30]. Minimum spanning trees using the same SNP matrix were generated using the goeBURST Full MST algorithm in Phyloviz using .
To determine the emergence of the M3L7 sub-type, Bayesian inference of the evolutionary rates was conducted using BEAST 1.8.2 . As input a set of 183 SNPs specific for M3L7 was used, excluding two hypermutator isolates (AUS937 and AUS938) and the AUS970 outgroup (non-M3L7 isolate). Regions of clustered SNPs, where at least three SNPs were found within 10 bp of each other, were also removed. Among the different combinations of the molecular clock model (strict and constant relaxed lognormal), substitution model (HKY, GTR) and population size change (coalescent constant and exponential growth) models, the preferred combination of parameters selected based on stepping stone sampling was strict molecular clock, HKY substitution model with four discrete gamma-distributed rate categories, and exponential population size change. Markov Chain Monte Carlo generations were run in triplicate for 50 million steps, sampling every 5000 steps, to ensure convergence and an ESS value > 200 for all parameters. Replicate runs were combined using LogCombiner with a 10% burn-in and maximum credibility trees reporting mean values were created using TreeAnnotator.
Comparative genomic analyses
Comparative genomic analyses were performed using Parsnp, Gingr, BRIG (BLAST Ring Image Generator), Roary (v3.4.2), ACT (Artemis Comparison Tool) and BLAST [33,34,35,36]. Large genomic differences were investigated using PHAST (PHAge Search Tool) and Roary [35, 37]. The effect of amino acid substitutions (functionally important or no change) were predicted in silico using PROVEAN (Protein Variation Effect Analyzer) . ABRicate was used to determine acquired resistance gene content by comparing against the ResFinder and CARD database . Multi locus sequence typing (MLST) was performed using the ariba tool and made use of the PubMLST database [40, 41].
Results and discussion
M3L7 is a distinct sub-lineage of the AUST-02 shared strain
On the basis of the whole-genome core phylogeny, the 63 AUST-02 isolates form two major discrete lineages (clades), M2 (n = 34) and M3 (n = 29), consistent with their possession of mexZ-2 (codon substitution, A38T) or mexZ-3 (codon substitution, T12N) alleles, respectively (Fig. 1 and Additional file 2: Figure S1). The M3L7 isolates (n = 26; including 25 isolates sequenced here and a previously sequenced AUST-02 genome, AUS22 [Patient 32; 2007]) form a monophyletic sub-lineage of AUST-02 within the M3 clade that has diverged from all other AUST-02 isolates sequenced to date (Fig. 1). Three isolates (AUS853 [Patient 45; 2007]; AUS854 [Patient 44; 2007]; AUS970 [Patient 43; 2007]) within the M3 clade form deep-branching relationships at the base of the lineage and do not harbour the lasR-7 allele (L7: 1 bp deletion, 438delG) that defines the M3L7 sub-type. Of the available AUST-02 sequences, isolate AUS970 [Patient 43; 2007], sequenced in this study, represented the AUST-02 genome that was most closely related to the M3L7 sub-lineage. In addition to the mexZ-3 allele (M3), AUS970 carried a wild-type lasR-1 (L1) allele (therefore named M3L1). All 63 isolates are ST-775 according to in silico MLST.
M3L7 diverged recently from other AUST-02 shared strains
The M3L7 sub-lineage expanded recently following a long period of divergence from the M2 sub-lineage according to the relatively short branch lengths connecting the M3L7 isolates and their relative distance from the root (Fig. 1). Using BEAST analysis (Fig. 2), we estimated that the most recent common ancestor (MRCA) of M3L7 emerged around 2001 (± 3 years). This is approximately 6 years prior to the first isolation of an M3L7 sub-type (2007) in people with CF in Brisbane (Additional file 3: Figure S2) . Of note, this time period also corresponds with a relatively high annual increase (approximately 10–15%) in the adult CF population at The Prince Charles Hospital (Additional file 3: Figure S2). This situation, combined with limited capacity to segregate all patients, particularly when admitted to an inpatient ward, may have contributed to shared-strain infections.
The phylogenetic relatedness of M3L7 isolates collected from multiple unrelated patients (including Patients 30, 31, 36, 37, 39, 40, 41) (Additional file 2; Figure S1), also supports the hypothesis of cross-infection. Examination of hospital admission data showed these patients had overlapping periods of hospitalisation to the same inpatient ward at a time when M3L7 was first detected in 2007 (Additional file 4: Table S2). As highlighted previously , molecular surveillance showed a striking absence of M3L7 amongst P. aeruginosa isolates collected from patients admitted to other wards during the same period. This is emphasised by the lack of concurrent methicillin-resistant Staphylococcus aureus or Burkholderia cepacia complex infection in those infected with M3L7, which is most likely due to patients with the former two infections receiving treatment in alternate inpatient wards. Baseline sputum samples of four patients with M3L7 infection, included in this study, had additional pathogens identified: Achromobacter xylosoxidans (n = 3); methicillin-susceptible Staphylococcus aureus (n = 2); Mycobacterium abscessus (n = 1). In the year prior to recruitment, a range of co-pathogens were also identified (Additional file 5: Table S3).
The admission data, combined with WGS data, raises the possibility of cross-infection of M3L7 within the inpatient setting. Taken together, these results support a single founder scenario in which a carrier of the M3L7 sub-type acted as a donor within a CF centre with subsequent rapid dissemination in the resident CF population. Alternatively, dissemination of the M3L7 sub-type may have arisen through exposure to common environmental sources; though this is unlikely as earlier surveys of both hospital and natural environmental sites have failed to reveal any common sources of shared CF strains [42,43,44].
Notably, both sub-lineages of AUST-02 are prevalent in Queensland, suggesting that their MRCA originated in this state; however, we cannot rule out independent introductions of M3L7 and M2 founders into the local CF population from outside the same geographical region (state).
Resolving the within-patient relationships of M3L7 provides evidence for both continuous infection and person-to-person transmission
To investigate genomic evidence for continuous infection within patients and person-to-person transmission, we examined the pattern of SNP differences between isolates in more detail. As described previously, the dataset included six pairs of M3L7 isolates (Additional file 2: Figure S1) collected from people with CF at two time-points in 2007 or 2008 and in 2011 .
A pair of M3L7 isolates (AUS937 and AUS938) from patient 38 had accumulated a much higher number of SNPs compared to other M3L7 isolates, as indicated by their relatively longer branches in the phylogeny (Additional file 2: Figure S1). Further analysis revealed that these two isolates had acquired independent non-synonymous mutations within the mutS gene (AUS937, L341P; AUS938, 1 bp deletion [1076delC]), encoding a DNA mismatch repair protein , and were predicted to be deleterious to protein function based on an in silico analysis . Mutations in mutS are associated with hypermutation, which frequently occurs during chronic P. aeruginosa infection of the CF airway [46, 47]. Notably, hypermutators were also identified and characterised in longitudinal isolates collected in 2014 from the person known as patient 37 in the present study . The 2007 M3L7 isolate (AUS951) is closest to AUS961 from the 2014 M3L7 isolates (Additional file 6: Figure S3), with 14 core SNP differences according to a minimal spanning tree of pairwise distances (Additional file 7: Figure S4). In contrast, the two 2014 hypermutator isolates from patient 37 that are closest to AUS951 (AUS965 and AUS966), differ from the 2007 isolate by 335 SNPs or 582 SNPs, respectively. This observation reveals the relative impact of hypermutation on a within-host population over a 7-year time span (Additional file 7: Figure S4), consistent with the previous report of a ~ 40-fold higher mutation rate amongst hypermutable P. aeruginosa in long-term CF infections .
The minimal spanning tree reveals a maximum of 74 core SNPs separating the most divergent, non-hypermutator isolates, (AUS941 [Patient 27; 2011] and AUS948 [Patient 40; 2011]) isolates (Additional file 7: Figure S4). The closest sequential within-patient isolates (AUS956 and AUS957 from patient 41) differed by only two core SNPs demonstrating a remarkably high degree of genome stability over a 4-year period (Additional file 7: Figure S4). Within-patient isolates from patients 41 (AUS956 and AUS957) and 28 (AUS935 and AUS936) also grouped together with high bootstrap support on the ML phylogeny, which is consistent with continuous M3L7 infection across the two sampling time-points (Additional file 6: Figure S3 and Additional file 7: Figure S4). Our analyses also revealed that a very close relationship existed between the early and late isolates from patients 30 (AUS944 and AUS945, 9 SNPs) and 31 (AUS953 and AUS954, 13 SNPs), which could also be due to continuous infection of the same strain (Additional file 6: Figure S4). Although these four isolates are grouped together with high bootstrap support (Additional file 6: Figure S3), pairwise SNP comparisons reveals only two core genome SNP differences between the 2007 isolates from patients 30 and 31 (AUS944 and AUS953, respectively) (Additional file 7: Figure S4). Notably, the hypothesis of cross-infection between these two patients (or from a common source) is supported by the finding that both patients were admitted to the same inpatient ward, at the same time, approximately 6 months prior to their first positive M3L7 culture (Additional file 4: Table S2).
Based on the Maximum-Likelihood phylogeny, the isolates from patient 39 do not cluster closely (Additional file 6: Figure S3). In fact, AUS947 [patient 39; 2011] is most closely related to AUS961 [patient 37: 2014] and is more closely related to AUS951 [patient 37; 2007] than the earlier 2007 isolate from patient 39 (AUS946) (Additional file 7: Figure S4). This indicates the possibility of multiple M3L7 cross-infection events as has been suggested for other shared strains , which may occur via the airborne route or during socialisation between patients [49, 50]. Pairwise SNP comparisons between M3L7 isolates also shows a possible transmission pathway with isolates AUS946 [patient 39; 2007] and AUS944 [patient 30, 2008] being the most likely source of cross-infections (Additional file 7: Figure S4).
AUS22 (an M3L7 isolate sequenced as part of a different study) and AUS943 were both isolated from patient 32 just 6 days apart in 2007. However, these two isolates were more closely related to isolates from other patients than to each other (Additional file 6: Figure S3). Given that only one isolate per sputum sample was analysed, it is unclear if this represents co-existence of multiple related sub-lineages within the chronically infected lungs as we and others have previously shown [5, 7, 17, 47, 48, 51] or if this is also suggestive of direct or indirect cross-infection between those patients. These possibilities are not mutually exclusive.
When analysed together with 11 previously published isolates isolated from a single individual (Patient 37 in this study) during an episode of acute pulmonary exacerbation over 3 months , M3L7 isolates from patient 36 (AUS948; 2007) and patient 40 (AUS949; 2011) were found to cluster with the the non-hypermutator isolates from Patient 37 (Additional file 6: Figure S3 and Additional file 7: Figure S4). Interestingly, both hypermutator isolates in patient 38 are more closely related to AUS951 [patient 37; 2007] than to each other, suggesting transmission via a route common to patients 36, 37 and 40, followed by the independent acquisition of mutS mutations. These findings highlight that short-term within-patient diversity of shared strains during chronic infection needs to be considered in light of the P. aeruginosa lung diversity of the local CF population as a whole.
The resolution of WGS data of multiple isolates from individual patients within samples and longitudinally, combined with social interaction data will enable future studies to fully characterise long-term within-patient M3L7 diversity and evolution and distinguish between continuous infection or recent acquisition of M3L7 variants amongst the CF population .
M3L7 is characterised by an accumulation of non-synonymous mutations in critical pathways
Clonal lineages are expected to accumulate mutations that enable adaptation of the bacterium to a specific environmental niche of a human host . A total of 44 shared SNPs and nine shared indels were acquired after divergence of the M3L7 sub-lineage from all other AUST-02 isolates of the M2 and M3 clades. Thirty-five SNPs were non-synonymous (80%), resulting in a change of the amino acid sequence including two premature stop codons (Additional file 8: Table S4). Four indels produced in-frame mutations, whilst five indels caused a shift in the reading frame (Additional file 8: Table S4). The full list of SNPs and indels found across M3L7 isolates can be found in (Additional file 9: Table S5).
Genes containing non-synonymous SNPs and indels exclusive to the M3L7 sub-lineage (n = 43) were subsequently categorised according to PseudoCAP (P. aeruginosa community annotation project) functions (Fig. 3) . Fifteen genes were annotated with at least two functional categories and nine genes were part of regulatory networks (Fig. 3), including key global regulators (e.g. rpoN, mexT), which might impact multiple processes . Approximately 50% of the non-synonymous mutations occurred in genes that encode proteins associated with virulence (e.g. PilR involved in surface attachment and twitching motility [54, 55]; MigA involved in swarming ; ZnuA in zinc homeostasis ; PchD for iron acquisition ) or antibiotic resistance mechanisms (e.g. OprD: carbapenem resistance ; GyrB: fluoroquinolone resistance ; FtsI, Mpl: β-lactam resistance [61, 62]; PmrB, ColS: polymyxin resistance [63, 64]; MexA, MexS, MexT: multi-drug efflux pumps [65,66,67,68]). The mutations found in genes correlated with antibiotic resistance were also identified in M3L7 isolates collected in 2014  and might help explain the increased resistance to anti-pseudomonal antibiotics, including meropenem, ceftazidime, ticarcillin-clavulanate, aztreonam, ciprofloxacin and colistin, compared to other AUST-02 subtypes, as described previously . We did not find any acquired antibiotic resistance genes in the M3 clade isolates. Altogether this analysis suggests that the M3L7 sub-lineage is characterised by mutations in genes that might have aided adaptation to the CF airways prior to dissemination in the CF population [4, 8, 47, 69].
Genomic variation due to large deletions
The draft genomes that comprise the M3L7 sub-lineage ranged in size from 6.15 to 6.25 Mbp, whereas the M3L1 isolate (AUS970) genome was substantially smaller (6.05 Mbp). All genomes within the M3 clade had an average GC content of 66.5%. Genome reduction is a typical evolutionary process that occurs during adaptation ; therefore, large genomic variations (> 10 kb) were compared between the AUST-02 isolates.
Major differences in genome size were not due to horizontal gene transfer but attributed to several deletion events, summarised in Table 1. For example, one M3L7 isolate (AUS947; Patient 39; 2011) lost a large 93 Kbp region encoding a number of virulence genes including exoY (a T3SS secreted toxin), a three-part hydrogen cyanide biosynthesis operon, hcnABC and part of the cupA1-A5 chaperone usher fimbrae operon (Table 1). Deletions of varying sizes encompassing exoY and hcnABC (of the same region) were also missing in three other isolates within the M3 clade (AUS853; [Patient 45; 2007]; AUS854 [Patient 44; 2007]; AUS970 [Patient 43; 2007]) suggesting that there was a selective pressure to lose the functionality of genes encoded at this position.
A 40 Kbp putative prophage was present in five AUST-02 isolates from Perth (AUS15 [Patient 18; 2008]; AUS874 [Patient 22; 2008]; AUS17 [Patient 17; 2008]; AUS876 [Patient 21; 2008]; AUS877 [Patient 20; 2008]) and one isolate from Brisbane (AUS24; [Patient 16; 2007]) of the M2 clade. This prophage is present in nearly all M3L7 genomes (absent in AUS958 [Patient 29; 2011]) and a previous study also observed the presence of the prophage in 9/11 M3L7 isolates collected in 2014 . Therefore, it is most likely that this prophage was present in the last common ancestor of the M3L7 sub-lineage and was vertically inherited. The prophage is inserted immediately adjacent to the mutS gene and a search of the putative prophage sequence using the PhaST webtool predicts a complete prophage that is only 9% identical to the P. aeruginosa F10 phage . The prophage is flanked by an 18 bp att sequence (TCTCTCAGCACACGCC) that delineates the deletion in AUS958.
The persistence of the AUST-02 shared strain in the CF population in Australia has led to the emergence of a monophyletic sub-lineage (M3L7) that is distinct from the M2 sub-lineage of AUST-02. Our WGS analysis demonstrated that M3L7 strains are characterised by mutations in genes that are likely to affect antibiotic resistance and virulence phenotypes. The rapid dissemination of this clinically important sub-type is most likely due to a combination of adaptation to the CF airway microenvironment and transmission between people attending the same CF centre. This work provides a framework for future efforts in real-time genomic surveillance to monitor the transmission and pathogenicity of AUST-02 amongst the Australian CF population and to detect newly emergent shared strains. Further genomic investigations are required on multiple intra-sample isolates of this sub-type to decipher potential mechanisms which facilitates its epidemiological success.
Australian shared strain/ epidemic strain/ transmissible strain
Sub-lineage of the AUST-02 defined by mexZ (mexZ-3, M3) and lasR (lasR-1, L1) alleles
Sub-lineage of the AUST-02 defined by mexZ (mexZ-3, M3) and lasR (lasR-7, L7) alleles
Most recent common ancestor
Polymerase chain reaction
Single nucleotide polymorphism
Whole genome sequencing
Cystic Fibrosis Trust. UK Cystic Fibrosis Registry 2014 Annual Report. 2015.
Cullen L, McClean S. Bacterial adaptation during chronic respiratory infections. Pathogens. 2015;4(1):66–89.
Jorth P, Staudinger BJ, Wu X, Hisert KB, Hayden H, Garudathri J, Harding CL, Radey MC, Rezayat A, Bautista G, et al. Regional isolation drives bacterial diversification within cystic fibrosis lungs. Cell Host Microbe. 2015;18(3):307–19.
Smith EE, Buckley DG, Wu Z, Saenphimmachak C, Hoffman LR, D'Argenio DA, Miller SI, Ramsey BW, Speert DP, Moskowitz SM, et al. Genetic adaptation by Pseudomonas aeruginosa to the airways of cystic fibrosis patients. Proc Natl Acad Sci U S A. 2006;103(22):8487–92.
Chung JC, Becq J, Fraser L, Schulz-Trieglaff O, Bond NJ, Foweraker J, Bruce KD, Smith GP, Welch M. Genomic variation among contemporary Pseudomonas aeruginosa isolates from chronically infected cystic fibrosis patients. J Bacteriol. 2012;194(18):4857–66.
Thomas SR, Ray A, Hodson ME, Pitt TL. Increased sputum amino acid concentrations and auxotrophy of Pseudomonas aeruginosa in severe cystic fibrosis lung disease. Thorax. 2000;55(9):795–7.
Mowat E, Paterson S, Fothergill JL, Wright EA, Ledson MJ, Walshaw MJ, Brockhurst MA, Winstanley C. Pseudomonas aeruginosa population diversity and turnover in cystic fibrosis chronic infections. Am J Respir Crit Care Med. 2011;183(12):1674–9.
Marvig RL, Sommer LM, Molin S, Johansen HK. Convergent evolution and adaptation of Pseudomonas aeruginosa within patients with cystic fibrosis. Nat Genet. 2015;47(1):57–64.
Al-Aloul M, Crawley J, Winstanley C, Hart CA, Ledson MJ, Walshaw MJ. Increased morbidity associated with chronic infection by an epidemic Pseudomonas aeruginosa strain in CF patients. Thorax. 2004;59(4):334–6.
Kidd TJ, Ramsay KA, Hu H, Marks GB, Wainwright CE, Bye PT, Elkins MR, Robinson PJ, Rose BR, Wilson JW, et al. Shared Pseudomonas aeruginosa genotypes are common in Australian cystic fibrosis centres. Eur Respir J. 2013;41(5):1091–100.
Jones AM, Dodd ME, Doherty CJ, Govan JR, Webb AK. Increased treatment requirements of patients with cystic fibrosis who harbour a highly transmissible strain of Pseudomonas aeruginosa. Thorax. 2002;57(11):924–5.
Aaron SD, Vandemheen KL, Ramotar K, Giesbrecht-Lewis T, Tullis E, Freitag A, Paterson N, Jackson M, Lougheed MD, Dowson C, et al. Infection with transmissible strains of Pseudomonas aeruginosa and clinical outcomes in adults with cystic fibrosis. JAMA. 2010;304(19):2145–53.
Fothergill JL, Walshaw MJ, Winstanley C. Transmissible strains of Pseudomonas aeruginosa in cystic fibrosis lung infections. Eur Respir J. 2012;40(1):227–38.
Kidd TJ, Soares Magalhaes RJ, Paynter S, Bell SC, Group ACI. The social network of cystic fibrosis Centre care and shared Pseudomonas aeruginosa strain infection: a cross-sectional analysis. Lancet Respir Med. 2015;3(8):640–50.
O'Carroll MR, Syrmis MW, Wainwright CE, Greer RM, Mitchell P, Coulter C, Sloots TP, Nissen MD, Bell SC. Clonal strains of Pseudomonas aeruginosa in paediatric and adult cystic fibrosis units. Eur Respir J. 2004;24(1):101–6.
Tai AS, Bell SC, Kidd TJ, Trembizki E, Buckley C, Ramsay KA, David M, Wainwright CE, Grimwood K, Whiley DM. Genotypic diversity within a single Pseudomonas aeruginosa strain commonly shared by Australian patients with cystic fibrosis. PLoS One. 2015;10(12):e0144022.
Sherrard LJ, Tai AS, Wee BA, Ramsay KA, Kidd TJ, Ben Zakour NL, Whiley DM, Beatson SA, Bell SC. Within-host whole genome analysis of an antibiotic resistant Pseudomonas aeruginosa strain sub-type in cystic fibrosis. PLoS One. 2017;12(3):e0172179.
Lee TW, Brownlee KG, Conway SP, Denton M, Littlewood JM. Evaluation of a new definition for chronic Pseudomonas aeruginosa infection in cystic fibrosis patients. J Cyst Fibros. 2003;2(1):29–34.
Ramsay KA, Sandhu H, Geake JB, Ballard E, O'Rourke P, Wainwright CE, Reid DW, Kidd TJ, Bell SC. The changing prevalence of pulmonary infection in adults with cystic fibrosis: a longitudinal analysis. J Cyst Fibros. 2017;16(1):70–7.
Harrison P. Nesoni. In., 0.128 EDN. 2015. https://github.com/Victorian-Bioinformatics-Consortium/nesoni. Accessed Sept 2015.
Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014;15(3):R46.
David M, Dzamba M, Lister D, Ilie L, Brudno M. SHRiMP2: sensitive yet practical SHort read mapping. Bioinformatics. 2011;27(7):1011–2.
Stover CK, Pham XQ, Erwin AL, Mizoguchi SD, Warrener P, Hickey MJ, Brinkman FS, Hufnagle WO, Kowalik DJ, Lagrou M, et al. Complete genome sequence of Pseudomonas aeruginosa PAO1, an opportunistic pathogen. Nature. 2000;406(6799):959–64.
Gladman S, Seemann T. VelvetOptimser. In., 2.2.5 EDN. 2012. https://github.com/tseemann/VelvetOptimiser. Accessed Sept 2015.
Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18(5):821–9.
Darling AE, Mau B, Perna NT. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One. 2010;5(6):e11147.
Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–9.
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3.
Huson DH, Richter DC, Rausch C, Dezulian T, Franz M, Rupp R. Dendroscope: an interactive viewer for large phylogenetic trees. BMC Bioinformatics. 2007;8:460.
Rambout A. FigTree. In., 1.4.2 EDN. 2014. http://tree.bio.ed.ac.uk/software/figtree. Accessed Sept 2015.
Francisco AP, Vaz C, Monteiro PT, Melo-Cristino J, Ramirez M, Carrico JA. PHYLOViZ: phylogenetic inference and data visualization for sequence based typing methods. BMC Bioinformatics. 2012;13:87.
Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012;29(8):1969–73.
Treangen TJ, Ondov BD, Koren S, Phillippy AM. The harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes. Genome Biol. 2014;15(11):524.
Alikhan NF, Petty NK, Ben Zakour NL, Beatson SA. BLAST ring image generator (BRIG): simple prokaryote genome comparisons. BMC Genomics. 2011;12:402.
Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MT, Fookes M, Falush D, Keane JA, Parkhill J. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31(22):3691–3.
Carver TJ, Rutherford KM, Berriman M, Rajandream MA, Barrell BG, Parkhill J. ACT: the Artemis comparison tool. Bioinformatics. 2005;21(16):3422–3.
Zhou Y, Liang Y, Lynch KH, Dennis JJ, Wishart DS. PHAST: a fast phage search tool. Nucleic Acids Res. 2011;39(Web Server issue):W347–52.
Choi Y, Sims GE, Murphy S, Miller JR, Chan AP. Predicting the functional effect of amino acid substitutions and indels. PLoS One. 2012;7(10):e46688.
Seemann T. ABRicate. In., 0.8 EDN. 2014. https://github.com/tseemann/abricate/. Accessed July 2018.
Hunt M, Mather AE, Sanchez-Buso L, Page AJ, Parkhill J, Keane JA, Harris SR. ARIBA: rapid antimicrobial resistance genotyping directly from sequencing reads. Microb Genom. 2017;3(10):e000131.
Jolley KA, Maiden MC. BIGSdb: scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics. 2010;11:595.
Armstrong DS, Nixon GM, Carzino R, Bigham A, Carlin JB, Robins-Browne RM, Grimwood K. Detection of a widespread clone of Pseudomonas aeruginosa in a pediatric cystic fibrosis clinic. Am J Respir Crit Care Med. 2002;166(7):983–7.
Jones AM, Govan JR, Doherty CJ, Dodd ME, Isalska BJ, Stanbridge TN, Webb AK. Spread of a multiresistant strain of Pseudomonas aeruginosa in an adult cystic fibrosis clinic. Lancet. 2001;358(9281):557–8.
Kidd TJ, Ritchie SR, Ramsay KA, Grimwood K, Bell SC, Rainey PB. Pseudomonas aeruginosa exhibits frequent recombination, but only a limited association between genotype and ecological setting. PLoS One. 2012;7(9):e44199.
Oliver A, Mena A. Bacterial hypermutation in cystic fibrosis, not only for antibiotic resistance. Clin Microbiol Infect. 2010;16(7):798–808.
Oliver A, Canton R, Campo P, Baquero F, Blazquez J. High frequency of hypermutable Pseudomonas aeruginosa in cystic fibrosis lung infection. Science. 2000;288(5469):1251–4.
Feliziani S, Marvig RL, Lujan AM, Moyano AJ, Di Rienzo JA, Krogh Johansen H, Molin S, Smania AM. Coexistence and within-host evolution of diversified lineages of hypermutable Pseudomonas aeruginosa in long-term cystic fibrosis infections. PLoS Genet. 2014;10(10):e1004651.
Williams D, Evans B, Haldenby S, Walshaw MJ, Brockhurst MA, Winstanley C, Paterson S. Divergent, coexisting Pseudomonas aeruginosa lineages in chronic cystic fibrosis lung infections. Am J Respir Crit Care Med. 2015;191(7):775–85.
Knibbs LD, Johnson GR, Kidd TJ, Cheney J, Grimwood K, Kattenbelt JA, O'Rourke PK, Ramsay KA, Sly PD, Wainwright CE, et al. Viability of Pseudomonas aeruginosa in cough aerosols generated by persons with cystic fibrosis. Thorax. 2014;69(8):740–5.
Ojeniyi B, Frederiksen B, Hoiby N. Pseudomonas aeruginosa cross-infection among patients with cystic fibrosis during a winter camp. Pediatr Pulmonol. 2000;29(3):177–81.
Diaz Caballero J, Clark ST, Coburn B, Zhang Y, Wang PW, Donaldson SL, Tullis DE, Yau YC, Waters VJ, Hwang DM, et al. Selective sweeps and parallel Pathoadaptation drive Pseudomonas aeruginosa evolution in the cystic fibrosis lung. MBio. 2015;6(5):e00981–15.
Winsor GL, Griffiths EJ, Lo R, Dhillon BK, Shay JA, Brinkman FS. Enhanced annotations and features for comparing thousands of Pseudomonas genomes in the Pseudomonas genome database. Nucleic Acids Res. 2016;44(D1):D646–53.
Winstanley C, O'Brien S, Brockhurst MA. Pseudomonas aeruginosa evolutionary adaptation and diversification in cystic fibrosis chronic lung infections. Trends Microbiol. 2016;24(5):327–37.
Hobbs M, Collie ES, Free PD, Livingston SP, Mattick JS. PilS and PilR, a two-component transcriptional regulatory system controlling expression of type 4 fimbriae in Pseudomonas aeruginosa. Mol Microbiol. 1993;7(5):669–82.
Kohler T, Curty LK, Barja F, van Delden C, Pechere JC. Swarming of Pseudomonas aeruginosa is dependent on cell-to-cell signaling and requires flagella and pili. J Bacteriol. 2000;182(21):5990–6.
Lindhout T, Lau PC, Brewer D, Lam JS. Truncation in the core oligosaccharide of lipopolysaccharide affects flagella-mediated motility in Pseudomonas aeruginosa PAO1 via modulation of cell surface attachment. Microbiology. 2009;155(Pt 10):3449–60.
D'Orazio M, Mastropasqua MC, Cerasi M, Pacello F, Consalvo A, Chirullo B, Mortensen B, Skaar EP, Ciavardelli D, Pasquali P, et al. The capability of Pseudomonas aeruginosa to recruit zinc under conditions of limited metal availability is affected by inactivation of the ZnuABC transporter. Metallomics. 2015;7(6):1023–35.
Serino L, Reimmann C, Visca P, Beyeler M, Chiesa VD, Haas D. Biosynthesis of pyochelin and dihydroaeruginoic acid requires the iron-regulated pchDCBA operon in Pseudomonas aeruginosa. J Bacteriol. 1997;179(1):248–57.
Sanbongi Y, Shimizu A, Suzuki T, Nagaso H, Ida T, Maebashi K, Gotoh N. Classification of OprD sequence and correlation with antimicrobial activity of carbapenem agents in Pseudomonas aeruginosa clinical isolates collected in Japan. Microbiol Immunol. 2009;53(7):361–7.
Bruchmann S, Dotsch A, Nouri B, Chaberny IF, Haussler S. Quantitative contributions of target alteration and decreased drug accumulation to Pseudomonas aeruginosa fluoroquinolone resistance. Antimicrob Agents Chemother. 2013;57(3):1361–8.
Tsutsumi Y, Tomita H, Tanimoto K. Identification of novel genes responsible for overexpression of ampC in Pseudomonas aeruginosa PAO1. Antimicrob Agents Chemother. 2013;57(12):5987–93.
Sainsbury S, Bird L, Rao V, Shepherd SM, Stuart DI, Hunter WN, Owens RJ, Ren J. Crystal structures of penicillin-binding protein 3 from Pseudomonas aeruginosa: comparison of native and antibiotic-bound forms. J Mol Biol. 2011;405(1):173–84.
Gutu AD, Sgambati N, Strasbourger P, Brannon MK, Jacobs MA, Haugen E, Kaul RK, Johansen HK, Hoiby N, Moskowitz SM. Polymyxin resistance of Pseudomonas aeruginosa phoQ mutants is dependent on additional two-component regulatory systems. Antimicrob Agents Chemother. 2013;57(5):2204–15.
Moskowitz SM, Ernst RK, Miller SI. PmrAB, a two-component regulatory system of Pseudomonas aeruginosa that modulates resistance to cationic antimicrobial peptides and addition of aminoarabinose to lipid a. J Bacteriol. 2004;186(2):575–9.
Poole K, Tetro K, Zhao Q, Neshat S, Heinrichs DE, Bianco N. Expression of the multidrug resistance operon mexA-mexB-oprM in Pseudomonas aeruginosa: mexR encodes a regulator of operon expression. Antimicrob Agents Chemother. 1996;40(9):2021–8.
Vettoretti L, Plesiat P, Muller C, El Garch F, Phan G, Attree I, Ducruix A, Llanes C. Efflux unbalance in Pseudomonas aeruginosa isolates from cystic fibrosis patients. Antimicrob Agents Chemother. 2009;53(5):1987–97.
Sobel ML, Neshat S, Poole K. Mutations in PA2491 (mexS) promote MexT-dependent mexEF-oprN expression and multidrug resistance in a clinical strain of Pseudomonas aeruginosa. J Bacteriol. 2005;187(4):1246–53.
Kohler T, Epp SF, Curty LK, Pechere JC. Characterization of MexT, the regulator of the MexE-MexF-OprN multidrug efflux system of Pseudomonas aeruginosa. J Bacteriol. 1999;181(20):6300–5.
Marvig RL, Johansen HK, Molin S, Jelsbak L. Genome analysis of a transmissible lineage of Pseudomonas aeruginosa reveals pathoadaptive mutations and distinct evolutionary paths of hypermutators. PLoS Genet. 2013;9(9):e1003741.
Rau MH, Marvig RL, Ehrlich GD, Molin S, Jelsbak L. Deletion and acquisition of genomic content during early stage adaptation of Pseudomonas aeruginosa to a human host environment. Environ Microbiol. 2012;14(8):2200–11.
We thank Ms. Rebecca Stockwell (Lung Bacteria Group, QIMR Berghofer Medical Research Institute, Brisbane, Australia) for extracting the genomic DNA for WGS.
This work was supported by project grant funding from the National Health and Medical Research Council (NHMRC: GNT455919), Australian Infectious Diseases Research Centre (QIMRB-UQ seed grant) and TPCH Foundation grant (MS2013–02). AST is the recipient of NHMRC medical and dental Postgraduate Scholarship, Australian Cystic Fibrosis Research Trust Postgraduate Scholarship and Airways Infections, Inflammation & Cystic Fibrosis Group Scholarship. LJS is the recipient of The Shelley Shephard Memorial Scholarship. KAR is the recipient of an Australian Postgraduate Award, PhD Scholarship. TJK is the recipient of an ERS–EU RESPIRE2 Marie Skłodowska-Curie Postdoctoral Research Fellowship (MC RESPIRE2 first round, grant number 4571–2013) and a NHMRC Early Career Fellowship (GNT1088448). ILL receives grant support from the New Zealand Health Research Council, Curekids, Cystic Fibrosis New Zealand and the New Zealand Lotteries Board (Health). SCB is the recipient of a Queensland Health, Health Research Fellowship and receives grant support from the NHMRC, CF Foundation Therapeutics (USA), TPCH Foundation and Children’s Health Foundation, Queensland. SAB is the recipient of a NHMRC Fellowship (GNT1090456). Funding bodies were not involved in the study design, analysis and interpretation of the results or in the writing of the manuscript.
Availability of data and materials
Genome sequence data generated in this study was deposited in the European Nucleotide Archive under study PRJEB14781 with accession identifiers ERS1249679 to ERS1249704. Other AUST-02 isolates are available as part of separate studies (PRJEB21755 and PRJEB14771).
Ethics approval and consent to participate
Ethics approval for this project was granted under HREC/07/QRCH/9 and HREC/13/QPCH/127 by The Prince Charles Hospital Human and Research Ethics Committee, Metro North Hospital and Health Service, Brisbane, Queensland, Australia and all participants provided written, informed consent.
Consent for publication
Not applicable (no individual level patient data contained within this manuscript).
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Details of Pseudomonas aeruginosa isolates and genome assembly metrics used in this study. (XLSX 14 kb)
Figure S1. Phylogeny of sequenced AUST-02 strains showing the M3L7 isolates in relation to other AUST-02 isolates within the major M2 and M3 clades. The Maximum-Likelihood phylogenetic tree was estimated from an alignment of 30,811 core genome SNPs using RAxML. > 70% Bootstrap support (*), 100% bootstrap support (**). Scale indicates branch length representing 10 nucleotide substitutions. M2, mexZ-2 allele (codon substitution, A38T); M3, mexZ-3 allele (codon substitution, T12N). Pairs of isolates (collected in 2007 or 2008 and 2011) are indicated by colour. Dotted lines indicate branches that have been shortened and are not to scale. Geographic location of CF treatment centre where each isolate was obtained is shown on the right of the tree. (PDF 107 kb)
Figure S2. Growth of the adult CF population at The Prince Charles Hospital between 2001 and 2015. (PDF 64 kb)
Table S2. Overlapping admissions to a hospital ward for patients and first detection of M3L7. (XLSX 10 kb)
Table S3. Details of co-pathogens in people with M3L7/M3L1 infection at time of isolation and in the previous 12 months. (XLSX 11 kb)
Figure S3. A Maximum-Likelihood phylogeny inferred from an alignment of 2573 SNPs showing detailed relationships within the M3L7 clade. The maximum likelihood tree was generated using RAxML with 1000 bootstrap replicates. Branches with * indicate bootstrap support of > 70%. Scale indicates branch length representing 5 nucleotide substitutions. Isolates (n = 26) sequenced as part of this study are highlighted in bold font. Isolates that were sequenced as part of other studies are italicised. Isolates originating from the same individual have the same label colour. The year of collection is indicated. The M3L1 (AUS970) outgroup was used to root the tree. M3, mexZ-3 allele (codon substitution, T12N); L1, lasR-1 allele (wild-type); L7, lasR-7 allele (1 bp deletion, 438delG). (PDF 95 kb)
Figure S4. Core SNP based minimum spanning tree depicting the most likely route of transmission. Two isolates (AUS944, AUS946) from this study were predicted to be closest to the source. Edge lengths are not to scale. Details of all SNPs found across M3L7 isolates can be found in Additional file 9: Table S5. Figure generated using the goeBURST Full MST algorithm implemented in Phyloviz . (PDF 63 kb)
Table S4. Position and details of non-synonymous SNPs (n = 35) and indels (n = 9) that characterise the M3L7 sub-lineage. (XLSX 13 kb)
Table S5. Table of variants (SNPs and indels) that vary across 37 M3L7 genomes and 1 M3L1 (AUS970) genome. Table generated with nesoni nway. Only positions with a consensus determined for all strains (−-require-all yes) are shown. Only substitution SNPs were used to build phylogeny. (XLSX 1466 kb)
About this article
Cite this article
Wee, B.A., Tai, A.S., Sherrard, L.J. et al. Whole genome sequencing reveals the emergence of a Pseudomonas aeruginosa shared strain sub-lineage among patients treated within a single cystic fibrosis centre. BMC Genomics 19, 644 (2018). https://doi.org/10.1186/s12864-018-5018-x
- Whole genome sequencing
- Cystic fibrosis
- Pseudomonas aeruginosa
- Chronic lung infections