- Research article
- Open Access
Intraclonal genome diversity of Pseudomonas aeruginosa clones CHA and TB
BMC Genomics volume 14, Article number: 416 (2013)
Adaptation of Pseudomonas aeruginosa to different living conditions is accompanied by microevolution resulting in genomic diversity between strains of the same clonal lineage. In order to detect the impact of colonized habitats on P. aeruginosa microevolution we determined the genomic diversity between the highly virulent cystic fibrosis (CF) isolate CHA and two temporally and geographically unrelated clonal variants. The outcome was compared with the intraclonal genome diversity between three more closely related isolates of another clonal complex.
The three clone CHA isolates differed in their core genome in several dozen strain specific nucleotide exchanges and small deletions from each other. Loss of function mutations and non-conservative amino acid replacements affected several habitat- and lifestyle-associated traits, for example, the key regulator GacS of the switch between acute and chronic disease phenotypes was disrupted in strain CHA. Intraclonal genome diversity manifested in an individual composition of the respective accessory genome whereby the highest number of accessory DNA elements was observed for isolate PT22 from a polluted aquatic habitat. Little intraclonal diversity was observed between three spatiotemporally related outbreak isolates of clone TB. Although phenotypically different, only a few individual SNPs and deletions were detected in the clone TB isolates. Their accessory genome mainly differed in prophage-like DNA elements taken up by one of the strains.
The higher geographical and temporal distance of the clone CHA isolates was associated with an increased intraclonal genome diversity compared to the more closely related clone TB isolates derived from a common source demonstrating the impact of habitat adaptation on the microevolution of P. aeruginosa. However, even short-term habitat differentiation can cause major phenotypic diversification driven by single genomic variation events and uptake of phage DNA.
Pseudomonas aeruginosa is a metabolically versatile gamma-proteobacterium that preferentially thrives in aquatic habitats and the rhizosphere . This opportunistic pathogen is the most dominant bacterium causing chronic airway infections in cystic fibrosis (CF)  and has become one of the most important causative agents of nosocomial infections, particularly in intensive care units .
The 5.2 – 7 Mbp P. aeruginosa genome is a mosaic of a conserved core and variable regions of genome plasticity (RGPs) . The core genome is characterized by a conserved synteny of genes . Clonal complexes differ from each other in clone-typical segments of core and accessory genome  and a nucleotide divergence in the core genome of 0.5 – 0.7% .
Intraclonal whole-genome variation in P. aeruginosa has mainly been studied in isolates from CF lungs that had been collected from the same patient longitudinally or at one time point [8–12]. The paired isolates from one patient typically differed due to a few dozens of single nucleotide substitutions (SNPs) and small insertions/deletions (indels) in the core genome, a few RGPs in the accessory genome and occasionally one large deletion or inversion. Close to 1,000 de novo SNPs and indels, however, were gained in hypermutable strains defective in DNA repair [10, 12].
Whereas genome microevolution of P. aeruginosa in the atypical habitat of the CF lungs has been investigated for several clones, only a single clone has so far been assessed in its genome diversity between strains of unrelated habitat and geographic origin . The two examined clone PA14 strains from California and Germany were found to be of the same genome size and differ from each other in 0.0035% of their nucleotide sequence.
Since these data alone do not allow any general conclusions, we wanted to explore the impact of habitat, history and geographic origin on intraclonal genome diversity of P. aeruginosa in more depth. For that purpose two complementary scenarios of habitat differentiation were chosen. The three selected clone CHA strains were isolated from freshwater or CF patients at geographically distant sites within a 15-year period and represent the distant clone strain set. Conversely, the three selected clone TB strains were isolated during a local outbreak and represent the closely related clone strain set. Clones CHA and TB were chosen because we wanted to include the highly pathogenic strains CHA  and TBCF10839  in the comparative genome analysis. CHA and TBCF10839 are the only known P. aeruginosa strains which can escape killing by leucocytes. TBCF10839 can persist and grow in leucocytes , whereas CHA kills leucocytes by type III secretion-dependent oncosis [17–19]. Genome sequencing was expected to provide an explanation why CHA and TBCF10839, but not the other two clone CHA and two TB strains could undermine the major antipseudomonal defence mechanism in humans.
Genome sequencing revealed higher nucleotide divergence and a more variable composition of the accessory genome amongst the less closely related clone CHA strains than amongst the more highly related clone TB strains. Strain-specific SNPs were preferentially detected in habitat-associated fitness loci. Conservation of small non-coding RNA loci followed clone-specific patterns with about 7% (clone TB) or 11% (clone CHA) not conserved. Clone-specific traits were also found for the accessory genomes of the analysed strains, but especially for clone CHA strains which were equipped with several strain-specific DNA elements, the majority of which appeared to be of phage origin. Phage-like DNA also differentiated the accessory genome of the clone TB wound isolate TB63741 from its relatives of CF-origin, indicating that uptake and integration of phage elements is a major driving force of intraclonal diversification of P. aeruginosa during adaptation to different habitats.
Origin of the P. aeruginosaclone CHA and clone TB strains
The clone CHA strains CHA, 491 and PT22 were isolated from sites in Grenoble, Hannover and Mülheim in 1990, 2005 and 1992, respectively. Strain PT22 was isolated from a river, whereas strains CHA and 491 are CF airway isolates. Strain CHA was recovered from a critically ill CF patient with advanced lung disease and chronic P. aeruginosa infection . Strain 491 was the first clone CHA isolate from respiratory secretions of a female CF patient with normal lung function . The strain was successfully eradicated from the patient’s airways by antipseudomonal chemotherapy and no further clone CHA strain has since been identified in the patient’s respiratory secretions. The three clone TB strains were isolated from a burn wound (strain TB63741) and two unrelated CF patients (strains TBCF10839 and TBCF121838 ) during a local outbreak at Hannover Medical School in summer 1983.
Shotgun genome sequencing
Fragment libraries of CHA, 491, PT22 and TB63741 were sequenced with the Illumina Genome Analyser II generating 36 bp reads as previously reported for strains TBCF10839 and TBCF121838 . Reads passing quality criteria  were mapped to the PAO1 genome sequence (; NCBI sequence NC_002516.2) in order to detect SNPs, indels and PAO1 loci absent in clones CHA and TB. Contigs representing the non-PAO1 loci of the accessory genome were de novo assembled from reads that could not be mapped to the PAO1 reference.
Comparison of the clone CHA genomes with the PAO1 genome
The P. aeruginosa core genome harbours a few loci that are subject to diversifying selection. Clone CHA is equipped with LPS serotype 06, pyoverdine type IIa, a type-a2 flagellin and a novel type I pilin variant.
The three clone CHA genomes shared 24548 nucleotide exchanges (Figure 1, Additional file 1) compared to the PAO1 reference sequence, which were evenly distributed in the genome (Figure 2). 503 of these lead to a non-conservative replacement of an amino acid as defined by a Dayhoff similarity index  of less than 5 (Additional file 2). Table 1 lists these amino acid changes in the 22 proteins whose function have been experimentally demonstrated in P. aeruginosa (annotation class I, ). Besides a few proteins involved in DNA replication or secondary metabolism, the remaining proteins are transcriptional regulators, members of two-component systems, virulence effectors or are directly or indirectly involved in secretion or biofilm formation. Non-conservative amino acid replacements were neither observed in any enzyme of the core or intermediary metabolism nor in any component of the basic transcriptional or translational apparatus. This comparison of the PAO1 and clone CHA genomes suggests that diversifying selection with impact on protein function has preferentially affected P. aeruginosa genes that encode elements for communication with the environment.
Nineteen small indels (< 4 bp) were identified in the coding region of the clone CHA genomes (Table 2), 14 of which were already known from other completely sequenced P. aeruginosa strains. The three frameshifts in the last codons of PA3124 and PA4161 or the stop codon of PA5282 are neutral sequence variations and the three in-frame indels in PA2091, PA2302, and PA3462 should modulate the function of the encoded gene products to only minor extent, but the majority of the other 13 out-of-frame indels are probably loss-of-function mutations.
Five of the 19 indels are as yet undescribed in the Pseudomonas Genome Database (August 2012). Two of these have no functional consequences as mentioned above (PA3124, PA5282) and one destroys the reading frame of a chemotaxis transducer (PA4915). The remaining two mutations are located in the first ORFs of RGP2 and RGP7, both of which are known to carry clone-specific accessory elements and to be hotspots of genome mobility . The frameshifts inactivate transposase/integrase genes and thus should fix these tRNA-associated genomic islands in the clone CHA genomes.
Gain and loss of start and stop codons
The loss of three start and stop codons each and the gain of eight premature stop codons were noted in all three analyzed clone CHA genomes (Table 3). Interestingly another premature stop codon was introduced into ORF PA0977 in all three strains at the same position but by divergent nucleotide exchanges, a transversion in two strains and a transition in the third strain, respectively. Two further nonsense mutations were exclusively identified in strain CHA (Table 3). The mutations affected transcriptional regulators, hypotheticals, glycolate oxidase and Glu-tRNA(Gln) amidotransferase operons. Thus basic bacterial functions of metabolism and translation are impaired or lost in P. aeruginosa clone CHA; i.e. glycolate utilization and the transamidation of misacylated Glu-tRNAGln to correctly charged Gln-tRNAGln.
SNPs shared by two clone CHA strains
Thirty one of 33 SNPs that were found in two, but not in the third CHA strain, are located in two regions of genomic mobility that are prone to horizontal gene transfer  suggesting that these SNPs differentiate variants of phage-related sequences. The only two SNPs sensu stricto were identified in intergenic sequences (see Additional file 3).
Strain specific SNPs
The frequency of SNPs shared by two of the three strains was extremely low, but several dozen unique SNPs were found in each of the individual strains indicating some distinct microevolution in the clonally distant strain set (Figure 3). For instance, 47 strain-specific SNPs were identified in the environmental isolate PT22 (Additional file 4). The 34 SNPs in coding regions target genes encoding enzymes, transporters, transcriptional regulators and hypotheticals.
The genome of the CF isolate 491 carries 60 strain specific SNPs (Additional file 4). The clade of strain 491 acquired non-synonymous SNPs in 31 ORFs including genes that should play a role during the colonization of CF airways. Serine-to-asparagine substitutions were present in the two-component response regulator AlgB which activates the transcription of the algD alginate biosynthesis operon  and the cytoskeleton ATPase MreB which is essential for the maintenance of cell shape, chromosome segregation and polar localization of proteins . The most drastic change was the substitution of arginine by tryptophan R771W in the usher protein CupC3 that is essential for the assembly of CupC1 fimbriae . With 8 of the 60 strain-specific nucleotide exchanges in ORF PA0728, this gene encoding a phage-like integrase was identified as a SNP hotspot in strain 491, and the unique SNPs were not evenly distributed over the whole genome (Figure 2).
Strain CHA carries most unique SNPs among the three sequenced isolates, i.e. 13 intergenic SNPs, 31 synonymous SNPs, 46 non-synonymous SNPs and two SNPs generating a stop codon (Additional file 4). The predicted amino acid sequence was changed in 37 proteins including seven enzymes, six transporters and 15 ones of unknown function. Moreover, the clinically highly virulent strain CHA had acquired missense mutations in seven genes that are key for pathogenicity and adaptation to a habitat such as the CF lungs, i.e. A5G MucA, A651P PelB, R101H ExsA, R156H Tse2, L116F WspA, D514Y PA4036, E721K CbrA. The latter three missense mutations affect the chemotaxis operon WspABCDEF and two sensor kinases of two-component systems. CbrA has been demonstrated to be a global regulator of metabolism, motility, virulence and antibiotic resistance [27–29]. Hence the E721K mutation in CbrA should be a pleiotropic modifier of the bacterial phenotype. Complementation experiments demonstrated that the change of an alanine by glycine in the N-terminus of anti-sigma factor MucA (A5G) leads to the mucoid phenotype, whereas the (by definition) non-conservative exchanges L382R in AlgB and Y50H in KinB of the alginate regulon  were not causative for mucoidy in strain CHA (data not shown). The unique ability of strain CHA among functionally characterized P. aeruginosa to induce oncosis of neutrophils and macrophages is critically dependent on its active type III secretion system . Whether the undescribed arginine-to-histidine substitution R101H in ExsA, the regulator of the type III secretion regulon, has an effect on the regulon’s activity, is unclear. The non-conservative exchanges in PelB and Tse2 are likely without any consequences for strain CHA. The proteins encoded by the pel operon are involved in the biosynthesis of the Pel exopolysaccharide and thus influence biofilm composition and antibiotic tolerance , while Tse2, a recently discovered substrate of type VI secretion system of P. aeruginosa, can inhibit the growth of competing bacterial cells. However, any impact of the mutations in PelB and Tse2 on biofilm stability or competitive fitness, respectively, is uncertain since both are not expressed in the CHA background (data not shown).
Comparable to strain 491, hotspots of strain-specific nucleotide exchanges could also be found in strain CHA, as ORFs PA0982 and PA0977, both located in a region known for genomic instability , had acquired nine and three SNPs, respectively.
Twelve, six and five strain-specific SNPs were identified in intergenic regions of strains CHA, PT22 and 491; one of which in each strain affected different sRNAs. Seven strain CHA - specific SNPs were found in the intergenic regions of PA0977-PA0978 (four SNPs) and PA0983-PA0984 (three SNPs) and thus located in the same region prone to genomic instability as 12 of the strains’ unique intragenic SNPs (in PA0977 and PA0982).
PAO1-DNA absent in clone CHA strains
The clone CHA genome lacks 117 PAO1 ORFs (2.1% of all ORFs) the majority of which encode pyocins, phage elements or functionally yet uncharacterized gene products (see Additional file 5). Twelve PAO1 ORFs only partially aligned with clone CHA sequence reads indicating that sequence variation is unusually high in these ORFs. All three clone CHA genomes also lack the small non-coding RNA gene (sRNA) phrD, that is part of a phage-like insertion in PAO1, and 39 of the 513 intergenic sRNA loci identified recently . Another 21 of these loci were only partially covered by sequence reads of the clone CHA strains (Additional file 6). Intraclonal differences were observed for two sRNA loci. The sRNA pant78 was absent in strain 491 only while pant106 was present in strains PT22 and 491 but absent in strain CHA. Both these pant-sRNAs are located in RGP-insertions in PAO1 (RGP5 or RGP7, respectively) and thus likely contributed to mobile DNA elements.
Strain-specific intragenic deletions of PAO1 coding sequence were observed for two ORFs in strain 491 and one ORF in strain PT22 (Table 4; Additional file 7 Figure S1). Strain CHA showed a 426 bp deletion and, due to that, lacks the last 146 nucleotides of the global regulator gacS (PA0928) and the first 278 nucleotides of the adjacent lactate dehydrogenase ldhA (PA0927). This two-gene spanning deletion generated a double mutant of key genes of lifestyle and metabolism of P. aeruginosa[35, 36].
The clone CHA accessory genome
Accessory DNA elements known from other P. aeruginosaclones
The clone CHA strains share several genomic islands with the transmissible Liverpool epidemic strain LESB58  (Table 5). CHA, PT22 and 491 harbour copies of LES-prophage 1, LESGI-2 and LESGI-4 of the LES strain and a copy of an RGP29-insertion in the completely sequenced strain PACS2. The three strains moreover share a few ORFs known from insertions in RGPs 6, 9, 27, 36 and 62 in other P. aeruginosa genomes  (Table 5), although none of these insertions is completely conserved in the clone CHA genomes. Otherwise interstrain diversity is pronounced among the three sequenced clone CHA strains. Each strain carries its specific set of accessory elements. Individual variants were identified for the partially covered RGP26 (Figure 4A) and RGP77 insertions in strain PA14 or PA7, respectively, and for the mobile PAGI-2/pKLC102-type genomic islands. The clone CHA strains also harbour different sets of phage phiCTX-like genes. Variants of this phage either containing or lacking the cytotoxin gene ctx have been described for P. aeruginosa, and apparently such different variants have been acquired by the clone CHA lineage, as the ctx gene is conserved in PT22 and 491, but not in strain CHA.
The environmental isolate PT22 is endowed with the largest accessory genome. It carries several ORFs of RGP42 and RGP63 and nearly identical copies of the genomic islands LESGI-3 of strain LESB58  and PAGI-2 of strain C  (Figure 4B, Table 5). Strain 491 harbours variants of PAGI-2 and LESGI-3 and phage sequences that are homologous to ORFs in LES-prophages 3 and 6, the latter of which also found in strain CHA.
Novel strain-specific genes
ORFs were designated as ‘novel genes’ if they had yet not been described in completely sequenced P. aeruginosa genomes deposited in databases by June 1st, 2012. The number of novel genes correlated with the genome size of the strain, i.e. least genes were identified in strain CHA and most genes were detected in strain PT22 (see Additional files 8, 9, 10).
The strain CHA genome incorporated a truncated variant of the Pseudomonas phage B3  and an aacC1 gene that confers resistance to aminoglycoside antibiotics. The aacC1 sequence contig probably originated from an enterobacterial integron that has the highest homology to the enterobacterial type I integron harboured by plasmid p1658/97 .
Annotation uncovered 114 strain-specific ORFs in the CF isolate 491 (see Additional file 10). Most ORFs to which a function could be ascribed encode enzymes of DNA metabolism or mobility or elements of conjugation and type IV secretion. The closest ortholog or homolog was identified for all ORFs in beta- or gamma-proteobacteria that have been classified in the pre-16S rDNA taxonomic era as ‘honorary pseudomonads’ because they share lifestyle, habitat and metabolic versatility with the ‘class I’ pseudomonads P. aeruginosa, P. putida, P. fluorescens and P. syringae. Twenty-five ORFs are shared with the metal-resistant Burkholderiales Herminiimonas arsenicoxydans. These genes are part of PAGI-2 like islands harboured by strain 491 (Figure 4B) and the beta-proteobacterium, but none of them is annotated as a metal-resistance contributor.
167 strain-specific ORFs were identified in the aquatic isolate PT22 (see Additional file 9). Like in strain 491, closest orthologs and homologs were detected exclusively among beta- and gamma-proteobacteria, but other genera, namely Acidovorax, Azoarcus, Cupriavidus, Ralstonia (26% of ORFs) and the true pseudomonads (47% of ORFs) were frequent among the closest relatives of PT22 ORFs. The function could be predicted for a larger proportion of ORFs than in the CF isolates, and a greater variety of functions could be addressed which is reflected by a much more diverse spectrum of functional categories/gene ontologies for the PT22-specific ORFs than for those specific for strains CHA or 491 (see Additional file 7 Figure S2). The strain-specific accessory genome of strain PT22 encodes enzymes of lipid and sulphur metabolism, the two-component system armRS, a heme lyase and a cytochrome C oxidase and multiple transporters including an efflux pump and a P-type ATPase for heavy metal ions (Additional file 9). Moreover a paralog of the P. aeruginosa gene mvaT was identified. MvaT belongs to the H-NS family of small DNA-binding proteins that are global regulators of gene expression . Five homologues have been identified in P. putida and two homologues mvaT and mvaU have been identified in the P. aeruginosa core genome . P. aeruginosa PT22 is thus the first known P. aeruginosa strain with three mvaT homologues.
Comparison of the clone TB genomes with the PAO1 genome
In contrast to the analysed clone CHA strains, little intraclonal genomic diversity was observed for the three clone TB strains that were sampled during a local outbreak at Hannover Medical School. As reported earlier, only five individual nucleotide exchanges and one deletion each in a pilus assembly gene could be detected in the two CF airways isolates TBCF10839 and TBCF121838 . Though many phenotypic differences were observed, also the accessory genome differed by only one 81 kb Ralstonia pickettii PAGI-2 like genomic island absent in the first but present in the latter isolate .
Sequencing of a third clone TB isolate, the wound isolate TB63741, revealed some more intraclonal diversity, but still less than observed for the three clone CHA strains. TB63741 lacked six nucleotide exchanges that were detected for both TB CF isolates, but carried 22 individual SNPs not seen in any of the two CF isolates (Figure 1, Additional file 11). TB63741 did not harbour any deletion in a pil gene, but it had acquired a 9-bp in-frame deletion in a two component sensor gene and two frame-shift mutations in a phage gene and in oprD (see Additional file 11). The porin OprD transports basic amino acids and peptides but it also takes up the antipseudomonal agent imipenem. Loss-of-function mutations in oprD as seen in the clinical isolate TB63741 are a common mechanism of imipenem resistance .
Similar to the clone CHA lineage, the conservation of described non-coding sRNA loci does not differ within the clone TB lineage apart from one exception. The sRNA phrD and 30 pant-sRNAs are absent in the three genomes, of another 10 pant-sRNA loci significant parts were lacking (see Additional file 6). The phage DNA-associated sRNA pant78, present in both CF-isolates but absent in TB63741 made up the only intraclonal difference regarding sRNAs in clone TB.
Comparison of the sRNA conservation in clonal lineages CHA and TB revealed clone-specific patterns. While phrD and 20 pant-sRNA loci from PAO1 were completely absent (and four more partially) in both lineages, clone CHA lacked 17 pant-sRNAs which were present in clone TB. Six pant-sRNAs, however, were absent in clone TB but fully conserved in clone CHA. For another 23 pant-sRNA loci conservation patterns were partially divergent in the two clonal lineages (see Additional file 6). According to that, varying spectra of small non-coding RNA genes in P. aeruginosa might contribute significantly to interclonal diversity but only to a small degree to diversity between clonal variants, if sRNA genes are parts of strain-specific acquisition of mobile DNA elements.
Clone TB is endowed with a large accessory genome including the genomic islands PAGI-1, PAGI-2, PAGI-5 and PAGI-6 . The wound isolate TB63741 lacks the 81 kb TBCF121838-specific R. pickettii genomic island and numerous phage-like ORFs of phage Pf1 and of genomic island LESGI-1 which were present in both CF isolates. Conversely, TB63741 has incorporated more than 300 kbp that are absent in the two CF strains. Virtually all this DNA is of phage origin including LES-prophage 2 and 3 sequence , of which 67.3 or 76.2%, respectively, of the DNA were found in TBCF63741 with nucleotide identities ranging from 80 to 100%. The closest homologues of accessory genome ORFs were found in other P. aeruginosa clones, other Pseudomonas taxa or in ‘honorary’ pseudomonads (see Additional file 12). The shuffling of phage DNA apparently was the major driving force of microevolution of clone TB during the outbreak.
Comparison of the sequenced clone CHA and clone TB genomes
This study compared the intraclonal genome diversity of P. aeruginosa isolates derived from common and divergent sources. Consistent with our expectation higher genomic variation was found among the clonal isolates with a more diverse spatiotemporal origin.
Sequence variation was low among the three clone TB strains that had been sampled in summer 1983 during a local outbreak. The two CF isolates belong to a small epidemic that tripled the prevalence of P. aeruginosa – positive patients at the CF clinic . Despite individual profiles of phenotype, strains TBCF10839 and TBCF121838 show only minute differences in their genome sequence . Strain TB63741 was isolated from a patient with severe burns who had been treated at the intensive care unit for burns from which clone TB had initially spread to surgical wards and later to the CF clinic. The ancestors of the TB63741 strain had incorporated numerous phages into the clone TB genome that were absent in the isolates from the CF lungs indicating that highly colonised burn wounds themselves and/or the associated hospital environment had tolerated or favoured the uptake of phages.
The three clone TB isolates had descended from a common source and the individual clades had diverged from each other by at most two years. In contrast, the three sequenced clone CHA isolates were sampled from spatially and temporarily distinct habitats. Correspondingly, the sequence of the core genome and the composition of the accessory genome were significantly more diverse among the three clone CHA than among the three clone TB strains. In particular, the numerous strain specific SNPs in absence of pairwise shared SNPs demonstrate the distinct microevolution of the clone CHA strains (Figure 3). Conversely, shared de novo mutations and comparably very few individual de novo mutations highlight the close relatedness of the two clone TB CF isolates.
The environmental isolate PT22 was endowed with the largest accessory genome of the investigated strains. PT22 was collected from the river Ruhr at a site with substantial anthropogenic pollution and contamination with industrial sewage (Wasserqualität der Ruhr 1992 ). Consistent with its source, the genomic islands of PT22 encoded genes for the detoxification of xenobiotics and the efflux of heavy metal ions. PT22 carried a copy of PAGI-2 which also exists in CF isolates and Cupriavidus metallidurans CH34 that had been sampled from an industrial site polluted with heavy metal ions [48, 49].
The CF airways isolates 491 and CHA were retrieved from patients with the extremes of the general state of health that are feasible with CF as the underlying predisposing condition: The clinically highly pathogenic strain CHA was isolated from a CF patient with end-stage lung disease, whereas strain 491 was recovered from an individual with normal anthropometry and excellent lung function. Strain 491 was eradicated by antipseudomonal chemotherapy and no clone CHA strain has yet been re-isolated from the patient’s respiratory secretions in the last seven years. 491 had gained numerous elements of genomic mobility that may confer some global fitness to the strain, but only a few amino acid substitutions in traits that may facilitate the colonization of CF airways. In other words, the microevolution of the 491 clade does not point to any pronounced selection of the 491 ancestry to accommodate itself to the CF lung habitat.
Conversely, the ancestors of the strain CHA isolate had selected numerous non-conservative amino acid substitutions in elements of chemotaxis, exopolysaccharide biosynthesis, motility and virulence. In addition, the genes gacS and ldhA were destroyed by a deletion. The lactate dehydrogenase LdhA has recently been demonstrated in strains PA14 and PAO1 to be indispensible for microcolony formation in biofilms . Hence deletion of the 3’ end of ldhA could alter biofilm formation although strain CHA displayed mucoid growth on agar plates (data not shown). The GacS/GacA two-component system controls the reciprocal expression of acute and chronic virulence determinants [34, 50]. The deletion of gacS should abrogate this control. Consistent with this interpretation, strain CHA strongly expresses the pathways for alginate biosynthesis, a hallmark of a chronic infection, and the virulence effectors and structural elements of type III secretion, a hallmark of an acute infection (mRNA microarray data from bacteria grown to stationary phase, data not shown). Deletions and point mutations in key determinants of virulence and the control thereof thus established a genetic repertoire in the strain CHA isolate that is distinct from 491 and PT22 and should translate into the observed high pathogenic potential in the predisposed human host. This microevolution towards virulence seems to be quite specific for the inhabited CF lungs because strain CHA was inconspicuous in standard P. aeruginosa worm and fly infection models . Strain CHA apparently acquired signatures of a host-specific pathogen, whereas the 491 and PT22 clades retained the balance between environmental organism and opportunistic pathogen.
The clone CHA and TB genomes share numerous prophages and genomic islands with the virulent and transmissible LES clone, which has caused substantial morbidity in the CF patient population in the UK . The relatedness of their genomes may explain why these clones are prone to nosocomial spread among predisposed human hosts and why virulent clades with uncommon pathogenicity traits have evolved in these clonal complexes. Subsequent evolvement of pathogenicity arising from such genomic predisposition proceeded differently then in the highly virulent examples TBCF10839 and CHA.
In the case of TBCF10839 only few sequence variations clearly differentiated its genome from that of the other two less virulent TB strains, mainly a loss-of-function mutation in TBCF10839 . While lacking of type IV pili on the surface and being impaired in twitching motility, TBCF10839 was metabolically more active , produced more outer membrane transporters and secreted more virulence effectors  than its clonal variants. Apparently the loss of PilQ induced a global response in the TB background that is far beyond pilus biogenesis. Any further mutations that are necessary to generate the unique ability of TBCF10839 to grow in neutrophils must have already existed in the clone TB lineage. Strain CHA, however, exhibits numerous strain-specific gain- or loss-of-function mutations in global regulators or key pathogenicity factors that should be involved in the specific virulence features of strain CHA like its capability to cause oncosis of neutrophils [17–19]. Evolvement of the specific pathogenicity traits likely occurred by a series of microevolution events in this case.
Intraclonal genome diversity in the two investigated strain triplets presented in a low number of strain-specific de novo mutations in the core genome and a variable composition of the accessory genome. Shared SNPs were mainly observed between the two most closely related clone TB isolates from the outbreak. The number of strain-differentiating single nucleotide substitutions ranged from 7 to 154 SNPs for the most and the least related strain pair of clone TB and CHA, respectively. Correspondingly the intraclonal sequence variation of the P. aeruginosa core genome was 200- to 3000-fold lower than the interclonal sequence variation of 0.3 – 0.5%. In contrast to the highly conserved core genome a strain-specific signature was noted for the repertoire of phage-related sequences and genomic islands in the distantly related clone CHA strain trio. Strains shared islands and prophages that have first been reported in the transmissible LES strain, but they were distinct in their PAGI-2/pKLC102-type islands that recruit their cargo from the extensive gene pool of the honorary pseudomonads. According to the annotation this cargo as well as the strain specific SNPs confer individual traits on the respective strains to cope with the demands of their habitat from which they were isolated.
P. aeruginosa strains 491, TBCF10839, TBCF121838 and TB63741 were isolated from patients seen at the Medizinische Hochschule Hannover. Strain PT22 was retrieved from the river Ruhr close to Mülheim. Strain CHA was isolated from a patient seen at the CF clinic in Grenoble. First subcultures were maintained in LB supplemented with 15% (w/v) glycerol at −80°C until use.
P. aeruginosa strains were genotyped by a custom-made microarray following the protocol published previously .
P. aeruginosa genomic DNA was prepared from cells grown in LB medium following a protocol optimized for Gram-negative bacteria .
Illumina genome analyser sequencing
After preparing genomic DNA libraries according to the manufacturer’s instructions, sequencing-by-synthesis was performed at GATC-Biotech (Constance, Germany) for each library with an Illumina Genome Analyser II generating 36 bp sequence reads. Illumina Genome Analyser Pipeline Version 0.2 software was applied to qualify reads passing default signal quality filters. Obviously incorrect reads with homooligomers > 13 bases in length (not present in the P. aeruginosa genome) or an ‘N’-base call in at least three positions were excluded from the analysis . All sequence data from this study have been submitted to the Sequence Read Archive (SRA) of the EBI (strain TB63741: study accession no. ERP001300; clone CHA strains CHA, PT22 and 491: study accession no. ERP001750).
Sequence and read alignment
36 bp reads data of the strains were individually mapped to the PAO1 reference genome (NC_002516.2) using the accurate alignment software Novoalign V2.07.00 (Novocraft Technologies, 2010). The command: novoalign –d Indexed_reference_genome –f Reads.fastq –o SAM > out.sam, was used during the mapping to create “sam” formatted alignment files. Two pools of data consisting of the PAO1 mapped and unmapped reads were then extracted directly from the three alignment files using a custom script. Unmapped reads representing non-PAO1 DNA and the mapped reads representing the PAO1 DNA were assigned to not-in-reference and in-reference read pools, respectively.
Sequence variation sites analysis
Clone CHA strains with genomic positions indicating single nucleotide variants relative to the PAO1 reference were extracted from the novoalign alignment files using SAMtools . The variant call format (vcf) output files generated by SAMtools were further filtered for low quality variants. Variants with minimum coverage of six reads with minimum base calling quality (Q) of 30 at the respective position, a minimum SNP-call quality (QUAL) of 160 (QUAL = −10 log10 (probability of wrong call) ) and with more than 67% of all quality reads calling the SNP were retained. These variants were then compared against each other to identify sets of strain specific SNPs through the use of an in-house SNP filter pipeline.
The SAMtools derived sequence variants output files were further searched for predictions of small indels. The top candidates (QUAL ≥ 160) were verified by manual inspection of the alignment. Predicted indels were removed that did not pass the following criteria: minimum coverage of more than five high quality reads (Q ≥30 at the candidate position) and more than 95% of reads flag the indel. Predicted indels and SNPs were subsequently annotated using SNPeff version 1.9.5  to identify their effect on coding DNA sequences.
The not-in-reference pools of sequence reads characterized as Clone CHA accessory genome were assembled to larger contigs with the de novo assembler Velvet version 1.0.12 . Commands used during the assembly process are as follows: velveth 63741_cov5_23 23 63741_reads.fas; velvetg 63741_cov5_23 -cov_cutoff 5.0 -max_coverage 300. The assembler parameters were set for a minimum read coverage of 5 and kmer size of 23 to construct reliable contigs. These criteria were set for the analysis as they were demonstrated to maximise the tradeoff between base pairs incorporated and average and maximum contig size after thorough empirical testing. Assembled contigs of strain triplets were aligned against one another by blastn (1e-5 E-value threshold) to search for similarity between the sequences. Contigs that lacked similarity with others were designated as strain-specific DNA. These candidates were further validated using alignments of the short read data sets from both other strains using Novoalign. Contigs covered by reads were not considered to be strain-specific.
Validated strain-specific contigs were aligned using blastx against the UniProt database  to identify sets of known (present in other P. aeruginosa) and novel (not present in other P. aeruginosa) genes in their accessory genomes.
Detection of horizontally transferred genomic elements in clone CHA
Assembled contigs of the three clone CHA strains were aligned against all known P. aeruginosa genomic islands and insertions in regions of genome plasticity using blastn (1e-10 E-value threshold). Alignment results for all the searches were then visualized by GenomeGraphs , an integrated genomic data visualization package for R (http://www.r-project.org) to help determine which of the known horizontally transferred genomic elements are completely/partially present in the three clone CHA strains.
Check for conservation of predicted sRNAs
Uncovered regions of the reference were extracted from the alignment results for the individual strains and checked for intersection with the 557 sRNA loci described for the PAO1 reference . Complete or partial absence (> 10% not conserved) was confirmed by visual inspection of alignment/coverage for these loci using the Integrative Genomics Viewer .
Selezska K, Kazmierczak M, Müsken M, Garbe J, Schobert M, Häussler S, Wiehlmann L, Rohde C, Sikorski J: Pseudomonas aeruginosa population structure revisited under environmental focus: impact of water quality and phage pressure. Environ Microbiol. 2012, 14: 1952-1967. 10.1111/j.1462-2920.2012.02719.x.
George AM, Jones PM, Middleton PG: Cystic fibrosis infections: treatment strategies and prospects. FEMS Microbiol Lett. 2009, 300: 153-164. 10.1111/j.1574-6968.2009.01704.x.
de Bentzmann S, Plésiat P: The Pseudomonas aeruginosa opportunistic pathogen and human infections. Environ Microbiol. 2011, 13: 1655-1665. 10.1111/j.1462-2920.2011.02469.x.
Mathee K, Narasimhan G, Valdes C, Qiu X, Matewish JM, Koehrsen M, Rokas A, Yandava CN, Engels R, Zeng E, Olavarietta R, Doud M, Smith RS, Montgomery P, White JR, Godfrey PA, Kodira C, Birren B, Galagan JE, Lory S: Dynamics of Pseudomonas aeruginosa genome evolution. Proc Natl Acad Sci USA. 2008, 105: 3100-3105. 10.1073/pnas.0711982105.
Tümmler B: Clonal variations in Pseudomonas aeruginosa. Pseudomonas. Volume 4. Edited by: Ramos JL, Levesque RC. 2006, New York: Springer, 36-68.
Wiehlmann L, Wagner G, Cramer N, Siebert B, Gudowius P, Morales G, Köhler T, van Delden C, Weinel C, Slickers P, Tümmler B: Population structure of Pseudomonas aeruginosa. Proc Natl Acad Sci USA. 2007, 104: 8101-8106. 10.1073/pnas.0609213104.
Spencer DH, Kas A, Smith EE, Raymond CK, Sims EH, Hastings M, Burns JL, Kaul R, Olson MV: Whole-genome sequence variation among multiple isolates of Pseudomonas aeruginosa. J Bacteriol. 2003, 185: 1316-1325. 10.1128/JB.185.4.1316-1325.2003.
Kresse AU, Dinesh SD, Larbig K, Römling U: Impact of large chromosomal inversions on the adaptation and evolution of Pseudomonas aeruginosa chronically colonizing cystic fibrosis lungs. Mol Microbiol. 2003, 47: 145-158.
Smith EE, Buckley DG, Wu Z, Saenphimmachak C, Hoffman LR, D’Argenio DA, Miller SI, Ramsey BW, Speert DP, Moskowitz SM, Burns JL, Kaul R, Olson MV: Genetic adaptation by Pseudomonas aeruginosa to the airways of cystic fibrosis patients. Proc Natl Acad Sci USA. 2006, 103: 8487-8492. 10.1073/pnas.0602138103.
Cramer N, Klockgether J, Wrasman K, Schmidt M, Davenport CF, Tümmler B: Microevolution of the major common Pseudomonas aeruginosa clones C and PA14 in cystic fibrosis lungs. Environ Microbiol. 2011, 13: 1690-1704. 10.1111/j.1462-2920.2011.02483.x.
Yang L, Jelsbak L, Marvig RL, Damkiær S, Workman CT, Rau MH, Hansen SK, Folkesson A, Johansen HK, Ciofu O, Høiby N, Sommer MO, Molin S: Evolutionary dynamics of bacteria in a human host environment. Proc Natl Acad Sci USA. 2011, 108: 7481-7486. 10.1073/pnas.1018249108.
Chung JC, Becq J, Fraser L, Schulz-Trieglaff O, Bond NJ, Foweraker J, Bruce KD, Smith GP, Welch M: Genomic variation among contemporary Pseudomonas aeruginosa isolates from chronically-infected cystic fibrosis patients. J Bacteriol. 2012, 194: 4857-4866. 10.1128/JB.01050-12.
Klockgether J, Cramer N, Wiehlmann L, Davenport CF, Tümmler B: Pseudomonas aeruginosa genomic structure and diversity. Front Microbiol. 2011, 2: 150-
Toussaint B, Delic-Attree I, Vignais PM: Pseudomonas aeruginosa contains an IHF-like protein that binds to the algD promoter. Biochem Biophys Res Commun. 1993, 196: 416-421. 10.1006/bbrc.1993.2265.
Tümmler B, Koopmann U, Grothues D, Weissbrodt H, Steinkamp G, von der Hardt H: Nosocomial acquisition of Pseudomonas aeruginosa by cystic fibrosis patients. J Clin Microbiol. 1991, 29: 1265-1267.
Klockgether J, Miethke N, Kubesch P, Bohn YS, Brockhausen I, Cramer N, Eberl L, Greipel J, Herrmann C, Herrmann S, Horatzek S, Lingner M, Luciano L, Salunkhe P, Schomburg D, Wehsling M, Wiehlmann L, Davenport CF, Tümmler B: Intraclonal diversity of the Pseudomonas aeruginosa cystic fibrosis airway isolates TBCF10839 and TBCF121838: distinct signatures of transcriptome, proteome, metabolome, adherence and pathogenicity despite an almost identical genome sequence. Environ Microbiol. 2013, 15: 191-210. 10.1111/j.1462-2920.2012.02842.x.
Dacheux D, Attree I, Schneider C, Toussaint B: Cell death of human polymorphonuclear neutrophils induced by a Pseudomonas aeruginosa cystic fibrosis isolate requires a functional type III secretion system. Infect Immun. 1999, 67: 6164-6167.
Dacheux D, Toussaint B, Richard M, Brochier G, Croize J, Attree I: Pseudomonas aeruginosa cystic fibrosis isolates induce rapid, type III secretion-dependent, but ExoU-independent, oncosis of macrophages and polymorphonuclear neutrophils. Infect Immun. 2000, 68: 2916-2924. 10.1128/IAI.68.5.2916-2924.2000.
Dacheux D, Goure J, Chabert J, Usson Y, Attree I: Pore-forming activity of type III system-secreted proteins leads to oncosis of Pseudomonas aeruginosa-infected macrophages. Mol Microbiol. 2001, 40: 76-85. 10.1046/j.1365-2958.2001.02368.x.
Wiehlmann L, Cramer N, Ulrich J, Hedtfeld S, Weissbrodt H, Tümmler B: Effective prevention of Pseudomonas aeruginosa cross-infection at a cystic fibrosis centre - results of a 10-year prospective study. Int J Med Microbiol. 2012, 302: 69-77. 10.1016/j.ijmm.2011.11.001.
Stover CK, Pham XQ, Erwin AL, Mizoguchi SD, Warrener P, Hickey MJ, Brinkman FS, Hufnagle WO, Kowalik DJ, Lagrou M, Garber RL, Goltry L, Tolentino E, Westbrock-Wadman S, Yuan Y, Brody LL, Coulter SN, Folger KR, Kas A, Larbig K, Lim R, Smith K, Spencer D, Wong GK, Wu Z, Paulsen IT, Reizer J, Saier MH, Hancock RE, Lory S, Olson MV: Complete genome sequence of Pseudomonas aeruginosa PAO1, an opportunistic pathogen. Nature. 2000, 406: 959-964. 10.1038/35023079.
Dayhoff MO: Atlas of Protein Sequence and Structure, 5, Suppl. 3. Observed frequencies of amino acid replacements between closely related proteins. 1978, Washington: National Biomedical Research Foundation
Winsor GL, Lam DK, Fleming L, Lo R, Whiteside MD, Yu NY, Hancock RE, Brinkman FS: Pseudomonas Genome Database: improved comparative analysis and population genomics capability for Pseudomonas genomes. Nucleic Acids Res. 2011, 39: D596-600. 10.1093/nar/gkq869.
Leech AJ, Sprinkle A, Wood L, Wozniak DJ, Ohman DE: The NtrC family regulator AlgB, which controls alginate biosynthesis in mucoid Pseudomonas aeruginosa, binds directly to the algD promoter. J Bacteriol. 2008, 190: 581-589. 10.1128/JB.01307-07.
Cowles KN, Gitai Z: Surface association and the MreB cytoskeleton regulate pilus production, localization and function in Pseudomonas aeruginosa. Mol Microbiol. 2010, 76: 1411-1426. 10.1111/j.1365-2958.2010.07132.x.
Ruer S, Stender S, Filloux A, de Bentzmann S: Assembly of fimbrial structures in Pseudomonas aeruginosa: functionality and specificity of chaperone-usher machineries. J Bacteriol. 2007, 189: 3547-3555. 10.1128/JB.00093-07.
Nishijyo T, Haas D, Itoh Y: The CbrA-CbrB two-component regulatory system controls the utilization of multiple carbon and nitrogen sources in Pseudomonas aeruginosa. Mol Microbiol. 2001, 40: 917-931. 10.1046/j.1365-2958.2001.02435.x.
Li W, Lu CD: Regulation of carbon and nitrogen utilization by CbrAB and NtrBC two-component systems in Pseudomonas aeruginosa. J Bacteriol. 2007, 189: 5413-5420. 10.1128/JB.00432-07.
Yeung AT, Bains M, Hancock RE: The sensor kinase CbrA is a global regulator that modulates metabolism, virulence, and antibiotic resistance in Pseudomonas aeruginosa. J Bacteriol. 2011, 193: 918-931. 10.1128/JB.00911-10.
Damron FH, Goldberg JB: Proteolytic regulation of alginate overproduction in Pseudomonas aeruginosa. Mol Microbiol. 2012, 84: 595-607. 10.1111/j.1365-2958.2012.08049.x.
Mann EE, Wozniak DJ: Pseudomonas biofilm matrix composition and niche biology. FEMS Microbiol Rev. 2012, 36: 893-916. 10.1111/j.1574-6976.2011.00322.x.
Hood RD, Singh P, Hsu F, Guvener T, Carl MA, Trinidad RR, Silverman JM, Ohlson BB, Hicks KG, Plemel RL, Li M, Schwarz S, Wang WY, Merz AJ, Goodlett DR, Mougous JD: A type VI secretion system of Pseudomonas aeruginosa targets a toxin to bacteria. Cell Host Microbe. 2010, 7: 25-37. 10.1016/j.chom.2009.12.007.
Klockgether J, Reva O, Larbig K, Tümmler B: Sequence analysis of the mobile genome island pKLC102 of Pseudomonas aeruginosa C. J Bacteriol. 2004, 186: 518-534. 10.1128/JB.186.2.518-534.2004.
Gómez-Lozano M, Marvig RL, Molin S, Long KS: Genome-wide identification of novel small RNAs in Pseudomonas aeruginosa. Environ Microbiol. 2012, 14: 2006-2016. 10.1111/j.1462-2920.2012.02759.x.
Petrova OE, Schurr JR, Schurr MJ, Sauer K: Microcolony formation by the opportunistic pathogen Pseudomonas aeruginosa requires pyruvate and pyruvate fermentation. Mol Microbiol. 2012, 86: 819-835.
Goodman AL, Merighi M, Hyodo M, Ventre I, Filloux A, Lory S: Direct interaction between sensor kinase proteins mediates acute and chronic disease phenotypes in a bacterial pathogen. Genes Dev. 2009, 23: 249-259. 10.1101/gad.1739009.
Winstanley C, Langille MG, Fothergill JL, Kukavica-Ibrulj I, Paradis-Bleau C, Sanschagrin F, Thomson NR, Winsor GL, Quail MA, Lennard N, Bignell A, Clarke L, Seeger K, Saunders D, Harris D, Parkhill J, Hancock RE, Brinkman FS, Levesque RC: Newly introduced genomic prophage islands are critical determinants of in vivo competitiveness in the Liverpool Epidemic Strain of Pseudomonas aeruginosa. Genome Res. 2009, 19: 12-23.
Nakayama K, Kanaya S, Ohnishi M, Terawaki Y, Hayashi T: The complete nucleotide sequence of phi CTX, a cytotoxin-converting phage of Pseudomonas aeruginosa: implications for phage evolution and horizontal gene transfer via bacteriophages. Mol Microbiol. 1999, 31: 399-419. 10.1046/j.1365-2958.1999.01158.x.
Larbig KD, Christmann A, Johann A, Klockgether J, Hartsch T, Merkl R, Wiehlmann L, Fritz HJ, Tümmler B: Gene islands integrated into tRNA(Gly) genes confer genome diversity on a Pseudomonas aeruginosa clone. J Bacteriol. 2002, 184: 6665-6680. 10.1128/JB.184.23.6665-6680.2002.
Braid MD, Silhavy JL, Kitts CL, Cano RJ, Howe MM: Complete genomic sequence of bacteriophage B3, a Mu-like phage of Pseudomonas aeruginosa. J Bacteriol. 2004, 186: 6560-6574. 10.1128/JB.186.19.6560-6574.2004.
Zienkiewicz M, Kern-Zdanowicz I, Golebiewski M, Zylinska J, Mieczkowski P, Gniadkowski M, Bardowski J, Ceglowski P: Mosaic structure of p1658/97, a 125-kilobase plasmid harboring an active amplicon with the extended-spectrum beta-lactamase gene bla(SHV-5). Antimicrob Agents Chemother. 2007, 51: 1164-1171. 10.1128/AAC.00772-06.
Palleroni NJ: Prokaryote taxonomy of the 20th century and the impact of studies on the genus Pseudomonas: a personal view. Microbiology. 2003, 149: 1-7. 10.1099/mic.0.25952-0.
Muller D, Simeonova DD, Riegel P, Mangenot S, Koechler S, Lièvremont D, Bertin PN, Lett MC: Herminiimonas arsenicoxydans sp. nov., a metalloresistant bacterium. Int J Syst Evol Microbiol. 2006, 56: 1765-1769. 10.1099/ijs.0.64308-0.
Castang S, Dove SL: High-order oligomerization is required for the function of the H-NS family member MvaT in Pseudomonas aeruginosa. Mol Microbiol. 2010, 78: 916-931. 10.1111/j.1365-2958.2010.07378.x.
Li C, Wally H, Miller SJ, Lu CD: The multifaceted proteins MvaT and MvaU, members of the H-NS family, control arginine metabolism, pyocyanin synthesis, and prophage activation in Pseudomonas aeruginosa PAO1. J Bacteriol. 2009, 191: 6211-6218. 10.1128/JB.00888-09.
Li H, Luo YF, Williams BJ, Blackwell TS, Xie CM: Structure and function of OprD protein in Pseudomonas aeruginosa: from antibiotic resistance to novel therapies. Int J Med Microbiol. 2012, 302: 63-68. 10.1016/j.ijmm.2011.10.001.
Entwicklung der Wasserqualität der Ruhr. Essen, Germany: Ruhrverband, http://www.ruhrverband.de/wissen/wasserqualitaet/entwicklung/,
Klockgether J, Würdemann D, Reva O, Wiehlmann L, Tümmler B: Diversity of the abundant pKLC102/PAGI-2 family of genomic islands in Pseudomonas aeruginosa. J Bacteriol. 2007, 189: 2443-2459. 10.1128/JB.01688-06.
Diels L, Van Roy S, Taghavi S, Van Houdt R: From industrial sites to environmental applications with Cupriavidus metallidurans. Antonie Van Leeuwenhoek. 2009, 96: 247-258. 10.1007/s10482-009-9361-4.
Goodman AL, Kulasekara B, Rietsch A, Boyd D, Smith RS, Lory S: A signalling network reciprocally regulates genes associated with acute infection and chronic persistence in Pseudomonas aeruginosa. Dev Cell. 2004, 7: 745-754. 10.1016/j.devcel.2004.08.020.
Fauvarque MO, Bergeret E, Chabert J, Dacheux D, Satre M, Attree I: Role and activation of type III secretion system genes in Pseudomonas aeruginosa-induced Drosophila killing. Microb Pathog. 2002, 32: 287-295. 10.1006/mpat.2002.0504.
Chang YS, Klockgether J, Tümmler B: An intragenic deletion in pilQ leads to nonpiliation of a Pseudomonas aeruginosa strain isolated from cystic fibrosis lung. FEMS Microbiol Lett. 2007, 270: 201-206. 10.1111/j.1574-6968.2007.00664.x.
Arevalo-Ferro C, Buschmann J, Reil G, Görg A, Wiehlmann L, Tümmler B, Eberl L, Riedel K: Proteome analysis of intraclonal diversity of two Pseudomonas aeruginosa TB clone isolates. Proteomics. 2004, 4: 1241-1246. 10.1002/pmic.200300709.
Ausubel FM, Brent R, Kingston RE, Moore DD, Seidmann JG, Smith JA, Struhl K: (Eds): Current Protocols in Molecular Biology. 1994, New York: Wiley,
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup: The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009, 25: 2078-2079. 10.1093/bioinformatics/btp352.
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R, 1000 Genomes Project Analysis Group: The variant call format and VCFtools. Bioinformatics. 2011, 27: 2156-2158. 10.1093/bioinformatics/btr330.
Cingolani P, Platts A, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM: A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012, 6: 80-92. 10.4161/fly.19695.
Zerbino DR, Birney E: Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008, 18: 821-829. 10.1101/gr.074492.107.
Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Natale DA, O’Donovan C, Redaschi N, Yeh LS: UniProt: the Universal Protein knowledgebase. Nucleic Acids Res. 2004, 32: D115-119. 10.1093/nar/gkh131.
Durinck S, Bullard J, Spellman PT, Dudoit S: GenomeGraphs: integrated genomic data visualization with R. BMC Bioinformatics. 2009, 10: 2-10.1186/1471-2105-10-2.
Thorvaldsdottir H, Robinson JT, Mesirov JP: Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2013, 14: 178-192. 10.1093/bib/bbs017.
Huson DH, Bryant D: Application of Phylogenetic Networks in Evolutionary Studies. Mol Biol Evol. 2006, 23: 254-267.
OKIB is grateful to Oleg Reva, University of Pretoria, for supervision and intellectual support. This work was supported by a grant from the Mukoviszidose eV (project: Comparative phenotyping of highly virulent clinical Pseudomonas aeruginosa isolates from cystic fibrosis) during the initial stage of the project and later by grants from the Deutsche Forschungsgemeinschaft (SFB 900, projects A2 and Z1). OKIB received a stipend from the German Academic Exchange Service (DAAD). Publication of this manuscript is supported by the sponsorship ‘Open Access Publication’ of the Deutsche Forschungsgemeinschaft.
The authors declare that they have no competing interests.
JK and BT designed the study. IA, SE and BT provided resources. OKIB and CFD wrote scripts. OKIB, JK and CFD evaluated the sequence data. OKIB, JK, CFD and BT interpreted the sequence data and wrote the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Table S1: “SNPs common in clone CHA strains”, contains descriptions of 24560 SNPs detected in all clone CHA strains. (XLS 3 MB)
Additional file 2: Table S2: “Non-synonymous SNPs in clone CHA strains causing aa-exchanges with Dayhoff-similarity-indices < 5”, contains descriptions of 503 SNPs causing amino acid exchanges. (XLS 382 KB)
Additional file 3: Table S3: “SNPs shared by two of the three clone CHA strains”, contains lists of SNPs present in two of the three clonal variants. (XLS 24 KB)
Additional file 4: Table S4: “SNPs specific for strain CHA/PT22/491”, contains three separate lists with SNPs specific for strain CHA, for strain PT22, and for strain 491, respectively. (XLS 46 KB)
Additional file 5: Table S5: “PAO1 loci absent in clone CHA strains”, describes gene loci from the reference sequence that are absent in the clone CHA strains. (DOC 25 KB)
Additional file 6: Table S6: “PAO1 ncRNA loci not conserved in clonal lineages CHA and TB”, lists small RNA loci from the reference sequence that are absent in the clone CHA and the clone TB strains. (XLS 26 KB)
Additional file 7: Figure S1.”, a visualization of a strain-specific deletion in the genome of the clone CHA isolate PT22, and “Figure S2”, showing gene ontologies / functional categories of strain-specific genes of strains CHA, PT22 and 491. (PDF 201 KB)
Additional file 8: Table S7: “Alignment of strain CHA-specific accessory genome contigs versus UniProt-database”, contains results of database similarity search for proteins encoded in the strain CHA-specific DNA. (XLS 26 KB)
Additional file 9: Table S8: “Alignment of strain PT22-specific accessory genome contigs versus UniProt-database”, contains results of database similarity search for proteins encoded in the strain PT22- specific DNA. (XLS 81 KB)
Additional file 10: Table S9: “Alignment of strain 491-specific accessory genome contigs versus UniProt-database”, contains results of database similarity search for proteins encoded in the strain 491-specific DNA. (XLS 47 KB)
Additional file 11: Table S10: “TB63741-specific features”, describes SNPs and small indels differentiating strain TB63741 from its clonal variants. (XLS 32 KB)
Additional file 12: Table S11: “Alignment of TB63741-specific accessory genome contigs versus UniProt-database”, contains results of database similarity search for proteins encoded in the strain TB63741-specific DNA. (XLS 54 KB)
About this article
Cite this article
Bezuidt, O.K., Klockgether, J., Elsen, S. et al. Intraclonal genome diversity of Pseudomonas aeruginosa clones CHA and TB. BMC Genomics 14, 416 (2013). https://doi.org/10.1186/1471-2164-14-416
- Pseudomonas aeruginosa
- Habitat adaptation
- Genome diversity