Draft genome sequence of Xanthomonas fragariaereveals reductive evolution and distinct virulence-related gene content
BMC Genomics volume 14, Article number: 829 (2013)
Xanthomonas fragariae (Xf) is a bacterial strawberry pathogen and an A2 quarantine organism on strawberry planting stock in the EU. It is taxonomically and metabolically distinct within the genus Xanthomonas, and known for its host specificity. As part of a broader pathogenicity study, the genome of a Belgian, virulent Xf strain (LMG 25863) was assembled to draft status and examined for its pathogenicity related gene content.
The Xf draft genome (4.2 Mb) was considerably smaller than most known Xanthomonas genomes (~5 Mb). Only half of the genes coding for TonB-dependent transporters and cell-wall degrading enzymes that are typically present in other Xanthomonas genomes, were found in Xf. Other missing genes/regions with a possible impact on its plant-host interaction were: i) the three loci for xylan degradation and metabolism, ii) a locus coding for a ß-ketoadipate phenolics catabolism pathway, iii) xcs, one of two Type II Secretion System coding regions in Xanthomonas, and iv) the genes coding for the glyoxylate shunt pathway. Conversely, the Xf genome revealed a high content of externally derived DNA and several uncommon, possibly virulence-related features: a Type VI Secretion System, a second Type IV Secretion System and a distinct Type III Secretion System effector repertoire comprised of multiple rare effectors and several putative new ones.
The draft genome sequence of LMG 25863 confirms the distinct phylogenetic position of Xf within the genus Xanthomonas and reveals a patchwork of both lost and newly acquired genomic features. These features may help explain the specific, mostly endophytic association of Xf with the strawberry plant.
Xanthomonas fragariae (Xf) is a bacterial strawberry pathogen and the cause of angular leaf spot. It was first described in the United States in 1962  and has since spread globally. Under favourable conditions the pathogen may cause significant damage to both plant stock and strawberry production . Xf is a quarantine pest on planting stock within the EU , which may explain why this generally considered mild pathogen has remained at the heart of scientific and legislative debate for decades. Xf is a distinct and homogeneous species within the otherwise complex and highly dynamic genus Xanthomonas[4–7]. A certain degree of infraspecific diversity within Xf has been observed, but in general it is considered as a coherent and stable species [8–10].
Unlike its clear taxonomic position, the disease-related capabilities of Xf are still obscure. One well-established characteristic of Xf is its narrow host range: Fragaria spp. are the only natural hosts, although close relatives of Fragaria, such as Potentilla fruticosa and P. glandulosa, showed symptoms after artificial inoculation and therefore are considered potential hosts . Another, poorly characterized feature of Xf is its symptomless persistence in strawberry crops , which holds significant relevance for Xf as quarantine organism in strawberry planting stock. Molecular testing repeatedly demonstrated Xf presence in symptomless rhizomes of strawberry plants intended for planting [13, 14]. Knowledge on the in planta movement of Xf is limited:so far, only one study presented experimental evidence for the endophytic spread of Xf down from infected strawberry leaves to the rhizome and to newly emerging runners and daughter plants . Xf is a challenging organism to study because of its fastidious nature on most common growth media , and its rapidly declining viability after contact with strawberry leaf extracts . Moreover, Xf appeared insusceptible to genetic manipulation, which hampered our efforts in developing fluorescent and functional mutants (unpublished results).
A whole genome sequence of Xf can provide insight in its life style and help solve some of the technical problems it presents in the laboratory. Recent advances in sequencing technology and bioinformatics, together with emerging commercial whole genome sequencing services, have resulted in rapid and cost-effective means of generating draft genomes fit for most plant-pathology related studies . Also within Xanthomonas, multiple genome sequences are available and already provided interesting insights in the most common pathogenicity determinants of the genus [18, 19]. One of the final technical challenges associated with next-generation sequencing techniques is the presence of repetitive genomic sequences . Multiple paired-read datasets with varying insert sizes are often used to resolve assembly ambiguities associated with these repetitive sequences, or at least to bridge sequence gaps by concatenating related contigs into larger scaffolds. In addition, several software tools providing automatic scaffold gap-closure have recently been released: Gapcloser , IMAGE  and Gapfiller .
The aim of the current study was to generate a draft genome sequence of a Belgian, virulent Xf strain (LMG 25863) and to analyse its virulence-related gene content by comparison to available Xanthomonas whole-genome sequences. Two commercially obtained paired-read datasets were combined, and an automatic gap-closure algorithm was applied, to overcome encountered assembly problems related to repetitive DNA. Here, we present the resulting draft genome sequence of Xf LMG 25863 and the observed virulence-related features.
Results and discussion
Repetitive DNA content complicates genome assembly
Assembling the draft genome sequence of Xf was more challenging than anticipated. A first de novo assembly using a single Paired-End (PE) Illumina short-read dataset (Table 1) did not meet our expectations: although the contig number and N50-values of this initial draft genome sequence were on par with comparable assemblies of other Xanthomonas genomes (e.g. ), it was considerably smaller than anticipated (3,9 Mb instead of 5 Mb) and revealed an exceptionally high Insertion Sequences (IS) related repetitive DNA content. A second, Mate-Paired (MP) Illumina short-read dataset with a larger insert size was generated to avoid incomplete genome assembly caused by read ambiguity. The MP dataset did not improve the de novo assembly (data not shown), and the initial PE-based de novo assembly was used to start a second assembly stage that included scaffolding of the de novo assembly with the MP dataset, and subsequent application of the automatic gap-closing algorithm Gapcloser . There was a clear improvement in both assembly quality and read disambiguation, and the current draft sufficed for our plant pathology aimed research goals. Given the quick evolution of genome assembly algorithms, the public available raw sequence data generated in this study could result in a finished genome assembly in the near future.
In its current form, the draft genome of LMG 25863 consists of 96 contigs with a total contig size of 4.182.545 bp. The final draft genome of LMG 25863 was confirmed to contain an abundant IS-content. During RAST annotation, for example, 420 of the total 3786 recognized Coding DNA Sequences (CDS) in the draft genome were identified as IS-related and represented 5% of the total genome size. Of course, this number may be artificially inflated by partial and frame-shifted ORFs caused by incomplete assembly of the highly repetitive IS. However, the frequent association of the IS with sequence gaps, ambiguous read positions and orphan contigs in the draft genome all seem to confirm their abundance. Truncated CDS, due to incomplete assembly in the Xf draft genome or in the other 25 available Xanthomonas genomes (Table 2), were assumed as complete in our comparative analysis. The frequently encountered IS-families in Xf seemed common for Xanthomonas genomes (Figure 1), although during blast queries two types annotated in RAST as “IS1647” and “tis1421” appeared to be more related to Ralstonia and Burkholderia genomes.
Phylogenetic affiliation of X. fragariae to other Xanthomonasspecies
In this study, we compared the Xf draft genome with available whole genome sequences of 25 other Xanthomonas strains (Table 2). First, we determined the phylogenetic affiliation among these 26 genomes using the concatenated partial sequences of four housekeeping genes (gyrB, atpD, dnaK, and rpoD) and the structural gene fyuA[40–42]. The results of the MLSA (Figure 2) were congruent with earlier phylogenetic studies of the genus Xanthomonas[40, 41]. The subdivision between the “core” of the genus and the two outliers X. albilineans GPE PC73 and X. sacchari NCPPB 4393 was most evident. Also, Xf clearly represented a distinct phylogenetic lineage within the Xanthomonas core group.
Xf reveals genome reduction similar to X. oryzae and X. albilineans
Even after the second genome assembly, the total contig size of the Xf draft genome (4.2 Mb) was still implying a considerable genome reduction. Because genome reduction had already been reported for X. albilineans (Xalb) and X. oryzae pv. oryzae (Xoo) , we compared the CDS content of Xf with that of Xalb GPE PC73 and Xoo KACC 10331 using the EDGAR software framework . We further included X. campestris pv. campestris ATCC 33913 (Xcc) and X. euvesicatoria 85–10 (Xcv), respectively a vascular and a non-vascular pathogen, as references of non-reduced genomes. EDGAR analysis indicated a substantial genome reduction in Xf, which was at least in part similar to Xoo and Xalb (Figure 3). This was most apparent from the 490 CDS shared by Xcc and Xcv but absent in Xalb, Xoo and Xf. Additionally, the 195 CDS exclusively missing in Xf suggested that the genome reduction in Xf was more extensive than in Xoo (84 exclusively missing CDS), but not as extreme as in Xalb (367 exclusively missing CDS). The lists of missing and present CDS, acquired from the EDGAR analysis (Additional file 1: Table S1), were used as a starting point for all further genome comparisons in this study, and were checked by independent protein and nucleotide blast queries in all 26 Xanthomonas genomes before inclusion in this manuscript.
The reduced Xf genome has the major virulence-related gene regions
The lists of CDS from the EDGAR analysis (Additional file 1: Table S1) indicated that Xf did not lose critical pathogenesis-related gene clusters reported for Xanthomonas. Contrary to the more reduced Xalb genome, for example, the Xf genome contained the hrp gene cluster coding for the structural elements of the Type III Secretion System (T3SS) and the gum gene cluster for extracellular polysaccharide (EPS) synthesis (Table 3). It also contained the gene cluster coding for a common Type IV Secretion System (T4SS) in Xanthomonas, which was entirely missing in Xoo and only partly retained in Xcv. Xf did lose the xcs-coded Type II Secretion System (T2SS), similar to Xoo and Xalb, but this gene cluster was reported as less critical for Xanthomonas pathogenicity than the clearly present xps-coded T2SS . Also reported as unessential for virulence and absent in Xf, are the gum-associated genes gumN, gumO and gumP, and the rpf-associated genes rpfD (truncated) and rpfI.
Other noteworthy gene regions that were indicated as absent in Xf by EDGAR analysis and confirmed by blast queries, were i) the glyoxylate shunt pathway coding locus [46, 47], which was also missing in Xoo and Xalb, ii) the three loci of the Carbohydrate Utilization (CUT) system involving TonB-dependent transporters for xylan degradation and metabolism in X. campestris[48, 49], iii) the genes coding for the kdp potassium transport system  and, iv) a gene region coding for a ß-ketoadipate phenolics degradation pathway  that was also found partly absent in Xoo (Table 4).
Although some absent gene-regions in Xf may have virulence-related implications, the genome reduction in Xf seems to weaken nutritional and adaptive flexibility rather than clear virulence functions. For example, the absence of all three xylan degradation loci and the ß-ketoadipate pathway may indicate that Xf is unable to respectively degrade xylan and metabolize the phenolic components of lignin, two important elements of the secondary plant cell wall . Perhaps, the opinion that primarily soil bacteria have been associated with lignin degradation , might suggest that the main role of the ß-ketoadipate pathway lays in saprophytic survival. Likewise, though the glyoxylate shunt pathway has been linked to successful symbiotic and pathogenic plant-bacterial interactions, it does so by increasing metabolic fitness through growth on C2-compounds . Potassium is another important nutritional element, crucial for cell turgor maintenance, activation of cellular enzymes and pH homeostasis. The kdp potassium transport system is widely distributed among bacteria and serves as an emergency K+-scavenging system that is only expressed and activated under extreme environmental stress . Maybe these missing functions are redundant for Xf’s existence in the strawberry leaf apoplast. Among the 26 analysed Xanthomonas genomes, the xylan degradation pathway, the sdk potassium transport system and the entire phenolics degradation pathway were uniquely missing in Xf.
The Xf genome has a reduced TonB-dependent transporter set
The EDGAR analysis suggested a substantial loss of TonB-dependent transporters (TBDT) in Xf, Xoo and Xalb. Therefore, we screened all 26 Xanthomonas genomes for homologs of 100 TBDT references (Additional file 2: Table S2). Overall, the average TBDT gene repertoire amounted to 56 homologs with a standard deviation of 14. Then again only 27 homologs could be found in Xf, which was the second smallest TBDT repertoire after Xalb (26 homologs) (Figure 4). The Xoo and Xoc genomes had 34 to 40 homologs. The extensive TBDT repertoire in Xcc has been linked to niche diversity and carbohydrate scavenging in the oligotrophic conditions encountered during epiphytic survival . In turn, the small TBDT sets in Xf, Xalb and the X. oryzae strains may be yet another adaptation to a stable and restricted niche.
Xf has a reduced plant cell wall degrading enzyme set
Observing the missing xylan degradation and ß-ketoadipate pathways in Xf, triggered us to study its Cell Wall Degrading Enzyme (CWDE) repertoire in greater detail. To this end, all 26 Xanthomonas genomes were screened for homologs of 46 CWDE references (Additional file 3: Table S3). The average number of pectinolytic, cellulolytic and hemicellulolytic enzyme homologs thus retrieved in each genome was 6, 14 and 10 respectively, with standard deviations of 2, 2 and 3. The average total CWDE repertoire consisted of 31 homologs, with a standard deviation of 6. With only 18 homologs, Xf revealed the smallest CWDE repertoire comprised of an apparently unreduced pectinolytic enzyme repertoire but with low numbers of cellulolytic (10 homologs) and hemicellulolytic (3 homologs) enzymes (Figure 5). Small CWDE repertoires were again also apparent in the genomes of Xalb (19 homologs), Xoc (20 homologs), and to a lesser extent in the three Xoo strains (25 homologs). For the 26 Xanthomonas genomes analysed, no clear correlation could be observed between the CWDE repertoires and their infection mode: only the XCC3534-like cellobiosidase appeared unique for vascular pathogens.
Despite the contribution of CWDEs to virulence, the reduced set of Xf does not necessarily make it a lesser pathogen. Smaller CWDE repertoires are typically found in biotrophic pathogens, who rely on precise breaching of the host cell wall during infection instead of extensive tissue destruction observed for necrotrophic pathogens . One potential explanation for the reduced CWDE repertoire of Xf may be found in the concurrent absence of the ß-ketoadipate phenolics degradation pathway: strawberry plant tissue is rich in phenolics  and many contribute to plant defence as phytoanticipins or phytoalexins, which are often released when plant cell integrity is compromised . Therefore, one could hypothesize that one way for Xf to survive long-term residence in its potentially toxic host is to avoid extensive tissue damage.
Xf exhibits a distinct T3SE repertoire and several putatively new effectors
Because several Xf-exclusive CDS in the EDGAR analysis showed similarity with Type III Secretion Effector (T3SE) genes, we compared the T3SE genes of Xf with those of the other 25 Xanthomonas genomes. Classifying the Xf-effectors within the currently defined effector-families , however, was challenging. Some effector families contained clearly distinct subgroups with lower than 70% inter-subgroup pairwise protein sequence similarities, as indicated here for XopA (Figure 6). A more restrictive definition and further subdivision of the current T3SE-families could improve the classification. For now, we retained the classification as defined by White et al. (2009) , except for XopAG, and applied a general cut-off of 60% pairwise protein sequence similarity within a given effector family. For XopAG we suggest the division into two subfamilies: XopAG1 and XopAG2, with [Genbank:ZP_10262207] and [Genbank:ZP_10263166] as respective reference sequences, encountered in the genome of X. axonopodis pv. punicae LMG 859. The Xf-effector gene [Genbank:O1K_00020] was unambiguously classified in XopAG2 (Figure 7). Using the 60% cut-off rule, we identified homologs of 23 known Xanthomonas T3SEs in Xf, with two homologs for XopX (Table 5). Presence of at least one XopAD homolog and multiple XopP homologs was suggested, but could not be confirmed due to incomplete genome assembly.
Comparison of the effector gene repertoire identified in Xf and in the other 25 Xanthomonas genomes is shown (Additional file 4: Table S4). For simplicity, multiple homologs of a certain effector class within a single genome were marked only once. Transcription Activator-Like Effectors (TALEs) were not included in the comparison because Xf did not appear to contain any, and also TALEs are distinct from the other effectors in both coding sequences and function . Among the 23 hrp-positive genomes, the average T3SE repertoire consisted of 23 different families, with a standard deviation of 5 (Figure 8). Explicit small T3SE repertoires were observed in X. campestris pv. raphani 756C (13 effectors) and in X. arboricola LMG 19146 (6 effectors) , while Xcv revealed the largest repertoire with 31 effectors.
Although the T3SE repertoire of Xf seemed average-sized (25 effectors), it distinguished itself by the presence of multiple rare effectors: XopB, XopC1, XopE4, XopZ2, XopAF and XopAS. This distinct T3SE repertoire in Xf was also evident from the isolated position of Xf in a Neighbor Joining split network based on binary representations of the T3SE repertoires found among all 23 hrp-positive genomes (Figure 9). Interestingly, the general topology of the T3SE-based network correlated well with the MLSA based phylogenetic tree shown before, suggesting that at least a fraction of the T3SE repertoire was acquired early and evolved slowly during the formation of the major lineages within the genus.
In addition to this unique “known” T3SE repertoire, the Xf genome also revealed several putative new effectors (Table 6). Three putative T3SEs in Xf showed low homology with the known Xanthomonas effectors XopC, XopE and XopAD: [Genbank:O1K_18811] showed highest pairwise similarity (66.6%) with a T3SE found in the genus Ralstonia, [Genbank:01K_20482] showed 56.4% similarity to an unidentified protein in Mesorhizobium, and [Genbank:O1K_11082] showed up to 78.5% pairwise similarity with an unidentified protein in X. gardneri ATCC 19865. Nine other putative T3SEs in Xf showed distant homology with both XopD and PsvA, a T3SE family in Acidovorax and Pseudomonas contributing to host specificity, plant defence suppression and in planta pathogen proliferation . Although the function of these putative new T3SEs still needs to be confirmed, the likely Plant-Inducible-Promoter (PIP) sequences , frequently found in the vicinity of their coding sequences, suggest virulence-related functions for these proteins. The low%GC value observed in 7 of these coding sequences might suggest that they were acquired through horizontal gene transfer (Table 6).
Xf harbours a Type VI secretion system similar to X. oryzae
Two loci shared between Xf and Xoo in the EDGAR analysis appeared to code for structural elements of a Type VI secretion system (T6SS). Coding sequences for putative Vgr protein-like T6SS effectors were encountered in the Xf genome, although their exact number and sequence could not be established due to incomplete assembly. The T6SS is the most recently found secretion system in gram-negative bacteria, and several distinct types are widely distributed among the Proteobacteria. In contrast to their common structure, which is analogous to the injection apparatus of bacteriophages, the specific roles of each of these distinct T6SSs are still obscure: some systems were shown to contribute to the modification of eukaryotic hosts within both pathogenic and symbiotic relationships, while others were linked to inter-bacterial activity during the struggle for niche dominance [61, 62]. All 26 Xanthomonas genomes were searched for the presence of T6SSs using a widely retained T6SS-related protein class (COG3519) as bait , which revealed the presence of three distinct T6SSs within the genus (Figure 10). The T6SS of Xf appeared highly similar to one of the two T6SSs that were found in the X. oryzae genomes. This distinct, previously thought X. oryzae-exclusive, T6SS has been correlated with plant host specialization because of its similarity with the T6SS found in plant pathogens like Ralstonia solanacearum and Pseudomonas syringae. The same study grouped the other two T6SSs that were observed here, in a widely distributed T6SS class present in both animal and plant pathogens, suggesting a broader functionality. Of course, this in-silico based hypothesis would need further experimental confirmation.
Evidence of considerable horizontal gene transfer and a CRISPR in Xf
In addition to some virulence genes that are possibly acquired by Horizontal Gene Transfer (HGT) and IS elements, other evidence of HGT-exposure could be found in the Xf genome. The IS-content appeared exceptionally abundant in Xf. While IS abundance is a common feature of Xanthomonas genomes, exceptionally high IS content was previously reported for the three Xoo strains and the Xoc strain [32, 36–38]. There, it was interpreted as a result of the consistent association of these strains with rice: the stable environment would have alleviated the selective pressure of many genes, allowing their disruption by IS. A similar process could be envisioned for Xf. Conversely, it has also been described as an important source of genome plasticity in X. oryzae and a cause of the genotypic diversity within the species [32, 38]. This seems to conflict with the restricted genotypic diversity reported for Xf. Perhaps the relatively young practice of formal plant breeding in strawberry cultivation (18th-19th century AD)  compared to the ancient domestication of rice (7000–4000 BC)  has to be considered.
Possible HGT-related CDS in Xf were a set of phage coding genes ([Genbank:O1K_11280] to [Genbank:O1K_11410]), nine toxin-antitoxin (TA) modules (Table 7) , and nine contigs with a total sequence length of 25 Kb exhibiting more than 90% overall DNA sequence identity with the 27 Kb Plasmid II of X. albilineans GPE PC73. The latter nine contigs mainly coded for the structural elements of a second, phylogenetic distinct T4SS, and plasmid replication functions. Although this raised the possibility of an autonomous plasmid in Xf, so far we were unable to identify or isolate any plasmid DNA from the sequenced strain (data not shown).
Another HGT-related CDS in Xf with a potential virulence related function, [Genbank:O1K_06242], coded for a 2485aa putative Repeats-in-toxin (RTX) exoprotein . Its low%GC (54%) and the presence of IS elements directly up and downstream seem to indicate that it was acquired through HGT.
Finally, the Xf genome revealed a Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-region comprised of 6 CRISPR associated (cas) genes of the so-called Ypest (“Yersinia pestis”) subtype ([Genbank:O1K_01919] to [Genbank:O1K_01944]), a 121 bp long AT-rich leader sequence and a CRISPR containing 36 identical repeats and 36 spacers. A second, smaller locus containing an additional 4 repeats and 3 spacers was also found, although the last repeat in this locus was degenerated. Among all other studied Xanthomonas genomes, cas-genes of the Ypest-subtype were also found in Xalb and X. campestris pv. raphani 756C (Xcr). Moreover, the associated CRISPR repeat sequence in Xf (GTTCACTGCCGCGTAGGCAGCTCAGAAA) was identical to that of Xcr, and diverged only one nucleotide from that of Xalb (GTTCACTGCCGTGTAGGCAGCTCAGAAA). CRISPR regions were recently recognized as a prokaryotic adaptive immune system against invading DNA molecules, with functional analogy to the RNA interference (RNAi) pathways in eukaryotes .
The draft genome sequence of Xf provided valuable insight in its general and more specific pathogenicity-related gene content. Although the current total contig size of 4.2 Mb is not definite, the sheer amount of missing gene homologs in Xf is sufficient evidence for a significant genome reduction. A similar “convergent genome erosion” was already reported for X. albilineans and X. oryzae pv. oryzae, and ascribed to their restricted lifestyle within the xylem vessels of their hosts . A similar genome reduction was found here in Xf and also in X. oryzae pv. oryzicola BLS256, which are two non-vascular leaf pathogens. Therefore, it may be more accurate to ascribe the convergent genome reduction of these Xanthomonas species to their endophytic lifestyle and typically to their commitment to a single host. Similar to earlier comparative genomic studies within Xanthomonas[30, 45], we were unable to find clear determinants for host or tissue specificity. Perhaps this specificity is the result of a more complex interplay between different genes or, of a subtle sequence variation within a small set of conserved genes. Alternatively, clear host or tissue determinants may still remain hidden within the substantial group of proteins for which we currently have no clear molecular function, or in uncharacterised functional RNAs.
Based on the data presented here, one could hypothesize an evolutionary process for Xf that is reminiscent of the model that was recently presented for some dangerous epidemic bacteria of humans . During an initial period of intense horizontal gene transfer experienced by the more generalist ancestor of Xf, acquirement of certain heterologous host-specificity factors would have allowed it to colonize the strawberry leaf apoplast and thus escape antagonists and environmental threats. This transition from a dynamic to a stable environment would subsequently have triggered the observed genome reduction: useless or redundant features, especially metabolic, perceptive and regulatory functions, were allowed to degrade and eventually were lost. At some point, the progressing genome erosion resulted in the effective metabolic “entrapment” of Xf within the strawberry plant, excluding it from other hosts or more general epiphytic or saprophytic lifestyles. Because of Xf’s increasing spatial and phylogenetic isolation, the initially intense horizontal gene transfer would have abated, a process that was perhaps hastened by the acquirement of the CRISPR region. Meanwhile, mobile genetic elements which conferred a selective advantage to Xf would have become permanently incorporated in the genome. This evolutionary process would finally have resulted in the genotypical and phenotypical distinct, mainly endophytic, biotrophic and strawberry-specific Xf known today. The eventual necrosis of the typical water-soaked angular leaf spots associated with Xf would not be in conflict with this hypothesis: it could be the manifestation of an eventual breakthrough of the plant defence, or merely the collapse of an overburdened plant cellular system.
Of course, many of the in-silico based hypotheses presented here should be tested and confirmed by further experimental data. In our opinion, the most appealing matters for future research are i) the molecular and functional characterization of the putatively new PsvA-like T3SEs, ii) the exact function of the T6SS in Xf and other Xanthomonas, and iii) the delicate endophytic existence of Xf, sensitive to toxic compounds inside the strawberry plant cells.
Strain selection, culture conditions and DNA preparation
The selected Xf strain, LMG 25863, was isolated as GBBC-Xf 920 in 2002 from clear angular leaf spots on strawberry leaves at the Institute for Agricultural and Fisheries Research (ILVO) in Belgium. Since then, it has been applied in the development of a real-time PCR detection method for Xf , served as Xf reference in a study of X. arboricola pv. fragariae and as parental strain in the development of a green fluorescent Xf mutant (unpublished). It was deposited at the BCCM-LMG culture collection at the time of whole genome sequencing. For genomic DNA preparation, the cryogenically stored Xf strain was resuscitated by incubation on Wilbrinck-N agar medium  at 28°C for 96 h. A single colony of this culture was then transferred to fresh Wilbrink-N agar plates and again incubated at 28°C for 48 h. From these cultures, a total of 30 μg DNA with a concentration of at least 100 ng/μl and an OD260:280 rating between 1.8 and 2.0 was prepared using the Gentra Puregene Cell Kit (QIAGEN Benelux B.V., Venlo, The Netherlands), according to the manufacturer’s instructions. Quantity and quality of the extracted DNA was checked using the NanoDrop 1000 spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, USA).
Sequencing, draft genome assembly and annotation
Custom DNA library preparation and Illumina-sequencing was performed by Baseclear N.V., Leiden, The Netherlands. A first Paired-End (PE) DNA library with an a mean insert size of 375 bp was sequenced with 50 bp reads on an Illumina Genome Analyzer IIx (Illumina Inc., San-Diego, USA). A second, Mate-Paired (MP) DNA library with a mean insert size of 5100 bp was sequenced with 75 bp reads on an Illumina Hiseq2000 (Illumina Inc.), but only the first 50 bp were used to avoid chimeric reads.
The received raw PE and MP read sets were quality trimmed in CLC Bio v4.0 (CLC bio, Aarhus, Denmark) using a Phred quality cut-off score of 20. An initial de novo assembly was performed in CLC Bio v4.0 using only the PE reads, and all contigs shorter than 200 bp were discarded. This assembly was scaffolded in SSPACE Premium v2.0  using MP reads and processed with the Gapcloser v1.12 tool of the SOAP genome assembly software . Gapfiller and IMAGE were not used because the former was not yet freely available and the latter could not be operated as intended. Because Gapcloser did not recognize the Hiseq2000 file parsing of the MP dataset, only the PE data was used. Finally, the draft genome sequence was manually edited with the Editseq tool of DNAStar Lasergene core suite v10.0.1 (DNASTAR Inc., Madison, WI, USA). Remaining N-nucleotides in the scaffolds, introduced during scaffolding and not replaced by gapcloser, were removed from the final sequence by breaking up the scaffolds back into contigs where they were encountered. The quality of the final draft genome sequence was compared to the initial PE-based de novo assembly through comparative read-mapping in CLC Bio v4.0 using both the trimmed read sets. The final draft genome sequence of Xf was putatively annotated with the RAST v4.0 online annotation pipeline  and NCBI’s Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP; http://www.ncbi.nlm.nih.gov/genomes/static/Pipeline.html).
Comparative genome analysis
The presence of possible pathogenicity-related genes in the Xf draft genome was analysed by comparison with 25 other Xanthomonas genomes (Table 2). In a first explorative screening, the gene content of the Xf genome was compared with the chromosomes X. campestris pv. campestris (Xcc) ATCC 33913, X. euvesicatoria (Xcv) 85–10, X. oryzae pv. oryzae (Xoo) KACC 10331 and X. albilineans (Xalb) GPE PC73 in EDGAR . Absence or presence of coding sequences in each genome, as reported by EDGAR, were independently confirmed by performing protein and nucleotide blast queries (as described below) in the target genomes before inclusion in this manuscript.
In a second phase, gene families of interest were examined in all 26 genomes. All genomes except three were screened using the protein and nucleotide blast tools of Genbank. Four genomes not present in Genbank were screened with the blast tool of the SEED Viewer v2.0 web-interface : draft genomes of the present Xf strain LMG 25863, two X. arboricola strains LMG 19145 and LMG 19146 , and the X. arboricola pv. pruni strain LMG 25864 (M Maes, unpublished). Based on the EDGAR results, three protein families were studied in greater detail: the TonB-dependent transporters (TBDT), the cell-wall degrading enzymes (CWDE) and the Type III secretion system effectors (T3SE).
To determine the phylogenetic position of Xf within the Xanthomonas genus, we performed in silico multi-locus sequence analysis using partial sequences of the genes gyrB, rpoD, atpD, dnaK and fyuA, according to Parkinson et al. (2009) , Young et al. (2008) , and Ngoc et al. (2010) . Sequences were retrieved from the Xf and the 25 other Xanthomonas genomes available in GenBank (Table 2). The sizes of the five partial sequences are 530 bp (gyrB), 747 bp (atpD), 940 bp (dnaK), 873 bp (rpoD) and 698 bp (fyuA), giving a total of 3788 bp for the concatenated dataset. Sequences were concatenated and aligned using the CLUSTALW algorithm . Sequence alignment, trimming, and phylogenetic analysis were performed in Mega5 . The phylogenetic tree was generated using the Maximum Composite Likelihood method for calculating distances and the Neighbor Joining algorithm for clustering , bootstrap analysis was performed using 1000 bootstrap replicates.
The TBDT repertoire of Xf was compared to that of the other 25 Xanthomonas genomes. The 72 TBDTs identified in Xcc ATCC 33913  were used as primary references. Initially, the 26 genomes were screened for homologs of these primary references using protein blast queries. In case of a negative blast result, an additional nucleotide blast with the reference coding sequence was performed to exclude false negative results. TBDTs with low protein sequence similarity (<70%) with the primary references were considered as new types and were added to the list of references. TBDTs in the more distantly related X. albilineans GPE PC73 and X. sacchari NCPPB 4393 occasionally showed ambiguous homology with more than one reference. This was resolved by reciprocal blast, after which the TBDT in question was assigned to the reference with the best blast hit score.
Cell-wall degrading enzymes
Reference sequences of different CWDE-families were retrieved from . All 26 genomes were searched for homologs of these references in a similar fashion as applied for the TBDTs.
Type III secretion effectors
References of the presently known T3SEs in Xanthomonas were retrieved from the Xanthomonas Resource website  and all 26 genomes were screened for homologs. Because of their sometimes large diversity, the protein sequences of all encountered (putative) effectors were collected and compared in Bionumerics v6.6 (Applied Maths, Sint-Martens-Latem, Belgium). For each effector class, a protein-sequence based pairwise alignment similarity matrix was calculated using the standard Bionumerics algorithm in its default settings. Candidate effectors of Xf were only considered as part of an effector class when it exhibited at least 60% pairwise similarity with at least one other entry in the matrix. Next, a binary table listing the presence or absence of at least one fully coded homolog of every known T3SE class in each of the 26 genomes was created. Truncated, frame-shifted or otherwise suspected incomplete or inactive coding sequences were interpreted as absent. When the full coding sequence could not be retrieved because of incomplete assembly, the effector was counted as present. Multiple homologs of an effector in a single genome were counted only once. This binary dataset was imported and analyzed in Bionumerics. A binary-data based distance matrix was calculated using the Hamming Distance parameter and a distance-based consensus network was calculated using the Neighbor Joining tree method in its default settings and bootstrap resampling with 1000 replications .
The raw sequence data received from Baseclear N.V. (Leiden, The Netherlands) was deposited at the Short Read Archive (SRA) of Genbank under accession numbers SRR514114 (PE dataset) and SRR514113 (MP dataset). The current draft genome sequence was deposited at Genbank under accession number AJRZ00000000 after automatic annotation by the PGAAP online annotation pipeline.
Kennedy BW, King TH: Angular leaf spot of strawberry caused by Xanthomonas fragariae sp. nov. Phytopathology. 1962, 52: 873-875.
Maas JL: Angular leaf spot. Compendium of Strawberry Diseases. Edited by: Maas JL. 1998, St. Paul, MN-USA: American Phytopathological Society, 16-17. 2
European Commission: EC Council Directive 2000/29/EC of 8 May 2000 on Protective measures against the introduction into the Community of organisms harmful to plants or plant products and against their spread within the Community. OJEU. 2000, Luxembourg: Publications Office of the European Union, Luxembourg, 1-112.
Dye DW, Lelliott RA, Buchanan RE, Gibbons NE: Genus II. Xanthomonas Dowson 1939, 187. Bergey’s Manual of Determinative Bacteriology. Edited by: Bergey DH, Buchanan RE, Gibbons NE. 1974, Baltimore, USA: Williams & Wilkins Co, 243-249. 8
Van den Mooter M, Swings J: Numerical analysis of 295 phenotypic features of 266 Xanthomonas strains and related strains and an improved taxonomy of the genus. Int J Syst Bacteriol. 1990, 40: 348-369. 10.1099/00207713-40-4-348.
Vauterin L, Hoste B, Kersters K, Swings J: Reclassification of Xanthomonas. Int J Syst Bacteriol. 1995, 45: 472-489. 10.1099/00207713-45-3-472.
Vauterin L, Rademaker J, Swings J: Synopsis on the taxonomy of the genus Xanthomonas. Phytopathology. 2000, 90: 677-682. 10.1094/PHYTO.2000.90.7.677.
Pooler MR, Ritchie DF, Hartung JS: Genetic relationships among strains of Xanthomonas fragariae based on random amplified polymorphic DNA PCR, repetitive extragenic palindromic PCR, and enterobacterial repetitive intergenic consensus PCR data and generation of multiplexed PCR primers useful for the identification of this phytopathogen. Appl Environ Microbiol. 1996, 62: 3121-3127.
Roberts PD, Hodge NC, Bouzar H, Jones JB, Stall RE, Berger RD, Chase AR: Relatedness of strains of Xanthomonas fragariae by restriction fragment length polymorphism, DNA-DNA reassociation, and fatty acid analyses. Appl Environ Microbiol. 1998, 64: 3961-3965.
Stöger A, Barionovi D, Calzolari A, Gozzi R, Ruppitsch W, Scortichini M: Genetic variability of Xanthomonas fragariae strains obtained from field outbreaks and culture collections as revealed by repetitive-sequence PCR and AFLP. J Plant Pathol. 2008, 90: 469-473.
Kennedy BW: Infection of Potentilla by Xanthomonas fragariae. Plant Dis Rep. 1965, 49: 491-492.
Roberts PD, Berger RD, Jones JB, Chandler CK, Stall RE: Disease progress, yield loss, and control of Xanthomonas fragariae on strawberry plants. Plant Dis. 1997, 81: 917-921. 10.1094/PDIS.19184.108.40.2067.
Mahuku GS, Goodwin PH: Presence of Xanthomonas fragariae in symptomless strawberry crowns in Ontario detected using a nested polymerase chain reaction (PCR). Can J Plant Pathol. 1997, 19: 366-370. 10.1080/07060669709501061.
Moltmann E, Zimmermann C: Detection of Xanthomonas fragariae in symptomless strawberry plants by nested PCR. EPPO Bull. 2005, 35: 53-54. 10.1111/j.1365-2338.2005.00812.x.
Milholland RD, Ritchie DF, Daykin ME, Gutierrez WA: Multiplication and translocation of Xanthomonas fragariae in strawberry. Adv Strawb Res. 1996, 15: 13-17.
Hildebrand PD, Braun PG, Renderos WE, Jamieson AR, McRae KB, Binns MR: A quantitative method for inoculating strawberry leaves with Xanthomonas fragariae, factors affecting infection, and cultivar reactions. Can J Plant Pathol. 2005, 27: 16-24. 10.1080/07060660509507189.
MacLean D, Jones JDG, Studholme DJ: Application of ‘next-generation’ sequencing technologies to microbial genetics. Nat Rev Microbiol. 2009, 7: 287-296.
Büttner D, Bonas U: Regulation and secretion of Xanthomonas virulence factors. FEMS Microbiol Rev. 2009, 34: 107-133.
Ryan RP, Vorhölter FJ, Potnis N, Jones JB, Van Sluys MA, Bogdanove AJ, Dow JM: Pathogenomics of Xanthomonas: understanding bacterium-plant interactions. Nat Rev Microbiol. 2011, 9: 344-355. 10.1038/nrmicro2558.
Treangen TJ, Salzberg SL: Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nat Rev Genet. 2011, 13: 36-46.
Li R, Fan W, Tian G, Zhu H, He L, Cai J, Huang Q, Cai Q, Li B, Bai Y: The sequence and de novo assembly of the giant panda genome. Nature. 2009, 463: 311-317.
Tsai I, Otto T, Berriman M: Improving draft assemblies by iterative mapping and assembly of short reads to eliminate gaps. Genome Biol. 2010, 11: R41-10.1186/gb-2010-11-4-r41.
Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W: Scaffolding pre-assembled contigs using SSPACE. Bioinformatics. 2011, 27: 578-579. 10.1093/bioinformatics/btq683.
Potnis N, Krasileva K, Chow V, Almeida NF, Patil PB, Ryan RP, Sharlach M, Behlau F, Dow JM, Momol MT: Comparative genomics reveals diversity among xanthomonads infecting tomato and pepper. BMC Genomics. 2011, 12: 146-10.1186/1471-2164-12-146.
Pieretti I, Royer M, Barbe V, Carrere S, Koebnik R, Cociancich S, Couloux A, Darrasse A, Gouzy J, Jacques MA: The complete genome sequence of Xanthomonas albilineans provides new insights into the reductive genome evolution of the xylem-limited Xanthomonadaceae. BMC Genomics. 2009, 10: 616-10.1186/1471-2164-10-616.
Da Silva ACR, Ferro JA, Reinach FC, Farah CS, Furlan LR, Quaggio RB, Monteiro-Vitorello CB, Van Sluys MA, Almeida NF, Alves LMC, Do Amaral AM, Bertolini MC, Camargo LEA, Camarotte G, Cannavan F, Cardozo J, Chambergo F, Ciapina LP, Cicarelli RMB, Coutinho LL, Cursino-Santos JR, El-Dorry H, Faria JB, Ferreira AJS, Ferreira RCC, Ferro MIT, Formighieri EF, Franco MC, Greggio CC, Gruber A, et al: Comparison of the genomes of two Xanthomonas pathogens with differing host specificities. Nature. 2002, 417: 459-463. 10.1038/417459a.
Jalan N, Aritua V, Kumar D, Yu F, Jones JB, Graham JH, Setubal JC, Wang N: Comparative genomic analysis of Xanthomonas axonopodis pv. citrumelo F1, which causes citrus bacterial spot disease, and related strains provides insights into virulence and host specificity. J Bacteriol. 2011, 193: 6342-6357. 10.1128/JB.05777-11.
Sharma V, Midha S, Ranjan M, Pinnaka AK, Patil PB: Genome sequence of Xanthomonas axonopodis pv. punicae strain LMG 859. J Bacteriol. 2012, 194: 2395-10.1128/JB.00181-12.
Studholme DJ, Kemen E, MacLean D, Schornack S, Aritua V, Thwaites R, Grant M, Smith J, Jones JD: Genome-wide sequencing data reveals virulence factors implicated in banana Xanthomonas wilt. FEMS Microbiol Lett. 2010, 310: 182-192. 10.1111/j.1574-6968.2010.02065.x.
Qian W, Jia Y, Ren SX, He YQ, Feng JX, Lu LF, Sun Q, Ying G, Tang DJ, Tang H: Comparative and functional genomic analyses of the pathogenicity of phytopathogen Xanthomonas campestris pv. campestris. Genome Res. 2005, 15: 757-767. 10.1101/gr.3378705.
Vorhölter FJ, Schneiker S, Goesmann A, Krause L, Bekel T, Kaiser O, Linke B, Patschkowski T, Rückert C, Schmid J: The genome of Xanthomonas campestris pv. campestris B100 and its use for the reconstruction of metabolic pathways involved in xanthan biosynthesis. J Biotechnol. 2008, 134: 33-45. 10.1016/j.jbiotec.2007.12.013.
Bogdanove AJ, Koebnik R, Lu H, Furutani A, Angiuoli SV, Patil PB, Van Sluys MA, Ryan RP, Meyer DF, Han SW: Two new complete genome sequences offer insight into host and tissue specificity of plant pathogenic Xanthomonas spp. J Bacteriol. 2011, 193: 5450-5464. 10.1128/JB.05262-11.
Thieme F, Koebnik R, Bekel T, Berger C, Boch J, Büttner D, Caldana C, Gaigalat L, Goesmann A, Kay S: Insights into genome plasticity and pathogenicity of the plant pathogenic bacterium Xanthomonas campestris pv. vesicatoria revealed by the complete genome sequence. J Bacteriol. 2005, 187: 7254-7266. 10.1128/JB.187.21.7254-7266.2005.
Midha S, Ranjan M, Sharma V, Pinnaka AK, Patil PB: Genome sequence of Xanthomonas citri pv. mangiferaeindicae strain LMG 941. J Bacteriol. 2012, 194: 3031-10.1128/JB.00433-12.
Moreira LM, Almeida NF, Potnis N, Digiampietri LA, Adi SS, Bortolossi JC, da Silva AC, Da Silva AM, De Moraes FE, De Oliveira JC: Novel insights into the genomic basis of citrus canker based on the genome sequences of two strains of Xanthomonas fuscans subsp. aurantifolii. BMC Genomics. 2010, 11: 238-10.1186/1471-2164-11-238.
Lee BM, Park YJ, Park DS, Kang HW, Kim JG, Song ES, Park IC, Yoon UH, Hahn JH, Koo BS: The genome sequence of Xanthomonas oryzae pathovar oryzae KACC10331, the bacterial blight pathogen of rice. Nucleic Acids Res. 2005, 33: 577-586. 10.1093/nar/gki206.
Ochiai H, Inoue Y, Takeya M, Sasaki A, Kaku H: Genome sequence of Xanthomonas oryzae pv. oryzae suggests contribution of large numbers of effector genes and insertion sequences to its race diversity. JARQ. 2005, 39: 275-
Salzberg SL, Sommer DD, Schatz MC, Phillippy AM, Rabinowicz PD, Tsuge S, Furutani A, Ochiai H, Delcher AL, Kelley D: Genome sequence and rapid evolution of the rice pathogen Xanthomonas oryzae pv. oryzae PXO99A. BMC Genomics. 2008, 9: 204-10.1186/1471-2164-9-204.
Vandroemme J, Cottyn B, Pothier JF, Pflüger V, Duffy B, Maes M: Xanthomonas arboricola pv. fragariae: what’s in a name?. Plant Pathol. doi:10.1111/ppa.12028
Parkinson N, Cowie C, Heeney J, Stead D: Phylogenetic structure of Xanthomonas determined by comparison of gyrB sequences. Int J Syst Evol Micr. 2009, 59: 264-274. 10.1099/ijs.0.65825-0.
Young JM, Park DC, Shearman HM, Fargier E: A multilocus sequence analysis of the genus Xanthomonas. Syst Appl Microbiol. 2008, 31: 366-377. 10.1016/j.syapm.2008.06.004.
Ngoc LBT, Verniere C, Jouen E, Ah-You N, Lefeuvre P, Chiroleu F, Gagnevin L, Pruvost O: Amplified fragment length polymorphism and multilocus sequence analysis-based genotypic relatedness among pathogenic variants of Xanthomonas citri pv. citri and Xanthomonas campestris pv. bilvae. Int J Syst Evol Micr. 2010, 60: 515-525. 10.1099/ijs.0.009514-0.
Blom J, Albaum SP, Doppmeier D, Pühler A, Vorhölter FJ, Zakrzewski M, Goesmann A: EDGAR: a software framework for the comparative analysis of prokaryotic genomes. BMC Bioinforma. 2009, 10: 154-10.1186/1471-2105-10-154.
Szczesny R, Jordan M, Schramm C, Schulz S, Cogez V, Bonas U, Büttner D: Functional characterization of the Xcs and Xps type II secretion systems from the plant pathogenic bacterium Xanthomonas campestris pv. vesicatoria. New Phytol. 2010, 187: 983-1002. 10.1111/j.1469-8137.2010.03312.x.
Lu H, Patil P, Van Sluys MA, White FF, Ryan RP, Dow JM, Rabinowicz P, Salzberg SL, Leach JE, Sonti R: Acquisition and evolution of plant pathogenesis-associated gene clusters and candidate determinants of tissue-specificity in Xanthomonas. PLoS One. 2008, 3: e3828-10.1371/journal.pone.0003828.
Dunn MF, Ramírez-Trujillo JA, Hernández-Lucas I: Major roles of isocitrate lyase and malate synthase in bacterial and fungal pathogenesis. Microbiology. 2009, 155: 3166-3175. 10.1099/mic.0.030858-0.
Schatschneider S, Vorhölter FJ, Rückert C, Becker A, Eisenreich W, Pühler A, Niehaus K: Genome-enabled determination of amino acid biosynthesis in Xanthomonas campestris pv. campestris and identification of biosynthetic pathways for alanine, glycine, and isoleucine by 13 C-isotopologue profiling. Mol Genet Genomics. 2011, 286: 247-259. 10.1007/s00438-011-0639-7.
Blanvillain S, Meyer D, Boulanger A, Lautier M, Guynet C, Denancé N, Vasse J, Lauber E, Arlat M: Plant carbohydrate scavenging through TonB-dependent receptors: a feature shared by phytopathogenic and aquatic bacteria. PLoS One. 2007, 2: e224-10.1371/journal.pone.0000224.
Dejean G, Blanvillain-Baufume S, Boulanger A, Darrasse A, de Bernonville TD, Girard AL, Carrere S, Jamet S, Zischek C, Lautier M, Sole M, Buttner D, Jacques MA, Lauber E, Arlat M: The xylan utilization system of the plant pathogen Xanthomonas campestris pv campestris controls epiphytic life and reveals common features with oligotrophic bacteria and animal gut symbionts. New Phytol. 2013, 198: 899-915. 10.1111/nph.12187.
Epstein W: The roles and regulation of potassium in bacteria. Prog Nucleic Acid Re. 2003, 75: 293-320.
MacLean AM, MacPherson G, Aneja P, Finan TM: Characterization of the b-ketoadipate pathway in Sinorhizobium meliloti. Appl Environ Microbiol. 2006, 72: 5403-5413. 10.1128/AEM.00580-06.
Bugg TDH, Ahmad M, Hardiman EM, Singh R: The emerging role for bacteria in lignin degradation and bio-product formation. Curr Opin Biotech. 2011, 22: 394-400. 10.1016/j.copbio.2010.10.009.
Cantu D, Vicente AR, Labavitch JM, Bennett AB, Powell ALT: Strangers in the matrix: plant cell walls and pathogen susceptibility. Trends Plant Sci. 2008, 13: 610-617. 10.1016/j.tplants.2008.09.002.
Wang SY, Lin HS: Antioxidant activity in fruits and leaves of blackberry, raspberry, and strawberry varies with cultivar and developmental stage. J Agr Food Chem. 2000, 48: 140-146. 10.1021/jf9908345.
Morrissey JP: Biological activity of defence-related plant secondary metabolites. Plant-Derived Natural Products: Synthesis, Function, and Application. Edited by: Osbourn AE, Lanzotti V. 2009, New York, NY-USA: Springer, 283-299.
White FF, Potnis N, Jones JB, Koebnik R: The type III effectors of Xanthomonas. Mol Plant Pathol. 2009, 10: 749-766. 10.1111/j.1364-3703.2009.00590.x.
Boch J, Bonas U: Xanthomonas AvrBs3 family-type III effectors: discovery and function. Annu Rev Phytopathol. 2010, 48: 419-436. 10.1146/annurev-phyto-080508-081936.
Kim JGUN, Taylor KW, Mudgett MB: Comparative analysis of the XopD type III secretion (T3S) effector family in plant pathogenic bacteria. Mol Plant Pathol. 2011, 12: 715-730. 10.1111/j.1364-3703.2011.00706.x.
Koebnik R, Krüger A, Thieme F, Urban A, Bonas U: Specific binding of the Xanthomonas campestris pv. vesicatoria AraC-type transcriptional activator HrpX to plant-inducible promoter boxes. J Bacteriol. 2006, 188: 7652-7660. 10.1128/JB.00795-06.
Boyer F, Fichant G, Berthod J, Vandenbrouck Y, Attree I: Dissecting the bacterial type VI secretion system by a genome wide in silico analysis: what can be learned from available microbial genomic resources?. BMC Genomics. 2009, 10: 104-10.1186/1471-2164-10-104.
Jani AJ, Cotter PA: Type VI secretion: not just for pathogenesis anymore. Cell Host Microbe. 2010, 8: 2-6. 10.1016/j.chom.2010.06.012.
Schwarz S, Hood RD, Mougous JD: What is type VI secretion doing in all those bugs?. Trends Microbiol. 2010, 18: 531-537. 10.1016/j.tim.2010.09.001.
Zhang W, Xu S, Li J, Shen X, Wang Y, Yuan Z: Modulation of a thermoregulated type VI secretion system by AHL-dependent quorum sensing in Yersinia pseudotuberculosis. Arch Microbiol. 2011, 193: 351-363.
Darrow GM: The strawberry. History, breeding and physiology. 1966, New York, NY-USA: Holt, Rinehart & Winston
Purugganan MD, Fuller DQ: The nature of selection during plant domestication. Nature. 2009, 457: 843-848. 10.1038/nature07895.
Van Melderen L: Toxin-antitoxin systems: why so many, what for?. Curr Opin Microbiol. 2010, 13: 781-785. 10.1016/j.mib.2010.10.006.
Linhartová I, Bumba L, Mašín J, Basler M, Osicka R, Kamanová J, Procházková K, Adkins I, Hejnová-Holubová J, Sadílková L, Morová J, Šebo P: RTX proteins: a highly diverse family secreted by a common mechanism. FEMS Microbiol Rev. 2010, 34: 1076-1112.
Marraffini LA, Sontheimer EJ: CRISPR interference: RNA-directed adaptive immunity in bacteria and archaea. Nat Rev Genet. 2010, 11: 181-190.
Georgiades K, Raoult D: Genomes of the most dangerous epidemic bacteria have a virulence repertoire characterized by fewer genes but more toxin-antitoxin modules. PLoS One. 2011, 6: e17962-10.1371/journal.pone.0017962.
Vandroemme J, Baeyen S, Van Vaerenbergh J, De Vos P, Maes M: Sensitive real-time PCR detection of Xanthomonas fragariae in strawberry plants. Plant Pathol. 2008, 57: 438-444. 10.1111/j.1365-3059.2007.01813.x.
Koike H: The aluminium-cap method for testingsugarcane varieties against leaf scald disease. Phytopathology. 1965, 55: 317-319.
Aziz R, Bartels D, Best A, DeJongh M, Disz T, Edwards R, Formsma K, Gerdes S, Glass E, Kubal M: The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008, 9: 75-10.1186/1471-2164-9-75.
Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R: The subsystems approach to genome annotation and its use in the project to annotate 1000 genomes. Nucleic Acids Res. 2005, 33: 5691-5702. 10.1093/nar/gki866.
Thompson JD, Higgins DG, Gibson TJ: Clustal-W - improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.
Saitou N, Nei M: The neighbor-joining method - a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4: 406-425.
Pouseele H, Vauterin P, Vauterin L: A resampling strategy for reliable network construction. Mol Phylogenet Evol. 2011, 60: 273-286. 10.1016/j.ympev.2011.04.020.
The authors would like to acknowledge Miss. Cinzia Van Malderghem at ILVO for her technical assistance during the genomic DNA preparation, and Prof. Dr. Tom Coenye of the Laboratory of Pharmaceutical Microbiology at Ghent University for his critical reading of the draft manuscript. This work was made possible through the financial support of the Institute for the Promotion of Innovation by Science and Technology in Flanders (IWT) under project nr. 070597.
The authors declare that they have no competing interests.
MM and PDV conceived the study and participated in its design and coordination. BC and JVD participated in the design of the study. JVD oversaw the genome assembly, automatic genome annotations and sequence depositions, and conducted the genome comparisons. SB conducted in the genome assembly, automatic genome annotations and sequence depositions. BC and JVD drafted the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Table S1: EDGAR output table. Raw output data of the EDGAR genome comparison, listing all CDSs shared among X. fragariae LMG 25863, X. albilineans GPE PC73, X. oryzae pv. oryzae KACC 10331, X. campestris pv. campestris ATCC 33913 and X. euvesicatoria 85–10. (XLSX 143 KB)
Additional file 2: Table S2: TonB-dependent transporters. Occurrence of TonB-dependent transporters among the 26 studied Xanthomonas genomes. The 100 reference protein sequences are listed in the table rows, with the name of the genome they were retrieved from and their Genbank locus tags indicated in the first two columns. The remaining columns show the occurrence of homologs of each reference protein among the 26 genomes. “2”: two homologs present; “1”: one homolog present, “0”: no homolog found, “ΨT”: coding DNA sequence encountered, but protein believed inactive due to truncation;” ΨF”: coding DNA sequence encountered, but protein believed inactive due to frameshift, “Seq”: coding DNA sequence truncated due to incomplete genome-assembly; functional protein assumed present during further processing of data. Underlined entries were not annotated in Genbank, and were retrieved using nucleotide blast queries. (XLSX 21 KB)
Additional file 3: Table S3: Macerating enzymes. Occurrence of macerating enzymes among the 26 studied Xanthomonas genomes. The 46 reference protein sequences are listed in the table rows, with the name of the genome they were retrieved from and their Genbank locus tags indicated in the second and third columns. The remaining columns show the occurrence of homologs of each reference protein among the 26 genomes. “2”: two homologs present; “1”: one homolog present, “0”: no homolog found, “ΨT”: coding DNA sequence encountered, but protein believed inactive due to truncation;” ΨF”: coding DNA sequence encountered, but protein believed inactive due to frameshift, “Seq”: coding DNA sequence truncated due to incomplete genome-assembly; functional protein assumed present during further processing of data. Underlined entries were not annotated in Genbank, and were retrieved using nucleotide blast queries. (XLSX 20 KB)
Additional file 4: Table S4: Type III secretion effectors. Overview of the type III secretion effector repertoire in each of the 26 tested Xanthomonas genomes. “1”: at least one homolog present, “0”: no homolog found, “ΨT”: coding DNA sequence encountered, but protein believed inactive due to truncation;“ ΨF”: coding DNA sequence encountered, but protein believed inactive due to frameshift,“ ΨIS”: coding DNA sequence encountered, but protein believed inactive due to inserted sequence, “Seq”: coding DNA sequence truncated due to incomplete genome-assembly; functional protein assumed present during further processing of data. Underlined entries were not annotated in Genbank, and were retrieved using nucleotide blast queries. (XLSX 15 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Vandroemme, J., Cottyn, B., Baeyen, S. et al. Draft genome sequence of Xanthomonas fragariaereveals reductive evolution and distinct virulence-related gene content. BMC Genomics 14, 829 (2013). https://doi.org/10.1186/1471-2164-14-829
- Draft Genome
- Draft Genome Sequence
- Cluster Regularly Interspaced Short Palindromic Repeat
- Strawberry Plant
- Genome Reduction