- Research article
- Open Access
Genomic variation among closely related Vibrio alginolyticus strains is located on mobile genetic elements
BMC Genomics volume 21, Article number: 354 (2020)
Species of the genus Vibrio, one of the most diverse bacteria genera, have undergone niche adaptation followed by clonal expansion. Niche adaptation and ultimately the formation of ecotypes and speciation in this genus has been suggested to be mainly driven by horizontal gene transfer (HGT) through mobile genetic elements (MGEs). Our knowledge about the diversity and distribution of Vibrio MGEs is heavily biased towards human pathogens and our understanding of the distribution of core genomic signatures and accessory genes encoded on MGEs within specific Vibrio clades is still incomplete. We used nine different strains of the marine bacterium Vibrio alginolyticus isolated from pipefish in the Kiel-Fjord to perform a multiscale-comparative genomic approach that allowed us to investigate  those genomic signatures that characterize a habitat-specific ecotype and  the source of genomic variation within this ecotype.
We found that the nine isolates from the Kiel-Fjord have a closed-pangenome and did not differ based on core-genomic signatures. Unique genomic regions and a unique repertoire of MGEs within the Kiel-Fjord isolates suggest that the acquisition of gene-blocks by HGT played an important role in the evolution of this ecotype. Additionally, we found that ~ 90% of the genomic variation among the nine isolates is encoded on MGEs, which supports ongoing theory that accessory genes are predominately located on MGEs and shared by HGT. Lastly, we could show that these nine isolates share a unique virulence and resistance profile which clearly separates them from all other investigated V. alginolyticus strains and suggests that these are habitat-specific genes, required for a successful colonization of the pipefish, the niche of this ecotype.
We conclude that all nine V. alginolyticus strains from the Kiel-Fjord belong to a unique ecotype, which we named the Kiel-alginolyticus ecotype. The low sequence variation of the core-genome in combination with the presence of MGE encoded relevant traits, as well as the presence of a suitable niche (here the pipefish), suggest, that this ecotype might have evolved from a clonal expansion following HGT driven niche-adaptation.
The genomic variation within a bacterial species is reflected by accessory genes. Together with strain-specific singletons (genes found in only one strain) and the core-genes that are shared by all strains of a species (such as housekeeping genes and genes required for essential cellular functions) the accessory elements form the pangenome . Accessory genes are usually only found in a subset of genomes of a species and are attributed to habitat-specific functions, such as strain-specific virulence factors , antibiotic resistance , or niche-adaptation . As such, accessory genes are often located on mobile genetic elements (MGEs), including plasmids, bacteriophages, transposons, integrative or conjugative elements (ICEs) and genomic islands (GIs) and are shared between species via horizontal gene transfer (HGT) . Even though HGT between strains of the same species is more frequent, some MGEs have a broad host range enabling them to move genes across species boundaries thereby blurring the definition of a species .
Not only the size of pangenomes differs among species but also the relationship between the proportion of core-genes relative to the pangenome . A large pangenome with a proportionally small core-genome and with high rates of HGT is considered open, whereas a small pangenome with a high proportion of core-genes and low HGT rates is considered to be closed . For a wide range of bacteria, the pangenome size is positively correlated with the effective population size (Ne) . Species with a high Ne are likely to occupy different environmental niches where they experience a larger diversity of selection pressures, which will drive selection for increased genome diversity and ultimately lead to larger pangenomes . However, niche adaptation in species with high Ne can also select for distinct ecotypes (i.e. ecologically and evolutionary distinct subpopulations) and ultimately initiate a speciation process with a niche-specific pangenome . It remains debated when ecotypes are sufficiently diverged to be considered different species .
An excellent model organism to study pan-core genomic signatures and niche adaptation resulting from HGT is the genus Vibrio. These bacteria constitute one of the most diverse bacterial genera with a large pangenome comprising more than 100 species from different clades, that occupy different ecological niches [10, 11], where they can exist as pathogenic, opportunistic, mutualistic or free-living forms . The considerable large pangenome (> 26,500 genes) of the genus Vibrio, which is more than 50 times larger than the Vibrio specific core-genome , indicates a large reservoir of genomic diversity for this genus. In contrast, within a specific Vibrio clade, the pangenome to core-genome ratio decreases considerably. For instance, within the cholerae clade, the pangenome is only 4.5 times larger than the core-genome . The relatively small cholerae-pangenome (6923 genes) compared to the entire Vibrio-pangenome, might reflect a specific set of genes, required to occupy the cholera-specific niche.
Niche adaptation and ultimately speciation in this genus has been suggested to be mainly driven by HGT . Furthermore, MGEs, in particular filamentous phages, contribute significantly to the emergence of pathogenic vibrios from environmental populations  and the prophage repertoire of a given Vibrio species can account for a large fraction of the variation among strains [14, 15]. Our knowledge about the diversity and distribution of Vibrio MGEs is heavily biased towards human pathogens, e.g. V. cholerae and V. parahaemolyticus. While recent studies started to expand our knowledge on Vibrio-specific MGEs to other clades, such as V. anguillarum [16, 17], V. fischerii , or V. alginolyticus , our understanding about the distribution of core-genomic signatures and accessory genes encoded on MGEs within specific Vibrio clades is still incomplete. This might be because most of the available genomes are draft genomes comprising multiple contigs, whereas only a few complete genomes of environmental vibrios are available to date . It is challenging to generate high-quality genomes of vibrios due to repetitive genomic regions such as rRNA operons present in the two chromosomes [8,9,10,11,12] and arrays of multiple integrated prophages. Especially multiple prophage arrays cannot be resolved using draft genome assemblies from short-read sequencing techniques. Due to the incomplete information concerning the genome organization stored in draft genomes, it is not possible to study genome dynamics of mobile genetic elements especially of prophages without long-read sequencing data .
The present study aimed to investigate how genome organization, as well as core-genomic signatures and accessory genes, differ within a group of closely related environmental Vibrio isolates. To do so, we performed a multi-scale comparative genomics approach using Vibrio alginolyticus as a model organism. V. alginolyticus, a ubiquitous marine opportunistic pathogen can cause mass mortalities in shellfish, shrimp, and fish, resulting in severe economic losses worldwide [3, 21, 22]. Vibrio pathogenicity is a complex interaction of abiotic and biotic factors, including high temperatures [23, 24] low salinities , host and bacterial genotypes [26, 27]. To date only six closed alginolyticus genomes are available, while 28 remain fragmented with 2 to 75 contigs (last accessed: December 2019). A detailed comparative genomic characterization of the alginolyticus species, including plasmid content, distribution of MGEs and virulence factors of this species is as yet missing. Here we analysed core-and accessory-genomic signatures of V. alginolyticus by comparing nine closely related V. alginolyticus genomes, isolated from the pipefish Syngnathus typhle in the Kiel-Fjord , (later named Kiel-alginolyticus ecotype) with less closely related V. alginolyticus isolates (Table 1).
Results and discussion
Genome features of Vibrio alginolyticus isolated from the Kiel-Fjord
We sequenced the genomes of nine Vibrio alginolyticus strains, previously isolated from the gut or gills of six different pipefish caught in the Kiel-Fjord (Germany, 54°75′57″N; 9°87′66″E) in May 2010 , using a combination of PacBio long- and Illumina short-read technology. The assembly resulted in eight closed genome sequences and one draft genome (strain K09K1), where both chromosomes have been assembled into a single contig due to multiple copies of an integrated filamentous phage. The replicon boundaries could not be resolved experimentally based on PCR thus strain K09K1 has been assigned a “permanent draft” status. All V. alginolyticus genomes contain a ~ 3.47 Mbp chromosome 1 and a ~ 1.88 Mbp chromosome 2, with a %GC content of around 44% (Table 1), which is typical for the genus Vibrio as well as for the species V. alginolyticus [29, 32]. We found extra-chromosomal replicons including plasmids and filamentous phages in seven isolates (Table 1).
Species definition and phylogenetic relationship of the Kiel-Fjord isolates
All nine strains isolated from the Kiel-Fjord share more than 98% average nucleotide identity (ANI) with other alginolyticus strains, suggesting that these strains belong, as previously suggested based on a multi-locus-sequencing-approach (MLSA) , to the species V. alginolyticus. All alginolyticus strains share ~ 92% ANI with two other closely related Vibrio species, i.e. V. diabolicus and V. antiquarius (Fig. 1), which is below the species threshold of ANI = 95% [33, 34], indicating that V. alginolyticus is a distinct species.
The Vibrio alginolyticus pan/ core genome
To compare the gene repertoire of the nine V. alginolyticus strains from the Kiel-Fjord with the global V. alginolyticus gene repertoire, we calculated their pangenome including the amount of core-and accessory genes as well as strain-specific singletons, i.e. genes unique to single strains . The analysis included in total 73,277 protein-sequences encoded on chromosomes and MGEs, including plasmids and extra-chromosomal phage replicons. Overall, we found that the size of the pangenome of V. alginolyticus increased stronger (from 4997 to 8843 gene clusters) with the sequential addition of each new genome when we included all 15 strains from diverse habitats (Fig. 2a). In contrast, we only observed a slight increase in the pangenome (from 5044 to 5679 gene clusters) when we only included the nine V. alginolyticus isolates from the Kiel-Fjord (Fig. 2b). An increase in the pangenome results from new strain-specific genes, found in each newly analyzed isolate. The much larger increase in the pangenome across all available V. alginolyticus isolates relative to the increase in the pangenome of Kiel-Fjord isolates suggests that the genomic diversity of V. alginolyticus within the Kiel-Fjord is very limited. However, the species V. alginolyticus per se has a vast reservoir of genomic diversity.
We extrapolated both, the within-Kiel and the global V. alginolyticus pangenome by fitting a least-squares curve based on Heaps’ Law  and found that the within-Kiel pangenome is closed (α = 1.12), whereas the global V. alginolyticus pangenome is open (α = 0.58). This means, that within the global V. alginolyticus pangenome each newly sequenced V. alginolyticus isolate will reveal a unique set of genes, irrespective of the number of strains included in the present analysis. This open V. alginolyticus pangenome reflects the diversity in habitats in which this species exists. These distinct environments probably require different adaptations and/ or promote high levels of HGT. In contrast, a closed pangenome, as has been detected for the within-Kiel V. alginolyticus pangenome, suggests that the number of genes that will be obtained from any newly sequenced isolate will converge to zero. Here, we assume, that at least the sequenced pipefish associated V. alginolyticus strains within the Kiel-Fjord contain the major part of the gene equipment that is requested to adapt to their habitat. Thus the genomes can be expected to be highly similar, potentially resulting from niche adaptation and strong selection thereof. Strong positive selection of such adaptive genes might have led to a clonal expansion of the Kiel-alginolyticus ecotype. Indeed, we found no sequence divergence based on core-genomic signature among all nine isolates from Kiel (Fig. 2c), indicating a clonal expansion of this ecotype. It remains to be investigated, whether free-living V. alginolyticus strains or isolates from other eukaryotic hosts from the Kiel-Fjord share the same gene-pool or are more divergent from the pipefish-associated strains.
Core-genome and singletons
We observed a stronger decrease in the core-genome when we included all V. alginolyticus isolates as opposed to when we performed the analysis only within the Kiel-alginolyticus ecotype. Comparative analyses between all 15 V. alginolyticus strains and the Kiel-alginolyticus ecotype only, revealed that the core genome (4708 gene clusters) is four times larger than the accessory genome (971 gene clusters) when we only included the nine Kiel strains. However, when we extended the analysis and included all 15 V. alginolyticus, we found that the core genome (3876 gene clusters) becomes smaller than the accessory gene pool (4967 gene clusters). In other words, within the Kiel-Fjord, different V. alginolyticus isolates have a large core-genome (83% of the pangenome) with a limited accessory gene pool (17% of the pangenome). In contrast, despite all being members of the same species, the global accessory gene-pool is highly variable (56% of the pangenome) and the V. alginolyticus core-genome constitutes only 44% of the pangenome.
Habitat specific chromosomal regions
The acquisition of entire gene-blocks (genomic islands, plasmids, prophages) by HGT can rapidly alter the life-style of a bacterium in quantum leaps . This mechanism seems to be particularly important for bacterial adaptation to new ecological niches but also for how bacteria diverge from each other, forming ecotypes and ultimately new species . Genomic islands, which encode specific functions allowing for niche adaptation and maybe even speciation events are common within the genus Vibrio . For the Kiel-alginolyticus ecotype, we could identify 19 chromosomal genomic regions (GRs) of which most are unique to the strains isolated from the Kiel-Fjord (Fig. 3, Table S2). Overall, these 19 GRs encode a total of 487 genes out of which 305 have only been found within the Kiel-alginolyticus ecotype. Out of these 19 GRs five could be assigned to integrated prophages. GR 13 and GR 14 correspond to Vibrio phage VALGΦ2/2b and Vibrio phage VALGΦ6, which are unique for the isolates from the Kiel-Fjord and have not been found elsewhere . GR 3 corresponds to Vibrio phage VALGΦ1, which has so far only been detected in V. alginolyticus ATCC33787, with a relatively low query cover of 57%, thus making it a unique region for Kiel. In contrast GR 5, which corresponds to the filamentous phage Vibrio phage VALGΦ8 on chromosome 1, is absent in most Kiel strains, except K10K4 and K05K4, but present in some non-Kiel strains, such as ATCC17749 and FDAARGOS_114, suggesting that GR 5 is not unique for the Kiel isolates. Similarly, GR 15, which corresponds to a multiphage-cassette consisting of multiple repeats of a combination of the two filamentous phages Vibrio phage VALGΦ6 and Vibrio phage VALGΦ8 , is absent in most Kiel strains, except K04M3 and K04M5 but in parts present in ATCC17749 and thus not unique for the Kiel system. Of the remaining 14 GRs, which do not correspond to integrated phages, we could identify four genomic islands: GR 2, GR 6, GR 7 and GR 8 (> 10 kb, presence of integrase/ transposase, different GC content). According to functional COG categories, these islands contain a variety of proteins, most of them involved in replication, recombination, and repair [L] and transcription [K] (see Table S3 for functional annotation of all GRs). The high prevalence of those maintenance genes is however not surprising and might result from an identification bias, as they are usually better annotated for MGEs than accessory genes, which would provide a selective advantage to the host. As such, many proteins were classified as phage integrases, transposases or other proteins, involved with viral integration into host DNA or DNA repair, suggesting that HGT might have played an important role during the acquisition of these GIs. GR 6 encodes the multi-drug transporter subunit MdtN and two other loci which were assigned to COG category [V] and predicted to be involved in defense mechanisms. This suggests that GR 6 might play a role in resistance. None of the other genomic islands could be identified as pathogenicity−/ metabolic- or resistance island. All other GRs were either smaller than 10 kb or did not contain integrases or transposases and are thus referred to as genomic regions instead of genomic island. Out of these GRs, GR 16 encoded three proteins associated with the type VI secretion system and GR 18 encoded a beta-lactamase protein suggesting an adaptive role in virulence and resistance, respectively. The acquisition of these unique GRs might have allowed the Kiel-alginolyticus ecotype to invade a new niche, which was then followed by clonal expansion of this ecotype. Clonal expansion has been observed for several pathogens, in particular within the genus Vibrio. The best characterized example of clonal expansion upon acquisition of virulence genes is V. cholerae. But also other Vibrio pathogens, for instance V. parahaemolyticus have been shown to experience similar evolutionary dynamics (for a review see ).
Genomic differences within Kiel-alginolyticus ecotype
To investigate genomic differences between the nine strains from the Kiel-Fjord, we focused on all those gene-clusters from the Roary analysis, which could only be found within the Kiel-alginolyticus ecotype but were absent in all non-Kiel isolates. We found that all Kiel-specific core-genes (n = 412) were located exclusively on one of the two chromosomes (Fig. 4). In contrast, the majority of the Kiel-specific accessory gene clusters (89%) were encoded on mobile genetic elements, in particular plasmids. These results support ongoing theory that accessory genes are predominately located on MGEs and shared by horizontal gene transfer (HGT) . We found 490 gene clusters with no orthologous in any other Kiel strain, i.e. singletons out of which 40% were located on MGEs, in particular, extrachromosomal replicating elements (170 on plasmids and 27 on extrachromosomal phages) and 60% (n = 293) were chromosomal (Fig. 4). All Kiel-specific alginolyticus gene clusters were further assigned to putative functional categories using the Clusters of Orthologous Groups of Proteins (COG) database  (Table S4). Even though a large fraction of the gene clusters could either not be assigned to a COG or was poorly characterized, we found differences in the relative distribution of super-functional COGs between core- and accessory genomes and singletons: The majority of the singletons (37%) was predicted to be dedicated to cellular processes/ signaling, while relatively small proportions of gene clusters (10 and 16%) belong to information storage/ processing and metabolism. In contrast, the largest proportion of gene-clusters encoded on core-genes was predicted to belong to information storage/ processing (24%), and only 16 and 13% of gene-clusters encoded on core genes belong to cellular processes/ signaling and metabolism. Among the gene-clusters encoded on the accessory genome 22% could be assigned to information storage/ processing as well as to cellular processes/ signaling and only 6% to the metabolism (Fig. 4). Within the accessory genome most of the genes are involved in replication, recombination, and repair (COG [L], Table S4). These include mainly proteins involved in HGT, such as transposases, integrases, transferases, recombinases as well as proteins involved in immunity, such as CRISPR associated helicase Cas3 and restriction modification methylases. This extensive representation of proteins involved in HGT on the accessory genome suggests that HGT was potentially one of the driving evolutionary mechanisms underlying the diversification of the nine V. alginolyticus strains from the Kiel-Fjord.
The majority of the accessory gene-pool within the Kiel-alginolyticus ecotype is located on plasmids (Fig. 4). We could identify four different plasmids from all nine Kiel V. alginolyticus isolates. Three plasmid types isolated from the Kiel strains were unique for specific strains with a size of 0.9 to 2.9 kbp (Table 1). The fourth plasmid type was characterized as a mega-plasmid (Fig. 5a), which ranged between 280 and 300 kbp in size, and was present in six out of the nine strains (Fig. 5b). The six mega-plasmids share 296 core-genes, encode 129 accessory genes and between 5 and 26 singletons per plasmid (Fig. 5b). Together with the three small plasmids, plasmid-encoded singletons make up 34.5% of all 486 Kiel-specific singletons. The majority of the plasmid-specific singletons comprise hypothetical proteins. The remaining singletons include AAA proteins (K04M1_pL280), PFAM phosphoadenosine phosphosulfate reductase and ATPases (K04M5_pL294), spore germination and type IV secretion system (K5K4_pL289), endonuclease and site-specific methyl-transferase, potentially forming a restriction-modification system (K06K5_pL291), and DNA polymerase (K08M3_pL300). V. alginolyticus ATCC 33787 contains as well three plasmids including plasmid pMBL287, which is similar in size as the Kiel mega plasmids . However, a comparison of ATCC 33787 plasmids revealed no sequence similarity to any of the plasmids from the Kiel strains.
Only 6.5% of the 870 Kiel-specific accessory genes and singletons are located on extrachromosomal phages. The within-Kiel variation caused by these phages can be explained by the absence of extrachromosomal replicons of Vibrio phage VALGΦ8 in all but two strains (K04M1 and K05K4). However, four strains (K04M3, K04M5, K05K4, and K10K4) had an intra-chromosomal version of this phage, while strains K01M1, K06K5 and K08M3 did neither contain an intra- nor an extrachromosomal version of Vibrio phage VALGΦ8 (for a detailed analysis of Kiel vibriophages see ). Parts of Vibrio phage VALGΦ8 could be identified on both chromosomes of ATCC 17749T, and on chromosome 1 of FDAARGOS_114 but not on any other non-Kiel Vibrio. Genome analyses of Vibrio phage VALGΦ8 revealed no significant virulence-associated genes nor any other genes that could be associated with a habitat-specific adaptation .
Virulence and resistance of the Kiel-Fjord V. alginolyticus isolates
We found an identical virulence and resistance profile among the nine V. alginolyticus isolates from the Kiel-Fjord. The Kiel-Fjord isolates encode a total number of 17 homologues resistance genes out of which the majority (n = 13) is located on chromosome 1 and four resistance genes are encoded on chromosome 2 (Fig. 6, Table S5). In comparison with other V. alginolyticus strains (~ 25–130 resistance genes on Chromosome 1 and 11–12 resistance genes on Chromosome 2), isolates from the Kiel-Fjord have significantly less resistance genes (t-test: t8.85 = 3.51, p = 0.007; Fig. 6) and are missing in particular genes conferring resistance to tetracycline, aminoglycosides, and quinolones.
In contrast to non-pathogenic strains, such as V. fischerii, which is a mutualist containing 112 virulence genes, the Kiel-Fjord isolates encode more virulence genes (150 virulence genes, exception strain K05K4, 149 virulence genes). This number is lower than what has been found for human pathogenic vibrios, such as V. cholerae (~ 165 virulence genes) and V. parahaemolyticus (162 virulence genes), but similar to what has been found in other V. alginolyticus strains, except (strain FDAARGOS_114: 165 virulence genes). Unique for the Kiel-Fjord isolates is the absence of genes involved in Vibriobactin biosynthesis, which are present in almost all other V. alginolyticus strains. Similarly, In contrast to other V. alginolyticus isolates, the strains from the Kiel-Fjord miss the gene flaC, which is involved in the regulation of stringent-response-induced toxin production  and vscP, a gene involved in the type III secretion system. In contrast to other V. alginolyticus strains, the Kiel-alginolyticus ecotype encodes the type IV secretion system effectors, the phytotoxin coronatine and the thermolabile hemolysin (tlh), both toxins, which could not be found in any other V. alginolyticus strain. This unique virulence profile of the Kiel-Fjord isolates separates them from other Vibrio species including other V. alginolyticus isolates abut also another strain from the Kiel-Fjord: V. typhli, K08M4 (Chibani C, Hertel R, Goehlich H, Mertens V, Bunk B, Overmann J, et al. Complete Genome Sequence of Vibrio typhli K08M4, a fish pathogen isolated from the Kiel Fjord. in preparation) (Fig. 6). This clear separation was further supported by a hierarchical cluster analysis (Fig. 6), which indicates, that only based on the presence/absence of virulence genes the Kiel-alginolyticus ecotype can be distinguished from all other V. alginolyticus strains, sequenced to date. The unique resistance and virulence profile within the Kiel-Fjord V. alginolyticus isolates might be a further indication for niche-adaptation followed by clonal expansion of this ecotype.
By using a multi-scale comparative genomic approach, we conclude that the nine V. alginolyticus strains isolated from pipefish in the Kiel-Fjord belong to a unique ecotype, which evolved from a clonal expansion following niche adaptation. We came to this conclusion based on the following observations. First, the nine strains from the Kiel-Fjord have a closed-pangenome reflecting low genomic diversity within its niche. Second, even though these strains have been isolated from different eukaryotic host species, they did not differ based on core-genomic signatures which has been previously suggested based on MLSA . Third, niche adaptation and ultimately speciation in the genus Vibrio have been suggested to be mainly driven by HGT . Likewise, unique GRs and a unique repertoire of MGEs within the Kiel-Fjord isolates (including unique plasmids and prophages) support ongoing theory that the acquisition of gene-blocks by HGT, one of the most important mechanisms driving the evolution of niche adaptation and clonal expansion also played an important role in the evolution of the Kiel-alginolyticus ecotype. Lastly, the unique virulence and resistance profile which clearly separates the Kiel-Fjord isolates from all other investigated V. alginolyticus strains, suggests, that these are essential genes, required for a successful colonization of the pipefish, the niche of the Kiel-alginolyticus ecotype. To ultimately verify this theory, colonization experiments with knock-out mutants lacking the Kiel-specific virulence genes, or alternatively non virulent Vibrio strains equipped with the Kiel-specific virulence genes, would be required.
We are aware that our conclusion is limited by the low number of non-Kiel V. alginolyticus strains available included in the comparative genomic analyses. We could have included draft genomes, to increase the number of sequences available for comparison. However, due to the incomplete information especially for MGEs stored in draft genomes, the number of core-genomes drops considerably by approximately 50% compared to when only complete genomes are included in the analysis . In favor of a more accurate analysis, we thus decided to only include complete genomes.
The present study extends our current knowledge of pan-and core-genomic signatures in the genus Vibrio which is currently heavily biased towards human pathogens, such as V. cholerae and V. parahaemolyticus. Based on the present results we suggest that also non-human pathogens might experience similar evolutionary dynamics, once relevant pathogenic traits (in the present case: Vibrio phage VALGΦ6 , plus three unique virulence genes (Fig. 6)) have been acquired. The availability of a suitable niche, in the present case, presumably the pipefish, might have led to niche adaptation and clonal expansion of the Kiel-alginolyticus ecotype, clearly separating it from other V. alginolyticus species.
Bacterial genome data
Using a combination of PacBio and Illumina sequencing, we generated eight closed genomes and one permanent draft genome of nine Vibrio strains isolated from different pipefish in the Kiel-Fjord  and previously characterized as V. alginolyticus based on multi-locus-sequence typing . For sequencing and annotation information please see . We additionally sequenced prophages induced with mitomycin C from each strain, for details see . We compared all replicons from the nine V. alginolyticus strains to 10 closed Vibrio genome sequences downloaded from the NCBI nucleotide database; date of accession 1.12.2019 (Table S1).
Comparative genome analysis
Average nucleotide identity
The average similarity between the nine V. alginolyticus genomes from the present study, six other complete V. alginolyticus genomes as well as four other closely related vibrios (i.e. three V. diabolicus and one V. antiquarius) was measured as the average nucleotide identity (ANI) of all 19 Vibrio replicons (concatenated Chromosome 1 and Chromosome 2) using the ANIm mode from the PyANI python module  based on MUMmer (https://github.com/widdowquinn/pyani). Briefly, nucleotide sequences were extracted from each GenBank file using Biopython (https://biopython.org/) and subsequently used as input for PyANI for genome sequence alignment.
Pan-Core genome prediction
To determine the total gene-pool of V. alginolyticus within the Kiel-Fjord as well as the global gene repertoire of V. alginolyticus, we performed two pan-core genome analyses. In the first analysis, we only included the nine strains from the Kiel-Fjord. In the second analysis, we included all fully closed V. alginolyticus genomes available from NCBI (accession date 10.10. 2019) which were isolated from various regions (Table 1), in addition to the nine Kiel strains. Genbank files of all closed V. alginolyticus isolates were converted to GFF3 files and a nucleotide alignment was generated using Roary 3.8.0  with standard settings (minimum BLASTP identity of 90% with paralogue splitting). We used concatenated files for each isolate including both chromosomes and if existing, mobile genetic elements (plasmids and episomal phages). To estimate whether we have an open or a closed pangenome we used Heaps’ Law: η = κ ∗ N−α , implemented in the R package ‘micropan’ , where η is the expected number of genes for a given number of Genomes (N), and κ and α are constants to fit the specific curve. The exponent α is an indicator whether the pangenome is open (α < 1) or closed (α > 1). Lastly, a functional characterization of all protein-coding sequences was performed using the eggnog-mapper v2 database v5.0 [45, 46]. We subsequently determined the COG category (Cluster of Orthologous Groups of Proteins ) for Kiel-specific core- and accessory genes as well as singletons identified by the pan-core analysis based on the Roary clustering input table.
To further infer the phylogenetic relationship among all V. alginolyticus isolates we generated a phylogenetic tree based on the core-genome alignment for all 15 strains obtained from the Roary analysis (length for all strains ~ 3.84 Mbp, which corresponds to approximately 72% of the average size of the input genomes). To do so, we used the Bayesian Markov chain Monte Carlo (MCMC) method as implemented in MrBayes version 3.2.5 [47, 48]. The TN93  model plus invariant sites (TN93 + I), as suggested by the Akaike information criterion (AIC) given by jModelTest , was used as a statistical model for nucleotide substitution. The MCMC process was repeated for 106 generations and sampled every 5000 generations. The first 2000 trees were deleted as burn-in processes and the consensus tree was constructed from the remaining trees. Convergence was assured via the standard deviation of split frequencies (< 0.01) and the potential scale reduction factor (PSRF~ 1). The resulting phylogenetic tree and associated posterior probabilities were illustrated using FigTree version 1.4.2 (http://tree.bio.ed.ac.uk/software/figtree/).
We used IslandViewer  to predict putative genomic islands of all nine isolates from the Kiel-Fjord. The selection criterion for genomic islands was based on the presence of mobile-related genes (integrases or transposases), a size > 8 kb, and a different GC content compared to the rest of the genome. To visualize the genomic islands, we used BLAST Ring Image Generator [52, 53] and created a circular map per chromosome of all fully closed V. alginolyticus genomes. Functional characterization of all protein-coding sequences found on the putative genomic islands was performed as described above using the eggNOG-mapper v2 database v5.0 [45, 46] where the resulting COG assignments were used for further analysis.
We used Easyfig  for pairwise plasmid sequence comparisons and synteny comparisons with an E-value cut-off of 1e− 10. Plasmid maps were generated using SnapGene Viewer (version 4.3.10) with annotated genbank files as input files. The output resulting from Roary was used to determine orthologous genes, while the output of eggNOG-mapper [45, 46] was used to assign a functional characterization to the coding protein sequences of the six mega-plasmids detected in the Kiel-specific V. alginolyticus isolates.
Virulence and resistance
The virulence of the nine Kiel-isolates was determined using controlled infection experiments in a previous study , and could in part be explained by the presence and the coverage of a virulence-gene carrying filamentous phage Vibrio phage VALGΦ6. To determine other potential chromosomal encoded virulence factors, we scanned all 15 V. alginolyticus strains and seven other strains from different Vibrio clades using the Virulence Factor Database (https://www.mgc.ac.cn/VFs/main.htm) . We used IslandViewer  to identify homologues of resistance genes in all 15 V. alginolyticus strains.
All statistics and visualization graphs were performed using the ggplot2 library  in R 3.1.2 unless otherwise stated.
Availability of data and materials
The genomic sequences of the nine Kiel V. alginolyticus strains analyzed during the current study are available at GenBank (CP017889-CP017919).
Average nucleotide identity
Clusters of Orthologous Groups of Proteins
Mobile genetic elements
Horizontal gene transfer
Tettelin H, Riley D, Cattuto C, Medini D. Comparative genomics: the bacterial pan-genome. Curr Opin Microbiol. 2008;11(5):472–7.
Rasko DA, Rosovitz MJ, Myers GSA, Mongodin EF, Fricke WF, Gajer P, et al. The pangenome structure of Escherichia coli: comparative genomic analysis of E-coli commensal and pathogenic isolates. J Bacteriol. 2008;190(20):6881–93.
Aagesen AM, Phuvasate S, Su YC, Hase CC. Persistence of Vibrio parahaemolyticus in the Pacific oyster, Crassostrea gigas, is a multifactorial process involving Pili and flagella but not type III secretion systems or phase variation. Appl Environ Microbiol. 2013;79(10):3303–5.
Austin B, Austin DA, Blanch AR, Cerda M, Grimont F, Grimont PAD, et al. A comparison of methods for the typing of fish-pathogenic Vibrio spp. Syst Appl Microbiol. 1997;20(1):89–101.
Medini D, Donati C, Tettelin H, Masignani V, Rappuoli R. The microbial pan-genome. Curr Opin Genet Dev. 2005;15(6):589–94.
Hazen TH, Pan L, Gu JD, Sobecky PA. The contribution of mobile genetic elements to the evolution and ecology of Vibrios. FEMS Microbiol Ecol. 2010;74(3):485–99.
McInerney JO, McNally A, O'Connell MJ. Why prokaryotes have pangenomes. Nat Microbiol. 2017;2(4):17040. https://doi.org/10.1038/nmicrobiol.2017.40.
Brockhurst MA, Harrison E, Hall JPJ, Richards T, McNally A, MacLean C. The ecology and evolution of Pangenomes. Curr Biol. 2019;29(20):R1094–R103.
Cohan FM. Transmission in the Origins of Bacterial Diversity, From Ecotypes to Phyla. Microbiol Spectr. 2017;5(5). https://doi.org/10.1128/microbiolspec.MTBP-0014-2016.
Jones JL. Chapter 11 - Vibrio. In: Christine E.R. Dodd, Tim Aldsworth, Richard A. Stein, Dean O. Cliver, Hans P. Riemann, editors. Foddborne Dieseases. 3rd ed: Academic Press; 2017. p. 535–46. ISBN 9780123850072. https://doi.org/10.1016/B978-0-12-385007-2.18001-5.
Thompson CC, Vicente ACP, Souza RC, Vasconcelos ATR, Vesth T, Alves N, et al. Genomic taxonomy of vibrios. BMC Evol Biol. 2009;9:258. Published 2009 Oct 27. https://doi.org/10.1186/1471-2148-9-258.
Thompson FL, Iida T, Swings J. Biodiversity of Vibrios. Microbiol Mol Biol R. 2004;68(3):403–31.
Hunt DE, David LA, Gevers D, Preheim SP, Alm EJ, Polz MF. Resource partitioning and sympatric differentiation among closely related bacterioplankton. Science. 2008;320(5879):1081–5.
Banks DJ, Beres SB, Musser JM. The fundamental contribution of phages to GAS evolution, genome diversification and strain emergence. Trends Microbiol. 2002;10(11):515–21.
Ohnishi M, Kurokawa K, Hayashi T. Diversification of Escherichia coli genomes: are bacteriophages the major contributors? Trends Microbiol. 2001;9(10):481–5.
Castillo D, Kauffman K, Hussain F, Kalatzis P, Rorbo N, Polz MF, et al. Widespread distribution of prophage-encoded virulence factors in marine Vibrio communities. Sci Rep. 2018;8(1):9973. Published 2018 Jul 2. https://doi.org/10.1038/s41598-018-28326-9.
Di Lorenzo M, Stork M, Tolmasky ME, Actis LA, Farrell D, Welch TJ, et al. Complete sequence of virulence plasmid pJM1 from the marine fish pathogen Vibrio anguillarum strain 775. J Bacteriol. 2003;185(19):5822–30.
Dunn AK, Martin MO, Stabb EV. Characterization of pES213, a small mobilizable plasmid from Vibrio fischeri. Plasmid. 2005;54(2):114–34.
Chibani CM, Hertel R, Hoppert M, Liesegang H, Wendling CC. Closely related Vibrio alginolyticus strains encode an identical repertoire of prophages and filamentous phages. bioRxiv. 2019:859181.
Lukjancenko O, Ussery DW. Vibrio chromosome-specific families. Front Microbiol. 2014;5:73. https://doi.org/10.3389/fmicb.2014.00073.
Lee KK, Yu SR, Yang TI, Liu PC, Chen FR. Isolation and characterization of Vibrio alginolyticus isolated from diseased kuruma prawn, Penaeus japonicus. Lett Appl Microbiol. 1996;22(2):111–4.
Gonzalez-Escalona N, Blackstone GM, DePaola A. Characterization of a Vibrio alginolyticus strain, isolated from Alaskan oysters, carrying a hemolysin gene similar to the thermostable direct hemolysin-related hemolysin gene (trh) of Vibrio parahaemolyticus. Appl Environ Microbiol. 2006;72(12):7925–9.
Wendling CC, Piecyk A, Refardt D, Chibani C, Hertel R, Liesegang H, et al. Tripartite species interaction: eukaryotic hosts suffer more from phage susceptible than from phage resistant bacteria. BMC Evol Biol. 2017;17:98.
Wendling CC, Wegner KM. Relative contribution of reproductive investment, thermal stress and Vibrio infection to summer mortality phenomena in Pacific oysters. Aquaculture. 2013;412–3:88–96.
Poirier M, Listmann L, Roth O. Selection by higher-order effects of salinity and bacteria on early life-stages of Western Baltic spring-spawning herring. Evol Appl. 2017;10(6):603–15.
Le Roux F, Wegner K, Austin C, Vezzulli L, Osorio C, Amaro C, et al. The Emergence of Vibrio pathogens in Europe: Ecology, Evolution and Pathogenesis (Paris, 11–12 March 2015). Front Microbiol. 2015;6:830.
Wendling CC, Fabritzek AG, Wegner KM. Population-specific genotype x genotype x environment interactions in bacterial disease of early life stages of Pacific oyster larvae. Evol Appl. 2017;10(4):338–47. Published 2017 Mar 9. https://doi.org/10.1111/eva.12452.
Roth O, Keller I, Landis SH, Salzburger W, Reusch TB. Hosts are ahead in a marine host-parasite coevolutionary arms race: innate immune system adaptation in pipefish Syngnathus typhle against Vibrio phylotypes. Evolution. 2012;66(8):2528–39.
Wang P, Wen Z, Li B, Zeng Z, Wang X. Complete genome sequence of Vibrio alginolyticus ATCC 33787(T) isolated from seawater with three native megaplasmids. Mar Genomics. 2016;28:45–7.
Deng Y, Chen C, Zhao Z, Huang X, Yang Y, Ding X. Complete Genome Sequence of Vibrio alginolyticus ZJ-T. Genome Announc. 2016;4(5):e00912–16. https://doi.org/10.1128/genomeA.00912-16.
Liu X-F, Cao Y, Zhang H-L, Chen Y-J, Hu C-J. Complete genome sequence of Vibrio alginolyticus ATCC 17749T. Genome Announc. 2015;3(1):e01500–14. https://doi.org/10.1128/genomeA.01500-14.
Okada K, Iida T, Kita-Tsukamoto K, Honda T. Vibrios commonly possess two chromosomes. J Bacteriol. 2005;187(2):752–7.
Goris J, Konstantinidis KT, Klappenbach JA, Coenye T, Vandamme P, Tiedje JM. DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol. 2007;57(Pt 1):81–91.
Yoon SH, Ha SM, Lim J, Kwon S, Chun J. A large-scale evaluation of algorithms to calculate average nucleotide identity. Antonie Van Leeuwenhoek. 2017;110(10):1281–6.
Land M, Hauser L, Jun SR, Nookaew I, Leuze MR, Ahn TH, et al. Insights from 20 years of bacterial genome sequencing. Funct Integr Genomics. 2015;15(2):141–61.
Groisman EA, Ochman H. Pathogenicity islands: bacterial evolution in quantum leaps. Cell. 1996;87(5):791–4.
Schmidt H, Hensel M. Pathogenicity islands in bacterial pathogenesis. Clin Microbiol Rev. 2004;17(1):14 −+.
Vesth T, Wassenaar TM, Hallin PF, Snipen L, Lagesen K, Ussery DW. On the origins of a Vibrio species. Microb Ecol. 2010;59(1):1–13.
Shapiro BJ. How clonal are bacteria over time? Curr Opin Microbiol. 2016;31:116–23.
Galperin MY, Makarova KS, Wolf YI, Koonin EV. Expanded microbial genome coverage and improved protein family annotation in the COG database. Nucleic Acids Res. 2015;43(D1):D261–D9.
Kim HY, Yu SM, Jeong SC, Yoon SS, Oh YT. Effects of flaC mutation on stringent response-mediated bacterial growth, toxin production, and motility in Vibrio cholerae. J Microbiol Biotechnol. 2018;28(5):816–20.
Pritchard L, Glover RH, Humphris S, Elphinstone JG, Toth IK. Genomics and taxonomy in diagnostics for food security: soft-rotting enterobacterial plant pathogens. Anal Methods-Uk. 2016;8(1):12–24.
Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MT, et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31(22):3691–3.
Snipen L, Liland KH. micropan: an R-package for microbial pan-genomics. Bmc Bioinformatics. 2015;16:79.
Huerta-Cepas J, Szklarczyk D, Forslund K, Cook H, Heller D, Walter MC, et al. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res. 2016;44(D1):D286–D93.
Huerta-Cepas J, Szklarczyk D, Heller D, Hernandez-Plaza A, Forslund SK, Cook H, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019;47(D1):D309–D14.
Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Hohna S, et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61(3):539–42.
Huelsenbeck JP, Ronquist F. MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics. 2001;17(8):754–5.
Tamura K, Nei M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial-DNA in humans and chimpanzees. Mol Biol Evol. 1993;10(3):512–26.
Posada D, Buckley TR. Model selection and model averaging in phylogenetics: advantages of akaike information criterion and bayesian approaches over likelihood ratio tests. Syst Biol. 2004;53(5):793–808.
Dhillon BK, Chiu TA, Laird MR, Langille MG, Brinkman FS. IslandViewer update: Improved genomic island discovery and visualization. Nucleic Acids Res. 2013;41(Web Server issue):W129–32.
Méhats C, Tanguy G, Paris B, Robert B, Pernin N, Ferré F, et al. Pregnancy induces a modulation of the cAMP phosphodiesterase 4-conformers ratio in human myometrium: consequences for the utero-relaxant effect of PDE4-selective inhibitors. J Pharmacol Exp Ther. 2000;292(2):817–23.
Alikhan NF, Petty NK, Ben Zakour NL, Beatson SA. BLAST ring image generator (BRIG): simple prokaryote genome comparisons. BMC Genomics. 2011;12:402.
Sullivan MJ, Petty NK, Beatson SA. Easyfig: a genome comparison visualizer. Bioinformatics. 2011;27(7):1009–10.
Chen L, Yang J, Yu J, Yao Z, Sun L, Shen Y, et al. VFDB: a reference database for bacterial virulence factors. Nucleic Acids Res. 2005;33(Database issue):D325–8.
Wickham H. ggplot2. Wires Comput Stat. 2011;3(2):180–5.
We are grateful to Anja Poehlein for her support in sequencing the isolates.
This project was funded by three consecutive DFG grants [WE 5822/ 1–1, WE5822/1–2, and RO RO462/4–2] within the priority programme SPP1819 given to given to CCW and OR, a grant from the Cluster of Excellence “The Future Ocean”, given to CCW, and a KAAD stipend given to CCM. The funding agency played no part in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 2: Table S2. Genomic regions (GRs) unique for the nine V. alginolyticus strains from the Kiel-Fjord. For chromosome 1 (left) we used K10K4 as a reference, as this strain has more integrated prophages than the other Kiel isolates. For the same reason, we used strain K04M3 as a reference for chromosome 2. For a detailed comparison of GRs among all nine strains, see Fig. 3 main manuscript.
Additional file 3: Table S3. Functional characterization of all protein-coding sequences found on the Genomic Regions (using the eggNOG-mapper v2 software) with the resulting COG assignments. Similar to Table S2 we used strain K10K4 as reference for chromosome 1 and strain K04M3 as reference for chromosome 2. The column Kiel Specific indicates whether the GR is unique for the Kiel alginolyticus ecotype.
About this article
Cite this article
Chibani, C.M., Roth, O., Liesegang, H. et al. Genomic variation among closely related Vibrio alginolyticus strains is located on mobile genetic elements. BMC Genomics 21, 354 (2020). https://doi.org/10.1186/s12864-020-6735-5
- Mobile genetic elements
- Horizontal gene transfer
- Genomic islands