- Research
- Open access
- Published:
Pangenome and genomic taxonomy analyses of Leuconostoc gelidum and Leuconostoc gasicomitatum
BMC Genomics volume 23, Article number: 818 (2022)
Abstract
Background
Leuconostoc gelidum and Leuconostoc gasicomitatum have dual roles in foods. They may spoil cold-stored packaged foods but can also be beneficial in kimchi fermentation. The impact in food science as well as the limited number of publicly available genomes prompted us to create pangenomes and perform genomic taxonomy analyses starting from de novo sequencing of the genomes of 37 L. gelidum/L. gasicomitatum strains from our culture collection. Our aim was also to evaluate the recently proposed change in taxonomy as well as to study the genomes of strains with different lifestyles in foods.
Methods
We selected as diverse a set of strains as possible in terms of sources, previous genotyping results and geographical distribution, and included also 10 publicly available genomes in our analyses. We studied genomic taxonomy using pairwise average nucleotide identity (ANI) and calculation of digital DNA-DNA hybridisation (dDDH) scores. Phylogeny analyses were done using the core gene set of 1141 single-copy genes and a set of housekeeping genes commonly used for lactic acid bacteria. In addition, the pangenome and core genome sizes as well as some properties, such as acquired antimicrobial resistance (AMR), important due to the growth in foods, were analysed.
Results
Genome relatedness indices and phylogenetic analyses supported the recently suggested classification that restores the taxonomic position of L. gelidum subsp. gasicomitatum back to the species level as L. gasicomitatum. Genome properties, such as size and coding potential, revealed limited intraspecies variation and showed no attribution to the source of isolation. The distribution of the unique genes between species and subspecies was not associated with the previously documented lifestyle in foods. None of the strains carried any acquired AMR genes or genes associated with any known form of virulence.
Conclusion
Genome-wide examination of strains confirms that the proposition to restore the taxonomic position of L. gasicomitatum is justified. It further confirms that the distribution and lifestyle of L. gelidum and L. gasicomitatum in foods have not been driven by the evolution of functional and phylogenetic diversification detectable at the genome level.
Background
Leuconostoc gelidum is a psychrotrophic lactic acid bacterium (LAB) detected by Shaw and Harding [1] in cold-stored, vacuum-packaged beef. In 2014, Leuconostoc gasicomitatum [2] was reclassified as Leuconostoc gelidum subsp. gasicomitatum [3] and another subspecies, Leuconostoc gelidum subsp. aenigmaticum was described. Reclassification of L. gasicomitatum was mainly based on the results from the phylogenetic analyses of atpA, pheS, and rpoA housekeeping genes. The sequence analyses of concatenated atpA, pheS, and rpoA genes had demonstrated that the novel strains, designated as subsp. aenigmaticum, as well as the type and reference strains of L. gelidum and L. gasicomitatum, were phylogenetically closely related. Until recently, L. gelidum was thus considered to comprise three subspecies i.e., subsp. gelidum, gasicomitatum and aenigmaticum, but based on the ANI and dDDH values of L. gelidum type strains, Wu and Gu [4] proposed to reject the proposal of Rahkila et al. [3], and to restore the taxonomic position of L. gelidum subsp. gasicomitatum back to the species level as L. gasicomitatum.
L. gelidum and L. gasicomitatum have commonly been associated with the spoilage of perishable food items, such as meat and poultry or minimally processed vegetables [5,6,7]. Metagenomic analyses have detected these species mainly in cold-stored meat and vegetables [8], but rarely in samples from animal microbiomes or environmental sources. Apart from food spoilage, L. gelidum and L. gasicomitatum have often been associated with the fermentation of kimchi [9, 10], a Korean traditional side dish made most commonly of Chinese cabbage and Korean radishes with a variety of seasonings. Kimchi is often fermented at low temperature (2–6 °C) to restrict the growth of spoilage bacteria, but psychrotrophic LAB, L. gelidum and L. gasicomitatum can grow at temperatures below 6 °C [1, 3, 6]. Therefore, they may become abundant in kimchi microbiomes [9,10,11]. Thus, the growth of L. gelidum in refrigerated foods may either be unwanted and lead to spoilage or be endorsed to ensure proper ripening and preservation according to the expectations of kimchi fermentation.
The significance of L. gelidum and L. gasicomitatum in food, prompted our interest to further studying these species at the genome level. Despite the importance of these species in the food system, relatively little is known about them at the genome level, and at the time of this study, only few genomes were available in the public databases. Until now, most studies have applied analysis of molecular fingerprints [6, 7] or multi-locus sequence typing [12] to examine these species. In addition, the recent taxonomic proposal warranted further genotaxonomic studies performed with a larger set of strains than just the type strains that were used by Wu and Gu [4]. Since we have worked with these species during the course of two decades, our strain collection allowed us to select a diverse set of well-characterised strains in terms of sources, previously conducted genotyping results and geographical distribution (Supplementary Table S1) for the present study.
Results
Genome features reveal little intraspecies variation
The 37 L. gelidum/L. gasicomitatum strains (our L. gelium subsp. gasicomitatum strains were renominated as L. gasicomitatum in this paper according to the proposal of Wu and Gu [4]) had an average genome size of 1.95 Mbp (range 1.82–2.12 Mbp) with GC content ranging from 36.4 to 36.8% (Table 1, Supplementary Table S1). The number of contigs ranged from 6 to 49 (av. 24 ± 12; Table 1) with 28 of 37 (75%) of the genomes that were assembled to a maximum of 32 contigs. The N50 values ranged from 102 kbp to 1.45 Mbp (Table 1), with an average of 354 kbp (Supplementary Table S2). On average, the genomes encoded 1,842 predicted proteins (range 1,704–2,034). The number of pseudogenes was estimated to range from 33 to 67 (43 ± 7.6). Furthermore, the number of transfer RNA (tRNA) genes varied from 40 to 47 with an average of 44 tRNA genes per assembled genome. (Supplementary Table S2).
Genomic relatedness indices suggest species status for L. gelidum subsp. gasicomitatum
Pairwise average nucleotide identity (ANI) values were used to assess inter- and intra-subspecies relatedness. The values for L. gelidum subsp. aenigmaticum assemblies against L. gelidum subsp. gelidum assemblies (Table 2;Supplementary Table S4) were above 95%, the cut-off value for species delineation [13], which supports their classification to the same species. In contrast, L. gasicomitatum and L. gelidum subsp. gelidum DSM 5578 T type assembly or L. gelidum subsp. aenigmaticum DSM 19375 T type assembly presented ANI ≤ 94.8% (Table 2; Supplementary Table S4), and were, thus, lower than the proposed species cut-off values for ANI. Similarly, the average values for L. gasicomitatum against to the assemblies of subsp. gelidum and subsp. aenigmaticum were below the cut-off of 95% (Table 2); although 5% (21 of 416) of the pairwise comparisons were between 95.0–95.2% (Supplementary Table S4). The ANI values obtained for all the intra-subspecies comparisons between L. gelidum assemblies were high, i.e. ≥ 98% (Table 2, Supplementary Table S4).
In addition to ANI, we assessed the genome relatedness by calculating the pairwise digital DNA-DNA hybridisation (dDDH) scores for the 47 genomes (Table 2, Supplementary Table S4). For dDDH, similarity values ≥ 70% are considered an argument to classify the strains to the same species [14]. Similarly, with ANI metrics, L. gasicomitatum assemblies showed relatively low dDDH values against L. gelidum subsp. gelidum (57.7 ± 1.7%) and L. gelidum subsp. aenigmaticum (59.2 ± 0.5%) assemblies. The inter-subspecies values obtained for L. gelidum subsp. aenigmaticum against L. gelidum subsp. gelidum genomes varied more. They ranged from 67.1 to 87.4 (71.1 ± 4.6%) with the values obtained for the L. gelidum subsp. gelidum DSM 5578T type assembly (69.0–72.4%) thus being also around the species cut-off of 70% (Table S4). However, the dDDH similarity was 70.6% between the type assemblies of these subspecies (DSM 5578T × DSM 19275T) (Supplementary Table S4).
Phylogenetic analyses classified subsp. gasicomitatum separately from subs. gelidum
To assess the evolutionary relationships between the strains belonging to the former three L. gelidum subspecies, we constructed a maximum likelihood phylogenetic tree (Fig. 1) from concatenated nucleotide sequences of three housekeeping genes atpA, pheS and rpoA used in the previous taxonomy studies to distinguish Leuconostoc species [3, 15]. In the tree topology (Fig. 1), the L. gasicomitatum strains were separated with high bootstrap support from subsp. gelidum and subsp. aenigmaticum strains. For this finding, we obtained even stronger support with a tree constructed using a core gene set of 1,141 single-copy genes (Fig. 2). In both phylogenetic treeing approaches, the three subsp. aenigmaticum strains clustered together and were placed in the same group as the subsp. gelidum strains (Figs. 1 and 2). In addition, we did not notice any clustering based on the geographical locations of the L. gasicomitatum strains. Strains from Belgium (strains coded with R-, HS- and a- or numbers only) and Latvia (Jla4-8) clustered among those isolated in Finland.
Pangenome analyses
We estimated the pangenome and core genome sizes separately for L. gasicomitatum and for L. gelidum subsp. gelidum and subsp. aenigmaticum using the model of Tettelin et al. [16] for curve fitting. Gene accumulation curves (Figs. 3 A and B) showed that the number of new genes added by each assembly continued to increase, indicating open pangenomes for both analyses. However, the expansion of the L. gasicomitatum pangenome was decreasing (Fig. 3B), and after 22 genomes fewer than 15 genes were added to the pool per new genome, suggesting that the pangenome was closing slowly. The core genome analyses (Figs. 3C and D) fitted the exponential decay model. The fitted curve for L. gasicomitatum plateaued at around 1300 genes (Fig. 3D), suggesting relatively stable core genomes.
The pangenome of L. gasicomitatum was estimated to include 3,046 genes of which 1,313 (43.1%) were considered the core genes present in 99% of the assemblies. On average, the core genome consisted of 71% (64–75%) of the coding potential of a gasicomitatum assembly. The open pangenome for L. gelidum subsp. gelidum and aenigmaticum consisted of 2703, gene, and of these 1,341 were included in the core genome. The unique genes detected in L. gelidum/L. gasicomitatum (Supplementary tables S5-S7) did not encode any properties associated with specific lifestyles related to food spoilage. Genes unique to L. gelidum subsp. gelidum and L. gasicomitatum strains were not detected.
Properties associated with lifestyle in food
Regarding to properties typical for leuconostocs, all strains contained genes encoding at least one Glucansucrase and half of them carried also genes encoding a fructansucrase. No genes associated with the formation of biogenic amines through amino acid decarboxylation or deamination were found.
L. gelidum and L. gasicomitatum are known to be able to cause spoilage of food products by producing yellow carotenoids when growing in food that contains fat, e.g. ham or sausage [17, 18]. But the genes responsible for carotenoid production in LAB (including other Leuconostoc species), e.g. crtN and crtM [19], were not detected in L. gelidum/L. gasicomitatum. It is, therefore, not clear to us how L. gelidum and L. gasicomitatum produce carotenoids or what type of carotenoids they produce.
Genetic determinants for fosfomycin and fusidic acid resistance
Next, we analysed the acquired antimicrobial resistance (AMR) determinants by comparing the L. gelidum/ L. gasicomitatum assemblies with the Comprehensive Antibiotic Resistance Database (CARD). The searches did not return any hits with good support (using “Perfect, Strict, complete genes only” as criteria) suggesting that these food-associated L. gelidum/L. gasicomitatum strains do not carry acquired or transferable antimicrobial resistance genes or allelic variants that cause antimicrobial resistance.
Regarding intrinsic resistance, the genus Leuconostoc is known to be resistant to vancomycin and the resistance mechanism is well-characterised [20]. Consistently, all 37 newly sequenced L. gelidum genomes harboured a resistant genotype of ddl gene. In addition to vancomycin, resistance to fosfomycin is common among leuconostocs [21, 22], and so we analysed the genomes for genes associated with fosfomycin resistance [23]. Among the alignments, the only known resistance determinant we identified was an amino acid substation in the fosfomycin target gene murA. A multiple protein sequence alignment of the MurA demonstrated (Supplementary Fig. 1) that all 47 L. gelidum strains had Asp substitution at position 115 (E. coli numbering) at the site encoding the active site of the MurA, a substitution known to confer fosfomycin resistance in E. coli [24].
Furthermore, we assessed the known genetic determinants behind fusidic acid resistance, another resistance characteristics of leuconostocs [25]. Sequence analysis of fusA (Supplementary Fig. 2), a chromosomal gene encoding elongation factor G (EF-G), the target protein of fusidic acid, revealed alterations that cause resistance-associated amino acid substitution in EF-G [26]. Comparison with Staphylococcus aureus showed (Supplementary Fig. 2) that L. gelidum/L. gasicomitatum genomes harboured an amino acid substitution V90I, His457Q, Leu461M and S416T (S. aureus numbering) of which at least V90I and His457Q result in high fusidic acid resistance in S. aureus [26].
Discussion
The genomic features of the 37 L. gelidum/L. gasicomitatum strains sequenced in the present study showed only little variation (Table 1; Supplementary Table S1), even though we selected as heterogenous a set of strains as possible from our database. We used MLST, fingerprinting and source attribution data accumulated over the past 25 years since 1995 (Table 1; Supplementary Table S1) to select the strains for the present study. The genome sizes (1.95 ± 0.06 Mbp), GC contents (36.6 ± 0.1%), numbers of CDS (1842 ± 72) and tRNA genes (44 ± 2) were quite similar (Table 1; Supplementary Table S2) and alike with the features present in the genomes of the 10 L. gelidum/L. gasicomitatum strains (Table 1; Supplementary Table S3) available in the public NCBI database.
The genetic relatedness indices (Table 2; Supplementary Table S4) as well as the phylogenetic analyses (Figs. 1 and 2) revealed that the strains formerly designated as L. gelidum subsp. gasicomitatum formed a genetically coherent group that was evolutionary different from the group containing the strains of the two other L. gelidum subspecies. According to the ANI and dDDH results (Table 2 and Supplementary Table 4), the taxonomic position of subsp. gasicomitatum within L. gelidum is not supported. Based on the ANI and dDDH results between the type strains of L. gelidum subspecies, Gu and Wu [4] have already suggested rejecting the proposal of Rahkila et al. [3] to reclassifiy L. gasicomitatum as L. gelidum subsp. gasicomitatum. Our study conducted with these 37 strains supports their findings and thus, the strains previously considered as subsp. gasicomitatum should be designated as L. gasicomitatum.
Rahkila et al. [3] justified the splitting of L. gelidum between three subspecies mainly because of the results obtained in the phylogenetic and optical DDH analyses. At that time, the phylogenic analyses were targeted at three housekeeping genes atpA, pheS and rpoA that had been reported to distinguish LAB species well [15]. Rahkila et al. [3] used 13 strains consisting of 6, 3 and 4 representatives of L. gelidum subsp. aenigmaticum, L. gelidum subsp. gelidum and L. gasicomitatum, respectively. Since the analysis of the concatenated atpA, pheS and rpoA sequences divided these 13 strains between three separate clusters, they subsequently conducted the optical DDH experiments using representative strains from each three clusters. In these DDH experiments, Rahkila et al. [3], used, among others, the type strains LMG18297T and LMG18811T of L. gelidum and L. gasicomitatum, respectively, and POUF4d, which is the current type strain (DSM 19375T) of subsp. aenigmaticum. The DDH result of 75% between L. gelidum and L. gasicomitatum type strains that they reported contravened the values previously published in two other studies. Björkroth et al. [2] and Kim et al. [27] had reported DDH values below 70%, i.e., 22 and 6%, respectively. In addition, Björkroth et al. (2000) had reported another value below 70%, i.e., 34%, between L. gasicomitatum LMG 18812 and L. gelidum subsp. gelidum LMG 9850T type stain. Thus, the results of the present study are in accordance with the conclusions of Björkroth et al. (2000), Kim et al. (2000) and Gu and Wu (2021), supporting that the taxonomic position of L. gelidum subsp. gasicomitatum should be restored as L. gasicomitatum.
Our analyses also show that subsp. aenigmaticum has been correctly classified as a subspecies of L. gelidum. During the past 25 years, we have isolated and constructed a comprehensive culture collection containing hundreds of L. gelidum and L. gasicomitatum food isolates. However, we have not detected more subsp. aenigmaticum strains than those used in the study of Rahkila et al. [3]. To our knowledge, only one additional scientific study of this subspecies has been published thus far. Mun et al. (2021) found L. gelidum subsp. aenigmaticum LS4 strain to improve the organoleptic qualities of kimchi juice and suggested that LS4 could be used as a functional starter culture for food (vegetable or fruit) fermentation at low temperatures. The genome and functional properties of LS4 are very similar to those detected in this study and by Rahkila et al. [3]. The reason for the low detection rate of subsp. aenigmaticum is not known to us, but the genetic differences between subsp. aenigmaticum and gelidum, and even between L. gasicomitatum, do not explain any specific traits to suggest reasons for the rare detection (Supplementary Table S5).
Figures 2 and 3 present phylogenetic trees generated with the concatenated atpA, pheS and rpoA and WG sequences, respectively. Both approaches resulted in trees containing three separate clusters with either L. gelidum subsp. gelidum, L. gelidum subsp. aenigmaticum or L. gasicomitatum strains. In both trees L. gelidum subsp. gelidum and L. gelidum subsp. aenigmaticum strains clustered adjacently, whereas L. gasicomitatum strains were located more distantly from the other two clusters. This shows that L. gelidum and L. gasicomitatum differ clearly from each other also phylogenically. No clustering to suggest any attribution to specific isolation sources was found as in the previous L. gasicomitatum MLST study by Rahkila et al. [12].
Antimicrobial resistance in Leuconostoc is of interest to the researcher community since leuconoctocs are present in various foods [6, 7, 11, 12, 22] and antimicrobial resistance, particularly, transferable or acquired resistance-related determinants, is considered a food safety concern [28]. Antimicrobial resistance properties can also be used to develop tools for a counterselection marker for recombination [29] or to design differentiating or selecting growth media for LAB [25]. Leuconostocs are Generally Regarded As Safe (GRAS, US Food and Drug Administration); or some have the Qualified Presumption of Safety by the European Food Safety Authority. However, mainly because of their intrinsic antimicrobial vancomycin resistance, they have also been associated with infections in patients receiving vancomycin. According to our analyses of the 47 L. gelidum/L. gasicomitatum genomes, these strains did not carry any acquired antimicrobial resistance genes. In addition to intrinsic vancomycin resistance, leuconostocs are intrinsically resistant to teicoplanin, fosfomycin and fusidic acid [16, 17, 26]. The vancomycin/teicoplanin resistance mechanism is known and it results from the use of D-lactate instead of D-ala in the synthesis of peptidoglycan [20]. Based on our findings, we consider that the intrinsic fosfomycin resistance in Leuconostoc is due to the Asp in MurA position 115 (E. coli numbering) known to confer resistance in E. coli [24]. In addition to leuconostocs, Fructobacillus, Weissella, Oenococcus, Pediococcus and lactobacilli appear to carry the resistant genotype, whereas other LAB genera have the sensitive genotype. Our findings related to fusidic acid resistance suggests that it results from point mutations within the fusA gene [26]. Leuconostoc, Fructobacillus and Convivina have the resistant genotype associated with a Q at the position 457 (Staphylococcus numbering), whereas Weissella and Oenococcus have the sensitive genotype related to a H at position 457. Recently [30], IsaA gene similar to a clindamycin resistant phenotype was detected in Leuconostoc fallax ATCC 700006T and Leuconostoc pseudomesenteroides NCDO 768T. However, we did not detect this gene in the 47 L. gelidum/ L. gasicomitatum genomes analysed.
A form of spoilage manifested by the formation of slime has been associated with L. gelidum/L. gasicomitatum [5]. Since all strains contained genes encoding at least one glucansucrase and half of them carried also genes encoding a fructansucrase, it can be concluded that these species have a general potential to form dextran if sucrose is available as the precursor and favourable growth condition exists. On the other hand, we did not find clear evidence of ability of forming biogenic amines (BAs) that result from decarboxylation or iminase reactions related to amino acids. BAs are unwanted metabolites in foods since they are associated with health hazards and a malodour [31]. Decarboxylation of histidine as histamine leads to a food poisoning resembling a severe allergic reaction after consumption of spoiled histidine-rich food such as tuna and some fish belonging to Scromboidaceae. Tyramine produced from tyrosine is known to trigger migraine, whereas the polyamines putrescine, cadaverine and agmatine are considered as indicators of food spoilage. We searched the genomes but did not detect genes related to agmatine demininase or histidine or tyrosine decarboxylases. We detected a gene that is annotated as an orn/lys/arg-decaboxylase by some annotations (not NCBI) in strains AMKR21, C220d, HS9, JPBL22, Kg1-2 & Vvan9). However, this gene is only half of the length of the characterised orn/lys/arg-decaboxylases. Thus, it is more likely an unknown aminotransferase. Based on the genomic data, these leuconostocs are unlikely to produce any BAs. The ability of leuconostocs to form BAs has been reported time to time but it is probably related to the former classification of “leuconostocs” that are currently considered as Oenococcus, Weissella and Fructobacillus. Due to the many changes in the taxonomy of leuconostocs, one should be cautious while reading older literature and not to mix properties of these genera in these with the current leuconostocs.
Conclusions
Genome features of 47 strains belonging to L. gasicomitatum and L. gelidum subspecies aenigmaticum and gelidum showed only limited variation even though the strains selected for this study originated from different sources, represented variable MLST and genotyping groups, and were isolated in four different countries over a time span of more than 25 years. The genetic relatedness indices as well as the phylogenetic analyses revealed that the strains formerly designated as L. gelidum subsp. gasicomitatum are not a subspecies of L. gelidum. Thus, the proposal of Rahkila et al. [3] to reclassify L. gasicomitatum as L. gelidum subsp. gasicomitatum should be rejected and the species status for L. gasicomitatum restored as also recently suggested by Wu and Gu [4]. Analyses of the genomes did not reveal any source attribution or other clustering associated with the lifestyle of the strains. The pangenome of L. gasicomitatum was estimated to include 3,046 genes of which were 1,313 (43.1%) were considered the core genes present in 99% of the assemblies. After 22 genomes, less than 15 genes were added to the pool per new genome, suggesting that this pangenome was closing slowly, whereas L. gelidum pangenomes (2,703 genes) remained open. According to our analyses on the 47 L. gelidum/ L. gasicomitatum genomes, these strains did not carry any acquired antimicrobial resistance genes or genes associated with production of harmful biogenic amines.
Methods
Selection and sequencing of the strains
Thirty-seven L. gelidum strains isolated during previous research activities from various foods and food-related samples (Table 1) were selected for genome sequencing. The source and county of origin are listed in Table 1.
For sequencing, DNA was extracted from cells collected from MRS broth cultures (25 °C) as described earlier [3]. DNA purity was confirmed using a NanoDrop 2000™ spectrophotometer and quantified using a Qubit® 3.0 fluorometer (Thermo Scientific, USA). Library preparation and sequencing were carried out at the Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Finland. First, a paired-end DNA library for Illumina sequencing was prepared and normalised with ~ 300 bp inserts using a Nextera XT DNA library Preparation Kit (Illumina, CA, USA). The prepared library was sequenced using an Illumina HiSeq 2500 platform for 100 bp paired-end reads.
In addition to 37 sequenced strains, three complete and seven draft genomes of L. gelidum/L. gasicomitatum, including the type assemblies of each subspecies, were retrieved (Supplementary Table S2). Among them are four L. gasicomitatum strains sequenced by Andreevskaya et al. [32] that at the time of writing were incorrectly classified as Leuconostoc inhae by the European Nucleotide Archive (ENA). Furthermore, the genome sequence for Leuconostoc kimchii IMSNU 11154 T (BioProject PRJNA40837) to was used as a root in the phylogenetic analyses. All software were run with default parameters, unless otherwise stated.
Genome assembly
Following sequencing, the quality control of the raw sequence reads was conducted with FastQC v.0.11.9 available at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/, the adaptor removal and trimming were performed with Trimmomatic v. 0.40 [33]. The draft genomes were assembled with Velvet v.1.2.08 [34] together with VelvetOptimiser v.2.2.6 (https://github.com/tseemann/VelvetOptimiser) and their quality for completeness and contamination was evaluated using CheckM v.1.0.10 [35]. The assembled genomes were annotated using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) [36].
The accession numbers and details of the genomes used in this study are provided in Supplementary Tables S1 and S2.
To obtain genome-to-genome relatedness indices, Pairwise average nucleotide identity (ANI) values based on BLAST were determined using pyani.py v.0.2.7 (https://github.com/widdowquinn/pyani) and digital DNA–DNA hybridisation (dDDH) with the genome-to-genome distance calculator 2.1 (GGDC) server (available at: http://ggdc.dsmz.de/ggdc.php) using formula 2 recommended for draft genomes. The species boundary for ANI and dDDH values were set for/at? 95 ~ 96 and 70%, respectively.
Pangenome and core genome
The pangenome and core genome sizes were estimated using BPGA v.1.3.0 [37].
Phylogenetic analysis
The nucleotide sequences of all 1,141 single-copy orthologous genes shared by all 47 of the genomes were retrieved by homologous searches using GET_HOMOLOGUES v.3.2.4 [38] and aligned using Prank v.140603 [39] with the “-codon” option for codon aware alignment. Gblocks v.0.91b [40] was used to select conserved regions for the construction of concatenated nucleotide sequences. An approximative Maximum-Likelihood phylogenetic tree was built using FastTree v.2.1.11 [41], with the generalised time-reversible (GTR) model. Maximum-Likelihood phylogenetic trees were constructed for concatenated housekeeping genes atpA, pheS and rpoA using IQ-TREE v.1.6.12 [42] with the TPM2u + F + I and the HKY + F + G4 nucleotide substitution models, respectively.
Prediction of antibiotic resistance
The Resistance Gene Identifier (RGI) tool of the Comprehensive Antibiotic Resistance Database “CARD” version 3.1.3 [43] was used to predict resistomes by using contigs file with the parameters “Perfect, Strict, complete genes only” as search criteria. Literature information on intrinsic antibiotic resistance in Leuconostoc species was also collected. The protein sequences were aligned using multiple sequence alignment MAFFT v.7.408 [44]. Aligned sequences were visualized with GeneDoc v.2.7 [45].
Abbreviations
- AMR:
-
Antimicrobial resistance
- ANI:
-
Pairwise average nucleotide identity
- dDDH:
-
Digital DNA–DNA hybridization
- MLST:
-
Multilocus sequence typing
- PFGE:
-
Pulsed field gel electrophoresis
- RGI:
-
Resistance Gene Identifier
References
Shaw B, Harding C. Leuconostoc gelidum sp. nov. and Leuconostoc carnosum sp. nov. from chill-stored meats. Int J Syst Evol Microbiol. 1989;39(3):217–23.
Björkroth KJ, Geisen R, Schillinger U, Weiss N, De Vos P, Holzapfel WH, et al. Characterization of Leuconostoc gasicomitatum sp. nov., associated with spoiled raw tomato-marinated broiler meat strips packaged under modified-atmosphere conditions. Appl Environ Microbiol. 2000;66(9):3764–72.
Rahkila R, De Bruyne K, Johansson P, Vandamme P, Björkroth J. Reclassification of Leuconostoc gasicomitatum as Leuconostoc gelidum subsp. gasicomitatum comb. nov., description of Leuconostoc gelidum subsp. aenigmaticum subsp. nov., designation of Leuconostoc gelidum subsp. gelidum subsp. nov. and emended description of Leuconostoc gelidum. Int J Syst Evol microbiol. 2014;64(4):1290–5. https://doi.org/10.1099/ijs.0.058263-0.
Wu Y, Gu CT. Rejection of the reclassification of Leuconostoc gasicomitatum as Leuconostoc gelidum subsp. gasicomitatum based on whole genome analysis. Int J Syst Evol Microbiol. 2021;71(9). https://doi.org/10.1099/ijsem.0.005027.
Lyhs U, Koort JM, Lundström H-S, Björkroth KJ. Leuconostoc gelidum and Leuconostoc gasicomitatum strains dominated the lactic acid bacterium population associated with strong slime formation in an acetic-acid herring preserve. Int J Food Microbiol. 2004;90(2):207–18.
Pothakos V, Snauwaert C, De Vos P, Huys G, Devlieghere F. Psychrotrophic members of Leuconostoc gasicomitatum, Leuconostoc gelidum and Lactococcus piscium dominate at the end of shelf-life in packaged and chilled-stored food products in Belgium. Food Microbiol. 2014;39:61–7.
Vihavainen EJ, Björkroth KJ. Diversity of Leuconostoc gasicomitatum associated with meat spoilage. Int J Food Microbiol. 2009;136(1):32–6.
Kergourlay G, Taminiau B, Daube G, Vergès MCC. Metagenomic insights into the dynamics of microbial communities in food. Int J Food Microbiol. 2015;213:31–9.
Jung JY, Lee SH, Jeon CO. Complete genome sequence of Leuconostoc gelidum strain JB7, isolated from kimchi. J Bacteriol. 2012;194(23):6665. https://doi.org/10.1128/JB.01806-12.
Kim BJ, Lee HJ, Park SY, Kim JH, Han HU. Identification and characterization of Leuconostoc gelidum, isolated from kimchi, a fermented cabbage product. J Microbiol. 2000;38(3):132–6.
Jung JY, Lee SH, Jeon CO. Kimchi microflora: history, current status, and perspectives for industrial kimchi production. Appl Microbiol Biotechnol. 2014;98(6):2385–93.
Rahkila R, Johansson P, Säde E, Paulin L, Auvinen P, Björkroth J. Multilocus sequence typing of Leuconostoc gelidum subsp. gasicomitatum, a psychrotrophic lactic acid bacterium causing spoilage of packaged perishable foods. Appl Environ Microbiol. 2015;81(7):2474–80.
Richter M, Rosselló-Móra R. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci. 2009;106(45):19126–31.
Meier-Kolthoff JP, Auch AF, Klenk H-P, Göker M. Genome sequence-based species delimitation with confidence intervals and improved distance functions. BMC Bioinformatics. 2013;14(1):1–14.
De Bruyne K, Schillinger U, Caroline L, Boehringer B, Cleenwerck I, Vancanneyt M, et al. Leuconostoc holzapfelii sp. nov., isolated from Ethiopian coffee fermentation and assessment of sequence analysis of housekeeping genes for delineation of Leuconostoc species. Int J Syst Evol Microbiol. 2007;57(12):2952–9.
Tettelin H, Masignani V, Cieslewicz MJ, Donati C, Medini D, Ward NL, et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial “pan-genome.” Proc Natl Acad Sci. 2005;102(39):13950–5.
Cai Y, Benno Y, Takeda A, Yoshida T, Itaya T, Nakase T. Characterization of Leuconostoc species isolated from vacuum-packaged ham. J Gen Appl Microbiol. 1998;44(2):153–9.
Kröckel L. Gelbe Farbabweichungen bei kühl gelagerten, vorverpackten Weißwürsten: Leuconostoc gelidum verursacht Verfärbungen. Fleischwirtschaft. 2006;86(9):129–33.
Hagi T, Kobayashi M, Kawamoto S, Shima J, Nomura M. Expression of novel carotenoid biosynthesis genes from Enterococcus gilvus improves the multistress tolerance of Lactococcus lactis. J Appl Microbiol. 2013;114(6):1763–71.
Kuzin AP, Sun T, Jorczak-Baillass J, Healy VL, Walsh CT, Knox JR. Enzymes of vancomycin resistance: the structure of D-alanine–D-lactate ligase of naturally resistant Leuconostoc mesenteroides. Structure. 2000;8(5):463–70.
Buu-Hoi A, Branger C, Acar J. Vancomycin-resistant streptococci or Leuconostoc sp. Antimicrob Agents Chemother. 1985;28(3):458–60.
Hemme D, Foucaud-Scheunemann C. Leuconostoc, characteristics, use in dairy technology and prospects in functional foods. Int Dairy J. 2004;14(6):467–94.
Silver LL. Fosfomycin: mechanism and resistance. Cold Spring Harb Perspect Med. 2017;7(2): a025262.
Kim DH, Lees WJ, Kempsell KE, Lane WS, Duncan K, Walsh CT. Characterization of a Cys115 to Asp substitution in the Escherichia coli cell wall biosynthetic enzyme UDP-GlcNAc enolpyruvyl transferase (MurA) that confers resistance to inactivation by the antibiotic fosfomycin. Biochemistry. 1996;35(15):4923–8.
Andreevskaya M, Jääskeläinen E, Johansson P, Ylinen A, Paulin L, Björkroth J, et al. Food spoilage-associated Leuconostoc, Lactococcus, and Lactobacillus species display different survival strategies in response to competition. Appl Environ Microbiol. 2018;84(13):e00554-e618.
Besier S, Ludwig A, Brade V, Wichelhaus TA. Molecular analysis of fusidic acid resistance in Staphylococcus aureus. Mol Microbiol. 2003;47(2):463–9.
Kim J, Chun J, Han HU. Leuconostoc kimchii sp. nov., a new species from kimchi. Int J Syst Evol Microbiol. 2000;50(5):1915–9.
Hazards EBPEpoB, Koutsoumanis K, Allende A, Alvarez-Ordóñez A, Bolton D, Bover-Cid S, et al. Scientific opinion on the update of the list of QPS-recommended biological agents intentionally added to food or feed as notified to EFSA (2017–2019). EFSA J. 2020;18(2):e05966.
Zhang S, Oh J-H, Alexander LM, Özçam M, Van Pijkeren J-P. d-Alanyl-d-alanine ligase as a broad-host-range counterselection marker in vancomycin-resistant lactic acid bacteria. J Bacteriol. 2018;200(13):e00607-e617.
Salvetti E, Campedelli I, Larini I, Conedera G, Torriani S. Exploring antibiotic resistance diversity in Leuconostoc spp. by a genome-based approach: focus on the lsaA Gene. Microorganisms. 2021;9(3):491.
Santos MS. Biogenic amines: their importance in foods. Int J Food Microbiol. 1996;29(2–3):213–31.
Andreevskaya M, Hultman J, Johansson P, Laine P, Paulin L, Auvinen P, et al. Complete genome sequence of Leuconostoc gelidum subsp. gasicomitatum KG16–1, isolated from vacuum-packaged vegetable sausages. Stand Genomic Sci. 2016;11(1):1–12.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18(5):821–9.
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25(7):1043–55.
Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, et al. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 2016;44(14):6614–24.
Chaudhari NM, Gupta VK, Dutta C. BPGA—an ultra-fast pan-genome analysis pipeline. Sci Rep. 2016;6:24373.
Contreras-Moreira B, Vinuesa P. GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis. Appl Environ Microbiol. 2013;79(24):7696–701.
Löytynoja A, Goldman N. A model of evolution and structure for multiple sequence alignment. Philos Trans R Soc B Biol Sci. 2008;363(1512):3913–9.
Talavera G, Castresana J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol. 2007;56:564.
Price MN, Dehal PS, Arkin AP. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS ONE. 2010;5: e9490.
Trifinopoulos J, Nguyen L-T, von Haeseler A, Minh BQ. W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 2016;44:W232–5.
Raphenya AR, Lau TT, Tsang KK, Bouchard M, Edalatmand A, et al. CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Res. 2020;48(D1):D517–25.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecilar Biology and Evolution. 2013;30:772–80.
Nicholas KB, Nicholas HBJ. GeneDoc: a tool for editing and annoting multiple sequence alignments. Distributed by the authors (https://github.com/karlnicholas/GeneDoc). 1997.
Acknowledgements
We thank Henna Niinivirta for skilful technical assistance and Professor Devlieghere for providing the Belgian strains. The CSC-IT Center for Science Ltd. is acknowledged for providing computational resources.
Funding
This study was supported by the Academy of Finland project CODELAB (307855) and the Novo Nordisk Foundation Distinguished investigator grant of Björkroth. Open access was funded by Helsinki University Library.
Author information
Authors and Affiliations
Contributions
PJ, ES, PA & JB designed the experiments, analysed and interpreted data; PJ and JH performed the genome analyses; PJ, ES & JB wrote the paper. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Availability of data and materials
The datasets generated and/or analysed during the current study are available in the NCBI nucleotide database and the accession numbers are listed in the Supplementary Tables S2 and S3.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1:
Supplementary Fig. 1. Multiple alignment ofrepresentative MurA protein sequences from different genera of LAB, Escherichia coli and Staphylococcus aureus. The MurA sequences used in the alignment are: S.aureus Q931H5, E. coli P0A749, Leuconostoc gasicomitatum A0A7H9BCM4, Fructobacillus fructosus A0A3F3I000, Oenococcus oeni A0NKT6, Weissella confusa A0A1T4J435, Pediococcus acidilactici A0A1A5VP62, Dellaglioa algida A0A0R1HHE0, Carnobacterium maltaromaticum K8E679, Enterococcus faecium A0A133CSM8, Lactococcus lactis Q9CIP4 and Streptococcus pyogenes P0DC46. Supplementary Fig. 2. Multiple alignment ofrepresentative FusA protein sequences from different genera of LAB and from Staphylococcus aureus: The FusA sequences used in the alignment are: Leuconostoc gasicomitatum A0A175CRG2, Fructobacillus sp. A0A0J5PAW1, Weissella viridescens A0A0R2H6R8, Ooenococcus oeni Q04ED6, S. aureus P68790, Lactococcus carnosus A0A0D6E076, Carnobacterium divergens A0A4R9CM82, Eenterococcus faecalisQ839G9, Vagococcus fessus A0A430A960and Paussilactobacillus oligofermentansA0A0R1RP22.
Additional file 2:
Table S1. Strains and genomes studied. Table S2. Accession numbers and general genome features of the 37 L. gelium strains sequenced in this study. Table S3. Accession numbers and general genome features of the 10 L. gelium genomes retrieved from NCBI. Table S4. Matrix with genome-to-genome ANI (lower part) and dDDH (upper part) values (%) obtained for 47 L. gelidum/L. gasicomitatum genomes. The values of and above 95% and 70% for ANI and dDDHA, respectively, are highlighted in gray. Table S5. Genes unique to L. gelidum subsp. aenigmaticum strains. Table S6. Genes unique to L. gasicomitatum strains. Table S7. Genes unique to L. gelidum subsp. gelidum/aenigmaticum strains. References to publications cited in the Supplementary Table S1.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Johansson, P., Säde, E., Hultman, J. et al. Pangenome and genomic taxonomy analyses of Leuconostoc gelidum and Leuconostoc gasicomitatum. BMC Genomics 23, 818 (2022). https://doi.org/10.1186/s12864-022-09032-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12864-022-09032-3