- Research article
- Open Access
Exploring the genomic traits of fungus-feeding bacterial genus Collimonas
BMC Genomics volume 16, Article number: 1103 (2015)
Collimonas is a genus belonging to the class of Betaproteobacteria and consists mostly of soil bacteria with the ability to exploit living fungi as food source (mycophagy). Collimonas strains differ in a range of activities, including swimming motility, quorum sensing, extracellular protease activity, siderophore production, and antimicrobial activities.
In order to reveal ecological traits possibly related to Collimonas lifestyle and secondary metabolites production, we performed a comparative genomics analysis based on whole-genome sequencing of six strains representing 3 recognized species. The analysis revealed that the core genome represents 43.1 to 52.7 % of the genomes of the six individual strains. These include genes coding for extracellular enzymes (chitinase, peptidase, phospholipase), iron acquisition and type II secretion systems. In the variable genome, differences were found in genes coding for secondary metabolites (e.g. tripropeptin A and volatile terpenes), several unknown orphan polyketide synthase-nonribosomal peptide synthetase (PKS-NRPS), nonribosomal peptide synthetase (NRPS) gene clusters, a new lipopeptide and type III and type VI secretion systems. Potential roles of the latter genes in the interaction with other organisms were investigated. Mutation of a gene involved in tripropeptin A biosynthesis strongly reduced the antibacterial activity against Staphylococcus aureus, while disruption of a gene involved in the biosynthesis of the new lipopeptide had a large effect on the antifungal/oomycetal activities.
Overall our results indicated that Collimonas genomes harbour many genes encoding for novel enzymes and secondary metabolites (including terpenes) important for interactions with other organisms and revealed genomic plasticity, which reflect the behaviour, antimicrobial activity and lifestylesof Collimonas spp.
The genus Collimonas comprises soil bacteria with the ability to grow at the expense of living fungal hyphae under nutrient-limited conditions [1–3]. Since the first description of Collimonas, more mycophagous bacteria have been detected , but Collimonas species are still highly interesting in view of the interactions between bacteria and fungi in soil and the associated ecosystem functions including suppression of pathogens and the production of novel bioactive compounds.
Collimonas belongs to the family Oxalobacteraceae, class Betaproteobacteria. The first Collimonas isolates were obtained within the framework of a project searching for a naturally occurring biocontrol agent of fungi pathogenic to marram grass (Ammophilia arenaria) and were determined as being dominant among the cultivable chitinolytic bacteria in the acidic Dutch dune soils . Three species have been described so far: C. fungivorans, C. pratensis and C. arenae . All three species display the ability to feed on fungi (mycophagy), to degrade chitin and to dissolve minerals (weathering) . However, Collimonas strains differ in important ecological traits such as colony morphology, the ability to oxidize various carbon sources, in their antibacterial, antifungal and antioomycetal activities [6, 7]. A comparative genomic approach would help to reveal the genetic basis of these ecological differences. Applying a comparative genomic hybridization approach  showed that a gene cluster involved in the production of an antifungal polyyne was only found in the genome of a few Collimonas strains. The study of Mela et al.  was biased in the sense that the hybridization assay used allowed only to screen for absence/presence of genes in other strains as compared to C. fungivorans strain Ter 331, the only strain for which the complete genome was sequenced at that time. To reveal the real plasticity of Collimonas strains more genome sequences are needed. This will demonstrate constant and variable genetic elements, and hence determine the adaptations of Collimonas species and traits important for inter-specific microbial interactions in the soil. To date, only two genome sequences of Collimonas are publicly available [7, 8]. Here we report on full genome sequences of five Collimonas strains across the three recognized species and performed comparative genome analysis including the recently published C. fungivorans Ter331 genome . Gene clusters with potential relevance for interactions of the Collimonas species with fungi and other microorganisms were further investigated by gene knock-out mutations and/or enzymatic characterization.
Results and discussion
A genome sequence analysis was performed for strains Ter6, Ter91, Ter291, Ter10 and Ter282, using a combined strategy of Illumina Hiseq and PacBio sequencing. A summary of the general genomic features (size, GC content, predicted number of coding sequences, and number of rRNAs) of each Collimonas strain is presented in Table 1. Considerable variation in genome size and differences in plasmid content was observed. The six genomes vary in size by approximately one megabase (ranging from 4.7–5.7 Mb) with the number of coding sequences (CDSs) ranging from 4436 to 5424, indicating substantial strain-to-strain variation. The genomes of C. fungivorans and C. pratensis are larger and have higher GC content than the two strains of C. arenae. Only strain C. fungivorans Ter331 has a plasmid, described in detail by Mela et al. . Despite the absence of a plasmid, C. pratensis Ter91 has the largest genome size (5.7 kb) and highest number of encoding genes (5424), which is likely due to the large number of horizontally acquired genes as indicated by the number of genomic islands (see below).
A phylogenetic tree based on 233 protein-coding genes (Fig. 1a) revealed that C. fungivorans and C. pratensis are more closely related to each other than to C. arenae. Similar clustering was observed based on phylogenetic trees generated with whole genome fragments (200 bp fragment sizes) (Fig. 1b) and 16S rRNA (Additional file 1: Figure S1) where each strain falls into its respective species clade.
Core and pan-genome analysis
A core genome containing 2339 predicted orthologous groups was identified for the six Collimonas strains based on the all-vs-all BLASTp search (Fig. 2a). This core genome represents 43.1 to 52.7 % of the predicted ORF’s of each strain (Fig. 2a), illustrating a large degree of genomic diversity between these strains. Each of the six genomes includes 125 to 835 orthologous groups that are unique (Fig. 2a). Species core orthologous clusters and strain-specific unique clusters within the three Collimonas species were examined, respectively (Fig. 2b-d). In the three species, 5868, 5810 and 4546 orthologous clusters were identified and of these, 3859, 4194 and 3829 orthologs were present as the species core genome for C. fungivorans, C. pratensis, C. arenae, respectively (Fig. 2b-d). A core-pan genome evolution plot summarizing the variability in each possible combination of Collimonas species shows that the number of unique (singleton) gene clusters is stable. The variable gene clusters are increasing and the core gene clusters are decreasing (Fig. 2e). In order to determine the differences in functions encoded by the core and variable genome of each strain, the proportion of proteins in each COG (Clusters of Orthologous Groups) was plotted versus the COG function. The relative abundance of almost all the COG categories was higher in the core genome of the six strains than in the variable genome. This was not the case for COG categories N (Cell motility), U (Intracellular trafficking, secretion, and vesicular transport) and proteins that cannot be assigned in COG categories (data not shown) where the proteins were more abundant in variable genomes for C. fungivorans and C. arenae (Fig. 3a). This is most probably due to the fact that a flagellar and chemotaxis-related gene cluster is only present in these two species but not in C. pratensis (Additional file 2: Table S1). This finding is consistent with the observed reduced swimming motility of C. pratensis Ter91 and Ter291 as compared to the other four strains (Fig. 3b).
Whole genome alignments of the six strains were performed to obtain information on the nucleotide level synteny (Additional file 1: Figure S2). These alignments revealed a very high level of synteny when genomes of strains from the same species were compared (Additional file 1: Figure S2A-C), but many rearrangements and inversions were observed between the genomes of all strains (Additional file 1: Figure S2D).
Genomic islands (GIs), bacteriophages and CRISPRs
Genomic islands (GIs) are mobile genetic elements acquired by horizontal transfer, which carry multiple genes that are typically involved in pathogenesis or symbiosis. The Collimonas genomes carry 7 to 47 GIs ranging from 4.0 kb to 64 kb in size (Fig. 3c). All together, the six Collimonas genomes have 139 genomic islands. The large numbers of GIs indicate a complex history of gene recombination and horizontal transfer between bacterial relatives. The genomes of all strains contain one to five possible phages, each ranging in size from 7.0 to 59.9 kb. In total, the six genomes have 18 phages with some of the phages falling into the GIs (Fig. 3c). CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats) are DNA loci that are involved in prokaryotic immunity to phage infection. Putative CRISPRs were identified using the CRISPRsFinder program . Two confirmed CRISPRs were present only in C. fungivorans Ter6 genome. For the other five genomes, no or only questionable CRISPRs were found (data not shown).
In Gram-negative bacteria, type II secretion systems (T2SSs) are the most ubiquitous secretion systems used by bacteria to export many extracellular enzymes. T2SSs are conserved and known as a two-step process: proteins are translocated across the inner membrane by the Sec or Tat pathway, and then transported from the periplasm to the exterior by an outer membrane secretin . SecABDEFY, yajC, yidC, ftsY, and ffh and tatABC encoding genes for Sec and Tat pathways respectively were found to be present in all Collimonas genomes (Additional file 1: Figure S3A; Additional file 2: Table S2). The outer membrane secretion unit of the T2SS in the Collimonas genomes resembles the Gsp system which contains one gene cluster gspD-N, responsible for secretion of protease, lipase, and phospholipase C in Burkholderia  (Additional file 1: Figure S3A; Additional file 2: Table S2). A newly described subtype of T2SS, tad locus (tight adherence)  was identified in all six genomes. The tad locus encodes the machinery required for the assembly of adhesive Flp (fimbrial low-molecule-weight protein) pili and is necessary for bacterial adhesion to surfaces, biofilm formation, and pathogenesis as shown for Aggregatibacter actinomycetemcomitans , Haemophilus , Pseudomonas , Yersinia etc. . For the mycophagous behavior of Collimonas, the adhesion to fungal hyphae might be of prime importance  and the presence of the T2SS tad locus in all strains indicates that it may be an essential trait for the mycophagy lifestyle of this species.
Type III secretion systems (T3SSs) are used by various Gram-negative bacteria to inject effector proteins into host cells, promoting either mutual benefit or pathogenesis [18, 19] and have been described as important for bacterial interaction with fungi . Three Collimonas strains Ter331, Ter6 and Ter91 carry hrp-hrc1 family gene clusters of T3SS and a second T3SS (Additional file 1: Figure S3A; Additional file 2: Table S2). The T3SSs play crucial role in the virulence of plant and human pathogens . However, their functions in non-pathogenic bacteria are still poorly understood; there are indications that mutation of T3SSs in a plant-growth promoting bacteria P. fluorescens SBW25 resulted in a significant reduction in the potential of the bacterium to colonize the root tips of sugar beet seedlings . They might also be involved in bacterial-fungal interactions, facilitating bacteria migration along fungal hyphae . For Collimonas, the T3SSs may be important to inject membrane disturbing compounds into the fungal host to get access to nutrients inside the fungal hyphae .
Type VI secretion systems (T6SSs) are conserved and prevalent in Gram-negative bacteria. They are known to be involved in competition, predation and inter-specific bacterial interactions [23–25]. In this study, we identified a cluster of genes encoding T6SS only in C. fungivorans and C. pratensis strains, but not in the C. arenae species. This may indicate that horizontal gene transfer events or evolutionary genes loss have occurred. Furthermore, in C. fungivorans Ter331, part of the T6SS is located on a genomic island.
Further functional studies are needed to determine the exact role of these secretion systems for Collimonas lifestyle and in particular for the attack of fungi.
Signal transduction systems
Signal transduction systems play important roles for many bacteria enabling them to detect and respond to changes and stresses in the environment . Each Collimonas genome encodes 267 to 365 one-component systems (1CSs) which are the majority of signal transduction systems in prokaryotes  and 90 to 109 two component system (TCSs) (Additional file 2: Table S3). Additionally, 9 to 29 genes involved in chemotaxis systems were found. Extracytoplasmic function (ECF) sigma factors which comprise the largest group among the σ70 family  were also found in all Collimonas genomes with numbers ranging from 9 to 11. Higher numbers of signal transduction system were predicted in C. fungivorans and C. pratensis as compared to C. arenae. The overall high number of genes related to signal transduction systems (8–9 % of predicted sequences in the six Collimonas genomes) suggest that Collimonas possess the ability to sense environmental signals and cues important for their growth, survival and interactions in the heterogeneous and complex soil environment. This is in consistent with the previous report that soil bacteria contain higher number of signal transduction systems as compared to bacteria from a stable environment .
Our genomic analysis revealed that all six Collimonas genomes contain two QS genes: one autoinducer gene and one luxR-type transcriptional regulator. They show 40 % homology to the CepIR system from Burkholderia (Additional file 2: Table S4) which is known to regulate protease, lipases, chitinases and some other exoenzymes production [29, 30]. Quorum sensing assay performed with the indicator strain C. violaceum CV026 revealed clear short chain AHL production in C. fungivorans Te331, Ter6 and C. pratensis Ter91, Ter291 strains, but no or trace amounts in C. arenae Ter10 and Ter282 strains (Additional file 1: Figure S3B). In all strains, AHL production was detected when A. tumefaciens NT1 was used as QS bioreporter (Additional file 1: Figure S3C).
Secondary metabolome of Collimonas strains
Bacteria often produce a set of secondary metabolites with antimicrobial properties important for competition and survival in competitive environments. In Collimonas, the secondary metabolites are thought to play an important role enabling mycophagous growth, namely by disturbing the fungal membrane integrity . Although Collimonas was suggested to represent a valuable resource for the discovery of novel molecules and enzymes , to date only two antimicrobial compounds were described for this genus, namely violacein and collimomycin [31, 32]. Violacein was identified in the Collimonas CT strain isolated from an aquatic environment and revealed antibacterial activity against Micrococcus luteus . However, in the six Collimonas genomes here, no genes encoding violacein were identified.
Collimomycin is a polyacetylenic compound with alternating triple and single carbon-carbon bonds  which is produced by C. fungivorans Ter331 and was shown to have strong antifungal activities . The corresponding biosynthesis cluster K, is only partially present in C. fungivorans Ter6, and completely absent in the other four genomes (Fig. 3c, Additional file 2: Table S5). Moreover, the part of the collimomycin gene cluster that is present only in C. fungivorans Ter331 is located in a genomic island suggesting that its presence is due to a horizontal gene transfer event.
Exoenzymes are extracellular enzymes produced inside the cell, and subsequently released outside the cell to perform extracellular digestion. Exoenzymes may be of importance for Collimonas nutrient acquisition, microbial interactions, mycophagy and weathering.
Chitin is a major component of fungal cell walls and is a homopolymer of N-acetyl-D-glucosamine (GlcNAc). Chitinases are able to hydrolyze the 1,4-beta-linkages of chitin . Two loci A and B of chitinase biosynthesis and transport were already found in the genome of C. fungivorans Ter331 . We observed that the other five Collimonas genomes also contain a complete set of genes in these two loci (Additional file 2: Table S6). This is in line with previous observations based on comparative genomic hybridization study  and indicates that acquisition of these genes occurred before Collimonas speciation. Phenotypic evaluation of chitinase production of these strains was confirmed by halo formation on water-agar plates containing colloidal chitin . Bacterial chitinase activity has often been reported to be linked to antifungal properties [35–38]. Indeed, when adding chitinase inhibitor allosamidin, the growth of Collimonas on fungi was decreased, suggesting potential contribution to its mycophagous ability . However, mutants in the chitinase loci of C. fungivorans Ter331 showed no difference during in vitro antagonism tests . This suggests that chitinases might not contribute solely to the antifungal activities of Collimonas. The antifugal activities might be coupled with the production of other secondary metabolites.
Phospholipases are a group of enzymes that catalyze the cleavage of phospholipids. Two major phospholipase activities can be defined by the site of cleavage, namely in the hydrophobic diacylglycerol moiety (PLA) or in the polar head group of the amphipathic phospholipid (PLC and PLD) (Schmmiel and Miller, 1999). In general, 11 to 15 phospholipases from four different groups: phospholipase A1, phospholipase C, phospholipase D and patatin phospholipase were detected in the six Collimonas genomes (Fig. 4a; Additional file 2: Table S7). Phospholipases are considered virulence factors for pathogenic bacterial species which cause tissue destruction, lung infections, hemolysis etc. . Given the cleavage properties of phospholipids we speculate that they might be involved in nutrient acquisition via fungal membrane disturbing activities as well as in defense against competitors.
In the six Collimonas genomes, 176 to 212 peptidases were predicted and nine families of proteolytic enzymes were identified (Fig. 4b). Among them, serine and metallo peptidases are the two dominant families. The Collimonas strains in our study were tested positive for exoprotease production (Fig. 4c). Serine proteases are one of the most abundant groups of proteolytic enzymes found in all living organisms  and in prokaryotes, serine proteases are involved in several biological processes associated with cell signaling, defense response and development [42–44]. Furthermore, serine protease can be involved in regulating the biosynthesis of lipopeptides, which can play a role in the suppression of other microbes .
Siderophores are low molecular weight, high-affinity iron chelating compounds produced by microorganisms under iron limited conditions and function in solubilization, transport and storage of iron [46, 47]. Siderophore production can act as an antagonistic mechanism by scavenging limited iron from the soil environment, thereby reducing the amount of available iron for other organisms. Our analysis revealed that all six strains encode biosynthesis clusters which resemble ornibactin (Fig. 4e), a siderophore synthesis cluster of Burkholderia cenocepacia . Siderophore production was confirmed for all the six Collimonas strains, albeit with different production efficiency (Fig. 4d).
Next to siderophore production, other mechanisms of iron acquisition were reported. For example, recently, a novel alternative siderophore-independent iron uptake system was identified in Burkholderia, named ftr bcc ABCD locus. This ftrABCD operon was identified in all six Collimonas genomes (Fig. 4f), indicating that there are more strategies for iron uptake besides ornibactin production. Moreover, we found genes coding for the production of bacterioferritin, a type of iron-storage protein . The gene is often adjacent to genes encoding a small [2Fe-2S]-ferredoxin Bfd. Downstream of the bfd gene is the tonB-exbB-exbD cluster which encodes a system to transduce cellular energy to outer-membrane receptors for siderophores and haemin . The bacterioferritin encoding gene bfr and adjacent genes bfd and tonB-exbB-exbD cluster were found in the six Collimonas genomes (Fig. 4g). Furthermore, a fecIR operon was also found in all six genomes (Fig. 4h). It is known that the transcription of genes for ferric-citrate transport in E. coli requires FecI, and FecR, a cytoplasmic membrane protein encoded by the second gene in the Fur-repressed fecIR operon which transmits an external iron signal to the cytoplasmic FecI protein . A haem uptake system similar to that of Burkholderia cenocepacia J2315 was found in the genomes of C. fungivorans Ter6 and C. pratensis Ter91, encoded by the hmu operon . This operon is comprised of five genes (Fig. 4i) and was suggested to be Fur-mediated. Furthermore, fungi are known to produce haem , and, therefore, it is plausible that Collimonas are able to use the fungal haem as source of iron.
NRPS and PKS-NRPS genes encoding metabolites
Lipopeptides (LPs) are compounds composed of a lipid tail with a linear or cyclic oligopeptide . They exhibit surfactant, antimicrobial, anti-predation, and cytotoxic properties [55, 56]. LPs are synthesized in bacteria by large nonribosomal peptide synthetases (NRPSs) via a thiotemplate process. The structural diversity of the LPs is due to differences in the length, composition of the fatty acid tail, and the number, type and configuration of the amino acids in the peptide moiety. Via in silico analysis, we identified gene clusters for LP biosynthesis of tripropeptin A in the genomes of C. fungivorans Ter331 and Ter6 (Fig. 5a). Although the structure of tripropeptin A has been known for more than a decade [57, 58], genetic analysis of its biosynthesis was only recently reported . Our study revealed that three NRPS genes trpA, trpB, trpC are organized in a single-operon (Fig. 5a). The trpA gene, encodes an NRPS with the first five modules, and trpB, trpC encode one and two modular NRPS, respectively. The wild type C. fungivorans Ter331 and Ter6 possess antibacterial activity against Staphylococcus aureus but the other four strains from C. pratensis and C. arenae (Fig. 5b) do not. The site-directed ∆trpA mutant of C. fungivorans Ter331 lacks this antagonism (Fig. 5b) indicating that tripropeptin A is indeed involved in antibacterial activity which is a unique trait for C. fungivorans strains.
Another 16.9 kb NRPS gene was found in C. fungivorans Ter331, Ter6 and C. pratensis Ter91, Ter291 strains (Figs. 3c and 5c). This gene is composed of five amino acids. There are no hits to known lipopeptides indicating a new lipopeptide. Collimonas strains are well known for their ability to suppress a range of fungi and oomycetes [3, 32, 60, 61]. However, our study revealed that the six Collimonas strains differed in the in vitro suppression against fungal and oomycetal pathogens (Fig. 5d; Additional file 2: Table S8). When the gene encoding the new lipopeptide was knocked out in C. fungivorans Ter331 by site-directed mutagenesis, reduced suppression was observed (Fig. 5d). This indicates that the new lipopeptide contributes to antimicrobial activity against both fungal and oomycetal pathogens. For the C. arenae species, no genes coding for the production of the new lipopeptide were found in the genome, but the strains showed suppressing activities against fungal and oomycetal pathogens (Additional file 2: Table S8). This indicates that apart from new lipopeptides, there may be other factor(s) involved in antimicrobial activity or that the suppression activity is due to synergistic effects rather than to a single compound. Thus, the different types of lipopeptide produced by the different Collimonas strains may partly explain the variability in the fungal inhibition behavior of these strains. The variability is also reflected in the unknown orphan gene clusters described below.
Unknown orphan NRPS and PKS-NRPS hybrid gene clusters
Within the genomes, four orphan gene clusters were identified. Two different 2-amino acids (2-aa) NRPS genes were found in Ter6 and Ter291 respectively (Additional file 2: Table S9; Additional file 1: Figure S4B, C). Next to this, a 54.5 kb unknown T3PKS-NRPS gene cluster was discovered in the genome of Ter6 (Additional file 1: Figure S4A). Another 10.6 kb NRPS-T1PKS gene cluster in Ter91 (Additional file 1: Figure S4D) resembles clusters encoding the Heat-stable antifungal factor (HSAF), also referred to as dihydromaltophilin, produced by Lysobacter species [62, 63]. HSAF exhibits inhibitory activities against a wide range of fungal species by disrupting the polarized growth or the biosynthesis of a distinct group of sphingolipids of fungi [62, 63].
Terpenes are a diverse family of primary and secondary metabolites which were mostly studied in plants and fungi . Many terpenes of plant origin are known to be active against a wide variety of microorganisms, including Gram-positive, Gram-negative bacteria and fungi , but to date there are only few reports on antimicrobial activity of terpenes from microbial origin [66, 67]. In a previous study, four monoterpenes (γ-terpinene, 1S-α-pinene, β-pinene and β-myrcene) were detected in the headspace of C. pratensis strains Ter91 . Here, these monoterpenes were tested individually and as a mixture for their antimicrobial activity. The β-pinene exhibited inhibition against Staphylococcus aureus and Rhizoctonia solani and the mixture of all monoterpenes revealed inhibition against these two pathogens and also to E. coli (Fig. 6a).
Biosynthesis of terpenes is mediated by terpene cyclases, and starts from polyprenyl pyrophosphate precursors. The precursor accepted by the cyclase determines which class of terpenes it produces: the C10 precursor geranyl pyrophosphate (GPP) will lead to formation of monoterpenes, while C15 precursorfarnesyl pyrophosphate (FPP) will lead to sesquiterpenes, and the C20 precursor geranylgeranyl pyrophosphate (GGPP) will lead to di-terpenes, or to phytoene (C40). In particular mono- and sesquiterpene synthases can occur in many cyclization patterns, leading to a huge diversity of molecules.
The genomes of the six Collimonas strains were screened for gene clusters possibly involved in terpene biosynthesis. All strains carry genes related to phytoene biosynthesis (Additional file 2: Table S9). Only C. pratensis strains Ter91 and Ter291 harbored an additional cluster comprising terpene synthases genes (Fig. 3c; CPter91_2617 and CPter291_2730). Terpene cyclases are widely distributed in bacteria, but mostly characterised in Streptomyces species [69, 70]. To date only one terpene cyclase from Proteobacteria has been functionally characterised, the 2-methylenebornane synthase from Pseudomonas fluorescens Pf0-1 . CPter91_2617 and CPter291_2730 both encode a 330-amino acid protein that differs only in 2 amino acid residues (Additional file 1: Figure S5). When compared to other functionally characterized terpene cyclases, the Collimonas protein sequences showed maximally 23 % aa-identity to any previously characterized bacterial terpene cyclase (Fig. 6b).
The product specificity of mono- and sesquiterpene cyclases cannot be predicted from their primary sequence. For biochemical characterization, CPter91_2617 and CPter291-2730 genes were expressed in E. coli, partially purified and tested in vitro using FPP, GPP or GGPP as substrates. When the produced terpenes were analyzed by GC-MS, both Collimonas enzymes converted FPP to a mix of sesquiterpenes and sesquiterpene alcohols. The major peak was putatively identified as germacrene D-4-ol by comparison of the mass spectrum to the NIST 2014 spectral library, and several minor sesquiterpene peaks, including δ-cadinene (Fig. 6c). When GPP was applied as a substrate, the production of two monoterpenes identified as β-pinene and β-linalool was observed (Additional file 1: Figure S6A). A small amount of product could be observed upon the incubation of GGPP as substrate, which was putatively identified as 13-epimanool (Additional file 1: Figure S6B). Thus we characterized CPter91_2617 and CPter291_2730 as mixed mono-, sesqui- and diterpene cyclases, with major product germacrene D-4-ol. The sesquiterpene products suggest that they are functionally related to plant and fungal cadinene/cadinol and germacrene D-4-ol synthases, although sequence homology to these enzymes is low [72, 73]. One of the monoterpene products of CPter91_2617 and CPter291_2730, β-pinene, was also observed in the headspace of C. pratensis and showed antibacterial activity, suggesting a role of the Collimonas terpene cyclases in antimicrobial activity. Volatiles terpenes may have synergistic antimicrobial effects in combination with antibiotics. For example a synergistic effect of terpenes and penicillin on multiresistant strains S. aureus and E. coli was reported . Beside antimicrobial activity the production of terpenes by Collimonas may point to another important ecological role, namely chemical communication. Since terpenes volatilize easily and can be produced by all kingdoms of life including plant, fungi and bacteria, we assume that terpenes may play a significant role for Collimonas long-distance inter-kingdom interactions and communication.
The comparative analysis of six completely sequenced Collimonas genomes representing three species revealed a high degree of genomic diversity between strains with a core genome representing 43.1 to 52.7 % of the genome of all strains (Summarized in Fig. 3c). Although the genomes were largely syntenic, genome rearrangements were observed both between and within the species indicating high genomic plasticity. All Collimonas genomes carry large numbers of Genomic Islands pointing to a complex history of gene recombination and horizontal transfer between bacterial relatives. Type two secretion systems were present in all genomes suggesting that it may be important trait for the mycophagous lifestyle of Collimonas.
Genomic analysis of secondary metabolism of Collimonas revealed that genes encoding for exoenzymes such as chitinase, peptidase, phospholipase are well conserved but that there is a high variability of genes encoding for the production of other secondary metabolites such as collimomycin, lipopeptides, (PKS-)NRPS, and terpenes. Genes encoding the production of the polyacetylenic compound collimomycin and lipopeptide tripropeptin A are present only in C. fungivorans while the gene clusters encoding the production of a putative new lipopeptide is present in both C. fungivorans and C. pratensis, but not C. arenae. Moreover, several unknown orphan (PKS-)NRPS gene clusters are present in the genome of C. fungivorans and C. pratensis.
Mutational and phenotypical analyses indicated that tripropeptin A and the designated new lipopeptide have antibacterial and antifungal (oomycetal) activities, respectively. The biochemical characterization of the terpene synthases genes revealed that Collimonas are able to produce a set of sesquiterpenes and/or monoterpens that are considered to be mainly of plant origin. The in vitro assay of pure terpene compounds indicated their contributions to both antibacterial and antifungal activities.
Overall, our exploration of Collimonas genomes revealed that this bacterial group represents a valuable resource for the discovery of novel secondary metabolites and enzymes. The results gained here will be certainly helpful for designing future experimental studies that will lead to comprehensive understanding of the unique ecology of Collimonas species.
Strains and growth conditions
All bacterial strains used in this study are listed in Table 1 and Additional file 2: Table S10. Collimonas strains were cultured in 0.1 Tryptic Soy Broth (0.1 TSB) (5 g/L NaCl, 1 g/L KH2PO4, 3 g/L TSB, 20 gL − 1 CMN-Boom Agar, pH = 6.7) or King’s B medium (20 g/L proteose peptone, 1.5 g/L MgSO4, 1.2 g/L KH2PO4, 10 g/L glycerol, 15 g/L agar). Escherichia coli strain DH5α was used as a host for the plasmids used for site-directed mutagenesis. E. coli strains were grown on Luria-Bertani (LB) plates (10 g/L NaCl, 10 g/L Bacto™ Tryptone, 5 g/L Bacto™ Yeast extract, 20 g/L Merck Agar) or in LB broth amended with the appropriate antibiotics.
Genomic DNA isolation
Genomic DNA from each Collimonas strain was extracted from overnight grown cells using QIAamp® DNA Mini Kit and Qiagen® MagAttract® HMW kit and used for Illumina and PacBio RS II sequencing respectively.
Illumina paired-end sequences were obtained for C. arenae Ter10, C. arenae Ter282, C. pratensis Ter91, C. pratensis Ter291 and C. fungivorans Ter6 from BaseClear B.V. on the Illumina HiSeq2000 platform (2 x 51 bp paired-end reads, except strain Ter91 which was sequenced at 2 x 100 bp paired-end reads) and assembled using the Ray  assembler version 2.3.1 (Additional file 2: Table S11).
PacBio RS II sequences were obtained from 5 SMRT cells, one for each strain. Sequences were filtered using SMRT Analysis server v2.2.0 with default settings. The RS_HGAP Assembly.3 (HGAP3)  protocol was used to assemble the filtered reads, the RS_AH_Scaffolding protocol was used if the initial assembly yielded more than one contig, followed by a final Quiver correction using the RS_Resequencing protocol (See Additional file 2: Table S12).
The singular contigs were checked using a custom script and overlapping ends trimmed. The final circular contig for each chromosome was rearranged to start at the dnaA gene in the forward direction. There was no evidence of plasmids in the sequence data.
All Collimonas sequences including the previously published genome of Ter331 were annotated using the IGS annotation pipeline . The IGS annotation of Ter331 is attached as a Additional file 3: The accession numbers of the other five genomes after submission to NCBI are Ter6 (CP013232); Ter91 (CP013234); Ter291 (CP013236); Ter10 (CP013233) and Ter282 (CP013235).
Core, pan and variable genome analysis
Protein coding genes from the six Collimonas strains were clustered together with the protein coding genes of Burkholderia phytofirmans PsJN (NC_010681.1, NC_010676.1) and Pseudomonas protegens Pf-5 (NC_004129.6) using cd-hit  with word length 3 (−n 3), global identity (−G 1) and a minimal alignment coverage of 60 % for the shortest protein (−aS 0.6). Cd-hit clusters were parsed into an absence-presence matrix from which the core, pan and variable genomes were parsed using custom scripts. COG annotations were determined using kognitor . Core and pan evolution plot is generated based on . At each number of strains (n) out of strain set (s), s!/n!*(s-n)! combinations are possible. The median number of specific, variable and core genes for all combinations are plotted as a function of n. Clusters containing one gene per strain were selected from the core cluster set of the Collimonas, Burkholderia and Pseudomonas strains were aligned with MAFFT and combined in one pseudoalignment. Redundant colums were removed and maximum likelihood phylogenetic trees were calculated with RAxML .
The annotation of the previously-published genome of Ter331 was updated and manually curated as part of this study. Synteny analyses were performed using Progressive MAUVE . Phylogenetic analyses on the whole genome was performed using Gegenees , and 16S rDNA analysis was performed using MEGA6 [84–87]. Whole genome peptidases prediction was conducted by MEROPS . Phospholipases were predicted by searching on the basis of the profile HMM using PFAM domains of phospholipase A1 (PF02253), phospholipase A2 (PF09056), phospholipase C (PF05506) and phospholipase D (PF00614). Secondary metabolite production clusters were examined using the antiSMASH program [89, 90]. The amino acid composition of products from NRPS sequences were predicted using NRPSpredictor 2 . Genomic islands were identified using IslandViewer [92, 93] and phages elements and features were identified using PHAST . CRISPRs were identified based on CRISPRfinder . Whole genome analysis for type VI secretion system was conducted with SecRet6  and circular genome diagrams were visualized using Circos .
Availability of supporting data
Further methodological details are provided in the Additional file 4: Materials and Methods.
Nonribosomal peptide synthetase
Clusters of Orthologous Groups
Clustered Regularly Interspaced Short Palindromic Repeats
de Boer W, Leveau JH, Kowalchuk GA, Klein Gunnewiek PJ, Abeln EC, Figge MJ, et al. Collimonas fungivorans gen. nov., sp. nov., a chitinolytic soil bacterium with the ability to grow on living fungal hyphae. Int J Syst Evol Microbiol. 2004;54(Pt 3):857–64.
Leveau JH, Uroz S, de Boer W. The bacterial genus Collimonas: Mycophagy, weathering and other adaptive solutions to life in oligotrophic soil environments. Environ Microbiol. 2010;12(2):281–92.
De Boer W, Gunnewiek PJAK, Lafeber P, Janse JD, Spit BE, Woldendorp JW. Anti-fungal properties of chitinolytic dune soil bacteria. Soil Biol Biochem. 1998;30(2):193–203.
Rudnick M-B: Mycophagous soil bacteria. PhD thesis. 2015:87–124.
Hoppener-Ogawa S, de Boer W, Leveau JH, van Veen JA, de Brandt E, Vanlaere E, et al. Collimonas arenae sp. nov. and Collimonas pratensis sp. nov., isolated from (semi-)natural grassland soils. Int J Syst Evol Microbiol. 2008;58(Pt 2):414–9.
Mela F, Fritsche K, de Boer W, van Veen JA, de Graaff LH, van den Berg M, et al. Dual transcriptional profiling of a bacterial/fungal confrontation: Collimonas fungivorans versus Aspergillus niger. ISME J. 2011;5(9):1494–504.
Mela F, Fritsche K, de Boer W, van den Berg M, van Veen JA, Maharaj NN, et al. Comparative genomics of bacteria from the genus Collimonas: Linking (dis)similarities in gene content to phenotypic variation and conservation. Environ Microbiol Rep. 2012;4(4):424–32.
Wu JJ, de Jager VC, Deng WL, Leveau JH: Finished genome sequence of Collimonas arenae Cal35. Genome announcements 2015, 3(1) doi: 10.1128/genomeA.01408-14.
Mela F, Fritsche K, Boersma H, van Elsas JD, Bartels D, Meyer F, et al. Comparative genomics of the pIPO2/pSB102 family of environmental plasmids: Sequence, evolution, and ecology of pTer331 isolated from Collimonas fungivorans Ter331. FEMS Microbiol Ecol. 2008;66(1):45–62.
Grissa I, Vergnaud G, Pourcel C. CRISPRFinder: A web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 2007;35:W52–7.
Cianciotto NP. Many substrates and functions of type II secretion: Lessons learned from Legionella pneumophila. Future Microbiol. 2009;4(7):797–805.
DeShazer D, Brett PJ, Burtnick MN, Woods DE. Molecular characterization of genetic loci required for secretion of exoproducts in Burkholderia pseudomallei. J Bacteriol. 1999;181(15):4661–4.
Tomich M, Planet PJ, Figurski DH. The tad locus: Postcards from the widespread colonization island. Nat Rev Microbiol. 2007;5(5):363–75.
Planet PJ, Kachlany SC, Fine DH, DeSalle R, Figurski DH. The widespread colonization island of Actinobacillus actinomycetemcomitans. Nat Genet. 2003;34(2):193–8.
Nika JR, Latimer JL, Ward CK, Blick RJ, Wagner NJ, Cope LD, et al. Haemophilus ducreyi requires the flp gene cluster for microcolony formation in vitro. Infect Immun. 2002;70(6):2965–75.
Durand E, Bernadac A, Ball G, Lazdunski A, Sturgis JN, Filloux A. Type II protein secretion in Pseudomonas aeruginosa: the pseudopilus is a multifibrillar and adhesive structure. J Bacteriol. 2003;185(9):2749–58.
Hoppener-Ogawa S, Leveau JH, Hundscheid MP, van Veen JA, de Boer W. Impact of Collimonas bacteria on community composition of soil fungi. Environ Microbiol. 2009;11(6):1444–52.
Cornelis GR, Wolf-Watz H. The Yersinia Yop virulon: A bacterial system for subverting eukaryotic cells. Mol Microbiol. 1997;23(5):861–7.
Galan JE, Collmer A. Type III secretion machines: Bacterial devices for protein delivery into host cells. Science. 1999;284(5418):1322–8.
Warmink JA, van Elsas JD. Selection of bacterial populations in the mycosphere of Laccaria proxima: Is type III secretion involved? ISME J. 2008;2(8):887–900.
Tseng TT, Tyler BM, Setubal JC. Protein secretion systems in bacterial-host associations, and their description in the gene ontology. BMC Microbiol. 2009;9 Suppl 1:S2.
Jackson RW, Preston GM, Rainey PB. Genetic characterization of Pseudomonas fluorescens SBW25 rsp gene expression in the phytosphere and in vitro. J Bacteriol. 2005;187(24):8477–88.
Schwarz S, Hood RD, Mougous JD. What is type VI secretion doing in all those bugs? Trends Microbiol. 2010;18(12):531–7.
Russell AB, Hood RD, Bui NK, LeRoux M, Vollmer W, Mougous JD. Type VI secretion delivers bacteriolytic effectors to target cells. Nature. 2011;475(7356):343–U392.
Records AR. The type VI secretion system: A multipurpose delivery system with a phage-like machinery. Mol Plant Microbe In. 2011;24(7):751–7.
Stock AM, Robinson VL, Goudreau PN. Two-component signal transduction. Annu Rev Biochem. 2000;69:183–215.
Ulrich LE, Koonin EV, Zhulin IB. One-component systems dominate signal transduction in prokaryotes. Trends Microbiol. 2005;13(2):52–6.
Krell T, Busch A, Lacal J, Silva-Jimenez H, Ramos JL. The enigma of cytosolic two-component systems: A hypothesis. Environ Microbiol Rep. 2009;1(3):171–6.
Lewenza S, Conway B, Greenberg EP, Sokol PA. Quorum sensing in Burkholderia cepacia: identification of the LuxRI homologs CepRI. J Bacteriol. 1999;181(3):748–56.
Aguilar C, Friscina A, Devescovi G, Kojic M, Venturi V. Identification of quorum-sensing-regulated genes of Burkholderia cepacia. J Bacteriol. 2003;185(21):6456–62.
Hakvag S, Fjaervik E, Klinkenberg G, Borgos SE, Josefsen KD, Ellingsen TE, et al. Violacein-producing Collimonas sp. from the sea surface microlayer of costal waters in Trondelag, Norway. Mar Drugs. 2009;7(4):576–88.
Fritsche K, van den Berg M, de Boer W, van Beek TA, Raaijmakers JM, van Veen JA, et al. Biosynthetic genes and activity spectrum of antifungal polyynes from Collimonas fungivorans Ter331. Environ Microbiol. 2014;16(5):1334–45.
Rabea EI, Badawy MET, Stevens CV, Smagghe G, Steurbaut W. Chitosan as antimicrobial agent: Applications and mode of action. Biomacromolecules. 2003;4(6):1457–65.
Fritsche K, de Boer W, Gerards S, van den Berg M, van Veen JA, Leveau JH. Identification and characterization of genes underlying chitinolysis in Collimonas fungivorans Ter331. FEMS Microbiol Ecol. 2008;66(1):123–35.
Kobayashi DY, Reedy RM, Bick J, Oudemans PV. Characterization of a chitinase gene from Stenotrophomonas maltophilia strain 34S1 and its involvement in biological control. Appl Environ Microb. 2002;68(3):1047–54.
Chang WT, Chen CS, Wang SL. An antifungal chitinase produced by Bacillus cereus with shrimp and crab shell powder as a carbon source. Curr Microbiol. 2003;47(2):102–8.
Chernin L, Ismailov Z, Haran S, Chet I. Chitinolytic Enterobacter agglomerans antagonistic to fungal plant-pathogens. Appl Environ Microb. 1995;61(5):1720–6.
Dahiya N, Tewari R, Tiwari R, Hoondal G. Production of an antifungal chitinase from Enterobacter sp NRG4 and its application in protoplast production. World J Microb Biot. 2005;21(8–9):1611–6.
De Boer W, Gunnewiek PJAK, Kowalchuk GA, Van Veen JA. Growth of chitinolytic dune soil beta-subclass Proteobacteria in response to invading fungal hyphae. Appl Environ Microb. 2001;67(8):3358–62.
Schmiel DH, Miller VL. Bacterial phospholipases and pathogenesis. Microbes Infect. 1999;1(13):1103–12.
Page MJ, Di Cera E. Serine peptidases: classification, structure and function. Cell Mol Life Sci. 2008;65(7–8):1220–36.
Gottesman S. Proteolysis in bacterial regulatory circuits. Annu Rev Cell Dev Bi. 2003;19:565–87.
Hengge R, Bukau B. Proteolysis in prokaryotes: Protein quality control and regulatory principles. Mol Microbiol. 2003;49(6):1451–62.
Jenal U, Hengge-Aronis R. Regulation by proteolysis in bacterial cells. Curr Opin Microbiol. 2003;6(2):163–72.
de Bruijn I, Raaijmakers JM. Regulation of cyclic lipopeptide biosynthesis in Pseudomonas fluorescens by the ClpP protease. J Bacteriol. 2009;191(6):1910–23.
Chu BC, Garcia-Herrero A, Johanson TH, Krewulak KD, Lau CK, Peacock RS, et al. Siderophore uptake in bacteria and the battle for iron with the host; a bird’s eye view. Biometals: Intl J Role Met Ions Biol Biochem Med. 2010;23(4):601–11.
Hider RC, Kong XL. Chemistry and biology of siderophores. Nat Prod Rep. 2010;27(5):637–57.
Agnoli K, Lowe CA, Farmer KL, Husnain SI, Thomas MS. The ornibactin biosynthesis and transport genes of Burkholderia cenocepacia are regulated by an extracytoplasmic function sigma factor which is a part of the Fur regulon. J Bacteriol. 2006;188(10):3631–44.
Andrews SC. Iron storage in bacteria. Adv Microb Physiol. 1998;40:281–351.
Schalk IJ, Yue WW, Buchanan SK. Recognition of iron-free siderophores by TonB-dependent iron transporters. Mol Microbiol. 2004;54(1):14–22.
Braun V, Mahren S, Ogierman M. Regulation of the FecI-type ECF sigma factor by transmembrane signalling. Curr Opin Microbiol. 2003;6(2):173–80.
Holden MTG, Seth-Smith HMB, Crossman LC, Sebaihia M, Bentley SD, Cerdeno-Tarraga AM, et al. The genome of Burkholderia cenocepacia J2315, an epidemic pathogen of cystic fibrosis patients (vol 191, pg 261, 2009). J Bacteriol. 2009;191(8):2907–7.
Franken ACW, Lokman BC, Ram AFJ, van den Hondel CAMJJ, de Weert S, Punt PJ. Analysis of the role of the Aspergillus niger aminolevulinic acid synthase (hemA) gene illustrates the difference between regulation of yeast and fungal haem- and sirohaem-dependent pathways. Fems Microbiol Lett. 2012;335(2):104–12.
Raaijmakers JM, de Bruijn I, Nybroe O, Ongena M. Natural functions of lipopeptides from Bacillus and Pseudomonas: more than surfactants and antibiotics. Fems Microbiol Rev. 2010;34(6):1037–62.
Raaijmakers JM, de Bruijn I, de Kock MJD. Cyclic lipopeptide production by plant-associated Pseudomonas spp.: Diversity, activity, biosynthesis, and regulation. Mol Plant Microbe In. 2006;19(7):699–710.
Raaijmakers JM, Mazzola M. Diversity and natural functions of antibiotics produced by beneficial and plant pathogenic bacteria. Annu Rev Phytopathol. 2012;50:403–24.
Hashizume H, Igarashi M, Hattori S, Hori M, Hamada M, Takeuchi T. Tripropeptins, novel antimicrobial agents produced by Lysobacter sp. I. Taxonomy, isolation and biological activities. J Antibiot. 2001;54(12):1054–9.
Hashizume H, Hirosawa S, Sawa R, Muraoka Y, Ikeda D, Naganawa H, et al. Tripropeptins, novel antimicrobial agents produced by Lysobacter sp - II. Struct Elucidation J Antibiot. 2004;57(1):52–8.
Medema MH, Paalvast Y, Nguyen DD, Melnik A, Dorrestein PC, Takano E, et al. Pep2Path: Automated mass spectrometry-guided genome mining of peptidic natural products. Plos Comput Biol. 2014;10(9):e1003822.
Adesina MF, Lembke A, Costa R, Speksnijder A, Smalla K. Screening of bacterial isolates from various European soils for in vitro antagonistic activity towards Rhizoctonia solani and Fusarium oxysporum: Site-dependent composition and diversity revealed. Soil Biol Biochem. 2007;39(11):2818–28.
Opelt K, Berg G. Diversity and antagonistic potential of bacteria associated with bryophytes from nutrient-poor habitats of the Baltic Sea coast. Appl Environ Microb. 2004;70(11):6569–79.
Yu FG, Zaleta-Rivera K, Zhu XC, Huffman J, Millet JC, Harris SD, et al. Structure and biosynthesis of heat-stable antifungal factor (HSAF), a broad-spectrum antimycotic with a novel mode of action. Antimicrob Agents Ch. 2007;51(1):64–72.
Lou LL, Qian GL, Xie YX, Hang JL, Chen HT, Zaleta-Riyera K, et al. Biosynthesis of HSAF, a tetramic acid-containing macrolactam from Lysobacter enzymogenes. J Am Chem Soc. 2011;133(4):643–5.
Trapp SC, Croteau RB. Genomic organization of plant terpene synthases and molecular evolutionary implications. Genetics. 2001;158(2):811–32.
Tholl D. Terpene synthases and the regulation, diversity and biological roles of terpene metabolism. Curr Opin Plant Biol. 2006;9(3):297–304.
Gurtler H, Pedersen R, Anthoni U, Christophersen C, Nielsen PH, Wellington EM, et al. Albaflavenone, a sesquiterpene ketone with a zizaene skeleton produced by a streptomycete with a new rope morphology. J Antibiot. 1994;47(4):434–9.
Zhao B, Lin X, Lei L, Lamb DC, Kelly SL, Waterman MR, et al. Biosynthesis of the sesquiterpene antibiotic albaflavenone in Streptomyces coelicolor A3(2). J Biol Chem. 2008;283(13):8183–9.
Garbeva P, Hordijk C, Gerards S, de Boer W. Volatile-mediated interactions between phylogenetically different soil bacteria. Front Microbiol. 2014;5:289.
Cane DE, Ikeda H. Exploration and mining of the bacterial terpenome. Acc Chem Res. 2012;45(3):463–72.
Yamada Y, Kuzuyama T, Komatsu M, Shin-Ya K, Omura S, Cane DE, et al. Terpene synthases are widely distributed in bacteria. Proc Natl Acad Sci U S A. 2015;112(3):857–62.
Chou WKW, Ikeda H, Cane DE: Cloning and characterization of Pfl_1841, a 2-methylenebornane synthase in Pseudomonas fluorescens PfO-1. Tetrahedron 2011, 67(35):6627–6632.
Yoshikuni Y, Martin VJ, Ferrin TE, Keasling JD. Engineering cotton (+)-delta-cadinene synthase to an altered function: Germacrene D-4-ol synthase. Chem Biol. 2006;13(1):91–8.
Lauchli R, Pitzer J, Kitto RZ, Kalbarczyk KZ, Rabe KS. Improved selectivity of an engineered multi-product terpene synthase. Org Biomol Chem. 2014;12(23):4013–20.
Gallucci MN, Oliva M, Casero C, Dambolena J, Luna A, Zygadlo J, et al. Antimicrobial combined action of terpenes against the food-borne microorganisms Escherichia coli, Staphylococcus aureus and Bacillus cereus. Flavour Frag J. 2009;24(6):348–54.
Boisvert S, Laviolette F, Corbeil J. Ray: Simultaneous assembly of reads from a mix of high-throughput sequencing technologies. J Comput Biol. 2010;17(11):1519–33.
Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods. 2013;10(6):563. −+.
Galens K, Orvis J, Daugherty S, Creasy HH, Angiuoli S, White O, et al. The IGS standard operating procedure for automated prokaryotic annotation. Stand Genomic Sci. 2011;4(2):244–51.
Huang Y, Niu BF, Gao Y, Fu LM, Li WZ. CD-HIT Suite: A web server for clustering and comparing biological sequences. Bioinformatics. 2010;26(5):680–2.
Snel B, Bork P, Huynen MA. The identification of functional modules from the genomic association of genes. Proc Natl Acad Sci U S A. 2002;99(9):5890–5.
Tettelin H, Masignani V, Cieslewicz MJ, Donati C, Medini D, Ward NL, et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan-genome”. Proc Natl Acad Sci U S A. 2005;102(39):13950–5.
Stamatakis A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3.
Darling ACE, Mau B, Blattner FR, Perna NT. Mauve: Multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14(7):1394–403.
Agren J, Sundstrom A, Hafstrom T, Segerman B. Gegenees: Fragmented alignment of multiple genomes for determining phylogenomic distances and genetic signatures unique for specified target groups. Plos One. 2012;7(6):e39107.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30(12):2725–9.
Saitou N, Nei M. The neighbor-joining method - a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4(4):406–25.
Felsenstein J. Phylogenies and the comparative method. Am Nat. 1985;125(1):1–15.
Tamura K, Nei M, Kumar S. Prospects for inferring very large phylogenies by using the neighbor-joining method. Proc Natl Acad Sci U S A. 2004;101(30):11030–5.
Rawlings ND, Waller M, Barrett AJ, Bateman A. MEROPS: The database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 2014;42(Database issue):D503–509.
Medema MH, Blin K, Cimermancic P, de Jager V, Zakrzewski P, Fischbach MA, et al. antiSMASH: Rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011;39:W339–46.
Blin K, Medema MH, Kazempour D, Fischbach MA, Breitling R, Takano E, et al. antiSMASH 2.0-a versatile platform for genome mining of secondary metabolite producers. Nucleic Acids Res. 2013;41(W1):W204–12.
Rottig M, Medema MH, Blin K, Weber T, Rausch C, Kohlbacher O. NRPSpredictor2-a web server for predicting NRPS adenylation domain specificity. Nucleic Acids Res. 2011;39:W362–7.
Langille MGI, Brinkman FSL. IslandViewer: An integrated interface for computational identification and visualization of genomic islands. Bioinformatics. 2009;25(5):664–5.
Dhillon BK, Chiu TA, Laird MR, Langille MGI, Brinkman FSL. IslandViewer update: Improved genomic island discovery and visualization. Nucleic Acids Res. 2013;41(W1):W129–32.
Zhou Y, Liang YJ, Lynch KH, Dennis JJ, Wishart DS. PHAST: A fast phage search tool. Nucleic Acids Res. 2011;39:W347–52.
Li J, Yao Y, Xu HH, Hao L, Deng Z, Rajakumar K, et al. SecReT6: A web-based resource for type VI secretion systems found in bacteria. Environ Microbiol. 2015;17(7):2196–202.
Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, et al. Circos: An information aesthetic for comparative genomics. Genome Res. 2009;19(9):1639–45.
This research was supported in part by funds from Ecolink (260–45140) and by The Netherlands Organization for Scientific Research (NWO) VIDI personal grant to P.G (864.11.015). This publication is No.5983 of the Netherlands Institute of Ecology (NIOO-KNAW).
The authors declare no conflict of interest.
CS and PG designed the experiments. CS, RS and VJ performed the bioinformatics analysis. CS, DK, EJ, KC, AV conducted the experiments. CS drafted the manuscript. PG, JB, KC, WB and JV revised the manuscript. All authors read and approved the final manuscript.
Phylogenetic tree based on 16S rDNA depicting the relationships of sequenced strains of Collimonas. Underneath the genes are the module and domain organization of PKS or NRPS genes. The domains are as follows: C, condensation; A, adenylation; T, thiolation; E, Epimerization; TE, thioesterification; CAL, Co-enzyme A ligase domain; KS, Ketosynthase domain; AT, Acyltransferase domain; ER, Enoylreductase domain; KR, Ketoreductase domain; DH, Dehydratase domain; and ACP, Acyl-carrier protein domain. Underneath the domains are the amino acids that are incorporated into the peptide moiety. The number associated with the amino acid refers to the position of the amino acid in the peptide chain. Figure S2. Synteny of the six Collimonas genomes. Pairwise alignments of genomes were generated using Mauve (A) C. fungivorans (B) C. pratensis (C) C. arenae and (D) The three species together. Colored outlined blocks surround the regions of the genomic sequence that aligned to another genome. The colored bars inside the blocks are related to the level of sequence similarities. The analysis showed that the highest number of rearrangements was evident between all the three species. Figure S3. (A) Conserved gene clusters for type II (T2SSa/T2SSb), III (T3SSa/T3SSb) and VI (T6SS) secretion systems identified in Collimonas strains. Quorum sensing assays of the Collimonas strains. Quorum sensing activity of Collimonas strains Ter331, Ter6, Ter91, Ter291, Ter10 and Ter282 with indicator strain (B) C. violaceum CV026 and (C) A. tumefaciens NT1 (outer colonies). A purple (C. violaceum CV026) or blue (A. tumefaciens NT1) pigment produced by the indicator strains is indicative of quorum sensing activity of the tested strains. Figure S4. Organization of the orphan gene clusters and predicted amino acid compositions. Figure S5. Amino acid alignment of Collimonas terpene synthases CPter91_2617 and CPter291_2730 with previously characterised bacterial terpene synthases. The Collimonas terpene synthases were aligned with the Streptomyces exfoliatus pentalenene synthase (Se_pentalenene), S. coelicolor geosmin synthase (Sc_geosmin, 336 amino acids of the N-terminus), Streptosporangium roseum epi-cubenol synthase (Sr_epicubenol), S. avermitilis avermitilol synthase (Sa_avermitilol), S. clavuligerus 1,8-cineole synthase (Scl_cineole) and Pseudomonas fluorescens 2-methylenebornane synthase (Pf_methylenebornane). The characteristic terpene synthase divalent metal-binding motifs, namely the acidic amino acid-rich motif and the NSE triad, are boxed. Figure S6. GC-MS chromatograms of Ter91 terpene cyclase incubated with different substrates, and mass spectra of major products. The GC-MS chromatograms of Ter291 were identical to Ter91 (data not shown). Empty vector chromatograms shows products from a control enzyme extract (pACYC-duet-1). (A) Ter91 terpene synthase with GPP. TIC 100 % = 1.19E4. Major monoterpene product at RT 6.79, identified as Beta-pinene by authentic standard. (B) Ter91 terpene synthase with GGPP. TIC 100 % = 2.12E4. Major diterpene product at RT 20.24, identified as 13-epimanool comparison of mass spectra to the NIST8 library. (PDF 570 kb)
Gene locus of the flagellar biosynthetic gene cluster of the Collimonas strains. Table S2. Gene locus of the secretion systems of the Collimonas strains. Table S3. Genomic distribution of signal transduction proteins in six Collimonas genomes. Table S4. Presence of quorum-sensing proteins in six Collimonas genomes. Table S5. Gene locus of the collimomycin biosynthetic gene cluster of the Collimonas strains. Table S6. Gene locus of the chitinase biosynthetic gene clusters of the Collimonas strains. Table S7. Gene locus of the phopholipase genes of the Collimonas strains. Table S8. Antimicrobial activities of the Collimonas strains. Table S9. Genes encoding secondary metabolites and secretion systems in Collimonas strains. Table S10. Strains and mutants used in this study. Table S11. Summary of Illumina paired-end sequences assembly data. Table S12. Summary of PacBio sequences assembly data. Table S13. Sequences of characterized bacterial terpene synthases used in the phylogenetic analysis. (XLSX 56 kb)
The IGS annotation of Collimonas fungivorans Ter331 genome. (GBF 10511 kb)
Materials and Methods. Additional detailed materials and methods. (DOCX 45 kb)
About this article
Cite this article
Song, C., Schmidt, R., de Jager, V. et al. Exploring the genomic traits of fungus-feeding bacterial genus Collimonas . BMC Genomics 16, 1103 (2015) doi:10.1186/s12864-015-2289-3
- Comparative genomics
- Secondary metabolites