Metagenome-mining indicates an association between bacteriocin presence and strain diversity in the infant gut
BMC Genomics volume 24, Article number: 295 (2023)
Our knowledge about the ecological role of bacterial antimicrobial peptides (bacteriocins) in the human gut is limited, particularly in relation to their role in the diversification of the gut microbiota during early life. The aim of this paper was therefore to address associations between bacteriocins and bacterial diversity in the human gut microbiota. To investigate this, we did an extensive screening of 2564 healthy human gut metagenomes for the presence of predicted bacteriocin-encoding genes, comparing bacteriocin gene presence to strain diversity and age.
We found that the abundance of bacteriocin genes was significantly higher in infant-like metagenomes (< 2 years) compared to adult-like metagenomes (2–107 years). By comparing infant-like metagenomes with and without a given bacteriocin, we found that bacteriocin presence was associated with increased strain diversities.
Our findings indicate that bacteriocins may play a role in the strain diversification during the infant gut microbiota establishment.
Gut colonization during infancy is a complex process, involving recruitment of strains and species until an adult-like gut microbiota is reached . The diversity increases drastically the first years of life, and factors involved are the mode of delivery, antibiotic usage, microbial exposure, and diet [2,3,4]. However, the underlying ecological forces of the early bacterial recruitment and increasing diversity remain poorly understood . The ecological forces shaping the gut microbiota have been considered to include the opposing selection pressures from the host and the bacteria themselves . Orchestration by the host on the gut microbiota, such as host immunity control of the bacterial composition, is termed top-down selection . Top-down selection has been suggested to be of the diversifying kind, ensuring stability and functional redundancy in the gut. On the other hand, bottom-up mechanisms are commonly considered to reduce diversity . However, in cases of intransitive competition, bacterial-bacterial interactions may increase the diversity in a bacterial community . Intransitive competition means that there is no dominant competitor among the species competing for the same resources. This allows coexistence between species in the same niche . The most widely explored model for this is the rock-paper-scissors (RPS) based niche competition [9,10,11,12]. RPS dynamics is the simplest form of intransitive competition, modelling competition outcomes in a system with three competing components [13, 14]. Such a scenario, where there is no definite winner, can occur when mechanisms for competitiveness come at a high price . An example of a costly, competitive strategy is the use of antimicrobial compounds . Therefore, the diversification process of microbial communities can involve the production of antimicrobial compounds .
One of the most widely distributed antimicrobial compound produced by bacteria are bacteriocins . Bacteriocins are ribosomally synthesized peptides or large proteins with antibacterial activity [17, 18]. These compounds are produced by bacteria and archaea to enhance their competitiveness for nutrients and ecological niches . Bacteriocins can target and disrupt the cell membrane integrity, or inhibit transcription, translation, replication or synthesis of the cell wall . They are divided into three classes (I-III) based on their structure, and each class includes several subclasses . Since the bacteriocin databases BAGEL4 and BACTIBASE do not contain complete subclass information, we have not focused on subclasses in this article. Class I bacteriocins are small (< 10 kDa) post-translationally modified peptides, while class II bacteriocins are unmodified (< 10 kDa). Bacteriocins in class III are large (> 10 kDa) and heat-lable, often with enzymatic activity . Bacteriocin genes of all three classes have been identified in bacteria from different sites of the human body, including the gut .
Genome mining projects have discovered a vast variety of gut bacteria with the potential to produce bacteriocins [21, 22]. Cultivating experiments of fecal samples from infant guts and mother’s milk have resulted in findings of bacteria with antimicrobial activity, demonstrating that the infant gut accommodates bacteriocin-producing bacteria [23,24,25,26,27]. The bacteriocin producers may contribute to the formation of the infant gut microbiota as it has been shown that addition of bacteriocins to a gut bacterial community modulates the bacterial composition [28, 29]. However, to our knowledge, a complete characterization of bacteriocin-producing bacteria in the infant gut microbiota, from a microbial ecological point of interest, has not been performed. Thus, there is a need for extensive exploration of bacteriocins and their ecological role in the infant gut.
The objective of this study was to test the idea of that bacteriocins can play a role in the gut microbiota establishment and diversification during the transition from an infant- to an adult-like microbiota. This was done by performing a comprehensive bacteriocin screening of gut metagenomes from healthy people to identify and characterize bacteriocins that are enriched in infants and investigate their potential role in association to composition and diversity of the infant gut microbiota. We collected all bacteriocin sequences from two public databases [30,31,32] and searched for their presence in publicly available gut metagenomes from healthy persons, as well as in the HumGut catalogue of human gut genomes . An outline of the strategies used in this paper is shown in Fig. 1.
Our main findings were that infant-like metagenomes contain a higher abundance of bacteriocin genes compared to adult-like metagenomes, and that the strain diversity in the infant-like metagenomes was higher if bacteriocin genes were present compared to if they were absent. These results indicate that bacteriocins are of importance in the early gut microbiota formation, possibly by promotion of strain diversity.
Overall distribution of bacteriocin genes in gut metagenomes
We executed a bacteriocin search (diamond blastx) in 2564 metagenomes from healthy persons consisting of more than 8 billion reads in total, with a collective size of around 10 terabases, using all the 1075 unique bacteriocin sequences from the bacteriocin databases BAGEL4 and BACTIBASE. Overall, 5.1 × 106 bacteriocin-matching reads were identified (E-value below 10–5).
Microbial diversity and distribution of bacteriocin genes in gut metagenomes
We divided the gut metagenomes into five age groups: infant (age 0–1), child (age 2–9), adolescent (age 10–19), adult (age 20–59) and elderly (age 60–107). In panel a of Fig. 2, the metagenomes in the infant group show a separation from the metagenomes in the other age groups. From panel b we observe that the infant metagenomes had a lower alpha diversity than the metagenomes in the other age groups since the top 10 families cover more of the total abundance, and this can also be seen in Shannon diversity plot in Supplementary Fig. S1.
From the bacteriocin search, we found that the fraction of bacteriocin-matching reads per total number of reads in the metagenomes of the infant group was higher compared to the metagenomes in the other age groups, as shown in Fig. 3. The same pattern was seen when comparing bacteriocin-matching reads of class I, II and III. The Wilcoxon rank-sum test indicated that the difference in abundance between the infant group and each of the other age groups was significant in all cases (p < 0.01), while comparisons between the other age groups yielded p-values above 0.05. Therefore, we treated the metagenomes of the age groups child, adolescent, adult, and elderly as one group in the downstream analyses, and defined all these metagenomes as adult-like metagenomes (n = 1963), while the metagenomes in the infant group from now on is specified as infant-like. A Wilcoxon rank-sum test performed for the fraction of bacteriocin-matching reads per total number of reads in the infant-like metagenomes and adult-like metagenomes indicated that there was a highly significant difference in all cases (p < 10–10).
Enriched bacteriocin genes in infant-like metagenomes
To identify which of the bacteriocin genes detected in the bacteriocin search that were enriched in the infant-like metagenomes, the bacteriocin gene abundance in the infant-like and adult-like metagenomes were compared using a Wilcoxon rank-sum test with multiple testing correction, identifying 53 bacteriocin genes (q < 0.01). These belonged to 42 bacteriocin clusters based on protein sequence similarity, determined by all-versus-all pairwise alignments using blast + (similarity > 0.95). The number of infant-like metagenomes in which these 42 bacteriocin genes were detected ranged between 10 and 53%. We therefore categorized these bacteriocin genes as highly prevalent bacteriocin genes (> 30%), medium prevalent bacteriocin genes (20–30%), and low prevalent bacteriocin genes (10–20%). The highly prevalent bacteriocin genes were the encoding genes of the following bacteriocins, presented in descending prevalence: BlpU, Colicin E9, Pyocin S1, BlpK, BlpD/Thermophilin 9, Colicin and Enterolysin A. The medium prevalent bacteriocin genes were the encoding genes of the following bacteriocins: Colicin Ia, Enterolysin A, BlpU, Salivaricin 9 and rSAM-modified RiPP 019. Notice that although some of the protein names of the medium prevalent bacteriocins genes resemble some of the highly prevalent ones, their protein sequences share less than 0.95 similarity. The low prevalent bacteriocin genes were not analyzed further. Bacteriocin gene prevalence, abundance, standard deviation, and q-value are shown for all the enriched and clustered bacteriocins in Supplementary Table S1.
Distribution of enriched bacteriocin genes in gut bacterial genomes
Bacteriocin screening of the metagenomes identified which bacteriocin genes that were present in the gut microbiomes, and we were interested in linking the genes to their producer species in the gut. To obtain an overview of the distribution of the enriched bacteriocin genes in gut bacteria, a bacteriocin search (diamond blastx) was performed on 381 000 gut bacterial genomes . For a given taxonomic rank (species or genus), a bacteriocin would typically align with a fraction of the genomes under some taxon. The average fractions within selected genera (average fractions > 0.01) shown in Fig. 4, indicate that the highly prevalent enriched bacteriocin genes are present in genomes belonging to bacteria found in the gut. The Pyocin S1-encoding gene was detected in just one genus, namely Pseudomonas. The Blp-encoding genes were found in two genera, both within the Firmicutes, showing the highest prevalence in Streptococcus. The gene encoding Enterolysin A was restricted to the Firmicutes, showing high prevalence in Enterococcus and Pediococcus. The genes encoding Colicin and Colicin E9 were detected in 10 and 17 genera, respectively, being most prevalent in Gammaproteobacteria, but they were also found in Sphingobacteriia and Negativicutes. The distribution of the highly prevalent genes among the different taxa at species-level varied, as some were restricted to specific species while others were detected in most species within one genus (Supplementary Fig. S2). The medium prevalent BlpU-encoding gene and Salivaricin 9-encoding gene displayed similar detection patterns as the highly prevalent Blp-encoding genes, showing the highest prevalence in Streptococcus. Compared to the highly prevalent Colicin-encoding genes, the detection of the medium prevalent Colicin Ia-encoding gene was restricted to Shigella, Escherichia and Klebsiella. The medium prevalent Enterolysin A-encoding gene showed a similar detection pattern as the highly prevalent Enterolysin A-encoding gene. However, the medium prevalent rSAM-modified Ripp 019 was detected in different genera then the other mentioned bacteriocins, being most prevalent in Tannerella and Phocaeicola (Supplementary Fig. S3).
Search for bacteriocin associated genes in bacteriocin containing contigs
To approach the question of functional bacteriocin production and secretion by the bacteria in the infant gut, we looked for bacteriocin associated genes adjacent to the enriched bacteriocin genes. Firstly, we assembled contigs from selected infant-like metagenomes. Next, 13 contigs that contained highly prevalent enriched bacteriocin genes were identified based on a tblastn search. The number of contigs aligning with one of the bacteriocins differed from one to three contigs, and in the case of the Blp bacteriocins, all these bacteriocins aligned to the same contig on the same location. Therefore, these genes were treated as one group in this analysis. Lastly, excerpts were made of maximum 10 000 bp upstream and downstream from the bacteriocin genes, and using blastp, genes involved in bacteriocin secretion, immunity or the bacteriocin itself, were detected. Table 1 shows that we detected (E-value < 10–5) both an immunity gene and a secretion protein in the proximity of the bacteriocin structural gene, indicating working operons. The taxonomic classification of the contigs concurred with the genera standing out as bacteriocin producers in the bacteriocin screening of the gut bacterial genomes (Table 1). The detected genes are illustrated on their respective contigs in Supplementary Fig. S4.
Association between bacteriocin genes and within-species diversity in the infant gut
To assess the effects of bacteriocins on the bacterial composition in the infant gut, we investigated the within-species diversity in the infant-like metagenomes, focusing on the enriched bacteriocin genes and the producer species of these. The reason for looking at within-species diversity is that bacteriocins are known to mostly affect close relatives. The producer species were chosen based on the bacteriocin search in gut bacterial genomes, selecting the species with the highest bacteriocin gene prevalence. We estimated within-species diversity for a given metagenome by classifying metagenome reads and determining the number of different genomes from the species in which the reads were assigned. A higher number of genomes indicates a higher within-species diversity, but this value depends on how abundant the species is in the metagenome. Therefore, we also collected the number of reads per 1 million that were assigned to the species. In Fig. 5, the relation between the number of genomes (y-axis) and the number of reads assigned to the same species (x-axis) is shown. But more important, when coloring metagenomes according to bacteriocin gene presence or absence, we observe a pattern. Metagenomes with a bacteriocin gene consistently exhibited a larger diversity, i.e. more genomes per species. This difference is significant in most cases, and in the remaining cases, the green regression line is always (slightly) above the pink, indicating the same trend. The case for the Pyocin S1-encoding gene and Pseudomonas aeruginosa (lower right panel) is not significant, likely because there were only three HumGut genomes for this species, and therefore the resolution became too low. A more extensive list of producer species was analyzed, and the same trend can be seen for these species as well (Supplementary Fig. S5). The medium prevalent bacteriocin genes display similar regression trends as the highly prevalent genes (Supplementary Fig. S5).
The bacteriocin gene enrichment in infants compared to adults can possibly be linked to active competition between early colonizers in the gut. Another body site enriched in bacteriocins is the oral cavity [21, 34, 35], which in similarity to the infant gut is inhabited by founder communities. Both habitats have fluctuations in the environmental conditions, leading to continual recolonization of the habitats [36,37,38]. However, the causal association between bacteriocins and founder communities remains to be determined.
The association between bacteriocin gene presence and increased strain diversity in the infant gut for bacteriocin-associated species was significant for the highly prevalent Colicin-encoding gene. Considering that Colicin is a bacteriocin with a narrow spectrum of activity , our finding indicates that intraspecies competition driven by bacteriocins can promote strain diversity. The same was seen for several of the Blp-encoding genes, and although the knowledge on the activity spectra of Blp bacteriocins is limited, it has been reported that four different Blp peptides compose the multipeptide bacteriocin Thermophilin 9 [40, 41], which has a narrow activity spectrum. The broad spectrum bacteriocin Enterolysin A [42, 43], however, did not display a difference in strain diversity in the metagenomes depending on bacteriocin detection, indicating that the inhibition spectrum of bacteriocins could be important for within-species diversity.
The finding of an association between bacteriocin gene presence and increased strain diversity might be explained by intransitive bacterial competition. As reported by Kerr et al.  and elaborated further by Abrudan, Brown, and Rozen , the presence of bacteriocin-producing strains may very well lead to an increased diversity. When three strains that compete for the same niche have one of the following traits each; bacteriocin producer and resistance, bacteriocin resistance only or bacteriocin sensitive but without the cost of neither production nor resistance, intransitive competition dynamics in form of the RPS model may occur [9, 10]. Assuming that the strains are counterbalanced in an RPS manner, the complexity of the bacterial community can be expanded by adding new bacteriocin producers and strains that are resistant and sensitive to the new bacteriocins . In this study, the within-species diversity in the representation of reads matching that species, was consistently larger for the bacteriocin-containing metagenomes compared to those without. Thus, bacteriocin gene presence was found to be associated to increased strain diversity within the producer species, and this can possibly be enabled by RPS dynamics. As for the bacteriocin enrichment, experimental evidence is needed to determine if intransitive bacterial competition is an explanation for the strain diversification in the infant gut.
Regarding the highly prevalent bacteriocin genes, the presence of Blp-encoding genes in gut bacterial genomes was predominantly observed in genera belonging to the Firmicutes, being most prevalent in Streptococcus. This genus is known for production of these bacteriocins [44, 45]. We identified the Enterolysin A-encoding gene in genera restricted to the Firmicutes, and the highest prevalence of this bacteriocin was in Enterococcus, which is in line with previous observations [42, 46]. We found the Pyocin S1-encoding gene in just one genus, Pseudomonas, suggesting that Pseudomonas is the only producer of this bacteriocin. However, the colicin-encoding genes, which are known to be produced by Escherichia coli and some close relatives , were found to be abundant in Pseudomonas. Homology between colicins and pyocins has been characterized previously . The wide distribution of colicin-encoding genes in different genera belonging to Proteobacteria may indicate that these genes can be disseminated by horizontal gene transfer of colicin-encoding plasmids . The classification of metagenome-assembled contigs that contained the encoding genes of the Blp bacteriocins, Colicin and Enterolysin A ties well with the bacteriocin-associated genera discussed.
The detection of genes related to bacteriocin secretion and immunity adjacent to bacteriocin genes on Blp- and Colicin-associated contigs indicates that the infant gut inhabits potential bacteriocin producers. The secretion protein gene detected for the Blp bacteriocins was the ABC transporter, a protein required for bacteriocin maturation and secretion . For Colicin, the gene encoding colicin lysis protein was detected, a protein involved in cell release of group A colicins, preventing the colicins from accumulating in the cytoplasm . No secretion proteins or immunity proteins are described for Enterolysin A . The illustrations of the detected Blp and Colicin gene clusters (Fig. S4) do not resemble known gene clusters for these bacteriocin groups [51,52,53,54]. However, among the different gene clusters described for these groups, there does not seem to be a consensus on how the genes are structured [44, 55], indicating that the gene cluster structure of these bacteriocins may be diverse.
It is clear that an in silico search for bacteriocin genes in metagenomes will also lead to a number of false positive matches. Even if open reading frames with a sequence matching a large part of a known bacteriocin gene are found, the verification of a truly functional operon requires more extensive studies. The results we present in Table 1 and Supplementary Fig. S4 are merely indicators of this.
In summary, the finding of enriched bacteriocin genes in the infant gut in this study suggests that bacteriocins conceivably are of significance in the early stages of human gut microbiota establishment. We have verified that the members of the infant microbiome harbor bacteriocin loci with essential genes for secretion and immunity, which substantiate their potential of bacteriocin production. Based on two previously suggested bottom-up selection theories, we hypothesized that bacteriocins can affect the strain diversity of the human gut microbiota. Our results support that bacteriocins could promote strain diversity in the infant gut microbiota, possibly enabled by intransitive competition. Yet, further work is necessary to study the bacteriocin-mediated interactions between strains in an experimental setup and identify possible mechanisms for intransitive competition.
The methods are described following the same structure as given in Fig. 1.
Bacteriocin screening of gut metagenomes
From all available BioProjects at NCBI/SRA  a collection of human gut metagenomes from healthy individuals was downloaded. Healthy individuals are in general the healthy controls in research projects, as described by Hiseni et al. . From these, we collected all metagenomes with information about the persons’ age, 2564 metagenomes in total. The number of metagenomes per age can be found in Supplementary Fig. S6. Next, all bacteriocin protein sequences in the bacteriocin databases BAGEL4  and BACTIBASE [31, 32], comprising 1231 sequences, were collected.
For each of the metagenomes, the taxonomic composition at the species-level was estimated, using the kraken2 [57, 58] and bracken  software, and using the HumGut genome collection as a reference database . The metagenomes were grouped into five age groups to explore presence of bacteriocins in different stages in life: infant (age 0–1), child (age 2–9), adolescent (age 10–19), adult (age 20–59) and elderly (age 60–107). Beta diversity was computed from the taxonomic profiles, by first using the Aitchisons log-ratio transform  and then a Principal Component Analysis of the resulting profile matrix, as seen in panel a of Fig. 2. The Shannon diversity for samples at each age showed that persons of age 0 and 1 year had substantially lower diversity than those of 2 years and above, which is as expected . See Supplementary Fig. S1 for details on this. Based on the similarities between the metagenomes in the child, adolescent, adult and elderly groups, these metagenomes were later grouped together and referred to as the adult-like metagenomes in the rest of the article. The infant group was kept, and in the rest of the article the metagenomes in this group is referred to as the infant-like metagenomes.
Sequence similarity search
All unique bacteriocin protein sequences, 1075 sequences, were used to build a diamond database . For each metagenome, all the reads were searched against the database of bacteriocin proteins, using the diamond blastx. This resulted in a set of bacteriocin-matching reads for each metagenome. As an estimate for the abundance of bacteriocin genes in a persons’ gut, the number of unique reads that gave at least one match against any bacteriocin was counted for each metagenome. The diamond results were processed, filtering out all hits having an E-value above 10–5. The number of unique matching reads was then divided by the total number of reads for the metagenome, to obtain the bacteriocin gene abundance for each metagenome. The downstream analysis focuses on the bacteriocin genes that are enriched in infants, i.e. they tend to have a higher bacteriocin gene abundance in infant-like metagenomes than in adult-like metagenomes. This enrichment was found by using a Wilcoxon rank-sum test for each bacteriocin, testing if the bacteriocin gene abundance has a larger expected value in infant-like metagenomes than in adult-like metagenomes. Since this test was performed for each of the bacteriocins separately, the multiple testing was corrected for by converting all p-values to q-values, controlling the False Discovery Rate . This resulted in a ranked list of enriched bacteriocin genes, ranked by q-value. It must be noted that some bacteriocin genes with a very small q-value was found present (abundance > 0) in only a very small number of metagenomes from both infant- and adult-like metagenomes. To eliminate such cases, only bacteriocins present in at least 10% of the infant-like metagenomes were considered.
Up to this point there had been made no attempt to group the bacteriocins, even if it was apparent from the start that several entries in the bacteriocin databases are variants of the same protein. However, the enriched bacteriocins were now grouped based on their sequence similarity. An all-versus-all pairwise alignments using blast + was performed  and the bit-score for each alignment was collected. The similarity of sequence A and B was then computed as the 2S(A,B)/(S(A,A) + S(B,B)) where S(A,B) is the bit-score of the alignment between A and B, and S(A,A) and S(B,B) are the corresponding self-alignment bit-scores. If A and B are identical this becomes 1.0. Pairs having a similarity of at least 0.95 were clustered as one bacteriocin.
Bacteriocin screening of genomes
Having the set of enriched and clustered bacteriocins, these were associated to some bacterial taxa in the human infant gut. Again, a diamond search was performed against the bacteriocins, this time using the HumGut genome collection instead of metagenome reads as queries. The HumGut collection contains roughly 30 000 genomes, where each represents a group of very similar genomes, in total around 381 000 genomes. All these genomes were used in the search, and from these results a taxonomic profile for each bacteriocin was computed, where the value for a given bacteriocin and taxon is the fraction of genomes from that taxon having some match against the bacteriocin. Thus, for each enriched and clustered bacteriocin the associated taxa are those with the largest values in this profile.
Within-species diversity analysis
For a given bacteriocin and its associated species, the within-species alpha-diversity across infant-like metagenomes was investigated. Infant-like metagenomes were categorized as either bacteriocin positive or negative, reflecting if the given bacteriocin was detected or not. In our HumGut genome collection there are typically several genomes within each species since genomes are clustered at 97.5% Average Nucleotide Identity. A custom kraken2 database was made where reads are either assigned to such a genome or not assigned at all. Then all the reads in all the infant-like metagenomes were classified using this database. The metagenomes were rarefied to 1 million reads. For each metagenome, positive or negative, the number of genomes having reads assigned was counted, as a proxy for strain diversity. The total number of reads within the species was also computed. Thus, for each infant-like metagenome, the total number of reads associated with a species, as well as how many genomes that was detected from that species was obtained.
Assembly and blasting of contigs
To verify that a metagenome where many reads have bacteriocin hits actually contained the bacteriocin gene, as well as looking for evidence of a truly functional operon, assembly of some of the metagenomes were done. For each bacteriocin, the infant-like metagenomes with the two highest number of hits were chosen for assembly. In total, 8 distinct metagenomes, containing 7.9 × 106 -3.8 × 108 read pairs, were assembled to contigs using metaspades . The contigs were screened for their corresponding bacteriocin gene using tblastn. From each contig with hits, the surrounding region was extracted, and all Open Reading Frames (ORFs) were extracted from the selected region. Then the ORFs were translated, and a blastp search was done against the full nr-database at NCBI to see if they contained genes required for a functional bacteriocin (e.g. immunity genes and secretion protein genes). The contigs were taxonomically classified using the kraken2 software and the HumGut genome collection as a reference database.
Statistics and reproducibility
The metagenomes analyzed in this study are all publicly available at NCBI/SRA .
As described above, the non-parametric Wilcoxon test was used to test for differing bacteriocin gene abundance between infants- and adult-like metagenomes, followed by a correction for multiple testing.
The difference in within-species diversity presented in Fig. 5 was analyzed by a simple analysis of covariance model
where the response yi is the number of genomes detected within the species for metagenome i, the explanatory variable xi is the abundance of the species (reads per 1 million reads) for metagenome i, the indicator variable Ii is 1 if metagenome i has bacteriocin and 0 if not and ei is the random error term. The interesting parameter is a0. If this is > 0 it means there is a significant increase in diversity between bacteriocin and non-bacteriocin metagenomes, regardless of how abundant the species is in the metagenomes. The p-values in Fig. 5 refers to the testing of a0 = 0 versus a0 is nonzero.
Milani C, Duranti S, Bottacini F, Casey E, Turroni F, Mahony J, et al. The first microbial colonizers of the human gut: composition, activities, and health implications of the infant gut microbiota. Microbiol Mol Biol Rev. 2017;81(4):e00036–17.
Stewart CJ, Ajami NJ, O’Brien JL, Hutchinson DS, Smith DP, Wong MC, et al. Temporal development of the gut microbiome in early childhood from the TEDDY study. Nature. 2018;562(7728):583–8.
Bokulich NA, Chung J, Battaglia T, Henderson N, Jay M, Li H, et al. Antibiotics, birth mode, and diet shape microbiome maturation during early life. Sci Transl Med. 2016;8(343):343ra82.
Linehan K, Dempsey EM, Ryan CA, Ross RP, Stanton C. First encounters of the microbial kind: perinatal factors direct infant gut microbiome establishment. Microbiome Res Rep. 2022;1(2):10.
Scanlan PD. Microbial evolution and ecological opportunity in the gut environment. Proc Biol Sci. 1915;2019(286):20191964.
Ley RE, Peterson DA, Gordon JI. Ecological and Evolutionary Forces Shaping Microbial Diversity in the Human Intestine. Cell. 2006;124(4):837–48.
Alcántara JM, Pulgar M, Rey PJ. Dissecting the role of transitivity and intransitivity on coexistence in competing species networks. Thyroid Res. 2017;10(2):207–15.
Soliveres S, Allan E. Everything you always wanted to know about intransitive competition but were afraid to ask. J Ecol. 2018;106(3):807–14.
Kerr B, Riley MA, Feldman MW, Bohannan BJM. Local dispersal promotes biodiversity in a real-life game of rock–paper–scissors. Nature. 2002;418(6894):171–4.
Abrudan MI, Brown S, Rozen DE. Killing as means of promoting biodiversity. Biochem Soc Trans. 2012;40(6):1512–6.
Liao MJ, Din MO, Tsimring L, Hasty J. Rock-paper-scissors: Engineered population dynamics increase genetic stability. Science. 2019;365(6457):1045–9.
Vallespir Lowery N, Ursell T. Structured environments fundamentally alter dynamics and stability of ecological communities. Proc Natl Acad Sci U S A. 2019;116(2):379–88.
Gilpin ME. Limit Cycles in Competition Communities. Am Nat. 1975;109(965):51–60.
May RM, Leonard WJ. Nonlinear Aspects of Competition Between Three Species. SIAM J Appl Math. 1975;29(2):243–53.
Maldonado-Barragán A, West SA. The cost and benefit of quorum sensing-controlled bacteriocin production in Lactobacillus plantarum. J Evol Biol. 2020;33(1):101–11.
Dobson A, Cotter PD, Ross RP, Hill C. Bacteriocin production: a probiotic trait? Appl Environ Microbiol. 2012;78(1):1–6.
Cotter PD, Hill C, Ross RP. Bacteriocins: developing innate immunity for food. Nat Rev Microbiol. 2005;3(10):777–88.
Alvarez-Sieiro P, Montalbán-López M, Mu D, Kuipers OP. Bacteriocins of lactic acid bacteria: extending the family. Appl Microbiol Biotechnol. 2016;100(7):2939–51.
Yang S-C, Lin C-H, Sung CT, Fang J-Y. Antibacterial activities of bacteriocins: application in foods and pharmaceuticals. Front Microbiol. 2014;5:241-.
Simons A, Alhanout K, Duval RE. Bacteriocins, antimicrobial peptides from bacterial origin: overview of their biology and their impact against multidrug-resistant bacteria. Microorganisms. 2020;8(5):639.
Zheng J, Gänzle MG, Lin XB, Ruan L, Sun M. Diversity and dynamics of bacteriocins from human microbiome. Environ Microbiol. 2015;17(6):2133–43.
Drissi F, Buffet S, Raoult D, Merhej V. Common occurrence of antibacterial agents in human intestinal microbiota. Front Microbiol. 2015;6:441-.
Birri DJ, Brede DA, Tessema GT, Nes IF. Bacteriocin Production, Antibiotic Susceptibility and Prevalence of Haemolytic and Gelatinase Activity in Faecal Lactic Acid Bacteria Isolated from Healthy Ethiopian Infants. Microb Ecol. 2013;65(2):504–16.
Kozak K, Charbonneau D, Sanozky-Dawes R, Klaenhammer T. Characterization of bacterial isolates from the microbiota of mothers’ breast milk and their infants. Gut microbes. 2015;6(6):341–51.
Khalkhali S, Mojgani N. Bacteriocinogenic potential and virulence traits of Enterococcus faecium and E. faecalis isolated from human milk. Iran J Microbiol. 2017;9(4):224–33.
Mohammadi F, Eshaghi M, Razavi S, Sarokhalil DD, Talebi M, Pourshafie MR. Characterization of bacteriocin production in Lactobacillus spp. isolated from mother’s milk. Microbial Pathogenesis. 2018;118:242–6.
Angelopoulou A, Warda AK, O’Connor PM, Stockdale SR, Shkoporov AN, Field D, et al. Diverse bacteriocins produced by strains from the human milk microbiota. Front Microbiol. 2020;11:788.
Bäuerl C, Umu Ö CO, Hernandez PE, Diep DB, Pérez-Martínez G. A method to assess bacteriocin effects on the gut microbiota of mice. J Vis Exp. 2017;(125):56053.
Umu ÖCO, Gueimonde M, Oostindjer M, Ovchinnikov KV, de los Reyes-Gavilán CG, Arbulu S, et al. Use of Fecal Slurry Cultures to Study In Vitro Effects of Bacteriocins on the Gut Bacterial Populations of Infants. Probiotics Antimicrobial Proteins. 2020;12(3):1218–25.
van Heel AJ, de Jong A, Song C, Viel JH, Kok J, Kuipers OP. BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins. Nucleic Acids Res. 2018;46(W1):W278–81.
Hammami R, Zouhir A, Ben Hamida J, Fliss I. BACTIBASE: a new web-accessible database for bacteriocin characterization. BMC Microbiol. 2007;7(1):89.
Hammami R, Zouhir A, Le Lay C, Ben Hamida J, Fliss I. BACTIBASE second release: a database and tool platform for bacteriocin characterization. BMC Microbiol. 2010;10(1):22.
Hiseni P, Rudi K, Wilson RC, Hegge FT, Snipen L. HumGut: a comprehensive human gut prokaryotic genomes collection filtered by metagenome data. Microbiome. 2021;9(1):165.
Alghamdi S. Isolation and identification of the oral bacteria and their characterization for bacteriocin production in the oral cavity. Saudi J Biol Sci. 2022;29(1):318–23.
Kreth J, Merritt J, Shi W, Qi F. Co-ordinated bacteriocin production and competence development: a possible mechanism for taking up DNA from neighbouring species. Mol Microbiol. 2005;57(2):392–404.
Deo PN, Deshmukh R. Oral microbiome: Unveiling the fundamentals. J Oral Maxillofac Pathol. 2019;23(1):122–8.
McLean JS. Advancements toward a systems level understanding of the human oral microbiome. Front Cell Infect Microbiol. 2014;4:98.
Yatsunenko T, Rey FE, Manary MJ, Trehan I, Dominguez-Bello MG, Contreras M, et al. Human gut microbiome viewed across age and geography. Nature. 2012;486(7402):222–7.
Cascales E, Buchanan SK, Duché D, Kleanthous C, Lloubès R, Postle K, et al. Colicin biology. Microbiol Mol Biol Rev. 2007;71(1):158–229.
Rossi F, Marzotto M, Cremonese S, Rizzotti L, Torriani S. Diversity of Streptococcus thermophilus in bacteriocin production; inhibitory spectrum and occurrence of thermophilin genes. Food Microbiol. 2013;35(1):27–33.
Fontaine L, Hols P. The inhibitory spectrum of thermophilin 9 from Streptococcus thermophilus LMD-9 depends on the production of multiple peptides and the activity of BlpG(St), a thiol-disulfide oxidase. Appl Environ Microbiol. 2008;74(4):1102–10.
Nilsen T, Nes IF, Holo H. Enterolysin A, a cell wall-degrading bacteriocin from Enterococcus faecalis LMG 2333. Appl Environ Microbiol. 2003;69(5):2975–84.
Khan H, Flint SH, Yu PL. Determination of the mode of action of enterolysin A, produced by Enterococcus faecalis B9510. J Appl Microbiol. 2013;115(2):484–94.
Fontaine L, Boutry C, Guédon E, Guillot A, Ibrahim M, Grossiord B, et al. Quorum-sensing regulation of the production of Blp bacteriocins in Streptococcus thermophilus. J Bacteriol. 2007;189(20):7195–205.
Kjos M, Miller E, Slager J, Lake FB, Gericke O, Roberts IS, et al. Expression of Streptococcus pneumoniae Bacteriocins Is Induced by Antibiotics via Regulatory Interplay with the Competence System. PLoS Pathog. 2016;12(2):e1005422.
Nigutova K, Morovsky M, Pristas P, Teather RM, Holo H, Javorsky P. Production of enterolysin A by rumen Enterococcus faecalis strain and occurrence of enlA homologues among ruminal Gram-positive cocci. J Appl Microbiol. 2007;102(2):563–9.
Barreteau H, Tiouajni M, Graille M, Josseaume N, Bouhss A, Patin D, et al. Functional and Structural Characterization of PaeM, a Colicin M-like Bacteriocin Produced by Pseudomonas aeruginosa*. J Biol Chem. 2012;287(44):37395–405.
DeWitt W, Helinski DR. Characterization of colicinogenic factor E1 from a non-induced and a mitomycin C-induced Proteus strain. J Mol Biol. 1965;13(3):692–703.
Havarstein LS, Diep DB, Nes IF. A family of bacteriocin ABC transporters carry out proteolytic processing of their substrates concomitant with export. Mol Microbiol. 1995;16(2):229–40.
Pérez-Ramos A, Madi-Moussa D, Coucheney F, Drider D. Current knowledge of the mode of action and immunity mechanisms of LAB-bacteriocins. Microorganisms. 2021;9(10):2107.
Miller EL, Abrudan MI, Roberts IS, Rozen DE. Diverse Ecological Strategies Are Encoded by Streptococcus pneumoniae Bacteriocin-Like Peptides. Genome Biol Evol. 2016;8(4):1072–90.
Bogaardt C, van Tonder AJ, Brueggemann AB. Genomic analyses of pneumococci reveal a wide diversity of bacteriocins - including pneumocyclicin, a novel circular bacteriocin. BMC Genomics. 2015;16(1):554.
Cole ST, Saint-Joanis B, Pugsley AP. Molecular characterisation of the colicin E2 operon and identification of its products. Mol Gen Genet MGG. 1985;198(3):465–72.
Waleh NS, Johnson PH. Structural and functional organization of the colicin E1 operon. Proc Natl Acad Sci U S A. 1985;82(24):8389–93.
Schramm E, Olschläger T, Tröger W, Braun V. Sequence, expression and localization of the immunity protein for colicin B. Mol Gen Genet. 1988;211(1):176–82.
Leinonen R, Sugawara H, Shumway M. The sequence read archive. Nucleic Acids Res. 2011;39(Database issue):D19-21.
Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014;15(3):R46.
Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019;20(1):257.
Lu J, Breitwieser F, Thielen P, Salzberg S. Bracken: Estimating species abundance in metagenomics data. PeerJ Computer Science. 2017;3:e104.
Aitchison J. The Statistical Analysis of Compositional Data. J Roy Stat Soc: Ser B (Methodol). 1982;44(2):139–77.
Avershina E, Lundgård K, Sekelja M, Dotterud C, Storrø O, Øien T, et al. Transition from infant- to adult-like gut microbiota. Environ Microbiol. 2016;18(7):2226–36.
Buchfink B, Reuter K, Drost H-G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat Methods. 2021;18(4):366–8.
Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J Roy Stat Soc: Ser B (Methodol). 1995;57(1):289–300.
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10(1):421.
Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 2017;27(5):824–34.
This research was funded by the Faculty of Chemistry, Biotechnology and Food Sciences, Norwegian University of Life Sciences, NORWAY.
Ethics approval and consent to participate
Consent for publication
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Shannon diversity of the metagenomes in this study.
Prevalence and abundance of the enriched and clustered bacteriocin genes.
Bacteriocin gene distribution in gut bacterial genomes at species-level.
Distribution of highly prevalent and medium prevalent bacteriocin genes in gut bacterial genomes.
Distribution of bacteriocin associated genes on contigs.
Within-species diversity analysis of highly prevalent and medium prevalent enriched bacteriocin genes.
Age distribution of metagenomes.
Metagenome accession numbers.
About this article
Cite this article
Ormaasen, I., Rudi, K., Diep, D.B. et al. Metagenome-mining indicates an association between bacteriocin presence and strain diversity in the infant gut. BMC Genomics 24, 295 (2023). https://doi.org/10.1186/s12864-023-09388-0
- Genome mining
- Infant gut microbiota