Our analysis involved several steps: (a) Quantitative analysis of the vitamin biosynthetic and transporter related genes between different groups; (b) Charting the information of the identified genes at their corresponding species and phyla level; (c) Identification of co-occurrence patterns between vitamin producers and consumers, and using this information to gain insights into the possible metabolic correlations based on species abundance; (d) Metabolic reconstructions of genome-scale metabolic models were deployed to visualize the community-level metabolic potential of abundant gut species in diverse cohorts, and (e) Quantifying the abundance of vitamin biosynthetic enzymes and vitamin-dependent enzymes in an IBD (Inflammatory Bowel Disease) cohort in comparison to healthy controls.
Metagenomic abundance of vitamin biosynthetic pathways and transporters in healthy individuals from different countries
To evaluate if the abundances of microbial-mediated vitamin metabolic genes are dependent on the country of origin or not, linear and logistic regression analyses, without controlling for other confounding factors, was carried out. The result suggested an association between vitamin metabolic gene abundances with healthy samples originating from China (F-statistic, p < 0.01, as a country variable). However, the gene abundances from healthy samples of other countries, i.e., Denmark, Spain, and USA showed no significant association to the corresponding country variable. Extraction of DNA was performed in the Beijing Genome Institute (BGI) for the samples from China, Denmark, and Spain. Although none of these studies provided diet information for the samples, it would be reasonable to speculate that the observed differences could be a diet-dependent effect (based on Western and non-Western diets).
Henceforth, we compared the normalized abundances of eight B-vitamin and menaquinone related biosynthetic and transporter genes in the healthy subjects from these different populations. On comparison of healthy samples among four different countries, we found that individuals from China had significantly higher gene abundance (Mann–Whitney test, p < 0.01) for each vitamin type, than that from USA, Spain, and Denmark (Fig. 1 and Additional file 1: Table S1). For each vitamin, the normalized abundance of the biosynthetic genes ranged around similar range as shown in Fig. 1 for all the healthy individuals irrespective of their origin of country. Additionally, genes involved in the biosynthetic pathways were found to be significantly higher than their corresponding transport related pathways (Mann–Whitney test, p < 0.01). However, it is of caution that as the bacterial vitamin transporters could belong to a diverse class of protein families [15, 16] due to their evolutionary variability, functional annotation of these transporters through bioinformatic approach remains challenging [17, 18].
Metagenomic abundance of vitamin biosynthetic pathways and transporters in different states of health
Many functional features of gut microbiota have been shown to correlate with health and disease, and we therefore evaluated if vitamin metabolism by gut commensals is associated with the host health status. To do so, we performed similar analysis as described above between country-matched healthy controls and diseased subjects. The comparison for gene abundances included the following groups: healthy controls and the newly-onset IBD (inflammatory bowel disease) subjects from USA [14]; healthy controls and T2D (type 2 diabetes) subjects from China [13]; and healthy controls and IBD subjects under clinical remission from Spain [19]. The results suggested that gene abundances from T2D subjects in China (as a diagnosed variable) showed an association for vitamin metabolism (F-statistic, p < 0.01), while the gene abundances from IBD subjects from USA and Spain showed no significant association with disease status.
Comparing the gene abundances between the country-matched groups, we found that the T2D subjects from the Chinese cohort showed substantial and significant difference for all the vitamin types associated with biosynthetic enzymes and transporters (except for biotin and riboflavin) in comparison with the country-matched healthy controls (Mann–Whitney test, p < 0.01) (Fig. 2). On the other hand, IBD subjects (both onset and clinical remission ones) were not found to be significantly different concerning vitamin metabolism when compared to their respective country-matched healthy controls. To validate our analytical method, we tested and found that our findings were consistent with what has been reported earlier [20]. Henceforward, data for healthy subjects were used to get insight into how these genes map to bacteria from different phyla.
Phylogenetic origin of vitamin pathway genes
To compare the taxonomic profile of vitamin metabolic genes (biosynthesis and transporter genes) at phylum or species level, each of the mapped gene identifiers were used to retrieve the lineage information from UniProt database. In Fig. 3, we show how vitamin metabolic genes are prevalent across different phyla, which might suggest vitamin metabolism being as a part of core function in gut microbial metabolic modules. Additional file 2: Figure S1 shows a network of associations between species with mapped vitamin-related gene and its corresponding phylum. This figure reveals that the vitamin related pathways are mostly found in the phylum Firmicutes with about 49%, followed by Proteobacteria with about 19%, Bacteroidetes with about 14% and Actinobacteria with about 13%. Additional file 3: Figure S2 shows a network of associations between vitamin biosynthetic capabilities and phylum with species information. For every vitamin type, the number of species having prevalence of a biosynthetic pathway seemed to be higher than that of species with transporter mechanisms. Among the various vitamin transporters, species encoding thiamine transporter dominated in contrast to folate transporter being the lowest in number. However, from Additional file 3: Figure S2, it also becomes apparent that there are three types of species that differ with respect to functional characteristics related to vitamin metabolism, i.e. vitamin genes related to biosynthesis or transporters or both. Based on this, we categorized 171 microbial species as Producer (P), Consumer (C) and Producer-Consumer (Dual) for three specific vitamins (biotin, cobalamin, and thiamine) in the following analysis.
Network association between vitamin prototrophs and vitamin consumers
Given a hypothetical situation where there is a direct dependence between public-good-producing bacteria and its consumer, it is expected to observe a stronger association of their coexistence at the level of species abundance. Based on the presence or absence of genes encoding proteins required in vitamin biosynthetic pathways or transporter mechanisms or both, each microbial species was assigned as either vitamin prototrophs (producers) or vitamin consumers (consumers) or producer-consumer (dual) respectively for every vitamin type. Therefore, to estimate interactions between vitamin prototrophs and consumers, we performed a correlational analysis between species with all possible phenotypes. Of total 7021 possible pairs of species combinations (171 species (675 strains) considered), 2030 (i.e. 28.9%) and 590 (i.e. 8%) were significantly positive and negative correlated, respectively (Mann–Whitney test, p < 0.01). Among the positive correlations, with a correlation coefficient greater than 0.7, we observed that a pair of species originating from the same phylum could coexist, probably due to their evolutionary similarities. However, we also observed a similar level of coexistence for a pair of species originating from different phyla. For example, species from two different phyla, Lactobacillus acidophilus (Firmicutes) and Citrobacter koseri (Proteobacteria) had a high and positive correlation coefficient r = 0.81.
Based on the high prevalence of species with vitamin metabolic pathways among different phyla as derived from Fig. 3, we focused our next correlational analysis (using similar approached as described above) to the most prevalent three B-vitamins among microbial species, i.e. biotin, cobalamin, and thiamine involving both biosynthetic and transporter related genes respectively to estimate interactions between these specific vitamin prototrophs and consumers. Contrary to our hypothesis, we found that species with dual functional traits (i.e. containing genes for both biosynthesis and transport of the same vitamin) contributed to a notable fraction of positive correlations among the possible phenotypes (Additional file 4: Figure S3). On the other hand, the co-occurrence of species with single phenotype (i.e., either producer/P or consumer/C) and dual phenotypes belonged to both positive and negative correlations. These observations hint that the species with dual phenotypes could drive stronger relationships than the ones with either limited or less functional traits. It is well-known, that communities of species with multiple functional features respond well to changes in the environment, thereby influencing the net outcome of a community assembly [21,22,23]
Genome annotation coverage of biosynthetic pathways
Next, we sought to visualize the presence and absence of vitamin pathway related reactions in each cohort to gain insights into how conserved each pathway is, across different populations. As depicted in Additional file 5: Figure S4, riboflavin seems to be well represented and conserved among all species across all the four cohorts regardless of the health status. This conservation could also be due to the limited number of reactions involved in riboflavin pathway compared to other pathways. Contrarily, menaquinone or vitamin K2, has the least number of species with its biosynthesis pathway present. Perhaps, human microbiome might have evolved to have a reduced dependence of menaquinone production from the gut microbiota as proposed earlier [24]. In case of folate, the majority of the reactions involved in its biosynthesis were observed to be present except for four reactions which concerned interconversions of tetrahydrofolate, dihydrofolate, and folate using NADH or NADPH as cofactors and one ATP-forming reaction carried out by folylpolyglutamate synthase enzyme. Biotin and thiamine appeared to be vitamins with a major part of their biosynthetic reactions with their annotations missing. These are related to fatty acid metabolism and the upstream part of the biotin synthesis pathway provides a long-chain acyl as the starting point for the biosynthesis. In case of thiamine, three phosphatase reactions involving interconversion of thiamine monophosphate, −diphosphate, −triphosphate and thiamine were absent. For the rest of the vitamins, the incompleteness of their biosynthesis pathway was observed for many species, which is in line with previous studies as it has been suggested that vitamin biosynthesis could be carried out as complementary tasks between several bacteria, each harboring a part of the pathway [25].
Furthermore, for the American and Chinese cohort, we investigated the presence and absence of reactions in the biosynthesis pathway of each vitamin considering the abundance of bacterial species. As depicted in Fig. 4a, it is noticeable that species with biosynthetic reactions of riboflavin, pantothenate, and pyridoxine have relatively higher coverage when compared to the other vitamin types. It is also to be noted that species from genera Bacteroides, namely B. uniformis, B. stercoris, B. vulgatus, B. fragilis, and B. caccae are among the most abundant species with relatively high coverage of biosynthetic capabilities for all vitamins. Although the overall coverage of vitamin biosynthetic pathways is rather high, the same trend is not observable in case of biotin, cobalamin, and niacin which have incomplete pathways in many cases.
For the Chinese cohort (Fig. 4b), we noticed that the abundant species profile differs from that of the American cohort, possibly reflecting a population-specific (which is mostly due to country variation) gut microbiota. It is again evident from this that Bacteroides species have most of the vitamin biosynthesis pathways covered except for cobalamin due to differences at the species level. Besides, in the Chinese population, there are many species from Firmicutes amongst the abundant ones, which suggest that based on the Bacteroidetes/Firmicutes ratio there may be a change in abundance of vitamin related genes.
Expression level of vitamin biosynthetic enzymes and vitamin-dependent enzymes in an IBD cohort
Based on the availability of metatranscriptomics dataset from the American cohort analyzed above, we sought to investigate the mRNA sequences of microbe-mediated vitamin related enzyme as computed by HUMAnN2 [26]. Functional profiling of mRNA normalized features (i.e., vitamin biosynthesis) in this cohort reveals that there is a low abundance of cobalamin and thiamine biosynthesis related enzymatic genes expressed in the IBD samples compared to healthy controls (Mann–Whitney test, p < 0.01) (Fig. 5a). These findings were consistent with a similar pattern observed in a previous study where B-vitamins were measured to be lower in plasma samples of IBD subjects [27,28,29].
However, the expression level of biotin biosynthetic enzymes was found to be significantly higher in the IBD subjects compared to healthy controls (Mann–Whitney test, p < 0.01). To examine this, we inspected whether the abundance of biotin synthesizers were significantly abundant or not in the IBD subjects than that of the healthy controls (especially, for the coordinated set of samples where the metagenomics and metatranscriptomics were carried out). Interestingly, we found that species with only biosynthetic pathways for biotin were significantly more abundant in the IBD subjects whereas species with biotin transporters were significantly less abundant in the IBD subjects compared to healthy controls (Additional file 6: Figure S5).
These observations once again highlight the significance of the microbial contribution to human health status. Furthermore, we analyzed the expression level of these vitamin-dependent enzymes to discern the connection between their synthesis and usage. We found that the abundance of biotin-dependent enzymes had an inreverse pattern to that of their biosynthesis level, whereas, thiamine utilization potential as well as its biosynthetic potential were less abundant in IBD subjects when compared to the healthy controls (Fig. 5b) (Mann–Whitney test, p < 0.01). Variation in vitamin synthesis and utilization level among these three vitamins is likely to reflect the complexity of the gut microbiota towards resource allocation and reuse of enzymes.