Comparative genomics of the Bifidobacterium brevetaxon
- Francesca Bottacini1,
- Mary O’Connell Motherway1,
- Justin Kuczynski3,
- Kerry Joan O’Connell1,
- Fausta Serafini2,
- Sabrina Duranti2,
- Christian Milani2,
- Francesca Turroni1,
- Gabriele Andrea Lugli2,
- Aldert Zomer4,
- Daria Zhurina5,
- Christian Riedel5,
- Marco Ventura2Email author and
- Douwe van Sinderen1Email author
© Bottacini et al.; licensee BioMed Central Ltd. 2014
Received: 16 September 2013
Accepted: 19 February 2014
Published: 1 March 2014
Bifidobacteria are commonly found as part of the microbiota of the gastrointestinal tract (GIT) of a broad range of hosts, where their presence is positively correlated with the host’s health status. In this study, we assessed the genomes of thirteen representatives of Bifidobacterium breve, which is not only a frequently encountered component of the (adult and infant) human gut microbiota, but can also be isolated from human milk and vagina.
In silico analysis of genome sequences from thirteen B. breve strains isolated from different environments (infant and adult faeces, human milk, human vagina) shows that the genetic variability of this species principally consists of hypothetical genes and mobile elements, but, interestingly, also genes correlated with the adaptation to host environment and gut colonization. These latter genes specify the biosynthetic machinery for sortase-dependent pili and exopolysaccharide production, as well as genes that provide protection against invasion of foreign DNA (i.e. CRISPR loci and restriction/modification systems), and genes that encode enzymes responsible for carbohydrate fermentation. Gene-trait matching analysis showed clear correlations between known metabolic capabilities and characterized genes, and it also allowed the identification of a gene cluster involved in the utilization of the alcohol-sugar sorbitol.
Genome analysis of thirteen representatives of the B. breve species revealed that the deduced pan-genome exhibits an essentially close trend. For this reason our analyses suggest that this number of B. breve representatives is sufficient to fully describe the pan-genome of this species. Comparative genomics also facilitated the genetic explanation for differential carbon source utilization phenotypes previously observed in different strains of B. breve.
KeywordsBifidobacterium breve Evolutionary genomics Core genome Dispensable genome Pan-genome
Bifidobacteria are a common component of the microbiota of the gastrointestinal tract (GIT) of a broad range of hosts, and their presence is associated with a positive health status of the gut . However, little is known about the precise molecular mechanisms that explain these probiotic effects [1–3]. For this reason a considerable number of ongoing scientific efforts aim to precisely explain how these benefits are being provided, and in many cases such efforts involve comparative and functional genome analyses.
Sequenced bifidobacterial genomes range in size from 1.94 to 2.8 Mbp (Bifidobacterium animalis subsp. lactis DSM 10140 and Bifidobacterium longum subsp. infantis ATCC 15697, respectively), and their genomic organization is in line with that of a typical bacterial chromosome .
B. longum subsp. infantis, Bifidobacterium bifidum and Bifidobacterium breve are typical inhabitants of the infant intestine, which is presumed sterile at birth but becomes rapidly colonized by bacteria immediately following (vaginal) delivery [5, 6]. Functional analyses conducted on bifidobacterial genomes have also revealed how they adapt to a certain niche. For example, the presence of enzymes dedicated to the metabolism of human milk oligosaccharides (HMOs) in B. longum subsp. infantis showed how this species is specialized in colonizing the infant gut .
In vivo gene expression analyses conducted on B. breve UCC2003 and B. bifidum PRL2010 have revealed genes that encode functions required for gut colonization and persistence [8, 9]. Furthermore, Comparative Genome Hybridization (CGH) analyses on various B. breve isolates has highlighted the existence of a high level of sequence homology among members of this species, and it also identified genetic functions that appear to be more variable within this bifidobacterial taxon . Such variable functions are associated with bifidobacterial adaptation to the host environment and defence against invasion of foreign DNA. They include, among others, CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) sequences, (type II) Restriction/Modification (R/M) systems and genes involved in the production of particular extracellular structures, such as capsule exopolysaccharides (EPS) and sortase-dependent pili . However, CGH analysis is not sufficient to describe the genetic diversity of a species, as it can only detect genes present in the reference genome, but cannot identify genes that are present in tested genomes yet absent in the reference genome. For this reason we decided to investigate the genome variability within the B. breve taxon by performing whole genome sequencing and comparative analysis of thirteen B. breve strains. The generated genome data sets were used to perform a pan-genomic analysis which allowed the definition of the total number of different genes encoded by the entire B. breve group (the pan-genome), as well as the total number of common genes present in all isolates (the core-genome) [10, 11]. Corresponding pan- and core-genome information, as obtained by an increasing number of genome sequences, can be used to determine if sequenced representatives of a certain species have provided all expected gene diversity present in that taxon (closed trend), or if additional sequencing is still necessary before essentially all genes of the species have been identified (open trend) [10–12]. As this approach takes the overall collection of genetic functions assigned to a certain species (pan-genome) in consideration, rather than conducting individual analyses for each strain, it is believed to represent an accurate and advanced method to explore genomic diversity of a particular bacterial taxon.
Results and discussion
General genome features
List of Bifidobacterium breve representatives
B. breve UCC2003
University College Cork, Ireland
B. breve S27
Infant feces (breast fed)
University of Ulm, Germany
B. breve 689b
University of Parma, Italy
B. breve NCFB 2258
National Collection of Food Bacteria, UK
B. breve JCM 7017
Japan Collection of Microorgnisms, Japan
B. breve DSM 20213
Deutsche Sammlung von Mikroorganismen, Germany
DRAFT (103 contigs)
B. breve 12L
University of Parma, Italy
B. breve 2L
GenProbio Ltd., Parma, Italy
DRAFT (6 contigs)
B. breve 31L
GenProbio Ltd., Parma, Italy
DRAFT (4 contigs)
B. breve CECT 7263
Universidad de Madrid, Spain
DRAFT (34 contigs)
B. breve JCM 7019
Japan Collection of Microorgnisms, Japan
B. breve DPC 6330
Elderly individual faecal sample
Food Research Centre, Moorepark, Cork, Ireland
DRAFT (47 contigs)
B. breve ACS-071-V-Sch8b
Craig Venter Institute, USA
Our sequencing efforts resulted in fully sequenced genomes for six strains (B. breve 689b, B. breve 12L, B. breve NCFB 2258, B. breve S27, B. breve JCM 7017 and B. breve JCM 7019), while the assembly of the two remaining genome sequences resulted in multiple contigs (Table 1). Furthermore, five complete and draft B. breve genomes (B. breve UCC2003, B. breve ACS-071-V-Sch8b, B. breve CECT 7263, B. breve DPC 6330, B. breve DSM 20213) were retrieved from the NCBI public database. Genome alignment conducted on the eight complete genomes, using B. breve UCC2003 as reference sequence , established an average sequence length of 2,323,100 bp, where B. breve JCM 7017 represents the strain with the smallest chromosome (with a size of 2,288,919 bp), while B. breve UCC2003 possessed the largest chromosome (with a size of 2,422,684). All B. breve genomes here analyzed displayed an average G+C content of 58%, which is consistent with the range of G+C mol% content of genomes of the Bifidobacterium genus .
General features of eight complete genomes of Bifidobacterium breve
B. breveNCFB 2258
B. breveJCM 7017
B. breveJCM 7019
Genome length (bp)
Number of genes
Genes with assigned function
IS elements/ transposases
As displayed in Table 2, all fully sequenced genomes were observed and experimentally verified to encompass two identical rRNA loci located at non-adjacent positions in the genome with the exception of B. breve S27 which contains three of such loci; an average of 53 dispersed tRNA genes were noted per B. breve genome.
As previously observed in other bifidobacterial genomes [8, 13–15], the ATG sequence appears to be the preferred start codon (87%), while GTG, TTG and CTG seem to be less frequently used, with a calculated frequency percentage of 9.53%, 3.24% and 0.08%, respectively.
The predicted mobilome of B. brevespecies
All complete genome sequences were investigated for the presence of mobile elements such as IS elements and genes specifying transposases, and this analysis revealed that the B. breve JCM 7019 genome contains the largest number (i.e. 54) of such mobile elements, while the B. breve ACS-071-V-Sch8b genome encompasses just 12 IS elements and transposase-encoding genes. The IS classification according to the ISFinder database  also showed that IS30 is the most frequently occurring insertion family in B. breve.
The complete chromosomes were also examined for the presence of prophages and plasmids. The prophage-like DNA element Bbr-1 of B. breve UCC2003 was previously analysed and represents a likely prophage-remnant . Notably, in our analysis, we identified two other prophage like-elements (Additional file 1: Figure S1), 689b-1 in B. breve 689b (represented by the DNA region occupied by locus tags B689b_0284 through to B689b_0311), which appears to be incomplete, and 7019-1 in B. breve JCM 7019 (encompassing locus tags B7019_0905 through to B7019_1003), which appears to represent a complete prophage. Notably, the B. breve S27 genome appears to contain an integrated episome S27-1 (encompassing locus tags BS27_1090 through to BS27_1136), which is predicted to specify several hypothetical proteins, an integrase (BS27_1090), a DNA transfer protein (BS27_1114), and a cell wall anchor domain protein (BS27_1125). An extra-chromosomal (plasmid) sequence was confirmed to be present in B. breve NCFB 2258, and this plasmid is 100% identical to the previously published pCIBb1  in full length BLASTN alignment.
Whole-genome alignments and phylogenetic analysis
The eight fully sequenced B. breve genomes were also aligned using B. breve UCC2003 as the reference chromosome. The observed degree of alignment as displayed in a dot-plot exhibited a near-continuous straight line, indicating that all these genomes are highly syntenic, with the only exception of apparent inversions in the middle of the genome sequences of B. breve ACS-071-V-Sch8b (1126 Kb, corresponding to B. breve UCC2003 genome coordinates 611,964 - 1,653,404) and B. breve JCM 7017 (169 Kb, corresponding to UCC2003 genome coordinates 1,181,452 – 1,350,961) (Additional file 2: Figure S2). In the case of B. breve JCM 7017 we confirmed this genomic inversion by PCR, demonstrating that this was not due to an assembly error (data not shown). Furthermore, analysis of the DNA that directly flanks these two inversions revealed the presence of sequences specifying mobile elements/transposases, which may have acted as mechanistic drivers for this genomic reshuffling through homologous recombination . B. breve ACS-071-V-Sch8b contains truncated integrases and transposases flanking the inversion (HMPREF9228_0467-69 and HMPREF9228_1435-38, respectively), while the B. breve JCM 7017 genome contains an hypothetical protein and a putative conjugative transposon at the left end of the inversion and sequences encoding a replication initiation factor, excisionase and integrase at the other inversion end (B7017_0896-97 and B7017_1053-55, respectively). A BLAST alignment performed on the above mentioned genes for B. breve JCM 7017 revealed high identity (88–100% in BLASTP alignment) with mobile elements found in Clostridium difficile 630 , which suggests their acquisition by means of horizontal gene transfer (HGT).
In order to investigate the phylogenetic relationship between B. breve and other bifidobacteria, a phylogenetic supertree was computed based on 165 orthologues, selected on the basis of the comparison of the thirteen B. breve genomes (see above), other sequenced Bifidobacterium species (B. longum subsp. longum NCC2705, B. longum subsp. infantis ATCC 15697, B. bifidum PRL2010, B. adolescentis ATCC 15703, B. dentium Bd1, B. animalis subsp. animalis ATCC 25527, B. animalis subsp. lactis DSM 10140 and Bifidobacterium asteroides PRL2011), and three additional actinobacterial genomes, (i.e. Gardenerella vaginalis ATCC 14019, Leifsonia xyli subsp. xyli CTCB07 and Tropheryma whipplei TW08/27), combined with a member of Firmicutes as a representative outgroup (Lactobacillus plantarum WCFS1). As shown in the resulting consensus tree (Figure 1, panel c) all thirteen B. breve members fall into the B. longum phylogenetic group, which is consistent with a previous assignment based on a multilocus approach . As shown in a previous study the colonization of the infant gut by representatives of B. breve and B. longum occurs immediately after birth, with a correlation being observed between strains present in mother and progeny, thus suggesting that such bifidobacteria are transmitted from mother to child during vaginal delivery and/or breast feeding [5, 6]. The strains analyzed in this study possess very similar ecological origins and it was therefore not surprising that no clear separation of these strains was observed within the tree. However, B. breve JCM 7019, an isolate from adult faeces, clustered in a separate branch, while the B. breve milk isolates also cluster together. Additionally, most of the infant stool isolates were shown to cluster in the same group at the bottom of the tree.
B. brevecore and dispensable genome
Comparative genomic analysis based on BLASTP comparisons and MCL clustering algorithm between the eight complete B. breve genomes (see Methods) allowed the definition of a set of 1323 gene families, representing the core genome for the B. breve species, defined as a pool of gene families that is present in all of the considered genomes [10, 11], and representing the 1141 orthologues mentioned above plus an additional 182 paralogues. Inspection of corresponding COG assignments (Additional file 3: Figure S3) revealed that many components of this core genome represent functions related to cellular housekeeping. It is also worth mentioning that this set of core families is composed of common functions which can be present in single copy (named orthologues and including a large proportion of the identified housekeeping genes), but also functions present in multiple copy (also named paralogues, of which ATP Binding Cassette (ABC)-type transporters represent a typical example). Variability among the B. breve genomes is due to a specific set of functions also called dispensable genes which are present in more than one of the examined B. breve genomes, yet not present in all, as well as genes that are specific for just one strain [10, 11]. Our analysis revealed a total of 924 families of variable genes, 426 of which are classified as unique. Of these 924 gene families, 49% encode hypothetical proteins, while the remainder is assigned to more informative features, such as genes predicted to encode proteins involved in capsular exopolysaccharide (EPS) synthesis, in phage resistance (CRISPR locus and R/M systems), in the production of sortase-dependent pili, and in carbohydrate metabolism, including various carbohydrate transporters (Additional file 4: Table S1). Notably, our in silico data corroborate and extend previously published CGH analyses, that had been performed to explore the genomic diversity of B. breve.
Furthermore, the total gene pool (ORFeome) extracted from the eight B. breve complete genomes was compared with that of six complete and publicly available chromosome sequences of B. longum subsp. longum (B. longum subsp. longum NCC2705, B. longum subsp. longum DJO10A, B. longum subsp. longum JCM 1217, B. longum subsp. longum ATCC 15697, B. longum subsp. longum 157F, B. longum subsp. longum BBMN68), which is phylogenetically the closest related taxon to B. breve. This comparison showed that 564 gene families (Additional file 5: Table S2) are specifically present in B. breve, yet absent in B. longum subsp. longum. Of these 564 gene families, approximately 50% encode unknown or hypothetical functions, while the other 50% represent functions similar to the ones observed in the variable regions of B. breve (i.e. glycosyl hydrolases, ABC transporters, CRISPR genes and mobile elements). The analysis also showed that 581 genes families are specifically present in B. longum subsp. longum, yet absent in B. breve, where approximately 68% are coding unknown or hypothetical functions, while the remaining 32% specify mobile elements, ABC transporters and glycosyl hydrolases (data not shown).
Variability among B. brevegenomes
These variable regions were shown to include the EPS cluster 2 (Bbr_0430-74, REG 3) containing two opposite orientated operons (eps1 from Bbr_0441 to Bbr_0434 and the eps2 from Bbr_0442 to Bbr_0451 , type II R/M systems 1–3, Bbr_0214-16 and Bbr_1118-21 , REG 2 and REG 5, respectively), conjugative transposon, excisionase and integrase of B. breve JCM 7017 (B7017_0896-97 and B7017_1053-55, REG 4), a CRISPR locus (Bbr_1405-11, REG 6), pilus-encoding genes (Bbr_1887-89), clusters encoding enzymes involved in the metabolism of carbohydrates (REG 7–8) and hypothetical proteins (REG 1) (Figure 2, panel a).
Bifidobacterium breve variable regions
B. breveNCFB 2258
B. breveJCM 7017
B. breveJCM 7019
EPS cluster 2
EPS cluster 1
R/M system 1
R/M system 2
R/M system 3
A varying number of predicted type II DNA R/M systems were identified in each of the eight completed B. breve genomes. The chromosomes of B. breve UCC2003 and B. breve JCM 7019 are each predicted to encode three R/M systems, while the chromosomes of B. breve JCM 2258 and B. breve JCM 7017 each encompass two such systems, and the chromosomes of the remaining strains B. breve 12L, B. breve 689b, B. breve ACS-071-V-Sch8b, B. breve S27 are each predicted to contain a single R/M system (Table 3).
Regarding genes that encode (predicted) adhesion factors, a type IVb tight adherence (tad) locus was previously characterized in B. breve UCC2003  and its presence was also observed in all other B. breve strains with an high degree of similarity (100–98% in BLASTP alignment; Additional file 7: Figure S4, panel a). In contrast to the Tad-like genes, the analyzed B. breve genomes were shown to contain a varying number of sortase-dependent pilus-encoding loci: B. breve UCC2003, B. breve NCFB 2258 and B. breve JCM 7017 contain three sortase-dependent pili loci (designated pil1, pil2 and pil3), while B. breve JCM 7019, B. breve 12L, B. breve 689b, B. breve ACS-071-V-Sch8b, B. breve S27 only contain 2 (pil1 and pil2; Table 3) (Additional file 7: Figure S4, panel b). In most cases (with the only exception of B. breve 12L where the clusters also appear to lack a dedicated sortase-encoding gene), an apparent frameshift within a 10–11 guanine nucleotide stretch in the first surface protein-encoding gene of pil1 and pil3 was present, a phenomenon previously observed for B. breve UCC2003 , as well as for B. bifidum PRL2010 [9, 25].
As from the previous CGH work performed on 18 B. breve isolates , genetic variability among the analysed representatives of this group was observed for genes previously characterized as being involved in the utilization of the carbohydrates ribose , sucrose  and raffinose , as well as the plant-derived polysaccharides starch , galactan  and cellodextrin .
Interestingly, in all analysed strains, genes are present that are predicted to encode enzymes involved in the uptake and utilization of host-derived mono/oligosaccharides, in particular mucin and Human Milk Oligosaccharides (HMOs). Examples of this include gene clusters predicted to be involved in the metabolism of sialic acid (Bbr_0160-73 and Bbr_1247), lacto-N-biose through a Leloir-like metabolic pathway  (Bbr_1587, Bbr_0491, Bbr_1884 and Bbr_1585), fucose (Bbr_1740-42) and N-linked glycans  (Bbr_1141-50). Although B. breve is not known to be able to grow on mucin or HMOs [35, 36], host-derived mono/oligosaccharides may become available through hydrolytic activities of other (bifido) bacteria present in the gut (e.g. B. bifidum PRL2010  and B. longum subsp. infantis), allowing B. breve strains to utilize such liberated carbohydrates through a phenomenon called cross-feeding .
Comparing carbohydrate fermentation profiles of nine B. breve strains (B. breve UCC2003, B. breve 689b, B. breve 12L, B. breve 2 L, B. breve 31 L, B. breve 2258, B. breve JCM 7017, B. breve JCM 7019 and B. breve S27) revealed that all strains are able to ferment a common set of sugars, such as glucose, lactose, lactulose, maltose and raffinose. In contrast, fermentation capabilities for the other sugars such as galactan, sucrose, pullulan, amylopectin, starch, maltodextrins, sorbitol, mannitol, fructose, melezitose, cellobiose, xylose and ribose, were shown to be variable among the strains tested. None of the B. breve strains assayed here was shown to be capable of utilizing inulin, arabinose, maltulose, mannose, trehalose and galactose (Figure 4, panel a).
In bifidobacteria, genes involved in the utilization of a given sugar are frequently organized in gene clusters containing genes that encode one or more specific GHs and associated transport system, and are frequently placed under the transcriptional control of a LacI-type regulator specified by a gene that is also located adjacent to or within such a gene cluster [27–32].
Gene-trait matching with functions resulting from hierarchical clustering analysis
Ribose transport system permease protein rbsD
Conserved hypothetical membrane spanning protein
pfkB family carbohydrate kinase
NADH-dependent butanol dehydrogenase 1
Inosine-uridine preferring nucleoside hydrolase
pfkB family carbohydrate kinase
N-(5’-phosphoribosyl) anthranilate isomerase
Cobalt transport protein cbiQ
Conserved hypothetical membrane spanning protein
Amylosucrase or alpha-glucosidase
Glycosyl hydrolases family 53, Endogalactanase, galactan metabolism
Conserved hypothetical protein, PhoU-like domain
Narrowly conserved hypothetical membrane spanning protein, MFS superfamily
Narrowly conserved hypothetical membrane spanning protein
Conserved hypothetical membrane spanning protein
Raffinose synthase or seed imbibition protein Sip1/Alpha-galactosidase
Glycosyl hydrolases family 32, Beta-fructosidase or sucrose-6-phosphate hydrolase
Cellodextrin binding protein
AraC family transcriptional regulator
Putative glyoxalase family protein
Sugar isomerase (SIS)
Putative ABC transporter, permease protein
ABC transporter, permease protein
ABC transporter, permease protein
Putative ROK family protein
Genome sequencing of eight B. breve strains and comparative analysis of these genomes, combined with five additional, publicly available B. breve genomes, allowed the description of the pan-genome of the B. breve species, which was shown to follow an essentially closed trend. As pan-genomic analysis was only recently introduced for the description of bacterial species , its application is still somewhat controversial and subject to scientific scrutiny. One clear limitation of pan-genome analysis is the difficulty of assessing whether a pan-genome is really closed or not, given the dynamic nature of a given bacterial population and its associated tendency to evolve and exchange genetic material. In general it can be said that a closed pan-genome implies that the gene exchange within a species is low and this certainly seems to be the case for the thirteen B. breve genomes analyzed here.
Moreover, the in silico prediction of ORFs with a deviating G+C mol% content coupled with a comparative genomics analysis, allowed the identification of eight genomic regions of variability in the B. breve pangenome representing approximately 30% of the total of gene content within the B. breve species, containing a large fraction of ORFs that have been acquired by HGT (which constitutes the 10% of the total of gene content in B. breve). Apart from hypothetical proteins and mobile elements, the gene functions contained within these variable regions are predicted to be required for environmental niche adaptation of this group. For a gut commensal the process of colonization involves cell-cell and cell-host interactions (involving for example genes that encode sortase-dependent pili for adhesion, biofilm formation and cell aggregation [8, 9]) and evasion of the host adaptive immune response  (requiring genes specifying the biosynthesis of an exopolysaccharide capsule), as well as metabolic flexibility to acquire energy from a variety of carbon sources independent of the age of the host (when the host shifts from a milk-only diet to a diversity of solid foods, thus explaining the predicted capacity of producing a wide variety of GHs). Furthermore, evolutionary pressures to resist invasion of foreign DNA (e.g. phages and plasmids) also appears to provide an explanation for the presence of CRISPR, CRISPR-associated genes, as well as R/M systems in the variable regions of B. breve.
The comparative genomics approach used in this study also facilitated the explanation of certain differences previously observed in the carbon sources utilization in some B. breve members and allowed the definition of a new cluster responsible for the fermentation of the sugar alcohol sorbitol.
For this reason the in silico analysis presented in this study represents a robust starting point for future functional genomics investigations focusing on (individual members of) this bifidobacterial species, in order to elucidate the spectrum of functions and mechanisms of interaction with the host environment to explain the presence of these bacteria in the human gut and the reported beneficial effects on their host.
Genome sequencing and data assembly
All genomes used in this study are human isolates of B. breve, of which B. breve UCC2003 was isolated and sequenced as part of a previous study , B. breve 12L, B. breve 2L and B. breve 31L, B. breve 689b, B. breve S27 were isolated from human milk and breast-fed infant feaces as previously described , while B. breve JCM 7017, B. breve JCM 7019, B. breve NCFB 2258 were obtained from the Japan Collection of Microorgnisms and National Collection of Food Bacteria, respectively). All genomes were sequenced using one or more Next Generation Sequencing (NGS) platforms. In order to construct an initial scaffold backbone, reads were first obtained using a 454 Roche genome sequencer FLX Titanium instrument employing a long-tag, paired-end library (average read length of 400 bp).
The genomes of B. breve 689b, B. breve 12L and B. breve S27 were sequenced using a Roche 454 FLX Titanium instrument by the commercial sequencing service providers Agencourt Bioscience (Beverly, MA) and Eurofins MWG Operon (Germany) and then assembled, after which remaining gaps were closed using Sanger sequencing of PCR products. The obtained consensus genome sequence consisted of an approximately 30-fold overall coverage, where any remaining low quality regions or other sequence conflicts were resolved using additional Sanger sequencing of PCR products. Assembly was performed using Newbler v2.6 (http://454.com/products/analysis-software/index.asp) and Gap4 (Staden package v1.6.0 (http://sourceforge.net/projects/staden/).
In the case of the genomes of B. breve NCFB 2258, B. breve JCM 7017 and B. breve JCM 7019, sequences were obtained using a combination of the afore mentioned 454 FLX Titanium and Illumina Hiseq 2000 sequencing platforms, both performed by Macrogen (Seoul, Republic of Korea) (and using a paired-end library with an average read length of 450 bp and 101 bp, respectively). The obtained sequences were assembled employing a hybrid assembly using a combination of Newbler v2.6 (http://454.com/) for long reads and Abyss v1.3.4 (http://www.bcgsc.ca/) for short reads, resulting in a 200-fold coverage. Any remaining gaps and quality issues were resolved using Sanger sequencing of PCR products.
Finally, the draft genome sequences of B. breve 2L and B. breve 31L were carried out by GenProbio Ltd. (http://genprobio.com) combining the output of two runs of Ion Torrent PGM (Life Technologies, Germany) following a previously described protocol , reaching a coverage of approximately 50-fold. The obtained raw data were assembled using MIRA v.3.9 (http://www.chevreux.org/projects_mira.html), applying default parameters recommended for Ion Torrent data processing. All reads were quality checked and trimmed in order to improve the assembly process; this quality check/trimming step was performed for both 454 FLX and Illumina data using NGSQCToolkit v.2.3 . For Ion Torrent reads the pre-processing step was performed using a built-in function of the Mira assembler software (v3.9) (http://www.chevreux.org/).
General features prediction
Open Reading Frame (ORF) prediction was performed with a combined approach of the predictor Prodigal v2.0 (http://prodigal.ornl.gov) and BLASTX v2.2.26  alignment for all the genomes analysed in this study; identified ORFs were then automatically annotated on the basis of BLASTP v2.2.26  analysis using B. breve UCC2003 as the reference genome (NCBI Reference Sequence: NC_020517.1). Functional assignment was performed and manually edited based on similarity searches against the non-redundant protein database curated by the National Centre for Biotechnology Information (ftp://ftp.ncbi.nih.gov/blast/db/).
Artemis v.14 (http://www.sanger.ac.uk/resources/software/artemis/) was used to combine and inspect the results of the ORF finder and the associated BLASTP  results, while this software tool was also used for manual editing, where necessary, of the start codon of a predicted gene. Where appropriate, annotations were further refined, verified or adjusted using information retrieved from alternative databases, e.g. Uniprot/EMBL (http://www.uniprot.org/), protein family (Pfam) (http://pfam.sanger.ac.uk) and COGs .
Transfer RNA genes were identified using tRNAscan-SE v1.4  and ribosomal RNA genes were detected on the basis of Rnammer v1.2  and BLASTN v2.2.26  searches and annotated manually. Insertion sequence elements were identified and assigned using IS finder (https://www-is.biotoul.fr) and BLAST v2.2.26  and annotated manually. Carbohydrate-active enzymes were identified based on similarity to the carbohydrate-active enzyme (CAZy) database entries , Enzyme Commission numbers (http://enzyme.expasy.org) and Pfam alignments (http://pfam.sanger.ac.uk), and this combined information was used for manual annotation purposes.
Deviations in G+C mol% were computed based on the ORFs nucleotide sequence using Geecee function from the EMBOSS v6.5.7 package .
All genome sequences were searched for the presence of Restriction/Modification systems using a BLASTP  alignment function of the REBASE database (http://rebase.neb.com/rebase/rebase.html) (cut-off E-value of 0.0001; and at least showing 30% of similarity of at least 80% of the sequence length).
Whole-genome sequence alignments were performed at the DNA level using the software package MUMmer v3.0 . Sequence comparisons at protein level were performed using an all-against-all, bi-directional BLAST alignment  (cut-off: E-value 0.0001, with at least 50% identity across at least 50% of either protein sequence), and the resulting output was then clustered into protein families sharing the same function using the Markov Cluster Algorithm (MCL) implemented in the mclblastline pipeline v12-0678 . The obtained gene families were classified as belonging to either the core or to the dispensable genome based on their presence in either all strains or in a subset of the investigated strains, respectively. In the orthologues extraction an additional filter for paralogues was applied by selecting only those families that were shown to contain a single protein member for each genome. Proteins identified as belonging to the mobilome, such as IS elements or phages, were also discarded from this pool of genes and orthologues were then functionally classified using COG category assignments .
The supertree computation was performed from the alignment of a set of orthologous genes obtained from the same BLAST-based comparative approach as indicated above (Additional file 8: Table S3).
Each protein family was aligned using CLUSTAL_W v1.83 . Phylogenetic trees were computed using the maximum-likelihood in PhyML v3.0  and concatenated; the resulting consensus tree was computed using the Consense module from Phylip package v3.69 using the majority rule method (http://evolution.genetics.washington.edu/phylip.html) and phylogenetic data were submitted to TreeBASE database (http://treebase.org/treebase-web/home.html).
For the available B. breve genomes a pan-genome computation was calculated using the PGAP v1.0 , which performs this analysis according to the Heap’s law pan-genome model ; the ORF content of each genome is organized in functional gene clusters using the GF (Gene Family) method and a pan-genome profile was then built.
Carbohydrates fermentation profiles
In order to investigate their carbohydrate-utilization capabilities, nine B. breve strains, which were available to us (B. breve UCC2003, B. breve 689b, B. breve 12L, B. breve 2L, B. breve 31L, B. breve 2258, B. breve JCM 7017, B. breve JCM 7019 and B. breve S27) were experimentally tested for growth on 24 different carbohydrates (glucose, lactose, lactulose, maltose, raffinose, galactan, sucrose, pullulan, amylopectin, starch, maltodextrin, sorbitol, mannitol, fructose, melezitose, cellobiose, inulin, arabinose, maltulose, mannose, trehalose, galactose, xylose and ribose). The seven B. breve strains were grown in modified Rogosa medium supplemented with a given carbohydrate (final concentration 0.5%) and optical densities (OD at 600 nm) were recorded at regular intervals during 24 hours. In order to evaluate the phenotypic patterns of such strains, a lower limit OD of 0.3 was used as a cut-off value to discriminate between carbohydrates that did or did not support growth of a given strain. A further in silico gene-trait matching excercise was performed in order to correlate an observed carbohydrate-linked growth phenotype with the presence/absence of particular genes. For this analysis all shared gene families as obtained from the comparative genomic analysis described above were organized in 51 clusters, according to their presence in each strain. Subsequently all the data (phenotypic and genomic) were binarized and compared on an individual basis. The resulting matching distances were then reported in a heatmap and manually inspected with the additional support of PFAM database (http://pfam.sanger.ac.uk).
Construction of B. breveJCM7017 insertion mutant
In order to verify our predictions from the gene trait matching an insertion mutant was created in the alcohol dehydrogenase encoding gene of the predicted sorbitol utilization gene cluster of B. breve JCM7017. An internal fragment of open reading frame B7017_1848 (corresponding to codon numbers 78 through to 175 out of the 335 codons present in B7017_1848) were amplified by PCR using B. breve JCM 7017 chromosomal DNA as a template and primer pairs IM1848F (5’-CCTACAAGCTTCAGAAGTCACCAACGTCAAG-3’) and IM1848R (5’-CGATGCTCTAGAGATTCCGGCAAGATCCACCTG-3’) The insertion mutation was generated as described previously  to produce B. breve JCM7017-1848. Site-specific recombination in potential Tet-resistant mutant isolates was confirmed by colony PCR using primer combinations tetWFw (5’-ATGCTCATGTACGGTAAG-3’) and tetWRv (5’-CATTACCTTCTGAAACATA-3’) to verify tetW gene integration, and primers 1848-F (5’-GCTCCGCTGCCGCAGTTCCG-3’, positioned upstream of the selected internal fragment of B7017_1848), in combination with tetWFw to confirm integration at the correct chromosomal location.
Nucleotide sequence accession numbers
All the sequences here generated have been submitted to GenBank database with the following accession numbers: B. breve 689b [GenBank: CP006715], B. breve 12L [GenBank: CP006711], B. breve 2L [GenBank: AWUG00000000], B. breve 31L [GenBank: AWUF00000000], B. breve NCFB 2258 [GenBank: CP006714], B. breve S27 [GenBank: CP006716], B. breve JCM 7017 [GenBank: CP006712], B. breve JCM 7019 [GenBank: CP006713].
All the sequences used for our analysis have been retrieved from GenBank database with the following accession numbers: B. breve UCC2003 [GenBank: NC_020517], B. bifidum PRL2010 [GenBank: NC_014638], B. breve ACS-071-V-Sch8b [GenBank: NC_017218], B. breve CECT 7263 [GenBank: AFVV01000000], B. breve DPC 6330 [GenBank: AFXX00000000], B. breve DSM 20213 [GenBank: ACCG00000000]; B. longum subsp. longum NCC2705 [GenBank: NC_004307], B. longum subsp. longum DJO10A [GenBank: NC_010816], B. longum subsp. longum JCM 1217 [GenBank: NC_015067], B. longum subsp. longum ATCC 15697 [GenBank: NC_017219], B. longum subsp. longum 157 F [GenBank: NC_015052], B. longum subsp. longum BBMN68 [GenBank: NC_014656], B. longum subsp. infantis ATCC 15697 [GenBank: NC_017219], B. adolescentis ATCC 15703 [GenBank: NC_008618], B. dentium Bd1 [GenBank: NC_013714], B. animalis subsp. animalis ATCC 25527 [GenBank: NC_017834], B. animalis subsp. lactis DSM 10140 [GenBank: NC_012815], B. asteroides PRL2011 [GenBank: NC_018720], G. vaginalis ATCC 14019 [GenBank: NC_014644], L. xyli subsp. xyli CTCB07 [GenBank: NC_006087], T. whipplei TW08/27 [GenBank: NC_004551], Lb. plantarum WCFS1 [GenBank: NC_004567], C. difficile 630 [GenBank: NC_009089].
Human milk oligosaccharides
Comparative genome hybridazion
Open reading frame
Clustered regularly interspaced short palindromic repeats
- R/M system:
Restriction modification system
Next generation sequencing
Cluster of orthologous
This publication has emanated from research conducted with the financial support of Science Foundation Ireland (SFI) under Grant Numbers 07/CE/B1368, SFI/12/RC/2273 and 08/SRC/B1393. In addition, F.B. was supported by an EMBARK postgraduate fellowship of the Irish Research Council, while MOCM is a recipient of a HRB postdoctoral fellowship (Grant no. PDTM/20011/9). We thank GenProbio Ltd. for the financial support of the Laboratory of Probiogenomics. D.Z. and C.U.R were funded by the German Academic Exchange Service and German Federal Ministry of Education and Research (grant no. D/09/04778). We thank all students and co-workers for their contributions and enthusiasm.
- Ventura M, O’Flaherty S, Claesson MJ, Turroni F, Klaenhammer TR, van Sinderen D, O’Toole PW: Genome-scale analyses of health-promoting bacteria: probiogenomics. Nat Rev Microbiol. 2009, 7: 61-71. 10.1038/nrmicro2047.PubMedView ArticleGoogle Scholar
- Ventura M, Turroni F, van Sinderen D: Probiogenomics as a tool to obtain genetic insights into adaptation of probiotic bacteria to the human gut. Bioeng Bugs. 2012, 3: doi:10.4161/bbug.18540Google Scholar
- Turroni F, Ventura M, Butto LF, Duranti S, O’Toole PW, Motherway MO, van Sinderen D: Molecular dialogue between the human gut microbiota and the host: a Lactobacillus and Bifidobacterium perspective. Cell Mol Life Sci. 2013, 71: 183-203.PubMedView ArticleGoogle Scholar
- Ventura M, Canchaya C, Tauch A, Chandra G, Fitzgerald GF, Chater KF, van Sinderen D: Genomics of Actinobacteria: tracing the evolutionary history of an ancient phylum. Microbiol Mol Biol Rev. 2007, 71: 495-548. 10.1128/MMBR.00005-07.PubMed CentralPubMedView ArticleGoogle Scholar
- Turroni F, van Sinderen D, Ventura M: Genomics and ecological overview of the genus Bifidobacterium. Int J Food Microbiol. 2011, 149: 37-44. 10.1016/j.ijfoodmicro.2010.12.010.PubMedView ArticleGoogle Scholar
- Turroni F, Peano C, Pass DA, Foroni E, Severgnini M, Claesson MJ, Kerr C, Hourihane J, Murray D, Fuligni F, Gueimonde M, Margolles A, De Bellis G, O’Toole PW, Van Sinderen D, Marchesi JR, Ventura M: Diversity of Bifidobacteria within the Infant Gut Microbiota. PLoS One. 2012, 7: e36957-10.1371/journal.pone.0036957.PubMed CentralPubMedView ArticleGoogle Scholar
- Sela DA, Chapman J, Adeuya A, Kim JH, Chen F, Whitehead TR, Lapidus A, Rokhsar DS, Lebrilla CB, German JB, Prince NP, Richardson PM, Mills DA: The genome sequence of Bifidobacterium longum subsp. infantis reveals adaptations for milk utilization within the infant microbiome. Proc Natl Acad Sci USA. 2008, 105: 18964-18969. 10.1073/pnas.0809584105.PubMed CentralPubMedView ArticleGoogle Scholar
- O’Connell Motherway M, Zomer A, Leahy SC, Reunanen J, Bottacini F, Claesson MJ, O’Brien F, Flynn K, Casey PG, Munoz JA, Kearney B, Houston AM, O’Mahony C, Higgins DG, Shanahan F, Palva A, de Vos WM, Fitzgerald GF, Ventura M, O’Toole PW, van Sinderen D: Functional genome analysis of Bifidobacterium breve UCC2003 reveals type IVb tight adherence (Tad) pili as an essential and conserved host-colonization factor. Proc Natl Acad Sci USA. 2011, 108: 11217-11222. 10.1073/pnas.1105380108.PubMed CentralPubMedView ArticleGoogle Scholar
- Turroni F, Serafini F, Foroni E, Duranti S, O’Connell Motherway M, Taverniti V, Mangifesta M, Milani C, Viappiani A, Roversi T, Sánchez B, Santoni A, Gioiosa L, Ferrarini A, Delledonne M, Margolles A, Piazza L, Palanza P, Bolchi A, Guglielmetti S, van Sinderen D, Ventura M: Role of sortase-dependent pili of Bifidobacterium bifidum PRL2010 in modulating bacterium-host interactions. Proc Natl Acad Sci USA. 2013, 110: 11151-11156. 10.1073/pnas.1303897110.PubMed CentralPubMedView ArticleGoogle Scholar
- Tettelin H, Masignani V, Cieslewicz MJ, Donati C, Medini D, Ward NL, Angiuoli SV, Crabtree J, Jones AL, Durkin AS, Deboy RT, Davidsen TM, Mora M, Scarselli M, Margarit y Ros I, Peterson JD, Hauser CR, Sundaram JP, Nelson WC, Madupu R, Brinkac LM, Dodson RJ, Rosovitz MJ, Sullivan SA, Daugherty SC, Haft DH, Selengut J, Gwinn ML, Zhou L, Zafar N, et al: Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial “pan-genome”. Proc Natl Acad Sci USA. 2005, 102: 13950-13955. 10.1073/pnas.0506758102.PubMed CentralPubMedView ArticleGoogle Scholar
- Medini D, Donati C, Tettelin H, Masignani V, Rappuoli R: The microbial pan-genome. Curr Opin Genet Dev. 2005, 15: 589-594. 10.1016/j.gde.2005.09.006.PubMedView ArticleGoogle Scholar
- Bottacini F, Medini D, Pavesi A, Turroni F, Foroni E, Riley D, Giubellini V, Tettelin H, van Sinderen D, Ventura M: Comparative genomics of the genus Bifidobacterium. Microbiology. 2010, 156: 3243-3254. 10.1099/mic.0.039545-0.PubMedView ArticleGoogle Scholar
- Ventura M, Turroni F, Zomer A, Foroni E, Giubellini V, Bottacini F, Canchaya C, Claesson MJ, He F, Mantzourani M, Mulas L, Ferrarini A, Gao B, Delledonne M, Henrissat B, Coutinho P, Oggioni M, Gupta RS, Zhang Z, Beighton D, Fitzgerald GF, O’Toole PW, van Sinderen D: The Bifidobacterium dentium Bd1 genome sequence reflects its genetic adaptation to the human oral cavity. PLoS Genet. 2009, 5: e1000785-10.1371/journal.pgen.1000785.PubMed CentralPubMedView ArticleGoogle Scholar
- Turroni F, Bottacini F, Foroni E, Mulder I, Kim JH, Zomer A, Sanchez B, Bidossi A, Ferrarini A, Giubellini V, Delledonne M, Henrissat B, Coutinho P, Oggioni M, Fitzgerald GF, Mills D, Margolles A, Kelly D, van Sinderen D, Ventura M: Genome analysis of Bifidobacterium bifidum PRL2010 reveals metabolic pathways for host-derived glycan foraging. Proc Natl Acad Sci USA. 2010, 107: 19514-19519. 10.1073/pnas.1011100107.PubMed CentralPubMedView ArticleGoogle Scholar
- Bottacini F, Turroni F, Viappiani A, Milani C, Serafini F, Foroni E, van Sinderen D, Ventura M: The genome sequences of Bifidobacterium asteroides PRL2011 reveals respiratory metabolic capabilities. PloS One. 2012, 7 (9): E44229-10.1371/journal.pone.0044229.PubMed CentralPubMedView ArticleGoogle Scholar
- Makarova K, Slesarev A, Wolf Y, Sorokin A, Mirkin B, Koonin E, Pavlov A, Pavlova N, Karamychev V, Polouchine N, Shakhova V, Grigoriev I, Lou Y, Rohksar D, Lucas S, Huang K, Goodstein DM, Hawkins T, Plengvidhya V, Welker D, Hughes J, Goh Y, Benson A, Baldwin K, Lee JH, Díaz-Muñiz I, Dosti B, Smeianov V, Wechter W, Barabote R, et al: Comparative genomics of the lactic acid bacteria. Proc Natl Acad Sci USA. 2006, 103: 15611-15616. 10.1073/pnas.0607117103.PubMed CentralPubMedView ArticleGoogle Scholar
- Siguier P, Varani A, Perochon J, Chandler M: Exploring bacterial insertion sequences with ISfinder: objectives, uses, and future developments. Methods Mol Biol. 2012, 859: 91-103. 10.1007/978-1-61779-603-6_5.PubMedView ArticleGoogle Scholar
- Ventura M, Lee JH, Canchaya C, Zink R, Leahy S, Moreno-Munoz JA, O’Connell-Motherway M, Higgins D, Fitzgerald GF, O’Sullivan DJ, van Sinderen D: Prophage-like elements in bifidobacteria: insights from genomics, transcription, integration, distribution, and phylogenetic analysis. Appl Environ Microbiol. 2005, 71: 8692-8705. 10.1128/AEM.71.12.8692-8705.2005.PubMed CentralPubMedView ArticleGoogle Scholar
- O’Riordan K, Fitzgerald GF: Molecular characterisation of a 5.75-kb cryptic plasmid from Bifidobacterium breve NCFB 2258 and determination of mode of replication. FEMS Microbiol Lett. 1999, 174: 285-294. 10.1111/j.1574-6968.1999.tb13581.x.PubMedView ArticleGoogle Scholar
- Lee JH, O’Sullivan DJ: Genomic insights into bifidobacteria. Microbiol Mol Biol Rev. 2010, 74: 378-416. 10.1128/MMBR.00004-10.PubMed CentralPubMedView ArticleGoogle Scholar
- Brouwer MS, Roberts AP, Mullany P, Allan E: In silico analysis of sequenced strains of Clostridium difficile reveals a related set of conjugative transposons carrying a variety of accessory genes. Mob Genet Elements. 2012, 2: 8-12. 10.4161/mge.19297.PubMed CentralPubMedView ArticleGoogle Scholar
- Ventura M, Canchaya C, Del Casale A, Dellaglio F, Neviani E, Fitzgerald GF, van Sinderen D: Analysis of bifidobacterial evolution using a multilocus approach. Int J Syst Evol Microbiol. 2006, 56: 2783-2792. 10.1099/ijs.0.64233-0.PubMedView ArticleGoogle Scholar
- Fanning S, Hall LJ, Cronin M, Zomer A, MacSharry J, Goulding D, Motherway MO, Shanahan F, Nally K, Dougan G, van Sinderen D: Bifidobacterial surface-exopolysaccharide facilitates commensal-host interaction through immune modulation and pathogen protection. Proc Natl Acad Sci USA. 2012, 109: 2108-2113. 10.1073/pnas.1115621109.PubMed CentralPubMedView ArticleGoogle Scholar
- O’Connell Motherway M, O’Driscoll J, Fitzgerald GF, Van Sinderen D: Overcoming the restriction barrier to plasmid transformation and targeted mutagenesis in Bifidobacterium breve UCC2003. Microb Biotechnol. 2009, 2: 321-332. 10.1111/j.1751-7915.2008.00071.x.PubMed CentralPubMedView ArticleGoogle Scholar
- Foroni E, Serafini F, Amidani D, Turroni F, He F, Bottacini F, O’Connell Motherway M, Viappiani A, Zhang Z, Rivetti C, van Sinderen D, Ventura M: Genetic analysis and morphological identification of pilus-like structures in members of the genus Bifidobacterium. Microb Cell Fact. 2011, 10 (1): S16-10.1186/1475-2859-10-16.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhao Y, Wu J, Yang J, Sun S, Xiao J, Yu J: PGAP: pan-genomes analysis pipeline. Bioinformatics. 2012, 28: 416-418. 10.1093/bioinformatics/btr655.PubMed CentralPubMedView ArticleGoogle Scholar
- Pokusaeva K, Neves AR, Zomer A, O’Connell-Motherway M, Macsharry J, Curley P, Fitzgerald GF, Van Sinderen D: Ribose utilization by the human commensal Bifidobacterium breve UCC2003. Microb Biotechnol. 2010, 3: 311-323. 10.1111/j.1751-7915.2009.00152.x.PubMed CentralPubMedView ArticleGoogle Scholar
- Pokusaeva K, O’Connell-Motherway M, Zomer A, Fitzgerald GF, van Sinderen D: Characterization of two novel alpha-glucosidases from Bifidobacterium breve UCC2003. Appl Environ Microbiol. 2009, 75: 1135-1143. 10.1128/AEM.02391-08.PubMed CentralPubMedView ArticleGoogle Scholar
- Aslanidis C, Schmid K, Schmitt R: Nucleotide sequences and operon structure of plasmid-borne genes mediating uptake and utilization of raffinose in Escherichia coli. J Bacteriol. 1989, 171: 6753-6763.PubMed CentralPubMedGoogle Scholar
- Ryan SM, Fitzgerald GF, van Sinderen D: Screening for and identification of starch-, amylopectin-, and pullulan-degrading activities in bifidobacterial strains. Appl Environ Microbiol. 2006, 72: 5289-5296. 10.1128/AEM.00257-06.PubMed CentralPubMedView ArticleGoogle Scholar
- Motherway MO, Fitzgerald GF, van Sinderen D: Metabolism of a plant derived galactose-containing polysaccharide by Bifidobacterium breve UCC2003. Microb Biotechnol. 2011, 4: 403-416. 10.1111/j.1751-7915.2010.00218.x.View ArticleGoogle Scholar
- Pokusaeva K, O’Connell-Motherway M, Zomer A, MacSharry J, Fitzgerald GF, van Sinderen D: Cellodextrin Utilization by Bifidobacterium breve UCC2003. Appl Environ Microb. 2011, 77: 1681-1690. 10.1128/AEM.01786-10.View ArticleGoogle Scholar
- Nishimoto M, Kitaoka M: Identification of N-acetylhexosamine 1-kinase in the complete lacto-N-biose I/galacto-N-biose metabolic pathway in Bifidobacterium longum. Appl Environ Microbiol. 2007, 73: 6444-6449. 10.1128/AEM.01425-07.PubMed CentralPubMedView ArticleGoogle Scholar
- Garrido D, Dallas DC, Mills DA: Consumption of human milk glycoconjugates by infant-associated bifidobacteria: mechanisms and implications. Microbiology. 2013, 159: 649-664. 10.1099/mic.0.064113-0.PubMed CentralPubMedView ArticleGoogle Scholar
- Locascio RG, Ninonuevo MR, Kronewitter SR, Freeman SL, German JB, Lebrilla CB, Mills DA: A versatile and scalable strategy for glycoprofiling bifidobacterial consumption of human milk oligosaccharides. Microb Biotechnol. 2009, 2: 333-342. 10.1111/j.1751-7915.2008.00072.x.PubMed CentralPubMedView ArticleGoogle Scholar
- Ward RE, Ninonuevo M, Mills DA, Lebrilla CB, German JB: In vitro fermentability of human milk oligosaccharides by several strains of bifidobacteria. Mol Nutr Food Res. 2007, 51: 1398-1405. 10.1002/mnfr.200700150.PubMedView ArticleGoogle Scholar
- Kim JH, An HJ, Garrido D, German JB, Lebrilla CB, Mills DA: Proteomic analysis of Bifidobacterium longum subsp. infantis reveals the metabolic insight on consumption of prebiotics and host glycans. PloS one. 2013, 8: e57535-10.1371/journal.pone.0057535.PubMed CentralPubMedView ArticleGoogle Scholar
- Falony G, Vlachou A, Verbrugghe K, De Vuyst L: Cross-feeding between Bifidobacterium longum BB536 and acetate-converting, butyrate-producing colon bacteria during growth on oligofructose. Appl Environ Microbiol. 2006, 72: 7835-7841. 10.1128/AEM.01296-06.PubMed CentralPubMedView ArticleGoogle Scholar
- Coutinho PM, Henrissat B: Life with no sugars?. J Mol Microbiol Biotechnol. 1999, 1: 307-308.PubMedGoogle Scholar
- Hirayama Y, Sakanaka M, Fukuma H, Murayama H, Kano Y, Fukiya S, Yokota A: Development of a double-crossover markerless gene deletion system in Bifidobacterium longum: functional analysis of the alpha-galactosidase gene for raffinose assimilation. Appl Environ Microbiol. 2012, 78: 4984-4994. 10.1128/AEM.00588-12.PubMed CentralPubMedView ArticleGoogle Scholar
- O’Connell KJ, O’Connell Motherway M, O’Callaghan J, Fitzgerald GF, Ross RP, Ventura M, Stanton C, van Sinderen D: Metabolism of four alpha-glycosidic linkage-containing oligosaccharides by Bifidobacterium breve UCC2003. Appl Environ Microbiol. 2013, 79: 6280-6292. 10.1128/AEM.01775-13.PubMed CentralPubMedView ArticleGoogle Scholar
- Turroni F, Foroni E, Giubellini V, Ribbera A, Merusi P, Cagnasso P, Bizzarri B, De’ Angelis GL, Shanahan F, van Sinderen D: Exploring the diversity of the bifidobacterial population in the human intestinal tract. Appl Environ Microbiol. 75: 1534-1545.
- Milani C, Duranti S, Lugli GA, Bottacini F, Strati F, Arioli S, Foroni E, Turroni F, van Sinderen D, Ventura M: Comparative genomics of Bifidobacterium animalis subsp. lactis reveals a strict monophyletic bifidobacterial taxon. Appl Environ Microbiol. 2013, 79: 4304-4315. 10.1128/AEM.00984-13.PubMed CentralPubMedView ArticleGoogle Scholar
- Patel RK, Jain M: NGS QC Toolkit: a toolkit for quality control of next generation sequencing data. PloS one. 2012, 7: e30619-10.1371/journal.pone.0030619.PubMed CentralPubMedView ArticleGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.PubMedView ArticleGoogle Scholar
- Tatusov RL, Galperin MY, Natale DA, Koonin EV: The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2000, 28: 33-36. 10.1093/nar/28.1.33.PubMed CentralPubMedView ArticleGoogle Scholar
- Schattner P, Brooks AN, Lowe TM: The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005, 33: W686-689. 10.1093/nar/gki366.PubMed CentralPubMedView ArticleGoogle Scholar
- Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW: RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007, 35: 3100-3108. 10.1093/nar/gkm160.PubMed CentralPubMedView ArticleGoogle Scholar
- Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16: 276-277. 10.1016/S0168-9525(00)02024-2.PubMedView ArticleGoogle Scholar
- Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL: Versatile and open software for comparing large genomes. Genome Biol. 2004, 5: R12-10.1186/gb-2004-5-2-r12.PubMed CentralPubMedView ArticleGoogle Scholar
- Enright AJ, Van Dongen S, Ouzounis CA: An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002, 30: 1575-1584. 10.1093/nar/30.7.1575.PubMed CentralPubMedView ArticleGoogle Scholar
- Thompson JD, Gibson TJ, Higgins DG: Multiple sequence alignment using ClustalW and ClustalX. Current protocols in bioinformatics/editoral board, Andreas D Baxevanis [et al.]. 2002, Volume 2 3. 2nd editionGoogle Scholar
- Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52: 696-704. 10.1080/10635150390235520.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.