- Research article
- Open Access
Analysis of three genomes within the thermophilic bacterial species Caldanaerobacter subterraneus with a focus on carbon monoxide dehydrogenase evolution and hydrolase diversity
© Sant’Anna et al. 2015
- Received: 9 April 2015
- Accepted: 19 September 2015
- Published: 7 October 2015
The Caldanaerobacter subterraneus species includes thermophilic fermentative bacteria able to grow on carbohydrates substrates with acetate and L-alanine as the main products. In this study, comprehensive analysis of three genomes of C. subterraneus subspecies was carried in order to identify genes encoding key metabolic enzymes and to document the genomic basis for the evolution of these organisms.
Average nucleotide identity and in silico DNA relatedness were estimated for the studied C. subterraneus genomes. Genome synteny was evaluated using R2CAT software. Protein conservation was analyzed using mGenome Subtractor. Horizontal gene transfer was predicted through the GOHTAM pipeline (using tetranucleotide composition) and phylogenetic analyses (by maximum likelihood). Hydrolases were identified through the MEROPS and CAZy platforms.
The three genomes of C. subterraneus showed high similarity, although there are substantial differences in their gene composition and organization. Each subspecies possesses a gene cluster encoding a carbon monoxide dehydrogenase (CODH) and an energy converting hydrogenase (ECH). The CODH gene is associated with an operon that resembles the Escherichia coli hydrogenase hyc/hyf operons, a novel genetic context distinct from that found in archetypical hydrogenogenic carboxydotrophs. Apart from the CODH-associated hydrogenase, these bacteria also contain other hydrogenases, encoded by ech and hyd genes. An Mbx ferredoxin:NADP oxidoreductase homolog similar to that originally described in the archaeon Pyrococcus furiosus was uniquely encoded in the C. subterraneus subsp. yonseiensis genome. Compositional analysis demonstrated that some genes of the CODH-ECH and mbx operons present distinct sequence patterns in relation to the majority of the other genes of each genome. Phylogenetic reconstructions of the genes from these operons and those from the ech operon are incongruent to the species tree. Notably, the cooS gene of C. subterraneus subsp. pacificus and its homologs in C. subterraneus subsp. tengcongensis and C. subterraneus subsp. yonseiensis form distinct clades. The strains have diverse hydrolytic enzymes and they appear to be proteolytic and glycolytic. Divergent glycosidases from 14 families, among them amylases, chitinases, alpha-glucosidases, beta-glucosidases, and cellulases, were identified. Each of the three genomes also contains around 100 proteases from 50 subfamilies, as well about ten different esterases.
Genomic information suggests that multiple horizontal gene transfers conferred the adaptation of C. subterraneus subspecies to extreme niches throughout the carbon monoxide utilization and hydrogen production. The variety of hydrolases found in their genomes indicate the versatility of the species in obtaining energy and carbon from diverse substrates, therefore these organisms constitute a remarkable resource of enzymes with biotechnological potential.
- Caldanaerobacter subterraneus
- Horizontal gene transfer
- Carbon monoxide dehydrogenase
Thermophilic bacteria possess diverse adaptations in order to thrive under high temperatures [1, 2]. Therefore, these organisms are sources of potentially useful thermostable proteins, which is promising because of the increasing biotechnological interest in highly thermostable enzymes . Besides, the genomic study of these organisms can provide insights on interesting metabolic features characteristic of these bacteria, like the ability to generate hydrogen gas as metabolic product, a promising renewable fuel. With the advent of high throughput technologies of DNA sequencing, many genomes of thermophilic bacteria are being unraveled (e.g. [4–7], and the in silico analysis of the large amount of generated data is a fundamental initial approach to understand the full potential of these organisms.
Caldanaerobacter subterraneus includes fermentative thermophilic bacteria with relatively low genomic GC content (under 40 %) able to grow on carbohydrate substrates with acetate L-alanine, H2, and CO2 as the main products that have been isolated from a variety of hot environments [8–11]. C. subterraneus subsp. pacificus (formerly known as Carboxydobrachium pacificum) is known to grow on CO hydrogenogenically ; C. subterraneus subsp. tengcongensis–formerly Thermoanaerobacter tengcongensis–and C. subterraneus subsp. yonseiensis (but not C. subterraneus subsp. subterraneus) have been reported to oxidize CO ; however there is no mention if they produce hydrogen from CO. In 2002, the genome of C. subterraneus subsp. tengcongensis was sequenced. A CODH gene cooS was found in the genome and ascribed to the acetogenic Wood-Ljungdahl pathway . However, after this report it was noted that the genome lacks the acetyl-CoA synthase gene, indispensable for this pathway, and the CODH gene is clustered with ECH genes, suggesting that C. subteraneus subsp. tencongensis has the capacity for hydrogenogenic carboxydotrophy . Recently, the genome of C. subterraneus subsp. yonseiensis has also been published , which can contribute to the understanding of the evolution of the metabolic features in this species relative to its sibling strains. Moreover, these genomes constitute helpful resources for cloning and expression of novel enzymes of biotechnological importance (e.g. [1, 2, 15]).
In this study, the genome of C. subterraneus subsp. pacificus was sequenced. This bacterium grows from 50 to 80 °C, and was isolated from a submarine thermal vent in Japan, unlike the other subspecies (C. subterraneus subsp. tengcongensis and C. subterraneus subsp. yonseiensis were isolated from terrestrial high temperature environments, and C. subterraneus subsp. subterraneus strains are oilfield isolates). C. subterraneus subsp. pacificus is known to be able to grow chemolithotrophically on CO, producing H2 and CO2 during growth .
The main objective of this study was to explore the differences among the three genomes by comparative analysis. The analyses were focused on inferring the physiological and evolutionary aspects of these organisms. The role of horizontal gene transfer (HGT) in shaping these three genomes was also evaluated and key metabolic genes and proteins with potential biotechnological application, such as carbon monoxide dehydrogenase, hydrogenases, proteases, glycosidases and esterases were identified.
Phylogeny of the species
C. subterraneus subsp. tengcongensis is known to exhibit an exceptionally high level of sequence divergence among its intragenomic 16S rRNA gene copies (6.7 %) . As demonstrated in Fig. 1, the C. subterraneus subspecies with available genomes exhibit multiple 16S rRNA gene copies that are separated in two main clades (Clade A and Clade B, Fig. 1). This clade separation most probably represents the most ancient gene duplication that occurred before the diversification of subspecies. Considering Clade B, C. subterraneus subsp. pacificus is closer to C. subterraneus subsp. tengcongensis than to C. subterraneus subsp. yonseiensis. This pattern is not evident in Clade A due to the presence of intra-subspecies multiple 16S rRNA genes that interfere with interpretation of the true phylogenetic relationships among these subspecies.
Genomes overview and horizontal gene transfer detection
As expected, the average nucleotide identity (ANI) and in silico prediction of in vitro DNA-DNA hybridization (DDH) values for the genomes of C. subterraneus subspecies confirmed the conclusion of Fardeau et al.  about the affiliation of the C. subterraneus subspecies within the same species and once more showed the closest proximity of C. subterraneus subsp. pacificus to C. subterraneus subsp. tengcongensis (ANI value was 98.8 % and DDH value 85 %) than to C. subterraneus subsp. yonseiensis (ANI value was 98.0 % and DDH value was 80 %).
Overview of C. subterraneus genomes
C. subterraneus subsp. pacificus
C. subterraneus subsp. tengcongensis
C. subterraneus subsp. yonseiensis
Genome size (Mb)
Genome GC content (%)
Number of contigs
1 (complete chromosome)
962 (38.31 %)
855 (33.04 %)
836 (30.84 %)
Average gene length
rRNA average GC content (%)
tRNA average GC content (%)
Number of horizontally transferred CDSsb
173 (6.88 %)
121 (4.67 %)
127 (4.68 %)
Pacific Ocean hot vents
Terrestrial hot spring
Geothermal hot stream
Genes that putatively could have been acquired via horizontal transfer were identified in all three genomes. In C. subterraneus subsp. pacificus and C. subterraneus subsp. yonseiensis, most of the putative horizontally transferred genes correspond to hypothetical proteins, 99 of 173 (57.2 %), and 75 of 127 (59.1 %), respectively. C. subterraneus subsp. tengcongensis presents a lower proportion of hypothetical genes that could have been horizontally transferred, 55 of 121 CDSs (45.5 %). Also, some of the xenologous CDSs are transposases (9 in C. subterraneus subsp. tengcongensis, 8 in C. subterraneus subsp. pacificus, and 5 in C. subterraneus subsp. yonseiensis).
CODH dehydrogenase and Hyf/Hyc hydrogenase
Carbon monoxide dehydrogenases (CODH) are enzymes that catalyze the interconversion of CO and CO2, and they vary in their functional roles in the cell [18, 19]. All three C. subterraneus subspecies examined in this study possess a cooS gene encoding a CODH that is upstream of a hydrogenase gene cluster, with an invariant gene order identical to that found in Geobacillus thermoglucosidans strains .
Although Bao et al.  suggested that the CODH could also be utilized to fix carbon throughout the Wood-Ljungdahl pathway in C. subterraneus subsp. tengcongensis, it is improbable, because the genome of this strain does not present a gene putatively encoding for the key enzyme acetyl coenzyme A synthase (acsB) . As well, this gene was not found in the other two subspecies of C. subterraneus suggesting that the same argument can be applied to all these three subspecies, which indicates a limitation for the use of carbon monoxide or carbon dioxide as a carbon source. This contrasts to the capabilities of other thermophilic, CO-utilizing, hydrogenogenic or acetogenic Firmicutes (e.g., C. hydrogenoformans  and M. thermoacetica ) (Fig. 3).
Considering the species 16S rRNA tree as reference (Fig. 1), this result was unexpected. The CODH of C. subterraneus subsp. pacificus clusters with the homologous counterpart of Thermoanaerobacter sp. YS13 (having 98.8 % identity with this protein), and both are relatively distant from the CODHs from the other C. subterraneus subspecies. Following the classification described in Techtmann et al. , the CODH of C. subterraneus subsp. pacificus belongs to “Clade E”. On the other hand, the CODHs from C. subterraneus subsp. tengcongensis and C. subterraneus subsp. yonseiensis are immersed in “Clade F” (Fig. 4) and are rather closely related to the homologous proteins from Geobacillus thermoglucosidans strains (Order Bacillales), which are relatively distant from Caldanaerobacter (Order Clostridiales) (Fig. 1). However, despite their affiliation with the same clade, the CODHs of C. subterraneus subsp. tengcongensis and C. subterraneus subsp. yonseiensis are distant enough from each other to exclude their vertical inheritance from the LCA (last common ancestor) of these subspecies. While most of C. subterraneus subsp. yonseiensis proteins have H-values higher than 0.95 in relation to the C. subterraneus subsp. tengcongensis proteins (Fig. 2 and Additional file 2: Table S1), the CODH (O163_06470) has a H-value of 0.78 (Additional file 2: Table S1). These results suggest that the CODH evolutionary history of C. subterraneus includes several recent HGT events.
Taking these observations into account, all genes of the CODH-hydrogenase gene cluster were investigated for HGT by means of detection of phylogenetic discrepancies and parametric methods. Becq et al.  showed tetranucleotide composition and codon usage analyses had mean specificity of 87.8 and 89.2 %, and mean sensitivity of 77.2 and 91.5 %, respectively, when these methods were tested with artificial genomes. These values vary depending on intrinsic genome characteristics of the recipient organism, and on the origin of the HGT (e.g. a DNA sequence from a phylogenetically close related donor can be poorly detected by these methods). The greatest advantage of parametric methods is they do not rely on sequence data banks as phylogenetic approaches do . However, phylogenetic reconstruction is necessary to infer historical events from sequences .
Parametric analysis of nucleotide composition revealed that some CODH-hydrogenase genes from G. thermoglucosidans and prominently those from C. subterraneus subspecies have differential sequence patterns in relation to the “standard” gene sequence pattern of each genome (Fig. 3 and Additional file 2: Table S1), which suggests that these genes could have been acquired by horizontal transfer. The presence of a transposase gene downstream the CODH-hydrogenase gene cluster in C. subterraneus subsp. pacificus represents an additional evidence supporting this hypothesis (Fig. 3).
The close relationship of the hyf/hyc hydrogenase gene cluster between C. subterraneus and G. thermoglucosidans strains was confirmed by phylogenetic analyses. Apparently, these hyc and hyf genes shared common evolutionary histories and were acquired together as a cluster, not individually (Fig. 3 and Additional file 2: Table S1). Considering the phylogenetic distance between Geobacillus and C. subterraneus, the high identity levels between their CODH-hydrogenase proteins (~70–80 %) can hardly be interpreted as a result of vertical inheritance from the LCA but are rather a result of acquisition of the cluster by the C. subterraneus lineage via HGT or its acquisition by both C. subterraneus and G. thermoglucosidans lineages from the same source or sources related to each other.
In general, in most of the phylogenetic trees of the CODH-hydrogenase gene cluster, the three subspecies of C. subterraneus form monophyletic clades with Thermoanaerobacter sp. YS13 (Additional file 3: Figure S2 and Additional file 4: Figure S3), with the exception of cooF and cooS genes (Fig. 4). In fact, the hyc and hyf genes of Thermoanaerobacter sp. YS13 have identity values higher than 96 % at the nucleotide level when compared to the C. subterraneus subsp. tengcongensis orthologous genes. These observations and the presence of genes implicated with transposition within its CODH-hydrogenase gene cluster (Fig. 4) support the idea that this strain inherited the CODH-hydrogenase gene cluster from C. subterraneus. Since C. subterraneus subsp. pacificus is closer to C. subterraneus subsp. tengcongensis than any other organism in most phylogenetic reconstructions (Fig. 1 and Additional file 4: Figure S3) and gene comparisons (hyfCDEGHI and hycH are identical at nucleotide sequence level), it is likely that the gene transfer event occurred before the diversification of these subspecies. This observation implies that most probably their LCA harbored a “Clade E” CODH (as C. subterraneus subsp. pacificus). In spite of our contention that the hyf/hyc hydrogenase gene cluster was acquired as a cluster, not as individual genes, there is evidence that the cooF and cooS genes from C. subterraneus have distinct evolutionary histories with respect to the other genes of the cluster. An important point is that the GC contents of cooF and cooS genes from C. subterraneus subsp. tengcongensis (ca. 57 and 61 % respectively) and C. subterraneus subsp. yonseiensis (ca. 51 and 49 % respectively) are much higher than the mean gene GC content of these genomes (ca. 38 %) (Additional file 2: Table S1). Together with the above-mentioned considerable differences in the amino acid and nucleotide sequence patterns, the GC content data suggests that the CODHs of C. subterraneus subsp. tengcongensis and C. subterraneus subsp. yonseiensis were acquired recently via independent HGT events from prokaryotes having a higher genomic GC content after the diversification of the subspecies.
An important point to consider regarding the CODH tree is that its composition is very diverse taxonomically and does not reflect properly species relationships, indicating that HGT played an important role in the current distribution of carboxydotrophy among prokaryotes. Independent studies already pointed out that HGT of the cooS gene likely took place in several thermophilic species [29, 33, 34]. Despite the fact that the donor and acceptor organisms in these instances may be phylogenetically remote, they are usually able to grow in anaerobic environments at similar ranges of temperature and pH [33, 35]. The acquisition of new physiological characteristics would putatively allow the recipient organisms to be recruited to new thermophilic consortia, and consequently, the horizontal transference of important genes for adaptation to specialized niches would be facilitated.
Our analysis revealed at least four recent independent HGT events in the evolutionary history of the CODH-hydrogenase gene cluster: (1) cooS and cooF replacement in C. subterraneus subsp. tengcongensis; (2) cooS and cooF replacement in and C. subterraneus subsp. yonseiensis; (3) transfer of the cluster as whole to a recent ancestor of Thermoanaerobacter sp. YS13 from a common ancestor of C. subterraneus subsp. pacificus and C. subterraneus subsp. tengcongensis; (4) cooS. cooF, and cooC transfer from C. subterraneus subsp. tengcongensis to Thermodesulfobacterium thermophilum.
Besides the CODH-associated hydrogenase, C. subterraneus sbsp. tengcongensis harbors additional hydrogenases, a NiFe hydrogenase (encoded by ech genes) and a NADH-dependent Fe-only hydrogenase (encoded by hyd genes), which putatively catalyze the production of H2 from excess of reducing equivalents formed during the fermentation of saccharides at low p(H2) . The genes encoding these enzymes have been identified in C. subterraneus subsp. pacificus (although some ech are incomplete in the current genome assembly) and C. subterraneus subsp. yonseiensis. As pointed out by Calteau et al.  and Soboh et al. , both hydrogenase genes were wrongly assigned as NADH:ubiquinone oxidoreductase genes in C. subterraneus subsp. tengcongensis genome because of automatic annotation process. This error was also introduced to the C. subterraneus subsp. yonseiensis genome, and in the previous version of the genome of C. subterraneus subsp. pacificus. However, in the latest annotation of the genome of C. subterraneus subsp. pacificus, the proper description of these genes has been included.
Homologs of E. coli hyp genes are located upstream the ech genes in C. subterraneus. In E. coli, hyp genes are essential for the maturation of the hydrogenases . The synteny of hyp genes adjacent to ech genes could indicate their role in the maturation of Ech hydrogenase. However, it is noteworthy that possibly Hyp proteins could act on this as well as other hydrogenases, notably the CODH associated hydrogenase, which resembles the E. coli Hyc hydrogenase (Hyd-3), target of HypA and HypC proteins . In M. thermoacetica, the hypABFCDE operon, and in the Geobacillus thermoglucosidans strains investigated in this study, the hypAB genes are located downstream the hyf/hyc operon, which represents additional evidence supporting the probable interaction of their gene products.
Calteau et al.  had suggested that ech genes from an archaeon related to Methanosarcina were transferred horizontally to a C. subterraneus subsp. tengcongensis ancestor, however our analyses throughout parametric methods did not detect divergent sequence patterns in these genes (Additional file 2: Table S1). On the other hand, our phylogenetic analyses for most ech genes (with the exception of echB) demonstrated that C. subterraneus are immersed in the Thermoanaerobacter clade (Additional file 5: Figure S4). Consequently, the C. subterraneus subspecies last common ancestor would not have acquired the ech genes directly from an archaeon, but more likely indirectly through a Thermoanaerobacter species. Therefore, the alternative hypothesis by Calteau et al.  would be in agreement with our observations suggesting an initial transfer of these genes from an archaeon to a bacterial lineage followed by a second bacterium to bacterium transference. The hyp and hyd genes showed characteristics expected for this species in accordance to sequence composition (Additional file 2: Table S1) and phylogeny (Additional file 6: Figure S5 and Additional file 7: Figure S6).
E. coli has multiple hydrogenases that act differently depending on carbon source availability and on pH . Similarly, hydrogenases from C. subterraneus are expected to be active under different environmental conditions which would increase fitness in a variety of extreme environmental situations and carbon sources that C. subterraneus subspecies encounter in their natural niches .
The Mbx ferredoxin:NADP oxidoreductase
Evidence suggests that the mbx operon could have been transferred from a Thermoanaerobacter species to C. subterraneus subsp. yonseiensis. Calteau et al.  suggested that the mbx genes could have been originally transferred from Archaea to Bacteria. As the case of ech genes, this evolutionary event would have preceded the bacteria-to-bacteria HGTs.
Glycosidases from thermophiles have many industrial and biotechnological applications , thus the wealth of glycosidases in these species motivates detailed study. C. subterraneus subsp. tengcongensis presents 25 glycosidases distributed in 13 families, while C. subterraneus subsp. yonseiensis harbors 17 glycosidases from 8 families, and most of which have homologous counterparts in C. subterraneus subsp. tengcongensis. C. subterraneus subsp. pacificus have 21 glycosidases from 12 families, and two of them are specific to this subspecies (Additional file 2: Table S1 and Additional file 10: Table S2).
At this time three glycoside hydrolases deduced from C. subterraneus subsp. tengcongensis genome have been biochemically characterized. Two of them are starch-hydrolyzing enzymes, a glucoamylase (TTE1813)  and an alphaglucosidase (TTE0006) , and both have homologs in the other two C. subterrraneus subspecies (Additional file 10: Table S2). Exoglucohydrolases similar to these ones are extensively utilized for the hydrolysis of starch to glucose in industrial processes for food and ethanol production [15, 43]. Additional alphaglucosidases from the GH31 family remaining to be investigated were found in the genomes of C. subterraneus subsp. tengcongensis (TTE1934) and C. subterraneus subsp. pacificus (CDSM653_01802) (Additional file 10: Table S2). These orthologs have 32 % identity with the protein MalA of the archaeon Sulfolobus solfataricus, which has a substrate preference for maltose and maltooligosaccharides . Their neighbor genes are sugar permease genes in both C. subterraneus genomes, and they present different tetranucleotide composition, suggesting a likely horizontal inheritance for these genes (Additional file 2: Table S1).
The third type of identified glycosidase is a cellulase (endoglucanase) (TTE0359) , which was also found in C. subterraneus subsp. yonseiensis (Additional file 10: Table S2). This enzyme is able to break the internal bonds of cellulose, generating glucans of different lengths that are substrate for other enzymes to glucose production. One of these enzymes is the betaglucosidase, which hydrolyzes cellobiose disaccharides to glucose. Putative betaglucosidases from families GH1 and GH3 were found in the genomes of C. subterraneus subsp. tengcongensis and C. subterraneus subsp. pacificus, but the genome of C. subterraneus subsp. yonseiensis only hosts one belonging to the GH3 family (Additional file 10: Table S2). Currently, intensive studies of such enzymes are being carried out, due to their implication in the saccharification of lignocellulosic materials such as sugarcane bagasse for production of biofuel . Also, it is worth noting that four putative enzymes originally annotated as hypothetical in C. subterraneus subsp. tengcongensis genome belong by similarity to the GH18 family of glycosidases, and they were found in the other two C. subterraneus genomes (Additional file 10: Table S2). This family is known by containing chitinases, enzymes that hydrolyze chitin, one of the most common biopolymers in nature . Bacterial chitinases can be utilized as biological control of fungi and insects, but are also suitable for protoplast generation and the treatment of shellfish waste .
Esterases are widely utilized in industry for production of pharmaceuticals, detergents, biodiesel and other compounds [49, 50]. At least five esterases of C. subterraneus subsp. tengcongensis have been biochemically characterized [1, 2, 51–54]. These enzymes share high thermal stability at temperatures above 60 °C, and they use different substrates, as mentioned in Additional file 8: Table S3. Besides these esterases, Levisson et al.  detected through in silico approaches four additional esterases in C. subterraneus subsp. tengcongensis genome (Additional file 8: Table S3). In our study, it was verified that C. subterraneus subsp. yonseiensis possesses homologs of each one of the esterases referenced above. Two of them were not located in C. subterraneus subsp. pacificus genome, however using the LIPABASE proteins as reference, a specific lipase was found (CDSM653_00572). It matched a lipase from Acinetobacter baumannii [EMBL:A3M3C1], but because the coverage was 36 % and identity 31 %, more detailed studies are necessary to evaluate its catalytic properties.
Proteases are ubiquitous to all life forms, with in vivo functions ranging from protein turnover to growth substrate hydrolysis and amino acid acquisition. They have a highly diverse range of applications, such as tenderization of meat, composing detergent formulations, leather processing, molecular biology applications and peptide synthesis .
Around 100 proteases from 50 distinct subfamilies were found in each C. subterraneus genome (Additional file 11: Table S4). Among these proteins, metallo- and serineproteases were the most common. We note that the M42 subfamily proteases were originally annotated as cellulase-like proteins in C. subterraneus subsp. tengcongensis (Additional file 11: Table S4), but it is likely another case of misannotation. Dutoit et al.  verified experimentally that two proteins annotated as cellulases in the Thermotoga maritima and Clostridium thermocellum genomes were actually M42 aminopeptidases.
Although C. subterraneus possesses many proteases, only one peptidase has been already characterized, a serine protease named as tengconlysin (TTE0824) . Therefore the potential of C. subterraneus as a source for proteases is underexploited.
The study of C. subterraneus genomes is important to understand the adaptations allowing them to thrive in extreme habitats, as well as to analyze enzymes with biotechnological potential showing functionality under high temperatures. H2 is an important compound to the chemical industry, and a future clean biofuel . Genomic data indicate that C. subterraneus is able to produce H2 throughout different hydrogenase systems, markedly one associated with a CODH that permits obtaining energy from carbon monoxide, widely available in syngas and other industrial fuel gases. Horizontal gene transfer seems to be an important evolutionary driving force in carboxydotrophy and hydrogenogenesis in this species, abilities that permitted it to survive in niches where multiple inorganic and organic substrates may be available at low concentrations. In this sense, it is also worth noting that these bacteria encode a wide repertoire of hydrolase genes, such as glycosidases, esterases and proteases that act on a wide variety of substrates to provide them with carbon and energy. Therefore, the metabolic versatility of this species makes it a good source to target for novel enzymes with biotechnological potential.
Bacterial strain, genome sequencing, and operon prediction
C. subterraneus subsp. pacificus was isolated from a submarine hot vent in Okinawa Trough . Genome DNA was mainly sequenced and assembled at the J. Craig Venter Institute. Contigs of C. subterraneus subp. pacificus genome were automatically annotated with the xBase platform  using as reference the C. subterraneus subsp. tengcongensis genome. Genes of interest were inspected carefully and had their annotation refined manually. Operons were predicted using DOOR software . This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession ABXP00000000. The version described in this paper is version ABXP02000000.
16S rRNA gene phylogeny
Most of the 16S rRNA gene sequences were retrieved from the SILVA rRNA Database (Additional file 12: Table S5) . Sequences were aligned using SINA software , and gap positions were removed. Phylogenetic reconstructions were performed using the Phylogeny.fr platform  with the maximum likelihood method implemented in the PhyML program (v3.0 aLRT) . For each phylogeny, the GTR (Generalized Time Reversible) substitution model was selected assuming an estimated proportion of invariant sites and 4 gamma-distributed rate categories to account for rate heterogeneity across sites. The gamma shape parameter was estimated directly from the data. Reliability for internal branching was assessed using the aLRT (approximate Likelihood Ratio Test) .
Comparison of C. subterraneus genomes
In addition to the genome sequenced in this study, two genomes of C. subterraneus are publicly available from the following subspecies: C. subterraneus subsp. yonseiensis (AXDC00000000.1) and C. subterraneus subsp. tengcongensis (NC_003869.1). Only the C. subterraneus subsp. tengcongensis genome is complete. Therefore, the following comparative analyses were made using this genome as reference.
The average nucleotide identity (ANI) values (species boundary is 95 %) for the genomes of Caldanaerobacter subterraneus subspecies were determined . The in silico prediction of in vitro DNA-DNA hybridization (DDH) values (species boundary is 70 %) were calculated using GGDC 2.0 BLAST+ and recommended formula 2 .
Synteny plots were generated with the R2CAT software , aligning and ordering the contigs of C. subterraneus subsp. pacificus and C. subterraneus subsp. yonseiensis against C. subterraneus subsp. tengcongensis genome.
mGenome Subtractor  was utilized to compare the conservation of proteins of C. subterraneus subsp. pacificus and C. subterraneus subsp. yonseiensis genomes in relation to those from C. subterraneus subsp. tengcongensis.
The homology score (H-value) between two proteins is the product of the identity level (expressed as a value between 0 and 1) and of the ratio of the match length to query length . Conserved proteins were defined by having a homology score H-value above 0.64.
Horizontal gene transfer analysis
The genomes of C. subterraneus and the genomes from related species of interest were utilized for HGT detection throughout the GOHTAM platform , which detects horizontal gene transfers based on tetranucleotide composition and/or codon usage. GC content of the CDSs was computed using EMBOSS package [72, 73]. Genes containing a GC content higher or lower than two standard deviations from the average CDS GC content for all CDSs in a genome were highlighted.
Other phylogenetic analyses
Amino acid sequences of Caldanaerobacter subterraneus subspecies were utilized as query in blastp searches against the Genbank NR database. The most similar sequences were retrieved. Also, for some proteins (e.g. CooS), homologous counterparts from PDB database were also retrieved. Subsequently, amino acid sequences were aligned using MUSCLE software embedded in MEGA . Sites from the alignment containing gaps were removed. The phylogeny was constructed on the Phylogeny.fr platform basically as described previously above, but this time the WAG substitution model  was utilized.
Genome mining for hydrolases
Predicted proteins of the three subspecies of C. subterraneus were used as queries in blastp searches for glycohydrolase, esterase, peptidase and lipase databases. For protease identification the batch blast tool from MEROPS database  was utilized. For each protein, the hit with the lowest e-value (<10−10) was considered. Glycoside hydrolases were identified using CAT (Cazymes Analysis Tool)  with the following parameters: complete genome and e-value threshold of 10−10. Only glycosidases with domain and length consistency were considered. Esterases homologous to those already identified in C. subterraneus subsp. tengcongesis [1, 2, 51–55] were identified in the other subspecies genomes using as criterion the H-value >0.64. Lipases were searched among the C. subterraneus proteins using the blastp tool embedded in Bioedit  against the lipase database LIPABASE , with an e-value cutoff set to 10−10.
Supporting data are included as Additional files. Phylogenetic data have been deposited at TreeBASE under the accession URL http://purl.org/phylo/treebase/phylows/study/TB2:S18113.
FHS received scholarship from Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES, Brazil), process number 2403-13-7. The work of AVL and TGS was supported by the Russian Scientific Fund grant no. 14-24-00165. JMG acknowledges funding through projects CSD2009-00006, CGL2014-58762-P and GEN2006-26423-E from the Spanish Ministry of Economy and Competitiveness and RNM2529 and BIO288 from the Andalusian Government; Feder funds cofinanced these projects. C. subterraneus pacificus genome sequencing was carried out by the J. Craig Venter Institute through the Microbial Genome Sequencing Project sponsored by The Gordon and Betty Moore Foundation’s Marine Microbiology Initiative. Funding from the mobility programme 003-ABEL-CM-2013 (NILS Science and Sustainability programme, EEA grants) is also acknowledged. FTR acknowledges the support of a US National Science Foundation Grant. FTR acknowledges grant support from the Center for Dark Energy Biosphere Investigations (C-DEBI), an NSF Research Center at the University of Southern California. We acknowledge support of the publication fee by the CSIC Open Access Publication Support Initiative through its Unit of Information Resources for Research (URICI).
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Rao L, Xue Y, Zhou C, Tao J, Li G, Lu JR, et al. A thermostable esterase from Thermoanaerobacter tengcongensis opening up a new family of bacterial lipolytic enzymes. Biochim Biophys Acta. 2011;1814:1695–702.View ArticlePubMedGoogle Scholar
- Moriyoshi K, Koma D, Yamanaka H, Sakai K, Ohmoto T. Expression and characterization of a thermostable acetylxylan esterase from Caldanaerobacter subterraneus subsp. tengcongensis involved in the degradation of insoluble cellulose acetate. Biosci Biotechnol Biochem. 2013;77:2495–8.View ArticlePubMedGoogle Scholar
- De Champdoré M, Staiano M, Rossi M, D’Auria S. Proteins from extremophiles as stable tools for advanced biotechnological applications of high social interest. J R Soc Interface. 2007;4:183–91.PubMed CentralView ArticlePubMedGoogle Scholar
- Pierce E, Xie G, Barabote RD, Saunders E, Han CS, Detter JC, et al. The complete genome sequence of Moorella thermoacetica (f. Clostridium thermoaceticum). Environ Microbiol. 2008;10:2550–73.PubMed CentralView ArticlePubMedGoogle Scholar
- Oehler D, Poehlein A, Leimbach A, Müller N, Daniel R, Gottschalk G, et al. Genome-guided analysis of physiological and morphological traits of the fermentative acetate oxidizer Thermacetogenium phaeum. BMC Genomics. 2012;13:723.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhao Y, Caspers MP, Abee T, Siezen RJ, Kort R. Complete genome sequence of Geobacillus thermoglucosidans TNO-09.020, a thermophilic sporeformer associated with a dairy-processing environment. J Bacteriol. 2012;194:4118.PubMed CentralView ArticlePubMedGoogle Scholar
- Visser M, Worm P, Muyzer G, Pereira IAC, Schaap PJ, Plugge CM, et al. Genome analysis of Desulfotomaculum kuznetsovii strain 17(T) reveals a physiological similarity with Pelotomaculum thermopropionicum strain SI(T). Stand Genomic Sci. 2013;8:69–87.PubMed CentralView ArticlePubMedGoogle Scholar
- Sokolova T, Gonzalez J, Kostrikina N, Chernyh N, Tourova T, Kato C, et al. Carboxydobrachium pacificum gen. nov., sp. nov., a new anaerobic, thermophilic, CO-utilizing marine bacterium from Okinawa Trough. Int J Syst Evol Microbiol. 2001;51:141–9.View ArticlePubMedGoogle Scholar
- Xue Y, Xu Y, Liu Y, Ma Y, Zhou P. Thermoanaerobacter tengcongensis sp. nov., a novel anaerobic, saccharolytic, thermophilic bacterium isolated from a hot spring in Tengcong, China. Int J Syst Evol Microbiol. 2001;51:1335–41.View ArticlePubMedGoogle Scholar
- Kim B, Grote R, Lee D, Antranikian G, Pyun Y. Thermoanaerobacter yonseiensis sp. nov., a novel extremely thermophilic, xylose-utilizing bacterium that grows at up to 85 degrees C. Int J Syst Evol Microbiol. 2001;51:1539–48.View ArticlePubMedGoogle Scholar
- Fardeau M-L, Bonilla Salinas M, L’Haridon S, Jeanthon C, Verhé F, Cayol J-L, et al. Isolation from oil reservoirs of novel thermophilic anaerobes phylogenetically related to Thermoanaerobacter subterraneus: reassignment of T. subterraneus, Thermoanaerobacter yonseiensis, Thermoanaerobacter tengcongensis and Carboxydibrachium pacificum to. Int J Syst Evol Microbiol. 2004;54(Pt 2):467–74.View ArticlePubMedGoogle Scholar
- Bao Q, Tian Y, Li W, Xu Z, Xuan Z, Hu S, et al. A complete sequence of the T. tengcongensis genome. Genome Res. 2002;12:689–700.PubMed CentralView ArticlePubMedGoogle Scholar
- Sokolova TG, Henstra A-M, Sipma J, Parshina SN, Stams AJM, Lebedinsky AV. Diversity and ecophysiological features of thermophilic carboxydotrophic anaerobes. FEMS Microbiol Ecol. 2009;68:131–41.View ArticlePubMedGoogle Scholar
- Lee S-J, Lee Y-J, Park G-S, Kim B-C, Lee SJ, Shin J-H, et al. Draft genome sequence of an anaerobic and extremophilic bacterium, Caldanaerobacter yonseiensis, isolated from a geothermal hot stream. Genome Announc. 2013;1:e00923-13.PubMed CentralPubMedGoogle Scholar
- Zhou C, Xue Y, Ma Y. Enhancing the thermostability of alpha-glucosidase from Thermoanaerobacter tengcongensis MB4 by single proline substitution. J Biosci Bioeng. 2010;110:12–7.View ArticlePubMedGoogle Scholar
- Subbotina IV, Chernyh NA, Sokolova TG, Kublanov IV, Bonch-Osmolovskaya EA, Lebedinsky AV. Oligonucleotide probes for the detection of representatives of the genus Thermoanaerobacter. Microbiology. 2003;72:331–9.View ArticleGoogle Scholar
- Galtier N, Lobry JR. Relationships between genomic G + C content, RNA secondary structures, and optimal growth temperature in prokaryotes. J Mol Evol. 1997;44:632–6.View ArticlePubMedGoogle Scholar
- Ferry JG. CO dehydrogenase. Annu Rev Microbiol. 1995;49:305–33.View ArticlePubMedGoogle Scholar
- Wu M, Ren Q, Durkin AS, Daugherty SC, Brinkac LM, Dodson RJ, et al. Life in hot carbon monoxide: the complete genome sequence of Carboxydothermus hydrogenoformans Z-2901. PLoS Genet. 2005;1:e65.PubMed CentralView ArticlePubMedGoogle Scholar
- Sokolova T, Lebedinsky A. CO-oxidizing anaerobic thermophilic prokaryotes. In: Satyanarayana T, Littlechild J, Kawarabayasi Y, editors. Thermophilic microbes in environmental and industrial biotechnology: biotechnology of thermophiles. 2nd ed. Netherlands: Springer; 2013. p. 203–31.View ArticleGoogle Scholar
- Soboh B, Linder D, Hedderich R. Purification and catalytic properties of a CO-oxidizing:H2-evolving enzyme complex from Carboxydothermus hydrogenoformans. Eur J Biochem. 2002;269:5712–21.View ArticlePubMedGoogle Scholar
- Fox JD, Kerby RL, Roberts GP, Ludden PW. Characterization of the CO-induced, CO-tolerant hydrogenase from Rhodospirillum rubrum and the gene encoding the large subunit of the enzyme. J Bacteriol. 1996;178:1515–24.PubMed CentralPubMedGoogle Scholar
- Forzi L, Sawers RG. Maturation of [NiFe]-hydrogenases in Escherichia coli. Biometals. 2007;20:565–78.View ArticlePubMedGoogle Scholar
- Lim JK, Kang SG, Lebedinsky AV, Lee J-H, Lee HS. Identification of a novel class of membrane-bound [NiFe]-hydrogenases in Thermococcus onnurineus NA1 by in silico analysis. Appl Environ Microbiol. 2010;76:6286–9.PubMed CentralView ArticlePubMedGoogle Scholar
- Kim M-S, Bae SS, Kim YJ, Kim TW, Lim JK, Lee SH, et al. CO-dependent H2 production by genetically engineered Thermococcus onnurineus NA1. Appl Environ Microbiol. 2013;79:2048–53.PubMed CentralView ArticlePubMedGoogle Scholar
- Böhm R, Sauter M, Böck A. Nucleotide sequence and expression of an operon in Escherichia coli coding for formate hydrogenlyase components. Mol Microbiol. 1990;4:231–43.View ArticlePubMedGoogle Scholar
- Hedderich R. Energy-converting [NiFe] hydrogenases from archaea and extremophiles: ancestors of complex I. J Bioenerg Biomembr. 2004;36:65–75.View ArticlePubMedGoogle Scholar
- Henstra AM, Stams AJ. Deep conversion of carbon monoxide to hydrogen and formation of acetate by the anaerobic thermophile Carboxydothermus hydrogenoformans. Int J Microbiol. 2011; 2011: 641582. doi: 10.1155/2011/641582
- Techtmann SM, Lebedinsky AV, Colman AS, Sokolova TG, Woyke T, Goodwin L, et al. Evidence for horizontal gene transfer of anaerobic carbon monoxide dehydrogenases. Front Microbiol. 2012;3:132.PubMed CentralView ArticlePubMedGoogle Scholar
- Becq J, Churlaud C, Deschavanne P. A benchmark of parametric methods for horizontal transfers detection. PLoS One. 2010;5:e9989.PubMed CentralView ArticlePubMedGoogle Scholar
- Lawrence JG, Ochman H. Reconciling the many faces of lateral gene transfer. Trends Microbiol. 2002;10:1–4.View ArticlePubMedGoogle Scholar
- Eisen JA. Horizontal gene transfer among microbial genomes: new insights from complete genome analysis. Curr Opin Genet Dev. 2000;10:606–11.View ArticlePubMedGoogle Scholar
- Techtmann SM, Colman AS, Robb FT. “That which does not kill us only makes us stronger”: the role of carbon monoxide in thermophilic microbial consortia. Environ Microbiol. 2009;11:1027–37.View ArticlePubMedGoogle Scholar
- Gonzalez JM, Robb FT. Genetic analysis of Carboxydothermus hydrogenoformans carbon monoxide dehydrogenase genes cooF and cooS. FEMS Microbiol Lett. 2000;191:243–7.View ArticlePubMedGoogle Scholar
- Wagner ID, Wiegel J. Diversity of thermophilic anaerobes. Ann N Y Acad Sci. 2008;1125:1–43.View ArticlePubMedGoogle Scholar
- Calteau A, Gouy M, Perrière G. Horizontal transfer of two operons coding for hydrogenases between bacteria and archaea. J Mol Evol. 2005;60:557–65.View ArticlePubMedGoogle Scholar
- Jacobi A, Rossmann R, Böck A. The hyp operon gene products are required for the maturation of catalytically active hydrogenase isoenzymes in Escherichia coli. Arch Microbiol. 1992;158:444–51.PubMedGoogle Scholar
- Trchounian K. Transcriptional control of hydrogen production during mixed carbon fermentation by hydrogenases 4 (hyf) and 3 (hyc) in Escherichia coli. Gene. 2012;506:156–60.View ArticlePubMedGoogle Scholar
- Silva PJ, van den Ban EC, Wassink H, Haaker H, de Castro B, Robb FT, et al. Enzymes of hydrogen metabolism in Pyrococcus furiosus. Eur J Biochem. 2000;267:6541–51.View ArticlePubMedGoogle Scholar
- Schut GJ, Bridger SL, Adams MWW. Insights into the metabolism of elemental sulfur by the hyperthermophilic archaeon Pyrococcus furiosus: characterization of a coenzyme A- dependent NAD(P)H sulfur oxidoreductase. J Bacteriol. 2007;189:4431–41.PubMed CentralView ArticlePubMedGoogle Scholar
- Schut GJ, Boyd ES, Peters JW, Adams MWW. The modular respiratory complexes involved in hydrogen and sulfur metabolism by heterotrophic hyperthermophilic archaea and their evolutionary implications. FEMS Microbiol Rev. 2013;37:182–203.View ArticlePubMedGoogle Scholar
- Vieille C, Zeikus GJ. Hyperthermophilic enzymes: sources, uses, and molecular mechanisms for thermostability. Microbiol Mol Biol Rev. 2001;65:1–43.PubMed CentralView ArticlePubMedGoogle Scholar
- Zheng Y, Xue Y, Zhang Y, Zhou C, Schwaneberg U, Ma Y. Cloning, expression, and characterization of a thermostable glucoamylase from Thermoanaerobacter tengcongensis MB4. Appl Microbiol Biotechnol. 2010;87:225–33.View ArticlePubMedGoogle Scholar
- Rolfsmeier M, Blum P. Purification and characterization of a maltase from the extremely thermophilic crenarchaeote Sulfolobus solfataricus. J Bacteriol. 1995;177:482–5.PubMed CentralPubMedGoogle Scholar
- Liang C, Xue Y, Fioroni M, Rodríguez-Ropero F, Zhou C, Schwaneberg U, et al. Cloning and characterization of a thermostable and halo-tolerant endoglucanase from Thermoanaerobacter tengcongensis MB4. Appl Microbiol Biotechnol. 2011;89:315–26.View ArticlePubMedGoogle Scholar
- Kuhad RC, Gupta R, Singh A. Microbial cellulases and their industrial applications. Enzyme Res. 2011;2011:280696.PubMed CentralView ArticlePubMedGoogle Scholar
- Dahiya N, Tewari R, Hoondal GS. Biotechnological aspects of chitinolytic enzymes: a review. Appl Microbiol Biotechnol. 2006;71:773–82.View ArticlePubMedGoogle Scholar
- Bhattacharya D, Nagpure A, Gupta RK. Bacterial chitinases: properties and potential. Crit Rev Biotechnol. 2008;27:21–8.View ArticleGoogle Scholar
- Hasan F, Shah AA, Hameed A. Industrial applications of microbial lipases. Enzyme Microb Technol. 2006;39:235–51.View ArticleGoogle Scholar
- Panda T, Gowrishankar BS. Production and applications of esterases. Appl Microbiol Biotechnol. 2005;67:160–9.View ArticlePubMedGoogle Scholar
- Zhang J, Liu J, Zhou J, Ren Y, Dai X, Xiang H. Thermostable esterase from Thermoanaerobacter tengcongensis: high-level expression, purification and characterization. Biotechnol Lett. 2003;25:1463–7.View ArticlePubMedGoogle Scholar
- Grosse S, Bergeron H, Imura A, Boyd J, Wang S, Kubota K, et al. Nature versus nurture in two highly enantioselective esterases from Bacillus cereus and Thermoanaerobacter tengcongensis. Microb Biotechnol. 2010;3:65–73.PubMed CentralView ArticlePubMedGoogle Scholar
- Royter M, Schmidt M, Elend C, Höbenreich H, Schäfer T, Bornscheuer UT, et al. Thermostable lipases from the extreme thermophilic anaerobic bacteria Thermoanaerobacter thermohydrosulfuricus SOL1 and Caldanaerobacter subterraneus subsp. tengcongensis. Extremophiles. 2009;13:769–83.PubMed CentralView ArticlePubMedGoogle Scholar
- Abokitse K, Wu M, Bergeron H, Grosse S, Lau PCK. Thermostable feruloyl esterase for the bioproduction of ferulic acid from triticale bran. Appl Microbiol Biotechnol. 2010;87:195–203.View ArticlePubMedGoogle Scholar
- Levisson M, van der Oost J, Kengen SWM. Carboxylic ester hydrolases from hyperthermophiles. Extremophiles. 2009;13:567–81.PubMed CentralView ArticlePubMedGoogle Scholar
- Sinha R, Khare SK. Thermostable proteases. In: Satyanarayana T, Littlechild J, Kawarabayasi Y, editors. Thermophilic microbes in environmental and industrial biotechnology: biotechnology of thermophiles. 2nd ed. Netherlands: Springer; 2013. p. 859–80.View ArticleGoogle Scholar
- Dutoit R, Brandt N, Legrain C, Bauvois C. Functional characterization of two M42 aminopeptidases erroneously annotated as cellulases. PLoS One. 2012;7:e50639.PubMed CentralView ArticlePubMedGoogle Scholar
- Koma D, Yamanaka H, Moriyoshi K, Ohmoto T, Sakai K. Overexpression and characterization of thermostable serine protease in Escherichia coli encoded by the ORF TTE0824 from Thermoanaerobacter tengcongensis. Extremophiles. 2007;11:769–79.View ArticlePubMedGoogle Scholar
- Lee H-S, Vermaas WFJ, Rittmann BE. Biological hydrogen production: prospects and challenges. Trends Biotechnol. 2010;28:262–71.View ArticlePubMedGoogle Scholar
- Chaudhuri RR, Loman NJ, Snyder LAS, Bailey CM, Stekel DJ, Pallen MJ. xBASE2: a comprehensive resource for comparative bacterial genomics. Nucleic Acids Res. 2008;36(Database issue):D543–6.PubMed CentralPubMedGoogle Scholar
- Mao X, Ma Q, Zhou C, Chen X, Zhang H, Yang J, et al. DOOR 2.0: presenting operons and their functions through dynamic and integrated views. Nucleic Acids Res. 2014;42(Database issue):D654–9.PubMed CentralView ArticlePubMedGoogle Scholar
- Yarza P, Richter M, Peplies J, Euzeby J, Amann R, Schleifer K-H, et al. The All-Species Living Tree project: a 16S rRNA-based phylogenetic tree of all sequenced type strains. Syst Appl Microbiol. 2008;31:241–50.View ArticlePubMedGoogle Scholar
- Pruesse E, Peplies J, Glöckner FO. SINA: accurate high-throughput multiple sequence alignment of ribosomal RNA genes. Bioinformatics. 2012;28:1823–9.PubMed CentralView ArticlePubMedGoogle Scholar
- Dereeper A, Guignon V, Blanc G, Audic S, Buffet S, Chevenet F, et al. Phylogeny.fr: robust phylogenetic analysis for the non-specialist. Nucleic Acids Res. 2008;36(Web Server issue):W465–9.PubMed CentralView ArticlePubMedGoogle Scholar
- Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59:307–21.View ArticlePubMedGoogle Scholar
- Anisimova M, Gascuel O. Approximate likelihood-ratio test for branches: a fast, accurate, and powerful alternative. Syst Biol. 2006;55:539–52.View ArticlePubMedGoogle Scholar
- Goris J, Konstantinidis KT, Klappenbach JA, Coenye T, Vandamme P, Tiedje JM. DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol. 2007;57(Pt 1):81–91.View ArticlePubMedGoogle Scholar
- Meier-Kolthoff JP, Auch AF, Klenk H-P, Göker M. Genome sequence-based species delimitation with confidence intervals and improved distance functions. BMC Bioinformatics. 2013;14:60.PubMed CentralView ArticlePubMedGoogle Scholar
- Husemann P, Stoye J. r2cat: synteny plots and comparative assembly. Bioinformatics. 2010;26:570–1.PubMed CentralView ArticlePubMedGoogle Scholar
- Shao Y, He X, Harrison EM, Tai C, Ou H-Y, Rajakumar K, et al. mGenomeSubtractor: a web-based tool for parallel in silico subtractive hybridization analysis of multiple bacterial genomes. Nucleic Acids Res. 2010;38(Web Server issue):W194–200.PubMed CentralView ArticlePubMedGoogle Scholar
- Ménigaud S, Mallet L, Picord G, Churlaud C, Borrel A, Deschavanne P. GOHTAM: a website for “Genomic Origin of Horizontal Transfers, Alignment and Metagenomics”. Bioinformatics. 2012;28:1270–1.PubMed CentralView ArticlePubMedGoogle Scholar
- Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000;16:276–7.View ArticlePubMedGoogle Scholar
- Carver T, Bleasby A. The design of Jemboss: a graphical user interface to EMBOSS. Bioinformatics. 2003;19:1837–43.View ArticlePubMedGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28:2731–9.PubMed CentralView ArticlePubMedGoogle Scholar
- Whelan S, Goldman N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001;18:691–9.View ArticlePubMedGoogle Scholar
- Rawlings ND, Waller M, Barrett AJ, Bateman A. MEROPS: the database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 2014;42(Database issue):D503–9.PubMed CentralView ArticlePubMedGoogle Scholar
- Park BH, Karpinets TV, Syed MH, Leuze MR, Uberbacher EC. CAZymes Analysis Toolkit (CAT): web service for searching and analyzing carbohydrate-active enzymes in a newly sequenced organism using CAZy database. Glycobiology. 2010;20:1574–84.View ArticlePubMedGoogle Scholar
- Hall TA. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999;41:95–8.Google Scholar
- Messaoudi A, Belguith H, Ghram I, Ben Hamida J. LIPABASE: a database for “true” lipase family enzymes. Int J Bioinform Res Appl. 2011;7:390–401.View ArticlePubMedGoogle Scholar
- Dobbek H, Svetlitchnyi V, Gremer L, Huber R, Meyer O. Crystal structure of a carbon monoxide dehydrogenase reveals a [Ni-4Fe-5S] cluster. Science. 2001;293:1281–5.View ArticlePubMedGoogle Scholar