- Research article
Comparative genomic hybridizations reveal absence of large Streptomyces coelicolor genomic islands in Streptomyces lividans
BMC Genomicsvolume 8, Article number: 229 (2007)
The genomes of Streptomyces coelicolor and Streptomyces lividans bear a considerable degree of synteny. While S. coelicolor is the model streptomycete for studying antibiotic synthesis and differentiation, S. lividans is almost exclusively considered as the preferred host, among actinomycetes, for cloning and expression of exogenous DNA. We used whole genome microarrays as a comparative genomics tool for identifying the subtle differences between these two chromosomes.
We identified five large S. coelicolor genomic islands (larger than 25 kb) and 18 smaller islets absent in S. lividans chromosome. Many of these regions show anomalous GC bias and codon usage patterns. Six of them are in close vicinity of tRNA genes while nine are flanked with near perfect repeat sequences indicating that these are probable recent evolutionary acquisitions into S. coelicolor. Embedded within these segments are at least four DNA methylases and two probable methyl-sensing restriction endonucleases. Comparison with S. coelicolor transcriptome and proteome data revealed that some of the missing genes are active during the course of growth and differentiation in S. coelicolor. In particular, a pair of methylmalonyl CoA mutase (mcm) genes involved in polyketide precursor biosynthesis, an acyl-CoA dehydrogenase implicated in timing of actinorhodin synthesis and bldB, a developmentally significant regulator whose mutation causes complete abrogation of antibiotic synthesis belong to this category.
Our findings provide tangible hints for elucidating the genetic basis of important phenotypic differences between these two streptomycetes. Importantly, absence of certain genes in S. lividans identified here could potentially explain the relative ease of DNA transformations and the conditional lack of actinorhodin synthesis in S. lividans.
Streptomyces spp. include some of the world's most prolific producers of naturally occurring bioactive molecules, many of which are in current therapeutic use . These soil-dwelling filamentous bacteria exhibit a remarkably complex life style. Emerging from uni-genomic spores, they colonize the nutrient layer and eventually surface as multi-genomic aerial hyphae, often synthesizing secondary metabolites in the process. The extraordinary diversity of secondary metabolite gene clusters found in these microbes is likely the result of their existence in hostile ecological niches and consequent genomic evolutionary processes including large scale rearrangements, insertions and deletions to cope with these exigencies. The likelihood of these events are further enhanced by the unusual propensity of Streptomyces to undergo spontaneous recombination events especially at chromosome extremities [2, 3]. Multiple occurrences of such events over the course of millions of years lead to eventual speciation.
Much of the current knowledge of streptomycetes is based largely on the foundations of genetic and genomic studies conducted in S. coelicolor A3(2) . S. coelicolor is known for its ability to synthesize pigmented metabolites that serve as excellent phenotypic markers in genetic studies. As a model organism, its genome became the first among streptomycetes to be completely sequenced . Nevertheless, S. lividans, a close relative of S. coelicolor is almost exclusively considered as the preferred host, among actinomycetes, for heterologous protein expression [6–8]. The primary reasons attributed for this include a significantly relaxed restriction-modification system which enhances exogenous DNA uptake  and considerably attenuated endogenous protease activity leading to improved product recovery from S. lividans . Notable examples of biologically active heterologous protein productions in S. lividans include proteins of eukaryotic origins like human T-cell receptor CD4 , tumor necrosis factor-α , human interleukin  and salmon calcitonin  as well as bacterial proteins like mycobacterial antigens with appropriate glycosylation patterns . Leucotropin™ – a recombinant therapeutic agent used in treatment of Hodgkin's disease is commercially produced through S. lividans fermentation (Cangene, Winnipeg, Canada).
The 16S rRNA sequences of S. coelicolor and S. lividans share > 99.5% identity. Historically, the two species have been distinguished by the inability of the latter to produce the deep blue antibiotic, actinorhodin under many conditions. In addition, S. lividans (1) fails to methylate its own DNA or restrict exogenous methylated DNA (2) possesses diminished extracellular protease activity (3) lacks a mechanism to degrade agar and (4) forms unstable φ C31 lysogens .
Despite considerable research and economic interests in S. lividans and the availability of complete genome sequence of S. coelicolor, there have been surprisingly few systematic studies comparing the genome compositions of the two species. Early work by Leblond et al. using pulse-field gel electrophoresis (PFGE) and Southern hybridizations with restriction-fragment linking cosmid probes revealed an essentially similar genomic organization in the two species with identical ordering of the cosmid sequences . More recently, Zhou et al. deduced the absence of a ~90 kb S. lividans genomic island in S. coelicolor through analysis of a DNA modification deficient S. lividans derivative ZX7 .
In the post-genomic era, DNA microarrays have emerged as the tool of choice for genome scale comparisons of closely related organisms [17, 18]. Comparative genomic hybridizations (CGH) using microarrays have already demonstrated their utility in characterization of pathogenicity islands and drug resistance factors in Yersinia pestis , Vibrio cholerae , Staphylococcus aureus  and Mycobacterium tuberculosis  among many others. Elsewhere, they have been used to assess genome plasticity and microbial evolution . Among Streptomyces, microarrays have been previously used to detect gross genomic duplications and presence of long terminal repeats at chromosome ends in different strains of S. coelicolor . In this study, we performed a microarray-based whole genome comparison to identify S. coelicolor M145 genes absent or divergent in S. lividans TK21. Expression levels of these genes are analyzed and relevant observations are discussed in the context of known phenotypic differences that arose during the recent microevolution of these two species.
Genome-scale comparison of S. coelicolor and S. lividans
We had reported earlier the construction of a whole-genome PCR-product based S. coelicolor microarray with probes for more than 95% of the predicted ORFs . Comparative genomic hybridizations between S. coelicolor M145 and S. lividans TK21 using this microarray revealed extensive homology between the two chromosomes. This is apparent from Figure 1a showing the majority of ORFs with relatively similar signal intensity levels from both S. coelicolor and S. lividans gDNA channels (i.e. log2 of hybridization signal ratio close to zero). The observation concurs well with earlier reports of genome-scale conservation between the two species based on restriction fragment based linkage maps . Genes with log2 hybridization ratios less than the overall mean (of all genes) minus one standard deviation were designated as potentially absent or divergent in S. lividans. Of 7579 S. coelicolor genes probed in our array, we found that only ~8% fell in this category. Most often, these genes are clustered in localized regions of the chromosome. The clusters themselves, however, are scattered extensively across the entire length of the chromosome, albeit with a definite bias toward the right half. The absence of these clusters in S. lividans were confirmed by PCR with primers flanking the missing region (refer to supplementary material for details). Based on the size of these clusters and gene order in S. coelicolor M145, we classified the resident genes as belonging to either one of 5 large genomic islands (GI) (≥25 kb) or 18 smaller islets (Gi) (< 25 kb). These are in addition to the two chromosome ends and about 70 other genes scattered across the chromosome which, despite having low S. lividans gDNA signal, did not satisfy the three-adjacent-gene criteria used here to define a genomic island (see Methods section). The chromosome ends were not termed as genomic islands because large-scale DNA rearrangements are known to occur in the terminal regions leading to difficulties in ascertaining the exact boundaries of islands. Also PCR verification using flanking primers was impossible in such cases.
To assess whether these genomic deletions/insertions are consequences of horizontal gene transfer from distant species, we examined each island for characteristic features that define laterally acquired elements. Hallmarks of laterally acquired element include some or all of the following features – (1) unusual GC bias (2) anomalous codon usage (3) presence in the 3' end of tRNA genes (4) tendency to be flanked by direct repeats and (5) inclusion of mobile genetic elements [26, 27]. We therefore, sought to evaluate the GC bias and codon usage pattern of genes across the entire genome of S. coelicolor. We adopted a windowless cumulative GC plot based on z' curve (see Methods section) to evaluate genome-wide GC variation. This approach is more robust, gives higher resolution and has been shown to be useful when the traditional sliding-window based GC plots fail . A sharp rise in the z' value at a localized region in the chromosome implies an unusually AT-rich segment. For assessing codon usage patterns, we calculated the C odon A daptation I ndex (CAI), a measure of codon optimality , for every ORF with respect to the most frequent codons occurring in S. coelicolor. CAI values range from 0 to 1.0 with higher values indicating more optimal codon usage. Both the cumulative GC plot and CAI plot revealed excellent correlation with regions identified using microarrays (Figure 1b and 1c). In particular, 18 out of 23 islands/islets identified from microarray analysis have an average CAI ≤ 0.60 – a significant departure from an average of 0.73 for the whole genome. There were, however, some regions with low CAI that were not identified as absent in S. lividans. Although, it is quite possible that these segments might have been laterally acquired, our results suggest that those gene transfer events occurred before the evolutionary divergence of these two species. In addition, six of the 23 regions are located at 3' ends of tRNA genes while nine are flanked by near-perfect direct repeat sequences. Some of these repeat sequences had significant homology to the att sites of certain plasmids or phage related elements. For example, the repeat sequences flanking Gi-4, Gi-6, Gi-11 and GI-5 resembled the att sites of actinophage VWB, plasmid pSG1 (S. griseus), pSLP2 (S. lividans) and pMB400 (Bacillus megaterium) respectively. The list of genomic islands and islets identified and their associated features are shown in Table 1. Taken together, these results suggest that many of these segments are recent evolutionary incorporations into the S. coelicolor chromosome.
Analysis of GIs and resident genes
A total of 625 genes reside within the S. coelicolor genomic islands/islets identified above. They were categorized based on their functional assignments (Table 2). Unsurprisingly, more that two-thirds of laterally acquired elements in S. coelicolor are absent in S. lividans. Mobile genetic elements like integrative plasmids, bacteriophages and transposon related genes constitute this class. Other genes missing in S. lividans are comprised largely of hypothetical and periplasmic/exported proteins although a significant fraction belongs to a diverse set of functional categories shown in Table 2.
GI-1 (22 ORFs; ~26 kb) consists of a variety of genes including those coding for acetate uptake, amino acid biosynthesis and possible iron uptake systems. Superoxide dismutase – SCO0999 (sodF2), an Fe-dependent antioxidant was previously hypothesized to have been acquired by horizontal gene transfer . SCO0981-83 constitutes an acetate uptake system with a DNA-binding regulator (aceR), isocitrate lyase (aceA) and malate synthase (aceB2) genes. Isocitrate lyase and malate synthase are required for a functional glyoxylate pathway which promotes acetate utilization; yet, expression of these genes in S. cinnamonensis did not restore growth of acetate-uptake-deficient mutants on acetate as sole carbon source giving rise to the possibility that some, as yet unknown, factors are missing . SCO0997 and SCO0998 are annotated as iron-uptake system proteins – ftrD and ftrE respectively. However, the presence of an upstream gene (SCO0996) homologous to a lipoprotein found in daptomycin biosynthesis cluster has raised questions about their involvement in iron uptake . This is further corroborated by the presence of chromosomally linked genes homologous to SCO0991/92 (a hypothetical protein and a putative cysteine synthase) near the daptomycin cluster of S. roseosporus . If true, the presence of these antibiotic biosynthesis linked genes in a genomic island of S. coelicolor would be intriguing in view of evidence for horizontal transfer of antibiotic clusters among streptomycetes isolated from soil .
GI-2 (70 ORFs; ~75 kb) includes ORFs encoding a number of transposases and insertion elements. Interspersed within this region are a putative extracytoplasmic function sigma factor (SCO3450), a methyltransferase (SCO3452), a set of putative ABC-type transporters related to spermidine/putrescine transport family (Pot) (SCO3453-56) and an assortment of consecutive genes with various putative metabolic activities (SCO3473-3506). It is noteworthy here that the only extracellular agarase (dagA, SCO3471) of S. coelicolor maps to this genomic segment. This explains the lack of agarase activity in cultures of S. lividans noted elsewhere .
GI-3 (53 ORFs; ~57 kb) contains elements of three putative two-component systems (SCO6353/54, SCO6362/63/64 and an orphan kinase SCO6369). The region also encompasses ORFs with bacitracin transport permease domains (SCO6356, 6360, 6378). The last 16 ORFs in this island comprise mainly of mobile genetic elements like transposases and recombinases.
GI-4 (18 ORFs; ~30 kb) contains a phage growth limitation (Pgl) system. This is a defense mechanism against φ C31 bacteriophage infections in which an infecting phage undergoes a single burst of attack and the resulting progeny are severely attenuated in subsequent infectious cycles. Two loci, pglWX and pglYZ – both of which map to this genomic locus – are necessary for the Pgl+ phenotype. Absence of these genes in S. lividans leads to unstable φ C31 lysogens while complementing them with these elements give rise to Pgl+ colonies .
GI-5 (148 ORFs; ~153 kb) is the largest genomic island identified in this study. The size of this cluster implies that it hosts a multitude of genes with varied functions. Importantly, this cluster hosts genes that are involved in various modes of microbial defense. SCO6826/27 encodes a pair of type I modular polyketide synthase genes. In close proximity are two genes (SCO6832/33) coding for subunits of methlymalonyl-CoA mutase – an enzyme that catalyzes conversion of succinyl-CoA to methylmalonyl-CoA, which serves as building blocks for certain polyketide antibiotic synthesis . Two other genes, SCO6929/30 are similar to those involved in lantibiotic biosynthesis. In addition, SCO6835/36/37 belong to the arsenic resistance family. SCO6809/10 are similar to several multi-drug efflux transporters. Also, a probable acyl-CoA dehydrogenase (SCO6938) belonging to this island has been implicated in control of actinorhodin production and timing of sporulation . This observation is particularly interesting considering the extremely low-levels of actinorhodin production  and absence (or silencing) of the ram-independent developmental pathway in S. lividans . Furthermore, two putative DNA methylases (SCO6844 and SCO6885) also map to this locus.
The smaller genomic islets (Gi-1 to Gi-18) also contain several potentially interesting genes. Foremost among them is Gi-15. This islet includes the bldB locus (SCO5723) which has been attributed to pleiotropically regulate both antibiotic synthesis and morphological differentiation . Two other genes found in this islet (SCO5722 and SCO5731) encode putative secreted serine proteases. This islet is also unusual in being flanked on one side by a highly expressed rpsO gene rather than a tRNA. Among other smaller islets, Gi-12 contains an adenine-specific DNA methylase (SCO5331, pglS). This gene has been confirmed to interfere with the Pgl system of GI-4 and might extend resistance to phages other than φ C31 . Gi-10 and Gi-11 each encode a methyl specific restriction endonuclease – SCO4213 (mrr-like) and SCO4631 (mcrA-like) respectively. The rest of the genes in the genomic islands/islets are largely comprised of hypothetical proteins and other genes whose functions are yet to be elucidated.
Comparison with transcriptome and proteome data
Assuming that these genomic islands have evolved under selective pressure, we asked which, if any, of these genes are actively expressed under normal (laboratory) culture conditions. We chose to analyze mRNA and protein expression data derived from two distinct liquid media – one, an R5-based rich medium with yeast extract and the other, a minimal medium supplemented with casaminoacids (SMM). Due to significant differences in their composition, we expected that they will elicit markedly distinctive gene expression patterns. Data for the R5-based cultures were generated using our microarray and LC-MS experiments (Jayapal et al., in preparation) while those for SMM cultures were downloaded from a public repository http://streptobase.org/ or from previously published 2-D gel proteomic analyses [43, 44]. Since the microarray hybridizations were performed with S. coelicolor gDNA reference, the resulting log2 expression ratios (= log2 [mRNA Scoe /gDNA Scoe ]) are an indication of the expression level for each gene . The highest log2 expression ratio observed for a given gene over the range of growth phases and conditions analyzed can be taken as its potential for transcription under laboratory conditions. Figure 2 shows this transcriptome expression data plotted against comparative genomics hybridization data. Genes in the top-left region of the plot (marked in Figure 2) are, in general, designated as absent in S. lividans, yet highly expressed in S. coelicolor at least in some conditions. Note that this region also includes a few genes with quite low log2 [gDNA Sliv /gDNA Scoe ] value but were not designated as absent/divergent in S. lividans simply because they did not co-localize as three- or more-gene clusters and hence did not satisfy out absence criteria (see Methods section). Reciprocally, some genes with high log2 [gDNA Sliv /gDNA Scoe ] were assigned to certain genomic islands because of overwhelming evidence of absence from neighboring genes. Many such genes with considerable homology are transposon/plasmid related elements and are probably present elsewhere in the S. lividans chromosome. 42 genes from various genomic islands exceeded a threshold of max(log2 [mRNA Scoe /gDNA Scoe ]) value greater than 2.0 indicating appreciably high mRNA expression. 20 of these genes also had an average(log2 [mRNA Scoe /gDNA Scoe ]) > 0 indicating probable constitutive basal expression (data not shown in Figure). Currently, more than half of those 42 genes (23/42) do not have a reasonably descriptive functional annotation.
The most striking observation from the transcriptome analysis was the high expression of ~30 contiguous genes (SCO3473-3506) within GI-2 exclusively in SMM cultures. These genes encode a variety of probable carbohydrate metabolism enzymes including putative aldolase, isomerase, dehydrogenase, beta-galactosidase, sugar kinase and sugar permease; the precise context of these functions remain to be elucidated. Nevertheless, such coordinated but conditional transcription of genes within genomic islands leads one to speculate that, in addition to collective horizontal transfer as an island, they are also regulated in an independent and rather concerted fashion in response to varying environmental stimuli. In addition, the extracellular agarase precursor (SCO3471) in GI-2 is also expressed highly only in SMM cultures.
Several genes in GI-1 are also highly expressed. The superoxide dismutase (SCO0999, sodF2) has an average(log2 [mRNA Scoe /gDNA Scoe ]) of over 2.0 in cultures of R5-based medium and was also identified in LC-MS proteomic studies with multiple peptide hits (Jayapal et al., in preparation). However, this gene is not as highly expressed in SMM cultures. On the contrary, adjacent genes SCO0985/0993/0995 were identified in the 2-D gel experiments conducted on minimal medium derived samples . Genes in bldB locus (Gi-15) are also expressed at moderate to high levels from microarray data (in both rich and minimal medium). This is corroborated by the identification of some of these proteins (SCO5723/24 and SCO5729) in proteomic experiments. Interestingly, SCO5724/25 possibly codes for a novel secretion system . Such transporters are widely associated with pathogenicity islands in many bacteria . Furthermore, phage resistance proteins of the Pgl system (SCO6627/28/37/38; GI-4) were identified in several proteomic experiments. Notably, one of them – SCO6627 (PglX) was identified in multiple forms in the membrane fraction of mycelium undergoing programmed cell death . Likewise, polyketide precursor synthesis enzymes (SCO6832/33, methylmalonyl-CoA mutase subunits in GI-5) were identified in LC-MS experiments of R5 medium samples (Jayapal et al., in preparation).
S. coelicolor contains large laterally acquired genomic segments
Our results indicate that over 90% of S. coelicolor genes are conserved remarkably well in S. lividans. This observation is not surprising considering that even bacteria from other genera, namely Mycobacterium and Thermofibida, share a significant homology with genes in Streptomyces . Punctuating this pattern of widespread conservation are 23 genomic islands comprising > 600 genes that are conspicuously absent in S. lividans. Many of these islands exhibit hallmarks of horizontal gene transfer. Although it is generally believed that chromosome extremities are more prone to massive gene flux compared to the central "core" , we did not observe a higher occurrence of genomic islands near the chromosome ends (10 genomic islands in the "core" compared to eight in the chromosome "arms"). This could partly be explained for certain islands containing phage-related elements or integrative plasmids that target specific sequences rather than rely on genome instability for chromosome integration. There is at least one report of DNA amplifications in the "core" region of S. coelicolor indicating the chromosomal instabilities are not strictly limited to the extremities . Many of these regions (eg. GI-1, GI-4, GI-5, Gi-15) harbor elements that could potentially benefit the host, asserting that they truly are "fitness islands". Such elements include antibiotic biosynthesis genes, drug resistance determinants, phage defense systems, and numerous metabolic functions.
It is notable that 14 of the 23 islands detected in this study were reported as putative laterally acquired regions through in silico analysis during sequencing of the S. coelicolor genome . Half of those (7/14) were reported precisely as described here; for the rest, the exact boundaries of the islands do not match our results. This does not, however, imply that our results contradict previous observations. One likely possibility is that genes in any extended genomic island predicted by Bentley et al., although recently acquired, are well-conserved in S. lividans (probably acquired before speciation). Another possibility is that the apparent discrepancies are merely consequences of our strict criteria for absence that eliminated certain marginal cases. On the other hand, all boundary extensions (and new islands) reported in this work provide conclusive experimental evidence of recent gene transfer in those regions.
Hints for genetic basis of suppressed antibiotic synthesis in S. lividans
Over the years, identification of genetic determinants that suppress actinorhodin production in S. lividans has aroused considerable interest among researchers. Although numerous mutational and overexpression studies have led to activation of antibiotic synthesis in S. lividans [48–52], to our knowledge, none have succeeded in identifying the evolutionary transformation that led to modulation of actinorhodin synthesis. Our results provide two important clues in this regard. One is the absence of a probable acyl-CoA dehydrogenase (SCO6938) in S. lividans, whose mutation in S. coelicolor caused a marked delay in onset of actinorhodin synthesis as well as aerial mycelium formation . Acyl-CoA dehydrogenases are enzymes associated with breakdown of long-chain fatty acids and might potentially generate precursors for polyketide biosynthesis . Of particular note is a mutant of an acyl-CoA synthetase (another enzyme involved in fatty acid catabolism) in S. coelicolor that was severely impaired in actinorhodin synthesis particularly in glucose containing medium  – an observation curiously reminiscent of the conditional suppression of actinorhodin biosynthesis in S. lividans in glucose containing medium. The other clue comes from the apparent absence of bldB (SCO5723) locus in S. lividans. At least two other paralogs of bldB (SCO3424 and SCO4542) are also known to exist in S. coelicolor chromosome, both of which are conserved in S. lividans. However, these paralogs do not substitute for bldB – S. coelicolor mutants of bldB fail to erect aerial hyphae and lack actinorhodin production; yet, S. lividans appears to be quite capable of forming aerial mycelia and spores. Despite the apparent absence of SCO5723 locus in S. lividans, we also noted that a S. lividans gene annotated as bldB (GenBank accession AF071232) has been cloned and expressed in E. coli . A closer look revealed that this S. lividans bldB shares only ~82% homology with SCO5723 and hence gave a weak hybridization signal – an observation further corroborated by the failure of stringent Southern hybridizations to reveal the presence of bldB in S. lividans . Perhaps, this level of similarity is sufficient for most of the functional activity of BldB. Notably, all the conserved residues reported by Eccleston et al.  are present in both versions of the protein. Nonetheless, our data suggests that this ortholog in S. lividans is present at an alternate locus since an entire 17-gene S. coelicolor segment (Gi-15) including SCO5723 is absent in S. lividans. We note that the other bldB paralogs of S. coelicolor have a much lesser degree of homology with S. lividans bldB.
Absence of certain endonucleases and proteases in S. lividans
Another question of particular interest in this comparative study is what genetic factors make S. lividans a preferred host for heterologous protein expression? As mentioned earlier, the two contributing factors in this regard are the absence of a strong restriction-modification system and diminished extracellular protease activity. The phage growth limitation (Pgl) system found in GI-4 contains elements required for DNA restriction. Although this explains the unstable φ C31 lysogens of S. lividans, introduction of this system into S. lividans did not impair its ability to uptake methylated DNA . Two other potential candidates – SCO4213 and SCO4631 (both annotated as hypothetical proteins) were identified in a scan for methyl-sensing restriction endonucleases. SCO4213 contains a signature Mrr_cat type II restriction enzyme domain while SCO4631 is similar to an E. coli methylcytosine-specific restriction enzyme with an HNH endonuclease domain. In addition, SCO2863 – a putative helicase, also contains an hsdR-like type I restriction endonuclease domain. Moreover, the absence of two DNA-methylases, SCO6844 and SCO6885 might explain why DNA obtained from S. lividans is readily transformable into other streptomycetes. The attenuated extracellular protease activity in S. lividans is, perhaps, explained by the absence of two secreted serine proteases – SCO5722 and SCO5731.
TTA codon modulated gene expression is a recent evolutionary incorporation
An interesting observation that we noted in our analysis is the unusually high frequency of the rare TTA codon containing genes designated as absent in S. lividans. About 30% (43/145) of such genes are absent in S. lividans – statistically a much higher frequency compared to only 8% of all genes absent. Considering that AT-rich segments frequently occur in bacterial phages and plasmids  and also the fact that occurrences of TTA codons are quite rare in a GC-rich organism like S. coelicolor, it is likely that many of these genes are of foreign origin. In fact, a recent report by Chater and Chandra postulated that over 80% of TTA-containing S. coelicolor genes were acquired through horizontal gene transfer . They speculated that bldA (the sole tRNA that can efficiently translate TTA) might itself have been laterally acquired, and that TTA-modulated protein expression is a very recent evolutionary adaptation. Our experimental results are consistent with their hypothesis.
Despite the presence of over 600 genes in these S. coelicolor genomic islands, we found that over 93% were not highly expressed under typical laboratory conditions analyzed in this study. This is probably due to the fact that these elements require specific environmental stimuli for activation. Experimental conditions covering a wider range of physiological conditions will need to be tested for this purpose. Another rather likely possibility is that our genomic tools were simply not sensitive enough to detect certain physiologically relevant gene expression levels. We also note that genomic islands present in S. lividans and absent in S. coelicolor could not be identified in this study. Notwithstanding these limitations, our work sheds light into possible genetic determinants contributing to phenotypic differences between S. coelicolor and S. lividans. More importantly, it lays a strong foundation for identification of specific gene targets in the genetically well-characterized S. coelicolor to engineer it for industrial protein or secondary metabolite production processes.
Strains and culture conditions
Spores for S. coelicolor M145 and S. lividans TK21 were generated on Mannitol-Soy flour or R5 agar . Cultures for genomic DNA preparation were performed in YEME medium with 0.5% glycine supplement at 30°C until early stationary phase.
Genomic DNA extraction and labeling
Genomic DNA (gDNA) extraction was carried out using Kirby mix procedure as described elsewhere . About 500 μl of 20 μg/μl gDNA was sonicated briefly for 30–40 sec for shearing them to ~500 bp average size (confirmed by gel electrophoresis). The DNA was then labeled with Label IT® Cy3 or Cy5 Labeling Kit (Mirus Bio Corp., Madison, WI) according to suppliers instructions.
Microarray hybridizations and image analysis
Samples containing ~500 ng each of Cy3 and Cy5 labeled gDNA from S. coelicolor M145 and S. lividans TK21, respectively were hybridized to a whole-genome S. coelicolor microarray as described previously . Hybridizations were carried out in triplicate for ~16 hr at 50°C; arrays were washed and scanned using ScanArray5000 (Perkin Elmer, Wellesley, MA). Images were analyzed using GenePix (Axon Instruments, Union City, CA) to obtain raw intensity data for each spot. The median fluorescence intensity from each spot was used for all subsequent analysis.
Array normalization and data analysis
Raw relative intensity values for each spot were first normalized by scaling one of the gDNA channel signal intensities by a normalization factor to set the total intensity from both channels as equal. Log2 hybridization signal ratios were then calculated from normalized intensities as log2 [gDNA Sliv /gDNA Scoe ] and values were averaged using the median from triplicate experiments. Presence of at least three consecutive (in chromosome order) log2 [gDNA Sliv /gDNA Scoe ] values less than its global mean minus one standard deviation was taken as evidence for absence (or divergence) of a contiguous genomic segment in S. lividans. Certain marginal cases were thereafter manually reassigned as present/absent in S. lividans based on overwhelming trends of neighboring genes. For visualization purposes, median intensity ratios from successive 19-gene sliding windows were plotted as a function of genes in chromosome order.
For transcriptome analysis, cDNA-gDNA based microarray data was normalized using quantile normalization .
All microarray data discussed here are available at Gene Expression Omnibus (GEO): Accession – GSE7167 (comparative genomics hybridizations) and GSE7172 (transcriptome data).
Regional GC variations in S. coelicolor genome were calculated using z' curve method proposed by Zhang and Zhang . Briefly, z n was calculated at genomic location n as:
z n = (A n + T n ) - (G n + C n ), n = 0, 1, 2...N
where A n , T n , G n and C n are the cumulative number of bases A, T, G and C occurring in a sequence from 1st to nth base. To amplify the deviations of z n from its average trend, a linear least square fit: z n = kn is performed and z n ' is calculated as
z n ' = z n - k n
Codon Adaptation Index (CAI), a measure of utilization of "optimal codons" was calculated as described previously . First, a relative adaptiveness factor for every codon i coding for amino acid j is estimated as:
where fi,j(G) is the frequency of occurrence of codon i among the set for amino acid j across the entire genome G and fmax,j(G) is the corresponding value for the most frequently used codon for amino acid j. The CAI for a gene g is then calculated as
where N is the number of amino acids in gene g. The algorithms for both cumulative GC plot and CAI plot were implemented using Matlab 7.0 with bioinformatics toolbox.
PCR verification of islands absent in S. lividans
The absence of genomic islands in S. lividans were verified by PCR using primers flanking each island. Primers were chosen so as to fall in or outside of the DNA probe segment used in microarray to improve chances of amplification. The PCRs were conducted using GC-rich™ PCR system or Expand™ long template PCR system (Roche Applied Science, Indianapolis, IN) with 2% DMSO. Successful amplification of a relatively small product from S. lividans indicates absence of the intervening genomic island in each case.
Challis GL, Hopwood DA: Synergy and contingency as driving forces for the evolution of multiple secondary metabolite production by Streptomyces species. Proc Natl Acad Sci USA. 2003, 100 (Suppl 2): 14555-61. 10.1073/pnas.1934677100.
Choulet F, Aigle B, Gallois A, Mangenot S, Gerbaud C, Truong C, Francou FX, Fourrier C, Guerineau M, Decaris B: Evolution of the terminal regions of the Streptomyces linear chromosome. Mol Biol Evol. 2006, 23: 2361-9. 10.1093/molbev/msl108.
Leblond P, Decaris B: Unstable linear chromosomes: the case of Streptomyces. Organization of the prokaryotic genome. Edited by: Charlebois RL. 1999, ASM press, Washington DC, 263-89.
Kieser T, Bibb MJ, Buttner MJ, Chater KF, Hopwood DA: Practical Streptomyces Genetics. 2000, The John Innes Foundation, Norwich, UK
Bentley SD, Chater KF, Cerdeno-Tarraga AM, Challis GL, Thomson NR, James KD, Harris DE, Quail MA, Kieser H, Harper D: Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2). Nature. 2002, 417: 141-7. 10.1038/417141a.
Anne J, Van Mellaert L: Streptomyces lividans as host for heterologous protein production. FEMS Microbiol Lett. 1993, 114: 121-8. 10.1111/j.1574-6968.1993.tb06561.x.
Binnie C, Cossar JD, Stewart DI: Heterologous biopharmaceutical protein expression in Streptomyces . Trends Biotechnol. 1997, 15: 315-20. 10.1016/S0167-7799(97)01062-7.
Gilbert M, Morosoli R, Shareck F, Kluepfel D: Production and secretion of proteins by streptomycetes. Crit Rev Biotechnol. 1995, 15: 13-39.
MacNeil DJ: Characterization of a unique methyl-specific restriction system in Streptomyces avermitilis . J Bacteriol. 1988, 170: 5607-12.
Fornwald JA, Donovan MJ, Gerber R, Keller J, Taylor DP, Arcuri EJ, Brawner ME: Soluble forms of the human T cell receptor CD4 are efficiently expressed by Streptomyces lividans . Biotechnology (N Y). 1993, 11: 1031-6. 10.1038/nbt0993-1031.
Pozidis C, Lammertyn E, Politou AS, Anne J, Tsiftsoglou AS, Sianidis G, Economou A: Protein secretion biotechnology using Streptomyces lividans : large-scale production of functional trimeric tumor necrosis factor alpha. Biotechnol Bioeng. 2001, 72: 611-9. 10.1002/1097-0290(20010320)72:6<611::AID-BIT1026>3.0.CO;2-0.
Zhang Y, Wang WC, Li Y: Cloning, expression, and purification of soluble human interleukin-4 receptor in Streptomyces . Protein Expr Purif. 2004, 36: 139-45. 10.1016/j.pep.2004.04.010.
Hong B, Wu B, Li Y: Production of C-terminal amidated recombinant salmon calcitonin in Streptomyces lividans . Appl Biochem Biotechnol. 2003, 110: 113-23. 10.1385/ABAB:110:2:113.
Lara M, Servin-Gonzalez L, Singh M, Moreno C, Cohen I, Nimtz M, Espitia C: Expression, secretion, and glycosylation of the 45- and 47-kDa glycoprotein of Mycobacterium tuberculosis in Streptomyces lividans. Appl Environ Microbiol. 2004, 70: 679-85. 10.1128/AEM.70.2.679-685.2004.
Leblond P, Redenbach M, Cullum J: Physical map of the Streptomyces lividans 66 genome and comparison with that of the related strain Streptomyces coelicolor A3(2). J Bacteriol. 1993, 175: 3422-9.
Zhou X, He X, Li A, Lei F, Kieser T, Deng Z: Streptomyces coelicolor A3(2) lacks a genomic island present in the chromosome of Streptomyces lividans 66. Appl Environ Microbiol. 2004, 70: 7110-8. 10.1128/AEM.70.12.7110-7118.2004.
Dorrell N, Hinchliffe SJ, Wren BW: Comparative phylogenomics of pathogenic bacteria by microarray analysis. Curr Opin Microbiol. 2005, 8: 620-6. 10.1016/j.mib.2005.08.012.
Ochman H, Santos SR: Exploring microbial microevolution with microarrays. Infect Genet Evol. 2005, 5: 103-8. 10.1016/j.meegid.2004.09.002.
Zhou D, Han Y, Song Y, Tong Z, Wang J, Guo Z, Pei D, Pang X, Zhai J, Li M: DNA microarray analysis of genome dynamics in Yersinia pestis : insights into bacterial genome microevolution and niche adaptation. J Bacteriol. 2004, 186: 5138-46. 10.1128/JB.186.15.5138-5146.2004.
Dziejman M, Balon E, Boyd D, Fraser CM, Heidelberg JF, Mekalanos JJ: Comparative genomic analysis of Vibrio cholerae : genes that correlate with cholera endemic and pandemic disease. Proc Natl Acad Sci USA. 2002, 99: 1556-61. 10.1073/pnas.042667999.
Fitzgerald JR, Sturdevant DE, Mackie SM, Gill SR, Musser JM: Evolutionary genomics of Staphylococcus aureus : insights into the origin of methicillin-resistant strains and the toxic shock syndrome epidemic. Proc Natl Acad Sci USA. 2001, 98: 8821-6. 10.1073/pnas.161098098.
Brosch R, Pym AS, Gordon SV, Cole ST: The evolution of mycobacterial pathogenicity: clues from comparative genomics. Trends Microbiol. 2001, 9: 452-8. 10.1016/S0966-842X(01)02131-X.
Murray AE, Lies D, Li G, Nealson K, Zhou J, Tiedje JM: DNA/DNA hybridization to microarrays reveals gene-specific differences between closely related microbial genomes. Proc Natl Acad Sci USA. 2001, 98: 9853-8. 10.1073/pnas.171178898.
Weaver D, Karoonuthaisiri N, Tsai HH, Huang CH, Ho ML, Gai S, Patel KG, Huang J, Cohen SN, Hopwood DA: Genome plasticity in Streptomyces : identification of 1 Mb TIRs in the S. coelicolor A3(2) chromosome. Mol Microbiol. 2004, 51: 1535-50. 10.1111/j.1365-2958.2003.03920.x.
Mehra S, Lian W, Jayapal KP, Charaniya SP, Sherman DH, Hu WS: A framework to analyze multiple time series data: a case study with Streptomyces coelicolor . J Ind Microbiol Biotechnol. 2006, 33: 159-72. 10.1007/s10295-005-0034-7.
Hacker J, Carniel E: Ecological fitness, genomic islands and bacterial pathogenicity. A Darwinian view of the evolution of microbes. EMBO Rep. 2001, 2: 376-81.
Ochman H, Lerat E, Daubin V: Examining bacterial species under the specter of gene transfer and exchange. Proc Natl Acad Sci USA. 2005, 102 (Suppl 1): 6595-9. 10.1073/pnas.0502035102.
Zhang R, Zhang CT: A systematic method to identify genomic islands and its applications in analyzing the genomes of Corynebacterium glutamicum and Vibrio vulnificus CMCP6 chromosome I. Bioinformatics. 2004, 20: 612-22. 10.1093/bioinformatics/btg453.
Jansen R, Bussemaker HJ, Gerstein M: Revisiting the codon adaptation index from a whole-genome perspective: analyzing the relationship between gene expression and codon occurrence in yeast using a variety of models. Nucleic Acids Res. 2003, 31: 2242-51. 10.1093/nar/gkg306.
Chung HJ, Kim EJ, Suh B, Choi JH, Roe JH: Duplicate genes for Fe-containing superoxide dismutase in Streptomyces coelicolor A3(2). Gene. 1999, 231: 87-93. 10.1016/S0378-1119(99)00088-8.
Akopiants K, Florova G, Li C, Reynolds KA: Multiple pathways for acetate assimilation in Streptomyces cinnamonensis . J Ind Microbiol Biotechnol. 2006, 33: 141-50. 10.1007/s10295-005-0029-4.
Bunet R, Brock A, Rexer HU, Takano E: Identification of genes involved in siderophore transport in Streptomyces coelicolor A3(2). FEMS Microbiol Lett. 2006, 262: 57-64. 10.1111/j.1574-6968.2006.00362.x.
Miao V, Coeffet-Legal MF, Brian P, Brost R, Penn J, Whiting A, Martin S, Ford R, Parr I, Bouchard M: Daptomycin biosynthesis in Streptomyces roseosporus : cloning and analysis of the gene cluster and revision of peptide stereochemistry. Microbiology. 2005, 151: 1507-23. 10.1099/mic.0.27757-0.
Egan S, Wiener P, Kallifidas D, Wellington EM: Transfer of streptomycin biosynthesis gene clusters within streptomycetes isolated from soil. Appl Environ Microbiol. 1998, 64: 5061-3.
Kendall K, Cullum J: Cloning and expression of an extracellular-agarase from Streptomyces coelicolor A3(2) in Streptomyces lividans 66. Gene. 1984, 29: 315-21. 10.1016/0378-1119(84)90060-X.
Sumby P, Smith MC: Genetics of the phage growth limitation (Pgl) system of Streptomyces coelicolor A3(2). Mol Microbiol. 2002, 44: 489-500. 10.1046/j.1365-2958.2002.02896.x.
Birch A, Leiser A, Robinson JA: Cloning, sequencing, and expression of the gene encoding methylmalonyl-coenzyme A mutase from Streptomyces cinnamonensis . J Bacteriol. 1993, 175: 3511-9.
Gehring AM, Wang ST, Kearns DB, Storer NY, Losick R: Novel genes that influence development in Streptomyces coelicolor . J Bacteriol. 2004, 186: 3570-7. 10.1128/JB.186.11.3570-3577.2004.
Romero NM, Parro V, Malpartida F, Mellado RP: Heterologous activation of the actinorhodin biosynthetic pathway in Streptomyces lividans . Nucleic Acids Res. 1992, 20: 2767-72. 10.1093/nar/20.11.2767.
Keijser BJ, van Wezel GP, Canters GW, Kieser T, Vijgenboom E: The ram -dependence of Streptomyces lividans differentiation is bypassed by copper. J Mol Microbiol Biotechnol. 2000, 2: 565-74.
Eccleston M, Ali RA, Seyler R, Westpheling J, Nodwell J: Structural and genetic analysis of the BldB protein of Streptomyces coelicolor . J Bacteriol. 2002, 184: 4270-6. 10.1128/JB.184.15.4270-4276.2002.
Sumby P, Smith MC: Phase variation in the phage growth limitation system of Streptomyces coelicolor A3(2). J Bacteriol. 2003, 185: 4558-63. 10.1128/JB.185.15.4558-4563.2003.
Hesketh AR, Chandra G, Shaw AD, Rowland JJ, Kell DB, Bibb MJ, Chater KF: Primary and secondary metabolism, and post-translational protein modifications, as portrayed by proteomic analysis of Streptomyces coelicolor . Mol Microbiol. 2002, 46: 917-32. 10.1046/j.1365-2958.2002.03219.x.
Manteca A, Mader U, Connolly BA, Sanchez J: A proteomic analysis of Streptomyces coelicolor programmed cell death. Proteomics. 2006
Pallen MJ: The ESAT-6/WXG100 superfamily – and a new Gram-positive secretion system?. Trends Microbiol. 2002, 10: 209-12. 10.1016/S0966-842X(02)02345-4.
Chater KF, Chandra G: The evolution of development in Streptomyces analysed by genome comparisons. FEMS Microbiol Rev. 2006, 30: 651-72. 10.1111/j.1574-6976.2006.00033.x.
Redenbach M, Kleinert E, Stoll A: Identification of DNA amplifications near the center of the Streptomyces coelicolor M145 chromosome. FEMS Microbiol Lett. 2000, 191: 123-9. 10.1111/j.1574-6968.2000.tb09328.x.
Butler MJ, Bruheim P, Jovetic S, Marinelli F, Postma PW, Bibb MJ: Engineering of primary carbon metabolism for improved antibiotic production in Streptomyces lividans . Appl Environ Microbiol. 2002, 68: 4731-9. 10.1128/AEM.68.10.4731-4739.2002.
Butler MJ, Takano E, Bruheim P, Jovetic S, Marinelli F, Bibb MJ: Deletion of scbA enhances antibiotic production in Streptomyces lividans. Appl Microbiol Biotechnol. 2003, 61: 512-6.
Hu H, Zhang Q, Ochi K: Activation of antibiotic biosynthesis by specified mutations in the rpoB gene (encoding the RNA polymerase beta subunit) of Streptomyces lividans. J Bacteriol. 2002, 184: 3984-91. 10.1128/JB.184.14.3984-3991.2002.
Kim ES, Hong HJ, Choi CY, Cohen SN: Modulation of actinorhodin biosynthesis in Streptomyces lividans by glucose repression of afsR2 gene transcription. J Bacteriol. 2001, 183: 2198-203. 10.1128/JB.183.7.2198-2203.2001.
Sola-Landa A, Moura RS, Martin JF: The two-component PhoR-PhoP system controls both primary metabolism and secondary metabolite biosynthesis in Streptomyces lividans . Proc Natl Acad Sci USA. 2003, 100: 6133-8. 10.1073/pnas.0931429100.
Zhang YX, Denoya CD, Skinner DD, Fedechko RW, McArthur HA, Morgenstern MR, Davies RA, Lobo S, Reynolds KA, Hutchinson CR: Genes encoding acyl-CoA dehydrogenase (AcdH) homologues from Streptomyces coelicolor and Streptomyces avermitilis provide insights into the metabolism of small branched-chain fatty acids and macrolide antibiotic production. Microbiology. 1999, 145 (Pt 9): 2323-34.
Banchio C, Gramajo H: A stationary-phase acyl-coenzyme A synthetase of Streptomyces coelicolor A3(2) is necessary for the normal onset of antibiotic production. Appl Environ Microbiol. 2002, 68: 4240-6. 10.1128/AEM.68.9.4240-4246.2002.
Mishig-Ochiriin T, Won HS, Lee CJ, Kang SO, Lee BJ: Biophysical and structural property of the putative DNA-binding protein, BldB, from Streptomyces lividans . Biopolymers. 2003, 69: 343-50. 10.1002/bip.10387.
Harasym M, Zhang LH, Chater K, Piret J: The Streptomyces coelicolor A3(2) bldB region contains at least two genes involved in morphological development. J Gen Microbiol. 1990, 136: 1543-50.
Eccleston M, Willems A, Beveridge A, Nodwell JR: Critical residues and novel effects of overexpression of the Streptomyces coelicolor developmental protein BldB: evidence for a critical interacting partner. J Bacteriol. 2006, 188: 8189-95. 10.1128/JB.01119-06.
Rocha EP, Danchin A: Base composition bias might result from competition for metabolic resources. Trends Genet. 2002, 18: 291-4. 10.1016/S0168-9525(02)02690-2.
Zhang R, Zhang CT: Genomic islands in the Corynebacterium efficiens genome. Appl Environ Microbiol. 2005, 71: 3126-30. 10.1128/AEM.71.6.3126-3130.2005.
This study was supported in part by a grant from the National Institutes of Health (GM55850). Bioinformatics support was provided by the University of Minnesota Supercomputing Institute.
KPJ analyzed and interpreted comparative genomic hybridization data, performed sequence analysis, carried out PCR verifications, performed transcriptomic and proteomic experiments and drafted the manuscript. WL participated in preliminary data analysis including data normalization and interpretations. FG conceived of the study and carried out comparative genomic hybridizations. DHS and WSH participated in discussions, critical review of results presented here and assisted in manuscript preparation. All authors have read and approved the final manuscript.