Genome-wide structural and evolutionary analysis of the P450 monooxygenase genes (P450ome) in the white rot fungus Phanerochaete chrysosporium : Evidence for gene duplications and extensive gene clustering
© Doddapaneni et al; licensee BioMed Central Ltd. 2005
Received: 24 November 2004
Accepted: 14 June 2005
Published: 14 June 2005
Phanerochaete chrysosporium, the model white rot basidiomycetous fungus, has the extraordinary ability to mineralize (to CO2) lignin and detoxify a variety of chemical pollutants. Its cytochrome P450 monooxygenases have recently been implied in several of these biotransformations. Our initial P450 cloning efforts in P. chrysosporium and its subsequent whole genome sequencing have revealed an extraordinary P450 repertoire ("P450ome") containing at least 150 P450 genes with yet unknown function. In order to understand the functional diversity and the evolutionary mechanisms and significance of these hemeproteins, here we report a genome-wide structural and evolutionary analysis of the P450ome of this fungus.
Our analysis showed that P. chrysosporium P450ome could be classified into 12 families and 23 sub-families and is characterized by the presence of multigene families. A genome-level structural analysis revealed 16 organizationally homogeneous and heterogeneous clusters of tandem P450 genes. Analysis of our cloned cDNAs revealed structurally conserved characteristics (intron numbers and locations, and functional domains) among members of the two representative multigene P450 families CYP63 and CYP505 (P450foxy). Considering the unusually complex structural features of the P450 genes in this genome, including microexons (2–10 aa) and frequent small introns (45–55 bp), alternative splicing, as experimentally observed for CYP63, may be a more widespread event in the P450ome of this fungus. Clan-level phylogenetic comparison revealed that P. chrysosporium P450 families fall under 11 fungal clans and the majority of these multigene families appear to have evolved locally in this genome from their respective progenitor genes, as a result of extensive gene duplications and rearrangements.
P. chrysosporium P450ome, the largest known todate among fungi, is characterized by tandem gene clusters and multigene families. This enormous P450 gene diversity has evolved by extensive gene duplications and intragenomic recombinations of the progenitor genes presumably to meet the exceptionally high metabolic demand of this biodegradative group of basidiomycetous fungi in ecological niches. In this context, alternative splicing appears to further contribute to the evolution of functional diversity of the P450ome in this fungus. The evolved P450 diversity is consistent with the known vast biotransformation potential of P. chrysosporium. The presented analysis will help design future P450 functional studies to understand the underlying mechanisms of secondary metabolism and oxidative biotransformation pathways in this model white rot fungus.
The cytochrome P450 monooxygenases ("P450s") constitute a large superfamily of heme-thiolate proteins widely distributed in different life forms including prokaryotes (archaea, bacteria), lower eukaryotes (fungi, insects), and higher eukaryotes (plants and animals). P450s play an important role in the metabolism of a wide variety of endogenous and xenobiotic compounds. The current P450 nomenclature  is based on divergent evolution of the P450 superfamily. On the basis of sequence homology, all P450s can be categorized into two main classes, B ('bacterial') and E ('eukaryotic') . Bacterial P450s with three component systems [an FAD-containing flavoprotein (NADPH or NADH-dependent reductase), an iron-sulphur protein, and the P450 hemeprotein] and the fungal P450nor (CYP55)  belong to the 'B'-class. All the other known P450s from distinct systems, including eukaryotic P450s and bacterial P450s such as P450BM3 (CYP102) from Bacillus megaterium , belong to the 'E'-class. The eukaryotic microsomal P450 system contains two components, the NADPH:P450 oxidoreductase (POR), a flavoprotein containing both FAD and FMN, and the P450 monooxygenase containing the heme domain. The prokaryotic (bacterial) soluble P450 monooxygenase P450BM3 (CYP102) exists as a single protein with both heme and flavin functional domains. Typically, the bacterial P450s are soluble and shorter (approximately 400 amino acids) in length, whereas the eukaryotic P450s average around 500–600 amino acids or larger and are membrane-bound. The amino acid (aa) sequence of these P450 monooxygenase proteins is extremely diverse, with levels of identity as low as 16% in some cases, but their structural fold has remained the same throughout evolution. The existing data suggest that divergence of the P450 superfamily into B and E classes, and further divergence into stable P450 groups within the E class, is very ancient and had occurred before the appearance of eukaryotes . From the phylogenetic classification point of view, families have been identified based primarily on amino acid sequence similarity, with less than 40% similarity defining a family and 40–55% similarity defining a sub-family. Recently, the concept of "Clan" which represents higher order grouping of P450 families is gaining wider acceptance in the P450 community [6, 7].
Among the different phyla, plants have the highest number of P450 sequences, followed by fungi. So far, more than 380 fungal P450s have been identified and the number is increasing with the sequencing of new fungal genomes http://drnelson.utmem.edu/cytochromeP450.html. In comparison to the yeast forms (Saccharomyces and Candida), the higher order fungi have a larger number of P450 sequences in their genomes, with 41 P450 genes predicted in Neurospora crassa , 111 in Aspergillus spp ., 123 in Magnaporthe grisea , 110 in Fusarium graminearum and about 150 in Phanerochaete chrysosporium. Assuming the possibility that the present day P450s have evolved from a single ancestor P450 gene, the large disparity in the P450 gene count among the different phyla and genera is reflective of the differences in metabolic demand. According to Nelson , while plants have invested heavily on biochemistry for development of survival strategies thereby driving the P450s to expand rapidly, animals have invested on development of higher order sensory systems and hence carry comparatively fewer P450s. Fungi, which resemble plants in their sedentary habitat, appear to have driven P450s to diversify rapidly.
Phanerochaete chrysosporium , the model white rot fungus, has the extraordinary ability to degrade and mineralize (to CO2) lignin, the earth's most abundant aromatic polymer, and a wide range of toxic chemical pollutants. Lignin biodegradation occurs under nutrient-limited conditions when the fungus enters secondary metabolism. Only white rot fungi are known to possess the ability to completely degrade lignin ,,. The working hypothesis is that initial depolymerization of the lignin by extracellular peroxidases releases chemical compounds that are internalized and further degraded by diverse intracellular enzymes, including P450 monooxygenases. Further, several studies have shown that this fungus can degrade and mineralize a broad spectrum of aromatic, alicyclic, and aliphatic chemical pollutants under both nutrient-limited (ligninolytic) and nutrient-sufficient (non-ligninolytic) conditions. Lately, P450 monooxygenases have been shown to be involved in several of these biotransformations and this has led to an increasing interest in the characterization of cytochrome P450 systems in this model white rot fungus.
Initial cloning efforts from this laboratory identified the first complete P450 system genes in P. chrysosporium. These efforts led to the isolation  and functional characterization [14, 15, 17] of three full-length P450 genes pc-1, pc-2 and pc-3 that were assigned to CYP63 family, and the P450 oxidoreductase gene (POR ) that is responsible for electron transfer to the multiple P450 monooxygenases . Subsequent whole genome sequencing by the Joint Genome Institute (JGI) of the US Department of Energy (US-DOE) has led to the identification of a large P450 contingent (P450ome) in P. chrysosporium . The P450 genes constitute nearly 1% of the coding sequences in the genome of this basidiomycetous fungus . Of the nearly 150 P450 monooxygenase genes identified in P. chrysosporium genome, 108 have been assembled full-length and tentatively annotated based on general overall sequence homology analysis . To date, this is the highest number of P450s identified in any fungus, which appears to be one of the major underlying factors for its extraordinary catalytic potential. Understanding the structural and evolutionary aspects of such large family of P450s in conjunction with their physiological characteristics will help understand the functional versatility of this fungus. This study reports a detailed genome-wide structural and phylogenetic analysis of the P450ome of P. chrysosporium , coupled with cloning and characterization of new cDNAs for selected multigene P450 families, with a broader aim to facilitate the classification/nomenclature and to understand the functional diversity and evolution of the P450s in this wood-rotting model white rot fungus.
Results and Discussion
I. Structural analysis and characterization of P. chrysosporium P450ome
P450ome of P. chrysosporium
Among the 12 P450 families, the CYP64 family has the highest number (more than fifty) of member genes, followed by CYP67 (sixteen), CYP503 (fourteen), CYP58/53 (ten), CYP63 (seven), CYP505 (seven), CYP614/534 (seven), CYP617/547 (six), CYP5031/CYP547 (two), whereas P450 families CYP51, CYP61 and CYP62 consist of only one member gene each.
The International Nomenclature Committee on cytochrome P450s has been annotating the new P450 sequences on the basis of the existing primary amino acid sequence similarity criteria. However, there have been instances where the 40% similarity rule in defining a family has been relaxed, such as in naming the mammalian CYP4 family . Simultaneously, the usage of CYP "Clans" which are higher-order clusters of related families is gaining consensus [6, 7]. In view of this, a clan-based classification of the P450 families in P. chrysosporium is presented and discussed in subsequent sections, with an aim to understand their evolution and functional significance.
Cloning of new P450 cDNAs
Our initial studies on cloning of cDNAs for the first cloned P450 gene pc-1  and the related gene pc-3 , both belonging to CYP63 family, indicated a complex structural organization of P450 genes in P. chrysosporium. In an effort to further understand the structural organization of P450 genes in this system, the following additional cDNA sequences for the other CYP63 genes were isolated in the present study: pc-2 (1842 bp), pc-4 (430 bp), pc-5 (324 bp) and pc-6 (330 bp). The isolated cDNA sequences (full-length and partial) were then compared with the corresponding predicted coding sequences in the genome. This gene/cDNA analysis helped understand the gene features of the first multigene P450 family (CYP63) identified in this organism . The analysis also helped identify an additional gene, pc-7, groupable under this family. Furthermore, we isolated a cDNA sequence representing the C-terminus half (the reductase domain) of one of the fused P450foxy-like genes, pc-foxy 1. The cloned cDNA (681 bp), that spans 747 bp of the corresponding genomic region, helped identify the structural features of the P450foxy gene family (CYP505) of the P450ome.
Gene features (introns, exons and deduced coding regions) of the P. chrysosporium P450s
i). CYP63 family
ii). P450foxy gene family (CYP505)
The five P450foxy member genes on genome scaffold 73 show divergent transcription, with the three member genes (ug.73.17.1, pc.73.4.1 and pc.73.14.1) transcribing from the same strand, whereas the other two genes (ug.73.15.1 and ug.73.16.1) transcribing from the opposite strand of DNA. The deduced P450foxy proteins containing 624–1111 amino acid residues show high aa similarity (> 55%) among them and hence can be classified under the same sub-family. The general domain architecture of these proteins is the same as that of the earlier reported P450foxy protein from F. oxysporum , with an N-terminal P450 segment and a C-terminal reductase segment. All seven proteins have a full-length P450 segment with conserved P450 signature sequences. However, the reductase segment is full-length in only 6 of the seven members and is truncated in the member gene gx.187.5.1. These six member proteins contain a transmembrane domain, centered around 908–920 aa positions with a matrix score of more than 1000, as detected by TMpred analysis. The truncated member contains a transmembrane domain centered on amino acid residue 570. The above analysis points to the membrane-bound nature of these fused proteins similar to the homologous F. oxysporum protein and consistent with the membrane-bound nature of most of the eukaryotic P450 and POR proteins.
It is unusual that P. chrysosporium has only one P450 oxidoreductase (POR) gene to cater to such a large contingent of P450 monooxygenases for the electron supply, with the exception of P450foxy proteins (see below). This suggests a more significant role of alternate electron transfer proteins such as cytochromes b5 and b5 reductase in this organism; it will, however, require further functional characterization to prove this assumption. The P450foxy proteins appear to be self sufficient in their electron transfer mechanism considering the presence of a complete reductase partner (with all needed functional domains) as a part of the fusion protein. Such electron transfer function of the reductase component was experimentally demonstrated in the first known P450foxy fusion protein of the ascomycete fungus Fusarium oxysporum .
iii). Gene features of other P450ome genes
Among the other P450 families in the P450ome, the CYP64 family has the most genes (54), with member genes carrying 5–12 introns of size 21–268 bp, and encoding 350–534 aa size proteins. Detailed gene features of the other individual P450 families of the P. chrysosporium P450ome are listed in Table 1.
Typically, all the full-length assembled P450 genes were found to carry multiple exons frequently interrupted by small introns. In this analysis, we observed a relatively narrow size range for introns (21 to 620 bp) as compared to the exons; the shortest exon is 6 bp (2 aa) and the longest is 1048 bp (Table 1). Nearly half of the P450 genes show small exons (upto 67 bp), with 41 genes carrying 29 bp or shorter exons (Table 1). Interestingly, the P450ome genes contain microexons (encoding 2–10 aa) that are spread genome-wide and are not restricted to any particular P450 family. Such ubiquitous occurrence of the microexons in P. chrysosporium P450 genes points to their potential role in conferring functional diversity and emphasizes the fact that the functional divergence is not merely a product of gene duplication/translocation. In comparison to the P450 genes, the intron size range in the most intensively characterized peroxidase gene family of this organism is 49–78 bp, with 49 to 78 bp for the lignin peroxidase (LiP) sub-family and 50 to 72 bp for the manganese-dependent peroxidase (MnP) sub-family. Furthermore, as revealed by the whole genome sequencing, the presence of relatively small introns (average 117 bp) appears typical of this genome as compared to the higher eukaryotic genomes .
P450 gene clustering and tandem genes
The P. chrysosporium genome shows a total of 16 P450 gene clusters, with up to 11 member genes in a cluster. Of the 16 clusters, three clusters are located on scaffold 30, two each on scaffolds 20, 24, and 73, and one each on scaffolds 1, 16, 50, 53 59, 79 and 97. For convenience, the clusters have been assigned numbers 1 through 16 in the order they appear on the individual genomic scaffolds. Among these, cluster-5 has the highest number of tandem genes (eleven) followed by cluster-3 (five) and cluster-6 (four) (Table 2). The earlier cloned tandem CYP63 genes pc-1, pc-2 and pc-3 represent cluster-4.
There is a high sequence similarity (up to 84%) among the members of a given cluster and the cluster members fall under the same P450 family on the phylogenetic tree (Figure 1; Table 2). The number of introns and exons and their relative positions in the member genes of a cluster are conserved in 10 of the 16 clusters (Table 2). Two distinct patterns exist for the intron organization (number and position) within each cluster. In the case of small gene clusters (2–3 member genes), the number and relative position of introns as well as the size of the exons are conserved among the members, whereas the larger gene clusters (> 3 member genes) show varying degrees of dissimilarity in these characteristics. While the conserved gene characteristics suggest more recent duplications, the dissimilarity suggests either more distant duplication events or translocation events. Therefore, the P. chrysosporium genome shows both homogeneous and heterogeneous clusters of tandem genes. Tandem duplicates (paralogous genes), which initially are structurally and functionally identical, diverge with time due to mutations or translocations. It is plausible that one copy retains the original function, while the second copy either acquires a new function that is selected to meet the increased metabolic demand or gets deleted from the genome. In this context, it is noteworthy that member genes of cluster-4 (pc-1, pc-2 and pc-3), though closely spaced and structurally highly homogeneous, showed non-coordinate regulation of transcription and varying substrate inducibility [14, 17]. While such substrate diversity among the members of tandemly linked P450 genes has been observed in the yeast Candida maltosa , this observation is new in the context of filamentous fungi.
Analysis of spatial organization of the P450 gene clusters in the P. chrysosporium genome revealed a variable pattern. For instance, the two P450 clusters on scaffold-20 are spread over a 220 kb genomic region separated by more than 200 kb, whereas the gene clusters on other scaffolds are more closely spaced. On scaffold-24, the clusters 5 and 6 are spread over a 62 kb of genomic region and are separated by less than 10 kb genomic region. Similarly, clusters 7 to 9 on scaffold-30 are spread over an 80 kb region with a 27–28 kb gap between them. The two clusters on scaffold-73 are separated by less than 10 kb DNA and are spread over a 30 kb genomic region. This analysis constitutes the first report on P450 clustering and spatial organization in filamentous fungi. Nevertheless, in fungi, it has been frequently observed that the genes coding for enzymes involved in secondary metabolism, such as those involved in the synthesis of ergot alkaloids , HC-toxin , and mycotoxins such as sterigmatocystin and aflatoxins [22, 23], are heterologous clusters (containing P450 and other metabolic genes). Clustering of secondary metabolic genes has been proposed to favor their survival and dispersal, at least in part, via horizontal gene transfer [23, 24]. The close association seen in some of the P. chrysosporium gene clusters might point to their co-ordinated regulation as observed in the case of the above secondary metabolic gene clusters in different fungi. However, experimental evidence is needed to extrapolate this assumption to the fungal P450 clusters. Our initial transcription-based analysis demonstrated lack of such co-ordinated regulation among the tandem CYP63 genes of cluster-4 in P. chrysosporium .
Alternative splicing and functional diversity
Alternative splicing is an important mechanism for regulation of gene expression, which expands the coding capacity of a single gene to allow production of different protein isoforms, often with diverse functions . More than 50% of human genes are alternatively spliced . Reports in fungi on alternative splicing are few [12, 27], in comparison to humans where this mechanism has been well documented. We have experimentally identified two splice variants of the first characterized P450 gene pc-1 (CYP63A1) from P. chrysosporium  and predicted more such variants based on the in silico analysis. Although splice variants have not been identified so far in the other two tandemly arranged members pc-2 and pc-3 of this cluster, their similar gene organization (typically marked by multiple introns and exons as small as 4 to 10 aa length) suggests the existence of splice variants. Further, while validating our custom 70-mer microarray analysis data  on P450 gene transcription and induction using RT-PCR (wherein gene specific primers were chosen from different locations on the gene), we observed that the transcript quantification for a given sample in some cases varied with the location of the primer chosen (Unpublished data). This observed variation points to the existence of alternative splice variants. A recently proposed system of nomenclature for such splice variants suggests that the transcript name should include the exon involved in the splicing event . It is noteworthy that nearly one-third of the P. chrysosporium P450ome shows microexons, which are likely candidates that promote alternative splicing events during transcription. Such alternative splicing helps increase the diversity of the transcriptome, and is likely to significantly contribute to the metabolic diversity of this organism.
II. Evolutionary analysis of P. chrysosporium P450ome
Fungal P450 clans in P. chrysosporium
Finding meaningful associations and evolutionary relationships among members of the rapidly expanding superfamily of the P450 proteins at the species level is becoming a challenge using the existing family level classification. For instance, the CYP6 family that is exclusive to insects forms a close cluster with the CYP3 and CYP5 families from mammals, CYP30 from clams, and CYP25 from C. elegans on the phylogenetic tree, indicating that these five families probably have evolved from a common ancestor with similar function before the deuterostome-protostome split . However, this is not reflected in the family names, as the family classification is based on an arbitrary 40% amino acid similarity criterion. To explain such higher order groupings, the term "Clan" was recently introduced . Typically, a clan represents a cluster of P450 families across species, grouped based on relationships that are beyond the family designations. There are 9 clans in vertebrates and 10 in plants.
A detailed phylogenetic analysis was carried out to understand the evolutionary relationship of P. chrysosporium P450 gene families and their relatedness with other fungal clans. In lower forms of fungi (yeasts), 4 P450 families (CYP51, 52, 53 and 61) have been characterized, of which CYP51 and 61 are conserved. In higher forms of fungi (filamentous fungi), 13 P450 families (CYP51, 53, 61, 65, 68, 505, 531, 532, 537, 539, 540, 548, and 552) are common as exemplified by the hitherto sequenced genomes of four ascomycetous fungal species: Neurospora crassa , Magnoporthe grisea , Fusarium graminearum and Aspergillus nidulans . Interestingly, based on homology analysis, the P450 families CYP51, 53, 61, and 505 are also present in P. chrysosporium (a basidiomycetous fungus) albeit with a widely varying degree of similarity. Further, clan level comparisons revealed that 12 P. chrysosporium P450 families (Figure 1) have resemblances in 11 fungal P450 clans and show varying degrees of structural similarities to the P450 genes from different ascomycetous fungi such as Aspergillus and Fusarium , suggesting that P. chrysosporium has acquired these P450 families as a part of vertical descent from a common ancestor followed by further diversification. It will be interesting to compare the hitherto uncharacterized zygomycetous P450ome to arrive at the ancestral P450ome that led to the current P450 diversity among these three major fungal groups (ascomycetes, zygomycetes and basidiomycetes).
CYP 61 clan
The fatty acid hydroxylase P450BM3 (CYP102) of the bacterium Bacillus megaterium , containing a P450 monooxygenase gene fused with a P450 reductase gene, was the first identified fusion protein member of the P450 superfamily . Later, a similar fused P450 gene coding for fatty acid hydroxylase was identified from the fungus Fusarium oxysporum . The latter, named P450foxy, shows 40.6% and 35.3% amino acid similarity in its P450 and reductase domains to the corresponding domains in the bacterial P450 fusion protein P450BM3. The current hypothesis suggests that such fused proteins are of eukaryotic origin and their occurrence in the prokaryotic (bacterial) cells is due to horizontal gene transfer .
Many questions regarding the origin and distribution of these fused P450 proteins in fungi remain unanswered. One can conclude from the phylogenetic tree (Figure 9) that more than one original fusion event happened in the ancestral fungus either before the ascomycetous-basidiomycetous split approximately 400 million years ago , or a second fusion event happened in ascomycetous fungi immediately after these two groups split. Either way, during the course of evolution, these proteins have diversified further, possibly due to more gene fusions or due to gene duplications (Figure 10). Assuming that the ancestral fungus carried two fusion proteins, the question as to what happened to the other gene lineage in basidiomycetous fungi arises; has it been lost immediately after these two groups split without further diversification or was it never there and the second lineage in the ascomycetous fungi originated after these groups had separated? However, there is one complication to this argument; if P450foxy proteins predate the ascomycetous-basidiomycetous split, then why are these proteins missing entirely in archiascomycotina and hemiascomycotina (fission and budding yeasts)? This is an open question and may be answered by analyzing the genomes of vesicular-arbuscular mycorrhizas (VAM) or chitrids, which predate these two groups' split . The third possible reason for occurrence of these fused proteins in P. chrysosporium could be horizontal gene transfer from one or more of the ascomycetous fungi. When the P. chrysosporium P450foxy proteins were compared to the recently completed whole genome shotgun sequences of other basidiomycetes Coprinus cinereus , Cryptococcus neoformans , and Ustilago maydis , no P450 fused protein homologues were found indicating the unique presence of these proteins in P. chrysosporium, a member of the wood-rooting group of basidiomycetes. However, as more fungal genomes become available, especially from the basidiomycetous group, it will be clear if these fusion proteins are actually unique in P. chrysosporium (or wood-rotting basidiomycetes subgroup) among basidiomycetous fungi. Nevertheless, it appears that fused proteins are predominantly present across ascomycetous fungi, and their presumed exceptional occurrence in the basidiomycete P. chrysosporium could possibly be a result of a horizontal gene transfer, an event otherwise rare in fungal genome evolution [37, 23, 24]. Looking at the gene organization and flanking sequence homology of these P450 genes in P. chrysosporium in Figures 9 and 10, and their distribution on the genome, it appears that six of the seven genes have branched out from a single progenitor gene, while the origin of the seventh gene pc.17.40.1 remains unclear and could possibly be a result of independent transfer. Role of intragenomic duplication event in the origin of pc.17.40.1 is ruled out based on the fact that there is no flanking sequence similarity between this and the other six genes. Furthermore, P. chrysosporium has the largest contingent (7 member genes) of P450foxy proteins (CYP505) among the fungi (3–5 member genes) containing this family of proteins. These observations collectively point to a rapid evolution of this P450 fusion proteins family (CYP505) in P. chrysosporium, possibly to meet the metabolic demand for fatty acid hydroxylation in the ecological niches of this fungus.
Clan-level relationships of other P. chrysosporium P450 families
Based on the clan-level comparison, it is evident that, although the P. chrysosporium P450ome has member counterparts among different fungal groups, in most cases, their independent clustering suggests significant sequence diversification and likely unique functionality. Such diversification emphasizes the need for assigning new family names to the P. chrysosporium P450s.
The recognized extraordinary catalytic diversity of the white rot fungus P. chrysosporium correlates with its enormous P450 repertoire (P450ome), which is one of the largest among lower eukaryotes. Our structural and phylogenetic analyses of the P450ome, meant to understand the genesis of such a large number of P450 genes and facilitate their classification/nomenclature, have provided important clues to the evolution of the enormous catalytic diversity in this fungus. Considering the fact that certain P450 families (such as CYP64) have diversified more extensively than others, it appears that the P. chrysosporium P450ome has evolved in specific directions to meet the metabolic demand in its environmental niches. While the ancestral genes like CYP51 and CYP61 have remained unchanged, possibly due to their minimal role in P. chrysosporium (and other saprophytic fungi) versus that in their parasitic cousins such as pathogenic Aspergilli, other P450 gene families with suggestive roles in secondary metabolism (such as CYP64) have evolved as multigene families and even exist as gene clusters in this fungus. Such family-specific evolution was warranted presumably due to an extensive demand for generation of a broad range of metabolites in the secondary metabolic switch required for degradation of complex natural substrates such as lignin. The presented analysis indicates that the progenitor P450 families, originally acquired as a part of vertically descending P450ome, have diversified rapidly via multiple genetic mechanisms such as tandem duplications, translocations, mutations, and possibly gene fusions, to give rise to majority of the multigene families in the P. chrysosporium P450ome. Consequently, this structural diversification within individual multigene P450 families, as experimentally demonstrated in the case of CYP63 family in our studies, seems to have led to the acquisition of novel functions hitherto unseen in their ancestral counterparts (CYP52 genes of yeasts in this case). In addition, our experimental evidence for the presence of alternatively spliced variants in the P. chrysosporium P450 transcriptome further explains the evolution of expanded substrate diversity in this organism. The P. chrysosporium P450ome forms a model to investigate extrapolation of the evolved P450 gene diversity to the known vast biodegradation potential and will help design the future functional studies to understand the individual P450 gene functions in order to dissect the P450 functional diversity in white rot fungi.
Fungal cultures and cDNA cloning
Phanerochaete chrysosporium strain BKM-F-1767 (ATCC 24725) was grown as shaken cultures for 4 days in defined low nitrogen (LN) medium (2.4 mM N, 1% glucose) as described previously . Total RNA (500 ng) was isolated from frozen fungal mycelia using the TRI Reagent kit (Molecular Research Center, Cincinnati, OH, USA) and a cDNA pool was generated using the SMART™ PCR cDNA synthesis kit (Clontech, Palo Alto, CA, USA) per the manufacturer's instructions. Briefly, first-strand cDNA synthesis step was carried out in a 10 μl reaction volume using 200 units of MMLV reverse transcriptase and 1 μM each of the SMART III Oligonucleotide and the CDS III/3' PCR Primer, at 42°C for 1 hour. One-fifth of the first-strand reaction (2 μl) was then added to a 100 μl long distance (LD)-PCR reaction with 1 μM each of the 5' PCR Primer and the CDS III/3' PCR Primer, for the synthesis of a double-stranded (ds) cDNA pool. Amplification parameters included initial denaturation at 95°C for 1 min., followed by a two step-PCR protocol involving use of 95°C for 1 min. and 68°C for 6 min., for 24 cycles. Quality of the ds cDNA amplified was analyzed on a 1.1% Agarose/EtBr gel and the product was quantified using spectrophotometer. Gene-specific cDNA isolation involved use of 100 ng of this cDNA mix as the template, in conjunction with an appropriate pair of gene-specific primers listed in Table 3. The gene-specific cDNA synthesis reaction contained 100 ng each of the forward and the reverse primer in a 50 μl PCR reaction volume and the amplification included 37 cycles, each involving denaturation at 95°C for 30 sec, annealing at 60°C for 30 sec, and extension at 72°C for 1 min.
The cDNA amplicons generated were cloned using 2.1-TOPO vector (Invitrogen, Carlsbad, CA, USA) per the manufacturer's instructions. The recombinant plasmid DNA for amplicon sequencing was isolated and purified using QIAprep Spin Miniprep kit (Qiagen, Valencia, CA) per the manufacturer's specifications. The DNA sequencing was performed at the university's DNA core facility. The cloned cDNA sequences generated in this study have been submitted to the GenBank under the accession numbers-AY835607 (pc-2), AY321373 (pc-4), AY321374 (pc-5), AY835606 (pc-6) and AY835608 (pc-foxy 1).
Sequence alignments and phylogenetic analysis
The P. chrysosporium P450 sequences used in the phylogenetic analysis were retrieved from the website http://genome.jgi-psf.org/whiterot1/whiterot1.home.html of the Joint Genome Institute of US Department of Energy- (US-DOE) and from the P450 website http://drnelson.utmem.edu/whiterot.html. The deduced amino acid sequence for a given P450 was compared from the above two sources and a sequence with the longest aa stretch was selected. However, the gene number assigned in the published white rot genome was retained for uniformity and convenience. For the cloned cDNAs, Gene Runner program (version 3.05, Hastings Software, Inc. Hastings, NY, USA) was used to extract and analyze the corresponding gene sequences from the genome, design the primers for RT-PCR amplification, and deduce the amino acid sequences. P450 sequences for other fungi were obtained from the NCBI GenBank database and the P450 website. Sequences were aligned using the CLUSTALW program at the EMBL-EBI website http://www.ebi.ac.uk/clustalw/. Alignment of the 126 sequences of the P. chrysosporium P450ome was generated using the following customized parameters that varied from the default parameters- Matrix-BLOSUM (Henikoff), and Gap Open Penalty -1. All the other multiple alignments were generated using the default alignment parameters including the Matrix-GONNET 250 and Gap Open Penalty -10. Phylogenetic trees were constructed using the MEGA 2.1 software http://www.megasoftware.net . The minimal evolution trees (Figures 5 to 10 and 11 to 14) were generated by heuristic search using the Close-Neighbor-Interchange (CNI) algorithm, with the Neighbor-Joining tree serving as the temporary tree. The topological distance (dT) was set at 2 for searching the minimal evolution tree. The P. chrysosporium P450ome (Figure 1) was constructed using the Unweighted Pair Group Method with Arithmatic Mean (UPGMA) method with gamma distance model. The alignment gaps and missing data sites were deleted and a bootstrap value based on 1000 replications was set for all the phylogenetic trees generated in this study. Protein sequences used for constructing the phylogenetic trees were obtained either from the GenBank (those shown with accession numbers) or from the P450 web site http://drnelson.utmem.edu/cytochromeP450.html (those with preassigned CYP names).
This work was supported by the NIH's National Institute of Environmental Health Sciences (NIEHS) grant R01-ES10210 (JSY).
- Nelson DR, Koymans L, Kamataki T, Stegeman JJ, Feyereisen R, Waxman DJ, Waterman MR, Gotoh O, Coon MJ, Estabrook RW, Gunsalus IC, Nebert DW: P450 superfamily: update on new sequences, gene mapping, accession numbers and nomenclature. Pharmacogenetics. 1996, 6: 1-42.PubMedView ArticleGoogle Scholar
- Gotoh O: Evolution and differentiation of P450 genes. Cytochrome P450 Second Edition. Edited by: Omura T, Ishimura Y, FujiiKuriyama Y. 1993, Tokyo: Kodansha, 255-272.Google Scholar
- Kizawa H, Tomura D, Oda M, Fukamizu A, Hoshino T, Gotoh O, Yasui T, Shoun H: Nucleotide sequence of the unique nitrate/nitrite inducible cytochrome P450 cDNA from Fusarium oxysporum. J Biol Chem. 1991, 266: 10632-10637.PubMedGoogle Scholar
- Ruettinger RT, Wen LP, Fulco AJ: Coding nucleotide, 5' regulatory, and deduced amino acid sequences of P450BM3, a single peptide cytochrome P450: NADPH-P450 reductase from Bacillus megaterium. J Biol Chem. 1989, 264: 10987-10995.PubMedGoogle Scholar
- Degtyarenko KN: Structural domains of P450-containing monooxygenases systems. Protein Eng. 1995, 8: 737-747.PubMedView ArticleGoogle Scholar
- Nelson DR: Metazoan cytochrome P450 evolution. Comp Biochem Physiol. 1998, 121: 15-22.Google Scholar
- Nelson DR, Zeldin DC, Hoffman SM, Maltais LJ, Wain HM, Nebert DW: Comparison of cytochrome P450 (CYP) genes from the mouse and human genomes, including nomenclature recommendations for genes, pseudogenes and alternative splice variants. Pharmacogenetics. 2004, 14: 1-18. 10.1097/00008571-200401000-00001.PubMedView ArticleGoogle Scholar
- Nelson DR: Cytochrome P450 and the individuality of species. Arch Biochem Biophys. 1999, 369: 1-10. 10.1006/abbi.1999.1352.PubMedView ArticleGoogle Scholar
- Eriksson KE, Blanchette RA, Ander P: Biodegradation of Lignin. Microbial and enzymatic degradation of wood and wood components. Edited by: Timell TE. 1990, New York: Springer, 225-333.View ArticleGoogle Scholar
- Kirk TK, Farrell RL: Enzymatic "combustion": the microbial degradation of lignin. Annu Rev Microbiol. 1987, 41: 465-505. 10.1146/annurev.mi.41.100187.002341.PubMedView ArticleGoogle Scholar
- Orth AB, Tien M: Biotechnology of lignin degradation. The Mycota II Genetics and Biotechnology. Edited by: Kuck. 1995, Berlin: Springer-Verlag, 287-302.View ArticleGoogle Scholar
- Yadav JS, Soellner MB, Loper JC, Mishra PK: Tandem cytochrome P450 monooxygenase genes and splice variants in the white rot fungus Phanerochaete chrysosporium : cloning, sequence analysis, and regulation of differential expression. Fungal Genet Biol. 2003, 38: 10-21. 10.1016/S1087-1845(02)00508-X.PubMedView ArticleGoogle Scholar
- Martinez D, Larrondo LF, Putnam N, Gelpke MD, Huang K, Chapman J, Helfenbein KG, Ramaiya P, Detter JC, Larimer F, Coutinho PM, Henrissat B, Berka R, Cullen D, Rokhsar D: Genome sequence of the lignocellulose degrading fungus Phanerochaete chrysosporium strain RP78. Nature Biotechnol. 2004, 22: 695-700. 10.1038/nbt967.View ArticleGoogle Scholar
- Doddapaneni H, Yadav JS: Differential regulation and xenobiotic induction of tandem P450 monooxygenase genes pc-1 (CYP63A1) and pc-2 (CYP63A2) in the white rot fungus Phanerochaete chrysosporium. Appl Microb Biotechnol. 2004, 65: 559-565.View ArticleGoogle Scholar
- Yadav JS, Doddapaneni H: Genome-wide expression profiling and xenobiotic inducibility of P450 monooxygenase genes in the white rot fungus Phanerochaete chrysosporium . proceedings of the 13th Internat. Conf. On Cytochromes P450 Biochemistry, Biophysics And Drug Metabolism: June 29-July 3, Prague (Czech Republic). Edited by: Anzenbacher P, Hudecek J. 2003, Monduzzi Editore, Bologna, Italy, 333-340.Google Scholar
- Yadav JS, Loper JC: Cytochrome P450 oxidoreductase gene and its differentially terminated cDNAs from the white rot fungus Phanerochaete chrysosporium. Curr Genet. 2000, 37: 65-73. 10.1007/s002940050010.PubMedView ArticleGoogle Scholar
- Doddapaneni H, Subramanian V, Yadav JS: Physiological regulation, xenobiotic induction and heterologous expression of P450 monooxygenase gene pc-3, a new member of the CYP63 gene cluster in the white rot fungus Phanerochaete chrysosporium . Curr Microbiol. 2005, DOI: 10.1007/s00284-005-4480-2Google Scholar
- Kitazume T, Takaya N, Nakayama N, Shoun H: Fusarium oxysporum fatty-acid subterminal hydroxylase (CYP505) is a membrane-bound eukaryotic counterpart of Bacillus megaterium cytochrome P450BM3. J Biol Chem. 2000, 275: 39734-39740. 10.1074/jbc.M005617200.PubMedView ArticleGoogle Scholar
- Ohkuma M, Muraoka S, Tanimoto T, Fujii M, Ohta A, Takagi M: CYP52 (cytochrome P450alk) multigene family in Candida maltosa : identification and characterization of eight members. DNA Cell Biol. 1995, 14: 163-173.PubMedView ArticleGoogle Scholar
- Tudzynski P, Holter K, Correia T, Arntz C, Grammel N, Keller U: Evidence for an ergot alkaloid gene cluster in Claviceps purpurea. Mol Gen Genet. 1999, 261: 133-141. 10.1007/s004380050950.PubMedView ArticleGoogle Scholar
- Ahn JH, Walton JD: Chromosomal organization of TOX2, the complex locus controlling host-selective toxin biosynthesis in Cochliobolus carbonum. Plant Cell. 1996, 8: 887-897. 10.1105/tpc.8.5.887.PubMedPubMed CentralView ArticleGoogle Scholar
- Brown DW, Yu JH, Kelkar HS, Fernandes M, Nesbitt TC, Keller NP, Adams TH, Leonard TJ: Twenty-five coregulated transcripts define a sterigmatocystin gene cluster in Aspergillus nidulans. Proc Natl Acad Sci USA. 1996, 93: 1418-1422. 10.1073/pnas.93.4.1418.PubMedPubMed CentralView ArticleGoogle Scholar
- Keller NP, Hohn TM: Metabolic pathway gene clusters in filamentous fungi. Fung Genet Biol. 1997, 21: 17-29. 10.1006/fgbi.1997.0970.View ArticleGoogle Scholar
- Watson JD: Horizontal gene transfer and the evolution of secondary metabolite gene clusters in fungi: An hypothesis. Fung Genet Biol. 2000, 30: 167-171. 10.1006/fgbi.2000.1224.View ArticleGoogle Scholar
- Black DL: Protein diversity from alternative splicing: a challenge for bioinformatics and post-genome biology. Cell. 2000, 103: 367-370. 10.1016/S0092-8674(00)00128-8.PubMedView ArticleGoogle Scholar
- Leipzig J, Pevzner P, Heber S: The Alternative Splicing Gallery (ASG): bridging the gap between genome and transcriptome. Nucleic Acids Res. 2004, 32: 3977-3983. 10.1093/nar/gkh731.PubMedPubMed CentralView ArticleGoogle Scholar
- Larrondo LF, Gonzalez B, Cullen D, Vicuna R: Characterization of a multicopper oxidase gene cluster in Phanerochaete chrysosporium and evidence of altered splicing of the mco transcripts. Microbiology. 2004, 150: 2775-2783. 10.1099/mic.0.27072-0.PubMedView ArticleGoogle Scholar
- Nelson DR, Schuler MA, Paquette SM, Werck-Reichhart D, Bak S: Comparative genomics of rice and Arabidopsis. Analysis of 727 cytochrome p450 genes and pseudogenes from a monocot and a dicot. Plant Physiol. 2004, 135: 756-772. 10.1104/pp.104.039826.PubMedPubMed CentralView ArticleGoogle Scholar
- Joseph-Horn T, Hollomon DW: Molecular mechanisms of azole resistance in fungi. FEMS Microbiol Lett. 1997, 149: 141-149. 10.1016/S0378-1097(97)00043-8.View ArticleGoogle Scholar
- Skaggs BA, Alexander JF, Pierson CA, Schweitzer KS, Chun KT, Koegel C, Barbuch R, Bard M: Cloning and characterization of the Saccharomyces cerevisiae C-22 sterol desaturase gene, encoding a second cytochrome P-450 involved in ergosterol biosynthesis. Gene. 1996, 169: 105-109. 10.1016/0378-1119(95)00770-9.PubMedView ArticleGoogle Scholar
- Sanglard D, Loper JC: Characterization of the alkane-inducible cytochrome P450 (P450alk) gene from the yeast Candida tropicalis : identification of a new P450 gene family. Gene. 1989, 76: 121-136. 10.1016/0378-1119(89)90014-0.PubMedView ArticleGoogle Scholar
- Zimmer T, Iida T, Schunck WH, Yoshida Y, Ohta A, Takagi M: Relation between evolutionary distance and enzymatic properties among the members of the CYP52A sub-family of Candida maltosa. Biochem Biophys Res Commun. 1998, 251: 244-247. 10.1006/bbrc.1998.9450.PubMedView ArticleGoogle Scholar
- Yadav JS, Loper JC: Multiple p450alk (cytochrome P450 alkane hydroxylase) genes from the halotolerant yeast Debaryomyces hansenii. Gene. 1999, 226: 139-46. 10.1016/S0378-1119(98)00579-4.PubMedView ArticleGoogle Scholar
- Iida T, Sumita T, Ohta A, Takagi M: The cytochrome P450ALK multigene family of an n-alkane-assimilating yeast, Yarrowia lipolytica : cloning and characterization of genes coding for new CYP52 family members. Yeast. 2000, 16: 1077-87. 10.1002/1097-0061(20000915)16:12<1077::AID-YEA601>3.0.CO;2-K.PubMedView ArticleGoogle Scholar
- Bhatnagar D, Ehrlich KC, Cleveland TE: Molecular genetic analysis and regulation of aflatoxin biosynthesis. Appl Microb Biotechnol. 2003, 61: 83-93.View ArticleGoogle Scholar
- Berbee ML, Taylor JW: Dating the evolutionary radiations of the true fungi. Canadian J Bot. 1993, 71: 1114-1127.View ArticleGoogle Scholar
- Gonzalez P, Barosso G, Labarere J: Molecular analysis of the split cox1 gene from the basidiomycota Agrocybe aegerita : relationship of its introns with homologous ascomycota introns and divergence levels from common ancestral copies. Gene. 1998, 220: 45-53. 10.1016/S0378-1119(98)00421-1.PubMedView ArticleGoogle Scholar
- Hohn TM, Desjardins AE, McCormick SP: The Tri4 gene of Fusarium sporotrichioides encodes a cytochrome P450 monooxygenase involved in trichothecene biosynthesis. Mol Gen Genet. 1995, 248: 95-102.PubMedView ArticleGoogle Scholar
- Hahn M, Mendgen K: Characterization of in planta-induced rust genes isolated from a haustorium-specific cDNA library. Mol Plant Microbe Interact. 1997, 10: 427-437.PubMedView ArticleGoogle Scholar
- Kumar S, Tamura K, Jakobsen IB, Nei M: MEGA2: Molecular Evolutionary Genetics Analysis software. Bioinformatics. 2001, 12: 1244-1245. 10.1093/bioinformatics/17.12.1244.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.