- Research article
- Open Access
Classification of fungal and bacterial lytic polysaccharide monooxygenases
© Busk and Lange; licensee BioMed Central. 2015
- Received: 4 November 2014
- Accepted: 29 April 2015
- Published: 9 May 2015
Lytic polysaccharide monooxygenases are important enzymes for the decomposition of recalcitrant biological macromolecules such as plant cell wall and chitin polymers. These enzymes were originally designated glycoside hydrolase family 61 and carbohydrate-binding module family 33 but are now classified as auxiliary activities 9, 10 and 11 in the CAZy database. To obtain a systematic analysis of the divergent families of lytic polysaccharide monooxygenases we used Peptide Pattern Recognition to divide 5396 protein sequences resembling enzymes from families AA9 (1828 proteins), AA10 (2799 proteins) and AA11 (769 proteins) into subfamilies.
The results showed that the lytic polysaccharide monooxygenases have two conserved regions identified by conserved peptides specific for each AA family. The peptides were used for in silico PCR discovery of the lytic polysaccharide monooxygenases in 79 fungal and 95 bacterial genomes. The bacterial genomes encoded 0 – 7 AA10s (average 0.6). No AA9 or AA11 were found in the bacteria. The fungal genomes encoded 0 – 40 AA9s (average 7) and 0 – 15 AA11s (average 2) and two of the fungi possessed a gene encoding a putative AA10. The AA9s were mainly found in plant cell wall-degrading asco- and basidiomycetes in agreement with the described role of AA9 enzymes. In contrast, the AA11 proteins were found in 36 of the 39 ascomycetes and in only two of the 32 basidiomycetes and their abundance did not correlate to the degradation of cellulose and hemicellulose.
These results provides an overview of the sequence characteristics and occurrence of the divergent AA9, AA10 and AA11 families and pave the way for systematic investigations of the of lytic polysaccharide monooxygenases and for structure-function studies of these enzymes.
- Lytic polysaccharide monooxygenases
- Sequence analysis
- Peptide pattern recognition
- Genomic annotation
Copper-dependent lytic polysaccharide monooxygenases (LPMOs) are important enzymes for degradation of biological macromolecules such as plant cell wall and chitin polymers [1-6]. The LPMOs are metalloenzymes that oxidize the glycosidic bonds in cellulose [7-9], hemicellulose  and chitin . The catalysis involves binding of an active oxygen molecule to the copper atom [5,7,8,10] and interaction of a large surface of the LPMO with several chains of the crystalline polysaccharide substrate [11-13]. It was hypothesized that LPMOs could only act on crystalline substrates but recently it was found that some LPMOs oxidize soluble short-chain polysaccharides hence expanding the known LPMO substrates to soluble polymers .
The LPMOs were originally classified as glycoside hydrolases (GHs) and carbohydrate-binding modules (CBMs) but are now placed in the auxiliary activity families AA9, AA10 and AA11 in the CAZy database .
Family AA9 include fungal enzymes formerly classified in the GH61 family. Their classification as GHs was based on the report that one of the GH61s has weak endocellulase activity . Furthermore, certain AA9 family members enhance enzymatic degradation of cellulose  and are important industrial enzymes for conversion of lignocellulotic biomass into soluble sugars.
The enzymatically characterized AA9 proteins oxidise the glycosidic bonds in cellulose [7-9] but less than ten of the more than 200 AA9 proteins in the CAZy database (www.cazy.org) have been investigated. Furthermore, based on the large sequence diversity of the AA9 proteins it has been suggested that some AA9s may oxidize other substrates than cellulose [12,16,17]. This hypothesis is supported by a recent report of an AA9 protein that degrades hemicellulose  and different substrates induce expression of different AA9 genes [18-20]. Moreover, some fungi possess more than 30 different AA9-encoding genes in their genomes , which seems to be an excessive number of LPMOs for degradation of just one substrate.
In this context it is interesting that some of the bacterial LPMOs in the AA10 family degrade chitin whereas other degrade cellulose [2,6,21,22]. In addition, the substrate for the only characterized enzymes of the other type of fungal LPMOs, the AA11 proteins, is chitin . The AA11 LPMOs were originally classified as a subfamily of the GH61 proteins  but later renamed as AA11. However, only a few of these proteins have been annotated and only a single one is functionally characterized .
Peptide Pattern Recognition (PPR) is a new approach for sequence analysis. It was used to classify 743 GH61-like proteins and pinpoint conserved amino acid residues in their sequences . Moreover, PPR was used to annotate all GHs and LPMOs in 40 fungal genomes . However, only LPMOs similar to the sequences found in CAZy were annotated. To further investigate the large number of fungal LPMOs of the AA9 and AA11 families and the high sequence diversity we have used PPR to divide 833 LPMOs found in CAZy and additional 4456 LPMO-like proteins found by BLAST search in GenBank into subfamilies. Each of the three families AA9, AA10 and AA11 were characterized by a non-overlapping set of conserved peptides that mapped to similar regions in the LPMO proteins. The conserved sequences were used to annotate the LPMOs in 79 fungal and 95 bacterial genomes. AA9 and AA11 were exclusively found in fungal genomes whereas AA10s were found in bacteria and a single AA10-encoding gene was found in each of two fungi. Furthermore, AA9s were more abundant in plant cell wall-degrading fungi in agreement with that the substrates for AA9 enzymes are cellulose and hemicellulose. In contrast, AA11-encoding genes were found in all the ascomycetes examined, except for three saccharomycetes with different life styles, but only in a few of the white and brown rot basidiomycetes and the number of genes did not correlate to the degradation of cellulose and hemicellulose.
Division of large families of CAZyme proteins like the GH13 and GH5 into subfamilies has proven to be an important tool for characterization of the families [26,27]. To obtain subfamilies of the LPMOs 248 AA9, 537 AA10 and 48 AA11 unique protein sequences were downloaded from GenBank. Furthermore, 36 GH61 subfamily 7 proteins  were pooled with the AA11 sequences yielding a total of 80 unique AA11/GH61 subfamily 7 sequences after removal of duplicates. PPR analysis of the proteins after removal of duplicates and of irrelevant protein domains generated 12 subfamilies of AA9 proteins, 20 subfamilies of AA10 proteins and 3 subfamilies of AA11 proteins (Additional file 1).
Due to the careful curation of sequences in CAZy [28,29] this database may not contain all sequences relevant for sequence analysis of the LPMOs . Therefore, we used BLASTp search to find the 1000 sequences with the highest sequence similarity to the sequence with highest score in each of the LPMO subfamilies. These sequences were curated by removal of duplicates and irrelevant proteins domains and all sequences recognized as AA9 by PPR from the list of AA11-like sequences and vice versa (see Methods). A procedure of removing irrelevant protein domains rather than relying on positive identification of LPMO-relevant domains by CDD search was used to include putative LPMOs that might not contain a domain recognized by the CDD (see Methods, Sequence Analysis). Although this approach increases the risk of including non-LPMO proteins in the analysis we have previously found that PPR analysis separates unrelated proteins into different subfamilies [24,30]. Hence, these subfamilies may be removed after PPR analysis by manual curation.
The expanded LPMO families consisted of 1798 AA9-like proteins (AA9exp), 2799 AA10-like proteins (AA10exp) and 692 AA11-like proteins (AA11exp), respectively. This represents an additional 1550 AA9-like proteins, 2262 AA10-like proteins and 612 AA11-like proteins.
We have previously shown that the conserved peptides identified by PPR in LPMOs are useful for design of degenerate primers for PCR amplification of related genes . Moreover, by searching in proteins sequences for homology to peptide patterns (implemented as Hotpep), it can be investigated whether a protein sequence contains a sufficient number of conserved peptides from a specific PPR-generated LPMO subfamily to be identified as an LPMO and member of this subfamily . This is a kind of in silico PCR with degenerated primers.
Percent of LPMO-encoding genes annotated with the conserved peptides
To investigate the potential of different fungi for expression of LPMOs we used Hotpep  to mine 79 fungal genomes for LPMO-encoding genes. This was done by dividing the contigs of each genome into 2000 nucleotides long fragments and search for conserved peptides from each subfamily of the AA9exp, AA10exp and AA11exp families in all six reading frames. All reading frames that satisfied the threshold conditions (see Methods) were considered as hits.
For 39 of the fungal genomes the LPMOs have previously been annotated based on similarity to the LPMOs found in CAZy . This analysis identified 284 LPMOs in the 39 genomes. However, by using the expanded LPMO families, a total of 373 LPMOs were found in the same 39 genomes.
Average number of LPMO-encoding genes in 79 fungal and 95 bacterial genomes
Cell-wall degrading fungi
Six AA10-encoding genes were predicted in the 79 fungal genomes but upon closer examination by BLASTp search of the encoded amino acid sequenced it was found that only two of these genes from the litter-decomposing basidiomycete Galerina marginata and the smut Ustilago maydis, which is a maize pathogen, encode putative AA10 proteins. The closest BLAST hit to the gene from G. marginata was a bacterial gene from Streptomyces rimosus (E value = 3×10−51) whereas the closest hits to the gene from U. maydis were seven genes from Basidiomycota (Pseudozyma hubeiensis, Ustilago hordei, Sporisorium reilianum, Pseudozyma antarctica, Pseudozyma aphidis, Pseudozyma flocculosa and Pseudozyma brasiliensis (E values = 8×10−128 - 2×10−68) but also genes from Streptomyces sp. were closely related (E value = 4×10−55).
To investigate whether the AA9 and AA11 are found in bacteria we used Hotpep to mine 95 bacterial genomes for LPMOs. However, no AA9- or AA11-encoding genes were found in the bacteria, which had from 0 to 7 AA10-encoding genes (Table 2 and Additional file 5). The finding of 545 AA9s, 156 AA11s and only 2 AA10s in the 79 fungal genomes and 61 AA10s but no AA9s or AA11s in the bacterial genomes supports the notion that AA9 and AA11 are eukaryotic LPMOs whereas AA10 are bacterial.
The reclassification of the LPMOs as auxiliary activities based on their enzymatic and structural similarities is an important step towards a systematic analysis of these enzymes and their substrate preferences . However, each of the three families of LPMOs contains a large number of very divergent sequences and only a few of these have been enzymatically characterized. Hence, the further division of the LPMOs into PPR-generated subfamilies of proteins with related sequences provides a short cut to a systematic characterization of the LPMOs. To obtain as comprehensive sequence information about the LPMOs as possible we included not only the AA9, AA10 and AA11 sequences listed in the CAZy database in the PPR analysis but also a wide selection of LPMO-like sequences found by similarity search in GenBank. We have previously used this approach to analyze the AA9 proteins, which lead to the discovery of the AA11 proteins that were classified as GH61 subfamily 7 . In the present work the expanded LPMO families were divided into 47 AA9, 31 AA10 and 11 AA11 subfamilies compared to only 12 AA9, 20 AA10 and 3 AA11 subfamilies when only LPMO sequences from the CAZy database were included. Thus, the expansion leads to a more comprehensive characterization of the families and is easily handled by the PPR algorithm. One drawback is that a number of proteins that are not true LPMOs were included in the expanded sequence pools. This complication was reduced by removing most domains not relevant for the LPMO catalytic activity from the sequences before PPR analysis. Furthermore, after PPR analysis each subfamily was examined by CDD and BLAST search [42,43] and inspected manually to remove all subfamilies that do not include LPMOs. We have previously found that grouping proteins with PPR leads to exclusion of unrelated proteins [24,30]. Thus, any non-LPMO sequence that might have sneaked in to the sequence pool was excluded from the subfamilies of LPMOs. This approach made it possible to annotate 31% more LPMOs based on the expanded LPMO lists in the 39 fungal genomes that have previously been investigated and annotated based only on the LPMOs found in CAZy .
For GH families including proteins with different functions PPR divides the proteins into subfamilies that correlate largely to the functions of the proteins . An important purpose of the present analysis was to correlate sequence information for the LPMOs to functional data. As the enzymatically characterized AA9 proteins were classified to only six of the 47 subfamilies there is a host of AA9s with very different sequences that remain to be characterized. Comparison of AA9s from plant cell wall degrading asco- and basidiomycetes to AA9s in non-degrading asco- and basidiomycetes showed which of these subfamilies may be associated to degradation of lignocellulose. However, LPMOs with high ability to enhance lignocellulose degradation may belong to different subfamilies as exemplified by the T. aurantiacus LPMO and the Thielavia terrestris LPMO  classified in subfamily 1 and subfamily 2 that were very distant in the cluster analysis of the AA9 subfamilies. The catalytic activities of the LPMOs in subfamily 1 have been reported as endoglucanase for the enzyme from Trichoderma reesei  and monooxygenase yielding C1- and C4 oxidized products for the enzyme from Podospora anserina . Three AA9 enzymes from Neurospora crassa fall into the closely related subfamilies 3 and 4 but represent different catalytic mechanisms as the two enzymes in subfamily 3 are C1-oxidizing and C4-oxidizing and the enzyme in subfamily 4 is both C1- and C4-oxidizing [5,7,10,45]. Likewise, the enzymes in the related subfamilies 2 and 5 are a C1-oxidising LPMO from N. crassa and a C1- and C4-oxidizing LPMO from Phanerochaete chrysosporium [5,7,9,13,45]. Hence, there does not seem to be a clear correlation between the catalytic mechanisms of the AA9 LPMOs and their subfamily annotation. The classification of a hemicellulose-oxidizing LPMO  into subfamily 3 together with a cellulose-oxidizing LPMO [5,7] does not point to a clear division of the LPMOs based on substrate recognition. However, there is limited knowledge about the substrate specificity of the LPMOs. Hence, more LPMOs need to be characterized to reach a definite conclusion. Interestingly, many genomes of fungi degrading plant cell wall material possess several genes encoding betaglucosidases, endoglucanases and AA9-type LPMOs indicating that degradation of different, complex substrates may require different enzyme with the same activity . This notion points to that a comprehensive understanding of the properties of different LPMOs requires studies on complex natural substrates.
The AA10 family proteins have been described to oxidize two substrates, chitin and cellulose, that are built from different monomer carbohydrates. Interestingly, the cellulose- oxidizing AA10s [2,11,22] are found in the closely related subfamilies 6, 9 and 20 that are far from the subfamilies 1, 13 and 36 that contain only chitin-oxidizing AA10s [6,11,21,46,47]. Although the characterized AA10 protein from Thermobifida fusca in subfamily 9 is able to oxidize both cellulose and chitin  this separation of the LPMOs in the AA10 family according to substrate indicates that the subfamily classification may be used to predict the putative substrate of uncharacterized LPMOs. However, a confirmation of this possibility will await biochemical characterization of more LPMOs.
In contrast to the AA9 LPMOs, the number of AA11-encoding genes did not correlate to the degradation of cellulose and hemicellulose but rather to the fungal taxonomy as these genes were found in 36 of the 39 ascomycetes and in only four of the 32 basidiomycetes: The non-cellulose-degrading Cryptococcus neoformans and Tremella mesenterica, the white rot fungus Ceriporiopsis subvermispora and the white rot-causing fungus S. commune. The putative amino acid sequences encoded by the genes from C. neoformans and T. mesenterica share 83 of 150 amino acids and 76 of 160 amino acids, respectively, with a gene from the dermatophyte Blastomyces dermatitidis. The seven genes from S. commune are closely related in groups of two and five genes and appear to originate from two different genes. The sequence from C. vermispora is only 100 amino acids long and does not have any significant hits in BLASTp search. Interestingly, the comparison of the occurrence of AA11 subfamilies showed that dermatophytes have more than three times as many AA11 subfamily 1 encoding-genes than other ascomycetes. This result suggests that skin may contain a substrate for AA11 subfamily 1. Dermatophyte invasion in immunocompetent individuals is usually restricted to the keratinized, non-living material and involves the expression of a host of keratinases [38,48,49]. It is likely that the AA11 takes part in this degradation and may attack keratin or another macromolecule of the extracellular matrix. The only enzymatically characterized AA11 protein from A. oryzae is a chitinase , which is classified in subfamily 1. A number of genes encoding proteins with a chitin-binding LysM domain have been identified in the genomes of dermatophytes . The dermatophyte genomes do also contain chitinase-encoding genes and it is likely that some of their LysM-domain containing proteins are involved in chitin degradation . However, some of the LysM-encoding genes of T. rubrum are specifically expressed when this dermatophyte grows on keratin thus suggesting a link between degradation of keratin and apparent chitin-binding proteins . In analogy with the LysM-containing proteins the AA11 proteins from subfamily 1 may be used by the dermatophytes for growth on chitin but it is also possible that they have a role in growth on keratin.
In the context of industrial use of LPMOs for degradation of plant cell wall materials and productions of biofuels such as ethanol [1,3] the present results indicate the search for new LPMO enzymes should focus on the AA9 family and possibly on the AA10 subfamilies that contain cellulose-degrading enzymes and on related subfamilies.
The two basidiomycetes C. neoformans and S. commune that posses AA11-encoding genes are known to cause human infections [51-53] and it would be interesting to investigate whether their AA11 genes are induced during infection. On the other hand, no reports link the basidiomycete T. mesenterica that has an AA11-encoding gene, to human infections but this fungus is a fungal parasite  and could use the AA11 protein for degradation of the chitin-containing cell wall of its host.
Overall, the genome analysis is consistent with that AA9 and AA11 are eukaryotic LPMOs whereas AA10 are bacterial. Nevertheless, the genomes of the litter-decomposing basidiomycete G. marginata and the smut U. maydis encoded AA10-like proteins. These genes were closely related to bacterial LPMOs suggesting that G. marginata and U. maydis acquired their AA10 genes through horizontal gene transfer . For example, in its yeast form U. maydis interacts with bacteria in the phyllosphere, which contains a large variety of bacteria and fungi [56,57]. The considerably larger number of LPMOs that were generally found in the fungal genomes compared to bacterial genomes is consistent with the notion that eukaryotic genomes are larger, often redundant and contains more genes than eubacterial genomes .
Comparison of the distribution of the conserved peptides on the LPMO sequences indicated two conserved regions in accordance with previous findings for the GH61 family . One of the conserved regions is consistent with the substrate binding site of the LPMOs and the conserved surface residues in each subfamily have been identified and may be involved in recognition of different substrates . In the present work we found that also the AA10 family had a conserved region around amino acids 180 - 200 but a slightly different structure of conserved peptides towards the N-terminal than the AA9 and AA11 families.
We have previously shown that PPR can be used for dividing 743 AA9 and AA11 proteins into subfamilies . In the present work PPR demonstrated its ability to subdivide 5396 LPMOs. This subdivision could be used to annotate the LPMOs in fungal and bacterial genomes. It is also possible to annotate CAZymes by the use of a gene prediction algorithm followed by mapping of conserved domains  or directly through the CAZy database . However, by using PPR the training set can be expanded beyond the sequences found in CAZy, which is a useful feature when an exhaustive analysis is required. This feature is also important for analysis of protein families that are not classified in high-quality databases such as CAZy.
The LPMOs are important enzymes for microbiological degradation of cellulose, chitin and related poly- and oligoshaccharides [1-6]. Nevertheless, only a few of these enzymes have been characterized. The classification of the LPMOs into the three protein families AA9, AA10 and AA11 is an important step towards understanding the divergence of this large group of very divergent sequences . In the present work we have collected the LPMOs and available LPMO-like sequences from public databases and arranged them into subfamilies. This analysis showed that the functionally characterized LPMOs do only represent a small part of the natural sequence variation of these enzymes. Moreover, it pinpointed the subfamilies of LPMOs that have not been functionally characterized and thus have the highest probability of having different properties than the already described LPMOs. In this context it is relevant that the AA10 enzymes that oxidize cellulose belong to three closely related subfamilies. Moreover, some AA11 subfamilies were mainly found in dermatophytic ascomycetes pointing to a putative function of these enzymes in keratinolysis. In contrast to the substrate preference of the AA10 enzymes there does not seem to be a correlation between the subfamilies and a C1- or C4-oxidation as catalytic mechanism.
Genome mining of the LPMOs in fungi showed that the AA9 LPMOs were found mainly in the genomes of asco- and basidiomycetes capable of degrading plant biomass whereas genes for AA11 LPMOs were present in the genomes of ascomycetes and in only few of the basidiomycetes.
Genome sequences were downloaded from GenBank as indicated (Additional file 3). The classification of the fungi as lignocellulose-degraders was based on description of the fungi as degraders of complex plant cell wall material [33,34]. Hence, brown and white rot basidiomycota and saprophytic ascomycetes able to live on cell walls from dead plants are designated as cellulose-degraders [32-35,37]. On the other hand, Zygomycota such as Thermomucor indicae-seudaticae were classified as non-degraders based on their classification as fungi growing on easily accessible substrates [59,60].
Amino acid sequences of all AA9 and AA10 proteins in CAZy  were downloaded from GenBank in June 2013. Likewise, AA11 protein sequences  were downloaded in January, 2014 and pooled with the GH61 subfamily 7 , which includes the same type of proteins as AA11.
Protein domains were mapped in the sequences by CDD search  and domains clearly not related to the LPMO enzymatic function, and not overlapping with a relevant domain, were deleted from the sequences. Relevant domains included all domains designated as “Chitin_bind_3”, "Chitin_bind_3 superfamily", "ChtBD1 superfamily", "ChtBD1_1", "Cu_bind_like", "Cu_bind_like superfamily", "Glyco_hydro_61" or "Glyco_hydro_61 superfamily". For proteins belonging to the AA10exp sequence pool domains designated as "COG3397", "COG3979", "Cellulase", "Cellulase superfamily", "ChiA1_BD", "ChiC_BD", "ChtBD3 superfamily", "Collagen", "FN3", "FN3 superfamily", "GH18_chitinase", "GH18_chitinase-like superfamily", "Glyco_18", "PHA03325 superfamily", "PKD", "PKD superfamily" or "PRK13211" were also kept. All sequences that were shorter than 51 amino acids after deletion of unrelated domains were removed.
The curated protein families were analyzed by PPR as previously described for the GH13 and GH61 protein families . The length of the conserved peptides (hexamers), the number of conserved peptides per protein (10) and the total number of conserved peptides per group (70) were chosen as they were the conditions that gave the best rate of prediction of protein function in empirical testing of peptide lengths from trimers to decamers, 5 – 40 conserved peptides per protein and 30 – 200 conserved peptides per group .
To generate expanded protein families the top hit in each PPR-generated subfamily of AA9, AA10 and AA11 was used for BLAST search  in GenBank. For each of the three AA families, the 1000 best hits of each search were pooled and duplicates were removed. Next, protein domains were mapped in the sequences by CDD search  and domains clearly not related to the LPMO enzymatic function were deleted. All sequences shorter than 51 amino acids after deletion of unrelated domains were removed. Furthermore, all AA9-like proteins that were classified as AA11 by PPR search were removed from the AA9 expanded family. Likewise, all AA11-like proteins that were classified as AA9 by PPR search were removed from the AA11 expanded family where after the expanded protein families for AA9, AA10 and AA11 (Additional file 6) were analyzed by PPR as described above.
Finally, the proteins in each subfamily were analyzed by CDD search and manually curated to remove subfamilies that did not include LPMO-like sequences.
Gene annotation in fungal genomes by finding homology to peptide patterns (Hotpep)
Hotpep analysis was performed as previously described . The procedure can be described by the following steps:
Finding all the conserved peptides from the list that were present in the reading frame.
Sum the frequency of these peptides to obtain the subfamily-specific frequency score.
Included three or more conserved peptides from a subfamily.
The frequency score for the peptides was higher than 1.0
The conserved peptides represented at least ten amino acids of the sequence. (Three hexapeptides with maximal overlap represent 8 amino acids of a sequence whereas three non-overlapping hexapeptides represent 18 (no overlap) amino acids of a sequence).
If a sequence fragment satisfied all three conditions it was assigned to the AA family and to the PPR subfamily with the highest subfamily-specific frequency score as previously described . If two fragments were assigned to the same AA family and the distance between them in the original genome sequence was less than 5800 bases, the fragments were considered to be part of the same gene and counted as one hit.
Distribution of hexapeptides in the proteins
The position of a conserved hexapeptide was defined as the median of the position in all the protein sequences that contained the hexapeptide sequence. Using the median instead of the average position compensates for the different length of the LPMO sequences and for the occurrence of truncated proteins.
Conserved peptides on the 3D structures were visualized with Cn3D .
Gene annotation in bacterial genomes by finding homology to peptide patterns (Hotpep)
Bacterial genomes were translated in all six reading frames and all open reading frames longer than 50 amino acids were given a score for each subfamily-specific peptide lists for each AA family and significant hits were found as described for gene annotation in fungal genomes.
Each hit was considered as independent of neighboring hits.
Genome and sequence comparisons
R package version 3.0.1 was used for Ward hierarchical clustering. BLAST search  for sequence alignment was done by standard methods and conserved domains were identified in the CDD database at NCBI .
Student’s T-test was used for comparisons of data sets as indicated.
Availability of supporting data
The data sets supporting the results of this article are included within the article and its additional files.
This work was supported by The Danish Council for Strategic Research projects 09-065251 (Bio4Bio), 858028 (Keratin2Protein) and by Novozymes A/S.
- Agger JW, Isaksen T, Várnai A, Vidal-Melgosa S, Willats WGT, Ludwig R, et al. Discovery of LPMO activity on hemicelluloses shows the importance of oxidative processes in plant cell wall degradation. Proc Natl Acad Sci U S A. 2014;111:6287–92.View ArticlePubMed CentralPubMedGoogle Scholar
- Forsberg Z, Vaaje-Kolstad G, Westereng B, Bunæs AC, Stenstrøm Y, MacKenzie A, et al. Cleavage of cellulose by a CBM33 protein. Protein Sci. 2011;20:1479–83.View ArticlePubMed CentralPubMedGoogle Scholar
- Harris PV, Welner D, McFarland KC, Re E, Navarro Poulsen J-C, Brown K, et al. Stimulation of lignocellulosic biomass hydrolysis by proteins of glycoside hydrolase family 61: structure and function of a large, enigmatic family. Biochemistry. 2010;49:3305–16.View ArticlePubMedGoogle Scholar
- Langston JA, Shaghasi T, Abbate E, Xu F, Vlasenko E, Sweeney MD. Oxidoreductive cellulose depolymerization by the enzymes cellobiose dehydrogenase and glycoside hydrolase 61. Appl Environ Microbiol. 2011;77:7007–15.View ArticlePubMed CentralPubMedGoogle Scholar
- Phillips CM, Beeson WT, Cate JH, Marletta MA. Cellobiose dehydrogenase and a copper-dependent polysaccharide monooxygenase potentiate cellulose degradation by Neurospora crassa. ACS Chem Biol. 2011;6:1399–406.View ArticlePubMedGoogle Scholar
- Vaaje-Kolstad G, Westereng B, Horn SJ, Liu Z, Zhai H, Sørlie M, et al. An oxidative enzyme boosting the enzymatic conversion of recalcitrant polysaccharides. Science. 2010;330:219–22.View ArticlePubMedGoogle Scholar
- Beeson WT, Phillips CM, Cate JH, Marletta MA. Oxidative cleavage of cellulose by fungal copper-dependent polysaccharide monooxygenases. J Am Chem Soc. 2011;134:890–2.View ArticlePubMedGoogle Scholar
- Quinlan RJ, Sweeney MD, Lo Leggio L, Otten H, Poulsen J-CN, Johansen KS, et al. Insights into the oxidative degradation of cellulose by a copper metalloenzyme that exploits biomass components. Proc Natl Acad Sci U S A. 2011;108:15079–84.View ArticlePubMed CentralPubMedGoogle Scholar
- Westereng B, Ishida T, Vaaje-Kolstad G, Wu M, Eijsink VGH, Igarashi K, et al. The putative endoglucanase PcGH61D from Phanerochaete chrysosporium is a metal-dependent oxidative enzyme that cleaves cellulose. PLoS One. 2011;6:e27807.View ArticlePubMed CentralPubMedGoogle Scholar
- Isaksen T, Westereng B, Aachmann FL, Agger JW, Kracher D, Kittl R, et al. A C4-oxidizing lytic polysaccharide monooxygenase cleaving both cellulose and cello-oligosaccharides. J Biol Chem. 2014;289:2632–42.View ArticlePubMed CentralPubMedGoogle Scholar
- Aachmann FL, Sørlie M, Skjåk-Bræk G, Eijsink VGH, Vaaje-Kolstad G. NMR structure of a lytic polysaccharide monooxygenase provides insight into copper binding, protein dynamics, and substrate interactions. Proc Natl Acad Sci U S A. 2012;109:18779–84.View ArticlePubMed CentralPubMedGoogle Scholar
- Li X, Beeson 4th WT, Phillips CM, Marletta MA, Cate JHD. Structural basis for substrate targeting and catalysis by fungal polysaccharide monooxygenases. Structure. 2012;20:1051–61.View ArticlePubMed CentralPubMedGoogle Scholar
- Wu M, Beckham GT, Larsson AM, Ishida T, Kim S, Payne CM, et al. Crystal structure and computational characterization of the lytic polysaccharide monooxygenase GH61D from the basidiomycota fungus Phanerochaete chrysosporium. J Biol Chem. 2013;288:12828–39.View ArticlePubMed CentralPubMedGoogle Scholar
- Levasseur A, Drula E, Lombard V, Coutinho PM, Henrissat B. Expansion of the enzymatic repertoire of the CAZy database to integrate auxiliary redox enzymes. Biotechnol Biofuels. 2013;6:41.View ArticlePubMed CentralPubMedGoogle Scholar
- Karlsson J, Saloheimo M, Siika-Aho M, Tenkanen M, Penttilä M, Tjerneld F. Homologous expression and characterization of Cel61A (EG IV) of Trichoderma reesei. Eur J Biochem. 2001;268:6498–507.View ArticlePubMedGoogle Scholar
- Horn SJ, Vaaje-Kolstad G, Westereng B, Eijsink VG. Novel enzymes for the degradation of cellulose. Biotechnol Biofuels. 2012;5:45.View ArticlePubMed CentralPubMedGoogle Scholar
- Leggio LL, Welner D, De Maria L. A structural overview of GH61 proteins - fungal cellulose degrading polysaccharide monooxygenases. Comput Struct Biotechnol J. 2012;2:e201209019.PubMed CentralPubMedGoogle Scholar
- Hori C, Igarashi K, Katayama A, Samejima M. Effects of xylan and starch on secretome of the basidiomycete Phanerochaete chrysosporium grown on cellulose. FEMS Microbiol Lett. 2011;321:14–23.View ArticlePubMedGoogle Scholar
- Ray A, Saykhedkar S, Ayoubi-Canaan P, Hartson SD, Prade R, Mort AJ. Phanerochaete chrysosporium produces a diverse array of extracellular enzymes when grown on sorghum. Appl Microbiol Biotechnol. 2012;93:2075–89.View ArticlePubMedGoogle Scholar
- Yakovlev I, Vaaje-Kolstad G, Hietala AM, Stefańczyk E, Solheim H, Fossdal CG. Substrate-specific transcription of the enigmatic GH61 family of the pathogenic white-rot fungus Heterobasidion irregulare during growth on lignocellulose. Appl Microbiol Biotechnol. 2012;95:979–90.View ArticlePubMed CentralPubMedGoogle Scholar
- Forsberg Z, Røhr AK, Mekasha S, Andersson KK, Eijsink VGH, Vaaje-Kolstad G, et al. Comparative study of two chitin-active and two cellulose-active AA10-type lytic polysaccharide monooxygenases. Biochemistry. 2014;53:1647–56.View ArticlePubMedGoogle Scholar
- Forsberg Z, Mackenzie AK, Sørlie M, Røhr AK, Helland R, Arvai AS, et al. Structural and functional characterization of a conserved pair of bacterial cellulose-oxidizing lytic polysaccharide monooxygenases. Proc Natl Acad Sci U S A. 2014;111:8446–51.View ArticlePubMed CentralPubMedGoogle Scholar
- Hemsworth GR, Henrissat B, Davies GJ, Walton PH. Discovery and characterization of a new family of lytic polysaccharide monooxygenases. Nat Chem Biol. 2014;10:122–6.View ArticlePubMed CentralPubMedGoogle Scholar
- Busk PK, Lange L. Function-based classification of carbohydrate-active enzymes by recognition of short, conserved peptide motifs. Appl Environ Microbiol. 2013;79:3380–91.View ArticlePubMed CentralPubMedGoogle Scholar
- Busk PK, Lange M, Pilgaard B, Lange L. Several genes encoding enzymes with the same activity are necessary for aerobic fungal degradation of cellulose in nature. PLoS One. 2014;9:e114138.View ArticlePubMed CentralPubMedGoogle Scholar
- Stam MR, Danchin EGJ, Rancurel C, Coutinho PM, Henrissat B. Dividing the large glycoside hydrolase family 13 into subfamilies: towards improved functional annotations of alpha-amylase-related proteins. Protein Eng Des Sel. 2006;19:555–62.View ArticlePubMedGoogle Scholar
- Aspeborg H, Coutinho PM, Wang Y, Brumer 3rd H, Henrissat B. Evolution, substrate specificity and subfamily classification of glycoside hydrolase family 5 (GH5). BMC Evol Biol. 2012;12:186.View ArticlePubMed CentralPubMedGoogle Scholar
- Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, Henrissat B. The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res. 2009;37(Database issue):D233–8.View ArticlePubMed CentralPubMedGoogle Scholar
- Lombard V, Golaconda Ramulu H, Drula E, Coutinho PM, Henrissat B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014;42(Database issue):D490–5.View ArticlePubMed CentralPubMedGoogle Scholar
- Busk PK, Lange L. A Novel Method of Providing a Library of N-Mers or Biopolymers. 2012. [WO2012101151].Google Scholar
- Hemsworth GR, Taylor EJ, Kim RQ, Gregory RC, Lewis SJ, Turkenburg JP, et al. The copper active site of CBM33 polysaccharide oxygenases. J Am Chem Soc. 2013;135:6069–77.View ArticlePubMed CentralPubMedGoogle Scholar
- Floudas D, Binder M, Riley R, Barry K, Blanchette RA, Henrissat B, et al. The Paleozoic origin of enzymatic lignin decomposition reconstructed from 31 fungal genomes. Science. 2012;336:1715–9.View ArticlePubMedGoogle Scholar
- Kuhad RC, Kuhar S, Kapoor M, Sharma KK, Singh A. Lignocellulolytic microorganisms, their enzymes and possible biotechnologies based on lignocellulolytic microorganisms and their enzymes. In Lignocellulose Biotechnology: Future Prospects. I. K. International Pvt Ltd; 2007. p. 3–22.Google Scholar
- Moore-Landecker E. Fungi as saprophytes. In Fundamentals of the fungi. Prentice-Hall; New Jersey, USA. 1972. p. 291–325.Google Scholar
- Riley R, Salamov AA, Brown DW, Nagy LG, Floudas D, Held BW, et al. Extensive sampling of basidiomycete genomes demonstrates inadequacy of the white-rot/brown-rot paradigm for wood decay fungi. Proc Natl Acad Sci USA. 2014;111:9923-9928.Google Scholar
- Ainsworth GC, Bisby GR, Hawksworth DL. Ainsworth & Bisby’s dictionary of the fungi. 8th revised edition edition. Wallingford, Oxon, UK: CABI Publishing; 1995.Google Scholar
- Glass NL, Schmoll M, Cate JHD, Coradetti S. Plant cell wall deconstruction by ascomycete fungi. Annu Rev Microbiol. 2013;67:477–98.View ArticlePubMedGoogle Scholar
- Baldo A, Monod M, Mathy A, Cambier L, Bagut ET, Defaweux V, et al. Mechanisms of skin adherence and invasion by dermatophytes. Mycoses. 2012;55:218–23.View ArticlePubMedGoogle Scholar
- Ng KP, Ngeow YF, Yew SM, Hassan H, Soo-Hoo TS, Na SL, et al. Draft genome sequence of Daldinia eschscholzii isolated from blood culture. Eukaryot Cell. 2012;11:703–4.View ArticlePubMed CentralPubMedGoogle Scholar
- Gianoulis TA, Griffin MA, Spakowicz DJ, Dunican BF, Alpha CJ, Sboner A, et al. Genomic analysis of the hydrocarbon-producing, cellulolytic, endophytic fungus Ascocoryne sarcoides. PLoS Genet. 2012;8:e1002558.View ArticlePubMed CentralPubMedGoogle Scholar
- Ohm RA, de Jong JF, Lugones LG, Aerts A, Kothe E, Stajich JE, et al. Genome sequence of the model mushroom Schizophyllum commune. Nat Biotechnol. 2010;28:957–63.View ArticlePubMedGoogle Scholar
- Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–402.View ArticlePubMed CentralPubMedGoogle Scholar
- Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, et al. CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 2011;39(Database issue):D225–9.View ArticlePubMed CentralPubMedGoogle Scholar
- Bey M, Zhou S, Poidevin L, Henrissat B, Coutinho PM, Berrin J-G, et al. Cello-oligosaccharide oxidation reveals differences between Two lytic polysaccharide monooxygenases (family GH61) from Podospora anserina. Appl Environ Microbiol. 2013;79:488–96.View ArticlePubMed CentralPubMedGoogle Scholar
- Vu VV, Beeson WT, Phillips CM, Cate JHD, Marletta MA. Determinants of regioselective hydroxylation in the fungal polysaccharide monooxygenases. J Am Chem Soc. 2014;136:562–5.View ArticlePubMedGoogle Scholar
- Vaaje-Kolstad G, Bøhle LA, Gåseidnes S, Dalhus B, Bjørås M, Mathiesen G, et al. Characterization of the chitinolytic machinery of Enterococcus faecalis V583 and high-resolution structure of its oxidative CBM33 enzyme. J Mol Biol. 2012;416:239–54.View ArticlePubMedGoogle Scholar
- Gudmundsson M, Kim S, Wu M, Ishida T, Momeni MH, Vaaje-Kolstad G, et al. Structural and electronic snapshots during the transition from a Cu(II) to Cu(I) metal center of a lytic polysaccharide monooxygenase by X-ray photoreduction. J Biol Chem. 2014;289:18782–92.View ArticlePubMedGoogle Scholar
- Monod M. Secreted proteases from dermatophytes. Mycopathologia. 2008;166:285–94.View ArticlePubMedGoogle Scholar
- Martinez DA, Oliver BG, Gräser Y, Goldberg JM, Li W, Martinez-Rossi NM, et al. Comparative genome analysis of Trichophyton rubrum and related dermatophytes reveals candidate genes involved in infection. MBio. 2012;3:e00259–00212.View ArticlePubMed CentralPubMedGoogle Scholar
- Zaugg C, Monod M, Weber J, Harshman K, Pradervand S, Thomas J, et al. Gene expression profiling in the human pathogenic dermatophyte Trichophyton rubrum during growth on proteins. Eukaryot Cell. 2009;8:241–50.View ArticlePubMed CentralPubMedGoogle Scholar
- Loftus BJ, Fung E, Roncaglia P, Rowley D, Amedeo P, Bruno D, et al. The genome of the basidiomycetous yeast and human pathogen Cryptococcus neoformans. Science. 2005;307:1321–4.View ArticlePubMed CentralPubMedGoogle Scholar
- Baron O, Cassaing S, Percodani J, Berry A, Linas M-D, Fabre R, et al. Nucleotide sequencing for diagnosis of sinusal infection by Schizophyllum commune, an uncommon pathogenic fungus. J Clin Microbiol. 2006;44:3042–3.View ArticlePubMed CentralPubMedGoogle Scholar
- Tanaka H, Takizawa K, Baba O, Maeda T, Fukushima K, Shinya K, et al. Basidiomycosis: Schizophyllum commune osteomyelitis in a dog. J Vet Med Sci. 2008;70:1257–9.View ArticlePubMedGoogle Scholar
- Zugmaier W, Bauer R, Oberwinkler F. Mycoparasitism of some tremella species. Mycologia. 1994;86:49–56.View ArticleGoogle Scholar
- Fitzpatrick DA. Horizontal gene transfer in fungi. FEMS Microbiol Lett. 2012;329:1–8.View ArticlePubMedGoogle Scholar
- Jumpponen A, Jones KL. Seasonally dynamic fungal communities in the Quercus macrocarpa phyllosphere differ between urban and nonurban environments. New Phytol. 2010;186:496–513.View ArticlePubMedGoogle Scholar
- Redford AJ, Bowers RM, Knight R, Linhart Y, Fierer N. The ecology of the phyllosphere: geographic and phylogenetic variability in the distribution of bacteria on tree leaves. Environ Microbiol. 2010;12:2885–93.View ArticlePubMed CentralPubMedGoogle Scholar
- Spring J. Major transitions in evolution by genome fusions: from prokaryotes to eukaryotes, metazoans, bilaterians and vertebrates. J Struct Funct Genomics. 2003;3:19–25.View ArticlePubMedGoogle Scholar
- Richardson M. The ecology of the Zygomycetes and its impact on environmental exposure. Clin Microbiol Infect. 2009;15 Suppl 5:2–9.View ArticlePubMedGoogle Scholar
- Busk PK, Lange L. Cellulolytic potential of thermophilic species from four fungal orders. AMB Express. 2013;3:47.View ArticlePubMed CentralPubMedGoogle Scholar
- Busk PK. The meaning of life - On cactus finches, evolution and chaos. 1st ed. Saxo: Copenhagen, Denmark; 2013.Google Scholar
- Van den Brink J, de Vries RP. Fungal enzyme sets for plant polysaccharide degradation. Appl Microbiol Biotechnol. 2011;91:1477–92.View ArticlePubMed CentralPubMedGoogle Scholar
- Sánchez C. Lignocellulosic residues: biodegradation and bioconversion by fungi. Biotechnol Adv. 2009;27:185–94.View ArticlePubMedGoogle Scholar
- Wang Y, Geer LY, Chappey C, Kans JA, Bryant SH. Cn3D: sequence and structure views for Entrez. Trends Biochem Sci. 2000;25:300–2.View ArticlePubMedGoogle Scholar
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.