Echinocandin B biosynthesis: a biosynthetic cluster from Aspergillus nidulans NRRL 8112 and reassembly of the subclusters Ecd and Hty from Aspergillus pachycristatus NRRL 11440 reveals a single coherent gene cluster
© The Author(s). 2016
Received: 30 October 2015
Accepted: 6 July 2016
Published: 8 August 2016
Echinocandins are nonribosomal lipopeptides produced by ascommycete fungi. Due to their strong inhibitory effect on fungal cell wall biosynthesis and lack of human toxicity, they have been developed to an important class of antifungal drugs. Since 2012, the biosynthetic gene clusters of most of the main echinocandin variants have been characterized. Especially the comparison of the clusters allows a deeper insight for the biosynthesis of these complex structures.
In the genome of the echinocandin B producer Aspergillus nidulans NRRL 8112 we have identified a gene cluster (Ani) that encodes echinocandin biosynthesis. Sequence analyses showed that Ani is clearly delimited from the genomic context and forms a monophyletic lineage with the other echinocandin gene clusters. Importantly, we found that the disjunct genomic location of the echinocandin B gene cluster in A. pachycristatus NRRL 11440 on two separate subclusters, Ecd and Hty, at two loci was likely an artifact of genome misassembly in the absence of a reference sequence. We show that both sequences can be aligned resulting a single cluster with a gene arrangement collinear compared to other clusters of Aspergillus section Nidulantes. The reassembled gene cluster (Ecd/Hty) is identical to a putative gene cluster (AE) that was previously deposited at the NCBI as a sequence from A. delacroxii NRRL 3860. PCR amplification of a part of the gene cluster resulted a sequence that was very similar (97 % identity), but not identical to that of AE.
The Echinocandin B biosynthetic cluster from A. nidulans NRRL 8112 (Ani) is particularly similar to that of A. pachycristatus NRRL 11440 (Ecd/Hty). Ecd/Hty was originally reported as two disjunct sub-clusters Ecd and Hty, but is in fact a continuous sequence with the same gene order as in Ani. According to sequences of PCR products amplified from genomic DNA, the echinocandin B producer A. delacroxii NRRL 3860 is closely related to A. pachycristatus NRRL 11440. A PCR-product from the gene cluster was very similar, but clearly distinct from the sequence published for A. delacroxii NRRL 3860 at the NCBI (No. AB720074). As the NCBI entry is virtually identical with the re-assembled Ecd/Hty cluster, it is likely that it originates from A. pachycristatus NRRL 11440 rather than A. delacroxii NRRL 3860.
They have a strong antifungal activity because they specifically inhibit β-1,3-glucan synthase, the enzyme responsible for the main polymer in cell wall biosynthesis. Semi-synthetic derivatives have become important antifungal drugs for treatment of invasive mycoses [1–3]. Besides their pharmacological properties, their extraordinary structures offer numerous challenging questions for biosynthetic studies. Depending on the individual echinocandin, four or five of the six amino acids are non-proteinogenic. Most amino acid modifications are due to hydroxylations catalyzed by cytochrome P450 monoxygenases or non-heme dioxygenases. However, in two amino acids, 3-hydroxy-4-methylproline and dihyroxyhomothyrosine, the carbon skeleton is also non-canonical. The peptide ring of echinocandins is closed by an N-acyl-hemiacetal moiety, which is very sensitive to hydrolysis as a rule. Nevertheless, in echinocandins it is conformationally stabilized, so that the cyclopeptides can be handled without protection. The first systematic studies on echinocandin biosynthesis were carried out the early 1990s when researchers from Merck & Co. Research Laboratories identified the biosynthetic building blocks of the pneumocandin through 13C-labeling experiments and developed a comprehensive concept of pneumocandin biosynthesis [4, 5]. However, it was not until two decades later that echinocandin biosynthesis was elucidated at the genetic level [6, 7]. In the genome of Aspergillus pachycristatus NRRL 11440 , previously named as Emericella rugulosa, two separate partial clusters were located on different contigs of the assembly and were found to be responsible for encoding the synthesis of echinocandin B. The larger section, Ecd, encoded most of the enzymes required for echinocandin assembly and decoration. This included genes for a nonribosomal peptide synthetase (NRPS) with domains for the assembly of six amino acids and peptide cyclization, an Acyl Amp ligase for acylation with linoleic acid, three oxygenases, and an ABC transporter. The other cluster, Hty, contained the genes for the homotyrosine biosynthetic pathway and two additional oxygenases. The functions of many of the biosynthetic enzymes encoded by Ecd were demonstrated experimentally [6, 7]. In 2014, we described an α-ketoglutarate/Fe2+-dependent proline hydroxylase involved in pneumocandin biosynthesis in Glarea lozoyensis . More recently, the functions of four oxygenases in G. lozoyensis cluster GL were demonstrated by deletion in the host strain [10, 11]. The results were consistent with and complemented the findings for A. pachycristatus.
Taking all these results together, the nearly complete echinocandin biosynthesis can be modelled, though the precise order of some of the oxidation steps remain to be confirmed.
Aspergillus aculeatus ATCC 16872
GOLD Project ID: Gp0010055
Aspergillus mulundensis Y-30462 = DSMZ 5745 (= Aspergillus sydowii var. mulundensis)
Aspergillus nidulans NRRL 8112 (= Emericella nidulans DSM 946)
Coleophoma empetri F-11899
Coleophoma empetri No. 14573
Ecd/Hty (identical with AE)
Ecd/Hty: Aspergillus pachycristatus NRRL 11440 (= Emericella rugulosa, Aspergillus nidulans var. roseus ATCC 58397) AE: stated strain: Aspergillus delacroxii NRRL 3860 (= Emericella nidulans var. echinulatus
Ecd/Hty: assembly of Ecd JX421684 and Hty JX421685
GL: Glarea lozoyensis wildtype strain ATCC 20868
Glo: G. lozoyensis mutant strain ATCC 74030
Phialophora cf. hyalina No. 16616 (=Tolypocladium parasiticum)
Here we present the echinocandin B gene cluster from A. nidulans NRRL 8112 (Ani) identified by whole genome sequencing and sequence analysis. The Ani gene cluster is compared with the known echinocandin biosynthesis clusters from Aspergillus section Nidulantes. We also found that the two separated semi-clusters Ecd and Hty encoding echinocandin B biosynthesis in A. pachycristatus can be aligned into a continuous sequence (Ecd/Hty), which is virtually identical with the echinocandin B biosynthesis cluster AE deposited at the NCBI as a sequence from A. delacroxii NRRL 3860 (= E. nidulans var. echinulata).
Results and discussion
Echinocandin biosynthesis cluster from A. nidulans NRRL 8112
A. nidulans strain NRRL 8112 was originally isolated from India and was described as a weak producer of echinocandin B in a patent from Eli Lilly in 1976 (3.5 g/200 L culture broth) . As several other A. nidulans strains are well characterized and often used for molecular genetic research, we thought that echinocandin B biosynthesis by this strain would be of particular interest. To identify the corresponding cluster from A. nidulans, genomic DNA of NRRL 8112 was sequenced using the Illumina technique. Originally a reference assembly with the genome of A. nidulans FGSC A4 (assembly ASM14920v1) was attempted, however, more than one million reads remained unmapped. As an alternative, the reads were assembled de novo resulting a genome of 31.6 Mbp partitioned in 1,572 contigs. By BLASTn search using genes of the Ecd biosynthetic cluster as query sequences we localized an echinocandin biosynthesis cluster of about 51 kb (Ani) in a relatively small contig (56 kbp). The set of genes in Ani is equivalent to that of echinocandin B biosynthesis cluster AE (≙ Ecd/Hty) and the mulundocandin biosynthesis cluster. The gene orders are fully collinear with those from Aspergilli depicted previously .
Determination of terminal regions by comparison with AE from A. delacroxii
A comparison with the currently known echinocandin biosynthetic clusters (Table 1) revealed a highly conserved “core” set of genes, most of them encoding enzymes required for biosynthesis. Though evidence was rather vague, possible genes in the periphery of the core cluster were considered to be linked to echinocandin biosynthesis [7, 15]. However, similar genomic context in the flanking regions of the clusters indicating a function for echinocandin biosynthesis was only found in two closely related clusters from Coleophoma empetri strains (CE_1 and CE_2 in Table 1) .
Comparison of cluster AE with Ecd and Hty
Surprisingly, the calmodulin sequence of A. delacroxii NRRL 3860 (Additional file 1: Figure S8) differed in only two basepairs from that of A. pachycristatus NRRL 11440. Both sequences could be clearly assigned to the A. rugulosa/A. pachychristatus group rather than to the E. delacroxii (E. echinulata) strains (Additional file 1: Figure S7).
In addition to the common genomic marker sequences, a region between the NRPS and the adjacent cytochrome p450 oxygenase gene of the echinocandin biosynthesis cluster was analyzed. It was the same region which had already been sequenced from A. pachycristatus to confirm the connectivity of Ecd and Hty. As for the calmodulin fragments, the sequences of A. delacroxii and A. pachychrytatus (and the AE cluster) were very similar, but clearly distinct (97 % identity, see Additional file 1: Figure S10). From these results, we concluded that the AE cluster most likely originates from A. pachycristatus NRRL 11440 and not from A. delacroxii NRRL 3860.
In summary, we have identified the echinocandin B biosynthesis cluster Ani in the genome of the echinocandin B producer strain A. nidulans NRRL 8112. The gene order in Ani is collinear with that in the other known clusters of Aspergilli section Nidulantes (AE ≙ Ecd/Hty and AM). Sequence comparison of the individual proteins revealed that Ani forms a strictly monophyletic clade with the other echinocandin gene clusters. In the course of sequence comparisons, we found that the seemingly distant subclusters in A. pachychristatus NRRL 11440, Ecd and Hty, can be assembled into a single sequence (Ecd/Hty). The overlap of both sequences was confirmed by PCR. Moreover, Ecd/Hty was found to be virtually identical to echinocandin B biosynthesis-cluster AE which was deposited at the NCBI as a sequence from A. delacroxii NRRL 3860. To scrutinize this unexpected finding, genomic DNA from this strain and A. pachychristatus NRRL 11440 was amplified by PCR and sequenced. The calmodulin marker sequences of both strains were very similar, but not entirely identical (2 different bp). BLAST search and phylogenetic analysis showed that both of them were most similar to calmodulin sequences from other A. pachycristatus strains (99–100 % identity), and they clearly differed from those of A. delacroxii strains (92–94 % identity). The sequence of a PCR product from a section of the echinocandin gene cluster in A. delacroxii was very similar (97 % identity), but clearly distinct from the corresponding sequence of A. pachycristatus (≙ AE). Therefore, we suppose that AE originates from A. pachycristatus NRRL 11440 and not A. delacroxii NRRL 3860. Moreover, the partial sequence of the calmodulin gene and the remarkably high similarity of the gene cluster fragments strongly suggest that strain NRRL 3860 belongs to the A. pachychristatus/A. rugulosa group rather than to A. delacroxii. An in-depth examination of this strain will be necessary to clarify the taxonomic status.
Aspergillus nidulans NRRL 8112 (= ATCC 58396) and A. pachycristatus NRRL 11440 (= A. nidulans var. roseus ATCC 58397) were purchased from LGC Standards/ATCC (UK). A. delacroxii NRRL 3860 (= E. nidulans var. echinulata) was obtained from the Agricultural Research Service Collection (ARS) of the United States Department of Agriculture (USDA), Peoria IL (USA).
The genome of A. nidulans NRRL 8112 was sequenced and assembled by the De Novo Sequencing Service of BaseClear (Netherlands) using Illumina technique. (Assembly statistics: N50: 65,796 bp, L50: 138, largest contig: 462,827 bp). The paired-end library was deposited at the NCBI in Bioproject PRJNA89151 SRA experiment SRX128799. The genome was annotated with the automatic gene prediction tool AUGUSTUS  and the “Antibiotics and Secondary Metabolite Analysis Shell” (antiSMASH) .
For PCR-experiments genomic DNA from A. pachycristatus NRRL 11440 and A. delacroxii NRRL 3860 was prepared as follows: A culture was grown from a spore suspension in potato dextrose medium (250 mL) at 24 °C and 160 rpm for 14 days. The mycelium was harvested, frozen in liquid nitrogen and ground to a fine powder. The DNA was then extracted as per GenElute™ Plant Genomic DNA Miniprep Kit (Qiagen, Germany).
PCR experiments were conducted as per NEB protocol using Phusion™ Master Mix. The following profile was used for amplification: 96 °C for 10 s (hot start), pause and addition of polymerase; 96 °C for 5 min; then 30 cycles at 96 °C for 30 s; 57 °C (TM-2) for 20 s; 72 °C for 2 min; and a final extension at 72 °C for 10 min. For primer pair WH2_fw and WH2_rv a melting temperature of TM = 53 °C was used. Primer sequences are given in Table S1 in Additional file 1.
DNA and protein sequences were edited, compared and visualized with the Geneious 8.1 software platform . Sequence identities were determined by aligning each set of protein sequences with MUSCLE  implemented in Geneious 8.1. with the default settings. The global sequence identity of the clusters was determined through a concatenation of the alignments of all proteins which occur in all clusters.
The acronyms for the biosynthetic clusters are explained in Table 1.
BLAST(n), basic local alignment search tool (nucleotide); FGSC, Fungal Genetics Stock Center; NCBI, National Center for Biotechnology Information; NRPS, nonribosomal peptide synthase; NRRL, Northern Regional Research Laboratory; PCR, polymerase chain reaction
We thank Prof. Michael Müller for his generous support and James Swezey (NRRL) for assistance in obtaining strain NRRL 3860.
L.Y. was supported by an Alexander von Humboldt Research Fellowship. The article processing charge was funded by the German Research Foundation (DFG) and the Albert Ludwigs University Freiburg in the funding program Open Access Publishing.
Availability of data and material
The data sets supporting the results for this article are included within the article and Additional file 1.
The annotated sequence of the Ani-cluster is available from the NCBI nucleotide database under accession number KT806042. The partial calmodulin gene sequence of A. delacroxii NRRL 3860 was deposited under accession number KX159480.
WH conceived and supervised the study and drafted the manuscript. KGH performed the PCR-experiments and worked on the manuscript. LY prepared genomic DNA from A. nidulans NRRL 8112, coordinated the genome sequencing and performed initial cluster analyses. B.A.G and S.G. analyzed and annotated the genome. All authors have read and approved the final manuscript.
The authors declare that they have no competing interest.
Consent for publication
Ethics approval and consent to participate
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Balkovec JM, Hughes DL, Masurekar PS, Sable CA, Schwartz RE, Singh SB. Discovery and development of first in class antifungal caspofungin (CANCIDAS®) – A case study. Nat Prod Rep. 2014;31(1):15–34.View ArticlePubMedGoogle Scholar
- Stan CD, Tuchilus C, Stan CI. Echinocandins – new antifungal agents. Rev Med Chir Soc Med Nat Iasi. 2014;118(2):528–36.PubMedGoogle Scholar
- Emri T, Majoros L, Tóth V, Pócsi I. Echinocandins: production and applications. Appl Microbiol Biotechnol. 2013;97(8):3267–84.View ArticlePubMedGoogle Scholar
- Adefarati AA, Giacobbe RA, Hensens OD, Tkacz JS. Biosynthesis of L-671,329, an echinocandin-type antibiotic produced by Zalerion arboricola – origins of some of the unusual amino-acids and the dimethylmyristic acid side-chain. J Am Chem Soc. 1991;113(9):3542–5.View ArticleGoogle Scholar
- Adefarati AA, Hensens OD, Jones ETT, Tkacz JS. Pneumocandins from Zalerion arboricola V. Glutamic acid-derived and leucine-derived amino-acids in pneumocandin A0 (L-671,329) and distinct origins of the substituted proline residues in pneumocandins A0 and B0. J Antibiot. 1992;45(12):1953–7.View ArticlePubMedGoogle Scholar
- Jiang W, Cacho RA, Chiou G, Garg NK, Tang Y, Walsh CT. EcdGHK are three tailoring iron oxygenases for amino acid building blocks of the echinocandin scaffold. J Am Chem Soc. 2013;135(11):4457–66.View ArticlePubMedPubMed CentralGoogle Scholar
- Cacho RA, Jiang W, Chooi Y-H, Walsh CT, Tang Y. Identification and characterization of the eechinocandin B biosynthetic gene cluster from Emericella rugulosa NRRL 11440. J Am Chem Soc. 2012;134(40):16781–90.View ArticlePubMedPubMed CentralGoogle Scholar
- Matsuzawa T, Tanaka R, Horie Y, Hui Y, Abliz P, Yaguchi T. The correlation among molecular phylogenetics, morphological data, and growth temperature of the genus Emericella, and a new species. Mycoscience. 2012;53(6):433–45.View ArticleGoogle Scholar
- Houwaart S, Youssar L, Hüttel W. Pneumocandin biosynthesis: involvement of a trans-selective proline hydroxylase. ChemBioChem. 2014;15(16):2365–9.View ArticlePubMedGoogle Scholar
- Li Y, Chen L, Yue Q, Liu X, An Z, Bills GF. Genetic manipulation of the pneumocandin biosynthetic pathway for generation of analogues and evaluation of their antifungal activity. ACS Chem Biol. 2015;10:1702–10.View ArticlePubMedGoogle Scholar
- Chen L, Yue Q, Li Y, Niu X, Xiang M, Wang W, Bills GF, Liu X, An Z. Engineering of Glarea lozoyensis for exclusive production of the Pneumocandin B0 precursor of the antifungal drug caspofungin acetate. Appl Environ Microbiol. 2015;81(5):1550–8.View ArticlePubMedGoogle Scholar
- Yue Q, Chen L, Zhang X, Li K, Sun J, Liu X, An Z, Bills GF. Evolution of chemical diversity in echinocandin lipopeptide antifungal metabolites. Eukaryot Cell. 2015;14(7):698–718.View ArticlePubMedPubMed CentralGoogle Scholar
- Bills GF, Li Y, Chen L, Yue Q, Niu X-M, An Z. New insights into the echinocandins and other fungal non-ribosomal peptides and peptaibiotics. Nat Prod Rep. 2014;31(10):1348–75.View ArticlePubMedGoogle Scholar
- Higgens CE, Michel KH. Antibiotic A-22082 and process for production thereof, US patent 4024246. 1977.Google Scholar
- Chen L, Yue Q, Zhang X, Xiang M, Wang C, Li S, Che Y, Ortiz-López F, Bills GF, Liu X, et al. Genomics-driven discovery of the pneumocandin biosynthetic gene cluster in the fungus Glarea lozoyensis. BMC Genomics. 2013;14(1):339.View ArticlePubMedPubMed CentralGoogle Scholar
- Benz F, Knüsel F, Nüesch J, Treichler H, Voser W, Nyfeler R, Keller-Schierlein W. Stoffwechselprodukte von Mikroorganismen 143. Mitteilung. Echinocandin B, ein neuartiges Polypeptid-Antibioticum aus Aspergillus nidulans var. echinulatus: Isolierung und Bausteine. Helv Chim Acta. 1974;57(8):2459–77.View ArticleGoogle Scholar
- CIBA GEIGY AG. Antifungal antibiotic A32204 production by aerobic cultivation of Aspergillus nidulans var. echinulatus A32204. Swiss patent 568386. 1975Google Scholar
- Samson RA, Visagie CM, Houbraken J, Hong SB, Hubka V, Klaassen CHW, Perrone G, Seifert KA, Susca A, Tanney JB, et al. Phylogeny, identification and nomenclature of the genus Aspergillus. Stud Mycol. 2014;78:141–73.View ArticlePubMedPubMed CentralGoogle Scholar
- Boeck LVD, Kastner RE. Method of producing the A-30912 antibiotics, US Patent 4288549. 1981.Google Scholar
- Schoch CL, Seifert KA, Huhndorf S, Robert V, Spouge JL, Levesque CA, Chen W. Fungal Barcoding Consortium. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for fungi. Proc Natl Acad Sci U S A. 2012;109(16):6241–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Hong S-B, Go S-J, Shin H-D, Frisvad JC, Samson RA. Polyphasic taxonomy of Aspergillus fumigatus and related species. Mycologia. 2005;97(6):1316–29.View ArticlePubMedGoogle Scholar
- Gueidan C, Ruibal C, de Hoog GS, Schneider H. Rock-inhabiting fungi originated during periods of dry climate in the late devonian and middle triassic. Fungal Biol. 2011;115(10):987–96.View ArticlePubMedGoogle Scholar
- Hoff KJ, Stanke M. WebAUGUSTUS – a web service for training AUGUSTUS and predicting genes in eukaryotes. Nucleic Acids Res. 2013;41(Web Server issue):W123–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Weber T, Blin K, Duddela S, Krug D, Kim HU, Bruccoleri R, Lee SY, Fischbach MA, Müller R, Wohlleben W, et al. antiSMASH 3.0 – a comprehensive resource for the genome mining of biosynthetic gene clusters. Nucleic Acids Res. 2015;43(W1):W237–43.View ArticlePubMedPubMed CentralGoogle Scholar
- Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28(12):1647–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–7.View ArticlePubMedPubMed CentralGoogle Scholar
- Hino M, Fujie A, Iwamoto T, Hori Y, Hashimoto M, Tsurumi Y, Sakamoto K, Takase S, Hashimoto S. Chemical diversity in lipopeptide antifungal antibiotics. J Ind Microbiol Biotechnol. 2001;27(3):157–62.View ArticlePubMedGoogle Scholar
- Bills GF, Yue Q, Chen L, Li Y, An Z, Frisvad JC. Aspergillus mulundensis sp. nov., a new species for the fungus producing the antifungal echinocandin lipopeptides, mulundocandins. J Antibiot. 2016;69(3):141–8.View ArticlePubMedGoogle Scholar