Identification of the Hevea brasiliensisAP2/ERF superfamily by RNA sequencing
BMC Genomics volume 14, Article number: 30 (2013)
Rubber tree (Hevea brasiliensis) laticifers are the source of natural rubber. Rubber production depends on endogenous and exogenous ethylene (ethephon). AP2/ERF transcription factors, and especially Ethylene-Response Factors, play a crucial role in plant development and response to biotic and abiotic stresses. This study set out to sequence transcript expressed in various tissues using next-generation sequencing and to identify AP2/ERF superfamily in the rubber tree.
The 454 sequencing technique was used to produce five tissue-type transcript libraries (leaf, bark, latex, embryogenic tissues and root). Reads from all libraries were pooled and reassembled to improve mRNA lengths and produce a global library. One hundred and seventy-three AP2/ERF contigs were identified by in silico analysis based on the amino acid sequence of the conserved AP2 domain from the global library. The 142 contigs with the full AP2 domain were classified into three main families (20 AP2 members, 115 ERF members divided into 11 groups, and 4 RAV members) and 3 soloist members. Fifty-nine AP2/ERF transcripts were found in latex. Alongside the microRNA172 already described in plants, eleven additional microRNAs were predicted to inhibit Hevea AP2/ERF transcripts.
Hevea has a similar number of AP2/ERF genes to that of other dicot species. We adapted the alignment and classification methods to data from next-generation sequencing techniques to provide reliable information. We observed several specific features for the ERF family. Three HbSoloist members form a group in Hevea. Several AP2/ERF genes highly expressed in latex suggest they have a specific function in Hevea. The analysis of AP2/ERF transcripts in Hevea presented here provides the basis for studying the molecular regulation of latex production in response to abiotic stresses and latex cell differentiation.
Natural rubber accounts for 42.3% of the 23.9 million tons of rubber consumed worldwide . Hevea brasiliensis is the sole commercial source of natural rubber. The increasing demand for natural rubber calls for improved productivity in rubber plantations. Cis-1,4-polyisoprene chains are synthesized in the rubber particles of latex cells. Rubber particles account for up to 90% of the dry matter in latex cytoplasm, which is harvested by tapping the soft bark of rubber trees . Latex production depends on genetic, environmental and harvesting components. Harvesting systems use ethephon, an ethylene (ET) releaser applied to the tapping panel, to stimulate latex production by improving the flow and regeneration of latex. Tapping and ethephon stimulation frequencies are adjusted to Hevea clones according to their metabolism . Given the high pressure in the phloem tissue, latex is expelled after tapping. Tapping and ethephon are likely to be sources of stress conducive to the production of secondary metabolites and consequent rubber, but over a certain stress limit they lead to tapping panel dryness (TPD) . Mechanical wounding and osmotic stresses related to tapping trigger the production of endogenous ethylene and oxylipins such as jasmonic acid (JA) [5, 6]. Both mechanical wounding and methyl-jasmonate treatments induce the differentiation of secondary latex cells in extended young stems [7–9]. In trees, secondary latex cells are differentiated from cambium and then anastomosed to create laticifer vessels . Ethephon application also induces several biochemical processes in laticifers, such as sucrose loading, water uptake, nitrogen assimilation or synthesis of defence proteins [11, 12], involving a large number of ethylene-response genes [13–18], whereas its direct role in rubber biosynthesis is controversial .
Given the major role of ethylene and jasmonic acid in regulating latex cells, Ethylene-Response Factors are greatly expected to be involved in latex cell functioning. Indeed, ET and JA signalling pathways involve transcription factors such as Ethylene-Responsive Element Binding Proteins (EREBP), also called the Ethylene-Response Factors (ERF) family . ERFs have been shown to act as activators or repressors of additional downstream ethylene responsive genes. ERFs are a control point for crosstalk with other signals and they function as an integrator of the ethylene and jasmonic acid pathway . Multiple signalling pathways converge on ERFs by transcriptional and post-transcriptional regulation . Ethylene and jasmonate pathways converge in the transcriptional activation of ETHYLENE RESPONSE FACTOR1 (ERF1), which regulates in vivo the expression of a large number of genes responsive to both ethylene and jasmonate. ERF1 acts downstream of the intersection between the ethylene and jasmonate pathways suggesting that this transcription factor is a key element in the integration of both signals for the regulation of defence response genes [23, 24]. In biotic stress, AP2/ERF transcription factor ORA59 acts as the integrator of the JA and ET signalling pathways and is the key regulator of JA- and ET-responsive PDF1.2 expression [21, 25].
The ERF family was first discovered in Nicotiana tabaccum by Ohme-Takagi and Shinshi . The ERF family is one of the largest families of transcription factors with 122 genes out of the 2016 predicted transcription factors from 58 families in Arabidopsis[26, 27]. The ERF family belongs to the AP2/ERF superfamily. This superfamily encodes transcriptional regulators that serve a variety of functions in plant development and responses to biotic and abiotic stimuli [28–30]. Members of the AP2/ERF superfamily contain at least one AP2 domain, which consists of about 60 amino acids. This domain is involved in DNA binding to a conserved AGCCGCC sequence called the GCC-box [20, 31] or to a dehydration response element (DRE: TACCGACAT) containing the C-repeat [32, 33]. The structure of the AP2 domain was first reviewed by Riechmann and coll. . Initially, the APETALA2 (AP2) gene was isolated by T-DNA insertional mutagenesis in Arabidopsis. This gene encodes a 432-amino acid protein with two copies of a 68-amino acid direct repeat called the AP2 domain. The AP2 domain consists of three anti-parallel β-sheets and one α-helix. Two conserved elements, YRG and RAYD, have been identified. The latter is an 18-amino acid core region that is predicted to form an amphipathic α-helix . In some AP2 genes, the AP2 domain contains a 37-amino acid serine-rich acidic domain putatively functioning as an activation domain, and a 10-amino acid domain including a putative nuclear localization sequence KKSR . While previously thought to be plant-specific transcription factors, AP2 domain-containing genes were recently found in bacteria and viruses, which are predicted to be HNH endonucleases [37, 38].
Several ways of classifying the AP2/ERF superfamily have been proposed in plants. Although all of them are based on the number of AP2 domains, some differences exist. Firstly, Sakuma et al. described five subfamilies including AP2, RAV, Dehydration Responsive Element Binding Proteins (DREB), Ethylene-Responsive Element Binding Proteins (EREBP), also called the Ethylene-Response Factors (ERF) family , and others based on a homology of the DNA binding domain, and the DNA sequences that bind it, namely the DRE element or GCC-box separately . The AP2, ERF/DREB and RAV subfamilies have two AP2 domains, one AP2 domain, or one AP2 and one B3 domain, respectively. Groups A1 to A6 and B1 to B6 have been assigned to the DREB and ERF families . Secondly, Nakano et al. classified these proteins in only three major families: AP2, ERF and RAV . The ERF family was then divided into ten groups according to the structure of the AP2 domain, with groups I to IV corresponding to the DREB subfamily in Sakuma’s classification. To date, Nakano’s classification method has remained a reference for organizing the AP2/ERF superfamily in three families (AP2, ERF, RAV) and the ERF family in ten groups. In the construction of phylogenetic trees, methods for multiple sequence alignment and tree reconstruction have to be considered with caution. In the analyses by Sakuma and Nakano, ClustalW followed by a neighbor-joining method was chosen. Currently, although computationally intensive, the multiple sequence alignment software MUSCLE followed by a maximum likelihood method (PhyML) is more relevant [40–44].
The availability of the whole genome sequence of several plant species has made it possible to confirm a relatively well-conserved organization of the AP2/ERF superfamily with 147, 149, 202, 180 and 146 genes in Arabidopsis thaliana, Vitis vinifera, Populus trichocarpa Oryza sativa, and Solanum lycopersicon, mostly represented by the ERF family [27, 28, 45–47]. Transcript sequencing is also an alternative for identifying such gene families. For instance, 156 AP2/ERF genes consisting of 148 ERFs, 4 AP2s and 4 RAVs were identified in Gossypium hirsutum from EST databases . In Hevea brasiliensis, transcriptome sequencing has been carried out on latex, bark, leaf and shoot apex tissues using various methods [49–54]. A few number of ERF sequences have been released in the Genbank (HbEREBP1: HQ171930.1; EREBP2: HQ171931.1; DREB1p: HQ902146.1; CBF1: AY960212.1) . As preliminary experiment, we identified AP2/ERF unigenes from latex and leaf tissues of the Hevea clone Reyan 7-33-97 members . This analysis revealed 45 AP2/ERF with partial AP2 domain that did not allow gene classification (Additional file 1). Given the involvement of wounding, jasmonate and ethylene in natural rubber production, we developed a reference transcriptome that covers a large number of tissues and environmental conditions to have a fully comprehensive Hevea transcriptome and we examined the organization of the AP2/ERF superfamily in Hevea. Firstly, RNAs were isolated from different tissues of plants at several stages of development growing under various conditions, and transcripts were sequenced using GS-FLX next-generation sequencing (NGS) technologies. Secondly, contigs harbouring at least one AP2 domain were identified in tissue-type libraries for leaves, bark, latex, roots and embryogenic tissues and from a global library which pooled reads from all tissue-type libraries. AP2 domain-containing genes were aligned with the Arabidopsis AP2/ERF sequences and classified according to Nakano’s method based on a phylogenetic analysis of the conserved AP2 domain, which was optimized using a maximum likelihood method (PhyML) [40–42]. Post-transcriptional regulation was checked by predicting microRNA-targeted AP2/ERF genes. This study suggested that some HbAP2/ERF genes expressed in latex cells could be involved in specific biological processes.
Transcript sequence libraries
Transcript sequences were produced for five tissue-type libraries (embryogenic tissues, leaf, bark, latex and root) from the Hevea clone PB 260 by the pyrosequencing GS-FLX 454 technique. Total mRNAs were isolated from different tissues collected from plants at different stages of development and having undergone different treatments in order to have the most complete representation of the transcriptome (Table 1). A half run of 454 sequencing generated more than 500,000 reads for each tissue-type library (Table 2). An automatic pipeline was used to remove reads shorter than 120 bp and non-coding sequences and for clustering and assembly of contigs with TGICL (Figure 1). The annotation of contigs has been proceeded using miR target prediction by MIRANDA, and protein function by similarity with several protein sequence databases by BLAST. For the embryogenic tissues, leaf, bark, latex and root libraries, the number of contigs was 44,988, 29,910, 45,114, 29,016 and 50,146 respectively (Table 2). Reads from all libraries (2,390,118) were re-assembled in a global library to generate 94,981 contigs. The large coverage of the global library led to improve contig lengths, which reached 807 bp on average.
Identification and classification of AP2/ERF superfamily genes in hevea
Hevea AP2/ERF transcripts were identified using TBLASTN to search for translated nucleotide in the global Hevea transcript library using the amino acid Arabidopsis thaliana AP2 domain sequences as the query. The 173 putative AP2/ERF superfamily contigs identified in the global transcript sequence database contained full-length and partial transcripts. Of them, 142 contigs had the full-length AP2 domain sequences of 58–59 amino acids. Multiple alignment analysis was performed on full-length AP2 domain sequences from Hevea, Arabidopsis and Populus. Group classification was firstly achieved by constructing the general phylogenetic tree of AP2 domains in Arabidopsis and Hevea with the neighbour-joining method (data not shown), and then the phylogenetic relationships between these genes were analysed by constructing another phylogenetic tree using the PhyML method only for Hevea (Figure 2). The Nakano classification method was used to organize the Hevea AP2/ERF superfamily into families and groups. The alignments indicated three clusters corresponding to the AP2, ERF and RAV families, with the ERF family divided into eleven major groups including an additional VI-L group, and the three soloists rooted with the AP2 family. The AP2 family was organized in two groups including eight AINTEGUMENTA (ANT) and twelve AP2 genes.
The number of members of the Hevea AP2/ERF superfamily was compared with six other species (Table 3). The AP2/ERF superfamily has a similar number of genes in Vitis (149) and Arabidopsis (147). This number is higher for Gossypium (156), Populus (202) and Hevea (173), while it is lower for Solanum (112) and Triticum (117). These differences were mostly induced by a change in the number of ERF genes. In Hevea, twenty-five genes were assigned to the AP2 family based on the identity of their amino acid sequences with the A. thaliana AP2 proteins and the presence of a double AP2 domain in their sequences. This number included contigs with one or two partial AP2 domains. Ten genes containing a single complete/partial AP2 domain were classified in the AP2 family given their greater homology with the AP2 family. The largest family was the ERF family with 141 genes harbouring a single AP2 domain, including twenty-six contigs with a partial sequence of the AP2 domain. Four genes were classified in the RAV family, which had one single AP2 domain and one B3 domain. Three soloist genes were identified in Hevea whereas only one has been reported for Arabidopsis and Populus, and no soloists have been identified in Solanum and Gossypium.
One hundred and fifteen Hevea genes with a full AP2 domain from the ERF family were organized in eleven groups according to the Nakano classification, including the VI-Like group (Table 4). The number of genes (115) for the Hevea ERF family was comparable to that for Arabidopsis, Gossypium and Vitis (122, 148 and 135, respectively). The Hevea ERF groups showed several characteristics. Firstly, several ERF groups and subgroups found in Arabidopsis, such as subgroup IIc and groups IVb, Xc and Xb-L, were not found in Hevea (Figure 2). Secondly, Gossypium (22 genes) and Hevea (23 genes) have the largest number of ERF genes for group VII and conversely they have the smallest number of genes for group IV with 2 and 3 genes, respectively for these two species.
The alignment of nucleotide and deduced amino acid sequences of the three Hevea soloists revealed that they shared 64.8 to 72.9% and 73.2 to 93.2% of identity, respectively (Table 5). For the AP2 domain, this identity reached 92.3 to 97.4%. A multiple alignment analysis was carried out on AP2 domain sequences from Hevea, Arabidopsis and Populus (Figure 3). This phylogenetic analysis revealed an evolution of the three Hevea soloists after speciation phenomena with Arabidopsis and more recently with Populus. Nakano’s classification method was compared with Sakuma’s for the 142 Hevea genes with a complete AP2 domain (Table 6). Families and groups were noted as subfamilies and subgroups previously by Sakuma. ERF genes were classified in two subfamilies consisting of thirty-three DREB (ERF family group I to IV) and eighty-two ERF (ERF family group V to X) genes. ERF subfamily genes were twice as large as the DREB subfamily in Hevea.
Structure and group-specific residues of the AP2 domains of ERFgenes
The amino acid sequences of the AP2 domain from fifty-five representative ERF genes with full-length transcript sequences were aligned in order to identify the structure and the group-specific residues (Figure 4). Tertiary structures of the AP2 domain were predicted and revealed similarity to AtERF1 for each gene consisting of a three-stranded anti-parallel β-sheet and one α-helix (Protein Database number 2GCC). Specific amino acid residues were also identified for each group. AP2 domains from ERF family proteins contained the WLG motif and most of them also contained the YRG and RAYD elements. The positions of the AP2 domain were numbered according to the three-dimensional structure of AtERF1. Ten amino acids were totally conserved in each group (G148, R150, R152, G155, E160, I161, W172, L173, G174 and A182, Figure 4). Most AP2 domain sequences had conserved amino acid residues: V158 and E163 for groups I to IV and A158 and D163 for groups V to X, which corresponded to V14 and E19 for DREB and A14 and D19 for the ERF subfamilies according to Sakuma’s classification, respectively (Figure 4 and Additional file 2: Figure S1). A few members that did not show any conservation at these positions 158 and 163 were categorized based on their placement in the phylogenetic tree. A conservative sequence motif of 5 amino acid residues (KREYD) only occurred in group VI-L.
The group-specific amino acid residues observed in Hevea were compared with those of Arabidopsis and Gossypium (Table 7). At least one group-specific residue could be identified for each group, two for groups II and VIII, and three for group VII. Hevea group VI-L revealed one more group-specific residue (M196) in addition to the K189 found in all species. For group IX, one additional residue at position 167 was identified for all species leading to an AP2 domain of 59 amino acids long, as opposed to 58 for the other groups. In the Hevea, Arabidopsis and Gossypium AP2 family, the AP2 domains contain a conserved amino acid, T150 (92%) or A150 (8%). The AP2 domains of the RAV family have the amino acid residue V150 conserved at 100% in Hevea, Arabidopsis and Gossypium.
Distribution of reads from AP2/ERFcontigs in the various tissue-type libraries
The distribution of reads constituting AP2/ERF contigs in each tissue-type library reflected the global level of expression of AP2/ERF genes in each tissue (Figure 5). The number of reads was more abundant in roots with 29.8% (1883 reads), bark with 22.2% (1403 reads), followed by latex with 21.2% (1341 reads), embryogenic tissues with 16.4% (1037 reads) and then leaves with 10.4% (654 reads).
The sum of reads for the various ERF groups showed that some groups were more represented in some tissues (Table 8). A higher read number was observed in latex for groups II, VII and VIII, in bark for groups III, VI-L and IX, in leaves for groups II, VIII and IX, in roots for groups I, IV, V, VI and VII, and in embryogenic tissues for group X only. The DESeq statistical analysis of the read distribution for each contig did not revealed any significant differential gene expression (Additional file 3; Figure 6). All AP2/ERF families and all ERF groups were represented. Thirty-seven transcripts were detected in all five tissues. Fifty-nine contigs were built from reads found in latex.
Expression profile in various tissues for ten selected AP2/ERF genes
Ten AP2/ERF genes were selected for their high number of reads per contig or their presence in some tissue-type libraries. Primer were designed (Table 9), and their specificity was confirmed for each gene by a unique pick of the fusion curve after real-time RT-PCR amplification (Figure 7). Their relative transcript abundance was checked by real-time RT-PCR using HbRH2b as stable internal control between each tissue (Figure 8). The highest relative transcript abundance was found for HbERF-VIIa12, which ranged from 0.4 to 114. Interestingly, HbERF-IIb4 and HbERF-VIIa4 showed significant higher relative transcript abundance in latex than other tissues, 1.8 and 28 respectively, like it was observed for the read distribution. Nevertheless, latex specificity of HbERF-IIb5 expression was not proved since transcripts of this gene were also highly accumulated in embryogenic callus. Relative transcript accumulation was noted in embryogenic tissues (proliferating callus, embryogenic callus or somatic embryos) for HbERF-IIb5, HbERF-VIIa1, HbAP2-3 and HbAP2-6. The high read distribution counted in root was confirmed by high relative transcript abundance in the tap root of one-year-old plants. Finally, no significant difference could be found in relative transcript abundance for HbERF-VIIIa4 in contrast with the higher read distribution in latex and leaf compared with other tissues.
Prediction of microRNA-targeted AP2/ERFgenes
Twelve microRNAs (miR156, miR159, miR172, miR393, miR395, miR396, miR408, miR894, miR1511, miRn11, miRn12, miRn14) were predicted to inhibit Hevea transcripts of twenty-nine HbAP2/ERF genes (Table 10). Eight microRNA families (miR159, miR172, miR393, miR408, miR894, miRn11, miRn12, miRn14) were predicted to inhibit thirteen latex-expressed transcripts (HbAP2-4, HbAP2-6, HbAP2-10, HbAP2-18, HbERF-Ib5, HbERF-IIa1, HbERF-VI5, HbERF-VI-L1, HbERF-VIIa4, HbERF-VIIa13, HbERF-VIIa20, HbERF-VIIIa7, HbSoloist3). Although inhibition mostly involved a transcript cleavage, the inhibition of translation was predicted for nine genes (HbERF-Ib5, HbERF-IIa1, HbERF-VI5, HbERF-VIIa4, HbERF-VIIIa7, HbERF-IXc3, HbERF-IXc5, HbERF-Xa1, HbSoloist3). Predicted microRNA sites were in the conserved AP2 domain (HbERF-IIId3, HbERF-IXb1, HbERF-IXc6, HbERF-Xa1 and HbERF-Xa8 genes), in the CDS outside the AP2 domain for sixteen genes, and in the UTR for eight genes.
NGS data combined with an optimized method of alignment and classification led to the identification of the HeveaAP2/ERF superfamily
The AP2/ERF superfamily has been identified in several species from both genome and EST sequences. For the first time to our knowledge, this study presented the identification of most members of the AP2/ERF superfamily using the 454 sequencing technology for crop plants for which few data are available and especially for rubber. The one hundred and seventy-three AP2/ERF members identified in Hevea were clustered into three main families (25 AP2, 141 ERF, and 4 RAV members) and a group of 3 soloists using a maximum likelihood phylogenetic analysis. The ERF family was then subdivided into 11 major groups, which corresponded to groups I to X, and group VI-like described by Nakano . The stringency used for the read assembly led to discriminate the various allelic forms. Hevea brasiliensis is highly heterozygous that should lead to have various allelic forms in the assembled contigs and consequently less genes than AP2/ERF members. The number of Hevea AP2/ERF genes was comparable to the number observed in other species. RNA sequencing of additional tissues, such as flowers, should lead to cover the whole transcriptome.
The first phylogenetic analyses came up against the low quality of contig sequences from NGS (data not shown). The minimum overlap length was increased to 60 bp compared to the 40 bp used in Jatropha curcas for instance, with a minimum overlap identity of 95% . Finally, the assembly strategy for Hevea reads delivered robust contigs from current programs since the clustering method discriminated conserved domains from the various AP2/ERF genes. Sequences of AP2/ERF genes were shown to be from unique transcript for 10 genes in this study and more recently for 132 genes by analysis of the fusion curve after real-time RT-PCR amplification (data not shown). In addition, homopolymer correction by mapping Solexa reads was not required. The Neighbour-Joining tree built from the protein distance matrix with manual correction proposed by Nakano was widely adopted for classification of the ERF family. Based on NGS contigs, the classification method proposed by Nakano provided inconsistent results due to errors and the accuracy rate of contig sequences. An AP2 domain block of 57 amino acids was selected for the alignment of 142 sequences with a full AP2 domain using a combination of MUSCLE and Gblock softwares. The use of Gblocks reduced the need for manual editing of multiple alignments. This method facilitated the construction of a consistent phylogenetic tree with PhyML software without requiring a Bayesian Inference method. The latter method was successfully used to classify the Arabidopsis ERF protein family . These authors included groups VI-like and Xb-like described by Nakano et al. in their phylogenetic reconstruction, and ultimately assigned these groups as new groups XI and XII, respectively. Group VI-L genes were close to group VI, with a modification in the second element suggesting that the evolution of group VI-L is more recent than that of the other groups. This independent cluster on the Hevea phylogenetic tree led us to propose it as a new group (see below).
Hevea AP2/ERFgenes have common and several specific features compared to other species
Several functionally important conserved motifs described in Arabidopsis and tomato were also found in Hevea AP2/ERF deduced proteins suggesting that they are likely to function as transcription factors . The putative nuclear localization signal (NLS) motif near the R1 domain was found in Hevea AP2/ERF transcription factor sequences (data not shown). The residues G148, R150, R152, G155, E160, I161, G174 and A182 were completely conserved among all 437 ERF proteins collected from three species (Hevea, Arabidopsis and Gossypium). These observations are generally consistent with earlier reports on this topic [27, 34, 39]. The conserved Ala-37 (corresponding to A182 in this paper) in the ERF domain has been suggested to play a major role in the stability of the ERF domain or DNA binding with the DRE element or GCC box [56, 59]. The ERF-associated amphiphilic repression (EAR) motif was first described by Ohta . The EAR motif is found in group II and VIII. DEAR1, a DREB protein-containing EAR motif, has been shown to mediate crosstalk between signalling pathways for biotic and abiotic stress responses . The EAR motif exists in all members of ERF group VIII in tomato  and in ERF group VIIIa in Arabidopsis[27, 28, 45].
Soloists have been characterized by low conservation at the ERF DNA-binding domain in all plant genomes considered . In Hevea, we showed that this low conservation could be explained by 6 missing amino acid residues in their AP2 domain, including R152, which directly contacts the GCC box . The three HbSoloist genes only shared between 65% and 73% identity in their nucleotide sequences, which led us to consider these as three different HbSoloist genes and not as allelic forms. Although the three HbSoloist genes have only a single AP2 domain, they formed a group and clustered together with the AP2 family, as has been reported in Vitis vinifera. The greater conservation in amino acid sequences than in nucleotides, especially for the AP2 domain, revealed an evolutionary constraint suggesting a putative function for Hevea soloists since there were recent duplications. However, no functional information has been published for soloist genes.
Based on an analysis of 437 AP2 domain sequences of ERF genes from three species, ten amino acid residues were shown to be strictly group-specific for all ERF groups except for group II and group VIII. A previous study on 315 AP2 domain sequences from Arabidopsis, Gossypium and Oryza led to the identification of 14 group-specific residues with a certain error rate . The group-specific residues reported in this study could be proposed as a group marker of the ERF family for several species. In addition, Hevea AP2/ERF genes harboured unique group-specific residues in their AP2 domain, such as VI-L (M196), which are not found in other species. This difference could be explained by the distance between Gossypium and Arabidopsis in the Eurosides II (Brassicales and Malvales, respectively) and Hevea in the Eurosides I (Malpighiales) . We also identified that position 150 was conserved in Hevea, Arabidopsis, Gossypium, Populus with T150, T150 and V150 for the ERF, AP2 and RAV families, respectively. Position 150 directly contacts with DNA. These interactions determine the geometry of the GCC-box binding domain (GBD) relative to DNA and thereby comprise a framework for specific base recognition .
Several AP2/ERF genes highly expressed in latex could be related to a specific function in Hevea
AP2/ERF genes are regulated by developmental processes and environmental cues . As rubber trees are subjected to frequent mechanical wounding and osmotic stress upon tapping to collect latex, and ethephon stimulation to increase latex yield, some of these transcription factors are likely to play a unique role in Hevea defence mechanisms and latex production. Latex cells are differentiated in phloem tissue from cambium . Members of the AP2 family play an important role in angiosperm reproductive organ development [65–68]. Members of the RAV family were reported to be induced in ethylene response and in brassinosteroid response and to be involved in flower senescence . Consequently, the AP2 and RAV genes present in latex are suggested to play an important role in Hevea development.
Several of the fifty-one ERF transcripts accumulated in latex could be related to responses to stress. High read abundance was found in latex for ERF groups II, VII and VIII. Latex cells are differentiated in roots, leaves and bark. This might explain why genes expressed in latex could also be identified in the other tissues. In addition, thirteen other transcripts were highly accumulated in latex compared with other tissues: one for the AP2 family, one for the RAV family and eleven for the ERF family. The ERF transcripts highly accumulated in latex were distributed as follows: one for group I, four for group II, two for group VII and four for group VIII. Large number of genes was identified for groups VII, VIII and IX with 23, 15 and 19 genes, respectively.
A few members of the AP2/ERF superfamily have been previously reported in Hevea. The HbERF1, HbERF2, HbERF3 and HbRAV1 genes were suggested to be induced by JA in bark during JA-induced laticifer differentiation . According to our analysis, the HbERF1, HbERF2 and HbERF3 genes corresponded to HbERF-VIIa3, HbERF-VIIa17 and HbERF-VIIa1 in our classification with 99%, 98%, 99% identity, respectively. The HbCBF1 gene , and the HbCBF2 gene  have been reported to be regulated by cold and drought stresses, like other members of the DREB subfamily. We classified these genes in ERF group III. The HbCBF1 gene corresponded to the HbERF-IIIc1 gene with an identity of 100%, and the HbCBF2 gene to the HbERF-IIIb2 gene with 82% identity. Another member of the AP2/ERF superfamily is the HbEREBP1 gene recently identified by Chen et al. from Hevea laticifers . This gene was down-regulated by tapping and mechanical wounding in laticifers from adult trees, and was also regulated by both exogenous ethephon or methyl jasmonate treatments. This suggests that the HbEREBP1 gene may be a negative regulator of defence mechanisms in laticifers . The HbEREBP1 gene corresponded to the HbERF-VIIIa12 gene with 100% identity in our analysis.
Eleven new microRNAs are predicted to inhibit HeveaAP2/ERF transcripts
The mode of action of miR172-regulated AP2 genes has been well described in reproductive and vegetative organs as well as in the transition of developmental phases [73, 74], where multiple feedback loops involve the microRNAs miR156e targeting Squamosa Promoter Binding Protein-like (SPL) and miR172b targeting AP2 . Seven gymnosperm AP2 homologs were found to contain a sequence corresponding to miR172 with an average identity of approximately 84.4%, suggesting that mechanisms regulating gene expression using microRNAs have been conserved over the three hundred million years since the divergence of gymnosperm and flowering plant lineages . The cleavage site of miR172 is conserved between plant lineages and is located between the second AP2 domain and the 3′ terminus . This site is also observed in Hevea. However, miR172 regulates flowering time by down-regulating AP2-like target genes by a translational mechanism rather than by RNA cleavage , and could explain our failure in detecting cleaved HbAP2-18 and HbAP2-20 transcripts (data not shown). In addition to miR172, eleven other microRNAs were predicted to inhibit Hevea transcripts of twenty-nine HbAP2/ERF genes. Seven microRNA families were only found in various tissues of plantlets , and five others only in the latex of mature trees, including three novel microRNA families (miRn11, miRn12, miRn14) (data not shown). For the first time to our knowledge, both cleavage and translation inhibition were predicted with miR binding in the CDS sequence and especially for 5 genes in the AP2 domain.
NGS sequencing of five tissue-type libraries led to the generation of transcriptome data from the broadest coverage of tissues in Hevea compared with previous work done on latex, leaf and bark [49, 50, 78, 79]. This allowed identifying 173 AP2 domain-containing transcripts, of which 142 had a full-length AP2. We have proposed an optimized alignment and classification method enabling the use of NGS data with repeatable outputs. Analysis of read abundance led to the prediction that ERF genes play a major role in laticifers. A comparison with Populus and Vitis did not provide any specific features for woody species as assumed earlier , but the AP2 family appeared to be well represented for these species. Several AP2/ERF genes highly expressed in latex could be related to a specific function in Hevea. Further studies focusing on latex cells should provide a clearer understanding of the involvement of genes from the AP2/ERF superfamily in the regulation of latex production and latex cell differentiation. Finally, analysis of allelic variations between transcript sequences of several Hevea clones could be useful for the development of functional molecular markers.
Plant material of clone PB 260 was grown according to the conditions described in the Table 1. Self-rooted plants were produced by somatic embryogenesis at the CIRAD laboratory [55, 80]. Total mRNAs were isolated from different tissues. The embryogenic tissue sample was a mix of proliferating callus, embryogenic callus and somatic embryos. Leaf, root and bark tissues were taken from in vitro plantlets and grown for up to 1 month and 1 year after acclimatization. At each time point, in vitro plants were treated for 4 and 24 h with 1 ppm of ethylene or by wounding, or by water deficiency up to wilting leaves [14, 81]. Leaf, root and bark tissues were also taken from three-month-old shoots from grafted plants treated by wounding and 1 ppm of ethylene. The leaves were mechanically wounded by squeezing the entire surface of the leaves with pincers whilst the bark was wounded every 0.5 cm by scarification with a razor blade. Latex and bark samples were taken at IRRI’s Sembawa Centre from 5-year-old trees that were either untapped, tapped or both tapped and stimulated with 2.5% ethephon before RNA isolation.
Total RNA isolation
Leaves, bark, roots, embryogenic tissues (somatic embryos and callus) were frozen in liquid nitrogen and stored in the freezer at -80°C pending total RNA extraction. Total RNA was extracted using the caesium chloride cushion method adapted from Sambrook  by Duan and coll. . One gram of fresh matter was ground and transferred to a tube containing 30 ml of extraction buffer consisting of 4 M guanidium isothiocyanate, 1% sarcosine, 1% polyvinylpyrrolidone and 1% ß-mercapto-ethanol. After homogenization, tubes were kept on ice and then centrifuged at 10,000 g at 4°C for 30 minutes. The supernatant was transferred to a new tube containing 8 ml of 5.7 M CsCl. Ultracentrifugation in a swinging bucket was carried out at 115,700 g at 20°C for 20 hours. The supernatant and caesium cushion were discarded whilst the RNA pellet was washed with 70% ethanol. After 30 minutes of air drying, the pellet was dissolved in 200 μl of sterile water. RNAs were conserved at −80°C.
The procedure for total RNA isolation from latex was derived from the method described by Kush et al. . Six ml of latex was mixed with 6 ml of 2X alkaline fixing buffer (0.1 M Tris–HCl, 0.3 M LiCl, 10 mM EDTA, 10% SDS pH 9) and immediately deep-frozen in liquid nitrogen for storage. The mixture was then thawed and centrifuged for 30 min at 15,000 g. The aqueous fraction was treated with a phenol:chloroform solution twice, including centrifugation for 15 min at 10,000 g at 4°C. RNAs were precipitated overnight at 4°C after the addition of 1/3 volume of 8 M LiCl to the aqueous phase. After centrifugation for 30 min at 10,000 g at 4°C, the RNA pellet was resuspended in 400 uL of DEPC water on ice and then treated with a phenol:chloroform solution twice. The RNAs were finally precipitated with a 1/10 volume of 3 M Na acetate, pH 5.2, and 3 volumes of absolute ethanol. After centrifugation, the RNA pellet was resuspended and the solution kept at −80°C.
Sequencing technique and contig assembly
Total RNA samples of each tissue from plants grown under the various conditions were pooled together to generate five transcript libraries (embryogenic tissues, leaf, bark, latex and root). Single-stranded cDNA was synthesised from pooled RNA samples. Pyrosequencing was carried out using GS FLX (Roche / 454) Titanium run (Roche Applied Science) by the GATC-Biotech company in Germany. A 454 sequencing half-run (200 Mbp) generated more than 500,000 reads with an average read length of 400 bp for each library according to the manufacturer. Reads were analysed using the ESTtik tool (Expressed Sequence Tag Treatment and investigation kit)  modified for the analysis of 454 data (Table 2). Reads were first cleaned to avoid mis-assembly by discarding sequences that were both lower than 120 bp and of low complexity. We then discarded non-coding reads by comparing the reads against the fRNAdb database using the Megablast algorithm with an e-value cutoff of 1e-20 . More than 400,000 cleaned reads were obtained for each library. Reads were then assembled in contigs using the TGICL program , integrated in the ESTtik pipeline (Figure 1). The removal of poor end regions of reads and the computation of overlaps between reads has been done using default parameters of CAP3 program (best hit cut-off for difference -b = 20; best hit for clipping –c = 12) . Clustering was carried out for reads with an overlap of at least 60 bp and 94% identity between reads.
The second step was an assembly of reads from each cluster with greater stringency: the length of sequence overlap was then 60 bp with 95% identity between reads. The transcript sequence database consisted of contigs. An automatic annotation of each contig was attempted using the BLAST algorithm to find similar sequences using the Arabidopsis thaliana peptide database Tair9, the Uniprot databases Swissprot and TrEMBL, the non-redundant protein sequence database NR and the nucleotide sequence database NT from GenBank. Contigs were then annotated with Gene Ontology terms using Blast2GO on our Blast results . We predicted peptide sequences for each contig using the prot4EST pipeline . The peptide sequences were then annotated comparing the sequences on the InterPro signature database using the InterProScan web service . A first assembly set was generated from reads of each tissue separately to create tissue-type transcript databases. The reads of all five tissue-type libraries were then re-assembled to generate one global transcript sequence database for Hevea clone PB260, subsequently called the global database. Contig sequences of the global library are available on CIRAD’s website http://esttik.cirad.fr/ and in public databases.
Identification of AP2 domain-containing contigs
Firstly, we downloaded the AP2 domain of the 147 Arabidopsis thaliana AP2/ERF genes from the Arabidopsis Transcription Factor Database (ArabTFDB) (http://plntfdb.bio.uni-potsdam.de/v3.0/). BLASTX was carried out using the 147 AtAP2 domain amino acid sequences as protein subjects and nucleic acid sequences of contigs assembled in the HbPB260 transcript database as the query . Conversely, TBLASTN was carried out using nucleic acid sequences of contigs as the subject and the 147 AtAP2 domain amino acid sequences as the query. The two BLAST files were combined in order to keep information obtained in both BLASTX analyses. Contigs were translated using prot4EST  or FrameDP  and AP2 domain-containing contigs were then identified using the Interpro database (IPR001471) . This analysis was validated with the Conserved Domain Database (CDD) and Resource Group on NCBI . The method led to the identification of contigs with a full and partial AP2 domain.
Phylogenetic analysis of the AP2 domain from putative AP2/ERFgenes
A multiple alignment analysis was performed on full-length AP2 domain sequences from Hevea, Arabidopsis and Gossypium. Phylogenetic trees were firstly generated with the Neighbour-Joining method for Hevea, Arabidopsis and Gossypium in order to classify the groups (data not shown). The full AP2-domain sequences derived from 142 H. brasiliensis AP2-domain proteins of around 60 amino acids were then aligned using MUSCLE software [43, 95], which uses a progressive multiple alignment method. The alignment was curated by Gblocks software , searching for at least 10-amino-acid-long conserved blocks, and the block with 57 amino acids was extracted. This block of 57 amino acids was used to construct the phylogenetic tree using PhyML software , which implements a maximum likelihood tree reconstruction method, using the LG + gamma model, starting from a BioNJ tree . The tree was drawn and displayed with the Dendroscope program, and rooted on the branch separating the AP2 and RAV family from the rest of the tree. Branch supports were computed using the aLRT-SHlike method, and those under 0.70 were discarded. For genes of the AP2 family having two AP2 domains, the sequence of the first AP2 domain (repeat-1 or R1) was preferentially selected for alignment. For three partial transcripts, the second AP2 domain (repeat-2 or R2) was chosen for alignment instead of the first, which is not present.
Comparison of the classification between various species
Genes from the AP2/ERF superfamily are listed in Tables 3, 5 and 6 from publications on Arabidopsis thaliana, Populus trichicarpa, Vitis vinifera, Solanum lycopersicum, Gossypium hirsutum and Triticum aestivum. For Hevea brasiliensis, the classification of the AP2/ERF superfamily was based on the phylogenetic analysis presented in this paper. In addition to data from the phylogenetic analysis, contigs corresponding to partial transcripts harbouring either a partial AP2 domain sequence or only one AP2 domain instead of two for genes of the AP2 family are included in the presentation of Table 3.
Identification of conserved motifs and specific amino acid residues
AP2 domain amino acid sequences from the Hevea ERF genes were aligned using CLUSTALX. Conserved residues observed in Hevea sequences were compared with those of other species such as Gossypium and Arabidopsis in order to identify ERF group-specific residues [48, 62].
Extraction of read data from AP2/ERF contigs of each library and statistical analysis
Perl script was used to parse the alignment .ace file provided by the TGICL assembler in order to count the number of reads for each transcript and to identify the number of reads from each tissue (bark, leaf, latex, root and embryogenic tissues). The data are presented in Figure 6. Statistical analysis of differentially expressed genes was carried out using DESeq (v1.10.1) package in R software [98, 99]. Firstly, we have estimated the effective library size. Secondly, the estimated dispersion for all transcripts are fitted using “blind” method, “fit-only” sharing mode and “local” fitType as parameter for the “estimateDispersions” function. Then, we performed the “nbinomTest” to get p-values. The p-values were adjusted for multiple testing using the Benjamini and Hochberg as proposed in the DEseq package.
Prediction of microRNA-targeted AP2/ERFgenes
Deep sequencing of Hevea was performed with Solexa/Illumina technology and led to the identification of miRNA sequences conserved between plant species and putative novel miRNAs specific to Hevea using the LeARN pipeline . The AP2/ERF sequences from Hevea were scanned with conserved and non-conserved miRNA sequences using both psRNATarget server (http://plantgrn.noble.org/psRNATarget/,  and MIRANDA, which is included in the LeARN pipeline  with custom parameters (gap_value = 2, mm_value = 1, gu_value = 0.5, score_threshold = 3, min_length_alignment = 15 and no_mismatch_positions = 10;11). Only the miRNA/target pairs displaying a free energy below −20 kcal/mol are presented in Table 10.
Analysis of transcript abundances by real-time RT-PCR
Several rules were applied in order to reduce the risk of error in relative gene expression data. The integrity of total RNA was checked by electrophoresis. Primers were designed at the 3’ side of each sequence in order to reduce the risk of error due to short cDNA synthesis using the Primer 3 module of Geneious (Biomatters Ltd., New Zealand). Real-time PCR amplification and the fusion curve were carried out using a mix of cDNAs in order to check the specificity of each pair of primers. Primer sequences are listed for 10 selected genes according to their distribution of reads per contigs in Table 9.
cDNAs were synthesized from 2 μg of total RNA to the final 20 μL reaction mixture using a RevertAidTM M-MuLV Reverse Transcriptase (RT) kit according to the manufacturer’s instructions (MBI, Fermentas, Canada). Full-length cDNA synthesis was checked on each cDNA sample by PCR amplification of the Actin cDNA using primers at the cDNA ends. Quantitative gene expression analysis was finally carried out by real-time RT-PCR using a Light Cycler 480 (Roche, Switzerland). Real-time PCR reaction mixtures consisted of 2 μL RT product cDNA, 0.6 μL of 5 μM of each primer, and 3 μL 2 × SYBR green PCR master mix (LightCycler® 480 SYBR Green I Master, Roche Applied Sciences) in a 6-μL volume. PCR cycling conditions comprised one denaturation cycle at 95°C for 5 min, followed by 45 amplification cycles (95°C for 20 s, 60°C for 15 s, and 72°C for 20s). Expression analysis was performed in a 384-well plate. Samples were loaded using an automation workstation (Biomek NX, Beckman Coulter).
Real-time PCR was previously carried out for eleven housekeeping genes in order to select the most stable gene as the internal control for all compared tissues (HbelF1Aa, HbUBC4, HbUBC2b, HbYLS8, HbRH2b, HbRH8, HbUBC2a, HbalphaTub, Hb40S, HbUbi, HbActin) (Data not shown). HbRH2b was selected as the best reference gene according to its stability in the various tissues. The HbRH2b gene was amplified in each reaction plate in parallel with target genes. The transcript abundance level for each gene was relatively quantified by normalization with the transcript abundance of the reference HbRH2b gene. Relative transcript abundance took into account primer efficiencies. All the normalized ratios corresponding to transcript accumulation were calculated automatically by Light Cycler Software version 1.5.0 provided by the manufacturer using the following calculation: Normalized Ratio = 2 -Δ(Cp target-Cp RH2b).
Real-time PCR reactions were set up with three biological replications. Statistical analysis was performed with an ANOVA after logarithmic transformation of raw data. The ANOVA was followed by a Student Newman-Keuls test. Values with the same letter did not differ significantly at the 0.05 probability level.
Cold responsive element binding factor
Drought responsive element binding protein
Ethylene responsive factor
Polymerase chain reaction
Related to APETALA2
Related to ABI3/VP1
Sainte-Beuve J: Cyclope: les marches mondiaux. Caoutchouc. 2012, 387-390.
d’Auzac J, Jacob J-L, Prévôt J-C, Clément A, Gallois R, Chrestin H, Lacote R, Pujade-Renaud V, Gohet E: The regulation of cis-polyisoprene production (natural rubber) from hevea brasiliensis. Recent Research Developments in Plant Physiology. Edited by: Pandalai SG. 1997, Kerala India: Research Signpost, 273-332. 1
Jacob J-L, Prévôt J-C, Roussel D, Lacrotte R, Serres E, d’Auzac J, Eschbach J-M, Omont H: Yield limiting factors, latex physiological parameters, latex diagnosis, and clonal typology. Physiology of Rubber Tree Latex. Edited by: d’Auzac J, Jacob J-L, Chrestin H. 1989, Boca Raton, Florida: CRC press, Inc, 345-382.
Venkatachalam P, Thulaseedharan A, Raghothama K: Molecular identification and characterization of a gene associated with the onset of tapping panel dryness (TPD) syndrome in rubber tree (hevea brasiliensis muell.) by mRNA differential display. Mol Biotechnol. 2009, 41 (1): 42-52. 10.1007/s12033-008-9095-y.
Cai HB, Hu YS, Huang HS, Cheng H: Cloning and expression analysis of HbCBF2 gene in hevea brasiliensis. J Trop Agric Sci Technol. 2008, 31 (3): 424-425.
Chen YY, Wang LF, Dai LJ, Yang SG, Tian WM: Characterization of HbEREBP1, a wound-responsive transcription factor gene in laticifers of hevea brasiliensis muell. Arg Mol Biol Rep. 2012, 39 (4): 3713-3719. 10.1007/s11033-011-1146-y.
Hao B-Z, Wu J-L: Effects of wound (tapping) on laticifer differentiation in hevea brasiliensis. Acta Botanica Sinica. 1982, 24: 388-391.
Hao B-Z, Wu J-L: Laticifer differentiation in hevea brasiliensis: induction by exogenous jasmonic acid and linolenic acid. Ann Bot. 2000, 85 (1): 37-43. 10.1006/anbo.1999.0995.
Wu J-L, Hao B-Z, Tan H-Y: Wound-induced laticifer differentiation in hevea brasiliensis shoots mediated by jasmonic acid. J Rubber Res. 2002, 5: 53-63.
de Faÿ E, Jacob JL: Anatomical organization of the laticiferous system in the bark. Physiology of rubber tree latex. Edited by: d’Auzac J, Jacob JL, Chrestin H. 1989, Boca Raton (FL): CRC Press, 4-14.
Dusotoit-Coucaud A, Brunel N, Kongsawadworakul P, Viboonjun U, Lacointe A, Julien JL, Chrestin H, Sakr S: Sucrose importation into laticifers of hevea brasiliensis, in relation to ethylene stimulation of latex production. Ann Bot. 2009, 104 (4): 635-647. 10.1093/aob/mcp150.
Tang C, Huang D, Yang J, Liu S, Sakr S, Li H, Zhou Y, Qin Y: The sucrose transporter HbSUT3 plays an active role in sucrose loading to laticifer and rubber productivity in exploited trees of Hevea brasiliensis (para rubber tree). Plant Cell Environ. 2010, 33 (10): 1708-1720. 10.1111/j.1365-3040.2010.02175.x.
Dusotoit-Coucaud A, Kongsawadworakul P, Maurousset L, Viboonjun U, Brunel N, Pujade-Renaud V, Chrestin H, Sakr S: Ethylene stimulation of latex yield depends on the expression of a sucrose transporter (HbSUT1B) in rubber tree (Hevea brasiliensis). Tree Physiol. 2010, 30 (12): 1586-1598. 10.1093/treephys/tpq088.
Duan C, Rio M, Leclercq J, Bonnot F, Oliver G, Montoro P: Gene expression pattern in response to wounding, methyl jasmonate and ethylene in the bark of Hevea brasiliensis. Tree Physiol. 2010, 30 (10): 1349-1359. 10.1093/treephys/tpq066.
Tungngoen K, Viboonjun U, Kongsawadworakul P, Katsuhara M, Julien JL, Sakr S, Chrestin H, Narangajavana J: Hormonal treatment of the bark of rubber trees (hevea brasiliensis) increases latex yield through latex dilution in relation with the differential expression of two aquaporin genes. J Plant Physiol. 2011, 168 (3): 253-262. 10.1016/j.jplph.2010.06.009.
Duan B, Li Y, Zhang X, Korpelainen H, Li C: Water deficit affects mesophyll limitation of leaves more strongly in sun than in shade in two contrasting picea asperata populations. Tree Physiol. 2009, 29 (12): 1551-1561. 10.1093/treephys/tpp085.
Tungngoen K, Kongsawadworakul P, Viboonjun U, Katsuhara M, Brunel N, Sakr S, Narangajavana J, Chrestin H: Involvement of HbPIP2;1 and HbTIP1;1 aquaporins in ethylene stimulation of latex yield through regulation of water exchanges between inner liber and latex cells in hevea brasiliensis. Plant Physiol. 2009, 151 (2): 843-856. 10.1104/pp.109.140228.
Pujade-Renaud V, Clement A, Perrot-Rechenmann C, Prevot JC, Chrestin H, Jacob JL, Guern J: Ethylene-induced increase in glutamine synthetase activity and mRNA levels in hevea brasiliensis latex cells. Plant Physiol. 1994, 105 (1): 127-132.
Zhu J, Zhang Z: Ethylene stimulation of latex production in hevea brasiliensis. Plant Signal Behav. 2009, 4 (11): 1072-1074. 10.4161/psb.4.11.9738.
Ohme-Takagi M, Shinshi H: Ethylene-inducible DNA binding proteins that interact with an ethylene-responsive element. Plant Cell. 1995, 7 (2): 173-182.
Zarei A, Korbes AP, Younessi P, Montiel G, Champion A, Memelink J: Two GCC boxes and AP2/ERF-domain transcription factor ORA59 in jasmonate/ethylene-mediated activation of the PDF1.2 Promoter in arabidopsis. Plant Mol Biol. 2011, 75 (4–5): 321-331.
Shinshi H: Ethylene-regulated transcription and crosstalk with jasmonic acid. Plant Sci. 2008, 175: 18-23. 10.1016/j.plantsci.2008.03.017.
Lorenzo O, Piqueras R, Sanchez-Serrano JJ, Solano R: ETHYLENE RESPONSE FACTOR1 integrates signals from ethylene and jasmonate pathways in plant defense. Plant Cell. 2003, 15 (1): 165-178. 10.1105/tpc.007468.
Solano R, Stepanova A, Chao Q, Ecker JR: Nuclear events in ethylene signaling: a transcriptional cascade mediated by ETHYLENE-INSENSITIVE3 and ETHYLENE-RESPONSE-FACTOR1. Genes Dev. 1998, 12 (23): 3703-3714. 10.1101/gad.12.23.3703.
Pre M, Atallah M, Champion A, De Vos M, Pieterse CM, Memelink J: The AP2/ERF domain transcription factor ORA59 integrates jasmonic acid and ethylene signals in plant defense. Plant Physiol. 2008, 147 (3): 1347-1357. 10.1104/pp.108.117523.
Zhuang J, Chen JM, Yao QH, Xiong F, Sun CC, Zhou XR, Zhang J, Xiong AS: Discovery and expression profile analysis of AP2/ERF family genes from triticum aestivum. Mol Biol Rep. 2011, 38 (2): 745-753. 10.1007/s11033-010-0162-7.
Nakano T, Suzuki K, Fujimura T, Shinshi H: Genome-wide analysis of the ERF gene family in arabidopsis and rice. Plant Physiol. 2006, 140 (2): 411-432. 10.1104/pp.105.073783.
Zhuang J, Cai B, Peng RH, Zhu B, Jin XF, Xue Y, Gao F, Fu XY, Tian YS, Zhao W: Genome-wide analysis of the AP2/ERF gene family in populus trichocarpa. Biochem Biophys Res Commun. 2008, 371 (3): 468-474. 10.1016/j.bbrc.2008.04.087.
Xu ZS, Ni ZY, Liu L, Nie LN, Li LC, Chen M, Ma YZ: Characterization of the TaAIDFa gene encoding a CRT/DRE-binding factor responsive to drought, high-salt, and cold stress in wheat. Mol Genet Genomics. 2008, 280 (6): 497-508. 10.1007/s00438-008-0382-x.
Gutterson N, Reuber TL: Regulation of disease resistance pathways by AP2/ERF transcription factors. Curr Opin Plant Biol. 2004, 7 (4): 465-471. 10.1016/j.pbi.2004.04.007.
Hao D, Ohme-Takagi M, Sarai A: Unique mode of GCC box recognition by the DNA-binding domain of ethylene-responsive element-binding factor (ERF domain) in plant. J Biol Chem. 1998, 273 (41): 26857-26861. 10.1074/jbc.273.41.26857.
Jiang C, Lu B, Singh J: Requirement of a CCGAC cis-acting element for cold induction of the BN115 gene from winter brassica napus. Plant Mol Biol. 1996, 30 (3): 679-684. 10.1007/BF00049344.
Stockinger EJ, Gilmour SJ, Thomashow MF: Arabidopsis thaliana CBF1 encodes an AP2 domain-containing transcriptional activator that binds to the C-repeat/DRE, a cis-acting DNA regulatory element that stimulates transcription in response to low temperature and water deficit. Proc Natl Acad Sci U S A. 1997, 94 (3): 1035-1040. 10.1073/pnas.94.3.1035.
Riechmann JL, Meyerowitz EM: The AP2/EREBP family of plant transcription factors. Biol Chem. 1998, 379 (6): 633-646.
Jofuku KD, den Boer BG, Van Montagu M, Okamuro JK: Control of Arabidopsis flower and seed development by the homeotic gene APETALA2. Plant Cell. 1994, 6 (9): 1211-1225.
Okamuro JK, Caster B, Villarroel R, Van Montagu M, Jofuku KD: The AP2 domain of APETALA2 defines a large new family of DNA binding proteins in Arabidopsis. Proc Natl Acad Sci U S A. 1997, 94 (13): 7076-7081. 10.1073/pnas.94.13.7076.
Magnani E, Sjolander K, Hake S: From endonucleases to transcription factors: evolution of the AP2 DNA binding domain in plants. Plant Cell. 2004, 16 (9): 2265-2277. 10.1105/tpc.104.023135.
Wessler SR: Homing into the origin of the AP2 DNA binding domain. Trends Plant Sci. 2005, 10 (2): 54-56. 10.1016/j.tplants.2004.12.007.
Sakuma Y, Liu Q, Dubouzet JG, Abe H, Shinozaki K, Yamaguchi-Shinozaki K: DNA-binding specificity of the ERF/AP2 domain of arabidopsis DREBs, transcription factors involved in dehydration- and cold-inducible gene expression. Biochem Biophys Res Commun. 2002, 290 (3): 998-1009. 10.1006/bbrc.2001.6299.
Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O: New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010, 59 (3): 307-321. 10.1093/sysbio/syq010.
Guindon S, Delsuc F, Dufayard JF, Gascuel O: Estimating maximum likelihood phylogenies with PhyML. Methods Mol Biol. 2009, 537: 113-137. 10.1007/978-1-59745-251-9_6.
Guindon S, Lethiec F, Duroux P, Gascuel O: PHYML Online--a web server for fast maximum likelihood-based phylogenetic inference. Nucleic Acids Res. 2005, 33 (Web Server issue): W557-W559.
Edgar RC: MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinforma. 2004, 5: 113-10.1186/1471-2105-5-113.
Thompson JD, Linard B, Lecompte O, Poch O: A comprehensive benchmark study of multiple sequence alignment methods: current challenges and future perspectives. PLoS One. 2011, 6 (3): e18093-10.1371/journal.pone.0018093.
Licausi F, Giorgi FM, Zenoni S, Osti F, Pezzotti M, Perata P: Genomic and transcriptomic analysis of the AP2/ERF superfamily in vitis vinifera. BMC Genomics. 2010, 11: 719-10.1186/1471-2164-11-719.
Zhuang J, Chen JM, Yao QH, Xiong F, Sun CC, Zhou XR, Zhang J, Xiong AS: Discovery and expression profile analysis of AP2/ERF family genes from triticum aestivum. Mol Biol Rep. 2011, 2011:
Pirrello J, Prasad N, Zhang W, Chen K, Mila I, Zouine M, Latche A, Pech JC, Ohme-Takagi M, Regad F: Functional analysis and binding affinity of tomato ethylene response factors provide insight on the molecular bases of plant differential responses to ethylene. BMC Plant Biol. 2012, 12 (1): 190-10.1186/1471-2229-12-190.
Champion A, Hebrard E, Parra B, Bournaud C, Marmey P, Tranchant C, Nicole M: Molecular diversity and gene expression of cotton ERF transcription factors reveal that group IXa members are responsive to jasmonate, ethylene and xanthomonas. Mol Plant Pathol. 2009, 10 (4): 471-485. 10.1111/j.1364-3703.2009.00549.x.
Chow KS, Mat-Isa MN, Bahari A, Ghazali AK, Alias H, Mohd-Zainuddin Z, Hoh CC, Wan KL: Metabolic routes affecting rubber biosynthesis in hevea brasiliensis latex. J Exp Bot. 2012, 63 (5): 1863-71. 10.1093/jxb/err363.
Chow KS, Wan KL, Isa MN, Bahari A, Tan SH, Harikrishna K, Yeang HY: Insights into rubber biosynthesis from transcriptome analysis of hevea brasiliensis latex. J Exp Bot. 2007, 58 (10): 2429-2440. 10.1093/jxb/erm093.
Li D, Deng Z, Chen C, Xia Z, Wu M, He P, Chen S: Identification and characterization of genes associated with tapping panel dryness from Hevea brasiliensis latex using suppression subtractive hybridization. BMC Plant Biol. 2010, 10: 140-10.1186/1471-2229-10-140.
Xia Z, Xu H, Zhai J, Li D, Luo H, He C, Huang X: RNA-Seq analysis and de novo transcriptome assembly of hevea brasiliensis. Plant Mol Biol. 2011, 77 (3): 299-308. 10.1007/s11103-011-9811-z.
Li D, Deng Z, Qin B, Liu X, Men Z: De novo assembly and characterization of bark transcriptome using illumina sequencing and development of EST-SSR markers in rubber tree (hevea brasiliensis muell. Arg.). BMC Genomics. 2012, 13: 192-10.1186/1471-2164-13-192.
Triwitayakorn K, Chatkulkawin P, Kanjanawattanawong S, Sraphet S, Yoocha T, Sangsrakru D, Chanprasert J, Ngamphiw C, Jomchai N, Therawattanasuk K: Transcriptome sequencing of hevea brasiliensis for development of microsatellite markers and construction of a genetic linkage Map. DNA Res. 2011, 18 (6): 471-482. 10.1093/dnares/dsr034.
Lardet L, Dessailly F, Carron MP, Rio MA, Ferriere N, Montoro P: Secondary somatic embryogenesis in hevea brasiliensis (mull. Arg.): an alternative process for long-term somatic embryogenesis. J Rubber Res. 2009, 12 (4): 215-228.
Sharma MK, Kumar R, Solanke AU, Sharma R, Tyagi AK, Sharma AK: Identification, phylogeny, and transcript profiling of ERF family genes during development and abiotic stress treatments in tomato. Mol Genet Genomics. 2010, 284 (6): 455-475. 10.1007/s00438-010-0580-1.
Natarajan P, Parani M: De novo assembly and transcriptome analysis of five major tissues of jatropha curcas L. Using GS FLX titanium platform of 454 pyrosequencing. BMC Genomics. 2011, 12: 191-10.1186/1471-2164-12-191.
Goremykin V, Moser C: Classification of the arabidopsis ERF gene family based on bayesian analysis. Mol Biol (Mosk). 2009, 43 (5): 789-794.
Liu Y, Zhao TJ, Liu JM, Liu WQ, Liu Q, Yan YB, Zhou HM: The conserved Ala37 in the ERF/AP2 domain is essential for binding with the DRE element and the GCC box. FEBS Lett. 2006, 580 (5): 1303-1308. 10.1016/j.febslet.2006.01.048.
Ohta M, Matsui K, Hiratsu K, Shinshi H, Ohme-Takagi M: Repression domains of class II ERF transcriptional repressors share an essential motif for active repression. Plant Cell. 2001, 13 (8): 1959-1968.
Tsutsui T, Kato W, Asada Y, Sako K, Sato T, Sonoda Y, Kidokoro S, Yamaguchi-Shinozaki K, Tamaoki M, Arakawa K: DEAR1, A transcriptional repressor of DREB protein that mediates plant defense and freezing stress responses in arabidopsis. J Plant Res. 2009, 122 (6): 633-643. 10.1007/s10265-009-0252-6.
Allen MD, Yamasaki K, Ohme-Takagi M, Tateno M, Suzuki M: A novel mode of DNA recognition by a beta-sheet revealed by the solution structure of the GCC-box binding domain in complex with DNA. EMBO J. 1998, 17 (18): 5484-5496. 10.1093/emboj/17.18.5484.
Chase MW: An update of the angiosperm phylogeny group classification for the orders and families of flowering plants: APG II. Bot J Linn Soc. 2003, 141: 399-436.
Fujita M, Fujita Y, Noutoshi Y, Takahashi F, Narusaka Y, Yamaguchi-Shinozaki K, Shinozaki K: Crosstalk between abiotic and biotic stress responses: a current view from the points of convergence in the stress signaling networks. Curr Opin Plant Biol. 2006, 9 (4): 436-442. 10.1016/j.pbi.2006.05.014.
Bowman JL, Smyth DR, Meyerowitz EM: Genetic interactions among floral homeotic genes of Arabidopsis. Development. 1991, 112 (1): 1-20.
Modrusan Z, Reiser L, Feldmann KA, Fischer RL, Haughn GW: Homeotic transformation of ovules into carpel-like structures in arabidopsis. Plant Cell. 1994, 6 (3): 333-349.
Elliott RC, Betzner AS, Huttner E, Oakes MP, Tucker WQ, Gerentes D, Perez P, Smyth DR: AINTEGUMENTA, an APETALA2-like gene of arabidopsis with pleiotropic roles in ovule development and floral organ growth. Plant Cell. 1996, 8 (2): 155-168.
Klucher KM, Chow H, Reiser L, Fischer RL: The AINTEGUMENTA gene of arabidopsis required for ovule and female gametophyte development is related to the floral homeotic gene APETALA2. Plant Cell. 1996, 8 (2): 137-153.
Woo HR, Kim JH, Kim J, Lee U, Song IJ, Lee HY, Nam HG, Lim PO: The RAV1 transcription factor positively regulates leaf senescence in Arabidopsis. J Exp Bot. 2010, 61 (14): 3947-3957. 10.1093/jxb/erq206.
Wu HL, Yu B, Cheng QQ, Zeng RZ, Duan CF, Nie ZY, Li Y: Cloning and characterization of jasmonic acid-induced AP2/EREBP genes in laticifer from rubber tree (hevea brasiliensis muell. Arg.). Chinese Agricultural Science Bulletin. 2010, 26 (5): 287-293.
Cheng H, An ZW, Huang HS: Cloning and sequence analysis of HbCBF1 gene in hevea brasiliensis. Chinese Journal of Tropical Crops Research. 2005, 26 (3): 50-55.
Chen YY, Wang LF, Dai LJ, Yang SG, Tian WM: Characterization of HbEREBP1, a wound-responsive transcription factor gene in laticifers of Hevea brasiliensis Muell. Arg Mol Biol Rep. 2012, 39 (4): 3713-9. 10.1007/s11033-011-1146-y.
Rubio-Somoza I, Weigel D: MicroRNA networks and developmental plasticity in plants. Trends Plant Sci. 2011, 16 (5): 258-264. 10.1016/j.tplants.2011.03.001.
Aukerman MJ, Sakai H: Regulation of flowering time and floral organ identity by a MicroRNA and its APETALA2-like target genes. Plant Cell. 2003, 15 (11): 2730-2741. 10.1105/tpc.016238.
Yant L, Mathieu J, Dinh TT, Ott F, Lanz C, Wollmann H, Chen X, Schmid M: Orchestration of the floral transition and floral development in arabidopsis by the bifunctional transcription factor APETALA2. Plant Cell. 2011, 22 (7): 2156-2170.
Shigyo M, Hasebe M, Ito M: Molecular evolution of the AP2 subfamily. Gene. 2006, 366 (2): 256-265. 10.1016/j.gene.2005.08.009.
Gébelin V, Argout X, Engchuan W, Pitollat B, Duan C, Montoro P, Leclercq J: Identification of novel microRNAs in hevea brasiliensis and computational prediction of their targets. BMC Plant Biol. 2012, 12 (2): 18-
Li XN, Xiao XH, Hu SN, Tang CR: Transcriptome profiling of the three tissues (latex, bark and leaf) of two Hevea brasiliensis clones (CATAS 7-33-97 and CATAS 8–79) using 454 GSFlx. 2010, Sanya, China: IRRDB Rubber Conference 2010, 539-540.
Ko JH, Chow KS, Han KH: Transcriptome analysis reveals novel features of the molecular events occurring in the laticifers of Hevea brasiliensis (para rubber tree). Plant Mol Biol. 2003, 53 (4): 479-492.
Lardet L, Dessailly F, Carron MP, Montoro P, Monteuuis O: Influences of aging and cloning methods on the capacity for somatic embryogenesis of a mature hevea brasiliensis genotype. Tree Physiol. 2009, 29 (2): 291-298.
Putranto R, Sanier C, Leclercq J, Duan C, Rio M, Sabau X, Argout X, Montoro P: Differential spatial gene expression in different types of hevea brasiliensis roots. Plant Sci. 2012, 183: 149-158.
Sambrook J, Fritsch EF, Maniatis T: Molecular cloning, a laboratory manual. CHS Press. 1989, 1 (7): 19-22.
Kush A, Goyvaerts E, Chye ML, Chua NH: Laticifer-specific gene expression in hevea brasiliensis (rubber tree). Proc Natl Acad Sci U S A. 1990, 87 (5): 1787-1790. 10.1073/pnas.87.5.1787.
Argout X, Fouet O, Wincker P, Gramacho K, Legavre T, Sabau X, Risterucci AM, Da Silva C, Cascardo J, Allegre M: Towards the understanding of the cocoa transcriptome: production and analysis of an exhaustive dataset of ESTs of theobroma cacao L. Generated from various tissues and under various conditions. BMC Genomics. 2008, 9: 512-10.1186/1471-2164-9-512.
Mituyama T, Yamada K, Hattori E, Okida H, Ono Y, Terai G, Yoshizawa A, Komori T, Asai K: The Functional RNA Database 3.0: databases to support mining and annotation of functional RNAs. Nucleic Acids Res. 2009, 37 (Database issue): D89-D92.
Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B: TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics. 2003, 19 (5): 651-652. 10.1093/bioinformatics/btg034.
Huang X, Madan A: CAP3: A DNA sequence assembly program. Genome Res. 1999, 9 (9): 868-877. 10.1101/gr.9.9.868.
Gotz S, Garcia-Gomez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, Robles M, Talon M, Dopazo J, Conesa A: High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res. 2008, 36 (10): 3420-3435. 10.1093/nar/gkn176.
Wasmuth JD, Blaxter ML: Prot4EST: translating expressed sequence tags from neglected genomes. BMC Bioinforma. 2004, 5: 187-10.1186/1471-2105-5-187.
Hunter S, Jones P, Mitchell A, Apweiler R, Attwood TK, Bateman A, Bernard T, Binns D, Bork P, Burge S: InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Res. 2012, 40 (Database issue): D306-D312.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215 (3): 403-410.
Wasmuth JD, Blaxter ML: Prot4EST: translating expressed sequence tags from neglected genomes. BMC Bioinforma. 2004, 5:
Gouzy J, Carrere S, Schiex T: FrameDP: sensitive peptide detection on noisy matured sequences. Bioinformatics. 2009, 25 (5): 670-671. 10.1093/bioinformatics/btp024.
Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, Geer LY, Geer RC, Gonzales NR: CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 2011, 39 (Database issue): D225-D229.
Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.
Talavera G, Castresana J: Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol. 2007, 56 (4): 564-577. 10.1080/10635150701472164.
Gascuel O: BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data. Mol Biol Evol. 1997, 14 (7): 685-695. 10.1093/oxfordjournals.molbev.a025808.
Anders S, Huber W: Differential expression analysis for sequence count data. Genome Biol. 2010, 11 (10): R106-10.1186/gb-2010-11-10-r106.
R: A language and environment for statistical computing. http://www.R-project.org/,
Noirot C, Gaspin C, Schiex T, Gouzy J: LeARN: a platform for detecting, clustering and annotating non-coding RNAs. BMC Bioinforma. 2008, 9: 21-10.1186/1471-2105-9-21.
Dai X, Zhao PX: PsRNATarget: a plant small RNA target analysis server. Nucleic Acids Res. 2011, 39 (Suppl 2): W155-W159.
This work was supported by the Chinese Scholarship Council and the Institut Français du Caoutchouc. We are grateful to Dr Tetty Chaidamsari for her technical support for RNA isolation from the bark of trees. The authors thank Peter Biggins for his revision of the English.
The authors declare that they have no competing interests.
XA and MS carried out contig assembly and generated the transcript sequence database. CD and JFD carried out the phylogenetic analysis. CD and AC studied the features of the AP2 domain. K carried out field experiments. K and MR carried out RNA isolations. PP carried out design of primers and real-time RT-PCR analyses. VG and JL identified microRNAs and their target genes. JP carried out the DESeq statistical analysis. PM and CD planned the experiments. PM, CD, JFD and MS participated in drafting the manuscript. All the authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Amino acid sequences of partial AP2 domain from the 45 AP2/ERF members found in Xia et al. (2011).(TXT 3 KB)
Additional file 2: Figure S1: Alignment of the AP2/ERF domains from H.brasiliensis 115 ERF family proteins. Black and light gray shading indicate identical and conserved amino acid residues, respectively. Dark gray shading indicates conserved amino acid residues in group VI-L. Green color indicates the V14, E19 residue conserved ; blue color indicates the residue conserved in each group individually;pink color indicates the supplementary residue in group IX The black bar and block arrows represent predicted a-helix and b-sheet regions, respectively, within the AP2/ERF domain . Asterisks represent amino acid residues that directly make contact with DNA . (PPT 238 KB)
Additional file 3: Outputs from the DESeq analysis using read distribution for contigs of each tissue-type libraries.(XLSX 97 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Duan, C., Argout, X., Gébelin, V. et al. Identification of the Hevea brasiliensisAP2/ERF superfamily by RNA sequencing. BMC Genomics 14, 30 (2013). https://doi.org/10.1186/1471-2164-14-30
- Transcription factor