Genome-wide identification, organization and phylogenetic analysis of Dicer-like, Argonaute and RNA-dependent RNA Polymerase gene families and their expression analysis during reproductive development and stress in rice
© Kapoor et al; licensee BioMed Central Ltd. 2008
Received: 11 March 2008
Accepted: 01 October 2008
Published: 01 October 2008
Important developmental processes in both plants and animals are partly regulated by genes whose expression is modulated at the post-transcriptional level by processes such as RNA interference (RNAi). Dicers, Argonautes and RNA-dependent RNA polymerases (RDR) form the core components that facilitate gene silencing and have been implicated in the initiation and maintenance of the trigger RNA molecules, central to process of RNAi. Investigations in eukaryotes have revealed that these proteins are encoded by variable number of genes with plants showing relatively higher number in each gene family. To date, no systematic expression profiling of these genes in any of the organisms has been reported.
In this study, we provide a complete analysis of rice Dicer-like, Argonaute and RDR gene families including gene structure, genomic localization and phylogenetic relatedness among gene family members. We also present microarray-based expression profiling of these genes during 14 stages of reproductive and 5 stages of vegetative development and in response to cold, salt and dehydration stress. We have identified 8 Dicer-like (OsDCLs), 19 Argonaute (OsAGOs) and 5 RNA-dependent RNA polymerase (OsRDRs) genes in rice. Based on phylogeny, each of these genes families have been categorized into four subgroups. Although most of the genes express both in vegetative and reproductive organs, 2 OsDCLs, 14 OsAGOs and 3 OsRDRs were found to express specifically/preferentially during stages of reproductive development. Of these, 2 OsAGOs exhibited preferential up-regulation in seeds. One of the Argonautes (OsAGO2) also showed specific up-regulation in response to cold, salt and dehydration stress.
This investigation has identified 23 rice genes belonging to DCL, Argonaute and RDR gene families that could potentially be involved in reproductive development-specific gene regulatory mechanisms. These data provide an insight into probable domains of activity of these genes and a basis for further, more detailed investigations aimed at understanding the contribution of individual components of RNA silencing machinery during reproductive phase of plant development.
In plants, small RNAs have been widely implicated in varied developmental events and as guide RNAs in many gene silencing pathways [1–6]. These RNAs are usually generated by the activities of Dicers, Argonautes and RNA-dependent RNA polymerases (RDRs), which are also sometimes referred to as the core proteins mediating RNA interference. These proteins are involved in the initiation and maintenance of the trigger RNA that is central to this mode of gene regulation. Briefly, initiation of gene silencing involves generation of double stranded RNA (dsRNA) by several mechanisms, for example, bidirectional transcription of DNA, self-complementary RNA foldbacks or RNA-dependent transcription of aberrantly synthesized mRNAs . The complementary dsRNAs are then processed by the RNaseIII-type activities of Dicers into small RNAs, ~19–31 nucleotides in length (siRNA or miRNA). Generally, one strand of these RNAs then associates with the silencing effector complexes through the Argonaute proteins. This confers sequence-specific guide functions to these complexes that find target RNAs with sequences complementary to the small RNAs. Silencing/repression of the target genes then occurs by either blocking translation or cleavage of the target mRNA. These small RNAs could also mediate transcriptional gene silencing by recruitment of histone and/or DNA methyltransferases to regulatory sequences of the target genes [8, 9].
Dicers and Argonautes are multidomain ribonucleases. Dicers are characterized by the presence of six types of domains, viz., DExD-helicase, helicase-C, Duf283, PAZ, RNaseIII and double stranded RNA-binding (dsRB) domain . All Argonaute proteins share the domain structure that comprises of an N terminal, PAZ, Mid and a C-terminal PIWI domain. The PAZ domain (~100 a.a) facilitates binding of 3' end of siRNA, while, the PIWI domain binds the 5' end of siRNA and to the target RNA. This domain has marked similarity with RNaseH family of ribonucleases and it possesses the catalytic amino acid residues required for endonucleolytic cleavage of the target RNA . At least three subfamilies of Argonaute proteins have been identified in eukaryotes . These include the AGO subfamily that is present in plants, animals and yeasts; the Piwi subfamily that has been found only in animals and the worm-specific Argonaute (WAGO) subfamily that are present in C. elegans [13, 14, 6]. Members of both AGO and Piwi subfamilies possess the characteristic DDH metal binding signature residues in their Piwi domains, while most of the WAGO proteins lack them. The Piwi proteins are expressed specifically in the germline cells and are known to interact with a subset of small RNA called Piwi-interacting RNA that are longer (26–31 nt) than siRNA and miRNA (21–24 nt) . This clade of Argonaute proteins have not been identified in any plant species. Recently, however, a novel kind of Argonaute, OsMEL1 has been described in rice that is involved specifically in male meiosis . RNA-dependent RNA polymerase was first isolated from tomato . These proteins are required for initiation and amplification of silencing signal and they possess a conserved sequence motif that resembles the catalytic β' subunit of DNA-dependent RNA polymerases .
Multiple copies of Dicers, Argonautes and RDR genes are known to exist in both plants and animals. With the exception of fission yeast (Schizosaccharomyces pombe) that codes for only one copy each of the Dicer, the Argonaute and the RDR, these proteins are encoded by multigene families in insects, nematodes, mammals and plants . In Arabidopsis, 4 Dicer-like (DCL), 10 Argonautes and 6 RDR genes have been identified. Functions of few of these genes have also been elucidated. Arabidopsis Dicer-like 1 (DCL1), but not DCL2, 3 and 4, has been implicated in biogenesis of miRNA . ARGONAUTE1 (AGO1) regulates floral meristem and organ identity by affecting expression of LEAFY, APETALA1 and AGAMOUS genes. It also affects expression of the polycomb group protein, CURLYLEAF (CLF) that maintains repression of both KNOX and AGAMOUS in vegetative organs . In Drosophila, DICER-1 (Dcr-1) has been found to be essential for miRNA formation and it functions redundantly with Dcr-2 downstream of siRNA production. Dcr-2, on the other hand, has also been implicated in siRNA formation [21, 22]. In Neurospora crassa, distinct RDR and Argonaute proteins are required for meiotic silencing of unpaired DNA (in nucleus) and RNAi (in cytoplasm; ). In Caenorhabditis elegans, the 22 nucleotide endogenous miRNA encoded by lin-4 and let-7 genes are processed by Dcr-1 and these small RNAs regulate lin-14 and lin-41 linked timing of developmental stages in the nematode life cycle .
In rice, components of the RNAi machinery are known to be involved in maintenance of undifferentiated cells in shoot apical meristem (SAM), initiation of lateral organ primodia from SAM and floral meristems and formation of male and female germ cells [15, 25, 26]. The present investigation has been carried out with the aim to obtain comprehensive expression overview of all the members of rice Dicer-like, Argonaute and RDR gene families to gain insight into the domains of activity of these genes and to provide a firm foundation for further, more detailed investigations aimed at understanding the contribution of individual components of RNA silencing machinery in regulating gene expression during reproductive development of plants. For this, an in-house generated (previously described; ) rice microarray data set comprising of 17 stages of vegetative and reproductive development, along with 5 newly added tissues/developmental stages corresponding to Y-leaf, SAM and three very early stages of panicle initiation, was utilized. Where possible, the expression profiles of rice genes have been compared with those in Arabidopsis at similar stages of flower and seed development to identify genes with similar expression profiles in dicots.
Identification and structural organization of rice Dicer-like, Argonaute and RDR genes
Structural characteristics of Dicer-like, Argonaute and RNA dependent RNA Polymerase genes identified in rice
Accession Numbers of Gene Models
TIGR Locus ID
ORF Length (bp)
No. of Introns
TIGR (Release 5)
Mol. Wt. (Da)
1183184 – 1174351
21469900 – 21455783
8649185 – 8646060
39932566 – 39922935
18031152 – 18042777
25262283 – 25255247
10924931 – 10927291
14769389 – 14772506
27335674 – 27324616
28214275 – 28207729
35762233 – 35755714
31069900 – 31076078
31020218 – 31024093
31025382 – 31030370
9633946 – 9628308
3547911 – 3542867
4710731 – 4701446
33319166 – 33310479
32753781 – 32760315
9466401 – 9459149
19198976 – 19203042
23539785 – 23545931
3754861 – 3748703
27110631 – 27102684
27127103 – 27114562
16891936 – 16898240
RNA-DEPENDENT RNA POLYMERASES
19272378 – 19267489
23065706 – 23070916
5321158 – 5304556
30727018 – 30730508
5289905 – 5285242
Rice and Arabidopsis Argonaute proteins with missing catalytic residue(s) in PIWI domains
All the 5 RDR genes present in the rice genome were found to encode proteins that share a common sequence motif corresponding to the catalytic β' subunit of DNA-dependent RNA polymerases . The length of the open reading frames of OsRDR s varied from 1311 bp for OsRDR3 to 3657 bp for SHL2, with the coding potential of 437 and 1219 amino acids, respectively. Interestingly, the genomic sequence of SHL2 that encodes the longest open reading frame is interrupted by a single intron, while OsRDR4 that encodes the second longest open reading frame of 3621 bp has 18 intervening sequences as annotated in TIGR (release 5; Table 1).
Phylogeny and chromosomal localization
Phylogenetic analysis of rice, Arabidopsis, Zea mays, Populus trichocarpa, C. elegans and Schizosaccharomyces pombe RDR genes revealed that dicot and monocot RDR genes cluster into four clades, I, II, III and IV (Figure 2B). Members in each clade show monophyletic pattern of origin. The rice RDR genes have been named as OsRDR1 to 4 on the basis of sequence similarity with the corresponding proteins in Arabidopsis. LOC_Os01g10130 was named OsRDR3 and LOC_Os01g10140 has been named OsRDR4 on the basis of phylogeny and the results of homology scores of the rice genes with the corresponding Arabidopsis RDR3 and RDR4 genes in BLAST searches (data not shown). The RDR genes of C. elegans also originated from a common ancestral gene that diverged from the clades that gave rise to plant and the single yeast gene.
Plant DCL genes form a monophyletic group with OsDCL genes showing high sequence conservation with their counterparts in Arabidopsis (Figure 2C). Dicers from Drosophila (Dcr-1), nematode and humans form one group that is distinct from Drosophila Dicer2 and the only Dicer from S. pombe due to absence of the conserved PAZ domain in these proteins (Figure 2C). The newly identified OsDCL loci closely grouped with the highly similar AtDCL1 and OsDCL1. Similar associations were observed when only dsRBa domains of rice and Arabidopsis proteins were used to construct a phylogenetic tree (data not shown). Therefore, we have renamed OsDCL1 as OsDCL1a, and named the two new loci as OsDCL1b and OsDCL1c (Table 1, Figure 2C).
The OsDCL genes were distributed on 7 chromosomes (Figure 3). Except chromosome 3 that contains OsDCL1 and OsDCL2a, all the other chromosomes (1, 5, 6, 4, 9 and 10) have single representative of OsDCL. When these data were superimposed with those for segmental duplications (TIGR), OsDCL2a and OsDCL2b were found to lie in duplicated regions of chromosomes 3 and 9. It has been previously shown that at amino acid levels too, these two paralogs share 99% similarity while at the genomic level these genes differ by a 200 bp deletion within an intron and deletion of a part of Duf domain in OsDCL2b . OsDCL1a, OsDCL1b and OsDCL1c that share significant homology were localized on regions of chromosomes 3, 5 and 6, which are not included in segmental duplication database, suggesting that events other than large segmental duplications might be responsible for their evolution. The expressed sequence tags (EST) showing similarity to these rice Dicer-like genes are present in Zea mays and Saccharum officinarum (AZM5_100042, CA252311) but not in Arabidopsis, thereby suggesting that duplication of OsDCL1 may have occurred after the divergence of monocots and dicots (~60 my).
A gene pair was considered tandemly duplicated if the members were separated by less than five intervening genes and shared ≥ 40% sequence similarity at amino acid level. Three OsAGO gene pairs, (OsAGO4a-15; OsAGO2-3 and OsAGO11-12) localized close to each other on chromosomes 1, 4 and 3, respectively, thus are considered to have arisen due to tandem duplications (Figure 3). They share 55%, 54% and 51% identity at amino acid level between the pair partners. None of the OsDCLs or OsRDRs seems to have undergone tandem duplications.
Transcript profiling of genes during vegetative and reproductive development and abiotic stress
To analyze transcript abundance of Dicer-like, Argonaute and RDR genes at different stages of reproductive development in rice, microarray datasets generated in-house were utilized. The stages of vegetative and reproductive development analyzed in the present study are summarized [see Additional file 1]. Microarray hybridization was performed using the 57K Affymetrix GeneChip® Rice Genome Arrays as described previously . Normalization of data was done using GCRMA algorithm and was log2 transformed before differential expression analyses were undertaken. In the microarray dataset, > 99.9% of negative controls and non-rice probe-sets were found to have average signal intensity values less than 15. Therefore, the value '15' was considered as the cut off value to distinguish between expressed and non-expressed genes in a particular tissue/developmental stage.
Of the five OsRDR genes, four expressed both in vegetative and floral tissues with average intensity values of > 15 in any or all stages of vegetative, floral- and seed-development (Figure 4C). However, a significant increase in the transcript abundance for OsRDR1, OsRDR2 and OsRDR3 (56-, 4.2- and 26.6-fold, respectively, at p-value ≤ 0.05) was observed during stages of early panicle development (P1-I to P1-III) in comparison to their expression in mature leaves (Figure 4C) [see Additional files 3 and 4]. The expression of OsRDR1 was detectable up to S2 stage of seed formation and declined thereafter. The expression profile of the closely related Arabidopsis genes, RDR1 and RDR2, did not show significant increase in transcript levels as compared to rice genes during early flower development; nevertheless, their transcripts were detectable until early stages of seed development. Unlike any other corresponding gene in Arabidopsis, the expression of OsRDR4 (that did not show any detectable expression in vegetative tissues except SAM) was found to be up-regulated by 36 folds in early stages of panicle development in comparison to mature vegetative tissues.
RNA interference plays an important role in regulating gene expression at post transcriptional level during vegetative and reproductive development in plants . This process is mediated by concerted activities of proteins like, Dicers, Argonautes and RNA-dependent RNA polymerases. In the present investigation, we have identified the genes encoding these proteins families in rice and studied their distribution on rice chromosome pseudomolecules. Phylogenetic analysis has shed light on the evolution of members in each gene family in rice and has provided insight into multiplicity of Argonaute genes in rice. Microarray based expression analysis at different stages of reproductive development in rice plants was attempted with the view to understand the contribution and developmental timings of activity of these proteins.
Rice Dicer-like genes
While there is only one Dicer in mammals and nematodes that is involved in processing all sizes of small RNAs, four DCL genes found in Arabidopsis seem to have specialized in the type of small RNAs they process . For example, DCL1 and DCL4 are required for biogenesis of 21 nt small RNAs that correspond mainly to miRNA and tasiRNA, DCL2 affects accumulation of 21 nt RNA that protects against viral infection and DCL3 acts to produce the 24 nt RNAs that mediate de novo DNA methylation, gene silencing and chromatin modification [19, 30]. We have identified all four classes of DCL genes in rice, that total eight in number. While duplication of DCL2 genes (OsDCL2a and OsDCL2b), which share more than 90% similarity at the amino acid level, has also been reported in Arabidopsis and Populus trichocarpa. However, duplication of DCL3 that gave rise to OsDCL3a and OsDCL3b is considered specific to monocots and predates the divergence of rice and maize . In the present study, microarray based expression pattern analysis of rice and Arabidopsis DCL genes at different stages of reproductive development shows conservation in expression pattern of genes in dicot and monocot plants, however, duplication events in rice may also have given rise to genes with novel expression (and probably functional) profiles.
Null mutants of AtDCL1 exhibit pleiotropic developmental defects and are lethal due to loss of accumulation of miRNAs. Henderson et al. (2006) also demonstrated that miRNAs dominate the pool of small RNAs (67.5%) accumulated in dcl2/dcl3/dcl4 triple mutants . In rice too, it has been shown that loss-of-function of OsDCL1 but not OsDCL4 affects processing of miRNA . Consistent with the roles of miRNAs in regulation of various plant developmental processes, AtDCL1 was found to express ubiquitously in all tissues. The presence of three DCL1 like genes in rice with divergent expression profiles is suggestive of diversification of DCL1 function in rice.
AtDCL3 together with RDR2 in Arabidopsis is known to affect accumulation of small RNAs that mediate de novo DNA methylation and transgene silencing. In rice, of the two OsDCL3s, OsDCL3a was previously considered as the functional counterpart of AtDCL3 on the basis of overall homology and conservation of dsRBb domain. On the other hand, the dsRBb domain in OsDCL3b was so divergent from that of AtDCL3 that it was suggested to categorize it as an altogether different, fifth class of dicer-like protein . AtDCL3 and OsDCL3b show low level expression in vegetative tissues with enhanced transcript accumulation in floral tissues and early stages of seed development. However, OsDCL3b expression was confined only to early (pre-meiotic) stages of panicle development. Hence, on the basis of expression profiles AtDCL3 and OsDCL3b share similar expression domains, although the encoded proteins possess highly divergent dsRBb domains and could therefore be interacting with different RNA substrates.
Arabidopsis DCL4 has been shown to function redundantly with DCL2 and DCL3 to process SHL2(OsRDR6) generated double stranded RNA into tasiRNA The RNA template for SHL2 is in turn produced by the activity of miR173 and miR390 generated by DCL1, HYL1, HEN1 and AGO1 . The expression patterns of rice genes, OsDCL4, SHL2 and OsAGO1 members (OsAGO1a, OsAGO1b, OsAGO1c and OsAGO1d) matched that of the corresponding genes in Arabidopsis especially in exhibiting enhanced expression during early stages of floral/panicle development. This further suggests that the diversification in function, as speculated from their expression pattern, of most plant DCL genes may have occurred after the split of plants and animal lineages but before the divergence of monocots and dicots.
Multiple AGOs and their roles in rice plant development
Argonautes form an evolutionarily conserved gene family whose members are present in both single celled and multicellular eukaryotes. Multiple genes have been reported in various organisms. Rice has the largest number of Argonautes among plants, almost double the number reported in Arabidopsis and second only to C. elegans that possesses 27 AGOs . Rice genes with similar/overlapping expression domains with Arabidopsis AGO1, (OsAGO1a, OsAGO1b, OsAGO1c and OsAGO1d) AGO5, (OsAGO13, OsAGO14 and OsMEL), and AGO4, (OsAGO4a and OsAGO4b), have been identified in the present study. Most of these genes appear to have evolved by duplication events (tandem or segmental) followed by differentiation of expression patterns. These duplication events may also have given rise to redundancy among OsAGO genes as is evident from the overlapping expression profiles of OsAGO1a, OsAGO1b, OsAGO1c and OsAGO1d and OsAGO4a and OsAGO4b genes. On the other hand, OsMEL1 and OsAGO14, that appear to have arisen by duplication of segments in chromosomes 3 and 7, exhibit contrasting expression profiles during panicle and seed development.
Argonautes are highly basic RNA binding proteins characterized by presence of PAZ and PIWI domains. The PIWI domain adopts an RNAse-like fold and has predicted endonuclease activity (slicer). It binds to 5' end and also interacts with the target RNA [31, 32]. The PAZ domain interacts with 3' of small RNAs and is the candidate region that determines the specificity of Argonautes. Consistent with this, homology among various rice AGOs was observed to be minimum in the PAZ regions (~21%) while their PIWI domains exhibited more than 90% conservation of amino acids. The endonuclease property of AGO proteins involved in RNAi resides in the PIWI domain that possesses three conserved metal-chelating amino acids (DDH). Subfamilies of AGO proteins, primary and secondary Argonautes, have been described in C. elegans on the basis of presence or absence of these catalytic amino acid residues . Five of the Arabidopsis AGO proteins, AGO2, AGO3, AGO4, AGO6 and AGO9 lack either the critical DDH or the H798 residue. In rice too, eleven genes do not code for the conserved catalytic residues in their PIWI domains. Of these, OsAGO3, OsAGO5a, OsAGO5b and OsAGO15, are expressed specifically or preferentially in reproductive tissues. Absence of catalytic amino acids could inhibit the processing of target RNA by endonucleolytic cleavage in these proteins. These proteins could, therefore, require accessory factors for mediating mRNA turnover. However, it has been reported that an aspartate at the third position of the catalytic triad in related RNase proteins, Integrase and Tn5, can chelate the divalent metal ions as efficiently as histidine residue and restore the catalytic activity [32, 33]. Accordingly, AGO2 and AGO3 of rice and Arabidopsis that possess DDD could still function as slicer components of silencing effector complexes. In rice, while OsAGO2 is ubiquitously expressed, OsAGO3 along with OsAGO14, OsAGO13 and OsMEL1 are expressed specifically in reproductive tissues. These genes can, therefore, be considered as candidate genes regulating expression of endogenous genes via RNA interference that may or may not involve cleavage of target RNA as their mode of activity.
In rice, knockout studies by three groups have provided insights into the functions of three OsAGO genes, OsMEL1, SHL4/SHO2 and OsPNH1 [15, 26, 34]. OsMEL1 is a unique AGO protein that regulates meiosis in germ cells and proper development of male and female gametes by chromatin modification probably mediated by H3K9 methylation. Whether OsMEL1 directly interacts with regulatory small RNAs to bring about desired changes at the chromatin is not yet known. OsMEL1 also has a duplicated partner, OsAGO14, located on chromosome 7 that does not show any expression in vegetative tissues and is specifically expressed at stages beginning from P3 in panicles and until S2 stage in seeds. Furthermore, diversity in their PAZ domains suggests that these two proteins could interact with different classes of small RNAs, while retaining the mode of processing the target RNA by virtue of the catalytic DDH motif in their PIWI domains.
The rice SHL4, SHL2 and SHO1 genes have been shown to be involved in initiation and formation of SAM during rice embryogenesis. This is mediated by miRNA and tasiRNA regulation of homeodomain-leucine zipper (HD-ZIPIII) and ETT/ARF gene families . OsPNH1, is the third gene whose functional analysis revealed that it affects the development of SAM and leaf development . This gene is closely related to OsAGO1.
Rice RDR genes in panicle development
RNA-dependent RNA polymerases are the regulatory components of RNAi machinery, which enhance the potency of RNAi by amplifying the aberrant RNA population . These proteins are required for both cytoplasmic gene silencing that is triggered by transgenes or viral infection, as well as, nuclear gene silencing for transposons and inverted repeats in the genome. In Arabidopsis, 6 genes encode RDRs out of which only 3 genes have been shown to be involved in viral defense, chromatin silencing and PTGS. RDR6, also known as SGS2 and SDE1 amplifies improper terminated and unpolyadenylated RNAs generated from transgenes or inverted repeats to trigger degradation of complementary RNA species . Genetic studies in Arabidopsis have also revealed that RDR6 functions in the same pathway as AGO7 and Asymmetric leaves 1 and 2 (AS1 and AS2) to control adaxial/abaxial patterning in leaves . This role of RDR6 appears to be evolutionarily conserved in dicot and monocot lineages as in rice it was recently reported that SHL2 along with SHL4, and SHO1 affect siRNA and tasiRNA mediated regulation of endogenous genes involved in SAM and leaf development . Arabidopsis RDR2 is required for combating viral infection through PTGS and is involved in siRNA mediated de novo methylation of direct repeats. Recently, it has been shown that the homolog of Arabidopsis RDR2 in maize, MOP1, is involved in paramutation of b1 locus . The MOP1 gene maintains a threshold level of RNA encoded by repeat sequences that modify chromatin at b1 locus. This gene is also closely related to OsRDR2. The role of OsRDR2 is not known yet, however its expression was observed to overlap with Arabidopsis RDR2 at earlier stages of flower development and it phylogenetically relates to both maize and Arabidopsis genes. It will therefore be interesting to study the role of this gene in rice to validate the speculation of conservation of their roles, based on their similar expression patterns, in dicots and monocots.
Regulation of gene expression at post-transcriptional level plays a critical role in plant development. RNA interference mediated by activities of Dicers, Argonautes and RNA-dependent RNA polymerases is an important regulatory process that checks transcript accumulation in cells. These components are also shared by other gene-silencing pathways. Dicer-like, Argonautes and RDRs are encoded by small multigene families and rice encodes the largest number of these genes among the plant species analyzed so far. Phylogenetic analysis and localization on rice pseudomolecules have revealed that duplication of genes, both segmental and tandem, have contributed to increase in number of these genes in rice. However, function of only a few of these genes has been established in both rice and Arabidopsis. A key objective of this study was to generate and compile an expression profile data set to facilitate selection of candidate genes for validation of their roles during reproductive phase of plant development and abiotic stress responses. More than 20 genes belonging to these three gene families were observed to express preferentially/specifically during panicle and seed development. This work has provided insights into the probable domains of activity of these genes.
Identification of genes, chromosomal localization and phylogenetic analysis
Name search and Hidden Markov Model (HMM) analysis was employed to search for Dicer-like, Argonautes and RDRs genes encoded in the rice genome. The sequences were downloaded from TIGR, release 5 http://www.tigr.org. An HMM profile was generated using HMMER 2.1.1 software package http://hmmer.janelia.org/ which was then used to search the proteome database of rice available at TIGR using the Basic Local Alignment Search Tool, BLAST with filter off. Amino acid sequences of all the rice genes were downloaded and conserved domains were searched using the Simple Modular Architecture Research Tool (SMART) version 3.4 or National Center for Biotechnology Information Conserved Domain Database, NCBI-CD . The newly identified genes in this study were named on lines of nomenclature used for the previously identified genes and on the basis of their phylogenetic relatedness to other members of the same family. ORF length, and details of encoded proteins (length, PI, molecular weight) were downloaded from TIGR. For proteins whose molecular weight and PI was not available in TIGR, Gene Runner program version 3.04 was used to calculate the same. Genes were localized on the chromosomes based on their chromosomal positions on psedudomolecules as mentioned in TIGR. For phylogenetic analysis, total protein sequences were downloaded from TIGR and aligned using ClustalX 1.83 program. An unrooted neighbor-joining tree was constructed in ClustalX using default parameters . Bootstrap analysis was performed using 1000 replicates.
Plant tissue for all the panicle and seed stages as well as mature leaf and Y-leaf was collected from field grown rice plants (Oryza sativa ssp. indica var. IR64). Rice seedlings were subjected to abiotic stress treatments, viz. salt, drought and cold as described previously .
Microarray Hybridization and Data Analysis
Affymetrix GeneChip® Rice Genome Arrays representing 49,824 rice transcripts were used to prepare a compendium of transcriptome profiles for 22 stages of vegetative and reproductive development and stress response in rice. Of these, microarray analysis of 17 stages was described previously ; deposited in the Gene Expression Omnibus database at the National Center for Biotechnology Information under the series accession numbers GSE6893 and GSE6901). Here, five more stages viz, Y-leaf, SAM (shoot apical meristem), P1-I (< 2 mm panicle), P1-II (0.2 to 0.5 mm panicle) and P1-III (5 to 10 mm panicle) have been added to emphasize changes in gene expression patterns during initial stages of panicle differentiation and floral primodia emergence in addition to stages of floral organ- and seed development. Total RNA from these five stages was isolated by using TRIzol method (Invitrogen Inc., USA; ). After checking the quality on agarose formaldehyde gels, the RNA samples were quantified using spectrophotometer (ND-1000, Nanodrop). Five micrograms of RNA with 260:280 ratios of 1.9–2.0 and 260:230 ratios more than 2.0 was used for cDNA synthesis. Labeling and hybridizations were carried out according to Affymetrix manual for one-cycle target labeling (Affymetrix, Santa Clara, CA). Hybridization was performed in GeneChip® Hybridization Oven 640 for 16 hours at 45°C and 60 rpm. GeneChips were washed and stained with streptavidin-phycoerythrin using the fluidics protocol EukGE_WS2V5_450 in Affymetrix fluidic station model 450. Finally, chips were scanned using the Gene-Chip® Scanner 3000-6G.
Sixty six cell intensity (CEL) files generated by GeneChip Operating Software (GCOS) were further analyzed using Arrayassist™ version 5.0 (Stratagene, La Jolla). Data were normalized using GC-RMA algorithm and log2 transformed. To get the expression values, averages of three biological replicates were used. The expression data for Dicer-like, Argonaute and RDR genes was extracted by using unique probe-set IDs mentioned for each gene in TIGR. Wherever more than one probe set was available for one gene, the probe set designed from 3' end was given preference. Cluster analysis on rows of expression values was performed by using Euclidean distance metric, and Ward's Linkage rule of hierarchical clustering. Differential expression analysis was performed by taking mature leaf as reference to identify genes expressing at more than two-fold level in different stages of reproductive development (panicle and seed), with p-values ≤ 0.05. Similarly, for identifying stress-induced genes, differential expression analysis was performed by taking seedling as reference with no correction applied and p-values less ≤ 0.05 [see Additional file 3]. Since the Affymetrix microarrays used in this study did not contain probes representing OsAGO15, data for this gene was extracted from rice MPSS database http://mpss.udel.edu/rice.
Arabidopsis expression analysis
To analyze the expression of Arabidopsis genes, Affymetrix GeneChip® ATH1 Genome Array data for 21 stages (55 .cel files) comparable to that used for rice were downloaded from Gene Expression Omnibus (GEO) database at the NCBI under the series accession numbers GSE5620, GSE5621, GSE5623, GSE5624, GSE5629, GSE5630, GSE5631, GSE5632 and GSE5634. The data were imported in ArrayAssist™ (Stratagene, La Jolla, CA) microarray analysis software wherein GCRMA algorithm was used for normalization and Log2transformation. [see Additional file 4]. The downstream processing and generation of heat maps for selected genes was as described for rice.
Real time PCR reactions were carried out by using the same RNA samples, which were used for microarrays as described earlier . In brief, primers were designed preferentially from 3' end of the genes by using PRIMER EXPRESS version 2.0 (PE Applied Biosystems, USA) with default parameters. Each primer was checked by using BLAST tool for homology with other regions of the genome. 4 μg of total RNA was used for first strand cDNA synthesis in 100 μl reaction volume by using high-capacity cDNA Archive kit (Applied Biosystems, USA). Diluted cDNA samples were used for Real time PCR analysis with 200 nM of each primer mixed with SYBR Green PCR master by using ABI Prism 7000 Sequence Detection System and software (PE Applied Biosystems, USA). Actin was used as endogenous control to normalize the variance among samples. Relative expression values were calculated after normalizing against the maximum expression value. These data were further normalized with the normalized expression values obtained from microarrays and bar charts plotted by using Microsoft Excel.
TIGR database resources are acknowledged for making available the detailed sequence information on rice. Expression data for Arabidopsis has been obtained from GEO database at NCBI. Senior Research fellowship by the Council for Scientific and Industrial Research (CSIR) to R.A. and A.N. is also acknowledged. The project has been funded by the Department of Science and Technology and Department of Biotechnology, Government of India.
- Hamilton AJ, Baulcombe DC: A species of small antisense RNA in posttranscriptional gene silencing in plants. Science. 1999, 286 (5441): 950-952. 10.1126/science.286.5441.950.PubMedView ArticleGoogle Scholar
- Volpe TA, Kidner C, Hall IM, Teng G, Grewal SI, Martienssen RA: Regulation of heterochromatic silencing and histone H3 lysine-9 methylation by RNAi. Science. 2002, 297 (5588): 1833-1837. 10.1126/science.1074973.PubMedView ArticleGoogle Scholar
- Hall IM, Noma K, Grewal SI: RNA interference machinery regulates chromosome dynamics during mitosis and meiosis in fission yeast. Proc Natl Acad Sci USA. 2003, 100 (1): 193-198. 10.1073/pnas.232688099.PubMedPubMed CentralView ArticleGoogle Scholar
- Mochizuki K, Fine NA, Fujisawa T, Gorovsky MA: Analysis of a piwi-related gene implicates small RNAs in genome rearrangement in tetrahymena. Cell. 2002, 110 (6): 689-699. 10.1016/S0092-8674(02)00909-1.PubMedView ArticleGoogle Scholar
- Chen X: A microRNA as a translational repressor of APETALA2 in Arabidopsis flower development. Science. 2004, 303 (5666): 2022-2025. 10.1126/science.1088060.PubMedView ArticleGoogle Scholar
- Faehnle CR, Joshua-Tor L: Argonautes confront new small RNAs. Curr Opin Chem Biol. 2007, 11 (5): 569-577. 10.1016/j.cbpa.2007.08.032.PubMedPubMed CentralView ArticleGoogle Scholar
- Matzke MA, Birchler JA: RNAi-mediated pathways in the nucleus. Nat Rev Genet. 2005, 6 (1): 24-35. 10.1038/nrg1500.PubMedView ArticleGoogle Scholar
- Wassenegger M, Heimes S, Riedel L, Sanger HL: RNA-directed de novo methylation of genomic sequences in plants. Cell. 1994, 76 (3): 567-576. 10.1016/0092-8674(94)90119-8.PubMedView ArticleGoogle Scholar
- Xie Z, Johansen LK, Gustafson AM, Kasschau KD, Lellis AD, Zilberman D, Jacobsen SE, Carrington JC: Genetic and functional diversification of small RNA pathways in plants. PLoS Biol. 2004, 2 (5): E104-10.1371/journal.pbio.0020104.PubMedPubMed CentralView ArticleGoogle Scholar
- Margis R, Fusaro AF, Smith NA, Curtin SJ, Watson JM, Finnegan EJ, Waterhouse PM: The evolution and diversification of Dicers in plants. FEBS Lett. 2006, 580 (10): 2442-2450. 10.1016/j.febslet.2006.03.072.PubMedView ArticleGoogle Scholar
- Rivas FV, Tolia NH, Song JJ, Aragon JP, Liu J, Hannon GJ, Joshua-Tor L: Purified Argonaute2 and an siRNA form recombinant human RISC. Nat Struct Mol Biol. 2005, 12 (4): 340-349. 10.1038/nsmb918.PubMedView ArticleGoogle Scholar
- Hall TM: Structure and function of argonaute proteins. Structure. 2005, 13 (10): 1403-1408. 10.1016/j.str.2005.08.005.PubMedView ArticleGoogle Scholar
- Girard A, Sachidanandam R, Hannon GJ, Carmell MA: A germline-specific class of small RNAs binds mammalian Piwi proteins. Nature. 2006, 442 (7099): 199-202.PubMedGoogle Scholar
- Yigit E, Batista PJ, Bei Y, Pang KM, Chen CC, Tolia NH, Joshua-Tor L, Mitani S, Simard MJ, Mello CC: Analysis of the C. elegans Argonaute family reveals that distinct Argonautes act sequentially during RNAi. Cell. 2006, 127 (4): 747-757. 10.1016/j.cell.2006.09.033.PubMedView ArticleGoogle Scholar
- Nonomura K, Morohoshi A, Nakano M, Eiguchi M, Miyao A, Hirochika H, Kurata N: A germ cell specific gene of the ARGONAUTE family is essential for the progression of premeiotic mitosis and meiosis during sporogenesis in rice. Plant Cell. 2007, 19 (8): 2583-2594. 10.1105/tpc.107.053199.PubMedPubMed CentralView ArticleGoogle Scholar
- Schiebel W, Pelissier T, Riedel L, Thalmeir S, Schiebel R, Kempe D, Lottspeich F, Sanger HL, Wassenegger M: Isolation of an RNA-directed RNA polymerase-specific cDNA clone from tomato. Plant Cell. 1998, 10 (12): 2087-2101. 10.1105/tpc.10.12.2087.PubMedPubMed CentralGoogle Scholar
- Iyer LM, Koonin EV, Aravind L: Evolutionary connection between the catalytic subunits of DNA-dependent RNA polymerases and eukaryotic RNA-dependent RNA polymerases and the origin of RNA polymerases. BMC Struct Biol. 2003, 3: 1-10.1186/1472-6807-3-1.PubMedPubMed CentralView ArticleGoogle Scholar
- Baumberger N, Baulcombe DC: Arabidopsis ARGONAUTE1 is an RNA Slicer that selectively recruits microRNAs and short interfering RNAs. Proc Natl Acad Sci USA. 2005, 102 (33): 11928-11933. 10.1073/pnas.0505461102.PubMedPubMed CentralView ArticleGoogle Scholar
- Henderson IR, Zhang X, Lu C, Johnson L, Meyers BC, Green PJ, Jacobsen SE: Dissecting Arabidopsis thaliana DICER function in small RNA processing, gene silencing and DNA methylation patterning. Nat Genet. 2006, 38 (6): 721-725. 10.1038/ng1804.PubMedView ArticleGoogle Scholar
- Kidner CA, Martienssen RA: The role of ARGONAUTE1 (AGO1) in meristem formation and identity. Dev Biol. 2005, 280 (2): 504-517. 10.1016/j.ydbio.2005.01.031.PubMedView ArticleGoogle Scholar
- Lee YS, Nakahara K, Pham JW, Kim K, He Z, Sontheimer EJ, Carthew RW: Distinct roles for Drosophila Dicer-1 and Dicer-2 in the siRNA/miRNA silencing pathways. Cell. 2004, 117 (1): 69-81. 10.1016/S0092-8674(04)00261-2.PubMedView ArticleGoogle Scholar
- Kennerdell JR, Yamaguchi S, Carthew RW: RNAi is activated during Drosophila oocyte maturation in a manner dependent on aubergine and spindle-E. Genes Dev. 2002, 16 (15): 1884-1889. 10.1101/gad.990802.PubMedPubMed CentralView ArticleGoogle Scholar
- Shiu PK, Metzenberg RL: Meiotic silencing by unpaired DNA: properties, regulation and suppression. Genetics. 2002, 161 (4): 1483-1495.PubMedPubMed CentralGoogle Scholar
- Grishok A, Pasquinelli AE, Conte D, Li N, Parrish S, Ha I, Baillie DL, Fire A, Ruvkun G, Mello CC: Genes and mechanisms related to RNA interference regulate expression of the small temporal RNAs that control C. elegans developmental timing. Cell. 2001, 106 (1): 23-34. 10.1016/S0092-8674(01)00431-7.PubMedView ArticleGoogle Scholar
- Liu B, Li P, Li X, Liu C, Cao S, Chu C, Cao X: Loss of function of OsDCL1 affects microRNA accumulation and causes developmental defects in rice. Plant Physiol. 2005, 139 (1): 296-305. 10.1104/pp.105.063420.PubMedPubMed CentralView ArticleGoogle Scholar
- Nagasaki H, Itoh J, Hayashi K, Hibara K, Satoh-Nagasawa N, Nosaka M, Mukouhata M, Ashikari M, Kitano H, Matsuoka M: The small interfering RNA production pathway is required for shoot meristem initiation in rice. Proc Natl Acad Sci USA. 2007, 104 (37): 14867-14871. 10.1073/pnas.0704339104.PubMedPubMed CentralView ArticleGoogle Scholar
- Arora R, Agarwal P, Ray S, Singh AK, Singh VP, Tyagi AK, Kapoor S: MADS-box gene family in rice: genome-wide identification, organization and expression profiling during reproductive development and stress. BMC Genomics. 2007, 8: 242-10.1186/1471-2164-8-242.PubMedPubMed CentralView ArticleGoogle Scholar
- Shi Z, Wang J, Wan X, Shen G, Wang X, Zhang J: Over-expression of rice OsAGO7 gene induces upward curling of the leaf blade that enhanced erect-leaf habit. Planta. 2007, 226 (1): 99-108. 10.1007/s00425-006-0472-0.PubMedView ArticleGoogle Scholar
- Schmid M, Davison TS, Henz SR, Pape UJ, Demar M, Vingron M, Scholkopf B, Weigel D, Lohmann JU: A gene expression map of Arabidopsis thaliana development. Nat Genet. 2005, 37 (5): 501-506. 10.1038/ng1543.PubMedView ArticleGoogle Scholar
- Gasciolli V, Mallory AC, Bartel DP, Vaucheret H: Partially redundant functions of Arabidopsis DICER-like enzymes and a role for DCL4 in producing trans-acting siRNAs. Curr Biol. 2005, 15 (16): 1494-1500. 10.1016/j.cub.2005.07.024.PubMedView ArticleGoogle Scholar
- Parker JS, Roe SM, Barford D: Structural insights into mRNA recognition from a PIWI domain-siRNA guide complex. Nature. 2005, 434 (7033): 663-666. 10.1038/nature03462.PubMedPubMed CentralView ArticleGoogle Scholar
- Ma JB, Yuan YR, Meister G, Pei Y, Tuschl T, Patel DJ: Structural basis for 5'-end-specific recognition of guide RNA by the A. fulgidus Piwi protein. Nature. 2005, 434 (7033): 666-670. 10.1038/nature03514.PubMedPubMed CentralView ArticleGoogle Scholar
- Nowotny M, Gaidamakov SA, Crouch RJ, Yang W: Crystal structures of RNase H bound to an RNA/DNA hybrid: substrate specificity and metal-dependent catalysis. Cell. 2005, 121 (7): 1005-1016. 10.1016/j.cell.2005.04.024.PubMedView ArticleGoogle Scholar
- Nishimura A, Ito M, Kamiya N, Sato Y, Matsuoka M: OsPNH1 regulates leaf development and maintenance of the shoot apical meristem in rice. Plant J. 2002, 30 (2): 189-201. 10.1046/j.1365-313X.2002.01279.x.PubMedView ArticleGoogle Scholar
- Sijen T, Fleenor J, Simmer F, Thijssen KL, Parrish S, Timmons L, Plasterk RH, Fire A: On the role of RNA amplification in dsRNA-triggered gene silencing. Cell. 2001, 107 (4): 465-476. 10.1016/S0092-8674(01)00576-1.PubMedView ArticleGoogle Scholar
- Luo Z, Chen Z: Improperly terminated, unpolyadenylated mRNA of sense transgenes is targeted by RDR6-mediated RNA silencing in Arabidopsis. Plant Cell. 2007, 19 (3): 943-958. 10.1105/tpc.106.045724.PubMedPubMed CentralView ArticleGoogle Scholar
- Xu L, Yang L, Pi L, Liu Q, Ling Q, Wang H, Poethig RS, Huang H: Genetic interaction between the AS1–AS2 and RDR6-SGS3-AGO7 pathways for leaf morphogenesis. Plant Cell Physiol. 2006, 47 (7): 853-863. 10.1093/pcp/pcj057.PubMedView ArticleGoogle Scholar
- Alleman M, Sidorenko L, McGinnis K, Seshadri V, Dorweiler JE, White J, Sikkink K, Chandler VL: An RNA-dependent RNA polymerase is required for paramutation in maize. Nature. 2006, 442 (7100): 295-298. 10.1038/nature04884.PubMedView ArticleGoogle Scholar
- Schultz J, Milpetz F, Bork P, Ponting CP: SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci USA. 1998, 95 (11): 5857-5864. 10.1073/pnas.95.11.5857.PubMedPubMed CentralView ArticleGoogle Scholar
- Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4 (4): 406-425.PubMedGoogle Scholar
- Nobuta K, Venu RC, Lu C, Belo A, Vemaraju K, Kulkarni K, Wang W, Pillay M, Green PJ, Wang GL: An expression atlas of rice mRNAs and small RNAs. Nat Biotechnol. 2007, 25 (4): 473-477. 10.1038/nbt1291.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.