Genome-wide analysis of coordinated transcript abundance during seed development in different Brassica rapa morphotypes
© Basnet et al.; licensee BioMed Central Ltd. 2013
Received: 25 June 2013
Accepted: 13 November 2013
Published: 1 December 2013
Brassica seeds are important as basic units of plant growth and sources of vegetable oil. Seed development is regulated by many dynamic metabolic processes controlled by complex networks of spatially and temporally expressed genes. We conducted a global microarray gene co-expression analysis by measuring transcript abundance of developing seeds from two diverse B. rapa morphotypes: a pak choi (leafy-type) and a yellow sarson (oil-type), and two of their doubled haploid (DH) progenies, (1) to study the timing of metabolic processes in developing seeds, (2) to explore the major transcriptional differences in developing seeds of the two morphotypes, and (3) to identify the optimum stage for a genetical genomics study in B. rapa seed.
Seed developmental stages were similar in developing seeds of pak choi and yellow sarson of B. rapa; however, the colour of embryo and seed coat differed among these two morphotypes. In this study, most transcriptional changes occurred between 25 and 35 DAP, which shows that the timing of seed developmental processes in B. rapa is at later developmental stages than in the related species B. napus. Using a Weighted Gene Co-expression Network Analysis (WGCNA), we identified 47 “gene modules”, of which 27 showed a significant association with temporal and/or genotypic variation. An additional hierarchical cluster analysis identified broad spectra of gene expression patterns during seed development. The predominant variation in gene expression was according to developmental stages rather than morphotype differences. Since lipids are the major storage compounds of Brassica seeds, we investigated in more detail the regulation of lipid metabolism. Four co-regulated gene clusters were identified with 17 putative cis-regulatory elements predicted in their 1000 bp upstream region, either specific or common to different lipid metabolic pathways.
This is the first study of genome-wide profiling of transcript abundance during seed development in B. rapa. The identification of key physiological events, major expression patterns, and putative cis-regulatory elements provides useful information to construct gene regulatory networks in B. rapa developing seeds and provides a starting point for a genetical genomics study of seed quality traits.
Brassica rapa (2n = 2x = 20; AA) is an important crop that consists of diverse morphotypes (also called crop types), including oilseed (annual crops yellow sarson and brown sarson, and biannual winter oils), leafy vegetables (Chinese cabbage, pak choi and many non-heading leafy types), turnip (fodder and vegetable turnip) and broccoletto. It contributes the A-genome to the amphidiploid oil crop canola (B. napus L; n = 19; AACC). Yellow sarson and brown sarson are grown for oil production in the Indian sub-continent, and in Canada, because of their early maturity and shatter resistance. Brassica seed is important for both plant propagation and oil production.
Brassica seed is non-endospermic, which means that the endosperm is not retained in mature seeds and only the embryo is enclosed by the seed coat . Seed development goes through basically three overlapping stages: morphogenesis, seed filling and seed desiccation [2, 3]. Embryo development, also known as embryogenesis, starts after the double fertilization process of fusion of two sperm nuclei with the egg cell and the central cell nuclei, respectively, and the zygote goes through a series of cell divisions and differentiation events from a pre-globular and globular embryo stage, a heart stage, a torpedo stage, a bent-cotyledon stage to the mature embryo [3, 4]. Embryogenesis consists of two phases; morphogenesis and seed filling, as the seeds are non-endospermic.
Seed development goes through a complex network of many dynamic developmental, biochemical and metabolic processes such as cell division and differentiation, carbohydrate, protein, cell wall, lipid, amino acid, hormone and secondary metabolite biosynthesis . Several hundreds of genes are reported to be involved in spatial and temporal regulation of these metabolic processes. A systematic overview of metabolic processes and gene expression patterns during seed development has been well documented for the closely related model plant Arabidopsis thaliana[5–7]. In B. napus, transcript profiling was mainly reported in relation to oil biosynthesis and storage seed reserves [2, 8]. For oil biosynthesis, starch is synthesized at the early seed developmental stage, but after intermediate processes such as malonyl-CoA and fatty acid biosynthesis, converted into triacylglycerol (TAG), lipids and storage proteins during the seed-filling phase at a later stage of seed development in both A. thaliana and B. napus[5, 9, 10]. A starchless mutant contained up to 40% less lipids in mature Arabidopsis seed than the wild-type, while starch was undetectable . Starch turnover, breakdown of cytosolic and plastidic glycolytic pathways, malonyl-CoA and fatty acid (FA) synthesis, TAG assembly and oil body formation takes place during TAG synthesis in seed . The plant hormones gibberellin, auxin, ethylene and abscisic acid (ABA) play key regulatory roles in seed development and growth [12, 13] and changes in hormonal levels affect the seed size and seed number in B. napus, especially during the 10–20 days after pollination (DAP) period . Transcription factors, for example, ABI3 (Abscisic acid insensitive-3), ABI4, ABI5, LEC1 (leafy cotyledon1), LEC2 and FUS3 (FUSCA3) are important regulators of the complex gene network during the process of seed development, maturation and germination [15, 16].
Understanding the regulatory mechanisms of seed development is essential to identify the molecular basis of seed development. Transcript profiling of developing seeds has been a widely used strategy to identify functional genes and their regulatory elements for seed development that can be used as tools in breeding programs for seed quality traits. Transcriptomics provides a powerful tool and is widely used to examine the temporal and spatial changes in transcript abundance during seed development in Arabidopsis[4, 7, 10, 17], B. napus[2, 18, 19], wheat [20, 21], maize [22, 23], barley , rice [13, 25], soybean , Jatropha and many other crops. So far, we are not aware of any studies connecting global gene expression profiles to seed developmental stages in the diploid Brassica species B. rapa. The release of the whole-genome sequence of B. rapa morphotype Chinese cabbage var. Chiifu  facilitates genomic studies, such as gene expression analysis and genetical genomics studies . The knowledge on changes in gene expression associated with specific stages of seed development is crucial to unravel the molecular and biochemical events that influence optimal seed metabolite composition . Timing of major transition stages differs between metabolic pathways (carbohydrates, fatty acids, storage proteins) and also between species. The higher number of differentially expressed sequence tags (ESTs) at 15 DAP than at 25 DAP in B. napus suggest that most developmental changes take place at 10–20 DAP . Major changes in gene expression profiles of genes involved in protein translation, starch metabolism and hormonal regulation were reported between 17–21 DAP in B. napus, whereas fatty acid synthesis related genes were highly expressed at 21 DAP as compared to earlier and later time points . In developing B. napus spring cultivar seeds, 20 DAP was the most active stage to measure variation in transcript abundance of genes related to the biosynthesis of starch, lipids, carotenoids, isoprenoids, proteins and storage reserves .
Recently, genetical genomics has become a powerful tool to find candidate genes for complex traits [28, 30], such as seed quality and seedling vigour traits . In this approach variation in transcript abundance is considered as quantitative traits in quantitative trait loci (QTL) analyses per gene, resulting in identification of genomic regions regulating gene expression (called expression quantitative trait loci: eQTL). It is important to find an optimum stage during seed development for eQTL mapping studies, where large numbers of genes show differences in transcript abundance between genotypes in a segregating population. To obtain a comprehensive insight into transcriptional changes during seed development in B. rapa, we carried out morphological characterization and global transcriptome analysis in a time range of developing seeds of a black/brown-seeded pak choi vegetable-type (PC175), a yellow-seeded oil-type yellow sarson (YS143) and both a yellow and a black/brown-seeded doubled haploid (DH) progeny line from their cross. In this study, we first describe embryo and seed morphological changes in time. Second, the differential expression profiles of genes from different metabolic pathways and transcription factors in developing seeds of the four genotypes are presented. Third, a window around the optimum seed development stage was defined based on genotypic and developmental transcriptomic profiles for more extended gene expression studies. Fourth, we investigated the regulation of lipid metabolism in more detail. Using a comparative analysis of gene expression networks among these four different genotypes, we explore the differential gene expression profiles and conserved regulatory mechanisms for seed development across these morphotypes of the diploid crop species B. rapa.
Morphology of developing seeds and embryos
Real-time gene expression profiling in developing seeds
The transcript abundance of 10 selected genes from a few key metabolic processes and transcription factors (Additional file 1: Figure S1) was measured from 10 to 60 DAP to obtain an overview of gene expression patterns during seed development. Three patterns were observed, with peak levels at 10–25 DAP, 25–40 DAP and 35–60 DAP which are defined as early-, mid- and later- stage, respectively (Additional file 1: Figure S1). These patterns were not very different among the four genotypes tested. Out of the ten genes, transcription factors LEC1 and Glabra2 and the starch gene GBSSI were expressed higher during earlier stages, lipid metabolism genes DGAT2 and FAE1, and storage protein 12S-CRA1 were expressed higher at mid stages, while the lipid metabolism gene DGAT1 (also called TAG1), carbohydrate metabolism gene SUS3, and the storage protein LEA and CHD3-chromatine-remodeling factor PICKLE were expressed highest at late stages. For the whole genome microarray gene expression profiling, six time points were selected: (i) 18 DAP (torpedo), (ii) 20 DAP (bent-cotyledon), (iii) 25 DAP (transition bent-embryo fully fills seed), and the developmental stages where the embryo fully fills the seed, being (iv) 30 DAP (v) 35 DAP and (vi) 40 DAP. These time points captured transcriptional changes at early, mid and late stages of seed development.
Microarray hybridization and probe annotation
In a dedicated B. rapa Agilent array, 61,546 probes (99.7% of total 61,654 probes) represent 42,162 Brassica rapa gene ID (called Bra ID). Out of 42,162 Bra IDs, 30,363 Bra IDs (72%) were assigned to 34 MapMan functional annotation categories. The remaining 11,799 (28% ) Bra IDs were not assigned to any functional category (Additional file 2: Table S1).
Pearson correlation coefficients were calculated to quantify how similar transcript abundance was between time-points and also between four replicates in each genotype (YS143, PC175, DH42 and DH78). All the replicates of each genotype from each time point had high correlations (r > 0.95) in all four genotypes (Additional file 3: Figure S2A-B). The correlation coefficients between time points decrease as the time points increase. Pearson correlation coefficients of transcript abundance between time-points were high (r > 0.9) from 18 to 25 DAP in PC175, and from 18 to 30 DAP in YS143, DH42 and DH78, but after those time points a transition from high (r > 0.95) to lower (r < 0.85) correlation coefficients occurs between early and later time points.
Correlation of transcript abundance of genes from real-time PCR and microarray analysis
Since transcript abundance was measured using two different techniques: qRT-PCR and microarray that might lead to a non-linear relationship, Spearman’s rank correlation coefficients, which are free from parametric assumptions, were used to compare the outcome of these two techniques. The transcript abundance from qRT-PCR and microarray of 10 selected genes were significantly and positively correlated except for transcription factors LEC1 and CHD3-chromatine-remodeling factor PICKLE. The rank correlation coefficients ranged from 0.43 for DGAT2 to 0.94 for LEA protein (Additional file 4: Table S2).
Genome-wide variation in transcript abundance during seed development
We investigated the loading values of probes on PC1, where probes with very low negative loadings were associated with the early stage of seed development (18–25 DAP) while the probes with very high positive loading were in response to later stages (35–40 DAP) (Figure 2). Among 34 MapMan functional categories, probes with high positive or low negative loadings mainly belong to metabolic pathways such as photosynthesis, cell wall metabolism, lipid metabolism, amino acid metabolism, protein metabolism, signalling, RNA (RNA processing, RNA binding and transcription factors), stress, transport, developmental processes, hormone metabolism, phosphate metabolism and secondary metabolism (Additional file 5: Figure S3).
Apparent changes in numbers of selected probes in contrasts between developmental stages or genotypes lead to selection of metabolic pathways
After excluding probes with rather constant transcript levels (< 2-fold change) across seed development and between genotypes, 11,244 probes (18.2% of total 61,554 probes) were retained for further analysis (Additional file 6: Table S3). Based on either a high number of selected probes per pathway or apparent changes in the number of selected probes from contrasts between consecutive time points or between genotypes at each time point, the top thirteen metabolic pathways were emphasized in this study. These top thirteen metabolic pathways correspond to metabolic pathways highlighted based on higher PC1 and PC2 loadings in PCA analysis. Those top thirteen metabolic pathways are represented by 9606 probes (i.e. 5520 Bra ID) and used for network analysis to separate the gene clusters according to temporal (4178 probes) and/or genotypic variation (3169 probes) during seed development (Additional file 7: Table S4).
Signed weighted gene co-expression network analysis (WGCNA) identifies gene modules associated with temporal and or genotype effects
Signed WGCNA grouped the selected probes (> 2 fold-change) into 47 co-expression gene modules, each one containing probes with a similar transcript abundance across genotypes and seed developmental stages. In an analysis of variance (ANOVA) test, 17 gene modules (3169 probes) showed a genotype effect, 4 modules (4179 probes) a time effect, and 6 modules (555 probes) a genotype as well as a time effect at 0.001 significance level and the remaining 20 gene modules did not show any effect (Additional file 8: Table S5; Additional file 9: Figure S4A-C). Since some of the gene modules showed similar expression patterns with subtle differences, gene modules were combined according to the time or genotype or time and genotype effects, and subjected to hierarchical clustering to have a broader overview of the patterns of transcript abundance.
Temporal variation across seed development stages
Putative cis-regulatory elements underlying co-expressed genes of lipid metabolism
We looked in more detail to changes in transcript abundance related to lipid metabolism because oil is the major storage compound of Brassica seeds. B. rapa and B. napus are widely grown for oil production, while B. rapa is also grown as vegetable crop. Therefore, it is interesting to know the variation in transcript abundance of genes related to oil biosynthesis during seed development in oil-type and non-oil type morphotypes. For this study, two genotypes: a yellow-seeded oil-type genotype YS143 and a black/brown-seeded vegetable-type genotype PC175 were chosen. In addition, two DH progeny lines, a yellow-seeded and a black/brown-seeded line, resembling the two parental lines were also used to develop ideas on segregation of transcript abundance of oil biosynthesis genes. In Additional file 11: Figure S6, the pathway for oil biosynthesis is depicted, with acetyl-CoA as the main precursor for the synthesis of fatty acids (FA), triacylglycerol (TAG) and phospholipids. Transcript abundance was visualized separately for genes involved in FA synthesis, FA elongation, lipid degradation, FA desaturation, biosynthesis of TAG and phospholipids, and oleosin (oil bodies).
In the process of FA synthesis and elongation, transcript abundance of genes revealed patterns with either a clear temporal effect or with a clear genotype effect (Additional file 12: Figure S7A:I-II). Transcript abundance of 63% of FA synthesis and FA elongation related probes was high at early stages (18–30 DAP), followed by a gradual decrease, while other probes (37%) show clear genotype differences with higher transcript abundance in the two progeny lines (DH42 and DH78) as compared to the parental genotypes. FA desaturation genes such as ADS1, FAD6 and FAD7 were up-regulated before 30 DAP, but FAD3 and ADS2 genes including FAD6 and FAD7 paralogs were up-regulated after 25 DAP (Additional file 12: Figure S7B). Triacylglycerides are the main constituents of vegetable oil and expressed at late stages of seed development. Genes involved in triacylglycerol biosynthesis, such as DGAT-1 and −2, GRP (glycine rich protein) and oleosin (storage proteins) were mainly up-regulated after 25 DAP (Additional file 12: Figure S7D).
For lipid degradation, four different patterns of transcript abundance were observed. A set of probes (18.5%) had high transcript abundance at later stages of seed development (after 25 DAP) (Additional file 12: Figure S7C: I-IV), while a larger number of probes (40.7%) showed higher transcript abundance at earlier stages before 30 DAP (Additional file 12: Figure S7C: II). Additional file 12: Figure S7C: III consists of a set of probes (13%) with high transcript abundance only at 35 and 40 DAP. Probes (27.8%) from Additional file 12: Figure S7C: IV showed genotype differences in transcript abundance with lower levels in parental genotypes PC175 and YS143, than in the DH lines.
A set of the genes functionally related and/or co-expressed often share conserved regulatory motifs, which might be responsible for coordinated expression of the set of genes. In this study, genes related to lipid metabolism with different co-expression patterns (different clusters) were searched to computationally predict cis-acting regulatory elements for potential roles in regulating lipid metabolism during seed development in B. rapa species. For all the selected 194 B. rapa genes (> absolute 2-fold change), the 1000 bp upstream sequence from the gene start were retrieved.
List of overrepresented motifs identified in promoter regions (1000 bp upstream) of genes involved in FA synthesis and elongation, FA desaturation, FA degradation and triacylglycerol (TAG) synthesis
Dof2, Dof3, MNB1A, PBF
High mobility group
High mobility group
Genotypic variation in overall metabolism
Genotypic as well as temporal variation in overall metabolism
The understanding of morphological and transcriptional changes during seed development has fundamental applications in Brassica breeding, both for high quality vegetable oil content and for crop establishment. In this study, we focused on analysis of morphological characteristics and global transcriptome analysis in developing seeds of four genotypes including two diverse B. rapa morphotypes: a leafy-type pak choi and an annual yellow-seeded oil-type yellow sarson. We also predict putative regulatory elements for lipid metabolism to understand this complex regulatory network during seed development.
Seed morphology varies at the later stages of seed development
Seed developmental stages, which are defined based on the shape of the embryo, were similar in both YS143 and PC175, irrespective of apparent differences in phenological characteristics, such as flowering time or seed colour in the two distant morphotypes pak choi and yellow sarson (Figure 1). However, the colour of embryo differed among these two genotypes at early stages (in period 15–25 DAP, PC175 embryos are yellowish, while YS143 embryos are green); and at later stages (at 40 DAP PC175 embryo’s turn from green to yellow, while YS143 embryo’s turn yellow only at 55 DAP). Also, seed coat colour changes differed among these two morphotypes, as the seed coat of PC175 turns from green to brownish at 40 DAP, while the YS143 seed coat turns yellowish at 50 DAP (Figure 1). Also in the two DH lines, the black/brown-seeded line DH78 lost the green colour earlier than the yellow-seeded DH42. Yellow seed colour is a desired quality trait in breeding Brassica oilseed species, because of its association with higher oil content and more easily digestible seed meal as compared to dark coloured seeds. The accumulation of proanthocyanidins (PAs) in the seed coat of immature black/brown seeds (20 DAP) but not in yellow seed  might be an explanation for the earlier change in seed colour. In this study, we observed that the embryo completely filled the seed at the bent-cotyledon stage (30 DAP); also Li et al.,  described that this stage was not yet reached at 25 DAP, but fully reached at 35 DAP in B. campestris (Synonymous: B. rapa). Brassica seed is non-endospermic, so, the endosperm is not retained in mature seeds, but only the embryo is enclosed by the seed coat . Evaluation of transcript abundance using real-time PCR was effective to define six time-points when abundance levels of a set of genes representative for the seed filling process varied with respect to their morphology: 18 DAP (torpedo), 20 DAP (bent- cotyledon), 25 DAP (transition bent-embryo fully fills seed) and 30 DAP, 35 DAP and 40 DAP (embryo fully fills the seed).
Seed developmental stages are the predominant cause for variation in transcript abundance
Genome-wide transcriptome analysis was used to explore global gene expression at six time points as representative stages for seed development in four genotypes of B. rapa. Despite the fact that B. rapa is an important vegetable and/or oil crop, this is the first study in which transcript abundance was profiled genome-wide during seed development in this species. The availability of whole genome sequence of B. rapa facilitated the design of a 60-mer oligonucleotide microarray platform (62,654 probes targeting 42,162 Brassica genes) based on predicted gene models from the genome sequence.
We used four approaches to define sets of genes with different transcript abundance during seed development in time (developmental stages) or between genotypes or both. First PCA was used to obtain an overview of variation in seed developmental stages and also between different genotypes using all the transcripts present in the microarray (Figure 2). The first principal component (PC1: 38.8% explained variance) captured mostly temporal variation in transcript abundance, supporting the earlier findings that seed developmental stages are major sources of transcriptional and metabolic variation in Arabidopsis[7, 33]. A comparative study of the transcript and metabolite profiles in both wild-type and transgenic genotypes of Arabidopsis also showed more variation across seed developmental stages than changes due to genotypic differences . The genotypic variation was captured in PC2 (15.6% explained variance), which suggests that metabolic processes inside developing seed are largely conserved, even between yellow-seeded oil and black/brown-seeded genotypes. Secondly, we selected a subset of genes with variation in transcript abundance patterns between developmental stages as well as between genotypes based on PCA loadings with a minimum two fold change criterion for further analysis. These subsets of genes represent the most active metabolic processes occurring in B. rapa developing seeds, such as photosynthesis, hormonal regulation, stress tolerance, cell wall, lipid, phosphate, amino acid, protein, signal transduction, transport, secondary metabolites, developmental process, and RNA processing and regulation of transcription (Additional file 7: Table S4). Those selected metabolic processes were also reported as major metabolic processes during seed development in close relatives A. thaliana and B. napus, but also in maize . Thirdly, a WGCNA approach was used to discover possible modules consisting of groups of genes with similar transcript abundance, either across time or between genotypes of both, and 27 modules out of a total of 47 modules showed significant variation in transcript abundance across time points or genotypes or their combinations (Additional file 8: Table S5; Additional file 9: Figure S4A-C). Since WGCNA uses Pearson correlation coefficients to identify co-expressed modules, it could not group genes that have similar patterns of transcript abundance but different levels into separate modules. So, in addition a separate hierarchical clustering using Euclidean distance was done in all gene modules according to the type of effects. The combined analysis using both Pearson correlation coefficients with WGCNA and hierarchical clustering with Euclidean distance resulted in clusters that are both similar in transcript abundance and level among genotypes across time points. Finally, we focused on transcriptional profiling related to lipid metabolism, in order to correlate co-expression patterns within pathways and to predict putative regulatory elements of lipid metabolism.
Global variation in transcript abundance: 25–35 DAP is a key period for major changes in B. rapadeveloping seed
In PCA, the early time points, before the embryo fills the seed (25 DAP), cluster tightly in PC1 but the later time points (35–40 DAP) cluster loosely, suggesting that physiological processes differentiate more at later stages. Higher correlations (r > 0.9) between the early time-points within genotypes and decreasing correlations between later stages also supports that there is more variation in transcript abundance at later stages (after 25 DAP) than at earlier stages (Additional file 3: Figure S2A-B). Variation in metabolite content, seed maturity, desiccation and dormancy induction occurred during the maturation phase , which corresponds to 25 DAP in this study. Interestingly, sequential changes in transcript abundance follow developmental changes in the black/brown-seeded genotypes (PC175 and DH78) but an extreme shift from 30 to 35 DAP and reversed at 40 DAP occurred in yellow-seeded genotypes (YS143 and DH42). This signifies the different transcriptome signatures of seed development in different genotypes, especially at the later stage. These findings are in agreement with a different timing of seed and embryo colour changes from 40 DAP onwards (Figure 1). The spatial position of the two DH lines between the two distant parental genotypes in the PC2 dimension points to variation in transcript abundance that can be used for genetic studies.
The largest changes in transcript abundance during seed development were observed during 25–35 DAP (bent-cotyledon to stage when embryo fully fills the seed), suggesting that this is the most optimal stage for genetical genomics studies for mapping eQTL in B. rapa developing seeds. In contrast, for B. napus, the major transcriptional transition was reported to be much earlier during heart-shaped to torpedo embryo stages i.e. 17–21 DAP, and for FA synthesis-related genes at 21 DAP in a spring and winter type B. napus L. cv HuYou15 .
Temporal changes in transcript abundance conserved across different morphotypes
The WGCNA method is a powerful and widely used tool to identify co-expressed gene clusters and to construct scale-free networks using topological properties of network construction . Among 47 gene modules identified, four (4179 probes) show temporal variation in transcript abundance across seed development (Additional file 8: Table S5, Additional file 9: Figure S4B), and these were reduced to three clusters after hierarchical clustering using Euclidean distance (Figure 3). This result, like PCA, confirms that variation in transcript abundance during seed development is predominantly conserved across genotypes in B. rapa. Similar observations were made for FA biosynthesis genes, which were conserved between B. napus and A. thaliana. The annotations of many genes belonging to these three clusters fitted what is known about different processes occurring during seed development. Among the three clusters, cluster I (48% genes) had high transcript abundance before 25 DAP with a gradual decrease till 35 DAP, with genes involved in photosynthesis, secondary metabolic pathways, and biosynthesis of tocopherols, mevalonate and carotenoids, and amino acids were over-represented. Amino acids are known as essential precursors for biosynthesis of secondary metabolites, proteins and other metabolic biosynthetic processes. Tocopherols are fat-soluble antioxidants and are one of the breeding goals to improve oil quality. Tocopherols accumulate slowly during 12–41 DAP and reach a maximum concentration during 41-53 DAP in developing seeds of B. napus. It has been suggested that production of tocopherols during seed development might be needed for the protection of polyunsaturated fatty acids against peroxidation . In cluster II (21% of genes with transcript abundance differences in time) and cluster III (31% genes) transcript abundance increased gradually or abruptly at 35–40 DAP, respectively (Figure 3). In these clusters, cytochrome P450, late embryogenesis abundant proteins (LEA), LTP (lipid transfer protein) and storage proteins, and abscisic acid and ethylene (hormone metabolism) were over-represented. This observation is in agreement with a number of other studies where storage proteins, abscisic acid and ethylene were highly expressed during late seed developmental stages because of their roles in growth and development of seed tissues, accumulation of seed reserves, maturation, desiccation tolerance, induction of seed dormancy and the utilization of storage reserves to support germination [1, 2, 12, 14, 39].
Gene co-expression patterns associated with genotypic differences, or genotype- and temporal differences
WGCNA analysis organized 3169 probes associated with genetic variation into 17 gene modules (3169 probes) (Additional file 8: Table S5; Additional file 9: Figure S4A), which could be represented by three gene clusters (cluster IV to VI) through hierarchical clustering (Figure 6). These clusters reveal genetic variation in patterns of transcript abundance during seed development, with distinct variation between the two parents with many genes showing transgressive segregation in DH lines.
Similarly, sets of genes (555 probes) displayed variation in transcript abundance due to both genotype and time contrasts in six gene modules (Additional file 8: Table S5; Additional file 9: Figure S4C). Four different patterns were identified in hierarchical clustering, mainly either with a gradual decrease in transcript abundance from early stages to late stages or a continuous increase across seed development (Figure 7). The leafy-type PC175 usually showed different patterns of transcript abundance compared to the other three genotypes (Figure 7A, 7C-E, 7G-H), while variation in transcript abundance of the two DH lines is more similar to that of the maternal genotype YS143. This could be due to maternal effects on seed and seed characteristics, as reported before in another study .
Predicting cis-regulatory elements for co-expressed genes related to lipid metabolism
Brassica species are widely cultivated for seed oil, and seed oil is also a major source of energy during germination and seedling growth. Thus, we want to get an insight in the genetic regulation of lipid metabolism in both oil- and vegetable- morphotypes. First, we defined pathways, such as FA synthesis and elongation, FA desaturation, lipid degradation, triacylglycerol. The co-expression analysis identified clusters of genes in the respective pathways with different transcript abundance. For example, FA synthesis and elongation related genes shared a similar time-dependent (high at 18–25 DAP, decrease thereafter) and a genotype-dependent transcript abundance (Additional file 12: Figure S7A). Lipid degradation related genes showed four different patterns of transcript abundance. However, triacylglycerol and FA desaturation biosynthesis processes were highly conserved with similar transcript abundance, increasing during late stages or early to middle stages of development respectively, among all four studied genotypes (Additional file 12: Figure S7B, D).
All these different sets of co-expressed genes in different pathways can be regulated by common or specific regulatory elements. The prediction of putative regulatory elements in co-regulated genes can increase our understanding of seed development and results in tools to breed for improved oil content. Transcription factors play regulatory roles not only in seed development but also in lipid metabolism  and transcription factor binding sites (or cis-regulating elements) are usually located in upstream regulatory regions of genes.
The ABI4 binding motif was shared by genes from the triacylglycerol biosynthesis pathway, FA desaturation and lipid degradation (Additional file 12: Figure S7C: III-IV), which were all up-regulated 25 DAP. Motif ABI4 was reported as an important cis-regulator of the DGAT gene of triacylglycerol biosynthesis [39, 41] and repressor of lipid degradation , and is known for its role during seed maturation, seed size, seed germination and seedling growth. The AAAG binding domain was conserved in motifs Dof2, Dof3, PBF and MNB1A (DOF family) and was found specifically in triacylglycerol biosynthesis genes in our seed samples. The roles of DOF genes are in activating seed storage protein genes during seed development and germination in rice , barley , maize , wheat  and Arabidopsis. The interwoven connection of different regulatory motifs in Figure 5 supports the fact that target genes are regulated by multiple interacting TFs. The interaction between Dof proteins and HMG proteins was reviewed in maize seed . Similarly, the other identified motifs, in this study, that belong to the bZIP, MADS-box, MYB family, beta-beta-alpha zinc finger families, as well as unknown motifs, likely play roles in regulating gene expression during seed development and maturation in B. rapa. Some motifs reported in Arabidopsis seed that are similar to our findings, such as AG, ABI4, squamosa, bZIP and PEND for triacylglycerol biosynthesis genes, and HMG-1 and Gamyb for FA synthesis genes . Moreover, they also reported many more motifs than our findings, and in addition, several motifs observed for triacylglycerol biosynthesis in our study were reported for FA synthesis in this study or vice versa. The possible explanations for finding different numbers of motifs with some disagreement could be (i) the sequence form 1000 bp upstream plus the UTR region was used by , but we considered only 1000 bp upstream sequences because the majority of cis-regulatory elements are located in this region , and (ii) the use of different motif finding tools; TFBS  and fdrMotif  by  but MEME tool  in this study. The different tools use different algorithms and that could lead to some differences in finding motifs . Besides the UTR region and the 1000 bp upstream region, cis-regulatory elements can also be located in the downstream sequence, in the gene’s introns or in neighbouring genes’ introns  and consideration of these genomic regions can potentially improve in finding TFs binding motifs.
A morphological characterization of developing embryos and seeds of two different morphotypes of Brassica rapa, a pak choi and a yellow sarson, showed that the seed developmental stages based on the shape of the embryo were similar in both morphotypes, but the colour of embryo and seed coat differed at both earlier (15–25 DAP) and later stages (after 40 DAP). Analysis of transcript abundance measured with qRT-PCR of ten selected genes from different metabolic processes suggested to use six time points (18, 20, 25, 30, 35 and 40 DAP) for a global gene expression study using microarrays. In this study, done on pak choi, yellow sarson, and two doubled haploid lines from their cross we found that most changes in transcript abundance occur between 25 and 35 DAP, suggesting that the timing of metabolic processes during seed development in B. rapa is later than in B napus. We identified 47 gene modules of which 17 showed genotypic variation in transcript abundance, 4 showed temporal variation and 6 showed both temporal and genotypic variation. This study shows that temporal transcriptional variation is more dominant than morphotype or genotype differences. Since lipids are the major storage compounds of Brassica seeds, we investigated putative cis-regulatory elements of co-regulated gene clusters involved in lipid metabolism. In total 17 putative cis-regulatory elements were predicted in 1000 bp upstream region, which are either specific for or common to four co-regulated gene clusters. This study provides detailed information on transcriptional changes during Brassica seed development and provides a starting point for a genetical genomics study of seed quality traits.
Plant materials and monitoring seed development
For this study two different B. rapa morphotypes were used; an oil-type yellow sarson (YS143) and a vegetable-type pak choi (PC175), as well as two DH lines (DH42 and DH78) from a cross of parental genotypes YS143 and PC175. These two parental morphotypes were selected based on their genetic distance, different plant phenology, flowering time and metabolite content in the seed (Additional file 14: Table S6). The two progeny DH lines, which also differ in morphological characteristics such as seed colour, flowering time and metabolite content were also included in this study (Additional file 14: Table S6). Three plants of parental genotypes and a single plant of each DH line was grown in a heated greenhouse under 16/8 hours light/dark from February to June, 2010 at Wageningen UR. Flowers were tagged the day they opened, assuming self-pollination on the day of flower opening. PC175 and other self-incompatible DH lines of the population were manually bud pollinated to get enough seed. For each genotype, siliques were harvested at 15 time points: 10, 15, 16, 17, 18, 20, 21, 25, 30, 35, 40, 45, 50, 55 and 60 DAP. About 100–150 seeds were excised from the seed pods, frozen in liquid nitrogen and used for RNA isolation. Randomly five seeds from each genotype at each time point (developmental stage) were dissected under the binocular stereo microscope at 1.6x magnification and pictures were taken using Axio Vision Rel. 4.8 software (Carl Zeiss Imaging Solutions, Wrek, Göttingen, Germany) to observe the morphological characteristics of embryos and seeds at each time point.
Siliques harvested at defined stages were kept in liquid nitrogen (−196°C), and around 100–150 seeds were extracted under dry ice and ground in liquid nitrogen (−196°C). For real-time PCR, RNA was isolated using KingFisher Flex system (Thermo Scientific, Finland) and Ambion’s MagMAX™-96 Total RNA isolation kit according to the manufacturer’s instruction and RNA pellets were dissolved in nuclease-free water. For microarray, RNA isolation was done using Trizol reagent according to the manufacturer’s instructions (Invitrogen, Burlington, ON, Canada) followed by DNase treatment (AmpGrade I, Invitrogen, Burlington, ON, Canada) and a purification step (RNeasy Mini Kit, Qiagen). The quantity of RNA was determined by NanoDrop ND-100 UV–VIS spectrophotometer and quality was assessed by A260/A280 and A260/A230 ratio (NanoDrop Technologies, Inc., Wilmington, DE, USA) as well as by 1% agarose gel.
Quantitative real-time PCR (qRT-PCR)
Ten genes involved in major metabolic processes of seed development according to the literature were selected to measure transcript abundance across seed development stages ranging from 10 to 60 DAP using real time-PCR (Additional file 15: Table S7). These candidate genes represent fatty acid biosynthesis (DGAT1, DGAT2 and FAE1), carbohydrate metabolism (GBSSI and SuSy3), storage proteins (12S-CRA1 and LEA), transcription factors (LEC1 and Glabra2) and one CHD3-chromatine-remodeling factor (PICKLE). The detailed procedure of qRT-PCR and normalization is described in Additional file 16. The normalized transcript abundance (∆∆CT) of each gene for each sample was determined with respect to the reference gene β-actin. We use the term gene expression for this normalized transcript abundance in this paper. In order to identify common profiles of transcript abundance across the seed development stages, genes were grouped using hierarchical cluster analysis with Euclidean distance of normalized data (∆∆CT). Transcript abundance of ten genes obtained from real-time PCR were visualized using a heatmap tool in Additional file 1: Figure S1.
Microarray probe design
The whole genome sequence of B. rapa cv. Chiifu (a leafy vegetable inbred line) is publicly available . We designed microarray probes for two-colour Agilent microarray platform based on the predicted gene models of the reference genome sequence. In this custom array, 61,654 probes were assembled, which represent 40,879 (99.74%) B. rapa gene IDs (Bra ID) and 108 (0.26%) scaffold IDs with no assignment of Bra ID (Additional file 2: Table S1). All the probes were annotated into 35 different functional categories or “BINS” as defined by MapMan software (Additional file 17). MapMan is an open source software tool to categorize and display functional genomics data .
Experimental design for microarray hybridization
Microarray hybridization was done on developing seeds from four genotypes; the two parents (YS143 and PC175) and two DH lines (DH42 and DH78) at six time points: 18, 20, 25, 30, 35 and 40 DAP. Two independent experiments were done to compare two parental genotypes (hereafter, called experiment A) and two DH lines (hereafter, called experiment B). Cy3 and Cy5 dyes were incorporated into cRNA samples according to the Agilent two-colour microarray based gene expression analysis (Low input quick Amp labelling G4140-90050) protocol (Agilent Technologies, Inc., Santa Clara, CA, USA) and hybridized on arrays following a double-loop design (Additional file 18: Figure S9A-B). In one array, two samples from the two consecutive time points of the same genotype or two genotypes from the same time point were hybridized. The same hybridization scheme was used for experiment B using the two DH lines. In both experiments A and B, each sample was hybridized four times generating four technical replicates. Loess was used for within-array normalization and quantile normalization for between-array normalization using the limma package in R . The normalized Cy3 and Cy5 intensities were used as measures of transcript abundance and are sometimes referred to as gene expression in this paper.
Microarray data analysis
The aim of this study was to explore the effects of seed developmental stages, genotypic variation or both on transcript abundance of genes with special focus on important metabolic processes. Principal components analysis (PCA) was used to examine the global profiles of transcript abundance of the four B. rapa genotypes across six seed developmental stages.
For further analyses, we excluded probes with little variation in transcript abundance across seed development as well as between genotypes using a minimum two-fold change threshold (in absolute value). Fold change differences were calculated in contrasts between two consecutive time points (18 vs. 20, 20 vs. 25, 25 vs. 30 and 35 vs. 40 DAP) as well as between two pairs of genotypes (YS143 vs. PC175 and DH42 vs. DH78) per time point. In this study, we emphasized the metabolic processes that have either a high number of selected probes or apparent changes in the number of selected probes among time point or genotype contrasts for further analysis.
WGCNA is a widely used correlation-based network construction method to construct a scale-free network . A signed WGNCA approach was applied in this study to find gene co-expression modules, so-called “gene modules” while keeping track of positive or negative correlation coefficients, where each gene module represents a group of genes having similar co-expression patterns across seed developmental stages or genotypes or their combinations. WGCNA first calculates Pearson’s correlation matrix of all genes, and transforms the correlation matrix into an adjacency matrix by raising all values to a soft threshold power β (default value 12) to emphasize strong correlations and penalize weaker correlations on an exponential scale. Then, the adjacency matrix is transformed into a topological overlap matrix (TOM), which summarizes the degree of shared connections between any two genes, and then converted into a dissimilarity matrix. A hierarchical cluster of genes is created based on a dissimilarity matrix and finally, gene co-expression modules were defined from the cluster dendrogram at a threshold of 0.2 dissimilarity value using the dynamic tree-cutting algorithm. Once gene modules were identified, the “Module Eigengene” (ME; the first principal component of the expression values across subjects) was calculated using all probes in each gene module. The module eigengene represents the expression profiles of all probes from a gene module across subjects (i.e. genotypes at each time point), and high or low eigengene values of subjects correspond to over- or under expression in the corresponding subjects, respectively. The details of this method are described in [36, 55], and the analysis was performed in R software using the WGCNA package . The module eigengene of each subject was examined to determine the effects of time or genotype or both using an ANOVA test. In this case, genotype and time were two independent factors and a module’s eigengene values as the response, consecutively for each module. The significance of the effects was determined at 0.001 FDR correction proposed by . The probes belonging to gene modules significant in ANOVA were grouped into three categories according to genotype or time or both genotype and time effect. Hierarchical clustering using Euclidean distance as a criterion for dissimilarity then was applied independently on the data sets of these three categories. From this hierarchical clustering, genes were broadly organized into clusters considering the height of the dendrogram, and each category was annotated with MapMan metabolic pathways. Fisher’s exact test was used to test for over- and under-representation of metabolic pathways in a selected cluster of genes using R software. If a particular pathway was significantly over- or under-represented in the gene cluster that indicates a statistically significant number of probes from the pathway are present in the gene clusters with specific patterns of gene expression across seed development stages over four genotypes .
We focused on discovering transcription factor binding sites or DNA motifs for the co-expressed genes of lipid metabolism. The 1000 bp upstream sequences of co-expressed Brassica genes from the transcription start site (TSS) were retrieved from Brassica database (http://brassicadb.org/brad/). Conserved DNA motifs were searched in the upstream regions using the expectation maximization algorithm implemented in MEME version 4.9.0 . Motifs with 6–12 nucleotides length were searched on both strands of the input sequence using both “zero or one occurrence per sequence” and “any number of repetitions” options. Motifs with and E-value ≤ 1 were used to assess similarity to known motifs using TOMTOM  in the JASPAR plant specific database . This plant specific JASPAR database was considered because of the potential roles of these motifs in regulating lipid metabolism during seed development in higher plants.
Availability of supporting data
The data sets supporting the results of this article are included within the article and its additional files.
We thank the members of the Unifarm facility of Wageningen University and Research Centre (WUR) for taking care of the plants and all the necessary support. Authors highly appreciate the contributions of Aalt-Jan van Dijk, Plant Research International, Bioscience, Wageningen, the Netherlands for carefully evaluating motif prediction methods used in this study. Finally, the authors are thankful to the Centre for BioSystems Genomics (CBSG), Netherlands which is a part of the Netherlands Genomics Initiative (NGI) for partial financial support for this study.
- Sabelli PA: Seed development: a comparative overview on biology of morphology, physiology, and biochemistry between monocot and dicot plants. Seed Development: OMICS Technologies toward Improvement of Seed Quality and Crop Yield. Edited by: Agrawal GK, Rakwal R. 2012, Springer Netherlands, 3-25.View ArticleGoogle Scholar
- Yu B, Gruber M, Khachatourians GG, Hegedus DD, Hannoufa A: Gene expression profiling of developing Brassica napus seed in relation to changes in major storage compounds. Plant Sci. 2010, 178 (4): 381-389. 10.1016/j.plantsci.2010.02.007.View ArticleGoogle Scholar
- Li W, Gao Y, Xu H, Zhang Y, Wang J: A proteomic analysis of seed development in Brassica campestri L. PLoS ONE. 2012, 7 (11): e50290-10.1371/journal.pone.0050290.PubMed CentralView ArticlePubMedGoogle Scholar
- Le BH, Cheng C, Bui AQ, Wagmaister JA, Henry KF, Pelletier J, Kwong L, Belmonte M, Kirkbride R, Horvath S, et al: Global analysis of gene activity during Arabidopsis seed development and identification of seed-specific transcription factors. Proc Natl Acad Sci. 2010, 107 (18): 8063-8070. 10.1073/pnas.1003530107.PubMed CentralView ArticlePubMedGoogle Scholar
- Baud S, Boutin J-P, Miquel M, Lepiniec L, Rochat C: An integrated overview of seed development in Arabidopsis thaliana ecotype WS. Plant Physiol Biochem. 2002, 40 (2): 151-160. 10.1016/S0981-9428(01)01350-X.View ArticleGoogle Scholar
- Girke T, Todd J, Ruuska S, White J, Benning C, Ohlrogge J: Microarray analysis of developing Arabidopsis seeds. Plant Physiol. 2000, 124 (4): 1570-1581. 10.1104/pp.124.4.1570.PubMed CentralView ArticlePubMedGoogle Scholar
- Peng F, Weselake R: Gene coexpression clusters and putative regulatory elements underlying seed storage reserve accumulation in Arabidopsis. BMC Genomics. 2011, 12 (1): 286-10.1186/1471-2164-12-286.PubMed CentralView ArticlePubMedGoogle Scholar
- Jolivet P, Boulard C, Bellamy A, Valot B, D’Andréa S, Zivy M, Nesi N, Chardot T: Oil body proteins sequentially accumulate throughout seed development in Brassica napus. J Plant Physiol. 2011, 168 (17): 2015-2020. 10.1016/j.jplph.2011.06.007.View ArticlePubMedGoogle Scholar
- Jiang H, Wu P, Zhang S, Song C, Chen Y, Li M, Jia Y, Fang X, Chen F, Wu G: Global analysis of gene expression profiles in developing physic nut (Jatropha curcas L.) Seeds. PLoS ONE. 2012, 7 (5): e36522-10.1371/journal.pone.0036522.PubMed CentralView ArticlePubMedGoogle Scholar
- Niu Y, Wu G-Z, Ye R, Lin W-H, Shi Q-M, Xue L-J, Xu X-D, Li Y, Du Y-G, Xue H-W: Global analysis of gene expression profiles in Brassica napus developing seeds reveals a conserved lipid metabolism regulation with Arabidopsis thaliana. Mol Plant. 2009, 2 (5): 1107-1122. 10.1093/mp/ssp042.View ArticlePubMedGoogle Scholar
- Andriotis VME, Pike MJ, Schwarz SL, Rawsthorne S, Wang TL, Smith AM: Altered starch turnover in the maternal plant has major effects on Arabidopsis fruit growth and seed composition. Plant Physiol. 2012, 160 (3): 1175-1186. 10.1104/pp.112.205062.PubMed CentralView ArticlePubMedGoogle Scholar
- Bogatek R, Gniazdowska A: Ethylene in seed development, dormancy and germination. Annual plant reviews volume 44: the plant hormone ethylene. Edited by: McManus MT. 2012, Oxford, UK: Wiley-Blackwell, 189-218. 1View ArticleGoogle Scholar
- Xue L-J, Zhang J-J, Xue H-W: Genome-wide analysis of the complex transcriptional networks of rice developing seeds. PLoS ONE. 2012, 7 (2): e31081-10.1371/journal.pone.0031081.PubMed CentralView ArticlePubMedGoogle Scholar
- Walton LJ, Kurepin LV, Yeung EC, Shah S, Emery RJN, Reid DM, Pharis RP: Ethylene involvement in silique and seed development of canola, Brassica napus L. Plant Physiol Biochem. 2012, 58: 142-150.View ArticlePubMedGoogle Scholar
- Wang H, Guo J, Lambert K, Lin Y: Developmental control of Arabidopsis seed oil biosynthesis. Planta. 2007, 226 (3): 773-783. 10.1007/s00425-007-0524-0.View ArticlePubMedGoogle Scholar
- Santos-Mendoza M, Dubreucq B, Baud S, Parcy F, Caboche M, Lepiniec L: Deciphering gene regulatory networks that control seed development and maturation in Arabidopsis. Plant J. 2008, 54 (4): 608-620. 10.1111/j.1365-313X.2008.03461.x.View ArticlePubMedGoogle Scholar
- Ruuska SA, Girke T, Benning C, Ohlrogge JB: Contrapuntal networks of gene expression during Arabidopsis seed filling. The Plant Cell Online. 2002, 14 (6): 1191-1206. 10.1105/tpc.000877.View ArticleGoogle Scholar
- Dong J, Keller W, Yan W, Georges F: Gene expression at early stages of Brassica napus seed development as revealed by transcript profiling of seed-abundant cDNAs. Planta. 2004, 218 (3): 483-491. 10.1007/s00425-003-1124-2.View ArticlePubMedGoogle Scholar
- Beisson F, Koo AJK, Ruuska S, Schwender J, Pollard M, Thelen JJ, Paddock T, Salas JJ, Savage L, Milcamps A, et al: Arabidopsis genes involved in acyl lipid metabolism. A 2003 census of the candidates, a study of the distribution of expressed sequence tags in organs, and a web-based database. Plant Physiol. 2003, 132 (2): 681-697. 10.1104/pp.103.022988.PubMed CentralView ArticlePubMedGoogle Scholar
- Laudencia-Chingcuanco D, Stamova B, You F, Lazo G, Beckles D, Anderson O: Transcriptional profiling of wheat caryopsis development using cDNA microarrays. Plant Mol Biol. 2007, 63 (5): 651-668. 10.1007/s11103-006-9114-y.View ArticlePubMedGoogle Scholar
- Wan Y, Poole R, Huttly A, Toscano-Underwood C, Feeney K, Welham S, Gooding M, Mills C, Edwards K, Shewry P, et al: Transcriptome analysis of grain development in hexaploid wheat. BMC Genomics. 2008, 9 (1): 121-10.1186/1471-2164-9-121.PubMed CentralView ArticlePubMedGoogle Scholar
- Lee J-M, Williams M, Tingey S, Rafalski A: DNA array profiling of gene expression changes during maize embryo development. Funct Integr Genomics. 2002, 2 (1–2): 13-27.View ArticlePubMedGoogle Scholar
- Liu X, Fu J, Gu D, Liu W, Liu T, Peng Y, Wang J, Wang G: Genome-wide analysis of gene expression profiles during the kernel development of maize (Zea mays L.). Genomics. 2008, 91 (4): 378-387. 10.1016/j.ygeno.2007.12.002.View ArticlePubMedGoogle Scholar
- Druka A, Muehlbauer G, Druka I, Caldo R, Baumann U, Rostoks N, Schreiber A, Wise R, Close T, Kleinhofs A, et al: An atlas of gene expression from seed to seed through barley development. Funct Integr Genomics. 2006, 6 (3): 202-211. 10.1007/s10142-006-0025-4.View ArticlePubMedGoogle Scholar
- Zhu T, Budworth P, Chen W, Provart N, Chang H-S, Guimil S, Su W, Estes B, Zou G, Wang X: Transcriptional control of nutrient partitioning during rice grain filling. Plant Biotechnol J. 2003, 1 (1): 59-70.View ArticlePubMedGoogle Scholar
- Asakura T, Tamura T, Terauchi K, Narikawa T, Yagasaki K, Ishimaru Y, Abe K: Global gene expression profiles in developing soybean seeds. Plant Physiol Biochem. 2012, 52: 147-153.View ArticlePubMedGoogle Scholar
- Wang X, Wang H, Wang J, Sun R, Wu J, Liu S, Bai Y, Mun J-H, Bancroft I, Cheng F, et al: The genome of the mesopolyploid crop species Brassica rapa. Nat Genet. 2011, 43 (10): 1035-1039. 10.1038/ng.919.View ArticlePubMedGoogle Scholar
- Jansen RC, Nap J-P: Genetical genomics: the added value from segregation. Trends Genet. 2001, 17 (7): 388-391. 10.1016/S0168-9525(01)02310-1.View ArticlePubMedGoogle Scholar
- Hu Y, Wu G, Cao Y, Wu Y, Xiao L, Li X, Lu C: Breeding response of transcript profiling in developing seeds of Brassica napus. BMC Mol Biol. 2009, 10 (1): 49-10.1186/1471-2199-10-49.PubMed CentralView ArticlePubMedGoogle Scholar
- Gaffney D, Veyrieras J-B, Degner J, Pique-Regi R, Pai A, Crawford G, Stephens M, Gilad Y, Pritchard J: Dissecting the regulatory architecture of gene expression QTLs. Genome Biol. 2012, 13 (1): R7-10.1186/gb-2012-13-1-r7.PubMed CentralView ArticlePubMedGoogle Scholar
- Jordan MC, Somers DJ, Banks TW: Identifying regions of the wheat genome controlling seed development by mapping expression quantitative trait loci. Plant Biotechnol J. 2007, 5 (3): 442-453. 10.1111/j.1467-7652.2007.00253.x.View ArticlePubMedGoogle Scholar
- Li X, Chen L, Hong M, Zhang Y, Zu F, Wen J, Yi B, Ma C, Shen J, Tu J, et al: A large insertion in bHLH transcription factor BrTT8 resulting in yellow seed coat in Brassica rapa. PLoS ONE. 2012, 7 (9): e44145-10.1371/journal.pone.0044145.PubMed CentralView ArticlePubMedGoogle Scholar
- Fait A, Angelovici R, Less H, Ohad I, Urbanczyk-Wochniak E, Fernie AR, Galili G: Arabidopsis seed development and germination is associated with temporally distinct metabolic switches. Plant Physiol. 2006, 142 (3): 839-854. 10.1104/pp.106.086694.PubMed CentralView ArticlePubMedGoogle Scholar
- Angelovici R, Fait A, Zhu X, Szymanski J, Feldmesser E, Fernie AR, Galili G: Deciphering transcriptional and metabolic networks associated with lysine metabolism during Arabidopsis seed development. Plant Physiol. 2009, 151 (4): 2058-2072. 10.1104/pp.109.145631.PubMed CentralView ArticlePubMedGoogle Scholar
- Teoh KT, Requesens DV, Devaiah S, Johnson D, Huang X, Howard J, Hood E: Transcriptome analysis of embryo maturation in maize. BMC Plant Biol. 2013, 13 (1): 19-10.1186/1471-2229-13-19.PubMed CentralView ArticlePubMedGoogle Scholar
- Horvath S, Dong J: Geometric interpretation of gene coexpression network analysis. PLoS Comput Biol. 2008, 4 (8): e1000117-10.1371/journal.pcbi.1000117.PubMed CentralView ArticlePubMedGoogle Scholar
- Goffman FD, Velasco L, Becker HC: Tocopherols accumulation in developing seeds and pods of rapeseed (Brassica napus L.). Fett-Lipid. 1999, 101 (10): 400-403. 10.1002/(SICI)1521-4133(199910)101:10<400::AID-LIPI400>3.0.CO;2-#.View ArticleGoogle Scholar
- Kamal-Eldin A, Appelqvist L-Å: The chemistry and antioxidant properties of tocopherols and tocotrienols. Lipids. 1996, 31 (7): 671-701. 10.1007/BF02522884.View ArticlePubMedGoogle Scholar
- Yang Y, Yu X, Song L, An C: ABI4 activates DGAT1 expression in Arabidopsis seedlings during nitrogen deficiency. Plant Physiol. 2011, 156 (2): 873-883. 10.1104/pp.111.175950.PubMed CentralView ArticlePubMedGoogle Scholar
- Deng W, Chen G, Peng F, Truksa M, Snyder CL, Weselake RJ: Transparent testa16 plays multiple roles in plant development and is Involved in lipid synthesis and embryo development in Canola. Plant Physiol. 2012, 160 (2): 978-989. 10.1104/pp.112.198713.PubMed CentralView ArticlePubMedGoogle Scholar
- Wind JJ, Peviani A, Snel B, Hanson J, Smeekens SC: ABI4: versatile activator and repressor. Trends Plant Sci. 2013, 18 (3): 125-132. 10.1016/j.tplants.2012.10.004.View ArticlePubMedGoogle Scholar
- Penfield S, Li Y, Gilday AD, Graham S, Graham IA: Arabidopsis ABA INSENSITIVE4 regulates lipid mobilization in the embryo and reveals repression of seed germination by the endosperm. The Plant Cell Online. 2006, 18 (8): 1887-1899. 10.1105/tpc.106.041277.View ArticleGoogle Scholar
- Gaur V, Singh US, Kumar A: Transcriptional profiling and in silico analysis of Dof transcription factor gene family for understanding their regulation during seed development of rice Oryza sativa L. Mol Biol Rep. 2011, 38 (4): 2827-2848. 10.1007/s11033-010-0429-z.View ArticlePubMedGoogle Scholar
- Mena M, Vicente-Carbajosa J, Schmidt Robert J, Carbonero P: An endosperm-specific DOF protein from barley, highly conserved in wheat, binds to and activates transcription from the prolamin-box of a native B-hordein promoter in barley endosperm. Plant J. 1998, 16 (1): 53-62. 10.1046/j.1365-313x.1998.00275.x.View ArticlePubMedGoogle Scholar
- Vicente-Carbajosa J, Moose SP, Parsons RL, Schmidt RJ: A maize zinc-finger protein binds the prolamin box in zein gene promoters and interacts with the basic leucine zipper transcriptional activator Opaque2. Proc Natl Acad Sci. 1997, 94 (14): 7685-7690. 10.1073/pnas.94.14.7685.PubMed CentralView ArticlePubMedGoogle Scholar
- Stamm P, Ravindran P, Mohanty B, Tan E, Yu H, Kumar P: Insights into the molecular mechanism of RGL2-mediated inhibition of seed germination in Arabidopsis thaliana. BMC Plant Biol. 2012, 12 (1): 179-10.1186/1471-2229-12-179.PubMed CentralView ArticlePubMedGoogle Scholar
- Yanagisawa S: Dof domain proteins: plant-specific transcription factors associated with diverse phenomena unique to plants. Plant Cell Physiol. 2004, 45 (4): 386-391. 10.1093/pcp/pch055.View ArticlePubMedGoogle Scholar
- Maeo K, Tokuda T, Ayame A, Mitsui N, Kawai T, Tsukagoshi H, Ishiguro S, Nakamura K: An AP2-type transcription factor, WRINKLED1, of Arabidopsis thaliana binds to the AW-box sequence conserved among proximal upstream regions of genes involved in fatty acid synthesis. Plant J. 2009, 60 (3): 476-487. 10.1111/j.1365-313X.2009.03967.x.View ArticlePubMedGoogle Scholar
- Lenhard B, Wasserman WW: TFBS: computational framework for transcription factor binding site analysis. Bioinformatics. 2002, 18 (8): 1135-1136. 10.1093/bioinformatics/18.8.1135.View ArticlePubMedGoogle Scholar
- Li L, Bass RL, Liang Y: fdrMotif: identifying cis-elements by an EM algorithm coupled with false discovery rate control. Bioinformatics. 2008, 24 (5): 629-636. 10.1093/bioinformatics/btn009.PubMed CentralView ArticlePubMedGoogle Scholar
- Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS: MEME Suite: tools for motif discovery and searching. Nucleic Acids Res. 2009, 37 (suppl 2): W202-W208.PubMed CentralView ArticlePubMedGoogle Scholar
- Meireles-Filho ACA, Stark A: Comparative genomics of gene regulation—conservation and divergence of cis-regulatory information. Curr Opin Genet Dev. 2009, 19 (6): 565-570. 10.1016/j.gde.2009.10.006.View ArticlePubMedGoogle Scholar
- Usadel B, Nagel A, Thimm O, Redestig H, Blaesing OE, Palacios-Rojas N, Selbig J, Hannemann J, Piques MC, Steinhauser D, et al: Extension of the visualization tool MapMan to allow statistical analysis of arrays, display of corresponding genes, and comparison with known responses. Plant Physiol. 2005, 138 (3): 1195-1204. 10.1104/pp.105.060459.PubMed CentralView ArticlePubMedGoogle Scholar
- Smyth GK: limma: linear models for microarray data. Bioinformatics and Computational Biology Solutions Using R and Bioconductor. Edited by: Gentleman R, Carey V, Huber W, Irizarry RA, Dudoit S. 2005, New York: Springer, 397-420.View ArticleGoogle Scholar
- Mason M, Fan G, Plath K, Zhou Q, Horvath S: Signed weighted gene co-expression network analysis of transcriptional regulation in murine embryonic stem cells. BMC Genomics. 2009, 10 (1): 327-10.1186/1471-2164-10-327.PubMed CentralView ArticlePubMedGoogle Scholar
- Langfelder P, Horvath S: WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008, 9 (1): 559-10.1186/1471-2105-9-559.PubMed CentralView ArticlePubMedGoogle Scholar
- Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995, 57 (1): 289-300.Google Scholar
- Merico D, Isserlin R, Stueker O, Emili A, Bader GD: Enrichment Map: a network-based method for gene-set enrichment visualization and interpretation. PLoS ONE. 2010, 5 (11): e13984-10.1371/journal.pone.0013984.PubMed CentralView ArticlePubMedGoogle Scholar
- Gupta S, Stamatoyannopoulos J, Bailey T, Noble W: Quantifying similarity between motifs. Genome Biol. 2007, 8 (2): R24-10.1186/gb-2007-8-2-r24.PubMed CentralView ArticlePubMedGoogle Scholar
- Portales-Casamar E, Thongjuea S, Kwon AT, Arenillas D, Zhao X, Valen E, Yusuf D, Lenhard B, Wasserman WW, Sandelin A: JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles. Nucleic Acids Res. 2010, 38 (suppl 1): D105-D110.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.