A cross-species transcriptomics approach to identify genes involved in leaf development
- Nathaniel Robert Street†1,
- Andreas Sjödin†1,
- Max Bylesjö2,
- Petter Gustafsson1,
- Johan Trygg2 and
- Stefan Jansson1Email author
© Street et al; licensee BioMed Central Ltd. 2008
Received: 06 June 2008
Accepted: 05 December 2008
Published: 05 December 2008
We have made use of publicly available gene expression data to identify transcription factors and transcriptional modules (regulons) associated with leaf development in Populus. Different tissue types were compared to identify genes informative in the discrimination of leaf and non-leaf tissues. Transcriptional modules within this set of genes were identified in a much wider set of microarray data collected from leaves in a number of developmental, biotic, abiotic and transgenic experiments.
Transcription factors that were over represented in leaf EST libraries and that were useful for discriminating leaves from other tissues were identified, revealing that the C2C2-YABBY, CCAAT-HAP3 and 5, MYB, and ZF-HD families are particularly important in leaves. The expression of transcriptional modules and transcription factors was examined across a number of experiments to select those that were particularly active during the early stages of leaf development. Two transcription factors were found to collocate to previously published Quantitative Trait Loci (QTL) for leaf length. We also found that miRNA family 396 may be important in the control of leaf development, with three members of the family collocating with clusters of leaf development QTL.
This work provides a set of candidate genes involved in the control and processes of leaf development. This resource can be used for a wide variety of purposes such as informing the selection of candidate genes for association mapping or for the selection of targets for reverse genetics studies to further understanding of the genetic control of leaf size and shape.
Leaves are of fundamental importance to life on earth, representing the powerhouses of most food chains and are, ultimately, the source of energy that sustains humanity. Although much has been learnt about the biochemical and physiological functioning of leaf-level processes such as photosynthesis , still relatively little is known about why leaves are the size and shape they are and how they come to be so. There are genes known to affect meristomatic pattern formation (e.g. AS1 and WUS, KNOX and CLV, see  for a review of leaf development) and the rate of leaf primordia initiation  as well as genes that contribute to the determination of leaf length (ROT3 , LNG ) and width (AN, ), with less being known about the determination of leaf size (although  provide exciting insight). Although the two now classic examples of ROT and AN clearly affect leaf width and length, each will likely explain only a small proportion of the variation that exists for these traits, as numerous examples suggest that leaf size and shape are under complex polygenic control [8–17]. The vast majority of work identifying genes that function in the control of leaf size and shape has, to date, been conducted using the model species Arabidopsis thaliana, with most genes being identified through mutant screens, and, to the best of our knowledge, with no examples of evidence that such genes show variation between ecotypes of Arabidopsis with contrasting leaf shape or size. Beyond this, the work in  showed how conclusions drawn about gene function in Arabidopsis, which has simple leaves, may not hold true in species with more complex, dissected leaf forms.
We are interested in identifying genes that are functionally important in determining natural variation in leaf size, shape, and development. Forward genetic screens represent an important approach for identifying genes that have the potential to contribute to natural variation of phenotypic traits. However, such mutant screens are not directed by natural variation in phenotype and there is no reason to expect that genes identified through induced mutation (or other means of disrupting gene function) are those that have been acted upon by natural selection, which could render them interesting but unimportant in an ecological/evolutionary context. The opposite paradigm of reverse genetics offers a way of identifying functionally important loci but, in the case of QTL or association mapping, is time-consuming and often significantly limited by the biology of the system being studied – for example in forest tree species the generation of mapping populations is a very long-term commitment and the subsequent QTL (Quantitative Trait Loci) mapping resolution is of questionable value if the aim is to identify genes, or even causal polymorphisms, underlying a QTL (although association mapping largely overcomes this resolution limitation, for example see , but requires greater starting information to enable targeting of relevant loci). A third approach, which represents a refinement to a reverse genetics pipeline, is to identify genes whose expression pattern(s) suggest they might be functioning during a developmental process or response to stimuli/stress (i.e. functional genomics, or perhaps initially 'guilt-by-association') and to combine this information with QTL mapping, and approach that was termed 'genetical genomics' . This proposed view of how to consider gene expression (or similarly peptide/metabolite abundance) in a genetical context is currently receiving considerable attention, extension and refinement of methodology [21–25]. Association mapping and genetical genomics both represent major motivations for the efficient identification of candidate genes, particularly where whole-genome assays cannot be utilised, such as in non-model systems or species such as aspens, where whole-genome SNP assays with adequate genomic resolution for association mapping due to rapid LD decay are still a long way from being available. In such cases well-conceived 'omics' studies may act as a magnet to help find the needles in the haystack that is the genome.
The use of microarrays is now a firmly established method, with transcriptomics being the most mature of the now multitude 'omics' fields. Expression microarrays allow the parallel profiling of most or all genes in a genome, and sensibly designed biological experiments allow the application of microarrays to identify patterns of gene expression associated with a trait of interest. Such traits can be temporal, spatial, or adaptive. As well as conducting individual experiments aimed at answering specific hypotheses, the concentration on MIAME (Minimum Information About a Microarray Experiment, ) compliance has enabled the use of meta-analysis approaches that aim to identify patterns across many experiments, or across species. Much effort has been put into the development and subsequent refinement of methods to isolate biologically meaningful information from the background noise [27–30].
An interest in the application of clustering methods to microarray data, whereby groups of genes exhibiting similar expression profiles are identified, was borne from the regulon concept whereby it is expected that a single 'master regulator' (typically a transcription factor) will control the expression of many genes as an efficient way of initiating gene expression 'programs'. For example, leaf development can be considered as a progression through multiple, successive, modules of a developmental program running from primordia initiation to leaf maturity. We have previously shown that some such successive program modules can be identified by profiling gene expression of a field-grown aspen throughout the growing season in multiple years. This identified modules that could broadly be defined as 'cell division', 'cell elongation', and 'differentiation/maturation' . Identifying such clusters (regulons/transcriptional modules, hereafter referred to as transcriptional modules) also allows inference about the function of unknown genes . Clustering was first carried out at the level of an individual experiment using mainly hierarchical clustering , k-means clustering  and Self Organising Maps (SOMs, ) with other methods following. Interest has now extended to identifying transcriptional modules across more extensive collections of expression data [36–41] as well as to the application of network theory and analysis to reconstruct gene regulatory networks [42–47].
We have previously established UPSC-BASE, a database repository with integrated transcriptomics analysis tools  for our own Populus cDNA microarray platform, which now contains by-far the largest collection of transcriptomics data available for Populus. Here we show how this data can be mined using a meta-analysis and cross-validation approach (using external datasets and cross-experiment comparisons) to identify candidate genes involved in leaf development. Our aim was to answer the questions a) by comparing the expression profiles of contrasting developing tissue types, can we identify genes that are of particular importance in leaves? b) Using the available microarray data, can we identify transcription factors and 'leaf transcriptional modules'? c) Can we use the results to identify meaningful patterns associated with the process of leaf development? This work represents the first meta-analysis of Populus transcriptomics data and is one of only a small number of cross-experiment transcriptomics studies in plants.
Identifying genes of importance to leaves using the UPSC-TC experiment
Transcription factors preferentially expressed in leaves
a) Digital Northern
CCAAT-box binding transcription factor subunit HAP3-related
loop-helix (bHLH) family protein
homeobox-leucine zipper family protein
High mobility group (HMG1/2) family protein
High mobility group (HMG6)
myb family transcription factor (MYB3)
zinc finger homeobox family protein (ZFHD4)
zinc finger homeobox family protein ATHB24)
zinc finger hmeobox family protein ATHB22/MEE68)
zinc finger homeobox family protein (ZFHD2)
As well as transcription factors being important regulators, it has also become clear that miRNAs play a key regulatory role, especially in restricting the zone of expression of genes thought to be involved in development. Using the target site predictions provided in , we tested the OPLS leaf genes to see if any of the miRNAs currently in miRBase  were over-represented for predicted target sites within this gene list. As not every gene model is represented on the microarrays used, we first restricted the target site prediction dataset to only those gene models profiled. We additionally had to translate the pre v1.0 gene models used in  to their v1.1 equivalents. Five of the seven members (c-g) of the ptc-MIR396 family were found to be over-represented for predicted target sites within the OPLS leaf genes, with no other miRNAs showing over-representation. Of these five significant family members, ptc-MIR396g was by-far the most significant with seven of its 11 predicted targets being found within the OPLS leaf gene list. Examination of the genomic location of these miRNAs in the PopGenIE Browser revealed that ptc-MIR396c,f,g are located within clusters of QTL for leaf development traits on LG III, VI and VII respectively. The other members of MIR396 do not collocate to leaf development QTL and additionally, none of the predicted target genes collocate with leaf QTL. As another route to examining the potential role of miRNAs in leaf development, we examined the number of predicted miRNAs targetting the OPLS leaf genes (Additional file 3). Six genes were predicted to be targetted by seven miRNAs, with five of those genes having maximum homology to Arabidopsis Growth Regulating Factor (GRF) genes, which are discussed below. For the five genes annotated as GRFs, the seven miRNAs predicted to target them are the seven members of the ptc-MIR396 family. The sixth gene is predicted to be targetted by seven members of ptc-MIR169 (i-m, o,p). This gene is of unknown function and has no reported Arabidopsis mutant phenotypes in TAIR currently. We additionally performed a broader examination of all genes (not just the leaf genes or those represented on the UPSC cDNA microarray) predicted to be highly targetted by miRNAs to see which GO Biological Process categories were over-represented. For this analysis we used only genes predicted to be targetted by more than 10 miRNAs using the entire set of predicted miRNAs from  as well as only the 'official' miRBase miRNAs. In both cases, there was an almost exclusive and dramatic over-representation of gene involved in developmental processes and pattern formation (Additional file 4 shows the results of the analysis using the ptc-miRBase miRNA subset).
Identifying and characterising transcriptional modules in leaves
The most highly connected gene within transcription modules
similar to cupin family protein
cyclin dependent kinase B2:1
acyl carrier protein 4 (ACP4)
DEAD box RNA helicase (RH20)
Transparent Testa 7 (TT7), required for flavonoid 3' hydroxylase activity
ethylene-responsive protein, similar to Universal Stress Protein (USP)
Cross-experiment and cross-species validation
In Figure 5A–D it can be seen that some of the selected transcription factors have expression patterns that were highly consistent between Populus and Arabidopsis (A, B), while other examples (C, D) showed contrasting expression profiles. This is possibly due to differentiation of function between the two orthologs but is also possibly due to the fact that the ortholog identified by best-BLAST score analysis is not the true ortholog. Here we do not attempt to differentiate between the two possibilities, we simply define orthologs based on our analysis pipeline. Figure 5A shows an example of a gene that is a good marker of leaf maturity, with low expression at early leaf development stages in all Populus experiments and with high expression particularly in older leaves and floral organs in Arabidopsis and low expression in the apex samples. Figure 5B shows the opposite case – a gene that is a good marker for early leaf development stages, dropping rapidly in expression as leaves mature and with a distinct peak of expression in the Arabidopsis apex samples and no expression in older leaf samples. Figure 5C–D shows two additional markers of early leaf development stages in Populus but in both cases, expression in the Arabidopsis apex samples is either absent (C) or very low (D). In the case of Figure 5C, a distinct and almost unique peak is seen in Arabidopsis roots, which contrasts to the Populus expression pattern of this gene. In Figure 5D, a distinct peak is seen in Arabidopsis seed samples but not in the apex or leaves, which contrasts to the expression pattern observed across the Populus experiments, where expression is only high in young leaves and in internode samples of the Pt-TC experiment (which contain a secondary vegetative meristem). Populus trichocarpa shows weak apical dominance and so it is unsurprising that expression of a gene that is high in young leaves and in the shoot apex would also be high in internode samples. In general, we found that ~60% of genes had broadly similar expression patterns between Arabidopsis and Populus with ~30% showing more than broad similarity. The remaining 40% of genes showed a range of divergence between Populus and Arabidopsis, with some (such as those in Figure 5C–D) showing distinctly contrasting patterns between to the two species.
As an additional means of examining how robust the results from the UPSC-TC experiment were, we performed an OPLS analysis on the Pt-TC experiment, which compares different tissue type of Populus trichocarpa using a different array technology, to examine the degree of agreement between the two datasets. A direct comparison of the results from the two microarray platforms is problematic due to the considerably greater number of gene models represented on the Nimblegen microarray used for the Pt-TC experiment (54,794 compared to 15,789). We therefore performed the OPLS analysis for this experiment on the entire set of 54,794 nuclear gene models and then restricted the gene list obtained to only those gene models represented on the UPSC POP2 array (we consider this to be a more stringent approach than limiting the dataset to only those genes present on the UPSC POP2 array before performing the OPLS analysis). The union between the restricted Pt-TC and the UPSC-TC gene lists was then examined. 452 gene models (40.5%) were identified in common (Additional file 6).
An example extended-application of the results
To examine the distribution of candidate genes across the genome, we plotted their locations to allow a visual overview (Figure 6C). There appear to be two potential clusters of genes on LG VI and another, smaller, cluster on LG XIV. Perhaps interestingly (but certainly a coincidence of note), AS1 is located at the site of the first cluster on LG VI, ER at the site of the second and AN within the cluster on LG XIV. The above example does not provide any evidence that the genes we have selected in any way have a causal role in controlling leaf development, size or shape. It does, however, provide a useful and interesting example of how the dataset we present can be used and integrated within a genetical genomics context.
After an early wave of enthusiasm for high-throughput transcriptomics, attention is shifting somewhat towards the next steps up the ladder of the central dogma, proteomics and metabolomics, and to the integration of information across all levels i.e. the systems biology approach. This shift in attention is partly due to the fact that control at the level of the transcriptome tells only part of the whole story in the plot of genotype to phenotype. However, transcriptional data is still important, telling which genes are being transcribed even if they are eliminated as characters from the plot at later stages by factors such as miRNA degradation. The criticism now commonly placed on transcriptomics (i.e. the potential for transcription not to represent protein activity) will, of course, hold equally true when proteomics data is examined in increasing detail; knowing which proteins are present tells you nothing about their half life, rate of replacement or side-branch decoration.
Here we have used an extensive set of transcriptomics data from a diverse range of sources, covering different microarray platforms, species of Populus as well as Arabidopsis to identify genes that are actively transcribed during the process of leaf development. A similar starting point to ours was previously reported for Arabidopsis, where it was found that different tissue types could be differentiated on the basis of gene expression data . However, only four leaf-specific markers were identified. We believe that Populus represents a more suitable model for answering these questions as the degree of tissue differentiation is greater than in Arabidopsis (contrast the trunk of a Populus tree to the stem of Arabidopsis), a fact that should increase the resolution with which tissue-specific expression patterns can be discriminated. Here we were also able to clearly separate tissue types on the basis of gene expression but with greater within tissue-type resolution than was possible using Arabidopsis. Figure 1 shows that leaf samples form a unique group that is distinct from other tissue types. A second group was found for 'wood' samples, including old roots (which were most similar to bark), and a third, less distinct, group containing young roots and flowers. When considering only those gene models predicted to have transcription factor activity, we could see further differentiation within the three main groups (Figure 1B). EST library representation (Figure 2) and functional enrichment analysis (Figure 3) of gene models that were highly representative of leaves in comparison to other tissues identified apical shoot and young leaf EST libraries and GO categories for photosynthesis and chloroplasts (as one would expect) but also a number of categories likely to be involved in leaf development. As functional enrichment was tested for in a list of genes that are more representative of leaves than other tissues, these genes are not simply general markers of high cell division/expansion but are more specifically involved in those processes primarily in leaves. We were interested primarily not in identifying genes that are highly representative of leaves per se but more specifically, genes that are involved in leaf development or that could serve as developmental markers. Our own interest in this approach is in the identification of candidate genes for association mapping in the SwAsp collection of Swedish aspens . We also wanted to develop a set of developmental marker genes as we have previously shown that the magnitude of transcriptional regulation during development is greater than that induced by changes in the weather for a field-grown aspen  and that the induced transcriptional response to a treatment can be masked by developmental alterations induced by the treatment . The availability of developmental markers would allow rapid comparisons to be made between leaves thought to be at the same developmental state to confirm whether like is being compared with like. As many more genes than can be sensibly used for downstream applications were identified in the OPLS method, we took the approach of concentrating on transcription factors and of identifying transcriptional modules within our list of candidate genes as a means of selecting a subset of genes that would maximise the information content. We were able to identify transcriptional modules (Figure 4A) that appear to be robust (Figure 4B) and for which broad biological function could be assigned to. For each module, we plotted the expression of the most highly connected gene across a range of microarray data sets in both Populus and Arabidopsis (Figure 5E–F). As can be seen, the green and blue modules appear to be important in early stages of leaf development, the red module in later stages (differentiation rather than cell production), and the turquoise module important in later development and in mature leaf functioning. These patterns of expression make sense when considered alongside the functional classification for each module: The green (protein biosynthesis) and blue (DNA replication and structure) are highly expressed in young leaves that are involved primarily in cell division, the red (organelle/intracellular) module then having a later peak as cells begin to expand and differentiate, with the latest peak in the turquoise (photosynthesis/chloroplast) module, which peaks and remains high as differentiation nears completion and expansion reaches an end, marking the point where leaves switch from sinks to sources. This progression through development is also revealed by the connectivity seen between modules in Figure 4B, with there being a progression in developmental time from the bottom (green and blue) to the top (turquoise) of the diagram. The orange (secondary metabolism) module has a more variable expression profile (data not shown) and is also the least inter-connected of the network modules (Figure 4). Although a regulatory role cannot easily be ascribed to each of these genes (Table 2 – one would expect the most highly connected gene to be the best candidate for being the module regulator gene), for some a regulatory role does seem plausible. For example, the most highly connected gene in the blue module is estExt fgenesh4 pm.C LG V0721, the ortholog of AT1G76540, a cyclin-dependant kinase involved in regulation of the G2/M transition of the mitotic cell cycle . Most of the genes identified in Table 2 do not have reported under- or over-expression phenotypes and are currently of undetermined function. There is currently little information on how transferable and conserved such complex developmental transcriptional modules identified from transcriptional data are likely to be, especially in plants.
Due to their key role in controlling developmental programs and their ability to influence the expression of a number of downstream targets (trans effects), we concentrated our analysis on transcription factors. Transcription factors of interest were selected using OPLS and EST library over-representation tests. This combined approach provides evidence that the C2C2-YABBY, CCAAT-HAP3 and 5, ZF-HD, and MYB families of transcription factor families are of particular importance in leaves, with C2C2-YABBY being the most highly represented family. YABBY transcription factors are known to be involved in the process of abaxialisation [59–61, 65–67] and ectopic expression of YABBY genes is sufficient to cause the abaxialisation of adaxial epidermal tissues [59–61]. Five gene models in the C2C2-YABBY family were identified by the OPLS analysis of the UPSC-TC experiment with one of those (grail3.001817701) also identified by the EST library over-representation test. Three of the five gene models are orthologs of AT2G45190 (AFO/FIL/YAB1) and the other two of AT2G26580 (YAB5). Although no functional evidence is available for these genes in Populus yet, homologous Arabidopsis genes have been chacterized.  report that there was no observable leaf phenotype in the fil single mutant but suggest this is the result of redundant activities of other members of the YABBY family.  report aberrant leaf phenotypes for 35S over-expression lines of FIL, with strong over-expression apparently being lethal and weaker over-expressing lines showing the formation of wrinkled leaves. The expression of YAB5 was recently suggested to be negatively modulated in the adaxial domain by AS1 and AS2, two genes critical for the development of properly expanded leaves . Two of the identified transcription factors have mutant phenotypes reported in TAIR. Eugene3.00021070 is the ortholog of AT4G37740, Growth-Regulating Factor 2, the mutants of which have smaller leaves , and which is one of the predicted targets of the ptc-MIR396 miRNA family that we found to be over-represented for target sites in the OPLS leaf genes. EstExt fgenesh4 pm.C LG VI0283 is the ortholog of ASYMETRIC LEAVES 1 (AS1, AT2G37630), which is homologous to the Antirrhinum PHANTASTICA (PHAN) and maize ROUGH SHEATH2 (RS2) genes [70–72] and which acts as a negative repressor of class I KNOTTED1-like homeobox (KNOX, in particular KNAT1 and KNAT2) genes and that shows genetic interaction with SHOOT MERISTEMLESS (STM; ) and BREVIPEDICELLUS (BP; ). as1 mutants have highly lobed leaves, occasionally forming ectopic shoots on leaves [70, 73] show that AS1 and AS2 together with ERECTA (ER) are involved in the establishment of adaxial-abaxial polarity (with ER appearing to protect the AS1/AS2 pathway from heat stress; ).  show interaction of AS1 with CUP-SHAPED COTYLEDON (CUC) 1, a gene involved in shoot apical meristem formation. It would therefore appear that AS1 is a key and central player in leaf formation and pattern development. Through a combination of the various analyses undertaken in this study, it would appear that miRNA family ptc-MIR396 and the GRF gene family may represent interesting targets for follow-up studies.
As a means of indicating how generally-applicable and relevant our finding are, we were interested to see what degree of overlap would be found between the UPSC-TC and Pt-TC experiments if the same OPLS analysis was carried out on the Pt-TC experiment. We did not expect a substantial overlap as the two experiments have some key differences: The UPSC-TC experiment samples tissues to a greater resolution and, for the leaf samples, includes a set of leaves that are considerably younger than the young leaf samples in the Pt-TC experiment (see GEO for sample details of the Pt-TC experiment). However, 452 (40.5%) of genes were found in common between the UPSC-TC and Pt-TC experiments, which suggests that the analysis method is robust and applicable beyond the samples contained in the original experiment (data not shown). This also suggests that our identified genes are likely to be functionally important within the SwAsp aspen trees that we wish to use for future association mapping, which was a major motivation for this undertaking rather than simply selecting candidate genes on the basis of published forward genetics screens.
As an example of the potential uses of the information presented by this work, we examined the collocation of identified candidate genes and transcription factors to QTL for leaf development that we had mapped in previous work  and to those additional QTL available in the PopGenIE Browser. This allowed us to identify two collocating transcription factors, one of which (AFO) has functional support for a role in leaf development in Arabidopsis [59–61]. We have also shown that multiple members of a miRNA family identified as targeting a number of genes within the OPLS leaf gene set additionally collocate to QTL for leaf development, raising the potential for miRNAs being the underlying genetic loci of a QTL.
There are at least six publications containing QTL data for leaf related traits in the mapping population considered here alone (and a greater number if one considers all results published in any Populus mapping population), but in no case (our own publication included) would this collocation analysis have been possible using only the published data. We would therefore strongly suggest that a MIAME-equivalent standard should be established for the publication of QTL results to ensure that map, QTL, and phenotype data are always made publicly available as a requirement for the publication of QTL results, especially with the increasingly availability of eQTL results in addition to more classical phenotypic QTL data as this will ensure that such cross-species, comparative genomics studies are achievable in the same way that they are in the transcriptomics field thanks to standards compliance.
We have shown that a large, diverse collection of microarray data can be used in a combined-analysis approach to identify candidate genes involved in the processes of leaf development. Network-clustering analysis on a set of genes that are highly-representative of leaves identified modules with distinct patterns of expression associated with leaf development and functional enrichment analysis of genes within each module revealed that the pattern of expression between the modules makes biological sense in the context of their likely functional roles. Our cross-experiment and cross-species validation approach showed that identified genes, particularly the transcription factors, have patterns of expression that suggest they represent a robust set of candidate genes involved in leaf development. The analysis approach used identified genes with proven functional roles in leaf development in Arabidopsis and other species, suggesting that our method is biologically robust and the results meaningful. This suggests that the identified genes of as-yet unknown function are sensible targets for further targeted functional characterisation or as candidates for association mapping, as well as providing a set of genes that can serve as robust markers of leaf developmental age.
UPSC Tissue Comparison microarray experiment
Tissue samples were collected from hybrid aspen (P. tremula × P. tremuloides 'T89') grown under natural light in the greenhouse (UMA-0030) and at eleven positions covering early leaf developmental stages (UMA-0020): the Apex (L1); tissue just below the apex (L2); primordia < 100 μm (L3), 200–250 μm (L4), 500–1000 μm (L5) and 1–3 mm (L6) in diameter; and leaves 1 cm (L7), 2 cm (L8), 3 cm (L9), 4 cm (L10) and 5 cm (L11) long (experiment UMA-0020). In experiment UMA-0030, young root and root meristem samples were collected from hydroponically grown plants. The extended Leaf Primordia series is available in UPSC-BASE as experiment UMA-0020, and we refer to it here as the UPSC-LP experiment.
Material used for the microarrays was prepared and handled as described in . The experiment made use of the POP2 microarrays described in  and  and for which information can be found in UPSC-BASE . For the UPSC tissue comparison (UPSC-TC) experiment, microarray slides were hybridised against a common reference containing a mixture of all samples. Slides were scanned at 10 μm resolution, using a Scanarray 4000 microarray analysis system scanner (Perkin-Elmer, Boston, MA, USA). Scanner settings were calibrated for PMT (Photo Multiplier Tube) and laser power to ensure even level signal strength for both channels. Spot data were extracted using GenePix version 5.0 (Axon Instruments Inc, Union City, CA, USA). The data output from GenePix was imported into UPSC-BASE, and is publicly available as experiment number UMA-0030. A step-wise normalisation  was applied for all slides before further analysis. Based on a Principal Component Analysis (PCA) visualisation of the data (see below) the samples were classified into three groups for OPLS analysis. The three groups were 'leaf' (containing Leaf 8 and 10, Apical region, and Primordia arrays), 'wood' (containing xylem, phloem, bark, and old roots arrays), and 'flower/roots' (containing flower, primary roots, and root meristem arrays).
UPSC-BASE leaf microarrays
In April 2008, UPSC-BASE contained 22 experiments totalling 642 slides (of which 602 passed our filtering criteria) performed on Populus spp. leaves (experiments UMA 0001, 0009, 0011, 0013, 0016, 0020, 0025, 0031, 0032, 0033, 0035, 0036, 0037, 0040, 0042, 0050, 0054, 0063, 0068, 0069, 0078, 0082). These experiments represent a highly diverse set of conditions including biotic and abiotic stress treatments, developmental series and RNAi lines. The data for each experiment were step-wise normalised  and filtered to remove spots with an A-value below 8. Finally the separate data files were median aggregated based on gene model and merged to a single matrix. As the microarrays selected were performed on both the POP1 and POP2 platforms, we considered only the set of gene models common to both array platforms. The POP1 array contained 13,872 features targeting 9532 gene models and the POP2 array 27,648 features targeting 15,789 gene models. All gene models represented on the POP1 array are also present on the POP2 array. See  for details of the POP1 array and  for details of the POP2 array. For both arrays, see PopulusDB  for details of the EST resource used for their production. The merged dataset is available in UPSC-BASE as experiment UMA-9992 and is referred to here as the UPSC-Leaf dataset.
Populus trichocarpa 'Nisqually-1' Nimblegen microarrays
 and  make use of an experiment profiling gene expression in different tissues of Populus trichocarpa 'Niqually-1', the sequenced clone. This experiment has been deposited in GEO (Gene Expression Omnibus, ) as series GSE6442. We used only the subset of 16 arrays specific to P. trichocarpa 'Nisqually-1'. A matrix of normalised expression values was downloaded and gene model means calculated. All subsequent analysis was performed only on probes for nuclear gene models. For OPLS analysis (see below), samples were classed as either leaf or non-leaf. The microarrays used are whole-genome arrays with probes designed to profile 55,794 nuclear and 126 chloroplast and mitochondria gene model sequences in addition to 9,995 unigenes derived from EST sequences . The array platform is in GEO as accession GPL2618. We refer to this dataset as the P. trichocarpa Tissue Comparison (Pt-TC) experiment.
We used the AtGenExpress developmental baseline microarray dataset  to visualise the expression of Arabidopsis orthologs (defined by best-hit BLAST analysis, ) of genes identified in this study. Absolute intensity values were downloaded using the visualisation tool available at AtGenExpress homepage and subsequently visualised using R. The dataset was profiled using the ATH1 Affymetrix expression microarrays which targets 22,746 (> 80%) of known genes.
Digital Northerns and over-representation anlaysis
Digital Northern heat maps representing the library distribution of ESTs representing gene models within PopulusDB [53, 83] were produced using the PopGenIE DigitalNorthern tool . Over-representation of transcription factor families, of transcription factor gene models within leaf libraries, and of miRNAs with predicted target sites within the OPLS leaf genes was tested using the PopGenIE CatFisher tool.
2,723 and 2,576 gene models predicted to be transcription factors were downloaded from the public databases Populus Transcription Factor Database  and Database of Populus Transcription Factors  respectively. The two subsets of transcription factors were merged into a modified single classification scheme by taking the union inside each transcription factor family.
PCA and OPLS analysis
Principal Component Analysis (PCA) was performed using SIMCA-P (version 11.5, Umetrics, Umeå, Sweden). Normalised M-values were mean-centred and a two-component model was then fitted and visualised.
Orthogonal Projections to Latent Structures (OPLS) was used to model and predict tissue types, treated as different categories (the classes used are described above for the UPSC-TC and Pt-TC experiments). Gene models that were particularly helpful in the discrimination between leaf tissue and non-leaf tissues were subsequently identified using a permutation test essentially according to . All OPLS modelling was performed using R based on in-house produced code. To minimize the problem of over fitting, cross-validation [86, 87] was utilised to identify OPLS models with good generalisation properties. The dataset was mean-centred for each gene model prior to modelling. Significant genes were classed as those with FDR-corrected p values < 0.05. P values were generated using the above described permutation test method.
Clustering and network analysis
Expression information for gene models that were particularly helpful in the discrimination between leaf tissue and non-leaf tissues was extracted from the UPSC-Leaf microarray dataset. The dataset was first filtered to remove microarray slides with >50% missing values followed by a second filter to remove gene models containing >50% missing values, resulting in a final set of 602 arrays. Topological Overlap Matrix (TOM) weighted co-expression was used to construct networks on the filtered dataset and to define transcriptional modules. For a general overview of the method used see . Beta = 6 was used for soft thresholding to fulfil the assumption of a scale-free network (i.e. that some nodes within the network are of greater importance, and will be more highly connected; ). Modules of highly interconnected genes were identified using hierarchical clustering and intra-modular connectivity was calculated. Biological function of the individual modules was indicated by testing for over-represented Gene Ontology categories as described below. All analysis was performed in R using scripts based on methods detailed in [39, 40] and .
Functional enrichment analysis
Functional over-representation analysis was performed using the BiNGO plugin  for the network visualisation software Cytoscape (version 2.5.1, ). We used the hypergeometric test and set a Benjamini and Hochberg FDR-adjusted significance level of 0.05 for declaring a GO (Gene Ontology, ) category as significantly over-represented. As there is not yet a mature GO release for Populus, we have used the best BLAST hit results of EST sequences to Arabidopsis thaliana  to infer GO using the TAIR (The Arabidopsis Information Resource ) 6 release of the Arabidopsis genome. Test were performed in the Arabidopsis GO-plant-slim ontology. Gene lists identified using OPLS were split into those genes that were highly and lowly expressed in leaves compared to other tissues and the highly expressed gene models were then tested for GO over-representation.
Quantitative Trait Loci
Principal Component Analysis
Orthogonal Projection to Latent Sqaures
Topological Overlap Matrix [weighted co-expression]
Expressed Sequence Tag
Minimum Information About a Microarray Experiment
The Arabidopsis Information Resource
Umeå Plant Science Centre
Basic Local Alignment and Search Tool
UPSC Tissue Comparison
UPSC Leaf Primordia
Populus trichocarpa Tissue Comparison
Zinc Finger Homeobox Domain.
This work was supported by the Knut and Alice Wallenberg Foundation, the Swedish Foundation for Strategic Research, the Swedish Research Council and Kempestiftelserna. Jerry Tuskan for supplying genetic map information.
- Nelson N, Yocum CF: Structure and function of photosystems I and II. Annual Review of Plant Biology. 2006, 57: 521-565. 10.1146/annurev.arplant.57.032905.105350.PubMedView ArticleGoogle Scholar
- Fleming AJ: The control of leaf development. New Phytologist. 2005, 166: 9-20. 10.1111/j.1469-8137.2004.01292.x.PubMedView ArticleGoogle Scholar
- Pien S, Wyrzykowska J, Mason S, Smart C, Fleming A: Local expression of expansin induces the entire process of leaf development and modifies leaf shape. Proceedings of the National Academy of Sciences, USA. 2001, 98 (20): 11812-11817. 10.1073/pnas.191380498.View ArticleGoogle Scholar
- Tsuge T, Tsukaya H, Uchimiya H: Two independent and polarized processes of cell elongation regulate leaf blade expansion in Arabidopsis thaliana (L.) Heynh. Development. 1996, 122 (5): 1589-1600.PubMedGoogle Scholar
- Lee Y, Kim GT, Kim IJ, Park J, Kwak SS, Choi G, Chung WI: LONGIFOLIA1 and LONGIFOLIA2, two homologous genes, regulate longitudinal cell elongation in Arabidopsis. Development. 2006, 133 (21): 4305-4314. 10.1242/dev.02604.PubMedView ArticleGoogle Scholar
- Tsukaya H, Tsuge T, Uchimiya H: The cotyledon: A superior system for studies of leaf development. Planta. 1994, 195 (2): 309-312. 10.1007/BF00199692.View ArticleGoogle Scholar
- Savaldi-Goldstein S, Peto C, Chory J: The epidermis both drives and restricts plant shoot growth. Nature. 2007, 446 (7132): 199-202. 10.1038/nature05618.PubMedView ArticleGoogle Scholar
- Rae AM, Ferris R, Tallis MJ, Taylor G: Elucidating genomic regions determining enhanced leaf growth and delayed senescence in elevated CO2. Plant Cell and Environment. 2006, 29: 1730-1741. 10.1111/j.1365-3040.2006.01545.x.View ArticleGoogle Scholar
- Street NR, Skogstrom O, Sjodin A, Tucker J, Rodriguez-Acosta M, Nilsson P, Jansson S, Taylor G: The genetics and genomics of the drought response in Populus. Plant Journal. 2006, 48 (3): 321-341. 10.1111/j.1365-313X.2006.02864.x.PubMedView ArticleGoogle Scholar
- Juenger T, Perez-Perez J, Bernal S, Micol J: Quantitative trait loci mapping of floral and leaf morphology traits in Arabidopsis thaliana: evidence for modular genetic architecture. Evolution and Development. 2005, 7 (3): 259-271. 10.1111/j.1525-142X.2005.05028.x.PubMedView ArticleGoogle Scholar
- Wullschleger S, Yin T, Difazio S, Tschaplinski T, Gunter L, Davis M, Tuskan G: Phenotypic variation in growth and biomass distribution for two advanced-generation pedigrees of hybrid poplar. Canadian Journal of Forest Research. 2005, 35 (8): 1779-1789. 10.1139/x05-101.View ArticleGoogle Scholar
- El-Lithy M, Clerkx E, Ruys G, Koornneef M, Vreugdenhil D: Quantitative Trait Locus Analysis of Growth-Related Traits in a New Arabidopsis Recombinant Inbred Population. Plant Physiology. 2004, 135: 444-458. 10.1104/pp.103.036822.PubMedPubMed CentralView ArticleGoogle Scholar
- Frary A, Fritz L, Tanksley S: A comparative study of the genetic bases of natural variation in tomato leaf, sepal, and petal morphology. Theoretical and Applied Genetics. 2004, 109 (3): 523-533. 10.1007/s00122-004-1669-x.PubMedView ArticleGoogle Scholar
- Perez-Perez J, Serrano-Cartagena J, Micol J: Genetic Analysis of Natural Variations in the Architecture of Arabidopsis thaliana Vegetative Leaves. Genetics. 2002, 162 (2): 893-915.PubMedPubMed CentralGoogle Scholar
- Jiang C, Wright RJ, Woo SS, Delmonte TA, Paterson AH: QTL analysis of leaf morphology in tetraploid Gossypium (cotton). Theoretical and Applied Genetics. 2000, 100 (3): 409-418. 10.1007/s001220050054.View ArticleGoogle Scholar
- Wu R, Stettler R: Quantitative genetics of growth and development in Populus. III. Phenotypic plasticity of crown structure and function. Heredity. 1998, 81 (3): 299-310. 10.1046/j.1365-2540.1998.00397.x.View ArticleGoogle Scholar
- Wu R, Bradshaw HD, Stettler RF: Molecular genetics of growth and development in Populus (Salicaceae). 5. Mapping quantitative trait loci affecting leaf variation. American Journal of Botany. 1997, 84 (2): 143-153. 10.2307/2446076.PubMedView ArticleGoogle Scholar
- Hay A, Tsiantis M: The genetic basis for differences in leaf form between Arabidopsis thaliana and its wild relative Cardamine hirsuta. Nature Genetics. 2006, 38 (8): 942-947. 10.1038/ng1835.PubMedView ArticleGoogle Scholar
- Ingvarsson P: Nucleotide Polymorphism and Linkage Disequilibrium Within and Among Natural Populations of European Aspen (Populus tremula L., Salicaceae). Genetics. 2005, 169 (2): 945-953. 10.1534/genetics.104.034959.PubMedPubMed CentralView ArticleGoogle Scholar
- Jansen RC, Nap JP: Genetical genomics: the added value from segregation. Trends in Genetics. 2001, 17 (7): 388-391. 10.1016/S0168-9525(01)02310-1.PubMedView ArticleGoogle Scholar
- Bing N, Hoeschele I: Genetical genomics analysis of a yeast segregant population for transcription network inference. Genetics. 2005, 170 (2): 533-542. 10.1534/genetics.105.041103.PubMedPubMed CentralView ArticleGoogle Scholar
- Deutsch S, Lyle R, Dermitzakis E, Attar H, Subrahmanyan L, Gehrig C, Parand L, Gagnebin M, Rougemont J, Jongeneel V, Antonarakis S: Gene expression variation and expression quantitative trait mapping of human chromosome 21 genes. Human Molecular Genetics. 2005, 14 (23): 3741-3749. 10.1093/hmg/ddi404.PubMedView ArticleGoogle Scholar
- Ghazalpour A, Doss S, Zhang B, Wang S, Plaisier C, Castellanos R, Brozell A, Schadt EE, Drake TA, Lusis AJ, Horvath S: Integrating Genetic and Network Analysis to Characterize Genes Related to Mouse Weight. PLoS Genetics. 2006, 2 (8): e130-10.1371/journal.pgen.0020130.PubMedPubMed CentralView ArticleGoogle Scholar
- Kliebenstein D, West M, van Leeuwen H, Loudet O, Doerge RW, Dina : Identification of QTLs controlling gene expression networks defined a priori. BMC Bioinformatics. 2006, 7: 308-10.1186/1471-2105-7-308.PubMedPubMed CentralView ArticleGoogle Scholar
- West M, Kim K, Kliebenstein D, van Leeuwen H, Michelmore R, Doerge RW, St Clair D: Global eQTL Mapping Reveals the Complex Genetic Architecture of Transcript-Level Variation in Arabidopsis. Genetics. 2007, 175 (3): 1441-1450. 10.1534/genetics.106.064972.PubMedPubMed CentralView ArticleGoogle Scholar
- Brazma A, Hingamp P, Quackenbush J, Sherlock G, Spellman P, Stoeckert C, Aach J, Ansorge W, Ball CA, Causton HC, Gaasterland T, Glenisson P, Holstege FC, Kim IF, Markowitz V, Matese JC, Parkinson H, Robinson A, Sarkans U, Schulze-Kremer S, Stewart J, Taylor R, Vilo J, Vingron M: Minimum information about a microarray experiment (MIAME)-toward standards for microarray data. Nature Genetics. 2001, 29 (4): 365-371. 10.1038/ng1201-365.PubMedView ArticleGoogle Scholar
- Kerr MK, Martin M, Churchill GA: Analysis of variance for gene expression microarray data. Journal of Computational Biology. 2000, 7 (6): 819-837. 10.1089/10665270050514954.PubMedView ArticleGoogle Scholar
- Smyth G: Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004, 3: Article 3Google Scholar
- Bylesjo M, Eriksson D, Kusano M, Moritz T, Trygg J: Data integration in plant biology: the O2PLS method for combined modeling of transcript and metabolite data. Plant Journal. 2007, 52: 1181-1191.PubMedView ArticleGoogle Scholar
- Bylesjo M, Eriksson D, Sjodin A, Jansson S, Moritz T, Trygg J: Orthogonal projections to latent structures as a strategy for microarray data normalization. BMC Bioinformatics. 2007, 8: 207-10.1186/1471-2105-8-207.PubMedPubMed CentralView ArticleGoogle Scholar
- Sjodin A, Wissel K, Bylesjo M, Trygg J, Jansson S: Global expression profiling in leaves of free-growing aspen. BMC Plant Biology. 2008,Google Scholar
- Zhou Y, Young JA, Santrosyan A, Chen K, Yan SF, Winzeler EA: In silico gene function prediction using ontology-based pattern identification. Bioinformatics. 2005, 21 (7): 1237-1245. 10.1093/bioinformatics/bti111.PubMedView ArticleGoogle Scholar
- Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences, USA. 1998, 95 (25): 14863-14868. 10.1073/pnas.95.25.14863.View ArticleGoogle Scholar
- Soukas A, Cohen P, Socci N, Friedman J: Leptin-specific patterns of gene expression in white adipose tissue. Genes & Development. 2000, 14 (8): 963-980.Google Scholar
- Kohonen T: Self-organized formation of topologically correct feature maps. Biological Cybernetics. 1982, 43: 59-69. 10.1007/BF00337288.View ArticleGoogle Scholar
- Ihmels J, Friedlander G, Bergmann S, Sarig O, Ziv Y, Barkai N: Revealing modular organization in the yeast transcriptional network. Nature Genetics. 2002, 31 (4): 370-377.PubMedGoogle Scholar
- Ihmels J, Bergmann S, Barkai N: Defining transcription modules using large-scale gene expression data. Bioinformatics. 2004, 20 (13): 1993-2003. 10.1093/bioinformatics/bth166.PubMedView ArticleGoogle Scholar
- Subramanian A, Tamayo P, Mootha V, Mukherjee S, Ebert B, Gillette M, Paulovich A, Pomeroy S, Golub T, Lander E, Mesirov J: Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences, USA. 2005, 102 (43): 15545-15550. 10.1073/pnas.0506580102.View ArticleGoogle Scholar
- Zhang B, Horvath S: A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005, 4: Article17-PubMedGoogle Scholar
- Yip A, Horvath S: Gene network interconnectedness and the generalized topological overlap measure. BMC Bioinformatics. 2007, 8: 22-10.1186/1471-2105-8-22.PubMedPubMed CentralView ArticleGoogle Scholar
- Dong J, Horvath S: Understanding Network Concepts in Modules. BMC Systems Biology. 2007, 1: 24-10.1186/1752-0509-1-24.PubMedPubMed CentralView ArticleGoogle Scholar
- Wellmer F, Riechmann JL: Gene network analysis in plant development by genomic technologies. International Journal of Developmental Biology. 2005, 49 (5–6): 745-759. 10.1387/ijdb.051991fw.PubMedView ArticleGoogle Scholar
- Margolin AA, Wang K, Lim WK, Kustagi M, Nemenman I, Califano A: Reverse engineering cellular networks. Nature Protocols. 2006, 1 (2): 662-671. 10.1038/nprot.2006.106.PubMedView ArticleGoogle Scholar
- Wang Y, Joshi T, Zhang XS, Xu D, Chen L: Inferring gene regulatory networks from multiple microarray datasets. Bioinformatics. 2006, 22 (19): 2413-2420. 10.1093/bioinformatics/btl396.PubMedView ArticleGoogle Scholar
- Ernst J, Vainas O, Harbison CT, Simon I, Bar-Joseph Z: Reconstructing dynamic regulatory maps. Molecular Systems Biology. 2007, 3: 74-10.1038/msb4100115.PubMedPubMed CentralView ArticleGoogle Scholar
- Wittkopp PJ: Variable gene expression in eukaryotes: a network perspective. Journal of Experimental Biology. 2007, 210 (Pt 9): 1567-1575. 10.1242/jeb.002592.PubMedView ArticleGoogle Scholar
- Bansal M, Belcastro V, Ambesi-Impiombato A, di Bernardo D: How to infer gene networks from expression profiles. Molecular Systems Biology. 2007, 3: 78-PubMedPubMed CentralView ArticleGoogle Scholar
- Sjodin A, Bylesjo M, Skogstrom O, Eriksson D, Nilsson P, Ryden P, Jansson S, Karlsson J: UPSC-BASE-Populus transcriptomics online. Plant Journal. 2006, 48 (5): 806-817. 10.1111/j.1365-313X.2006.02920.x.PubMedView ArticleGoogle Scholar
- Trygg J, Wold S: Orthogonal projections to latent structures (O-PLS). Journal of Chemometrics. 2002, 16: 119-128. 10.1002/cem.695.View ArticleGoogle Scholar
- Bylesjo M, Rantalainen M, Cloarec O, Nicholson J, Holmes E, Trygg J: OPLS discriminant analysis: combining the strengths of PLS-DA and SIMCA classification. Journal of Chemometrics. 2006, 20: 341-351. 10.1002/cem.1006.View ArticleGoogle Scholar
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature Genetics. 2000, 25: 25-9. 10.1038/75556.PubMedPubMed CentralView ArticleGoogle Scholar
- PopGenIE: Populus Genome Integrative Explorer. [http://www.popgenie.db.umu.se/]
- Sterky F, Bhalerao RR, Unneberg P, Segerman B, Nilsson P, Brunner AM, Charbonnel-Campaa L, Lindvall JJ, Tandre K, Strauss SH, Sundberg B, Gustafsson P, Uhlen M, Bhalerao RP, Nilsson O, Sandberg G, Karlsson J, Lundeberg J, Jansson S: A Populus EST resource for plant functional genomics. Proceedings of the National Academy of Sciences, USA. 2004, 101 (38): 13951-13956. 10.1073/pnas.0401641101.View ArticleGoogle Scholar
- Lindow M, Jacobsen A, Nygaard S, Mang Y, Krogh A: Intragenomic Matching Reveals a Huge Potential for miRNA-Mediated Regulation in Plants. PLoS Computational Biology. 2007, 3 (11): e238-10.1371/journal.pcbi.0030238.PubMedPubMed CentralView ArticleGoogle Scholar
- Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ: miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 2006, D140-D144. 10.1093/nar/gkj112. 34 Database
- Schmid M, Davison TS, Henz SR, Pape UJ, Demar M, Vingron M, Scholkopf B, Weigel D, Lohmann JU: A gene expression map of Arabidopsis thaliana development. Nature Genetics. 2005, 37 (5): 501-6. 10.1038/ng1543.PubMedView ArticleGoogle Scholar
- Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, Schein J, Sterck L, Aerts A, Bhalerao RR, Bhalerao RP, Blaudez D, Boerjan W, Brun A, Brunner A, Busov V, Campbell M, Carlson J, Chalot M, Chapman J, Chen GL, Cooper D, Coutinho PM, Couturier J, Covert S, Cronk Q, Cunningham R, Davis J, Degroeve S, Dejardin A, Depamphilis C, Detter J, Dirks B, Dubchak I, Duplessis S, Ehlting J, Ellis B, Gendler K, Goodstein D, Gribskov M, Grimwood J, Groover A, Gunter L, Hamberger B, Heinze B, Helariutta Y, Henrissat B, Holligan D, Holt R, Huang W, Islam-Faridi N, Jones S, Jones-Rhoades M, Jorgensen R, Joshi C, Kangasjarvi J, Karlsson J, Kelleher C, Kirkpatrick R, Kirst M, Kohler A, Kalluri U, Larimer F, Leebens-Mack J, Leple JC, Locascio P, Lou Y, Lucas S, Martin F, Montanini B, Napoli C, Nelson DR, Nelson C, Nieminen K, Nilsson O, Pereda V, Peter G, Philippe R, Pilate G, Poliakov A, Razumovskaya J, Richardson P, Rinaldi C, Ritland K, Rouze P, Ryaboy D, Schmutz J, Schrader J, Segerman B, Shin H, Siddiqui A, Sterky F, Terry A, Tsai CJ, Uberbacher E, Unneberg P, Vahala J, Wall K, Wessler S, Yang G, Yin T, Douglas C, Marra M, Sandberg G, de Peer YV, Rokhsar D: The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006, 313 (5793): 1596-1604. 10.1126/science.1128691.PubMedView ArticleGoogle Scholar
- Plant Genome Duplication Database. [http://chibba.agtec.uga.edu/duplication/]
- Zhao W, Su H, Song J, Zhao X, Zhang X: Ectopic expression of TaYAB1, a member of YABBY gene family in wheat, causes the partial abaxialization of the adaxial epidermises of leaves and arrests the development of shoot apical meristem in Arabidopsis. Plant Science. 2006, 170 (2): 364-371. 10.1016/j.plantsci.2005.09.008.View ArticleGoogle Scholar
- Sawa S, Watanabe K, Goto K, Kanaya E, Morita E, Okada K: FILAMENTOUS FLOWER, a meristem and organ identity gene of Arabidopsis, encodes a protein with a zinc finger and HMG-related domains. Genes & Development. 1999, 13 (9): 1079-1088. 10.1101/gad.13.9.1079.View ArticleGoogle Scholar
- Siegfried KR, Eshed Y, Baum SF, Otsuga D, Drews GN, Bowman JL: Members of the YABBY gene family specify abaxial cell fate in Arabidopsis. Development. 1999, 126 (18): 4117-28.PubMedGoogle Scholar
- Luquez V, Hall D, Albrectsen B, Karlsson J, Ingvarsson P, Jansson S: Natural phenological variation in aspen (Populus tremula): the SwAsp collection. Tree Genetics & Genomes. 2007, 4: 279-292. 10.1007/s11295-007-0108-y.View ArticleGoogle Scholar
- Taylor G, Street NR, Tricker PJ, Sjodin A, Graham L, Skogstrom O, Calfapietra C, Scarascia-Mugnozza G, Jansson S: The transcriptome of Populus in elevated CO2. New Phytologist. 2005, 167: 143-154. 10.1111/j.1469-8137.2005.01450.x.PubMedView ArticleGoogle Scholar
- Kono A, Umeda-Hara C, Lee J, Ito M, Uchimiya H, Umeda M: Arabidopsis D-Type Cyclin CYCD4;1 Is a Novel Cyclin Partner of B2-Type Cyclin-Dependent Kinase. Plant Physiology. 2003, 132 (3): 1315-1321. 10.1104/pp.103.020644.PubMedPubMed CentralView ArticleGoogle Scholar
- Eshed Y, Izhaki A, Baum S, Floyd S, Bowman J: Asymmetric leaf development and blade expansion in Arabidopsis are mediated by KANADI and YABBY activities. Development. 2004, 131 (12): 2997-3006. 10.1242/dev.01186.PubMedView ArticleGoogle Scholar
- Bowman J: The YABBY gene family and abaxial cell fate. Current Opinion in Plant Biology. 2000, 3: 17-22. 10.1016/S1369-5266(99)00035-7.PubMedView ArticleGoogle Scholar
- Golz J, Hudson A: Plant development: YABBYs claw to the fore. Current Biology. 1999, 9 (22): R861-R863. 10.1016/S0960-9822(00)80047-0.PubMedView ArticleGoogle Scholar
- Iwakawa , Hidekazu , Iwasaki , Mayumi , Kojima , Shoko , Ueno , Yoshihisa , Soma , Teppei , Tanaka , Hirokazu , Semiarti , Endang , Machida , Yasunori , Machida , Chiyoko : Expression of the ASYMMETRIC LEAVES2 gene in the adaxial domain of Arabidopsis leaves represses cell proliferation in this domain and is critical for the development of properly expanded leaves. Plant Journal. 2007, 51 (2): 173-184. 10.1111/j.1365-313X.2007.03132.x.PubMedView ArticleGoogle Scholar
- Kim JH, Choi D, Kende H: The AtGRF family of putative transcription factors is involved in leaf and cotyledon growth in Arabidopsis. The Plant Journal. 2003, 36: 94-104. 10.1046/j.1365-313X.2003.01862.x.PubMedView ArticleGoogle Scholar
- Byrne M, Barley R, Curtis M, Arroyo J, Dunham M, Hudson A, Martienssen R: Asymmetric leaves1 mediates leaf patterning and stem cell function in Arabidopsis. Nature. 2000, 408 (6815): 967-971. 10.1038/35050091.PubMedView ArticleGoogle Scholar
- Theodoris G, Inada N, Freeling M: Conservation and molecular dissection of ROUGH SHEATH2 and ASYMMETRIC LEAVES1 function in leaf development. Proceedings of the National Academy of Sciences, USA. 2003, 100 (11): 6837-6842. 10.1073/pnas.1132113100.View ArticleGoogle Scholar
- Eckardt N: The Role of PHANTASTICA in Leaf Development. Plant Cell. 2004, 16 (5): 1073-1075. 10.1105/tpc.060510.PubMed CentralView ArticleGoogle Scholar
- Xu L, Xu Y, Dong A, Sun Y, Pi L, Xu Y, Huang H: Novel as1 and as2 defects in leaf adaxial-abaxial polarity reveal the requirement for ASYMMETRIC LEAVES1 and 2 and ERECTA functions in specifying leaf adaxial identity. Development. 2003, 130 (17): 4097-4107. 10.1242/dev.00622.PubMedView ArticleGoogle Scholar
- Qi Y, Sun Y, Xu L, Xu Y, Huang H: ERECTA is required for protection against heat-stress in the AS1/AS2 pathway to regulate adaxial-abaxial leaf polarity in Arabidopsis. Planta. 2004, 219 (2): 270-276. 10.1007/s00425-004-1248-z.PubMedView ArticleGoogle Scholar
- Hibara K, Takada S, Tasaka M: CUC1 gene activates the expression of SAM-related genes to induce adventitious shoot formation. Plant Journal. 2003, 36 (5): 687-696. 10.1046/j.1365-313X.2003.01911.x.PubMedView ArticleGoogle Scholar
- Hertzberg M, Aspeborg H, Schrader J, Andersson A, Erlandsson R, Blomqvist K, Bhalerao R, Uhlen M, Teeri TT, Lundeberg J, Sundberg B, Nilsson P, Sandberg G: A transcriptional roadmap to wood formation. Proceedings of the National Academy of Sciences, USA. 2001, 98 (25): 14732-14737. 10.1073/pnas.261293398.View ArticleGoogle Scholar
- Moreau C, Aksenov N, Lorenzo MG, Segerman B, Funk C, Nilsson P, Jansson S, Tuominen H: A genomic approach to investigate developmental cell death in woody tissues of Populus trees. Genome Biology. 2005, 6 (4): R34-10.1186/gb-2005-6-4-r34.PubMedPubMed CentralView ArticleGoogle Scholar
- Wilson DL, Buckley MJ, Helliwell CA, Wilson IW: New normalization methods for cDNA microarray data. Bioinformatics. 2003, 19 (11): 1325-1332. 10.1093/bioinformatics/btg146.PubMedView ArticleGoogle Scholar
- Andersson A, Keskitalo J, Sjodin A, Bhalerao R, Sterky F, Wissel K, Tandre K, Aspeborg H, Moyle R, Ohmiya Y, Bhalerao R, Brunner A, Gustafsson P, Karlsson J, Lundeberg J, Nilsson O, Sandberg G, Strauss S, Sundberg B, Uhlen M, Jansson S, Nilsson P: A transcriptional timetable of autumn senescence. Genome Biology. 2004, 5 (4): R24-10.1186/gb-2004-5-4-r24.PubMedPubMed CentralView ArticleGoogle Scholar
- Ramirez-Carvajal GA, Morse AM, Davis JM: Transcript profiles of the cytokinin response regulator gene family in Populus imply diverse roles in plant development. New Phytologist. 2007, 177: 77-89.PubMedGoogle Scholar
- Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Edgar R: NCBI GEO: mining tens of millions of expression profiles-database and tools update. Nucleic Acids Research. 2007, D760-D765. 10.1093/nar/gkl887. 35 Database
- Groover A, Mansfield S, DiFazio S, Dupper G, Fontana J, Millar R, Wang Y: The Populus homeobox gene ARBORKNOX1 reveals overlapping mechanisms regulating the shoot apical meristem and the vascular cambium. Plant Molecular Biology. 2006, 61 (6): 917-932. 10.1007/s11103-006-0059-y.PubMedView ArticleGoogle Scholar
- Segerman B, Jansson S, Karlsson J: Characterization of genes with tissue-specific differential expression patterns in Populus. Tree Genetics & Genomes. 2007, 3 (4): 351-362. 10.1007/s11295-006-0077-6.View ArticleGoogle Scholar
- Riano-Pachon DM, Ruzicic S, Dreyer I, Mueller-Roeber B: PlnTFDB: an integrative plant transcription factor database. BMC Bioinformatics. 2007, 8: 42-10.1186/1471-2105-8-42.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhu QH, Guo AY, Gao G, Zhong YF, Xu M, Huang M, Luo J: DPTF: a database of poplar transcription factors. Bioinformatics. 2007, 23 (10): 1307-1308. 10.1093/bioinformatics/btm113.PubMedView ArticleGoogle Scholar
- Wold S: Cross Validatory Estimation of the Number of Components in Factor and Principal Components Models. Technometrics. 1978, 20: 397-406. 10.2307/1267639.View ArticleGoogle Scholar
- Shao J: Linear-Model Selection by Cross-Validation. Journal of The American Statistical Association. 1993, 88 (422): 486-494. 10.2307/2290328.View ArticleGoogle Scholar
- Ravasz E, Somera AL, Mongru DA, Oltvai ZN, Barabasi AL: Hierarchical Organization of Modularity in Metabolic Networks. Science. 2002, 297 (5586): 1551-1555. 10.1126/science.1073374.PubMedView ArticleGoogle Scholar
- Maere S, Heymans K, Kuiper M: BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics. 2005, 21 (16): 3448-3449. 10.1093/bioinformatics/bti551.PubMedView ArticleGoogle Scholar
- Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research. 2003, 13 (11): 2498-2504. 10.1101/gr.1239303.PubMedPubMed CentralView ArticleGoogle Scholar
- Huala E, Dickerman AW, Garcia-Hernandez M, Weems D, Reiser L, LaFond F, Hanley D, Kiphart D, Zhuang M, Huang W, Mueller LA, Bhattacharyya D, Bhaya D, Sobral BW, Beavis W, Meinke DW, Town CD, Somerville C, Rhee SY: The Arabidopsis Information Resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plant. Nucleic Acids Research. 2001, 29: 102-5. 10.1093/nar/29.1.102.PubMedPubMed CentralView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.