Transcript profiling of two alfalfa genotypes with contrasting cell wall composition in stems using a cross-species platform: optimizing analysis by masking biased probes
- S Samuel Yang†1Email author,
- Wayne Wenzhong Xu†2,
- Mesfin Tesfaye3,
- JoAnn FS Lamb1, 4,
- Hans-Joachim G Jung1, 4,
- Kathryn A VandenBosch3,
- Carroll P Vance1, 4 and
- John W Gronwald1, 4Email author
© Yang et al; licensee BioMed Central Ltd. 2010
Received: 22 September 2009
Accepted: 24 May 2010
Published: 24 May 2010
The GeneChip®Medicago Genome Array, developed for Medicago truncatula, is a suitable platform for transcript profiling in tetraploid alfalfa [Medicago sativa (L.) subsp. sativa]. However, previous research involving cross-species hybridization (CSH) has shown that sequence variation between two species can bias transcript profiling by decreasing sensitivity (number of expressed genes detected) and the accuracy of measuring fold-differences in gene expression.
Transcript profiling using the Medicago GeneChip® was conducted with elongating stem (ES) and post-elongation stem (PES) internodes from alfalfa genotypes 252 and 1283 that differ in stem cell wall concentrations of cellulose and lignin. A protocol was developed that masked probes targeting inter-species variable (ISV) regions of alfalfa transcripts. A probe signal intensity threshold was selected that optimized both sensitivity and accuracy. After masking for both ISV regions and previously identified single-feature polymorphisms (SFPs), the number of differentially expressed genes between the two genotypes in both ES and PES internodes was approximately 2-fold greater than the number detected prior to masking. Regulatory genes, including transcription factor and receptor kinase genes that may play a role in development of secondary xylem, were significantly over-represented among genes up-regulated in 252 PES internodes compared to 1283 PES internodes. Several cell wall-related genes were also up-regulated in genotype 252 PES internodes. Real-time quantitative RT-PCR of differentially expressed regulatory and cell wall-related genes demonstrated increased sensitivity and accuracy after masking for both ISV regions and SFPs. Over 1,000 genes that were differentially expressed in ES and PES internodes of genotypes 252 and 1283 were mapped onto putative orthologous loci on M. truncatula chromosomes. Clustering simulation analysis of the differentially expressed genes suggested co-expression of some neighbouring genes on Medicago chromosomes.
The problems associated with transcript profiling in alfalfa stems using the Medicago GeneChip as a CSH platform were mitigated by masking probes targeting ISV regions and SFPs. Using this masking protocol resulted in the identification of numerous candidate genes that may contribute to differences in cell wall concentration and composition of stems of two alfalfa genotypes.
Alfalfa [Medicago sativa (L.) subsp. sativa] is the most widely cultivated forage legume in the world  and the fourth most widely grown crop in the United States . In 2008, over 60 million metric tons of alfalfa dry hay with a value of over $10 billion were harvested from over 8.5 million hectares in the US . In addition to being a valuable forage crop for livestock, alfalfa has considerable potential as a sustainable, cellulosic feedstock for ethanol production . Alfalfa is a relatively high biomass crop that also provides environmental benefits . For example, alfalfa improves soil and water quality, promotes wildlife diversity and provides its own nitrogen fertilizer through symbiotic nitrogen fixation [2, 4–6].
A promising strategy for developing alfalfa as a cellulosic ethanol crop involves separating leaves and stems following harvest . The leaves would be used as a protein supplement for livestock while the stems would be used to produce ethanol. Our research has focused on selecting for large stem, non-lodging, biomass-type alfalfa germplasm and developing management strategies to optimize biomass yield. To date, these efforts have resulted in a 40% increase in total biomass and a doubling of theoretical ethanol yield . We have also initiated research to modify the composition of alfalfa stem cell walls via a transgenic approach. The efficiency of ethanol production from cellulosic biomass is positively correlated with cellulose content but negatively correlated with lignin content [8, 9]. Thus, the value of alfalfa as a cellulosic feedstock would be enhanced by developing new alfalfa varieties that have increased cellulose and decreased lignin in stem cell walls [8, 9]. To facilitate the identification of key genes regulating cell wall composition, we selected alfalfa germplasm (genotypes 252 and 1283) that exhibit significant differences in lignin and cellulose concentrations in stem cell walls . On a dry matter basis, stem cellulose and Klason lignin concentrations of plants at flowering are significantly higher in genotype 252 compared to genotype 1283 (302 gkg-1 vs. 257 gkg-1 for cellulose and 117 gkg-1 vs. 98 gkg-1for Klason lignin, respectively).
A high-density oligonucleotide microarray is not yet available for global transcript profiling in alfalfa. However, the GeneChip®Medicago Genome Array is available. This GeneChip contains a total of 52,796 Medicago probe sets designed from 50,900 and 1,896 sequences from M. truncatula and alfalfa, respectively. Each probe set in the GeneChip consists of 11 perfect match (PM) and 11 mismatch (MM) 25-mer probes. An underlying assumption when using microarrays for cross-species hybridization (CSH) is that the level of sequence homology among genes of closely-related species is significant enough to enable detection by probes originally designed for their orthologs. Previous research indicated that the Medicago GeneChip® is a suitable cross-species platform for transcript profiling in alfalfa [11, 12]. In large part, this is because there is a significant level of gene homology. For example, a previous study reported that DNA sequence identity was 93% or greater between protein coding regions of selected homologous genes in alfalfa and M. truncatula. However, in previous research using the Medicago GeneChip® for transcript profiling in alfalfa tissues, we observed decreased sensitivity (number of genes detected) and decreased accuracy in measuring fold-changes in gene expression compared to results obtained with M. truncatula tissues [11, 12].
Numerous studies, conducted with both animals and plants, have reported transcript profiling involving CSH to DNA microarrays of a closely-related species [13–32]. In a number of these studies, electronic masking was used to remove biased probes prior to microarray data analysis. For example, Ranz et al.  introduced a probe selection method based on genomic DNA hybridizations of the target and non-target species to the GeneChip. This approach has been used for CSH studies involving plant species [19, 27, 30]. However, a recent CSH study involving Xenopus species questioned the reliability of this method for selecting unbiased probes . Transcript profiling in non-human primates using the human GeneChip for CSH was optimized by identifying inter-species conserved probe sets . These probe sets were identified by aligning expressed sequence tags (ESTs) in non-human primate with probe sequences on the Affymetrix human GeneChip® platform. However, this approach is not feasible for species with limited sequence information such as alfalfa. In a study using the human GeneChip as a cross-species platform to measure gene expression in heart and liver tissues of non-human mammals (e.g. cattle, pig, dog, mouse), Ji et al.  developed a protocol to selectively mask poorly hybridized probes using the match/mismatch feature of the GeneChip. To evaluate whether masking improved the accuracy of measuring gene expression, it was hypothesized that different organs (heart, liver) of humans and non-human mammals have similar gene expression patterns. After masking low intensity probes in the microarray data of the cross-species, Ji et al.  found a linear correlation (r = 0.93) for Ln(heart/liver) values between human and mouse GeneChip data. These authors concluded that comparisons of gene expression patterns in defined tissues of related species could be used to optimize CSH studies involving other mammals or plants.
In earlier research, we examined gene expression at two stages of stem development for alfalfa and Medicago truncatula. In both species, transcript profiling was conducted in elongating stem internodes (ES) and post-elongation stem internodes (PES). Genes associated with primary cell wall development were preferentially expressed in ES internodes while genes associated with secondary xylem development were enriched in PES internodes. The objective of this study was to identify genes that are differentially expressed in ES and PES internodes of alfalfa genotypes 252 and 1283 using the GeneChip®Medicago Genome Array as a cross-species platform. To optimize cross-species hybridization analysis, we developed a protocol for masking probes targeting inter-species variable (ISV) regions. After masking for ISV regions and single-feature polymorphisms (SFPs) previously detected in genotypes 252 and 1283 , we identified numerous genes that were differentially expressed in ES and PES internodes of the two genotypes.
Results and discussion
Masking probes targeting inter-species variable regions
As a preliminary analysis of sequence divergence between orthologous genes of Medicago truncatula and Medicago sativa (alfalfa), we blasted 550,074 M. truncatula probe sequences (25-mer) on the GeneChip® Medicago Genome Array against the 12,072 alfalfa expressed sequence tag (EST) sequences that are currently available from the public database (e-value cut-off = 0.001, minimum nucleotide alignment length = 20). A total of 21,176 M. truncatula probe sequences had alfalfa EST hits and 14,960 of them (~70%) showed at least one base mismatch (data not shown). These results suggested that masking ISV regions would optimize transcript profiling when using the Medicago GeneChip® as a cross-species platform for measuring gene expression in alfalfa.
To evaluate the effect of masking for ISV regions on the accuracy of measuring fold-changes in gene expression between PES and ES internodes of alfalfa, we examined the correlation of the hybridization intensity signal ratio of PES and ES internodes for the commonly-selected genes from the two Medicago species as signal intensity threshold was increased. The Pearson correlation coefficient of the PES/ES ratio between M. truncatula and alfalfa increased from 0.29 to 0.34 as signal intensity threshold increased up to about 100 reflecting increased accuracy (Figure 3). The decline in correlation detected at signal intensity thresholds above 100 may be due to masking too many informative probes. Although the highest correlation of the PES/ES ratio between the two Medicago species was achieved with a signal intensity threshold of 100, the number of commonly-selected genes was reduced (Figure 3). The data in Figure 3 show that the effect of masking on accuracy and sensitivity intersect at a signal intensity threshold of 40 where over 50,000 probe sets (about 85% of the total number on the Genechip) were retained (Figure 2). On the basis of these results, we used a signal intensity threshold of 40 for masking biased probes due to ISV regions. The use of this masking threshold significantly improved sensitivity (the number of expressed genes detected) while maintaining a high level of accuracy in measuring fold-difference in gene expression.
Most CSH studies in plants have used a genomic DNA-based strategy for probe selection [19, 27, 30]. To our knowledge, this study is the first to employ an RNA-based probe selection protocol to mask ISV regions when using a cross-species platform for transcript profiling in plants. The masking protocol that we developed has some advantages over previously reported masking protocols especially for crops with limited sequence information. For example, neither DNA hybridization  nor prerequisite sequence information  is needed to identify inter-species conserved probe sets. In addition, with careful experimental design including adequate replication, the masking protocol developed in this study is relatively simple to implement (see Methods). The protocol is based on the assumption that the ratio of gene expression in PES and ES internodes of two closely-related Medicago species (M. truncatula and M. sativa) is similar. In mammals, a similar approach involving comparisons of gene expression between organs has been used successfully in CSH studies . In our study, the ratio of expression of probe sets in ES and PES internodes of M. truncatula was used to optimize both the sensitivity and the accuracy of detecting genes differentially expressed in alfalfa stem internodes. Our results suggest that a similar RNA-based approach for masking ISV regions could be successfully applied to other closely-related plant species where a microarray platform is available for one species.
Although the masking protocol used in this study is a useful tool for optimizing CSH GeneChip date, it does not correct for all bias in the data. It is important that candidate genes selected for further study based on masking results be validated by real-time quantitative RT-PCR. In addition, one limitation of a masking protocol based on RNA hybridization intensity is bias toward abundant transcripts. Low abundance genes (probes sets) would most likely be masked using this protocol.
Masking probes for both ISV regions and SFPs
Single-feature polymorphisms (SFPs) are polymorphisms detected by single probes in microarrays . Previously, we identified 10,890 SFPs between alfalfa genotypes 252 and 1283 using the GeneChip expression data files for ES and PES internodes . These allelic variations between the two genotypes can bias transcript profiling by causing both false positives and false negatives [35, 36]. The effect of masking for both ISV regions and SFPs (i.e. double-masking) on the number of probe sets retained was minimal. Only about 450 additional probe sets were lost after further masking for SFPs (data not shown). By masking for probes targeting both inter- and intra-species variable regions, we improved the quality of the CSH GeneChip data for the two alfalfa genotypes examined. The double-masking strategy employed in this study can be applied to other species when using a cross-species platform for transcript profiling between two genotypes.
Effect of masking on detection of differentially expressed genes
Differences in gene expression between stem internodes of genotypes 252 and 1283
The role of various transcription factors in regulating cell wall development has been examined primarily in Arabidopsis and poplar (Populus spp.) . For example, over-expression of some NAC (NAM/ATAF/CUC) and MYB proteins in Arabidopsis led to abnormal ectopic deposition of secondary cell walls and suppression of their functions resulted in a decrease in secondary cell wall thickening [40–47]. Some MYB family transcription factors also regulate the expression of genes involved in lignin biosynthesis [47–50]. Interestingly, a recent study suggested that the NAC family transcription factor SND1 (SECONDARY WALL-ASSOCIATED NAC DOMAIN PROTEIN1) acts as a master transcriptional switch for activating secondary cell wall biosynthetic pathways by regulating the expression of 11 transcription factors (1 homeobox-, 2 NAC- and 8 MYB-domain containing genes) essential for normal secondary cell wall development . In the alfalfa genotypes examined in this study, putative NAC genes Mtr.50934.1.S1_at and Mtr.25921.1.S1_at were up-regulated in both ES and PES internodes of genotype 252 compared to the same tissues in genotype 1283. Three MYB genes (Mtr.6897.1.S1_at, Mtr.42648.1.S1_at, Mtr.44850.1.S1_at) were up-regulated in 252 PES internodes compared to 1283 PES internodes. Among these, Mtr.42648.1.S1_at is a putative homolog of AtMYB63 (86% identical at the protein level), a SND-1 regulated MYB transcription factor that specifically activates lignin biosynthetic genes during secondary cell wall formation in Arabidopsis. AtMYB63 was specifically expressed in fibers and vessels undergoing secondary cell wall thickening. Over-expression of AtMYB63 resulted in specific activation of lignin biosynthetic genes causing ectopic deposition of lignin in normally non-lignifying cells. Suppression of AtMYB63 led to a reduction in secondary cell wall thickening and lignin content . Mtr.42648.1.S1_at also has high sequence homology with two other SND1-regulated AtMYB genes (AtMYB85 and AtMYB103) with 73% and 70% identity at the protein level, respectively. Over-expression of AtMYB103 and AtMYB85 led to an increase in secondary cell wall thickening in fibers and ectopic deposition of lignin in epidermal and cortical cells in stems . Dominant repression of AtMYB103 and AtMYB85 resulted in significantly reduced secondary cell wall thickening in fiber cells. We also identified numerous other differentially expressed transcription factor families that have not been previously reported to play a role in cell wall development. For example, zinc finger (14 genes total) and WRKY (18 genes total) were the most abundant families among the differentially expressed transcription factors in genotypes 252 and 1283. Other significant transcription factor families identified include bHLH, b-ZIP, and AP2/EREBP (Additional file 1).
Receptor-like kinases (RLKs) were also significantly over-represented among genes up-regulated in genotype 252 PES internodes compared to genotype 1283 PES internodes (Figure 7, Additional file 2). RLKs are known to play significant roles in plant growth, development and defence responses [51–53]. There are more than 600 RLKs in the Arabidopsis genome. Several recent reports suggested a significant role for RLKs in regulating cell wall development. For example, a loss of function mutant of THESEUS1, a plasma membrane receptor kinase, suppressed the ectopic lignification and growth inhibition phenotype of prc1-1, a recessive CELLULOSE SYNTHASE 6 Arabidopsis mutant, by repressing the induction of stress responses. These results suggested that the THESEUS1 RLK acts as sensor of cell wall integrity . Mutations in two leucine-rich repeat (LRR) RLKs (FEI1 and FEI2) disrupt anisotropic expansion and the synthesis of cell wall polymers including cellulose biosynthesis . WAKs (wall-associated Ser/Thr receptor kinases) are tightly bound to the cell wall and are thought to play a significant role in regulating cell wall function as well [56, 57]. Among the 32 putative RLKs identified in genotypes 252 and 1283, 23 were up-regulated in 252 PES internodes, one was up-regulated in both ES and PES internodes of 252, and one was up-regulated in 252 ES internodes compared to 1283 ES or PES internodes. One of the RLKs up-regulated in 252 PES internodes (Mtr.9325.1.S1_at) is a homolog of Arabidopsis FEI1. Two putative WAKs (Mtr.13054.1.S1_at and Mtr.3807.1.S1_at) were up-regulated in 252 PES internodes as well.
We also identified differentially expressed genes involved in lignin biosynthesis in the stems of alfalfa genotypes 252 and 1283 (Figure 8). For example, hydroxycinnamoyltransferase (HCT: Mtr.34427.1.S1_at) and caffeoyl-CoA 3-O-methyltransferase (CCoAOMT: Mtr.40942.1.S1_at) were up-regulated in both ES and PES internodes of genotype 252 compared to 1283. However, the degree of up-regulation in 252 was greater in PES internodes compared to ES internodes. In addition, three putative laccase genes (LAC17: Mtr.39737.1.S1_at, Mtr.45364.1.S1_at, and Mtr.4126.1.S1_at) and a putative 4-coumarate-CoA ligase gene (4CL: Mtr.42330.1.S1_at) were up-regulated in genotype 252 PES compared to genotype 1283. Interestingly, some genes involved in lignin biosynthesis were down-regulated in genotype 252 compared to genotype 1283. For example, two putative CCoAOMT genes (Mtr.31539.1.S1_at and Mtr.4850.1.S1_at) and cinnamyl-alcohol dehydrogenase (CAD: Mtr.8985.1.S1_at and Mtr.6181.1.S1_at) were down-regulated in genotype 252 compared to genotype 1283.
Differences in gene expression between stem internodes of genotypes 252 and 1283 are consistent with differences in cell wall composition
Overall, our results show significant up-regulation of a number of regulatory and cell wall-related genes in PES internodes of genotype 252 compared to genotype 1283. Many of the regulatory genes that were up-regulated are putative transcription factors and receptor kinases. Several of these up-regulated genes are known to play a role in the development of secondary xylem in Arabidopsis (e.g., AtMYB63, AtMYB85, AtMYB103, and FEI1). In addition, putative CesA genes that play a role in secondary cell wall development and putative genes involved in lignin synthesis were up-regulated in 252 PES internodes compared to 1283 PES internodes. The up-regulation of these regulatory and cell wall-related genes may play a role in the greater cell wall concentration and modified composition of PES internodes of genotype 252. On a dry matter basis, cellulose and lignin concentrations of stems of flowering plants (primarily PES internodes) are significantly higher in genotype 252 compared to genotype 1283 (302 g kg-1 vs. 257 g kg-1 for cellulose and 117 g kg-1 vs. 98 g kg-1for Klason lignin, respectively) . The greater cellulose and lignin concentrations in stems of genotype 252 compared to genotype1282 are associated with an 11% increase in total cell wall dry matter and a reduction in concentration of pectin sugar residues in the cell wall. The genotypic differences in cell wall concentration and composition of PES internodes are consistent with greater deposition of secondary xylem in internodes of genotype 252 compared to genotype 1283. Previous research has shown that increased development of secondary xylem increases cell wall concentration in alfalfa stems expressed on a dry weight basis . Furthermore, the thick secondary walls of this tissue are rich in cellulose, xylan and lignin, but contain little if any pectin compared to primary cell walls . The candidate genes identified in this study, especially transcription factor genes and genes involved in secondary cell wall synthesis, may play important roles in the development of secondary xylem in PES internodes of alfalfa. Future research involving transgenic approaches will be used to evaluate the role of these genes in the deposition of secondary xylem in alfalfa stems. Modifying the amount and composition of secondary xylem in stems of alfalfa will improve the value of alfalfa as a cellulosic feedstock.
Validation of selected candidate genes
A subset of 50 differentially expressed candidate genes from three functional categories (regulatory, signalling and cell wall-related genes) was initially selected for real-time quantitative RT-PCR validation. However, only 34 of these genes produced a single amplicon based on dissociation curves (see Methods for details). Most primers for real-time quantitative RT-PCR were designed using M. truncatula sequences because most probe sets selected for validation were designed from M. truncatula sequences. Sequence variation between the two Medicago species and among multi-gene families within species may explain the lower than expected RT-PCR success rate.
Candidate genes used for real-time quantitative RT-PCR validation.
Probe Set ID
Cell Wall-related Genes
Reversibly glycosylated polypeptide
LOB gene family
bZIP transcription factor
C2H2 zinc finger family
C2H2 zinc finger family
C2H2 zinc finger family
Homeobox transcription factor
Homeobox transcription factor
Homeobox transcription factor
MYB transcription factor
NAC transcription factor
PHD finger family
Zinc finger family
Signal Transduction Genes
Physical mapping of differentially expressed genes
Clustering simulation analysis of co-expressed genes (50 kb window) in ES and PES internodes of alfalfa genotypes 252 and 1283.
Number of Genes†
SD from Simulation Mean¶
up in 252
down in 252
up in 252
down in 252
To some degree, the clustering of co-expressed genes detected in this study may represent tandem repeats of duplicated genes. During the physical mapping process, some probe sets were targeted to closely linked multiple loci on the M. truncatula genome. If multiple loci hits per probe set were detected, we mapped only the top hit locus for each differentially expressed probe set onto the M. truncatula genome. By doing so, we reduced the chances that clusters detected in our cluster simulation analysis were due to tandem repeats of duplicated genes. Thus, the majority of co-expressed gene loci used for clustering simulation analysis are sequence unrelated. Previous studies conducted in other model systems reported chromosomal clustering of co-expressed genes even after removing duplicated genes [68, 69].
Numerous studies have reported co-expression of neighbouring genes in eukaryotes [70–74] including Arabidopsis[68, 75, 76] and rice . Co-regulated gene clusters often share the same biological functions and/or are in the same pathway [72, 75]. These co-expressed genes could be regulated by the same transcription factor  or share the same promoter elements  for co-regulation. Natural selection may promote the clustering of co-expressed genes as well . However, the mechanism behind the clustering of co-expressed genes is still unclear.
Chromosomal segments with clusters of co-expressed candidate genes will be useful for alfalfa breeding, especially for wide-crosses involving introgression of foreign chromosomal segments from alien species into elite alfalfa cultivars. In addition, the genomic DNA sequence of multiple candidate genes can be obtained by sequencing a BAC containing the candidate gene cluster.
Masking biased probes due to inter-species variable (ISV) regions and SFPs increased the sensitivity and accuracy of the transcript profiling data for alfalfa when using the Medicago GeneChip as a cross-species platform. The masking protocol developed in this study can be applied to other CSH studies involving the use of GeneChips for transcript profiling. The transcript profiling data, indicating up-regulation of putative cellulose and lignin genes involved in secondary cell wall thickening in 252 PES internodes compared to 1283 PES internodes, is consistent with difference in cell wall concentration and composition between the two genotypes. Numerous cell wall and regulatory genes that may contribute to differences in cell wall composition and concentration of the two alfalfa genotypes were identified. These candidate genes will be useful for improving alfalfa as a cellulosic feedstock via a transgenic approach. Physical mapping and clustering simulation analysis of the differentially expressed alfalfa genes on orthologous loci of M. truncatula suggested chromosomal regions where statistically significant co-expression of neighbouring genes occurred.
Alfalfa [Medicago sativa (L) subsp. sativa] clonal lines 252 and 1283 were selected as previously described .
RNA extraction, labelling and GeneChip hybridization
Elongating and post-elongation stem internodes of genotypes 252 and 1283 grown in the greenhouse were harvested as previously described . Methods used for RNA extraction, labelling, and GeneChip hybridization were previously described . The raw data cel files used in this study are available in the NCBI Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE13602).
Masking probes targeting variable regions
where Ts is the total number of sample files, R is the replicate number of each sample type and S is the number of sample types. The floor and ceiling are mathematical functions that map a real number to the next smallest and next largest integer, respectively. In this study, we used genotypes 252 and 1283 with ES and PES internode tissues for each genotype (S = 4) and three replicates (R = 3) for a total of 12 sample files (Ts = 12, P = 0.83). Based on the defined percentage (P), we created a series of mask files at each intensity point for all samples. A file was also created for masking the probe locations of the previously identified 10,890 single-feature polymorphisms (SFPs) in ES and PES internodes of alfalfa genotypes 252 and 1283 .
Next, we applied the series of masking files to the cel files of ES and PES internodes of genotype 252 using Expressionist Refiner module (http://www.genedata.com). Briefly, the raw data cel files were loaded into Refiner with the masking files. The RMA algorithm summarized the probe-level signals into probe set expression indexes. The resulting expression index files were evaluated by comparing them to the expression files obtained from the same internode tissues in the reference species, M. truncatula. For each signal intensity, we examined the correlation of the PES/ES expression ratio for the commonly-selected genes from the two Medicago species. Commonly-selected genes are defined as genes that exhibit at least a 2-fold difference in gene expression in ES and PES internodes of the two species. The masked expression data that yielded the best performance, as evaluated by optimization of both sensitivity and accuracy, was selected for differential gene expression analysis.
Detection of differentially expressed genes
After selection of the optimum intensity threshold for masking, the differentially expressed genes were identified in the masked data set by applying a t-test to the expression values in ES or PES internodes (3 replications for each tissue type) of the two alfalfa genotypes (e.g., 252 ES vs. 1283 ES) with a p-value and FDR cutoff of 0.001 and 0.05, respectively. An additional ratio cutoff of 2-fold was applied using the Genedata Expressionist Analyst module (http://www.genedata.com/). The gene expression signals corresponding to the bacterial microsymbiont (Sinorhizobium meliloti) probe sets were excluded in this analysis.
Functional classification and over-representation analysis
The MapMan gene functional classification system  was assigned to the probe sets on the Medicago GeneChip following the method previously described . The functional class over-representation analysis was performed using PageMan  as previously described  except that the log2(252/1283) values for ES and PES internodes were given to the selected probe sets and the remainder of the probe sets on the GeneChip (not selected) were given a false expression value of "zero". For over-representation analysis, the z-value cuttoff was set as 1 after Bonferroni correction.
Real-time quantitative RT-PCR
Total RNA used for GeneChip hybridization was also used to make cDNAs for real-time quantitative RT-PCR. First strand cDNAs for each sample were made using random hexamers and Taqman Reverse Transcription Reagents (Applied Biosystems, CA) following the manufacturer's recommendations. Gene specific primers for the selected probe sets were designed based on the consensus sequences (http://www.affymetrix.com) using Primer Express (Applied Biosystems, CA) (Additional file 5). Samples and standards were run in triplicate on each plate and repeated on at least two plates using SYBR-Green PCR Master Mix (Applied Biosystems, CA) on a GeneAmp 7500 Sequence Detection System (Applied Biosystems, CA) following the manufacturer's recommendations. Real-time quantitative RT-PCR was performed in a 20 μl reaction containing 7 μl ddH2O, 10 μl 2× PCR mix, 1 μl forward primer (4 μM), 1 μl reverse primer (4 μM), and 1 μl of template cDNA (10 ng/μl). The PCR conditions were two minutes of pre-incubation at 50°C, 10 minutes of pre-denaturation at 94°C, 40 cycles of 15 seconds at 95°C and one minute at 60°C, followed by steps for dissociation curve generation (30 seconds at 95°C, 60 seconds at 60°C and 30 seconds at 95°C). The 7500 System SDS software v.1.2.2 was used for data collection and analysis. Dissociation curves for each amplicon were carefully examined to confirm the specificity of the primer pair used. Relative transcript levels for each sample were obtained using the "comparative CT method". The threshold cycle (CT) value obtained after each reaction was normalized to the CT value of 18S rRNA. The relative expression level was obtained by calibrating the ΔΔCT values for other samples using a normalized CT value (ΔΔCT) for the PES internodes of alfalfa genotype 252.
Physical mapping and frequency distribution
The Medicago genome release version 2.0 (Mt2.0) (http://www.medicago.org/genome/) contains 38,844 coding sequences with various degrees of annotation and predicted chromosome locations. We used these coding sequences to search against the Medicago GeneChip Probe consensus sequences database (http://www.affymetrix.com) using blastn  with match matrix BLOSUM62 and a mismatch penalty of -3. We chose the blast parameter E value (0.0001) and bit score (100) for hit cutoff. If there were multiple loci hits per probe set, only the top hit locus was mapped for each probe set to minimize the effect of tandem repeats during the clustering simulation analysis. This analysis generated putative orthologous chromosome locations for 36,709 of the Medicago GeneChip probe sets. We used an R script to map the differentially expressed genes (p < 0.001, >2-fold difference) between alfalfa genotypes 252 and 1283 [ ES internodes (red triangles) and PES internodes (blue circles)] onto the putative orthologous loci in the M. truncatula chromosomes 1 through 8.
Frequencies of differentially expressed genes on chromosomes 1 through 8 in ES and PES internodes were examined in a 50 kb sliding window. For each tissue, the physical location information of the differentially expressed genes on the M. truncatula chromosomes was extracted. The frequency of genes selected within a sliding 50 kb window was calculated. This sliding window shifted every 10 kb along the chromosome. Within each window, the calculated gene frequency was plotted against chromosome distance in kb.
The simulation protocol described by Grant et al.  was used to test for clustering of selected differentially expressed candidate genes on chromosomes. Mt2.0_pseudomolecule contains a total 266,102,767 bases on 8 chromosomes. This genome sequence was partitioned into a total of 3,856 and 2,122 bins based on 50 kb and 100 kb windows, respectively. The simulation program randomly positioned a defined number of selected genes on the genome and the number of bins with different frequency of assigned genes was determined. The simulation was repeated 2,000 times. The difference between experimental data (selected gene distribution) and simulated data (random distribution) was considered statistically significant if the absolute value of (experimental data — simulation mean)/(simulation standard deviation) was ≥ 2. A significant difference is indicative of clustering within the defined 50 kb or 100 kb window.
expressed sequence tag
single nucleotide polymorphism
robust multi-array average
This work was carried out in part using computing resources at the University of Minnesota Supercomputing Institute for Advance Computational Research. Funding for this research was provided by USDA-ARS CRIS Project 3640-12210-001-00D. Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture.
- Michaud R, Lehman WF, Rumbaugh MD: World distribution and historical development. Alfalfa and alfalfa improvement -- Agronomy Monograph no. 29. 1988, Madison, WI: ASA-CSSA-SSSA, 25-91.Google Scholar
- Samac DA, Jung H-JG, Lamb JFS: Development of alfalfa (Medicago sativa L.) as a feedstock for production of ethanol and other bioproducts. Alcoholic Fuels. Edited by: Minteer SD. 2006, Boca Raton, FL: CRC Press, 79-98.Google Scholar
- National Agricultural Statistics Service: On-line resource. 2008, [http://www.nass.usda.gov]Google Scholar
- Russelle MP, Birr AS: Large-scale assessment of symbiotic dinitrogen fixation by crops: Soybean and alfalfa in the Mississippi river basin. Agron J. 2004, 96: 1754-1760.View ArticleGoogle Scholar
- Angers DA: Changes in soil aggregation and organic carbon under corn and alfalfa. Soil Sci Soc Am J. 1992, 56: 1244-1249.View ArticleGoogle Scholar
- Russelle MP, Lamb JFS, Turyk NB, Shaw BH, Pearson B: Managing nitrogen contaminated soils: benefits of N2-fixing alfalfa. Agron J. 2007, 99: 738-746. 10.2134/agronj2005.0325.View ArticleGoogle Scholar
- Lamb JFS, Jung H-JG, Sheaffer CC, Samac DA: Alfalfa leaf protein and stem cell wall polysaccharide yields under hay and biomass management systems. Crop Sci. 2007, 47: 1407-1415. 10.2135/cropsci2006.10.0665.View ArticleGoogle Scholar
- Chapple C, Ladisch M, Meilan R: Loosening lignin's grip on biofuel production. Nature Biotech. 2007, 25: 746-748. 10.1038/nbt0707-746.View ArticleGoogle Scholar
- Chen F, Dixon RA: Lignin modification improves fermentable sugar yields for biofuel production. Nature Biotech. 2007, 25: 759-761. 10.1038/nbt1316.View ArticleGoogle Scholar
- Yang SS, Xu WW, Tesfaye M, Lamb JFS, Jung H-JG, Samac DA, Vance CP, Gronwald JW: Single-feature polymorphism discovery in the transcriptome of tetraploid alfalfa. Plant Genome. 2009, 2: 224-232. 10.3835/plantgenome2009.03.0014.View ArticleGoogle Scholar
- Tesfaye M, Silverstein KAT, Bucciarelli B, Samac DA, Vance CP: The Affymetrix Medicago GeneChip® array is applicable for transcript analysis of alfalfa (Medicago sativa). Func Plant Biol. 2006, 33: 783-788. 10.1071/FP06065.View ArticleGoogle Scholar
- Tesfaye M, Yang SS, Lamb JFS, Jung H-JG, Samac DA, Vance CP, Gronwald JW, VandenBosch KA: Medicago truncatula as a model for dicot cell wall development. Bioenergy Res. 2009, 2: 59-76. 10.1007/s12155-009-9034-1.View ArticleGoogle Scholar
- Enard W, Khaitovich P, Klose J, Zöllner S, Heissig F, Giavalisco P: Intra- and interspecific variation in primate gene expression patterns. Science. 2002, 296: 340-343. 10.1126/science.1068996.PubMedView ArticleGoogle Scholar
- Cáceres M, Lachuer J, Zapala MA, Redmond JC, Kudo L, Geschwind DH: Elevated gene expression levels distinguish human from non-human primate brains. Proc Natl Acad Sci USA. 2003, 100: 13030-13035. 10.1073/pnas.2135499100.PubMed CentralPubMedView ArticleGoogle Scholar
- Horvath DP, Schaffer R, West M, Wisman E: Arabidopsis microarrays identify conserved and differentially expressed genes involved in shoot growth and development from distantly related plant species. Plant J. 2003, 34: 125-134. 10.1046/j.1365-313X.2003.01706.x.PubMedView ArticleGoogle Scholar
- Meiklejohn CD, Parsch J, Ranz JM, Hartl DL: Rapid evolution of male-biased gene expression in Drosophila. Proc Natl Acad Sci USA. 2003, 100: 9894-9899. 10.1073/pnas.1630690100.PubMed CentralPubMedView ArticleGoogle Scholar
- Michalak P, Noor MAF: Genome-wide patterns of expression in Drosophila pure-species and hybrid males. Mol Biol Evol. 2003, 20: 1070-1076. 10.1093/molbev/msg119.PubMedView ArticleGoogle Scholar
- Ranz JM, Castillo-Davis CI, Meiklejohn CD, Hartl DL: Sex-dependent gene expression and evolution of the Drosophila transcriptome. Science. 2003, 300: 1742-1745. 10.1126/science.1085881.PubMedView ArticleGoogle Scholar
- Becher M, Talke IN, Krall L, Krämer U: Cross-species microarray transcript profiling reveals high constitutive expression of metal homeostasis genes in shoots of the zinc hyperaccumulator Arabidopsis halleri. Plant J. 2004, 37: 251-268.PubMedView ArticleGoogle Scholar
- Close TJ, Wanamaker SI, Caldo RA, Turner SM, Ashlock DA, Dickerson JA, Wing RA, Muehlbauer GJ, Kleinhofs A, Wise RP: A new resource for cereal genomics: 22 K barley genechip comes of age. Plant Physiol. 2004, 134: 960-968. 10.1104/pp.103.034462.PubMed CentralPubMedView ArticleGoogle Scholar
- Ji W, Zhou W, Gregg K, Yu N, Davis S, Davis S: A method for cross-species gene expression analysis with high-density oligonucleotide arrays. Nucleic Acids Res. 2004, 32: e93-10.1093/nar/gnh084.PubMed CentralPubMedView ArticleGoogle Scholar
- Khaitovich P, Weiss G, Lachmann M, Hellmann I, Enard W, Muetzel B, Wirkner U, Ansorge W, Pääbo S: A neutral model of transcriptome evolution. PLoS Biology. 2004, 2: 682-689. 10.1371/journal.pbio.0020132.View ArticleGoogle Scholar
- Nuzhdin SV, Wayne ML, Harmon KL, McIntyre LM: Common pattern of evolution of gene expression level and protein sequence in Drosophila. Mol Biol Evol. 2004, 21: 1308-1317. 10.1093/molbev/msh128.PubMedView ArticleGoogle Scholar
- Uddin M, Wildman DE, Liu G, Xu W, Johnson RM, Hof PR: Sister grouping of chimpanzees and humans as revealed by genome-wide phylogenetic analysis of brain gene expression profiles. Proc Natl Acad Sci USA. 2004, 101: 2957-2962. 10.1073/pnas.0308725100.PubMed CentralPubMedView ArticleGoogle Scholar
- Weber M, Harada E, Vess C, Roepenack-Lahaye E, Clemens S: Comparative microarray analysis of Arabidopsis thaliana and Arabidopsis halleri roots identifies nicotianamine synthase, a ZIP transporter and other genes as potential metal hyperaccumulation factors. Plant J. 2004, 37: 269-281. 10.1111/j.1365-313X.2003.02013.x.PubMedView ArticleGoogle Scholar
- Wang Z, Lewis MG, Nau ME, Arnold A, Vahey MT: Identification and utilization of inter-species conserved (ISC) probesets on Affymetrix human GeneChip® platforms for the optimization of the assessment of expression patterns in non human primate (NHP) samples. BMC Bioinformatics. 2004, 5 (1): 165-10.1186/1471-2105-5-165.PubMed CentralPubMedView ArticleGoogle Scholar
- Hammond JP, Broadley MR, Craigon DJ, Higgins J, Emmerson ZF, Townsend HJ, White PJ, May ST: Using genomic DNA-based probe-selection to improve the sensitivity of high-density oligonucleotide arrays when applied to heterologous species. Plant Methods. 2005, 1: 10-10.1186/1746-4811-1-10.PubMed CentralPubMedView ArticleGoogle Scholar
- Moore S, Payton P, Wright M, Tanksley S, Giovannoni J: Utilization of tomato microarrays for comparative gene expression analysis in the Solanaceae. J Exp Bot. 2005, 56: 2885-2895. 10.1093/jxb/eri283.PubMedView ArticleGoogle Scholar
- Vallée M, Robert C, Méthot S, Palin M-F, Sirard M-A: Cross-species hybridizations on a multi-species cDNA microarray to identify evolutionarily conserved genes expressed in oocytes. BMC Genomics. 2006, 7: 113-10.1186/1471-2164-7-113.PubMed CentralPubMedView ArticleGoogle Scholar
- Hammond JP, Bowen HC, White PJ, Mills V, Pyke KA, Baker AJM, Whiting SN, May ST, Broadley MR: A comparison of the Thlaspi caerulescens and Thlaspi arvense shoot transcriptomes. New Phytologist. 2006, 170: 239-260. 10.1111/j.1469-8137.2006.01662.x.PubMedView ArticleGoogle Scholar
- Nieto-Díaz M, Pita-Thomas W, Nieto-Sampedro M: Cross-species analysis of gene expression in non-model mammals:reproducibility of hybridization on high density oligonucleotide microarrays. BMC Genomics. 2007, 8: 89-10.1186/1471-2164-8-89.PubMed CentralPubMedView ArticleGoogle Scholar
- Chain FJJ, Ilieva D, Evans BJ: Single-species microarrays and comparative transcriptomics. PLoS ONE. 2008, 3 (9): e3279-10.1371/journal.pone.0003279.PubMed CentralPubMedView ArticleGoogle Scholar
- Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP: Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2003, 31: e15-10.1093/nar/gng015.PubMed CentralPubMedView ArticleGoogle Scholar
- Borevitz JO, Liang D, Plouffe D, Chang H-S, Zhu T, Weigel D, Berry CC, Winzeler E, Chory J: Large-scale identification of single-feature polymorphisms in complex genomes. Genome Res. 2003, 13: 513-523. 10.1101/gr.541303.PubMed CentralPubMedView ArticleGoogle Scholar
- Walter NAR, McWeeney SK, Peters ST, Belknap JK, Hitzemann R, Buck KJ: SNPs matter: impact on detection of differential expression. Nature Methods. 2007, 4: 679-680. 10.1038/nmeth0907-679.PubMed CentralPubMedView ArticleGoogle Scholar
- DeCook R, Lall S, Nettleton D, Howell SH: Genetic regulation of gene expression during shoot development in Arabidopsis. Genetics. 2006, 172: 1155-1164. 10.1534/genetics.105.042275.PubMed CentralPubMedView ArticleGoogle Scholar
- Thimm O, Bläsing O, Gibon Y, Nagel A, Meyer S, Krüger P, Selbig J, Müller LA, Rhee SY, Stitt M: MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J. 2004, 37: 914-939. 10.1111/j.1365-313X.2004.02016.x.PubMedView ArticleGoogle Scholar
- Usadel B, Nagel A, Steinhauser D, Gibon Y, Bläsing OE, Redestig H, Sreenivasulu N, Krall L, Hannah MA, Poree F, Fernie AR, Stitt M: PageMan:An interactive ontology tool to generate, display, and annotate overview graphs for profiling experiments. BMC Bioinformatics. 2006, 7: 535-10.1186/1471-2105-7-535.PubMed CentralPubMedView ArticleGoogle Scholar
- Demura T, Fukuda H: Transcriptional regulation in wood formation. Trends Plant Sci. 2007, 12: 64-70. 10.1016/j.tplants.2006.12.006.PubMedView ArticleGoogle Scholar
- Kubo M, Udagawa M, Nishikubo N, Horiguchi G, Yamaguchi M, Ito J, Mimura T, Fukuda H, Demura T: Transcription switches for protoxylem and metaxylem vessel formation. Genes Dev. 2005, 19: 1855-1860. 10.1101/gad.1331305.PubMed CentralPubMedView ArticleGoogle Scholar
- Mitsuda N, Seki M, Shinozaki K, Ohme-Takagi M: The NAC transcription factors NST1 and NST2 of Arabidopsis regulate secondary wall thickenings and are required for anther dehiscence. Plant Cell. 2005, 17: 2993-3006. 10.1105/tpc.105.036004.PubMed CentralPubMedView ArticleGoogle Scholar
- Mitsuda N, Iwase A, Yamamoto H, Yoshida M, Seki M, Shinozaki K, Ohme-Takagi M: NAC transcription factors, NST1 and NST3, are key regulators of the formation of secondary walls in woody tissues of Arabidopsis. Plant Cell. 2007, 19: 270-280. 10.1105/tpc.106.047043.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhong R, Demura T, Ye Z-H: SND1, a NAC domain transcription factor, is a key regulator of secondary wall synthesis in fibers of Arabidopsis. Plant Cell. 2006, 18: 3158-3170. 10.1105/tpc.106.047399.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhong R, Richardson EA, Ye Z-H: The MYB46 transcription factor is a direct target of SND1 and regulates secondary wall biosynthesis in Arabidopsis. Plant Cell. 2007, 19: 2776-2792. 10.1105/tpc.107.053678.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhong R, Lee C, Zhou J, McCarthy RL, Ye Z-H: A battery of transcription factors involved in the regulation of secondary cell wall biosynthesis in Arabidopsis. Plant Cell. 2008, 20: 2763-2782. 10.1105/tpc.108.061325.PubMed CentralPubMedView ArticleGoogle Scholar
- Yang C, Xu Z, Song J, Conner K, Barrena GV, Wilson ZA: Arabidopsis MYB26/MALE STERILE35 regulates secondary thickening in the endothecium and is essential for anther dehiscence. Plant Cell. 2007, 19: 534-548. 10.1105/tpc.106.046391.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhou J, Lee C, Zhong R, Ye Z-H: MYB58 and MYB63 are transcriptional activators of the lignin biosynthetic pathway during secondary cell wall formation in Arabidopsis. Plant Cell. 2009, 21: 248-266. 10.1105/tpc.108.063321.PubMed CentralPubMedView ArticleGoogle Scholar
- Kawaoka A, Kaothien P, Yoshida K, Endo S, Yamada K, Ebinuma H: Functional analysis of tobacco LIM protein Ntlim1 involved in lignin biosynthesis. Plant J. 2000, 22: 289-301. 10.1046/j.1365-313x.2000.00737.x.PubMedView ArticleGoogle Scholar
- Rogers LA, Campbell MM: The genetic control of lignin deposition during plant growth and development. New Phytol. 2004, 164: 17-30. 10.1111/j.1469-8137.2004.01143.x.View ArticleGoogle Scholar
- Goicoechea M, Lacombe E, Legay S, Mihaljevic S, Rech P, Jauneau A, Lapierre C, Pollet B, Verhaegen D, Chaubet-Gigot N, Grima-Pettenati J: Eg MYB2, a new transcriptional activator from Eucalyptus xylem, regulates secondary cell wall formation and lignin biosynthesis. Plant J. 2005, 43: 553-567. 10.1111/j.1365-313X.2005.02480.x.PubMedView ArticleGoogle Scholar
- Shiu S-H, Bleecker AB: Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinases. Proc Natl Acad Sci USA. 2001, 98: 10763-10768. 10.1073/pnas.181141598.PubMed CentralPubMedView ArticleGoogle Scholar
- Shiu S-H, Karlowski WM, Pan R, Tzeng Y-H, Mayer KFX, Li W-H: Comparative analysis of the receptor-like kinase family in Arabidopsis and rice. Plant Cell. 2004, 16: 1220-1234. 10.1105/tpc.020834.PubMed CentralPubMedView ArticleGoogle Scholar
- Afzal AJ, Wood AJ, Lightfoot DA: Plant receptor-like serine threonine kinases: Roles in signaling and plant defense. Molecular Plant-Microbe Interactions. 2008, 21: 507-517. 10.1094/MPMI-21-5-0507.PubMedView ArticleGoogle Scholar
- Hématy K, Sado P-E, Van Tuinen A, Rochange S, Desnos T, Balzergue S, Pelletier S, Renou J-P, Höfte H: A receptor-like kinase mediates the response of Arabidopsis cells to the inhibition of cellulose synthesis. Curr Biol. 2007, 17: 922-931. 10.1016/j.cub.2007.05.018.PubMedView ArticleGoogle Scholar
- Xu S-L, Rahman A, Baskin TI, Kieber JJ: Two leucine-rich repeat receptor kinases mediate signalling, linking cell wall biosynthesis and ACC synthase in Arabidopsis. Plant Cell. 2008, 20: 3065-3079. 10.1105/tpc.108.063354.PubMed CentralPubMedView ArticleGoogle Scholar
- He Z-H, Fujiki M, Kohorn BD: A cell wall-associated, receptor-like protein kinase. J Biol Chem. 1996, 271: 19789-19793. 10.1074/jbc.271.33.19789.PubMedView ArticleGoogle Scholar
- Anderson CM, Wagner TA, Perret M, He Z-H, He D, Kohorn BD: WAKs: cell wall-associated kinases linking the cytoplasm to the extracellular matrix. Plant Mol Biol. 2001, 47: 197-206. 10.1023/A:1010691701578.PubMedView ArticleGoogle Scholar
- Turner SR, Somerville CR: Collapsed xylem phenotype of Arabidopsis identifies mutants deficient in cellulose deposition in the secondary cell wall. Plant Cell. 1997, 9: 689-701. 10.1105/tpc.9.5.689.PubMed CentralPubMedView ArticleGoogle Scholar
- Taylor NG, Scheible W-R, Cutler S, Somerville CR, Turner SR: The irregular xylem3 locus of Arabidopsis encodes a cellulose synthase required for secondary cell wall synthesis. Plant Cell. 1999, 11: 769-779. 10.1105/tpc.11.5.769.PubMed CentralPubMedView ArticleGoogle Scholar
- Taylor NG, Laurie S, Turner SR: Multiple cellulose synthase catalytic subunits are required for cellulose synthesis in Arabidopsis. Plant Cell. 2000, 12: 2529-2539. 10.1105/tpc.12.12.2529.PubMed CentralPubMedView ArticleGoogle Scholar
- Taylor NG, Howells RM, Huttly AK, Vickers K, Turner SR: Interactions among three distinct CesA proteins essential for cellulose synthesis. Proc Natl Acad Sci USA. 2003, 100: 1450-1455. 10.1073/pnas.0337628100.PubMed CentralPubMedView ArticleGoogle Scholar
- Liepman AH, Wilkerson CG, Keegstra K: Expression of cellulose synthase-like (Csl) genes in insect cells reveals that CslA family members encode mannan synthases. Proc Natl Acad Sci USA. 2005, 102: 2221-2226. 10.1073/pnas.0409179102.PubMed CentralPubMedView ArticleGoogle Scholar
- Jung H-G, Engels FM: Alfalfa stem tissues: Cell-wall deposition, composition, and degradability. Crop Sci. 2002, 42: 524-534.View ArticleGoogle Scholar
- Brouwer DJ, Osborn TC: A molecular marker linkage map of tetraploid alfalfa (Medicago sativa L.). Theor Appl Genet. 1999, 99: 1194-1200. 10.1007/s001220051324.View ArticleGoogle Scholar
- Choi H-K, Mun J-H, Kim D-J, Zhu H, Baek J-M, Mudge J, Roe B, Ellis N, Doyle J, Kiss GB, Young ND, Cook DR: Estimating genome conservation between crop and model legume species. Proc Natl Acad Sci USA. 2004, 101: 15289-15294. 10.1073/pnas.0402251101.PubMed CentralPubMedView ArticleGoogle Scholar
- Choi H-K, Kim D, Uhm T, Limpens E, Lim H, Mun J-H, Kalo P, Penmetsa RV, Seres A, Kulikova O, Roe BA, Bisseling T, Kiss GB, Cook DR: A sequence-based genetic map of Medicago truncatula and comparison of marker colinearity with M. sativa. Genetics. 2004, 166: 1463-1502. 10.1534/genetics.166.3.1463.PubMed CentralPubMedView ArticleGoogle Scholar
- Grant D, Cregan P, Shoemaker RC: Genome organization in dicots: Genome duplication in Arabidopsis and synteny between soybean and Arabidopsis. Proc Natl Acad Sci. 2000, 97: 4168-4173. 10.1073/pnas.070430597.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhan S, Horrocks J, Lukens LN: Islands of co-expressed neighboring genes in Arabidopsis thaliana suggest higher-order chromosome domains. Plant J. 2006, 45: 347-357. 10.1111/j.1365-313X.2005.02619.x.PubMedView ArticleGoogle Scholar
- Lercher MJ, Blumenthal T, Hurst LD: Coexpression of neighboring genes in Caenorhabditis elegans is mostly due to operons and duplicate genes. Genome Res. 2003, 13: 238-243. 10.1101/gr.553803.PubMed CentralPubMedView ArticleGoogle Scholar
- Hurst LD, Williams EJB, Pál C: Natural selection promotes the conservation of linkage of co-expressed genes. Trends Genetics. 2002, 18 (12): 604-606. 10.1016/S0168-9525(02)02813-5.View ArticleGoogle Scholar
- Oliver B, Parisi M, Clark D: Gene expession neighborhoods. J Biol. 2002, 1 (1): 4-10.1186/1475-4924-1-4.PubMed CentralPubMedView ArticleGoogle Scholar
- Vogel JH, Von Heydebreck A, Purmann A, Sperling S: Chromosomal clustering of a human transcriptome reveals regulatory background. BMC Bioinformatics. 2005, 6: 230-10.1186/1471-2105-6-230.PubMed CentralPubMedView ArticleGoogle Scholar
- Kosak ST, Scalzo D, Alworth SV, Li F, Palmer S, Enver T, Lee JSJ, Groudine M: Coordinate gene regulation during hematopoiesis is related to genomic organization. PLoS Biology. 2007, 5 (11): 2602-2613. 10.1371/journal.pbio.0050309.View ArticleGoogle Scholar
- Michalak P: Coexpression, coregulation, and cofunctionality of neighboring genes in eukaryotic genomes. Genomics. 2008, 91 (3): 243-248. 10.1016/j.ygeno.2007.11.002.PubMedView ArticleGoogle Scholar
- Williams EJB, Bowles DJ: Coexpression of neighboring genes in the genome of Arabidopsis thaliana. Genome Res. 2004, 14: 1060-1067. 10.1101/gr.2131104.PubMed CentralPubMedView ArticleGoogle Scholar
- Ren X-Y, Fiers MWEJ, Stiekema WJ, Nap J-P: Local coexpression domains of two to four genes in the genome of Arabidopsis. Plant Physiol. 2005, 138: 923-934. 10.1104/pp.104.055673.PubMed CentralPubMedView ArticleGoogle Scholar
- Ren X-Y, Stiekema WJ, Nap J-P: Local coexpression domains in the genome of rice show no microsynteny with Arabidopsis domains. Plant Mol Biol. 2007, 65: 205-217. 10.1007/s11103-007-9209-0.PubMed CentralPubMedView ArticleGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.