- Research article
- Open Access
A comparison of microarray and MPSS technology platforms for expression analysis of Arabidopsis
© Chen et al; licensee BioMed Central Ltd. 2007
- Received: 31 May 2007
- Accepted: 12 November 2007
- Published: 12 November 2007
Several high-throughput technologies can measure in parallel the abundance of many mRNA transcripts within a sample. These include the widely-used microarray as well as the more recently developed methods based on sequence tag abundances such as the Massively Parallel Signature Sequencing (MPSS) technology. A comparison of microarray and MPSS technologies can help to establish the metrics for data comparisons across these technology platforms and determine some of the factors affecting the measurement of mRNA abundances using different platforms.
We compared transcript abundance (gene expression) measurement data obtained using Affymetrix and Agilent microarrays with MPSS data. All three technologies were used to analyze the same set of mRNA samples; these samples were extracted from various wild type Arabidopsis thaliana tissues and floral mutants. We calculated correlations and used clustering methodology to compare the normalized expression data and expression ratios across samples and technologies. Abundance expression measurements were more similar between different samples measured by the same technology than between the same sample measured by different technologies. However, when expression ratios were employed, samples measured by different technologies were found to cluster together more frequently than with abundance expression levels.
Furthermore, the two microarray technologies were more consistent with each other than with MPSS. We also investigated probe-position effects on Affymetrix data and tag-position effects in MPSS. We found a similar impact on Affymetrix and MPSS measurements, which suggests that these effects were more likely a characteristic of the RNA sample rather than technology-specific biases.
Comparisons of transcript expression ratios showed greater consistency across platforms than measurements of transcript abundance. In addition, for measurements based on abundances, technology differences can mask the impact of biological differences between samples and tissues.
The availability of whole genome sequences and the development of high-throughput technologies have enabled global or genome-wide measurements of gene expression. The complete genome sequence of Arabidopsis thaliana  is available and this plant serves as an important model for gene expression research. A number of high-throughput technologies have been developed, such as the widely used microarray technologies [2, 3] and tag- or sequence-based technologies [4–7], that measure the abundance of many different mRNA transcripts within a biological sample. Several microarray platforms have been developed with probe sequences based on the sequenced Arabidopsis genome. Some of the most recent generation of commercially-produced Arabidopsis microarrays represent more than 21,000 genes: Affymetrix  and Agilent Technologies. . The Affymetrix microarray contains sets of 25-nucleotide probes per gene, while Agilent utilizes a single 60-nucleotide probe per gene on the microarray.
Hybridization signal intensities can be used to generate relative or abundance measurements that correspond to the amount of target mRNA that has hybridized to a specific probe, but relative measurements are determined by a comparison of two samples. Tag-based technologies measure the expression level of a gene by counting the abundance of a specific transcript in a sample. This count provides an abundance measure of each gene's expression level within the sample. Until recently, only two tag-based technologies have been widely used: SAGE (Serial Analysis of Gene Expression) [4, 5] and MPSS (Massively Parallel Signature Sequencing) technology [6, 7]. Both SAGE and MPSS produce short (10–22 nucleotide) sequence tags that are derived from a defined position in a mRNA molecule. A significant advantage of the MPSS technology relative to SAGE is the large number of distinct signatures (more than 1,000,000) that can be identified in a single analysis. Therefore, MPSS potentially provides a greater coverage of the transcriptome than SAGE.
Previous studies have compared different gene expression measurement technologies to determine how well they correlate with each other and/or to use one technology as a benchmark for another [9–24]. Some early studies, comparing different microarray platforms, showed that commercial microarrays were typically more consistent than non-commercial microarrays [11, 13, 18]. The study of Kuo et al. , which compared data from ten different microarray platforms for mouse, found that results were more similar between laboratories using the same platform than across platforms, that one-dye platforms were typically more consistent than two-dye platforms. Recently, some studies have compared microarray and MPSS technology, showing that there is moderate correlation between the two platforms but that one technology will often detect expression for some genes that are not measured by the other platform [22–24]. Oudes et al.  therefore suggested that using a combination of transcription profiling technologies would provide more complete coverage of gene expression measurements. Liu et al.  provided a comparison between many different microarrays and MPSS technology using mouse tissues but they did not consider as broad a range of biological samples for comparison as in our study.
In a preliminary study, Coughlan et al.  compared microarray and MPSS data from Arabidopsis thaliana. Here, we extend this preliminary work by carrying out a more detailed data analysis and by including data obtained with the Affymetrix platform. We compare transcript abundance measurements obtained with two types of microarrays (Affymetrix and Agilent technologies) and with MPSS. These data were generated from the same set of mRNA samples extracted from a variety of Arabidopsis thaliana tissues, mutants and treatments. We investigate the consistency of transcript abundance measurements across the three technology platforms using comparison of abundance expression levels and expression level ratios. We discuss factors which may affect the measured abundance and contribute to variation in measurements across different technology platforms. This study uses a broad range of samples in order to assess these different platforms.
Results and Discussion
Comparison of Gene Expression Data Obtained from MPSS and Microarrays
We believe that there should be some biologically meaningful relationships among different samples from different technology platforms. For example, samples from same tissues with different condition/treatment are expected to have higher correlation than samples from different tissues, and the same samples measured by different technology platforms would be expected to have higher correlation than others. To test our hypothesis, we make comparisons of abundance expression level and comparisons of expression level ratios.
Comparison of Abundance Expression Levels
Data used in comparison of microarrays and MPSS technology platforms.
Leaves SA 4 hr.
Leaves SA 52 hr.
Basic statistics of the NEU normalized expression data for 6,501 genes detectable by all three technologies.
Pearson's rank correlation of NEU normalized expression data for 1,648 genes in MPSS, Affymetrix and Agilent.
When comparing the same RNA samples across technologies, the highest correlations were observed for MPSS versus Affymetrix microarrays (0.66 to 0.76) and for Affymetrix microarrays versus Agilent microarrays (0.65 to 0.74) while the lowest correlations were exhibited for Agilent microarrays versus MPSS (0.53 to 0.56) (Table 3). The correlations among different tissues within the MPSS data were lower than the correlations among different tissues measured by Affymetrix and Agilent microarrays. The correlation values among MPSS expression data ranged from 0.41 to 0.64 while the same measurements using Affymetrix or Agilent microarrays ranged from 0.60 to 0.85 and 0.67 to 0.82, respectively. Additional analyses using different Affymetrix and Agilent technical replicates also showed that each pair of technical replicates were closely correlated together, as expected (see Figure 1 in additional file 1, e.g, AFRoA and AFRoB).
Comparison of Expression Level Ratios
Probe-position Effects in Affymetrix and Tag-position Effects in MPSS
The position of the microarray probe or MPSS signature within a given transcript may impact the measurement of transcript abundance due to the use of different polyadenylation sites or technical biases . Therefore, we compared the effects of the probe position on the Affymetrix microarray data and the effects of the tag position on the MPSS data. This type of analysis may be useful to distinguish differences in the measured transcript abundances that arise from biological causes in the mRNA sample from those that come from technological biases.
Based on the analysis of probe position in the Affymetrix microarrays and the tag position in MPSS samples, we found that both probe position and tag position, as measured from the 3' end, demonstrated a consistent trend. With the increase in probe position or tag position from the 3' end, the PM value and tag-abundance tended to increase then plateau. One reason for this tendency could be that there are many alternative transcripts for some genes which are not correctly annotated in the current Arabidopsis genome which could be alternative splice products or transcripts utilizing alternative polyadenylation sites. In fact, some probes and tags that we used for this analysis were not located within the most recently annotated genome [27, 28]. To check whether this tag- or probe-position bias is not merely a consequence of alternative splicing, we were able to obtain similar results when only single exon-containing genes were included (results not shown).
Our study indicated that the MPSS platform may be more variable in measuring RNA abundance than Affymetrix and Agilent platforms. Our analysis showed that the microarray and MPSS technologies did not correlate well on a quantitative basis for transcript abundance measurements, although within-technology clustering of samples was largely consistent. Expression ratio comparisons were more comparable and consistent across the platform technologies, and ratios for sample pairs involving different tissues were found to correlate quite well across technologies. Therefore, for the purpose of hierarchical clustering of global data sets, the ratios were a more biologically meaningful measure to use for cluster analysis than abundance measurements of gene expression. However, when the difference between the ratios was subtle, as in the case of ratios involving the same pair of tissues but with one derived from a different mutant or treatment, then technological differences often overwhelmed the underlying biological signal. Moreover, the two microarray technologies (Affymetrix and Agilent) were more consistent with each other than with MPSS.
The position of the probe or tag in the transcript has previously been shown to affect the measured transcript abundance . Our analysis demonstrated that there exists a probe-position effect on the abundance measured by Affymetrix, consistent with a similar tag-position effect observed in MPSS data. Specifically, Affymetrix probes and MPSS tags closer to the 3' end of the transcript showed a lower average abundance. The fact that this trend was observed using both technologies suggests that it may reflect an inherent characteristic of the RNA sample. The majority of probes and MPSS tags were within coding regions, and we confirmed that the trend also exists for genes that are annotated as containing only a single exon. This seems to exclude the possibility of alternative splicing as an explanation, unless a significant number of the current gene models are incorrect. One possibility is variation in the polyadenylation site within the 3' UTR. Other possible explanations may be bias in the RNA sample, a misannotation of the coding regions, or a coincidental technological bias shared by Affymetrix microarrays and MPSS.
Data Preparation and Pre-processing
The microarray data and MPSS data for the manuscript has been submitted to the GEO Omnibus . The Series record number is GSE8994 and details relating to sample and data preparation are available there.
Eleven different distinct RNA samples were used in this study (Table 1). The RNA samples were obtained from different Arabidopsis thaliana tissues (samples 1 to 5 included callus, inflorescence, leaves, roots and silique), homeotic floral mutants (samples 6 to 9 included agamous inflorescence, apetala1-10 inflorescence, apetala3-6 inflorescence and a superman/apetala1-10 inflorescence double mutant) and two leaf treatments (samples 10 and 11 include leaves sampled 4 and 52 hours after salicylic acid treatment, respectively) . Only one technical replication for each sample was available for the MPSS data. Two technical replicates for each sample were available for most of the Affymetrix and Agilent data (codes with last letter "A" or "B"). For the Agilent platform, some microarrays were self-self microarrays (e.g, AFCaA indicates both red and green channels are callus RNA samples) while others are were non-self microarrays (e.g, AGCaLeA indicates that red channel is callus and green channel is leaf RNA sample).
The Arabidopsis MPSS data has been published previously and is available on our website . In the present study, we utilized the "Signature" MPSS samples, rather than "Classic" MPSS samples, because there exists a significant tag-position bias in Classic MPSS samples [26, 31]. We also focused on our analyses on the 17-base signatures, rather than the 20-base signatures, because the shorter tags represent a less-biased data set due to a lower frequency of palindromic sequences or other "bad words"  that are poorly sequenced by MPSS technology. The raw abundance of each signature in each sample is normalized to a metric of TPM (Transcripts Per Million), which facilitates comparisons across samples and technologies. To obtain the expression level for each gene, we summed the abundance of signatures that match the sense-strand of the gene and are found only once in the Arabidopsis genome and transcriptome.
To generate data that was comparable across technologies, the same Arabidopsis RNA samples used for the MPSS analyses were hybridized to two different high-density Arabidopsis microarray platforms: the Affymetrix ATH1 GeneChip  (22,800 features, ~11 probes per gene and 25-nucleotides per probe) and Agilent 22 K Arabidopsis microarray  (21,500 features, 60 nucleotide probes and one probe per gene). MAS5.0 software  was used to obtain the expression level for each gene on the Affymetrix microarray; Agilent data was preprocessed by Agilent G2566AA Feature Extraction Software . Two technical replicates were obtained for almost all Affymetrix microarrays used to analyze each RNA sample from mutants/treatments/tissues (e.g, AFRoA and AFRoB) and for all Agilent microarrays (e.g, AGRoA and AGRoB which are self-self microarrays and AGRoLeA and AGRoLeB which are non-self microarrays). For the Affymetrix microarray data, for each gene the transcript abundance (expression) measurements from each pair of technical replicates was used to obtain the average (e.g., data was averaged for AFRoA and AFRoB to obtain AFRo). For self-self Agilent microarrays (both dyes associated with the same sample), because each microarray has two-dye channels, for each gene we first averaged the data from the red channel and green channels, then averaged these data from each pair of technical replicates to obtain the final average (e.g., data was averaged for AGRoA and AGRoB to obtain AGRo). Likewise, for Agilent microarrays which were non-self microarrays (each dye associated with a different sample), we averaged the pairs of technical replicates to get the average log ratio (e.g, data averaged for AGRoLeA and AGRoLeB to get AGRoLe).
We downloaded the most recent Arabidopsis genome release data (TIGR version 6, November 2005) from TAIR  and focused on the 26,751 protein-coding genes among the total of 31,407 annotated genes. Because each of these microarray platforms and the MPSS platform contained a subset of the total annotated Arabidopsis genes, we compared only the genes represented or detected by all three platforms. A total of 16,269 genes were potentially detectable by all three platforms (Figure 1). For the MPSS data, signatures which were duplicated (i.e., matching multiple genomic locations) were excluded from the analysis; genes matching these signatures were also excluded to avoid a bias that could result if only a subset of the signatures from each gene are used. For the comparison of MPSS and Affymetrix platforms, 6,501 genes were selected since these were detected as present in all eleven Affymetrix microarray data sets and had mean MPSS tag counts across all eleven samples greater than 4 TPM (i.e., the lower identification limit of MPSS signatures ). Moreover, for the comparison of the three platforms, 1,648 genes were selected with all MPSS tag counts greater than 4 TPM, detected as present in Affymetrix and detected as present in Agilent. The Agilent ratio data was produced in two different ways: the Agilent ratio data named without "Z" as the last letter was from a non-self microarray, with the log ratio derived from the red channel intensities divided by the green channel intensities; the Agilent ratio data with "Z" as the last letter was from self-self microarrays, with the log ratio obtained from two different microarrays. For example, AGRoLe data is from the root/leaf microarray, while AGRoLeZ data is from root/root and leaf/leaf.
where x represented the normalized expression level of gene calculated by each platform. For MPSS platform, NEU has the same meaning as Transcript Per Million (TPM).
Contour-plots were used to represent the density of data. Expression data were log2-transformed. When investigating the correlation relationship between different samples, hierarchical clustering method was used with both Pearson's correlation and Spearman's rank correlation as the distance metrics and using a complete linkage algorithm [36, 37]. The distance for clustering was calculated by 1 - r, where r represents correlation coefficient value. Correlation coefficient and total least-square values  were calculated for measuring the relationship among data from different platforms. A contour-plot matrix  was used to show correlation relationships. Because the relationship between two variables was often non-linear, we used a regression method called LOWESS (local weighted scatter-plot smoother)  to obtain smooth curves which showed the mean tendency of dependence pattern between two variables. When using LOWESS, we used a Gaussian kernel to calculate the weight contributing to each point from its neighboring points. We also used the bootstrap method  to evaluate the confidence intervals (5–95%) of the LOWESS method.
We thank Babatunde A. Ogunnaike (University of Delaware) for help with some of the statistics. We thank Affymetrix and Agilent for providing the arrays for this study.
- Initiative TA: Analysis of genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000, 408: 796-815. 10.1038/35048692.View ArticleGoogle Scholar
- Schena M, Shalon D, Davis RW, Brown PO: Quantitative monitoring of gene expression patterns with a complemetary DNA microarray. Science. 1995, 270: 467-470. 10.1126/science.270.5235.467.PubMedView ArticleGoogle Scholar
- Lockhart DJ, Dong H, Byrne MC, Follettie MT, Gallo MV, Chee MS, Mittmann M, Wang C, Kobayashi M, Horton H, Brown EL: Expression monitoring by hybridization to high-density oligonucleotide arrays. Nature Biotechnology. 1996, 14: 1675-1680. 10.1038/nbt1296-1675.PubMedView ArticleGoogle Scholar
- Velculescu VE, Zhang L, Vogelstein B, Kinzler KW: Serial analysis of gene expression. Science. 1995, 270: 484-487. 10.1126/science.270.5235.484.PubMedView ArticleGoogle Scholar
- Velculescu VE, Zhang L, Zhou W, Vogelstein J, Basrai MA, Bassett DEJ, Hieter P, Vogelstein B, Kinzler KW: Characterization of the Yeast Transcriptome. Cell. 1997, 88: 243-251. 10.1016/S0092-8674(00)81845-0.PubMedView ArticleGoogle Scholar
- Brenner S, Johnson M, Bridgham J, Golda G, Lloyd DH, Johnson D, Luo S, McCurdy S, Foy M, Ewan M, Roth R, George D, Eletr S, Albrecht G, Vermaas E, Williams SR, Moon K, Burcham T, Pallas M, DuBridge RB, Kirchner J, Fearon K, Mao J, Corcoran K: Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays. Nature Biotechnology. 2000, 18: 597-598. 10.1038/76469.View ArticleGoogle Scholar
- Brenner S, Williams SR, Vermaas EH, Storck T, Moon K, McCollum C, Mao JI, Luo S, Kirchner JJ, Eletr S, DuBridge RB, Burcham T, Albrecht G: In vitro cloning of complex mixtures of DNA on microbeads: physical separation of differentially expressed cDNAs. PNAS. 2000, 97: 1665-1670. 10.1073/pnas.97.4.1665.PubMed CentralPubMedView ArticleGoogle Scholar
- Hughes TR, Mao M, Jones AR, Burchard J, Marton MJ, Shannon KW, Lefkowitz SM, Ziman M, Schelter JM, Meyer MR, Kobayashi S, Davis C, Dai H, He Y, Stephaniants SB, Cavet G, Walker WL, West A, Coffey E, Shoemaker DD, Stoughton R, Blanchard AP, Friend SH, Linsley PS: Expression profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer. Nature Biotechnology. 2001, 19: 342-347. 10.1038/86730.PubMedView ArticleGoogle Scholar
- Baldwin D, Crane V, Rice D: A comparison of gel-based, nylon filter and microarray techniques to detect differential RNA expression in plants. Current Opinion in Plant Biology. 1999, 2: 96-103. 10.1016/S1369-5266(99)80020-X.PubMedView ArticleGoogle Scholar
- Ishii M, Hashimoto S, Tsutsumi S, Wada Y, Matsushima K, Kodama T, Aburatani H: Direct comparison of GeneChip and SAGE on the quantitative accuracy in transcript profiling analysis. Genomics. 2000, 68: 136-143. 10.1006/geno.2000.6284.PubMedView ArticleGoogle Scholar
- Kuo W, Jenssen T, Butte A, Ohno-Machado L, Kohane I: Analysis of matched mRNA measurements from two different microarray technologies. Bioinformatics. 2002, 18: 405-412. 10.1093/bioinformatics/18.3.405.PubMedView ArticleGoogle Scholar
- Lee M, Kuo F, Whitmore G, Sklar J: Importance of replication in microarray gene expression studies: statistical methods and evidence from repetitive cDNA hybridizations. PNAS. 2000, 97: 9834-9839. 10.1073/pnas.97.18.9834.PubMed CentralPubMedView ArticleGoogle Scholar
- Li J, Pankratz M, Johnson J: Differential Gene Expression Patterns Revealed by Oligonucleotide Versus Long cDNA Arrays. Toxicological Sciences. 2002, 69: 383-390. 10.1093/toxsci/69.2.383.PubMedView ArticleGoogle Scholar
- Quackenbush J: Microarray data normalization and transformation. Nature Genetics. 2002, 32: 496-501. 10.1038/ng1032.PubMedView ArticleGoogle Scholar
- Rhodes DR, Barrette TR, Rubin MA, Ghosh D, Chinnaiyan AM: Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer. Cancer Res. 2002, 62 (15): 4427-4433.PubMedGoogle Scholar
- Saghizadeh M, Brown D, Tajbakhsh J, Chen Z, Kenney M, Farber D, Nelson S: Evaluation of techniques using amplified nucleic acid probes for gene expression profiling. Biomolecular Engineering. 2003, 20: 97-106. 10.1016/S1389-0344(03)00006-6.PubMedView ArticleGoogle Scholar
- Tan P, Downey T, Spitznagel E, Xu P, Fu D, Dimitrov D, Lempicki R, Raaka B, Cam M: Evaluation of gene expression measurements from commercial microarray platforms. Nucleic Acids Research. 2003, 31: 5676-5684. 10.1093/nar/gkg763.PubMed CentralPubMedView ArticleGoogle Scholar
- Kuo W, Liu F, Trimarchi J, Punzo C, Lombardi M, Sarang J, Whipple M, Maysuria M, Serikawa K, Lee S, McCrann D, Kang J, Shearstone J, Burke J, Park D, Wang X, Rector T, Ricciardi-Castagnoli P, Perrin S, Choi S, Bumgarner R, Kim J, Short Gr, Freeman M, Seed B, Jensen R, Church G, Hovig E, Cepko C, Park P, Ohno-Machado L, Jenssen T: A sequence-oriented comparison of gene expression measurements across different hybridization-based technologies. Nature Biotechnology. 2006, 24: 832-840. 10.1038/nbt1217.PubMedView ArticleGoogle Scholar
- Ibrahim A, Hedley P, Cardle L, Kruger W, Marshall D, Muehlbauer G, Waugh R: A comparative analysis of transcript abundance using SAGE and Affymetrix arrays. Functional & Integrative Genomics. 2005, 5 (3): 163-174. 10.1007/s10142-005-0135-4.View ArticleGoogle Scholar
- Kim HL: Comparison of oligonucleotide-microarray and serial analysis of gene expression (SAGE) in transcript profiling analysis of megakaryocytes derived from CD34+ cells. Exp Mol Med. 2003, 35 (5): 460-466.PubMedView ArticleGoogle Scholar
- van Ruissen F, Ruijter JM, Schaaf GJ, Asgharnegad L, Zwijnenburg DA, Kool M, Baas F: Evaluation of the similarity of gene expression data estimated with SAGE and Affymetrix GeneChips. BMC Genomics. 2005, 6 (): 91-10.1186/1471-2164-6-91.PubMed CentralPubMedView ArticleGoogle Scholar
- Oudes AJ, Roach JC, Walashek LS, Eichner LJ, True LD, Vessella RL, Liu AY: Application of Affymetrix array and Massively Parallel Signature Sequencing for identification of genes involved in prostate cancer progression. BMC Cancer. 2005, 5 (): 86-10.1186/1471-2407-5-86.PubMed CentralPubMedView ArticleGoogle Scholar
- Liu F, Jenssen TK, Trimarchi J, Punzo C, Cepko CL, Ohno-Machado L, Hovig E, Kuo W: Comparison of hybridization-based and sequencing-based gene expression technologies on biological replicates. BMC Genomics. 2007, 8 (): 153-10.1186/1471-2164-8-153.PubMed CentralPubMedView ArticleGoogle Scholar
- Coughlan SJ, Agrawal V, Meyers B: A comparison of global gene expression measurement technologies in Arabidopsis thaliana. Comparative and Functional Genomics. 2004, 5: 245-252. 10.1002/cfg.397.PubMed CentralPubMedView ArticleGoogle Scholar
- Schimek MG: Smoothing and Regression: Approaches, Computation, and Application. 2000, United States of America: A Wiley-InterscienceView ArticleGoogle Scholar
- Chen J, Rattray M: Analysis of tag-position bias in MPSS technology. BMC Genomics. 2006, 7 (): 77-10.1186/1471-2164-7-77.PubMed CentralPubMedView ArticleGoogle Scholar
- Brett D, Pospisil H, Valcárcel J, Reich J, Bork P: Alternative splicing and genome complexity. Nature Genetics. 2001, 30: 29-30. 10.1038/ng803.PubMedView ArticleGoogle Scholar
- Suzuki Y, Taira H, Tsunoda T, Mizushima-Sugano J, Sese J, Hata H, Ota T, Isogai T, Tanaka T, Morishita S, Okubo K, Sakaki Y, Nakamura Y, Suyama A, Sugano S: Diverse transcriptional initiation revealed by fine, large-scale mapping of mRNA start sites. EMBO Rep. 2001, 2 (5): 388-393.PubMed CentralPubMedView ArticleGoogle Scholar
- Gene Expression Omnibus (GEO). [http://www.ncbi.nlm.nih.gov/geo/]
- Arabidopsis MPSS Database from Meyer's Lab. [http://mpss.udel.edu/at/]
- Jongeneel CV, Delorenzi M, Iseli C, Zhou D, Haudenschild CD, Khrebtukova I, Kuznetsov D, Stevenson BJ, Strausberg RL, Simpson AJ, Vasicek TJ: An atlas of human gene expression from massively parallel signature sequencing (MPSS). Genome Research. 2005, 15: 1007-1014. 10.1101/gr.4041005.PubMed CentralPubMedView ArticleGoogle Scholar
- Meyers BC, Tej SS, Vu TH, Haudenschild CD, Agrawal V, Edberg SB, Ghazal H, Decola S: The Use of MPSS for Whole-Genome Transcriptional Analysis in Arabidopsis. Genome Research. 2004, 14: 1641-1653. 10.1101/gr.2275604.PubMed CentralPubMedView ArticleGoogle Scholar
- Affymetrix Website. [http://www.affymetrix.com/]
- Agilent Website. [http://www.chem.agilent.com/]
- TAIR Genome Release 6 (9/11/2005). [ftp://ftp.arabidopsis.org/home/tair/Genes/TAIR6_genome_release/]
- D'Andrade R: U-statistic hierarchical clustering. Psychometrika. 1978, 43: 59-67. 10.1007/BF02294089.View ArticleGoogle Scholar
- Johnson S: Hierarchical clustering schemes. Psychometrika. 1967, 32 (3): 241-254. 10.1007/BF02289588.PubMedView ArticleGoogle Scholar
- MathWorld. [http://mathworld.wolfram.com/]
- Cook RD: Regression Graphice: Ideas for Studying Regressions through Graphics. 1998, USA: A Wiley-Interscience PubilcationView ArticleGoogle Scholar
- Duda RO, Hart PE, Stork DG: Pattern Classification. 2001, New York: John Wiley & Sons Pubilcation, SecondGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.