BMC Genomics BioMed Central Methodology article Triple-target microarray experiments: a novel experimental

Background Several high throughput technologies have been employed to identify differentially regulated genes that may be molecular targets for drug discovery. Here we compared the sets of differentially regulated genes discovered using two experimental approaches: a subtracted suppressive hybridization (SSH) cDNA library methodology and Affymetrix GeneChip® technology. In this "case study" we explored the transcriptional pattern changes during the in vitro differentiation of human monocytes to myeloid dendritic cells (DC), and evaluated the potential for novel gene discovery using the SSH methodology. Results The same RNA samples isolated from peripheral blood monocyte precursors and immature DC (iDC) were used for GeneChip microarray probing and SSH cDNA library construction. 10,000 clones from each of the two-way SSH libraries (iDC-monocytes and monocytes-iDC) were picked for sequencing. About 2000 transcripts were identified for each library from 8000 successful sequences. Only 70% to 75% of these transcripts were represented on the U95 series GeneChip microarrays, implying that 25% to 30% of these transcripts might not have been identified in a study based only on GeneChip microarrays. In addition, about 10% of these transcripts appeared to be "novel", although these have not yet been closely examined. Among the transcripts that are also represented on the chips, about a third were concordantly discovered as differentially regulated between iDC and monocytes by GeneChip microarray transcript profiling. The remaining two thirds were either not inferred as differentially regulated from GeneChip microarray data, or were called differentially regulated but in the opposite direction. This underscores the importance both of generating reciprocal pairs of SSH libraries, and of real-time RT-PCR confirmation of the results. Conclusions This study suggests that SSH could be used as an alternative and complementary transcript profiling tool to GeneChip microarrays, especially in identifying novel genes and transcripts of low abundance.


Background
Microarray technology is a high-throughput and parallel platform that enables research on whole genomes, thereby helping to increase our understanding of the regulation of biological systems. All variations of this technique involve the deposition of a large number of probe sequences (e.g. oligonucleotides, cDNA) -representing a whole genome or subsets thereof -in a regular grid-like array on a physical substrate, usually a glass slide for custom spotted arrays. Microarray studies are costly in terms of equipment, consumables and time, therefore careful design and replication are particularly important if the resulting experiment is to be maximally informative. As opposed to high-density arrays like Affymetrix (probes produced in-situ in a process resembling lithography), standard experiments using spotted arrays on glass slides usually involve the co-hybridisation of two differently labelled targets to one slide. This is normally referred to as dual-target or dual-dye arrays. In such a cDNA microarray dual-target experiment, it is usually the fluorescent dyes Cy5 and Cy3 that are used in tandem. One of the dyes is used to label an experimental target sample, the other to label a reference or control sample. After measuring the fluorescence of each dye-label channel separately, the scan images are processed and this results in a numerical value of abundance (often termed expression) of this gene in the experimental sample and the reference sample. The relative abundance of each gene is usually presented as the log-ratio of these two values, and used as the measure of gene expression for an array. There are practical issues concerning this experimental approach, in that constraints on the number of arrays processed and/or the amount of RNA available can have a detrimental effect on the experimental design [1].
We have introduced a third dye-label (Alexa 594) in an attempt to improve on such practical limitations without sacrificing data quality. This novel experimental strategy was specifically developed to investigate, using cDNA microarrays, the changes in gene expression patterns during the normal development of spermatogenesis in wild type mouse, and in parallel, in a known fertility mutant (Dazl-null mouse) [2]. We envisage this approach to be useful in similar complex multi-factorial experiments, e.g., time-series data with comparisons between different genotypes, or cancer studies where comparisons are required both in a single patient (e.g. before and after therapy) and across a population of patients. In each case, the benefit lies in producing more than one on-chip measurement of relative expression. For example, using a common reference sample in combination with a test sample of type A and type B on one array allows a direct comparison of the relative gene expression levels in both test samples, without having to compare relative expression values from two or more dual-target arrays containing only a ref-erence and one of the test samples. With a suitable laboratory protocol, and including the necessary levels of replication, this considerably reduces the number of arrays required for a microarray study without having to reduce the complexity of the biology under study.
There have been previous publications [3,4] investigating the use of three dye-labels per microarray, but it must be stressed here that the third dye was used to label the probes rather than a third target sample. This is a different objective to the one described here, and, due to the manipulation of the actual array platform, has larger requirements in terms of repeat array hybridisations. To avoid confusion between these different concepts and aims, we refer to the approach described in this paper as "triple-target" rather than "triple-dye".
In this paper, we report on the methodology and evaluation of using three dye-labelled target samples per array, specifically Cy3, Cy5 and Alexa 594.

Results and Discussion
In order to assess both the use of Alexa 594 in combination with the other two dyes and the use of Alexa 594 by itself or in combination with only one of the other dyes, three separate analyses were performed. The first is based on the analysis of arrays which have been co-hybridised with three differently labelled target samples, with n = 5 arrays. The second analysis is based on arrays which have been co-hybridised with only two of the available three dye-labels, with n = 2 arrays for each of the three possible dye-label combinations. The third analysis involves arrays that have been hybridised with only one dye-labelled target sample, with n = 3 arrays for Alexa594, n = 2 arrays for Cy5 and n = 2 arrays for Cy3. The only difference imposed on these three analyses was the use of a different print-run of slides for the single-target analysis. There was one consistent pool of target sample material for all hybridisations and labellings. QuantArray software was used to convert the images to numerical data, and all interpretation of data is done on these numerical data. The parameter values for the QuantArray algorithm were chosen based on manufacturer recommendations and kept consistent for all arrays and dye-label samples. Any choice of algorithm and parameter values results in some level of observed discrepancy between image and numerical presentation of the image (e.g. background estimates). Where such differences are apparent, they have been pointed out in the evaluation.

Triple-target self hybridisations
For dual-target experiments, any setup involving one sample dye-labelled with both dyes and hybridised to one array is referred to as self-to-self hybridisation, and in building on this terminology the experimental setup for this evaluation experiment will be referred to as triple-self hybridisations. The theoretical outcome of any self-to-self experiment is clear: absolute expression values for genes should be identical across array dye-label channels. Similarly, relative expression value for every gene should be 1 (or 0 on a log scale), independent of the dye-label combination in question. Evaluation of the validity of adding a third dye-labelled sample therefore consists of testing to which degree this theoretical outcome is true, and if it is different to outcomes from standard dual-target arrays. In practice, the ideal theoretical outcome is normally not achieved in experimental data, due to factors like differences in RNA extraction, dye-label incorporation, hybridisation quality, slide quality and scanning parameters etc., all of which increase variance in the data [5]. To minimise dye-label incorporation differences, aminoallyl labelling has been used, which results in higher labelling efficiency and improved incorporation of the different dyes compared to the direct labelling method [6][7][8]. Normalisation algorithms have been devised to further limit the effect of such sources of variation, and as a result, any self-to-self hybridisation should then approximate the ideal outcome. In the following section this statement is evaluated.
A first investigation of the expression values obtained from the three individual samples on an array (Fig. 1, Table 1) shows that, across all probes on an array, Cy3 and Alexa594 share similar average expression and spread of values before applying any normalisation methods. Cy5 labelled samples appear to have a greater spread of data values, with differences apparent in the lower signal intensities. This may be caused by dye-label incorporation differences, which are known to occur in most common Cy5/Cy3 dual-target experiments. These differences are not evident for Cy3/Alexa594 combinations here. Before normalisation, log-ratios for pair-wise sample comparisons on the arrays are therefore showing slightly greater variance for those combinations that involve Cy5 (Fig.  2a), in addition to global differences that systematically move the ratios away from zero 0. Subsequent location and scale normalisation reduces these systematic differences and results in very comparable data distributions, all gene probes on an array contained in the Inter-Quartile-Range having log-ratios within the interval [+0.25; -0.25] (Fig. 2b).
The sample-to-sample differences for all three samples on an array were also assessed by visualising their standard deviation in relation to the average level of expression, both calculated across all gene-probes in a dye-label channel (Fig. 3). Due to the limitation of only having three dye-label observations per gene per array, the calculation of the relative standard deviation is not a statistically relevant procedure, and influenced by outlier values. Here it only serves as a global quality indicator. Nonetheless, this exercise clearly emphasises that most variance in the expression levels of a gene on an array is occurring at lower levels of expression. The majority of medium and high expressed genes have a relative standard deviation <0.05 even before applying normalisation methods.
A common visualisation method for dual-target arrays is the MA plot (in a similar version often referred to as R-I plot), which combines information about the log-ratio of a gene probe with its expression level, and this is also used as the basis for a LoWeSS normalisation. A standard dualtarget array will obviously have only one such plot, whereas a triple-target array increases the number of possible sample-combinations on the array to 3. In a real biological experiment, the interest may be focused on only two of these, but for the purpose of this evaluation we examine all possible relationships (Fig. 4). All five arrays show reproducible patterns of dependence or nondependence of the log-ratio on the log-intensity. The dyeincorporation differences discussed above are clearly visible for the standard Cy5/Cy3 combination as well as for the Cy5/Alexa594 combination. The relationship between Cy3 and Alexa594 is equally reproducible, but, in contrast to the other two pairings, there is no evidence of pronounced differences between the two labels at low expression levels. After removing systematic linear and nonlinear differences between any two dye-labelled samples, the resulting log-ratios for all dye-label combinations are centred around 0 (Fig. 5). Since the LoWeSS normalisation needs to apply a greater change to log-ratios involving Cy5, at least at lower levels of signal intensity, it could be surmised that the corrected log-ratios from the Cy3/ Alexa combination will be associated with less introduced bias.

Dual-target self hybridisations
Whereas triple-target self hybridisations are ideal for testing the validity of adding a third dye-labelled sample on an array, to further evaluate Alexa 594, we performed dual-target self hybridisations. Such an experiment allows a) to examine the effect of the new dye-label Alexa594 without the presence of a third dye-labelled sample (which may have unknown effects), and b) to examine a standard dual-target experiment in the same system as is used for the triple-target hybridisations. The third scan for each array was still performed although no material had been hybridised for this dye. In theory, this blank channel should produce no signal values, and deviations from this show potential "signal bleed" from one dye-label (or rather, the laser frequency it is scanned at) to another. Limitations of target sample material and printed arrays did not allow for the inclusion of more than two arrays per dual-target combination, which limits the statistical interpretability of these results. However, for the purposes of this study, they serve as supporting evidence and s x / should identify gross problems with the inclusions of a third dye.
All arrays confirm the results obtained from analysis I: Cy5 is associated with more gene probes values in the low expression range (Fig. 6). The dependence of log-ratio on log-intensity of a gene probe is also consistent with the triple-target arrays, in that the use of Cy5 with either of the other two dyes leads to non-linear effects (Fig. 7). A positive aspect of Cy5 are the very small signal values close to Triple-target: data before normalisation Figure 1 Triple-target: data before normalisation Each panel shows the data distribution for 5 individual samples (one per array) for a particular dye-label. Array names are numbered 1-5. Samples labelled with Cy5 have a wider distribution and generally lower centre than the other two channels, with high consistency across the 5 replicate arrays. Triple-target: Log-ratio data Figure 2 Triple-target: Log-ratio data (A) Each panel shows the data distribution for 5 log-ratios of a particular dye-label combination within an array before normalisation. Triple-self hybridisation log-ratios for all genes and dye-label combinations should theoretically be centred on 0. Log-ratios involving the Cy5 labelled sample clearly show a larger log-ratio error and global shift away from 0 than the pairing of Cy3/Alexa594. This is consistent for all 5 replicate arrays. (B) Log-ratio distributions after removal of systematic global effects (dye-label incorporation, hybridisation quality, scan settings) by means of LoWeSS normalisation followed by a median absolute deviation adjustment of scale. Interpreted globally (across all genes on an array), log-ratio distributions are now directly comparable across all arrays and dye-label combinations. B zero when it is the blank channel, which cannot be said for Alexa594 and Cy3, both of which result in low level signal values even if no sample has been hybridised to the array with the corresponding dye-label. This is probably an indication of the relative closeness of the dye-labels in the light frequency spectrum, leading for example to fluorescence of the Cy3 channel when the array is subjected with the laser frequency corresponding to Alexa594. However, the level of signal obtained from these blank channels is small in proportion to the hybridised channels, with linear slopes between 0.02 and 0.03 and intercept signal values between 52 and 115. Given the assumption that this effect is also present in the triple-target hybridisation, it does not present itself as a large or non-systematic problem. It does not cause signal interpretation problems that are greater than those created by using two dyes with a non-linear relationship (i.e. Cy5 vs. Cy3) at comparable low levels of expression.

Single-target hybridisations
The hybridisation of only one dye-labelled sample to an array allows a closer investigation of signal "bleeding" from one dye-labelled sample to another, as described above. For this analysis, spike controls were included for all three dye-labels to allow calibration between channels, and their relation to the other gene probes on the array is shown in figure 8. Visual inspection of the images obtained from this experiment revealed that there is some visible signal bleeding between Alexa594 and Cy3, as well as between Alexa594 and Cy5. However, the results of this Triple-target: data consistency between dyes Figure 3 Triple-target: data consistency between dyes Relative standard deviation (or CV) of 3 values (one per dye-label) for each gene on an array plotted against the mean, showing intensity dependent variation of signal between the three dye-labels. Reproducibility of gene values across dye-labels begins to brake down at a low signal intensity of about 500. However, the last panel provides information about the distribution of CV values for each array, and it is evident that the majority of all genes on an array fall below 0.05 even before applying any normalisation methods. where Cy5 also has a similar level of bleed into the other channels (see below). This is most likely due to the image conversion algorithm interpreting pixels and features differently from the human eye. Although the number of replicates is not sufficient to draw statistical conclusions, the results are supporting those of the dual-target hybridisations. If Alexa594 is the hybridised sample, there is little to no signal present if scanning the blank Cy5 channel, whereas the scan for the blank Cy3 channel is resulting in consistent, but low levels of signal values (Fig. 9). The same is true where Cy3 is the hybridised sample: the blank Cy5 channel exhibits little to no signal, Alexa594 results in a low level of signal values. However, the current standard situation of dual-target hybridisations involving the combination of Cy5 and Cy3 appears to be subject to the same problem, as indicated by the single-target hybridisation with Cy5. The scans for the two blank channels Alexa594 and Cy3 have a low level of signal of the same relative proportion as the blank channels in the other single-target and dual-target hybridisations. This would seem to show that it is not just the relative closeness of Cy3 and Triple-target: MA plots before normalisation Figure 4 Triple-target: MA plots before normalisation MA plots are a commonly used visualisation tool to show global trends in dual-target hybridisation data. Instead of only one possible combination per array, we have included the other two dye-label pairs. The X axis represents the intensity level, the Y axis the corresponding expression ratio for a gene between the two selected channels. A clustering of data points around the horizontal line is the theoretical result of any self-to-self hybridisation. A local scatterplot smoother function (LoWeSS) is fitted to the data to show global trends and to provide an intensity-dependent correction function to remove any bends in the global trend and adjust it so that the global trend approximates the theoretical ideal. A bend in the global trend indicates that the log2-ratio differs with intensity level. This graph clearly shows that log 2 -ratios involving Cy5 have non-linear intensity-dependent trends which require normalisation, whereas the combination Cy3/Alexa594 is already closer to ideal prior to normalisation. Alexa594 in the light emission spectrum which results in signal bleeding to the other channel. The blank Cy5 channel in a Cy3-labelled single-target hybridisation remains unaffected by signal bleeding, whereas in the reverse situation the blank Cy3 channels is affected and shows some signal. In all of the above cases of signal bleed into other channels, this occurs at the intensity level around the detection threshold.

Evaluation of combined self hybridisation results
In order to provide quantitative indicators in addition to the graphical evaluation, tables 2 and 3 present comparative estimates of centrality and spread for log-ratios obtained from triple-self and self-to-self arrays after normalisation. There is good agreement in those estimates between the triple-self and self-self arrays for each of the three combinations of dye-label log-ratios. The theoretical centre of the log-ratio distributions is 0, and this value is approximately met by the majority of gene probes on the array, with only small differences between the different dye-label combinations. The combination of Cy5/Cy3 is marginally worse in its approximation of the theoretical ideal, and this is likely to be the result of the LoWeSS normalisation having to make greater adjustments to low expressed genes.
In light of the triple-, dual-and single-target hybridisation experiments carried out, it is clear that the addition of Alexa 594 does not introduce negative effects that are not already present or smaller than in the commonly used Triple-target: MA plots after normalisation Figure 5 Triple-target: MA plots after normalisation Log-ratios for triple-target hybridisations after performing LoWeSS and MAD scale normalisation. All log 2 -ratios are now centred closely around 0, with a small amount of variance remaining at lower expression levels. Only individual outliers remain, and these are usually irrelevant if an experiment is replicated to a sufficient degree. 10   combination of Cy5 and Cy3. The triple-self hybridisations have shown that, for low levels of signal, there is less inherent difference between Cy3 and Alexa 594 than there is between Cy5 and Cy3. The problem of differential dyelabel incorporation in traditional dual-target Cy5/Cy3 hybridisations seems largely due to Cy5, at least where low-level signal values are concerned. Co-hybridisation of Cy3/Alexa594 does not present this problem and in theory does not require a non-linear approach to normalisation (although there is no harm in using it). A nonlinear normalisation like LoWeSS will make more of a numerical adjustment at this level of expression for the Cy5/Cy3 combination. However, following a successful normalisation, all three dye-label combinations have comparable log-ratio data, with only minor differences between them (tables 2 and 3). In cases where low levels of signal are relevant to the question under study, these different levels of noise/error in the low expression regions must be accounted for in the usual way by biological replication of the experiment.
With the dual-and single-target hybridisations, we investigated the issue of inherent signal bleeding between channels. The outcome of these has shown that in any hybridisations with Cy3 or Alexa 594, Cy5 will not suffer from this problem. On the other hand, both Cy3 and Alexa 594 will present this problem in any hybridisation involving Cy5. Although to our knowledge this has not Dual-target: data before normalisation Figure 6 Dual-target: data before normalisation For all dual-target hybridisations, these box-plots show the signal distributions across all genes for each individual dye-label sample. The non-hybridised "blank" dye-label channel has also been scanned and processed, in order to assess the effect of signal bleeding from one dye-label to another. The only truly blank scan is Cy5 for the Cy3/Alexa594 co-hybridisations. Co-hybridisations of the standard Cy5/Cy3 and new Cy5/Alexa594 dye-labels leads to small, but detectable levels of signal in the blank scan. Regarding spread of data, the dual-target hybridisation are identical to the results of the triple-target hybridisations, with a wider distribution and lower average of signal for Cy5. Cy5 Cy3 been investigated before, it may therefore be present in most 'standard' experiments with Cy5/Cy3 dual-target hybridisations. Our data do not show that this is either a serious or a non-systematic problem. It usually occurs below or at the safe detection level of an array (as identified by negative control probes, data not shown), and contributes only a small amount of signal to one channel which is corrected together with other systematic dyelabel and hybridisation differences during the normalisation procedure.

Experimental design issues
The ability to directly compare two target samples to a third condition (be it treatment or reference) on one array is of potentially great use for experiments that include multiple factors. However, such an approach also requires good planning of logistics and analysis. It is important to consider which dye is used to label a particular biological target, the logistics of using dye-labelling kits, and the allocation of dye-labelled samples to particular arrays [9,10]. Simple reference designs can be extended to include a third-dye label in a straightforward manner. However, loop-designs will require a larger extent of planning to identify all necessary target sample combinations, with an inherent property of being inflexible in terms of adding more conditions/arrays or removing individual arrays due to hybridisation failure.
Dual-target: MA plots before/after normalisation  Dye-swap arrays also require consideration, since it is possible to only perform a dye-swap on two of the samples, or on all three. For our study of mouse spermatogenesis [2], Cy3 and Cy5 dye-swaps were only performed on the target samples of interest (wild type and Dazl-null mouse, from different developmental time-points), while the reference sample was always labelled with Alexa 594. This partial swap design was chosen to reduce the number of slides, cutting costs and time. To increase the reliability of the data, an extra level of replication was added by repeating this process with a second independent pool of mice.
Although it could be argued that residual colour bias is introduced because the reference sample was not dyeswapped, in our experimental design this was not critical. The main focus of the experiment is on the relative changes in sample expression across the six developmental time-points and not in comparison to the reference.

Data processing and analysis issues
With respect to data normalisation and analysis, current software packages or analysis modules are geared towards the analysis of single-or dual-dye experiments, providing Single-target: spike controls in single-target arrays Figure 8 Single-target: spike controls in single-target arrays Each panel shows the relation of spiked control probes (present for each dye-label) to other gene-probes on an array. Markers connected by lines indicate mean intensity level of genes on an array, corresponding markers not connected by lines indicate the mean of only the spiked control probes on the array. Spikes should be present at similar levels in all channels, although only one channel has been hybridised. As expected, spikes remain constant across all dye-labelled samples and blank channels. Normalisation constants based on the spike controls subset are small for all dye-label channels, and the variable hybridisation quality of the Cy3 arrays exceeds the adjustments that would be made by applying a subset normalisation. The consistency between the array replicates is still of sufficient degree to conclude that all single-target hybridisations result in small but measurable levels of signal for at least one of the two supposedly blank dye-label channels. Single-target: data before normalisation Figure 9 Single-target: data before normalisation For all single-target hybridisations, these boxplots show the signal distributions across all genes for each individual dye-label sample, before normalisation. Only one sample per array has been hybridised, but the other two scans on the "blank" channels were still performed at their respective wave-lengths. Similar to the dual-target hybridisations, the data suggest that dye-labelling with Alexa594 will result in a small but noticeable signal in the Cy3 signal wavelength, Cy3 will result in a similarly small signal in the Alexa594 wavelength. However, dye-labelling with Cy5 also leads to quantifiable levels of signal in the other two channels. This is not visible in the array images themselves, but quantification of false-colour images by eye will inherently be different from algorithm-based quantification. For each gene on the array, the median and median absolute deviation (MAD) of a particular log-ratio (e.g. Cy5/Cy3) across the 5 arrays * 2 spot replicates per array (effective n = 10) was obtained. The median value of all genes' median log-ratios and all genes' MAD is the basis of the estimate in this table. The effective n for the self-self hybridisations was 2 arrays per dye-label combination * 2 spot replicates per array = 4 measurements.  little facility to deal effectively with three samples per array. Although either working around these problems or customising analyses could be time-consuming, in the practical application of the triple-target method we found that data processing works well if the triple-target results are split into two files; file one contains the signal and background measurements of the wild type and the reference sample, and file two contains the signal and background measurements of the Dazl knock-out and the same reference sample.
For the purposes of this paper, we have used existing methods for normalisation and visualisation. If triple-target approaches are to be used on a regular basis, there is scope for developing statistical models that include all three dye-labels rather than multiple pair-wise combinations. This also applies to visualisation techniques. Naturally, assumptions applying to dual-hybridisation experiments still apply to our approach. The majority of genes in all samples need to be biologically unaffected by an experimental condition in order to allow global normalisation methods to be applied. Where this assumption is not met, control probes on the array are a necessity.

Discussion
In summary, there has been no evidence that the inclusion of Alexa594 as a third dye-label causes additional noise or unexpected results in the data. We used a theoretically well-controlled system of replicated triple-self hybridisations to evaluate any effect this addition may have on the expression of gene probes on an array. In conjunction with Cy5 and Cy3, this dye has shown similar levels of inherent quality of labelling and subsequent data acquisition. Standard normalisation methods work as well as they do for single-or dual-dye experiments, and the resulting data can be expected to allow identification of expression differences in a biological experiment, given sufficient levels of biological replication (as is necessary for most microarray experiments). We consider this novel triple-target hybridisation strategy to be useful for the analysis of complex multifactorial experiments. As such, it provides an additional option to the current choice of array experimental designs. On a solely technical basis there is little reason not to include a third dye-label (although we have to limit this conclusion to Alexa594) for two-factorial experiments, the only prerequisite being a working laboratory protocol for using three dyes and good planning of logistics. Triple-target hybridisations can be performed within the same technological framework as current conventional approaches and, since they perform more than one absolute or relative gene expression assay per array, constitute a possible solution to practical constraints of finance, logistics or availability of biological sample material in the design of an experiment.

Conclusions
The use of triple-target microarray experiments is a valid addition to the experiment design toolbox. Although the method adds complexity to the experiment planning stage and the later data handling, this is offset by the benefits for studies where there are multiple experimental factors to consider, for example a combined time-series and treatment study. We limit this conclusion to the dye-labels and combination of biological system (adult mouse testes) and platform (custom spotted mouse array) used for this proof of concept, although in theory the same approach is usable in other experiments. We have developed in-house standards for the specifics of using three dye-labels for microarrays, but this may have to be adapted by other researchers working on different systems. Given further and more widespread use of the triple-target approach, it may prove a valuable tool that can be standardised for multiple biological systems and dye-labels.

Microarray preparation
An in-house created, subtracted and normalised adult mouse testis cDNA library consisting of 5,225 clones with an average insert size of 500 base pairs, plus 118 negative control (buffer spots) and 32 positive control spots, was deposited in duplicate (resulting in 10,750 individual features) onto glass slides. All were produced to the same standards and coated with poly-L-lysine according to a protocol available online at http://cmgm.stanford.edu/ pbrown/protocols/1_slides.html. A detailed description of the probe cDNA library generation and characterisation, along with the microarray construction, is provided elsewhere [2]. In total, 18 arrays were used in this study. Rather than summarising log-ratios on a per-gene basis first, log-ratios for each dye-label combination were pooled across all genes and arrays, median and MAD were then used to provide estimates for centrality and spread.
A ij and B ij denote the jth spot value on the ith array for two dye-label channels. The LoWeSS function estimating the dependence of log-ratio y on log-intensity x for a given array is denoted by y (ij) . y ij = log 2 (A ij / B ij ) x ij = log 2 (A ij * B ij ) NLR ij = y ij -y(x i ) For the triple-target self-hybridisation arrays this method was used for three dye-label combinations per array: Cy5/ Cy3, C5/Alexa594 and Cy3/Alexa594.

Log-ratio scale normalisation
Median Average Deviation (MAD) NLR ij denotes the already location normalised log-ratios for the jth spot on the ith array. SLR ij denotes the scale normalised log-ratios.

Data analysis
Three separate analyses were performed, one for the tripletarget self-hybridisations, one for the dual-target selfhybridisations and one for single-target hybridisations. They were performed on dye-label channels, i.e. data for each differently labelled sample on an array, and on ratio data, i.e. the relative values between two such dye-label channels. Analyses were carried out on log 2 -transformed intensities and ratios. In addition to graphical output for evaluation of data distributions and global effects of variation, pooled and by-gene medians and median average distances (MAD) were computed for all log-ratios of interest across the replicate arrays, i.e. Cy5 vs. Alexa594, Cy3 vs. Alexa594 and Cy5 vs. Cy3.

Data availability
A complete MIAME-compliant catalogue of this data, including a complete listing of annotated gene content and clone sequences for this microarray, together with all raw and normalized expression measurement files, will be made available at the MRC Human Genetics Unit web site http://www.hgu.mrc.ac.uk/Research/Cooke/germ line.html. Experimental data will also be made available on the EBI ArrayExpress database http://www.ebi.ac.uk/ arrayexpress.