Skip to main content

Identification of rat lung – prominent genes by a parallel DNA microarray hybridization



The comparison of organ transcriptomes is an important strategy for understanding gene functions. In the present study, we attempted to identify lung-prominent genes by comparing the normal transcriptomes of rat lung, heart, kidney, liver, spleen, and brain. To increase the efficiency and reproducibility, we first developed a novel parallel hybridization system, in which 6 samples could be hybridized onto a single slide at the same time.


We identified the genes prominently expressed in the lung (147) or co-expressed in lung-heart (23), lung-liver (37), lung-spleen (203), and lung-kidney (98). The known functions of the lung-prominent genes mainly fell into 5 categories: ligand binding, signal transducer, cell communication, development, and metabolism. Real-time PCR confirmed 13 lung-prominent genes, including 5 genes that have not been investigated in the lung, vitamin D-dependent calcium binding protein (Calb3), mitogen activated protein kinase 13 (Mapk13), solute carrier family 29 transporters, member 1 (Slc29a1), corticotropin releasing hormone receptor (Crhr1), and lipocalin 2 (Lcn2).


The lung-prominent genes identified in this study may provide an important clue for further investigation of pulmonary functions.


With the completion of genome projects of human and other model species, functional studies on a genomic scale are coming to a frontier. The investigation of transcriptome reveals gene expression of organs and cells from normal and diseased animals and humans. By comparing transcriptomes of multiple organs, physiological functions in different organs can be further explored. Identifying the genes expressed prominently in the lung may reveal its unique physiological functions in the respiratory system.

The expression of some individual genes in the lung and other organs may be found in literature and public databases. In literature, newly discovered genes have been tested in various organs at the mRNA level with Northern blotting and RT-PCR and at the protein level with Western blotting. In public databases, gene expression are compiled from literature, cDNA library (e.g. UniGene) and high throughput tools such as serial analysis gene expression (SAGE) and DNA microarray (e.g., GEO) [1]. Several studies using DNA microarray have been reported for profiling differential gene expression among normal human and mouse organs, but very little information is available for the rat [26].

Dual color hybridizations are commonly used for differential expression of thousands of genes between two samples [7]. For three or more samples, a reference or loop design has to be employed to adapt dual color hybridization [8, 9]. In the reference design, several samples are hybridized onto different slides separately with a common reference, which is prepared by pooling all the samples or using genomic DNA [10]. In the loop design, samples are paired in a loop pattern for hybridization and each sample is hybridized twice. However, the efficiency and reproducibility of both designs are poor for the identification of organ-prominent genes. Only two samples are hybridized on one slide, and the hybridization on different slides is known to have high variations due to slide printing and hybridization conditions [7]. For instance, there are 15 pair-wise combinations among 6 distinct organs. Consequently, 15 co-hybridizations between samples are required for a single replication and 60 slides for an experiment with 4 biological replications.

To eliminate these problems, we developed a parallel hybridization system in which 6 samples can be hybridized onto one single slide. This technique simplifies the investigation of multiple samples, reduces experimental errors and improves experimental efficiency. Using this system, we investigated lung-prominent genes by comparing gene expression profiles among rat lung, heart, kidney, liver, spleen, and brain. The lung-prominent genes identified in the present study may provide a clue for further exploration of pulmonary functions.


Reproducibility and efficiency of parallel hybridization

Our parallel hybridization system consists of three identical blocks: A, B, and C, on a single slide (Table 1). Each block contains ~10,000 50-mer oligonucleotides (6,221 known rat genes, 3,594 rat ESTs, and 169 Arabidopsis negative controls). Six labeled cDNA samples (3 Cy3 and 3 Alexa 647) were combined into 3 green-red pairs and hybridized onto each block of one slide. During the hybridization step, the blocks were separated by thermostatic tapes. The latter was removed during the washing and scanning steps. To examine whether there was cross-contamination among blocks, blocks A and C on the same slide were hybridized simultaneously for 3 days with Alexa 647-labeled lung cDNA. No signals were detected in block B (data not shown), indicating no cross contaminations among blocks.

Table 1 Slide layout and hybridization design

Self-self hybridizations were performed on three slides to assess the reproducibility of hybridizations using Cy3- and Alexa 647-labeled lung cDNA samples. We observed the highest correlation coefficient between two samples co-hybridized in one block (within-block group, Fig. 1A), and the lowest one between two samples hybridized in two different blocks on two separate slides (among-slide group, Fig. 1C). The within-slide group (two samples in two distinct blocks on one slide, Fig. 1B) possessed a significantly higher reproducibility than the among-slide group, but lower than the within-block group (Fig. 1D, p < 0.01). The lower reproducibility of the among-slide group may be due to the experimental variations among slides, such as hybridization temperature fluctuation, washing, and scanning. These conditions were identical for the within-block and within-slide groups, in which samples were hybridized in a single slide.

Figure 1

Reproducibility of hybridizations. (A-C):typical scatter plots of self-self hybridization of lung cDNAs between two channels within a block (within-block, panel A), two different blocks in one slide (within-slide, panel B), and among slides (among-slide, panel C), respectively. The cDNAs from an identical lung tissue were labeled with Cy3 or Alexa 647, and hybridized to each block of the slides. The numbers on x- and y-axis were background-subtracted fluorescence intensities of each spot with log2 transformation. (D) A comparison of correlation coefficients from replicated hybridizations. The results were expressed as means ± SE. *P < 0.01 v.s. among-slide; #P < 0.01 v.s. within-slide. (E) Comparison of accumulated errors between within-slide and among-slide groups. For the within-slide group, the log ratios were from parallel hybridization on a single slide. For the among-slides, the log ratios were from different slides. The accumulated errors were calculated as described in Materials and Methods.

Next, we investigated the relative gene expression levels in 6 rat organs: lung, heart, kidney, brain, spleen, and liver. The hybridization of each organ was repeated 20 times: 4 biological replications (rats), each with 5 technical replications (slides). Six samples from each of four rats were split into 5 aliquots for hybridization on 5 slides. The labeling dyes, the sample pairing, and the hybridization blocks on a slide were randomly assigned for each biological replication. This minimized the variations among biological and technical replications, including animals, fluorescence dyes, sample combinations, blocks on a slide, slides, and experimental conditions (Table 1). Statistically, each slide was a random block containing 6 samples. There were 60 sample-sample hybridizations performed on 20 slides (60 Alexa 647-cDNA and 60 Cy3-cDNAs) in this experiment. To achieve similar statistical results, a traditional reference design requires 120 slides for co-hybridizations of sample and reference. Alternatively, in a loop design, 60 slides are required for co-hybridization of sample-sample.

The difference of fluorescence intensity between the parallel hybridization and traditional dual-color hybridization was evaluated. We first compared the difference of log ratios between the traditional and parallel hybridization systems by SAM [11]. The samples of lung and heart were used as an example. The log ratios of fluorescence intensity between lung and heart were normalized with the print-tip based LOWESS [7]. The traditional log ratios were from 4 slides, in which lung and heart were paired and co-hybridized onto the same block of each slide. The parallel log ratios were from 4 other slides, in which lung and heart were hybridized onto two different blocks of each slide. The 2-class SAM test identified no genes that showed a significant difference between the traditional co-hybridization group and the parallel hybridization group (false discovery ratio<0.047, q-value>0.05). Other organ pairs showed similar results. These results demonstrated that the log ratios of two samples from two different blocks in the parallel hybridization were not significantly different from that of the traditional two sample co-hybridization. Consequently, any two of the six samples hybridized onto one slide in the parallel hybridization can be directly compared as if these samples were pair-wise combined and co-hybridized onto one traditional slide.

We also tested the accumulated error of the log2 ratios among 6 organs. In a traditional loop design, the sum of log ratios along the loop should be zero, but frequently fluctuating. Therefore, the square sum of log ratios can be adapted to assess the accumulated error of each gene or the data fluctuation in one experiment. We selected one block from each of the six different slides and simulated the traditional loop design. The 6 blocks formed a loop as if they were 6 traditional co-hybridization slides. In another group, a loop was formed from a single parallel hybridization slide. The slides for both groups were randomly selected. The accumulated errors were calculated as described in the Materials and Methods, followed by being sorted ascendingly, and plotted against ranked genes. We found that 21% of the genes showed an accumulated error of >5 in the traditional hybridization group, but only 4% in the parallel hybridization group (Fig. 1E). A paired t-test of the accumulated errors between the two groups revealed that the fluctuation of the traditional co-hybridization was significantly higher than that of the parallel hybridization (p < 0.05).

Prominent genes expressed in the lung

Lung-prominent genes were identified through quality filter, statistics filter, and image confirmation. Several steps of data analysis were followed (see Materials and Methods for details): (i) After hybridization, we first checked the qualities of whole hybridization images and excluded the images from poor slides (one out of 20 slides was discarded); (ii) We filtered 2,829 low quality spots based on a mean quality index of <1 as our quality filter; (iii) Statistics test using SAM analysis revealed that the expression levels of 3,576 genes were significantly different among 6 organs (false-positive ratio <5%, and median false discovery ratio <0.05); (iv) In order to identify organ-prominent or co-expressed genes, the genes passed SAM test were further analyzed by multiple comparisons using Turkey's honestly significant difference (HSD) tests at an overall confidence level of 95%. Organ-prominent genes are defined as genes that are expressed significantly higher in one particular organ than any other organs (P < 0.05). Similarly, co-expressed genes are the genes that are expressed significantly higher in two organs than any other 4 organs (P < 0.05). There were some duplicated genes in single and two organ-prominent groups. The duplicated genes with a lower OSI were filtered. The duplication was due to the HSD-based multiple comparisons. For instance, endothelial cell growth factor protein precursor (VEGF, Genbank ID: NM_031836) was expressed significantly higher in the lung than other organs (p < 0.05, OSI for lung = 0.975). This gene was also co-expressed significantly higher in the lung and the liver than in other organs (p < 0.05, OSI for lung and heart = 0.778). In this case, we thus deleted this gene from the lung-liver group; (v) Finally, we further verified the genes identified above by directly comparing the results with spot images in a spreadsheet using the RealSpot software [12]. The visually inconsistent genes with spot images were filtered. The final genes were summarized in Fig. 2 and the hot maps of these genes were shown in Fig. 3. The liver showed the highest number of prominent genes (306 genes) and spleen the lowest (75 genes). The numbers of other organ-prominent genes were brain (218), kidney (163), lung (147), and heart (95). The lung had a high number of co-expressed genes with other organs: lung-spleen (203), lung-heart (23), lung-liver (37), lung-kidney (98), and lung-brain (10). The kidney also had a high number of co-expressed genes, kidney-liver (151) and kidney-brain (19). A list of all the organ-prominent genes is given in Additional file 1 (Supplementary Table E1, the excel format to show gene expression data) and Additional files 2 and 3 (Supplementary Table E2A and E2B, the PDF format to show spot images).

Figure 2

Summary of differentially expressed genes among 6 organs. The number under an organ represents the genes that are expressed significantly higher in the respective organ compared to other organs (p < 0.05). Similarly, the number between any two organs represents the genes that are expressed significantly higher in the two organs compared to other organs (p < 0.05). Thicker lines highlight a larger number of the genes co-expressed in the respective two organs.

Figure 3

Hot maps of Organ-prominent genes. Left and right panels are the relative expression levels of genes differentially expressed in one and two organs, respectively. Each column represents 19 replicated hybridizations of each organ and each row shows the spot signals of the organ-prominent genes. The scale of normalized spot signals was indicated on the top of the graph. (A): lung: 166 genes; (B) heart: 100 genes; (C) kidney: 186 genes; (D) liver: 324 genes; (E) spleen: 88 genes; (F) brain: 225 genes; (G) lung-heart: 47 genes; (H) lung-liver: 33 genes; (I) lung-spleen: 95 genes; (J) kidney-liver: 174 genes; (K) lung-kidney: 21 genes; (E) kidney-brain: 21 genes.

The prominent genes for one or two organs were further classified into 4 functional categories: function unclear, cellular location, molecular function, and biological process, using ontology annotations from Rat Genome Database and Gene Ontology The main functional categories of prominent genes were summarized in Additional file 4 (Supplementary Table E3) for one organ and in Additional file 5 (Supplementary Table E4) for two organs. The functions of the lung-prominent genes include ligand binding, signal transducer, cell communication, development, and metabolism. The cellular location was omitted since only a few genes were documented at the sub-cellular level. It is worthy to note that the functions of 60% or more genes we identified remain unclear in the present time.

Real-time PCR verification

Based on our research interests, we focused on lung-prominent genes for real-time PCR verification. We selected genes based on both mRNA abundance (signal intensity) and organ specificity index (OSI). OSI was defined as the correlation coefficient of expression levels between an interested gene and a putative gene that had 100% specificity (see Materials and Methods). The known lung marker genes have high OSIs, e.g. T1α, 0.996; SP-A, 0.993; SP-D, 0.993; SP-B, 0.933; CCSP, 0.972; and SP-C, 0.912 (Additional file 1). We chose 13 genes, which ranked in the top 30% in signal intensity (high expression level) and the top 10% in OSI (high specificity). In addition, we selected 3 genes that ranked below 30% in signal intensity (low expression level). Real-time PCR verified 13 genes that were expressed significantly higher in the lung than in other organs (Fig. 4). These genes include BD-2, K19, Calb3, SP-D, ICAM-1, Mapk13, Crhr1, Slc29a1, Ager, Slc34a2, Lcn2, Ddr2, and Mg50. Furthermore, the expression level for most of the genes in the lung was 10 times or more greater than that in other organs. The expression pattern of these genes was consistent with DNA microarray signals (Additional file 6). Three genes, Nup155, MMP9 and Sp4, did not show a significantly higher mRNA abundance in the lung when compared to other organs under our experiment conditions. This is due to high variations between samples.

Figure 4

Relative mRNA abundance of lung-prominent genes determined by relative real-time PCR. The mRNAs from six organs were reverse-transcribed to cDNA and quantified by relative real-time PCR. All of the genes were run on the same plate with 18S rRNA as an endogenous reference. The results were expressed as % of lung. Data shown are means ± S.E. (n = 3 biological replications).The mRNA expression level of all the genes in the lung was significantly higher in other organs (P < 0.05).


In the current study, we developed a parallel hybridization, in which 6 samples can be hybridized onto one single slide. This method provides higher reproducibility and efficiency than the standard co-hybridization, and should be suitable for experiments investigating multiple biological samples. Using this system, we identified genes prominently expressed in one or two organs of the rat lung, heart, kidney, liver, spleen, and brain. Thirteen out of 16 selected lung-prominent genes were verified by real-time PCR. The genes identified in present study may be useful for further functional investigation in the lung or other organs.

The organ-prominent genes we identified were directly based on statistical comparisons of normalized spot signals. These genes were further ranked by organ specificity index (OSI). The "standard" DNA microarray data process extracts fluorescence intensities of both channels from hybridization images, and calculates and normalizes ratios for further statistical analysis. Our method is different from the "standard" analysis in several ways: (i) we linearly transformed all of the spot signals from each channel of hybridization images into a 0–1,000 scale, which made different channels and slides comparable. Unlike the ratio normalization, we retained relative expression levels in each channel. This is especially useful for multiple sample comparisons; (ii) Gene classification was based on multiple comparison. Differentially expressed genes among the 6 organs were identified from SAM test, followed by multiple comparison using Tukey's HSD; and (iii) we ranked the genes by organ specificity index (OSI), higher OSI, more specific a gene in one or two organs. In this investigation, we selected lung-prominent genes for verification based on the combination of OSI and normalized spot intensity. We chose the genes ranked in the top 10% in OSI and the top 30% in spot intensity, which ensures both the lung-specificity and the gene expression level.

Recently, several studies have compared gene expression profiles in human and mouse [26]. Only one report was done on rats with a focus on the brain using commercial Affymetrix chips (7,000 known genes and 1,000 EST) [6]. In this data set, the lung and liver were not included and only two replications for the spleen, heart, and kidney were used. In comparison, 2,426 genes out of the 3,576 differential genes (current study, without image-filter) were found to be common with the Walker's study [6]. The correlation coefficient of relative expression between the two data sets was around 0.4 for heart, kidney, or spleen. The low quantitative correlation may be due to the differences between Affymetrix and our in-house microarray platforms such as glass slide/silicon wafer, two/one channel, and 50-mer oligonucleotide/25-mer oligonucleotide set. However, the two data sets showed a consistent gene expression pattern among heart, kidney, and spleen, when we manually compared differential expression of the genes with top OSI for each pattern.

We also compared our dataset with the published datasets from other species. In the Novartis GNF dataset, transcriptomes of mouse organs were compared, each organ with duplicated single channel hybridizations [5]. Of the 147 lung-specific genes in our dataset, 102 were found in the mouse microarray dataset (totally 31,770 genes). Based on OSI>0.75, calculated from their dataset, we found that 36 lung-prominent genes are common with our dataset. Six of them were on the list of our 13 real-time PCR-verified lung-prominent genes, including Ager, K19, SP-D, ICAM-1, Slc34a2, and Lcn2. Another verified gene, MAPK13 was not in the 36 genes. Its signals were less than 50 in all of the mouse organs.

We further compared our dataset with available human datasets. The dataset and listed 43 human lung-specific genes. Many known lung-specific genes such as T1α, caveolin and CCSP were not on this list. Among 43 genes on the list, 18 genes were found in our list of lung-prominent genes, including known lung-specific genes, surfactant proteins, ager and a verified gene, Slc34a2. Similarly, in another human tissue dataset (PubmedID: 15774023), 50 lung-specific genes were identified based on one human lung tissue hybridization. Once again surfactant proteins and ager were not in list. The only common gene between the list and our lung-prominent genes was caveolin. Finally, when comparing our rat dataset (10 K genes) and the Novartis GNF dataset human (10 K genes) datasets, we found 368 common genes between the datasets. Only 2 common genes, MAPK13 and latent transforming growth factor beta binding protein 2 (LTBP2), appeared to be lung-prominent based on the OSI. There were more common genes between the rat and the mouse datasets than these between the rat and the human datasets.

The published dataset was based on one or two hybridization of normal lung tissue. Our lung-prominent genes were based on 20 replicated DNA microarray hybridizations (4 biological and 5 technical replications). We believe that our gene lists were statistically confident and had a lower false-positive or false-negative genes.

The 13 lung-prominent genes we verified by real-time PCR have various functions, including pulmonary defenses, ion/solute transport, hormone receptor, differentiation, oxidant response and tumorgenesis. Five of them are defense genes. BD-2 (β-defensin 2) is a cationic peptide with a broad-spectrum antimicrobial activity and contributes to innate immunity in the lung [13]. It is expressed in the airway epithelia [14]. BD-2 was increased in the patients with inflammation and infections [15, 16]. SP-D (surfactant protein D) is highly expressed in alveolar epithelial type II cells and plays a pivotal role in cell defense against microbes [17, 18]. For instance, it has been reported that SP-D inhibited the proliferation of bacteria by increasing the permeability of the microbial cell membrane. ICAM-1 (intercellular adhesion molecule 1) is a cell adhesion molecule and a ligand for leukocyte adhesion molecule LFA-1. ICAM-1 also participates in the inflammatory response to lipopolysaccharide-induced lung injury by interacting mainly with neutrophils [19]. Lipcocalin 2 (also known as α2u-globulin-related protein, X13295) is a member of lipocalin protein family composed of small secreted proteins that have the ability to bind to small hydrophobic ligands [20]. Lipocalin 2 expression in the lung is markedly increased in acute lung injury caused by diesel exhaust particles and lipopolysaccharide [21]. Mapk13 (mitogen activated protein kinase 13) plays a role in stress and inflammatory responses via the MAPK cascade signaling pathway. Mapk13 is predominantly expressed in the lung although a small amount of Mapk 13 is also present in kidney [22], which is consistent with our results (Fig. 4).

Three of the identified lung prominent genes are ion/solute transporters. Calb3 (Calbindin 3), a vitamin D-dependent Ca2+ binding protein, was previously studied in the intestine, uterus, placenta, and lung epithelium [23]. It is a Ca2+ transporter and regulates Ca2+ homeostasis. Slc29a1 (solute carrier family 29 transporter, member 1) is an equilibrative nitrobenzylthioinosine-sensitive nucleoside transporter (ENT1), which transports nucleosides into or out of the cells in a Na+-independent manner [24]. Northern blot analysis has shown that Slc29a1 is highly expressed in the lung and testes [25]. It plays a role in nucleotide biosynthesis and cellular signaling. Slc34a2 (solute carrier family 34 sodium phosphate, member 2) is a sodium dependent phosphate transporter. It has been shown that Slc34a2 was predominantly expressed in the lung and in situ hybridization revealed that it is localized in alveolar type II cells [26]. Slc34a2 provides inorganic phosphate for the synthesis of lung surfactant.

Crhr1 (corticotropin releasing hormone receptor 1) is a receptor that binds corticotropin-releasing hormone. The mice null for the CRFR1 gene died within 48 hours after birth because of a pronounced lung dysplasia [27]. Interestingly, variation of Crhr1 was associated with improved function in the asthma patients who were treated with inhaled corticosteroids [28]. K19 (keratin 19) is expressed in epithelial cells, involved in testicular differentiation and lung cancer [29, 30]. Ager (advanced glycosylation end product-specific receptor) is a member of the immunoglobin superfamily and is involved in oxidant response. It is specifically expressed in alveolar epithelial type I cells [31]. Lung type I cells are squamous, covering >90% of alveolar surface, and, thus, are easily damaged by oxidants. Ager may protect lung type I cells from oxidative injury.

The two lung-prominent genes with lower mRNA abundance, Ddr2 and Mg50, may be involved in human tumorgenesis, and the regulation of collagen remodeling in the lung

The functions of 13 verified genes as well as some highly abundant co-expressed genes in the lung and another organ were summarized in Table 2. These co-expressed genes were previously studied in the lung or another organ. The most prominent genes expressed in the lung were relevant to pulmonary protection, including oxidant response, injury and repair, inflammatory, cell defense, and immune response. These genes also contribute to organ construction such as lung veins, energy supply, and epithelial tight junction. Some of these genes may be important for cell proliferation, such as anp and nf2. Two genes, anp and aqp5 may play a role in asthma and edema, respectively. The function of cd37 is currently unclear in any of the organs. Its prominent and specific expression in the lung may imply its important role for lung function. Cd37 may participate in cell proliferation in the lung based on the studies from other members of this gene family. Similarly, cathepsin Y may play a role of surfactant protein processing or apoptosis considering its endopeptidase activity in the spleen and the functions of cathepsin D and H in the lung. These hypothesized functions may serve as a starting point for further functional studies in the respective organs.

Table 2 Gene functions in the lung and 2nd organ

The parallel hybridization system has several advantages over the traditional two-color hybridization. First, in this hybridization system 6 paired and dual-color labeled samples were hybridized onto one slide and scanned under identical conditions. The homogenous conditions on one slide improved the reproducibility and decreased the variation, especially accumulated experimental errors. The latter is problematic in microarray experiments involving a series of samples such as a time course study. Second, any two of the six samples in a parallel hybridization can be directly compared, whereas only two paired samples can be directly compared in the traditional two-color hybridization of a reference or loop design. This increases the experimental efficiency and reduces the number of slides and the amount of RNAs in a whole experiment. In the parallel hybridization system, only one slide is needed for six samples. In contrast, 6 slides are required for a reference or loop design of six samples in the traditional two-color co-hybridization. The RNA amount is reduced to half that of the traditional hybridization. This is because each sample needs to hybridize twice with neighboring samples in the loop design or hybridize to a common reference consisting of all the samples in the reference design. Multiple-color hybridization on one slide could be developed for three or more samples labeled with distinct fluorescence dyes. However, the potential cross-talk among fluorescence dyes and the need for multiple lasers of a scanner limit its application.

The organ-prominent genes in the current study were identified from 6 organs. Some of them may be expressed higher in other tissues outside the 6 organs we monitored. This limitation may be overcome by further improvement of the parallel hybridization system. One possibility is to include one common control organ (e.g. lung) in all of the parallel hybridization slides. Although it reduces the efficiency, the transcriptomes of more than 6 organs can be directly compared. Another possibility is the potential technical improvement of spot printing and sample arrangement, which may result in more than 6 samples on one parallel slide. In the present study, we printed 10 K rat genes in triplicate on three blocks on one slide. Each block contains 16 sub-arrays (4.5 × 4.5 mm) consisting of 625 genes. Therefore, 6 samples can be hybridized to 10 K genes in this system. If we print 625 genes onto 48 sub-arrays in replicate, 96 dual-color labeled samples can be hybridized on one slide. Furthermore, if we increase the printing resolution from 160 to 80 microns, we can print 2,500 spots on one sub-array. Consequently, 96 samples can be hybridized to one slide containing 2,500 genes. Another improvement may be the separation of the slide regions. We used thermostatic tapes to divide 3 blocks, which may not be appropriate for more samples. The chambered coverslips of 24 or 48 wells such as CultureWell™ coverslip system or array of arrays glass wafer [32] may be adapted for this purpose.


Microarray preparation

The DNA microarray slides used in this study were in-house printed on epoxy-coated glass slides with 50-mer aminated oligonucleotides, Pan Rat 10 K Oligonucleotide Set (MWG Biotech Inc., High Point, NC). It contains 6,221 known rat genes, 3,594 rat ESTs, and 169 Arabidopsis negative controls. The oligonucleotides were suspended in 3× SSC at 25 μM and printed on epoxy-coated slides (CEL Associates, Pearland, Texas) with an OmniGrid 100 arrayer (GeneMachine, San Carlos, CA) per manufacturer's instructions. Each oligonucleotide was spotted in triplicate on three identical 18 × 18 mm blocks: A, B, and C (Table 1). The total spots on one slide were 30,000 including 186 blank spots. The spot-spot distance was 180 μm and the space between blocks was 4 mm. The printed slides were incubated in 65% humidity overnight at room temperature. The slides were then dried and stored in room temperature. Prior to hybridization, the slides were washed one time with 0.2% SDS, four times with water, and dried by centrifugation. The 3 blocks on a slide were separated by two 2 × 25 × 1 mm thermostatic transparent tape stripes during the hybridization. The stripes were removed after hybridization to wash and scann slides.

Sample collection and hybridization

Six organs, the lung, heart, kidney, liver, spleen, and brain of male Sprague-Dawley rats (200 g, Charles River Laboratories, Inc., Wilmington, MA) were dissected. The organs were briefly washed with deionized water and immediately homogenized in 10 ml TRI reagents (Molecular Research Center, Cincinnati, OH). Total RNA was subsequently extracted according to the manufacturer's protocol. RNA quality and quantity were assessed by spectrophotometer (NanoDrop Technologies, Inc, Rockland, DE) and agarose gel electrophoresis. Total RNA samples were aliquoted (20 μg each) for cDNA synthesis and 2-step microarray hybridization with 3DNA 50 Expression kit (Genisphere Inc., Hatfield, PA). Briefly, total RNA was reverse-transcribed with Cy3- or Alexa 647-specific primers. The cDNA products were purified with the Microcom YM-30 columns (Millipore, Billerica, MA) and mixed with 2× formamide hybridization buffer (50% formamide, 6× SSC, 0.2% SDS). The DNA microarray slides were hybridized with the cDNA samples at 42°C for 48 hours. The slides were washed and re-hybridized with Cy3- and Alexa 647-specific capture reagents at 42°C for 2 hours. In our experiments, the concentration of purified cDNA samples were normalized to 0.5 – 0.6 μg/μl before hybridization. The cDNA aliquots from 6 organs of the same rat were randomly paired and independently hybridized onto one of 3 blocks on a glass slide. Each sample was repeated 20 times: 4 biological replications and 5 technical replications. The arrangement of samples, fluorescence dyes, and blocks for one of the biological replications is shown in Table 1. The other 3 biological replications were similarly arranged in a style of random block design. Each hybridized slide was scanned twice by a laser confocal scanner, ScanArray Express (PerkinElmer Life and Analytical Sciences, Boston, MA). The first scanning was used for quantification and performed with 90% laser power and 70~80% PMT so that about 5% spots were saturated. The second scanning was used for spot alignment and was carried out with 90% laser power and 95% PMT. Hybridization images were analyzed with GenePix pro 4 (Axon Instruments, Inc. Union City, CA).

Data analysis

Hybridization reproducibility

The reproducibility was assessed by Pearson correlation coefficients of spot signals from self-self hybridizations. The spot signals were background-subtracted fluorescence intensity extracted from hybridization images by GenePix. To estimate the variations among 6 paired organs, accumulated errors of log ratios were calculated. The log ratios between two samples were assessed from the respective spot signals, normalized by local weighted scatter plot smooth (LOWESS) based on print-tip. The accumulated error of ratios of each gene was assessed as (1)

e = ( log ' s 1 s 2 + log ' s 2 s 3 + log ' s 3 s 4 + log ' s 4 s 5 + log ' s 5 s 6 + log ' s 6 s 1 ) 2 ( 1 ) MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGLbqzcqGH9aqpdaqadaqaaiGbcYgaSjabc+gaVjabcEgaNjabcEcaNmaalaaabaGaem4CamNaeGymaedabaGaem4CamNaeGOmaidaaiabgUcaRiGbcYgaSjabc+gaVjabcEgaNjabcEcaNmaalaaabaGaem4CamNaeGOmaidabaGaem4CamNaeG4mamdaaiabgUcaRiGbcYgaSjabc+gaVjabcEgaNjabcEcaNmaalaaabaGaem4CamNaeG4mamdabaGaem4CamNaeGinaqdaaiabgUcaRiGbcYgaSjabc+gaVjabcEgaNjabcEcaNmaalaaabaGaem4CamNaeGinaqdabaGaem4CamNaeGynaudaaiabgUcaRiGbcYgaSjabc+gaVjabcEgaNjabcEcaNmaalaaabaGaem4CamNaeGynaudabaGaem4CamNaeGOnaydaaiabgUcaRiGbcYgaSjabc+gaVjabcEgaNjabcEcaNmaalaaabaGaem4CamNaeGOnaydabaGaem4CamNaeGymaedaaaGaayjkaiaawMcaamaaCaaaleqabaGaeGOmaidaaOGaaCzcaiaaxMaadaqadaqaaiabigdaXaGaayjkaiaawMcaaaaa@74A6@

Where e is the accumulated error, and log ' s i s j MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacyGGSbaBcqGGVbWBcqGGNbWzcqGGNaWjdaWcaaqaaiabdohaZjabdMgaPbqaaiabdohaZjabdQgaQbaaaaa@3746@ are normalized log ratios between samples s1~s6, the 6 organs arranged in a loop design. The e was calculated in 2 groups, within-slide group and among-slide group. The log ratios of within-slide group were obtained from one slide with 6 samples, and those of among-slide group were from 6 slides comprised of a loop design for 6 samples.

Identification of lung-prominent genes

To identify differentially expressed genes among 6 organs, we first globally normalized 16-bit mean fluorescence intensity of each gene from original images using the software RealSpot developed in our laboratory [12] (freely available for download for academic usage, The global normalization converted the weakest 5% fluorescence intensities to 0 (background) and the strongest 5% fluorescence intensities to 1,000 (saturated spots, reflecting normally scanned images). The other fluorescence intensities were scaled to the range of 0 to 1,000. This transformation makes different slides and different channels comparable. It is similar to Affymetrix single channel data normalization. The transformed images and intensities were used for data quality filters, statistics tests, and direct confirmation of the data analysis results with spot images.

For spot quality evaluation, a quality index (QI) was assigned to each spot based on signal intensity and signal-to-noise ratio. QI 0–4 indicate empty, weak, middle, strong, and saturated spots, respectively. By default, QI 0 and 4 were assigned to the empty and saturated spots, whose intensities were less than 30% and greater than 95%, respectively. QI 1–3 was calculated, based on the intensity of spot signals, as: Q I i j = r o u n d ( I i j I 0 I 1 I 0 4 ) MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGrbqucqWGjbqsdaWgaaWcbaGaemyAaKMaemOAaOgabeaakiabg2da9iabdkhaYjabd+gaVjabdwha1jabd6gaUjabdsgaKjabcIcaOmaalaaabaGaemysaK0aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGHsislcqWGjbqsdaWgaaWcbaGaeGimaadabeaaaOqaaiabdMeajnaaBaaaleaacqaIXaqmaeqaaOGaeyOeI0IaemysaK0aaSbaaSqaaiabicdaWaqabaaaaOGaey4fIOIaeGinaqJaeiykaKcaaa@4A2C@ , where QIij is the quality index of spot j on slide i and Iij the intensity of the spot j on slide i. By default, I0 is the intensity at 30th percentile, and I1 at 95th percentile of the plot (intensity vs gene rank percentage) of the slide image. A QI of 5 was assigned to a contaminated or bad spot based on signal background ratio (SBR). By default, any spots with a SBR of <2.0 were given a QI of 5. A mean quality index was calculated from the replicated spots of a gene from multiple slides, excluding bad spots (QI = 5). Data were filtered if a mean quality index was 1.0 or less.

For the genes that passed the quality index filter, statistical tests were performed. The genes with a significantly differential expression among 6 organs for at least one organ-pair were identified by a software package, SAM, (Significant Analysis of Microarray, [11]. The median false discovery ratio (FDR) cutoff for a multiple class response test by SAM was set to 5%. The genes with a minimal FDR (q-value) of >5% were discarded. The genes that passed the SAM test were further classified into organ-prominent genes or co-expressed genes in two organs by pair-wise multiple comparisons with Tukey's honestly significant difference (HSD) at an overall confidence level of 95%. Organ-prominent genes were defined as the genes that were expressed significantly higher in one particular organ than in other organs (p < 0.05). Similarly, co-expressed genes in two organs were defined as the genes that were expressed in the two organs than the other 4 organs.

To determine the relative specificity of a gene among organs, an organ specificity index (OSI) was defined as the correlation coefficient of gene expression levels between a gene and a putative gene. The expression levels of a putative gene were 1,000 in prominent organs and 0 in other organs. For example, the expression level of a putative gene prominent in the lung will be (from left to right are lung, heart, kidney, liver, spleen, and brain) 1,000, 0, 0, 0, 0, 0. The OSI is calculated as

O S I = i = 1 n ( X i P i ) i = 1 n X i i = 1 n P i n i = 1 n ( X i 2 ) ( i = 1 n X i ) 2 n i = 1 n ( P i 2 ) ( i = 1 n P i ) 2 n ( 2 ) MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGpbWtcqWGtbWucqWGjbqscqGH9aqpdaWcaaqaamaaqahabaGaeiikaGIaemiwaGLaemyAaKMaey4fIOIaemiuaaLaemyAaKMaeiykaKIaeyOeI0YaaSaaaeaadaaeWbqaaiabdIfayjabdMgaPjabgEHiQmaaqahabaGaemiuaaLaemyAaKgaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGUbGBa0GaeyyeIuoaaSqaaiabdMgaPjabg2da9iabigdaXaqaaiabd6gaUbqdcqGHris5aaGcbaGaemOBa4gaaaWcbaGaemyAaKMaeyypa0JaeGymaedabaGaemOBa4ganiabggHiLdaakeaadaGcaaqaamaaqahabaGaeiikaGIaemiwaGLaemyAaK2aaWbaaSqabeaacqaIYaGmaaGccqGGPaqkcqGHsisldaWcaaqaaiabcIcaOmaaqahabaGaemiwaGLaemyAaKMaeiykaKYaaWbaaSqabeaacqaIYaGmaaaabaGaemyAaKMaeyypa0JaeGymaedabaGaemOBa4ganiabggHiLdaakeaacqWGUbGBaaaaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGUbGBa0GaeyyeIuoaaSqabaGcdaGcaaqaamaaqahabaGaeiikaGIaemiuaaLaemyAaK2aaWbaaSqabeaacqaIYaGmaaGccqGGPaqkcqGHsisldaWcaaqaaiabcIcaOmaaqahabaGaemiuaaLaemyAaKMaeiykaKYaaWbaaSqabeaacqaIYaGmaaaabaGaemyAaKMaeyypa0JaeGymaedabaGaemOBa4ganiabggHiLdaakeaacqWGUbGBaaaaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGUbGBa0GaeyyeIuoaaSqabaaaaOGaaCzcaiaaxMaadaqadaqaaiabikdaYaGaayjkaiaawMcaaaaa@9084@

Where Xi and Pi are the mean gene expression levels of each organ of a gene and the putative gene, respectively, in organ i. N is the total number of organs (n = 6 in this study). A higher correlation coefficient indicates a higher tendency of a gene for expression in a particular organ.

Finally, the gene expression data were directly compared with the respective spot images. The spot images of the genes in each sorted data set were searched and organized by RealSpot. The genes with visual consistence between differential gene expression and spot images were marked as highly prominent genes for the organ(s). The functional categories of these highly prominent genes were assessed based on gene ontology annotation from Rat Genome Database gene association file (RGD, and gene ontology definitions (GO,

Real-time PCR

Selected lung-specific genes were validated by SYBR Green I based real-time PCR (QIAGEN, Foster City, CA) as previously described [33]. Total RNA (5 μg) was reverse-transcribed into cDNA with 0.2 μg/μl dT17, 0.3 μg/μl random hexamer primer, and MMLV reverse transcriptase (Invitrogen Inc., Carlsbad, CA). The primer pairs were as follows ("_F": forward, "_R": reverse): beta defensin-2, BD-2_F, AAT CAC ATG CCT GAC CAA AGGA; BD-2_R, GGA GCA AAT TCT GTT CAT CCCA; keratin19, K19_F, CCA GGT CGC TGT CCA CAC TAC; K19_R, CCT TCC AGG GCA GCT TTC AT; vitamin D-dependent calcium-binding protein, Calb3_F, CAG CAC TCA CTG ACA GCA AGCA, Calb3_R, TCC TCC TTG GAC AGC TGG TTT; surfactant protein D, SP-D_F, TTC TCT CCA TGC TTG TCC TGC T, SP-D_R, GAC TAG GGT GCA CGT GTT GGT T; intercellular adhesion molecule 1, ICAM-1_F, GGA GTC TCA TGC CCG TGA AAT, ICAM-1_R, GTG CCT ACC CTC CCA CAA CA; mitogen activated protein kinase 13, Mapk13_F, CCC AGC AGC CAT TTG ATG AT, Mapk13_R, CAC TGC AGC TTC ATC CCA CTT; corticotropin releasing hormone receptor, Crhr1_F, GGT CTC CAG GGT CGT CTT CAT C, Crhr1_R, ACG CCA CCT CTT CCG GAT AG; solute carrier family 29 transporters, member 1, Slc29a1_F, GGA CAA TGG TCT CTG ACG GAC A; Slc29a1_R, CCT GGA ACA GGC ACA GAA GAA A; advanced glycosylation end product-specific receptor, Ager_F, TCC GGT GTC GGG CAA CTA, Ager_R, GGG ACA TTG GCT GTG AGT TCAG; solute carrier family 34 sodium phosphate, member 2, Slc34a2_F, GCC CAT AGG TGT GAG CCT TTC, Slc34a2_R, CCC CAT TCA CTC CAT CCT AGG A; lipocalin 2, Lcn2_F, TCT GGG CCT CAA GGA TAA CAAC, Lcn2_R, AGA CAG GTG GGA CCT GAA CCA; matrix metalloproteinase 9, MMP9_F, TGG GCA TTA GGG ACA GAG GAAT, MMP9_R, GGG CTG TTT CCC CTG TGA GT; nucleoporin 155kd, Nup155_F, AAG TGG ATC AAA ACC GAG TTCG, Nup155_R, TCG CTG CTG CAG TGA AAT TTC; discoidin domain receptor family, member 2, Ddr2_F, AAC CAA GCA CCG ACC ATC CTT, Ddr2_R, ATG TGG CTG AGC GGT AGG TCT T; trans-acting transcription factor 4, Sp4_F, TTG TCA CAG TTG CCG CCA TT, Sp4_R, TGA CCA GCC CAT TTC CAG ATT T; melanoma-associated antigen, Mg50_F, TGC CAC ATC AGT CAC CCA TGA, Mg50_R, AGC CGA GAC TCC AGG CTG TTT A;18S rRNA_F: TCC CAG TAA GTG CGG GTC ATA, 18s rRNA_R: CGA GGG CCT CAC TAA ACC ATC. The real-time PCR thermal conditions for all 14 genes listed above were 95°C 15 min, followed by 40 cycles of 95°C for 30 sec, 60°C for 30 sec, 72°C for 30 sec, and 77°C for 35 sec. To eliminate experimental variations, all genes were amplified in the same plate, each with 6 organ cDNA samples from one rat (totally 84 wells for organ samples, other wells for negative controls). Three plates were used for the three biological replications. Data were analyzed using relative real-time PCR quantification based on the delta delta Ct method [34]. The endogenous reference gene was 18S rRNA, and the control organ was lung. One-way ANOVA tests were performed for statistical significance (p < 0.05).


  1. 1.

    Wheeler DL, Church DM, Edgar R, Federhen S, Helmberg W, Madden TL, Pontius JU, Schuler GD, Schriml LM, Sequeira E, Suzek TO, Tatusova TA, Wagner L: Database resources of the National Center for Biotechnology Information: update. Nucleic Acids Res. 2004, 32 Database issue: D35-D40. 10.1093/nar/gkh073.

    Article  Google Scholar 

  2. 2.

    Sandberg R, Yasuda R, Pankratz DG, Carter TA, Del Rio JA, Wodicka L, Mayford M, Lockhart DJ, Barlow C: Regional and strain-specific gene expression mapping in the adult mouse brain. Proc Natl Acad Sci U S A. 2000, 97: 11038-11043. 10.1073/pnas.97.20.11038.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  3. 3.

    Miki R, Kadota K, Bono H, Mizuno Y, Tomaru Y, Carninci P, Itoh M, Shibata K, Kawai J, Konno H, Watanabe S, Sato K, Tokusumi Y, Kikuchi N, Ishii Y, Hamaguchi Y, Nishizuka I, Goto H, Nitanda H, Satomi S, Yoshiki A, Kusakabe M, DeRisi JL, Eisen MB, Iyer VR, Brown PO, Muramatsu M, Shimada H, Okazaki Y, Hayashizaki Y: Delineating developmental and metabolic pathways in vivo by expression profiling using the RIKEN set of 18,816 full-length enriched mouse cDNA arrays. Proc Natl Acad Sci U S A. 2001, 98: 2199-2204. 10.1073/pnas.041605498.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  4. 4.

    Warrington JA, Nair A, Mahadevappa M, Tsyganskaya M: Comparison of human adult and fetal expression and identification of 535 housekeeping/maintenance genes. Physiol Genomics. 2000, 2: 143-147.

    PubMed  CAS  Google Scholar 

  5. 5.

    Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, Cooke MP, Walker JR, Hogenesch JB: A gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci U S A. 2004, 101: 6062-6067. 10.1073/pnas.0400782101.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  6. 6.

    Walker JR, Su AI, Self DW, Hogenesch JB, Lapp H, Maier R, Hoyer D, Bilbe G: Applications of a rat multiple tissue gene expression data set. Genome Res. 2004, 14: 742-749. 10.1101/gr.2161804.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  7. 7.

    Quackenbush J: Microarray data normalization and transformation. Nat Genet. 2002, 32: 496-501. 10.1038/ng1032.

    PubMed  CAS  Article  Google Scholar 

  8. 8.

    Kerr MK, Churchill GA: Statistical design and the analysis of gene expression microarray data. Genet Res. 2001, 77: 123-128. 10.1017/S0016672301005055.

    PubMed  CAS  Google Scholar 

  9. 9.

    Pan W: A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments. Bioinformatics. 2002, 18: 546-554. 10.1093/bioinformatics/18.4.546.

    PubMed  CAS  Article  Google Scholar 

  10. 10.

    Konig R, Baldessari D, Pollet N, Niehrs C, Eils R: Reliability of gene expression ratios for cDNA microarrays in multiconditional experiments with a reference design. Nucleic Acids Res. 2004, 32: e29-10.1093/nar/gnh027.

    PubMed  PubMed Central  Article  Google Scholar 

  11. 11.

    Tusher VG, Tibshirani R, Chu G: Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci U S A. 2001, 98: 5116-5121. 10.1073/pnas.091062498.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  12. 12.

    Chen Z, Liu L: RealSpot: Software validating results from DNA microarray data analysis with spot images. Physiol Genomics. 2005, 21: 284-291. 10.1152/physiolgenomics.00236.2004.

    PubMed  Article  Google Scholar 

  13. 13.

    Jia HP, Mills JN, Barahmand-Pour F, Nishimura D, Mallampali RK, Wang G, Wiles K, Tack BF, Bevins CL, McCray PBJ: Molecular cloning and characterization of rat genes encoding homologues of human beta-defensins. Infect Immun. 1999, 67: 4827-4833.

    PubMed  CAS  PubMed Central  Google Scholar 

  14. 14.

    Wang X, Zhang Z, Louboutin JP, Moser C, Weiner DJ, Wilson JM: Airway epithelia regulate expression of human beta-defensin 2 through Toll-like receptor 2. FASEB J. 2003, 17: 1727-1729.

    PubMed  CAS  Google Scholar 

  15. 15.

    Schaller-Bals S, Schulze A, Bals R: Increased levels of antimicrobial peptides in tracheal aspirates of newborn infants during infection. Am J Respir Crit Care Med. 2002, 165: 992-995.

    PubMed  Article  Google Scholar 

  16. 16.

    Ross DJ, Cole AM, Yoshioka D, Park AK, Belperio JA, Laks H, Strieter RM, Lynch JPIII, Kubak B, Ardehali A, Ganz T: Increased bronchoalveolar lavage human beta-defensin type 2 in bronchiolitis obliterans syndrome after lung transplantation. Transplantation. 2004, 78: 1222-1224. 10.1097/01.TP.0000137265.18491.75.

    PubMed  CAS  Article  Google Scholar 

  17. 17.

    Crouch EC: Collectins and pulmonary host defense. Am J Respir Cell Mol Biol. 1998, 19: 177-201.

    PubMed  CAS  Article  Google Scholar 

  18. 18.

    Wright JR: Immunomodulatory functions of surfactant. Physiol Rev. 1997, 77: 931-962.

    PubMed  CAS  Google Scholar 

  19. 19.

    Beck-Schimmer B, Madjdpour C, Kneller S, Ziegler U, Pasch T, Wuthrich RP, Ward PA, Schimmer RC: Role of alveolar epithelial ICAM-1 in lipopolysaccharide-induced lung inflammation. Eur Respir J. 2002, 19: 1142-1150. 10.1183/09031936.02.00236602.

    PubMed  CAS  Article  Google Scholar 

  20. 20.

    Flower DR: The lipocalin protein family: structure and function. Biochem J. 1996, 318: 1-14.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  21. 21.

    Yanagisawa R, Takano H, Inoue K, Ichinose T, Yoshida S, Sadakane K, Takeda K, Yoshino S, Yamaki K, Kumagai Y, Yoshikawa T: Complementary DNA microarray analysis in acute lung injury induced by lipopolysaccharide and diesel exhaust particles. Exp Biol Med (Maywood ). 2004, 229: 1081-1087.

    CAS  Google Scholar 

  22. 22.

    Hu MC, Wang YP, Mikhail A, Qiu WR, Tan TH: Murine p38-delta mitogen-activated protein kinase, a developmentally regulated protein kinase that is activated by stress and proinflammatory cytokines. J Biol Chem. 1999, 274: 7095-7102. 10.1074/jbc.274.11.7095.

    PubMed  CAS  Article  Google Scholar 

  23. 23.

    Dupret JM, L'Horset F, Perret C, Bernaudin JF, Thomasset M: Calbindin-D9K gene expression in the lung of the rat. Absence of regulation by 1,25-dihydroxyvitamin D3 and estrogen. Endocrinology. 1992, 131: 2643-2648. 10.1210/en.131.6.2643.

    PubMed  CAS  Google Scholar 

  24. 24.

    Yao SY, Ng AM, Muzyka WR, Griffiths M, Cass CE, Baldwin SA, Young JD: Molecular cloning and functional characterization of nitrobenzylthioinosine (NBMPR)-sensitive (es) and NBMPR-insensitive (ei) equilibrative nucleoside transporter proteins (rENT1 and rENT2) from rat tissues. J Biol Chem. 1997, 272: 28423-28430. 10.1074/jbc.272.45.28423.

    PubMed  CAS  Article  Google Scholar 

  25. 25.

    Lu H, Chen C, Klaassen C: Tissue distribution of concentrative and equilibrative nucleoside transporters in male and female rats and mice. Drug Metab Dispos. 2004, 32: 1455-1461. 10.1124/dmd.104.001123.

    PubMed  CAS  Article  Google Scholar 

  26. 26.

    Hashimoto M, Wang DY, Kamo T, Zhu Y, Tsujiuchi T, Konishi Y, Tanaka M, Sugimura H: Isolation and localization of type IIb Na/Pi cotransporter in the developing rat lung. Am J Pathol. 2000, 157: 21-27.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  27. 27.

    Smith GW, Aubry JM, Dellu F, Contarino A, Bilezikjian LM, Gold LH, Chen R, Marchuk Y, Hauser C, Bentley CA, Sawchenko PE, Koob GF, Vale W, Lee KF: Corticotropin releasing factor receptor 1-deficient mice display decreased anxiety, impaired stress response, and aberrant neuroendocrine development. Neuron. 1998, 20: 1093-1102. 10.1016/S0896-6273(00)80491-2.

    PubMed  CAS  Article  Google Scholar 

  28. 28.

    Tantisira KG, Lake S, Silverman ES, Palmer LJ, Lazarus R, Silverman EK, Liggett SB, Gelfand EW, Rosenwasser LJ, Richter B, Israel E, Wechsler M, Gabriel S, Altshuler D, Lander E, Drazen J, Weiss ST: Corticosteroid pharmacogenetics: association of sequence variants in CRHR1 with improved lung function in asthmatics treated with inhaled corticosteroids. Hum Mol Genet. 2004, 13: 1353-1359. 10.1093/hmg/ddh149.

    PubMed  CAS  Article  Google Scholar 

  29. 29.

    Bodenmuller H, Ofenloch-Hahnle B, Lane EB, Dessauer A, Bottger V, Donie F: Lung cancer-associated keratin 19 fragments: development and biochemical characterisation of the new serum assay Enzymun-Test CYFRA 21-1. Int J Biol Markers. 1994, 9: 75-81.

    PubMed  CAS  Google Scholar 

  30. 30.

    Appert A, Fridmacher V, Locquet O, Magre S: Patterns of keratins 8, 18 and 19 during gonadal differentiation in the mouse: sex- and time-dependent expression of keratin 19. Differentiation. 1998, 63: 273-284.

    PubMed  CAS  Article  Google Scholar 

  31. 31.

    Shirasawa M, Fujiwara N, Hirabayashi S, Ohno H, Iida J, Makita K, Hata Y: Receptor for advanced glycation end-products is a marker of type I lung alveolar cells. Genes Cells. 2004, 9: 165-174. 10.1111/j.1356-9597.2004.00712.x.

    PubMed  CAS  Article  Google Scholar 

  32. 32.

    Zarrinkar PP, Mainquist JK, Zamora M, Stern D, Welsh JB, Sapinoso LM, Hampton GM, Lockhart DJ: Arrays of arrays for high-throughput gene expression profiling. Genome Res. 2001, 11: 1256-1261. 10.1101/gr.GR-1748R.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  33. 33.

    Chen Z, Jin N, Narasaraju T, Chen J, McFarland LR, Scott M, Liu L: Identification of two novel markers for alveolar epithelial type I and II cells. Biochem Biophys Res Commun. 2004, 319: 774-780. 10.1016/j.bbrc.2004.05.048.

    PubMed  CAS  Article  Google Scholar 

  34. 34.

    Pfaffl MW: A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001, 29: e45-10.1093/nar/29.9.e45.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  35. 35.

    Millino C, Sarinella F, Tiveron C, Villa A, Sartore S, Ausoni S: Cardiac and smooth muscle cell contribution to the formation of the murine pulmonary veins. Dev Dyn. 2000, 218: 414-425. 10.1002/1097-0177(200007)218:3<414::AID-DVDY1002>3.0.CO;2-H.

    PubMed  CAS  Article  Google Scholar 

  36. 36.

    Murphy AM, Thompson WR, Peng LF, Jones L: Regulation of the rat cardiac troponin I gene by the transcription factor GATA-4. Biochem J. 1997, 322: 393-401.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  37. 37.

    Martin AF, Orlowski J: Molecular cloning and developmental expression of the rat cardiac-specific isoform of troponin I. J Mol Cell Cardiol. 1991, 23: 583-588. 10.1016/0022-2828(91)90050-V.

    PubMed  CAS  Article  Google Scholar 

  38. 38.

    Mell OC, Seibel P, Kadenbach B: Structural organisation of the rat genes encoding liver- and heart-type of cytochrome c oxidase subunit VIa and a pseudogene related to the COXVIa-L cDNA. Gene. 1994, 140: 179-186. 10.1016/0378-1119(94)90542-8.

    PubMed  CAS  Article  Google Scholar 

  39. 39.

    Lenka N, Basu A, Mullick J, Avadhani NG: The role of an E box binding basic helix loop helix protein in the cardiac muscle-specific expression of the rat cytochrome oxidase subunit VIII gene. J Biol Chem. 1996, 271: 30281-30289. 10.1074/jbc.271.47.30281.

    PubMed  CAS  Article  Google Scholar 

  40. 40.

    Mohapatra SS, Lockey RF, Vesely DL, Gower WRJ: Natriuretic peptides and genesis of asthma: an emerging paradigm?. J Allergy Clin Immunol. 2004, 114: 520-526. 10.1016/j.jaci.2004.05.028.

    PubMed  CAS  Article  Google Scholar 

  41. 41.

    Cayli S, Ustunel I, Celik-Ozenci C, Korgun ET, Demir R: Distribution patterns of PCNA and ANP in perinatal stages of the developing rat heart. Acta Histochem. 2002, 104: 271-277. 10.1078/0065-1281-00648.

    PubMed  Article  Google Scholar 

  42. 42.

    Matsuzaki T, Tajika Y, Suzuki T, Aoki T, Hagiwara H, Takata K: Immunolocalization of the water channel, aquaporin-5 (AQP5), in the rat digestive system. Arch Histol Cytol. 2003, 66: 307-315. 10.1679/aohc.66.307.

    PubMed  CAS  Article  Google Scholar 

  43. 43.

    Wallace TJ, Ghosh S, McLean GW: Molecular cloning and expression of rat lung carboxylesterase and its potential role in the detoxification of organophosphorus compounds. Am J Respir Cell Mol Biol. 1999, 20: 1201-1208.

    PubMed  CAS  Article  Google Scholar 

  44. 44.

    Lian X, Yan C, Yang L, Xu Y, Du H: Lysosomal acid lipase deficiency causes respiratory inflammation and destruction in the lung. Am J Physiol Lung Cell Mol Physiol. 2004, 286: L801-L807. 10.1152/ajplung.00335.2003.

    PubMed  CAS  Article  Google Scholar 

  45. 45.

    Ghosh S, Mallonee DH, Hylemon PB, Grogan WM: Molecular cloning and expression of rat hepatic neutral cholesteryl ester hydrolase. Biochim Biophys Acta. 1995, 1259: 305-312.

    PubMed  Article  Google Scholar 

  46. 46.

    Lacroix D, Desrochers M, Lambert M, Anderson A: Alternative splicing of mRNA encoding rat liver cytochrome P450e (P450IIB2). Gene. 1990, 86: 201-207. 10.1016/0378-1119(90)90280-5.

    PubMed  CAS  Article  Google Scholar 

  47. 47.

    Daugherty BL, Mateescu M, Patel AS, Wade K, Kimura S, Gonzales LW, Guttentag S, Ballard PL, Koval M: Developmental regulation of claudin localization by fetal alveolar epithelial cells. Am J Physiol Lung Cell Mol Physiol. 2004, 287: L1266-L1273. 10.1152/ajplung.00423.2003.

    PubMed  CAS  Article  Google Scholar 

  48. 48.

    Rahner C, Mitic LL, Anderson JM: Heterogeneity in expression and subcellular localization of claudins 2, 3, 4, and 5 in the rat liver, pancreas, and gut. Gastroenterology. 2001, 120: 411-422. 10.1053/gast.2001.21736.

    PubMed  CAS  Article  Google Scholar 

  49. 49.

    Harrison CA, Raftery MJ, Walsh J, Alewood P, Iismaa SE, Thliveris S, Geczy CL: Oxidation regulates the inflammatory properties of the murine S100 protein S100A8. J Biol Chem. 1999, 274: 8561-8569. 10.1074/jbc.274.13.8561.

    PubMed  CAS  Article  Google Scholar 

  50. 50.

    Buhling F, Ittenson A, Kaiser D, Tholert G, Hoffmann B, Reinhold D, Ansorge S, Welte T: MRP8/MRP14, CD11b and HLA-DR expression of alveolar macrophages in pneumonia. Immunol Lett. 2000, 71: 185-190. 10.1016/S0165-2478(00)00164-4.

    PubMed  CAS  Article  Google Scholar 

  51. 51.

    Bruggemann M, Free J, Diamond A, Howard J, Cobbold S, Waldmann H: Immunoglobulin heavy chain locus of the rat: striking homology to mouse antibody genes. Proc Natl Acad Sci U S A. 1986, 83: 6075-6079. 10.1073/pnas.83.16.6075.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  52. 52.

    Petitprez K, Grzych JM, Pierrot C, Godin C, Capron A, Khalife J: Molecular cloning and expression of an anti-idiotype antibody mimicking a protective oligosaccharide of the parasite Schistosoma mansoni. Parasitol Res. 1998, 84: 38-40. 10.1007/s004360050353.

    PubMed  CAS  Article  Google Scholar 

  53. 53.

    van Spriel AB, Puls KL, Sofi M, Pouniotis D, Hochrein H, Orinska Z, Knobeloch KP, Plebanski M, Wright MD: A regulatory role for CD37 in T cell proliferation. J Immunol. 2004, 172: 2953-2961.

    PubMed  CAS  Article  Google Scholar 

  54. 54.

    Li X, Rayford H, Shu R, Zhuang J, Uhal BD: Essential role for cathepsin D in bleomycin-induced apoptosis of alveolar epithelial cells. Am J Physiol Lung Cell Mol Physiol. 2004, 287: L46-L51. 10.1152/ajplung.00442.2003.

    PubMed  CAS  Article  Google Scholar 

  55. 55.

    Ueno T, Linder S, Na CL, Rice WR, Johansson J, Weaver TE: Processing of pulmonary surfactant protein B by napsin and cathepsin H. J Biol Chem. 2004, 279: 16178-16184. 10.1074/jbc.M312029200.

    PubMed  CAS  Article  Google Scholar 

  56. 56.

    Nakazono E, Kamata Y, Yamafuji K: Determination of the mRNA sequence of cathepsin Y, a cysteine endopeptidase from rat spleen, 1 and confirmation of its ubiquitous expression. Biol Chem. 2002, 383: 1971-1975. 10.1515/BC.2002.223.

    PubMed  CAS  Article  Google Scholar 

  57. 57.

    Sakamoto E, Sakao Y, Taniguchi Y, Yamafuji K: Cathepsin Y (a novel thiol enzyme) produces kinin potentiating peptide from the component protein of rat plasma. Immunopharmacology. 1999, 45: 207-214. 10.1016/S0162-3109(99)00079-X.

    PubMed  CAS  Article  Google Scholar 

  58. 58.

    Kuwano K, Hagimoto N, Nakanishi Y: The role of apoptosis in pulmonary fibrosis. Histol Histopathol. 2004, 19: 867-881.

    PubMed  CAS  Google Scholar 

  59. 59.

    French LE, Tschopp J: Constitutive Fas ligand expression in several non-lymphoid mouse tissues: implications for immune-protection and cell turnover. Behring Inst Mitt. 1996, 97: 156-160.

    PubMed  CAS  Google Scholar 

  60. 60.

    Crowell JA, Korytko PJ, Morrissey RL, Booth TD, Levine BS: Resveratrol Associated Renal Toxicity. Toxicol Sci. 2004, 82: 614-619. 10.1093/toxsci/kfh263.

    PubMed  CAS  Article  Google Scholar 

  61. 61.

    Nyberg A, Fagerberg A, Ahlqvist M, Jern C, Seeman-Lodding H, Aneman A: Pulmonary net release of tissue-type plasminogen activator during porcine primary and secondary acute lung injury. Acta Anaesthesiol Scand. 2004, 48: 845-850. 10.1111/j.1399-6576.2004.00432.x.

    PubMed  CAS  Article  Google Scholar 

  62. 62.

    Hrafnkelsdottir T, Ottosson P, Gudnason T, Samuelsson O, Jern S: Impaired endothelial release of tissue-type plasminogen activator in patients with chronic kidney disease and hypertension. Hypertension. 2004, 44: 300-304. 10.1161/01.HYP.0000137380.91476.fb.

    PubMed  CAS  Article  Google Scholar 

  63. 63.

    Sun CX, Robb VA, Gutmann DH: Protein 4.1 tumor suppressors: getting a FERM grip on growth regulation. J Cell Sci. 2002, 115: 3991-4000. 10.1242/jcs.00094.

    PubMed  CAS  Article  Google Scholar 

Download references


This work was supported by NIH R01 HL-52146, and R01 HL-071628 (to LL). ZC and NJ were supported by an AHA predoctoral fellowship 0315260Z, and 0315256Z. The authors thank Patricia Ayoubi for her help in printing microarray slides, Narendranath Reddy Chintagari and Dr. Narasaraju Telugu for their help in collecting organ samples, Keyu He for data management, Lindsey Tanner for image analysis, and Candice Marsh and Tisha Posey for editorial assistance.

Author information



Corresponding author

Correspondence to Lin Liu.

Additional information

Authors' contributions

ZC performed the microarray experiments and data analysis and drafted the manuscript. JC and TW participated in the DNA microarray hybridization. NJ contributed to real time PCR. LL conceived of the study, and participated in its design and coordination and helped to draft the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Chen, Z., Chen, J., Weng, T. et al. Identification of rat lung – prominent genes by a parallel DNA microarray hybridization. BMC Genomics 7, 47 (2006).

Download citation


  • Loop Design
  • Solute Carrier Family
  • Spot Signal
  • Biological Replication
  • Parallel Hybridization