Skip to main content
  • Research article
  • Open access
  • Published:

Characterization of the global profile of genes expressed in cervical epithelium by Serial Analysis of Gene Expression (SAGE)



Serial Analysis of Gene Expression (SAGE) is a new technique that allows a detailed and profound quantitative and qualitative knowledge of gene expression profile, without previous knowledge of sequence of analyzed genes. We carried out a modification of SAGE methodology (microSAGE), useful for the analysis of limited quantities of tissue samples, on normal human cervical tissue obtained from a donor without histopathological lesions. Cervical epithelium is constituted mainly by cervical keratinocytes which are the targets of human papilloma virus (HPV), where persistent HPV infection of cervical epithelium is associated with an increase risk for developing cervical carcinomas (CC).


We report here a transcriptome analysis of cervical tissue by SAGE, derived from 30,418 sequenced tags that provide a wealth of information about the gene products involved in normal cervical epithelium physiology, as well as genes not previously found in uterine cervix tissue involved in the process of epidermal differentiation.


This first comprehensive and profound analysis of uterine cervix transcriptome, should be useful for the identification of genes involved in normal cervix uterine function, and candidate genes associated with cervical carcinoma.


One of the most frequent malignancies in women worldwide is the Uterine Cervical Carcinoma (CC), both in incidence and mortality and the first cause of death among the Mexican female population [1]. High-risk human papillomavirus (HPV) persistent infection is considered the most important risk factor associated with the development of this tumor [2, 3]. Although HPV is a mandatory cause for CC, it is not sufficient to trigger all the changes required for its development [4].

A number of recent studies about gene expression profiles in in vitro HPV-infected cultured keratinocytes and from (CC) clinical samples have provided an initial notion of the changes in gene expression induced by HPV and in early CC [510]. Moreover, some studies have compared normal versus tumor-induced gene expression in cervical samples with the aim to identify potential tumor markers of clinical value [1113].

At present, there are reports of genes expressed by keratinocytes derived from a normal human epidermis and from mouse uterus carried out by Serial Analysis of Gene Expression (SAGE) [1417]. However, no such study exists for human cervix. Therefore, the aim of our study was to describe the first compendium of expressed genes in normal cervical epithelium, which is composed mainly by keratinocytes strongly influenced by hormones. To achieve this we used SAGE, which is capable of producing an accurate molecular picture of cervical tissue based on expressed genes, as the main methodology. As SAGE is not dependent on preexisting databases of expressed genes, it provides an unbiased view of gene expression profiles within the mRNA populations [18]. SAGE allows the simultaneous quantitative and qualitative analysis of thousands of gene transcripts based on two principles: first, 14 mers are sufficient to uniquely identify 95% of cell transcripts [19]; and second, cloning of these 14 bp tags serially with the insertion of a restriction enzyme recognition sequence as an anchor, the throughput is considerably increased. To obtain a catalog of expressed genes and their relative frequencies we performed database analysis to relate each tag to its corresponding gene [20]. As an important drawback of SAGE is that a large amount of messenger RNA (2.5–5 μg polyA RNA) is required, and our tissue supply was limited (a punch biopsy) we employed the MicroSAGE protocol in RNA thereof [21]. The present report describes a partial transcriptome of a sample derived from normal cervical epithelium used to construct a SAGE library with 30,418 sequenced tags.

Results and discussion

SAGE library derived from one normal uterine ectocervical sample

Our Sage library was obtained from ectocervical tissue from a 38 year old healthy woman with active sexual life, not taking any hormonal therapy, nor any other drug that could potentially alter cervical physiology, we designated this as SAGE_cervix_normal_B_1. Histological analysis of this sample by H&E revealed normal ectocervical tissue, approximately 80% epithelium and 20% stroma without evidence of glands. There were minimal inflammatory infiltrates in the periphery of the sample, considered normal for this type of tissue.

The SAGE library yielded 30,418 sequenced tags, which was used to generate a table, which represents genes expressed in normal human cervix. For a complete list of the expressed genes, please visit the SAGEmap website, [22, 23]. The derived catalog of expressed genes represents the first attempt to generate a comprehensive and profound analysis of the cervical epithelium expression profile. The wealth of information obtained allows detection of genes involved in normal epithelium physiology, as well as possible target genes of HPV infection. In general, tag frequency in a typical SAGE experiment follows a normal distribution [24, 25]. Table 1 summarizes the general statistics of this library. As seen there is a normal distribution, where only a limited number of tags were either highly expressed or at an extremely low frequency (4.4 and 4.9%, respectively). Tags with a frequency of 1 were not considered for quantitative purposes, because these are likely to represent artifacts of sequencing or of the SAGE procedure [26].

Table 1 SAGE data general statistics.

Representativity of the data

According to Zhang et al. [27], a study of SAGE data mining analysis of 300,000 tags, 75% of mRNA consists of transcripts expressed at more than five copies per cell, and, in general, transcripts are expressed at a range from one to 5,300 copies per cell. With this in mind, our ~30,000-tag library, represents 10% of the total tags analyzed by Velculescu et al. The most frequently represented tag in the current report had a frequency of 515 (16,930 tags per million TPM). An estimate of such data indicates that this gene tag has an expression level of ~5,150 copies per cell, similar to what is observed in digital northerns of other top expressed tags in SAGE libraries (Figure 1A). We have to keep in mind, however, that in certain tissues, some genes are expressed at much higher levels, such as growth hormone, with 149,630 TPM in pituitary gland [28]. Because SAGE analysis represents a qualitative and quantitative assay of messenger RNA abundance not biased by cloning or polymerase chain reaction efficiency [29], our data provide an estimate of the genes normally expressed by normal uterine cervix.

Figure 1
figure 1

A) Comparison of most expressed tags among different SAGE libraries. Normalized expression levels (TPM) are similar between libraries with different total sequenced tags, indicating comparable messenger abundance among top expressed genes. Expression levels were obtained from SAGEmap website http://wwwncbinlmnihgov/projects/SAGE/indexcgi?cmd=expsetup. aTPM: Tags per million. Normalization to compare libraries with different numbers of sequenced tags. TPM is obtained by the following formula [(Tag frequency)(1000,000)/Total No. of sequenced tags]. bDN: digital northern, indicating gene expression level for a specific gene in a library. B. Graphical representation of expression levels (TPM) for three constitutive genes in several normal (N) and tumoral (T) tissues. Brain tissue libraries: SAGE BB542 whitematter (N) and SAGE Brain medulloblastoma B 98 04 P117 (T). Breast: SAGE Breast normal organoid (N) and B SAGE Breast carcinoma epithelium AP DCIS6 (T). Gastric: SAGE normal gastric body epithelial (N) and SAGE Hiroshima GC W246T (T). Liver: SAGE normal liver (N) and SAGE Liver cholangiocarcinoma B K2D (T). Kidney: SAGE Duke Kidney (N) and SAGE_Kidney_carcinoma_B_D2 (T). Colon: SAGE NC2 (N) and SAGE Tu98 (T). Prostate: SAGE PR317 normal prostate (N) SAGE PR317 prostate tumor (T). Lung: SAGE normal lung (N) and SAGE Lung adenocarcinoma MD L10 (T). Expression levels are indicated as tags per million.

Among the most frequently expressed tags in our library (Table 2), some corresponded to ubiquitously expressed transcripts (GRIN2C, FTH1, GNS, RPLP2, RPL21). The presence of this type of genes is a common result in SAGE experiments with an expected heterogeneity in their expression levels [14, 15, 17, 19], indicating a possible role as housekeeping genes (Figure 1B). In a report Velculescu et al., by means of data base analysis of SAGE libraries, found that ~1,000 genes are present in all normal or tumor tissues analyzed with over five copies per cell [30]. Hence, this list of genes identified by data mining is termed minimal transcriptome (i. e., the set of genes expressed by every cell), which represents genes constitutively expressed. In supplementary information of Velculescu's work [30] a search for the minimal transcriptome in our library, indicates >95% of housekeeping genes (data not shown), further validating the cervical library.

Table 2 Top 20 expressed genes in normal cervical tissue.

Spectrum of genes expressed by normal cervical tissue

To obtain better knowledge of the functional categories of global gene expression profile, we employed the Fatigo Data mining website [31, 32]. Figure 2 shows the distribution of expressed genes by functional categories defined by the Gene Ontology Consortium. As seen, the most frequent individual transcripts correspond to genes involved in maintenance and basic metabolism. On the other hand, genes corresponding to other processes such as cell growth regulation, morphogenesis, cell differentiation, or death were not as frequently expressed.

Figure 2
figure 2

Functional categories assigned to individual genes identified in normal cervical SAGE library. Genes can be assigned in different functional categories. aThe percentage was calculated with 3,764 initial genes from which 2,720 genes had Gene Ontology classification.

Top expressed non-ubiquitous genes in normal cervical tissue mainly correspond to epithelial growth and differentiation

It was important to distinguish which non-ubiquitous genes were predominantly expressed in normal cervix. As seen in table 2, genes related with epithelial differentiation and squamous architectural maintenance are abundantly represented in our library. These include S100A8, S100A9, and SPRR3, that belong to a complex of genes that are subject to coordinate regulation during keratinocyte differentiation. This complex has been called the epidermal differentiation complex (EDC) and is located on the 1q21 chromosome [33, 34]. These genes share spatial and temporal expression and interrelated functions and are grouped in three related gene families: cornified envelope precursor proteins (involucrin, loricrin, and the small proline-rich proteins [SPRRs]); intermediate filament-associated proteins (profilaggrin and trichohyalin), and calcium binding proteins (the S100As) [reviewed in [35]]. Approximately 30 genes belonging to the EDC are clustered together in a 200 Mb region, from which there are 20 genes expressed the in cervical SAGE library (Table 3).

Table 3 Genes belonging to 1q21 epidermal differentiation complex (EDC) expressed in cervical tissue

End point RT-PCR analysis confirms expression of genes detected by SAGE

It was important to confirm the expression of some EDC representative genes in different normal cervical tissues by a different technique. For this, we chose end point reverse transcriptase polymerase chain reaction (RT-PCR) analysis. Figure 3 shows the expression of five EDC genes in HPV negative tissue samples with no histopathologic lesion. As expected, the majority of cases expressed these genes. However, there were some differences in the level of expression among the different normal samples. This could be due to the fact that samples were taken on different days of the menstrual cycle (hormonal influence) or to unknown physiological differences among biological systems.

Figure 3
figure 3

Expression of genes clustered in 1q21, in normal cervical tissues. One hundred nanograms of total RNA purified of each sample was used in one RT-PCR reaction with gene specific primers; then one tenth of each RT-PCR reaction was subjected to agarose gel electrophoresis. MW: molecular weight marker; C1–C6 six different normal cervical samples

Minor expression of fibroblast-related genes in cervical tissue

The gene expression catalog reported on here was obtained from a heterogeneous population of cells composed mainly of epithelial keratinocytes in dissimilar differentiation stages (basale, spinosum and granulosum strata). Nevertheless, these tissues also contain fibroblasts associated with connective, besides other minor cell populations. To know which genes are related to fibroblasts, we compared a SAGE library derived from neonatal foreskin primary fibroblasts (Agnes Baross, British Columbia Genome Sciences Centre). We found 923 gene tags shared by both libraries, which could due to the presence of fibroblasts in the Cervix SAGE library (supplementary information). Shared genes with known biological function reveal that processes as signal transduction, regulation of transcription and cell adhesion are mainly involved. We consider important to identify minor contributions to global gene expression profile in a heterogeneous cell population; however, it is important to note that unknown differences between cervical and neonatal foreskin fibroblasts could exist.


To our knowledge, this is the first effort to achieve a global profile of gene expression in normal cervical tissue. This was accomplished by means of a methodology that produced an accurate catalog of expressed genes in this tissue. Analysis of gene expression revealed genes involved in keratinocyte differentiation. These genes have not been detected in cervical epithelium by traditional methodologies such as RT-PCR or in situ hybridization. Although our SAGE library was derived from a single donor, the majority of samples analyzed expressed the genes selected, indicating reproducibility in human samples. SAGE methodology is a complex and expensive analysis mainly due to the great sequencing efforts required to achieve SAGE libraries. Nevertheless, the overwhelming information derived from these justifies the effort and provides better knowledge of cervical biology and physiology. In a near future, it could also provide an insight of cervical physiology or HPV infection and in other pathologies affecting cervical tissue.



Normal cervices were obtained from women with negative Pap smears, confirmed by histopathological analysis, attending at the Dysplasia Clinic at General Hospital of Mexico, SS who had been subjected to hysterectomy due to uterine myomatosis. All patients were in reproductive age and none of them received hormonal therapy or contraceptives. All the described procedures were evaluated and approved by the local ethics committee of the Mexican Institute of Social Security. Written informed consent was obtained from all the patients. All tissue samples were longitudinally divided in three sections, the central part was snapped frozen in liquid nitrogen and stored at -70°C until nucleic acid extraction, and the other two were fixed overnight in 70% ethanol and were paraffin embedded at the Department of Pathology, Oncology Hospital, National Medical Center SXXI, Mexico. Serial sections from these fractions stained by Haematoxilin/ Eosin were inspected for representativity of the tissue.

HPV detection and typing

Genomic DNA was extracted from the phenol phase left by the TRIzol reagent (Gibco BRL, USA) RNA isolation protocol and amplified by PCR with MY11/MY09 primers [36] (Table 4). PCR products were separated by electrophoresis on 1% agarose gel. Only HPV negative samples were included in this study.

Table 4 Oligonucleotides sequences used in this work.

Micro SAGE protocol

Micro SAGE was performed according to Datson et al. [21] with minor modifications, by means of the Invitrogen's I-SAGE kit (Invitrogen, San Diego, CA USA). RNA isolation was done in TRIzol according to manufacturer's instructions. Five μg of total RNA was used as input material. A heating step was introduced at 65°C for 10 minutes followed by 2 minutes on ice to allow a better separation of concatenamers [37]. Products greater than 300 bp and smaller than 2,000 bp were excised, extracted and cloned in the Sph I site of pZero vector. Clones were selected and screened for inserts by PCR. Cervix library was sequenced by Agencourt through SAGE sequencing service (CGAP collaboration, GR). Sequence files were analyzed with the SAGE300 software [18, 20], which identifies the anchoring enzyme sites and extracts the two tags flanked by Nla III site. Gene identity and UniGene cluster assignment of each SAGE tag was obtained by using the tag-to-gene "reliable" map, from SAGEmap NCBI site [22, 23]. The tags extracted were uploaded to SAGEmap and corresponding accession numbers were retrieved using the H. sapiens NCBI-GenBank database.

Reverse Transcription-Polymerase Chain Reaction (RT-PCR) analysis

Total RNA was extracted from six normal cervical tissues using TRIzol, quantified by densitometric analysis and its quality evaluated by denaturing gel electrophoresis. Contaminiating DNA was digested and removed with Rnase-free Dnase (Promega). Expression analysis was performed using 100 ng total RNA in a RT-PCR reaction (Access RT-PCR System, Promega). The mRNA was reverse-transcribed at 48°C for 45 min. After an initial denaturation at 94°C for 2 minutes, the double stranded cDNA synthesized was amplified for 40 cycles with denaturation at 94°C for 30 seconds, annealing at 54–60°C for 1 minute and extension at 70°C for 2 minutes with specific oligonucleotides (Table 4) in a Perkin Elmer 480 Thermocycler.

Sense and antisense sequence of oligonucleotides for S100 A8 and 9, SPRR3, NICE-3 and -4 genes were designed with the program Primerquest [38]. GAPDH gene expression was used as an internal control.


  1. Mexican Ministry of Health: Registro Histopatológico de Neoplasias Malignas, Compendio de mortalidad y morbilidad. 1998, Secretaría de Salud, México

    Google Scholar 

  2. Schiffman MH, Bauer HM, Hoover RN, Glass AG, Cadell DM, Rush BB, Scott DR, Sherman ME, Kurman RJ, Wacholder S: Epidemiologic evidence showing that human papillomavirus infection causes most cervical intraepithelial neoplasias. J Natl Cancer Inst. 1993, 85: 958-964.

    Article  PubMed  CAS  Google Scholar 

  3. Villa L: Human papillomaviruses and cervical cancer. Adv Cancer Res. 1997, 71: 321-341.

    Article  PubMed  CAS  Google Scholar 

  4. Franco EL, Rohan TE, Villa LL: Epidemiologic evidence, human papillomavirus infection as a necessary cause of cervical cancer. J Natl Cancer I. 1999, 91: 506-511. 10.1093/jnci/91.6.506.

    Article  CAS  Google Scholar 

  5. Ruutu M, Peitsaro P, Johansson B, Syrjanen S: Transcriptional profiling of a human papillomavirus 33-positive squamous epithelial cell line which acquired a selective growth advantage after viral integration. Int J Cancer. 2002, 100: 318-326. 10.1002/ijc.10455.

    Article  PubMed  CAS  Google Scholar 

  6. Duffy CL, Phillips SL, Klingelhutz AJ: Microarray analysis identifies differentiation-associated genes regulated by human papillomavirus type 16 E6. Virology. 2003, 314: 196-205. 10.1016/S0042-6822(03)00390-8.

    Article  PubMed  CAS  Google Scholar 

  7. Thomas JT, Oh ST, Terhune SS, Laimins LA: Cellular changes induced by low-risk human papillomavirus type 11 in keratinocytes that stably maintain viral episomes. J Virol. 2001, 75: 7564-7571. 10.1128/JVI.75.16.7564-7571.2001.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  8. Garner-Hamrick PA, Fostel JM, Chien WM, Banerjee NS, Chow LT, Broker TR, Fisher C: Global effects of human papillomavirus type 18 E6/E7 in an organotypic keratinocyte culture system. J Virol. 2004, 78: 9041-9050. 10.1128/JVI.78.17.9041-9050.2004.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  9. Toussaint-Smith E, Donner DB, Roman A: Expression of human papillomavirus type 16 E6 and E7 oncoproteins in primary foreskin keratinocytes is sufficient to alter the expression of angiogenic factors. Oncogene. 2004, 23: 2988-2995. 10.1038/sj.onc.1207442.

    Article  PubMed  CAS  Google Scholar 

  10. Nees M, Geoghegan JM, Hyman T, Frank S, Miller L, Woodworth CD: Papillomavirus type 16 oncogenes downregulate expression of interferon-responsive genes and upregulate proliferation-associated, NF-kappaB-responsive genes in cervical keratinocytes. J Virol. 2001, 75: 4283-4296. 10.1128/JVI.75.9.4283-4296.2001.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  11. Chen Y, Miller C, Mosher R, Zhao X, Deeds J, Morrissey M, Bryant B, Yang D, Meyer R, Cronin F, Gostout BS, Smith-McCune K, Schlegel R: Identification of cervical cancer markers by cDNA and tissue microarrays. Cancer Res. 2003, 63: 1927-1935.

    PubMed  CAS  Google Scholar 

  12. Shim C, Zhang W, Rhee CH, Lee JH: Profiling of differentially expressed genes in human primary cervical cancer by complementary DNA expression array. Clin Cancer Res. 1998, 4: 3045-3050.

    PubMed  CAS  Google Scholar 

  13. Vazquez-Ortiz G, Pina-Sanchez P, Vazquez K, Duenas A, Taja L, Mendoza P, Garcia JA, Salcedo M: Overexpression of cathepsin f, matrix metalloproteinases 11 and 12 in cervical cancer. BMC Cancer. 2005, 5: 68-10.1186/1471-2407-5-68.

    Article  PubMed  PubMed Central  Google Scholar 

  14. van Ruissen F, Jansen BJ, de Jongh GJ, Zeeuwen PL, Schalkwijk J: A partial transcriptome of human epidermis. Genomics. 2002, 79: 671-678. 10.1006/geno.2002.6756.

    Article  PubMed  CAS  Google Scholar 

  15. van Ruissen F, Jansen BJ, de Jongh GJ, van Vlijmen-Willems IM, Schalkwijk J: Differential gene expression in premalignant human epidermis revealed by cluster analysis of serial analysis of gene expression (SAGE) libraries. FASEB J. 2002, 16: 246-248.

    PubMed  CAS  Google Scholar 

  16. Jansen BJ, van Ruissen F, de Jongh G, Zeeuwen PL, Schalkwijk J: Serial analysis of gene expression in differentiated cultures of human epidermal keratinocytes. J Invest Dermatol. 2001, 116: 12-22. 10.1046/j.1523-1747.2001.00218.x.

    Article  PubMed  CAS  Google Scholar 

  17. Larose M, St-Amand J, Yoshioka M, Belleau P, Morissette J, Labrie C, Raymond V, Labrie F: Transcriptome of mouse uterus by serial analysis of gene expression (SAGE): comparison with skeletal muscle. Mol Reprod Dev. 2004, 68: 142-148. 10.1002/mrd.20065.

    Article  PubMed  CAS  Google Scholar 

  18. Velculescu VE, Zhang L, Zhou W, Vogelstein J, Basrai MA, Bassett DE, Hieter P, Vogelstein B, Kinzler KW: Characterization of the yeast transcriptome. Cell. 1997, 88: 243-251. 10.1016/S0092-8674(00)81845-0.

    Article  PubMed  CAS  Google Scholar 

  19. Dinel S, Bolduc C, Belleau P, Boivin A, Yoshioka M, Calvo E, Piedboeuf B, Snyder EE, Labrie F, St-Amand J: Reproducibility, bioinformatic analysis and power of the SAGE method to evaluate changes in transcriptome. Nucleic Acids Res. 2005, 33: e26-10.1093/nar/gni025.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  20. Velculescu V, Zhang L, Vogelstein B, Kinzler K: Serial analysis of gene expression. Science. 1995, 270: 484-487.

    Article  PubMed  CAS  Google Scholar 

  21. Datson N, Perk-de Jong J, van den Berg M, de Kloet E, Vreugdenhil E: MicroSAGE: a modified procedure for serial analysis of gene expression in limited amounts of tissue. Nucleic Acids Res. 1999, 27: 1300-1307. 10.1093/nar/27.5.1300.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  22. SAGEmap NCBI. []

  23. Lash AE, Tolstoshev CM, Wagner L, Schuler GD, Strausberg RL, Riggins GJ, Altschul SF: SAGEmap: a public gene expression resource. Genome Res. 2000, 10: 1051-1060. 10.1101/gr.10.7.1051.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  24. Audic S, Claverie JM: The Significance of Digital Gene Expression Profiles. Genome Res. 1997, 7: 986-995.

    PubMed  CAS  Google Scholar 

  25. Ruijter JM, Van Kampen AH, Baas F: Statistical evaluation of SAGE libraries: consequences for experimental design. Physiol Genomics. 2002, 11: 37-44.

    Article  PubMed  CAS  Google Scholar 

  26. Yamamoto M, Wakatsuki T, Hada A, Ryo A: Use of serial analysis of gene expression (SAGE) technology. J Immunol Methods. 2001, 250: 45-66. 10.1016/S0022-1759(01)00305-2.

    Article  PubMed  CAS  Google Scholar 

  27. Zhang L, Zhou W, Velculescu VE, Kern SE, Hruban RH, Hamilton SR, Vogelstein B, Kinzler KW: Gene expression profiles in normal and cancer cells. Science. 1997, 276: 1268-1272. 10.1126/science.276.5316.1268.

    Article  PubMed  CAS  Google Scholar 

  28. Nishida Y, Yoshioka M, St-Amand J: The top 10 most abundant transcripts are sufficient to characterize the organs functional specificity: evidences from the cortex, hypothalamus and pituitary gland. Gene. 2005, 344: 133-141. 10.1016/j.gene.2004.09.007.

    Article  PubMed  CAS  Google Scholar 

  29. Polyak K, Riggins GJ: Gene Discovery Using the Serial Analysis of Gene Expression Technique: Implications for Cancer Research. J Clin Oncol. 2001, 19: 2948-2958.

    PubMed  CAS  Google Scholar 

  30. Velculescu VE, Madden SL, Zhang L, Lash AE, Yu J, Rago C, Lal A, Wang CJ, Beaudry GA, Ciriello KM, Cook BP, Dufault MR, Ferguson AT, Gao Y, He TC, Hermeking H, Hiraldo SK, Hwang PM, Lopez MA, Luderer HF, Mathews B, Petroziello JM, Polyak K, Zawel L, Kinzler KW: Analysis of human transcriptomes. Nat Genet. 1999, 23: 387-388. 10.1038/70487.

    Article  PubMed  CAS  Google Scholar 

  31. Fatigo Data mining. []

  32. Al-Shahrour F, Díaz-Uriarte R, Dopazo J: FatiGO: a web tool for finding significant associations of Gene Ontology terms to groups of genes. Bioinformatics. 2004, 20: 578-580. 10.1093/bioinformatics/btg455.

    Article  PubMed  CAS  Google Scholar 

  33. Marenholz I, Volz A, Ziegler A, Davies A, Ragoussis I, Korge BP, Mischke D: Genetic analysis of the epidermal differentiation complex (EDC) on human chromosome 1q21: chromosomal orientation new markers a 6-Mb YAC contig. Genomics. 1996, 37: 295-302. 10.1006/geno.1996.0563.

    Article  PubMed  CAS  Google Scholar 

  34. Elder JT, Zhao X: Evidence for local control of gene expression in the epidermal differentiation complex. Exp Dermatol. 2002, 11: 406-412. 10.1034/j.1600-0625.2002.110503.x.

    Article  PubMed  CAS  Google Scholar 

  35. Williams RE, Broad S, Sheer D, Ragoussis J: Subchromosomal Positioning of the Epidermal Differentiation Complex (EDC) in Keratinocyte Lymphoblast Interphase. Nuclei Exp Cell Res. 2002, 272: 163-175.36. 10.1006/excr.2001.5400.

    Article  PubMed  CAS  Google Scholar 

  36. Resnick RM, Cornelissen MT, Wright DK, Eichinger GH, Fox HS, ter Schegget J, Manos MM: Detection and typing of human papillomavirus in archival cervical cancer specimens by DNA amplification with consensus primers. J Natl Cancer Inst. 1990, 82: 1477-1484.

    Article  PubMed  CAS  Google Scholar 

  37. Kenzelmann M, Muhlemann K: Substantially enhanced cloning efficiency of SAGE (Serial Analysis of Gene Expression) by adding a heating step to the original protocol. Nucleic Acids Res. 1999, 27: 917-918. 10.1093/nar/27.3.917.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  38. Hsu EM, McNicol PJ, Guijon FB, Paraskevas M: Quantification of HPV-16 E6-E7 transcription in cervical intraepithelial neoplasia by reverse transcriptase polymerase chain reaction. Int J Cancer. 1993, 55: 397-401.

    Article  PubMed  CAS  Google Scholar 

  39. Primerquest program. []

Download references


This work was partially supported by grants of CONACyT (F7114 and 34686-M, MS), and FOFOI-IMSS (MS). For the sequencing of SAGE library, funding was provided by the National Cancer Institute's Cancer Genome Anatomy Project (NIH 23XS073 and 24XS070, GR). During this work CPP, GVO, PPS were recipients of CONACyT, DGEP-UNAM and IMSS fellowships.

This work was submitted in partial fulfilment of the requirements for the D. Sc. degree in for PPC at DOCTORADO EN CIENCIAS BIOMEDICAS, UNIVERSIDAD NACIONAL AUTONOMA DE MEXICO.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Mauricio Salcedo.

Additional information

Competing interests

The author(s) declare that they have no competing interests.

Authors' contributions

C.P.P. carried out the microSAGE protocol, real time RT-PCR validations, the bioinformatics analysis and writing the manuscript. G.R. provided sequencing of SAGE library. J.M. helped to write the manuscript and participated in discussions. H.G. A.H. and P.P.S. helped for the bioinformatics analysis and database comparisons. M.S. is the principal investigator and was involved in the conceptualization, design and writing of the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material


Additional data file 1: Tags shared between fibroblast and cervix. Tags founded in SAGE_cervix_normal_B_1(30418 tags) BJ dermal fibroblasts (57573 tags) and SAGE_cervix_normal_B_1 (30418 tags). Ubiquous tags were deleted in both libraries. BJ dermal fibroblast library was derived from neonatal foreskin primary fibroblasts cultured in Ham's F10 medium supplemented with 10% fetal bovine serum, 100 U/ml penicillin, and 100 ug/ml streptomycin. Library was developed by Agnes Baross at British Columbia Genome Sciences Centre. Columns are: Gene TAG, Cluster ID and Gene Name (XLS 127 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Pérez-Plasencia, C., Riggins, G., Vázquez-Ortiz, G. et al. Characterization of the global profile of genes expressed in cervical epithelium by Serial Analysis of Gene Expression (SAGE). BMC Genomics 6, 130 (2005).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: