- Research article
- Open Access
Increased epithelial stem cell traits in advanced endometrial endometrioid carcinoma
BMC Genomics volume 10, Article number: 613 (2009)
It has been recognized cancer cells acquire characters reminiscent of those of normal stem cells, and the degree of stem cell gene expression correlates with patient prognosis. Lgr5(+) or CD133(+) epithelial stem cells (EpiSCs) have recently been identified and these cells are susceptible to neoplastic transformation. It is unclear, however, whether genes enriched in EpiSCs also contribute in tumor malignancy. Endometrial endometrioid carcinoma (EEC) is a dominant type of the endometrial cancers and is still among the most common female cancers. Clinically endometrial carcinoma is classified into 4 FIGO stages by the degree of tumor invasion and metastasis, and the survival rate is low in patients with higher stages of tumors. Identifying genes shared between advanced tumors and stem cells will not only unmask the mechanisms of tumor malignancy but also provide novel therapeutic targets.
To identify EpiSC genes in late (stages III-IV) EECs, a molecular signature distinguishing early (stages I-II) and late EECs was first identified to delineate late EECs at the genomics level. ERBB2 and CCR1 were genes activated in late EECs, while APBA2 (MINT2) and CDK inhibitor p16 tumor suppressors in early EECs. MAPK pathway was significantly up in late EECs, indicating drugs targeting this canonical pathway might be useful for treating advanced EECs. A six-gene mini-signature was further identified to differentiate early from advanced EECs in both the training and testing datasets. Advanced, invasive EECs possessed a clear EpiSC gene expression pattern, explaining partly why these tumors are more malignant.
Our work provides new insights into the pathogenesis of EECs and reveals a previously unknown link between adult stem cells and the histopathological traits of EECs. Shared EpiSC genes in late EECs may contribute to the stem cell-like phenotypes shown by advanced tumors and hold the potential of being candidate therapeutic targets and novel prognosis biomarkers.
Tumor development, progression, and prognosis remain at the front position of medical research. Two hypotheses of the origin of cancer have existed for many decades. One hypothesis postulates that adult stem or precursor cell is the cell of origin for cancer, whereas the other declares a somatic cell can be mutated and then be dedifferentiated or be reprogrammed to regain properties associated with both cancer cells and stem cells [1–3]. The discovery of a subpopulation of tumor stem cells (TSCs) in leukemia and solid cancers has strengthened the stem cell hypothesis . Glioblastomas also possess characters and gene expression patterns of local neural stem cells (NSCs) , and artificially introducing cancer-associated mutations into stem or lineage-restricted precursor cells can indeed turn them into cancer initiating cells and all mice received mutations developed medulloblastomas [6, 7]. Another example that the adult stem cell represents the cell of origin of cancer has recently been made in chronic myeloid leukemia (CML): by restricting BCR-ABLp210 expression to mouse Sca1(+) hematopoietic stem cells, it is sufficient to induce CML formation that recapitulates the human disease . These evidences support the idea that mutations of stem cells may initiate the carcinogenic process of certain, although not necessary all, tumors.
On the other hand, the importance of somatic or tumor cell mutation and dedifferentiation has not been excluded completely. It has been recognized that during malignant transformation, cancer cells acquire genetic mutations that override the normal mechanisms controlling cellular proliferation. Human tumor cells can be created from healthy somatic cells with defined genetic elements . Even though cancers were originated from mutated stem cells, newly acquired mutations in tumors still contribute in cell malignancy and therapy resistance. It has been recognized that cancer cells acquire characters reminiscent of those of normal stem cells. Clinically cancer cells with poor differentiated pathological grading usually have worse therapy response than those with well differentiated morphology. The degree of embryonic gene re-expression correlates with pivotal tumor features and patient prognosis [10, 11]. It is known that colon cancers adopt a broad program encompassing embryonic colon development . In poorly differentiated breast cancer, gliomas and bladder carcinoma, an embryonic stem cell (ESC)-like gene expression signature is exhibited and the degree of ESC program recapitulation correlates with tumor stages and patient survival . Recent studies demonstrated that Snail, a potent oncogene which can induce epithelial-mesenchymal transition (EMT), contributes to the acquisition of stem cell traits in breast cancer cells [14, 15]. Pre-existing cancerous lesions may become more malignant by the accumulation of new oncogenic mutations (such as Snail) that can induce cell dedifferentiation. Identifying genes shared between transformed cells, especially the more malignant ones, and stem cells will help to unmask the pathogenesis of tumors, as well as provide us with novel therapeutic targets and prognosis biomarkers.
Endometrial carcinoma of the female genital tract can be divided into two forms: endometrial endometrioid carcinoma (EEC; Type I) which account for 70-80% of cases and are estrogen-related; whereas the Type II tumors (papillary serous or clear cell tumors) account for 20% of cases unrelated to estrogen stimulation . Clinically endometrial carcinoma is classified into 4 FIGO stages by the degree of invasion and metastasis: stage I tumors are limit to the uterine body and stage II tumors extend to the uterine cervix. Both stages are considered as less invasive, although stage IIB cases are characterized by a less favorable prognosis. In contrast, tumors of stages III-IV are invasive: for stage III there is regional tumor spread and for stage IV there is bulky pelvic disease or distant spread . Approximately 72% of endometrial carcinomas are stage I, 12% are stage II, 13% are stage III, and 3% are stage IV . The survival rate is also low in patients with higher stages of tumors: 80-90% in stage I, 70-80% in stage II, 40-60% in stage III, and 20% in stage IV . Identifying genes abundant in late EECs can not only unmask the mechanisms of tumor malignancy but also provide us with novel therapeutic targets. Recently Lgr5- or CD133-positive crypt stem cells of the intestinal track were identified and these cells were proven to be one of the original cells of intestinal cancer [18, 19]. OLFM4 is also a new, robust marker for stem cells in human intestine and marks a subset of colorectal cancer cells . Disruption of beta-catenin in cells positive for CD133 resulted in a gross disruption of crypt architecture and a disproportionate expansion of CD133(+) cells at the crypt base . It is unclear, however, whether genes high expressed in epithelial stem cells (EpiSCs) also contribute in tumor invasiveness, malignancy and therapy resistance. A broad description of stem cell traits reminiscent in EECs is therefore crucial.
In this study we dealt with the molecular bases of endometrial cancer and assessed the expression of epithelial precursor genes in advanced EEC. To examine the shared genes between EpiSC and late EECs, we first need to unmask the gene compositions in different stages of EECs. For this purpose we applied gene expression microarray and machine learning algorithms to filtrate genes differentially expressed in early (stages I-II) and late (stages III-IV) EECs. After obtaining genes unique in EECs of different stages, we then related transcriptional programs in EpiSCs and late EECs. This approach helped to discover a total of 217 probe sets differentiating EECs of different stages, and, moreover, showed late EECs possess a clear EpiSC gene expression pattern, partly explaining why these tumors are more malignant and fatal.
Molecular signatures of early and late stage EECs
To identify epithelial stem cell genes in late EECs, we first delineated early (FIGO stages I and II) and late (FIGO stages III and IV) EECs at the genomics level. We explored genes differentially expressed between early and late EEC tissues using the Affymetrix U133 Plus 2.0 array. The demographics of patients in the training and testing cohorts are in Tables 1 and 2, respectively. Tumor samples were compared to each other to minimize stromal and myometrial contamination as well as female-specific genes. A multidimensional scaling (MDS) plot using the whole transcriptome showed that the mRNA profiles of normal and cancerous tissues are different (Figure 1A). We then searched for genes distinguishing early and late EECs according to a statistical pipeline we used [21, 22]. A total of 678 probe sets could differentiate early and late stage samples, as well as discriminate 23 normal endometrium and 33 tumor tissues (Figure 1B; the positive false discovery rate (pFDR) cutoff q values are shown).
The discrimination ability of these 678 probe sets were evaluated by a supervised machine learning strategy, which combines the weighted voting algorithm and leave-one-out cross validation (LOOCV) [23–25]. An error rate of 12.1% (2 out of 24 early cancers and 2 out of 9 late samples; P < 0.001 by permutation test) was found (Figure 1C and Additional file 1). However, we found the top 217 features (ranked by the weighted value of each probe set ) is the largest panel to have better discrimination ability than that of the 678-probeset signature (error rate 6.1% vs. 12.1%; Figure 1C, upper panel): 2 out of 24 early EEC tissues are classified into the late group while all 9 late ones are correct (Figure 1D). MDS analysis supports the superior classification power of these 217 probe sets: only 2 early samples express late EECs gene signatures and are grouped together with the late cases (Figure 1E). When applying these 217 probe sets on another independent testing data set containing 15 early EEC cases, 1 out of 15 early tissues (error rate 6.7%; P < 0.001 by permutation test) is misgrouped (Figure 1F).
In-depth exploration of EEC-related genes
To have a better idea how the filtrated genes distribute in early and late EECs, a gene expression heat map for those 217 probe sets was drawn (Figure 2). This heat map showed the unique gene expression patterns between early or late EEC tumor tissues. Consistent with the classification data obtained by prediction strength (PS) analysis in Figure 1D, hierarchical clustering showed that only 2 early cases in the training data set are misclassified (indicated by arrows; Figure 2).
Those 217 probe sets correspond to 177 known genes (with gene symbols) and 29 cDNAs, which have no gene symbols been assigned yet (all in Additional file 2). Among them 58 genes/cDNAs are predominantly up in early ECCs while 25 being down (Figure 2). In contrast, 48 genes/cDNAs are particularly high in late EECs while another 75 being low (Figure 2). The details of known genes (especially those with known function) are in Tables 3, 4, 5, 6 and 7 respectively. Many of these genes, such as CD163 , MSR1 (CD204) , ERBB2 oncogene (also known as HER-2/neu) [28, 29], CSTA (stefin A)  and CCR1 , have been associated with tumor malignancy and poor patient outcomes in EEC or other cancers (Table 3, bold). CD163 and MSR1 (macrophage scavenger receptor 1; CD204) are markers for M2 macrophages, whose infiltration in tumor lesions is correlated with the histological grade of the gliomas  (Table 3, bold). These consistent findings support the reliability of our gene lists. We also validated our array data by performing immunohistochemical staining on Taiwanese EEC cases. ERBB2 was indeed more abundant in stages III and IV EEC tissues (Figure 3).
To gain more insights into the functional consequences of differential gene expression, we performed gene set enrichment analysis for the filtrated genes. Signature probe sets were subjected into the Gene Ontology (GO) database search to find statistically over-represented functional groups within these genes. The biological processes being statistically overrepresented (P < 0.05) in late stage-enriched genes are shown in Table 5. These predominant processes include those pertaining to immune system process, second-messenger-mediated signaling (genes also involved in cyclic nucleotide second messenger (P = 0.0306) are bold), MAP kinase activity (genes also involved in the inactivation of MAPK activity (P = 0.0459) are bold), membrane organization and biogenesis, regulation of catalytic activity (genes also involved in the positive regulation of catalytic activity (P = 0.0182) are bold), and cell surface receptor-linked signal transduction are significantly up (Table 5).
For genes enriched in early EECs, CDKN2A (P16) tumor suppressor was found to be reverse correlated with EEC prognosis  (Table 6, bold). Another tumor suppressor is APBA2 (amyloid beta (A4) precursor protein-binding, family A, member 2; also known as MINT2), which is frequently methylated and silent in colorectal carcinoma and gastric carcinoma . Hypermethylation of GPR37 is also frequently found in acute myeloid leukemia . In terms of oncogenes, ROBO2 (roundabout, axon guidance receptor, 2), a receptor of the SLIT2 axon guidance and cell migration growth factor, is associated with poor prognosis of breast cancer . ESCO2 (establishment of cohesion 1 homolog 2) is tightly correlated with BRCA1-dependent and various cell-type specific carcinogenesis , and DAPP4 pluripotent factor is enriched in seminomas . VANGL1 (also known as KITENIN or STB2) acts as an executor in colon cancer cells with regard to cell motility and thereby controls cell invasion, which may contribute to promoting metastasis . The abundant expression of known oncogenes in early EECs also suggests the early EEC cases contain high percentage of epithelial tumor cells instead of merely stromal and myometrial contaminations.
A six-gene signature distinguishing early and late EECs
When evaluating the classification effect of filtrated genes, we noticed that the top 6 genes could already distinguish early and late EECs, and these 6 genes gave the same diagnostic power to that of the 217 probe sets in the training cohort (Figure 4A). The same two early cases (one Stage 1B and one Stage 2B) were misgrouped with the late ones (Figure 4B). When applying these 6 genes on the testing data set, a lowest error rate could also be achieved (Figure 4C, upper panel). Only 1 out of 15 early tissues (error rate 6.7%; P < 0.001 by permutation test) was misgrouped (Figure 4C, lower panel). The same Stage 1B sample was misclassified when either applied only these 6 genes or the entire 217 probe sets (Figure 1F). Thus, these 6 genes hold clinical potentials of being diagnostic biomarkers. These 6 genes are: (1) ATP-binding cassette, B (MDR/TAP), 11 (ABCB11) (2) Archaemetzincins-2 (AMZ2) (3) amyloid beta (A4) precursor protein-binding A2 (APBA2) (4) LIM domain only 4 (LMO4) (5) Hypothetical protein LOC647065 (LOC647065) and (6) Homo sapiens mRNA, clone IMAGE:5759975 (cDNA FLJ12258 fis) (Table 8). AMZ and APBA2 are up-regulated in early EECs. ABCB11, LOC647065 and cDNA FLJ12258 fis are down in tumors, especially in late EECs, while LMO4 particularly down in early EECs.
Re-activation of epithelial stem cell genes in advanced EECs
Since our main goal is to identify EpiSC genes in EECs, we compared the gene expression profiles of EEC tissues of all 4 stages to that of normal CD133+ EpiSCs . When the 217 genes distinguishing early and late EECs were applied to compare the relationships between EECs and EpiSCs, clearly EpiSCs have a closest relationship to late EECs (Figure 5A). This impression is strengthened by calculating the average linkage distances between sample groups. Compared with early EECs, EEC of both Stages III and IV are closer to EpiSCs to a similar extent (Figure 5B), suggesting the re-expression of EpiSC features in late EECs. A total of 26 EpiSC genes are overexpressed in advanced EECs (Figure 5C). Also, genes down-regulated in late EECs (the 77 probe sets in Figure 2) are absence in EpiSCs (Figure 5D). Most early EECs clustered together and expressed the intermediate level of EpiSC genes (Figure 5C-D), consistent with the distances analysis result in Figure 5B.
EEC still ranks one of the most fatal female cancers worldwide and disease progression very often accompany with worse clinical outcomes and treatment failure. Identifying genes or canonical pathways associated with advanced cancer can help to unmask the mechanisms of tumor malignancy as well as provide us with novel drug targets. It has been recognized clinically that cancer cells, especially the advanced and metastatic ones, possess characters reminiscent of those of normal stem cells. The degree of stem cell gene expression correlates with pivotal tumor features and patient prognosis [10, 11, 13]. Hence, identifying shared genes between late EECs and stem cells will provide new insights into cancer biology, as well as new prognosis markers and therapeutic targets. In this study, we identified a 217-probeset signature which could distinguish late (stages III-IV) from early (stages I-II) EECs (Figure 1). More low stage disease array data than high stage ones were obtained, which may partly due to the fact that the early diagnosis takes place in almost 90% of EEC clinically. We combined primary and metastatic late EEC samples in one group since their molecular profiles are indistinguishable (not shown). Prostate EpiSCs were used as a comparative group since array data for endometrial stem cells is not available yet. Nevertheless, prostate CD133+ cells are still epithelial stem cells and therefore good controls. Other EpiSC data should reproduce part of our findings.
Our results reveal a previously unaware link between genes associated with EpiSC identity and the histopathological traits of EECs. It is possible that these genes contribute to the stem cell-like phenotypes of late EECs. A total of 26 EpiSC genes were found overexpressed in late EECs (Figure 5C), and genes down-regulated in late EECs (Figure 2; 77 probe sets) are also absence in EpiSCs (Figure 5D). Among those 26 overexpressed genes there are famous oncogenes or stemness genes (Figure 5C, underlined). ADAM17 (A Disintegrin and A Metalloproteinase 17), also known as tumor necrosis factor-alpha converting enzyme (TACE) or less commonly CD156q, is a therapeutic target in multiple diseases since major contemporary pathologies like cancer, inflammatory and vascular diseases seem to be connected to its cleavage abilities . CAP1 (adenylate cyclase-associated protein 1) overexpressed in pancreatic cancers is involved in cancer cell motility . CAPG (capping protein (actin filament), gelsolin-like) also contributes in the motility of pancreatic cancer cells . PDCD10 (CCM3) is involved in cerebral cavernous malformations (CCM)  and is found to interact with Ste20-related kinase MST4 to promote cell growth and transformation via modulation of the ERK pathway . PSEN1 (presenilin 1) is involved in apoptosis, overexpressed in high-risk patients with stage I non-small cell lung cancer (NSCLC), and is in a prognosis signature of NSCLC patients . SENP2 (SUMO-specific protease 2) is highly expressed in trophoblast cells that are required for placentation, and targeted disruption of SENP2 in mice reveals its essential role in development of all three trophoblast layers via modulating the Mdm2-p53 pathway . The appearance of these known oncogenes or stemness genes in our data supports the reliability of our gene lists. The roles of EpiSC genes in both epithelial stem cell biology and EEC malignancy will be addressed further.
Several genes were previous suggested to be tumor suppressors. CSTA (cystatin A, or stefin A), a cysteine proteinases inhibitor, is implicated in preventing local and metastatic tumor spread of cancers. The risk of disease recurrence and disease-related death was thus higher in patients with low CSTA in patients with squamous cell carcinoma of the head and neck . NPAS2 (neuronal PAS domain protein 2) is a circadian gene as well as a putative tumor suppressor involved in DNA damage response . PHC3 (polyhomeotic homolog 3), a component of the hPRC-H complex, associates with E2F6 during G0 and is lost in osteosarcoma tumors . Validating their expression in different stages of EECs by further immunohistochemstry study will not only provide novel malignancy mechanisms but will also present new drug targets.
In the past few years, much effort has been put to explore the mechanisms and additional molecular markers for predicting prognosis of EECs by using high-throughput genomics technology. Gene expression microarray (GEM) is a popular platform among all of those high-throughput genomics techniques. In this study we applied GEM and machine learning algorithms to filtrate out a 217-probeset signature for disease diagnosis. Many of the filtrated genes have been linked to tumor progression and malignancy, supporting the reliability of our array data. Moreover, we narrowed down this 217-probeset profile to a six-gene mini-signature for the differentiation of early to late EECs in the training set. This signature can be validated by an independent testing cohort (Figure 4). Owing to the small gene number of this signature, it is now possible to check their mRNA levels in patient tissues by real-time PCR in regular clinical labs. Recently a five-gene profile and a five-microRNA signature are identified for the prediction of clinical outcomes in non-small-cell lung cancer [49, 50]. Whether our six-gene signature can be correlated with relapse-free and overall survival among patients with EEC is unclear and awaited to be elucidated. Also, whether the protein expression levels of these 6 genes correlate with those of mRNAs is unclear. Since most of the patients in either training or testing data set were Caucasian (Table 1), whether this gene signature can be applied in patients with various genetic backgrounds should also be studied.
In our datasets we noticed that few early EEC cases expressed already late EEC genes and therefore could not be classified correctly (Figs. 1, 2). Since patients with late and metastatic EEC tend to have poor prognosis, whether these unusual early cases possess worse clinical outcomes is an interesting issue. It has been suggested that prognosis potential of human tumors is inherited in early lesions. For example, the gene expression patterns in metastatic colorectal carcinoma are readily distinguishable from those associated with in situ tumors [24, 51]. A subset of primary tumors resembled metastatic tumors with respect to this gene-expression signature [24, 51]. Very recently Varmus and colleagues showed that when untransformed mouse mammary cells were introduced into the systemic circulation of a mouse, those cells can bypass transformation at the primary site, form long-term residence in the lungs but do not form ectopic tumors . Husemann et al. also observed that systemic spread can be an early step in breast cancer. Tumor cells can disseminate systemically from earliest epithelial alterations and form and micrometastasis in bone marrow and lungs . Therefore, release from dormancy of early-disseminated cancer cells may frequently account for metachronous metastasis. The metastatic potential of human tumors is encoded in the bulk of a primary tumor and, at least in a subset of patients, metastatic capability in cancers is an inherent feature. Our EEC gene signatures therefore hold the potential of being a novel prognosis panel. More advanced therapy and clinical follow-up should be applied on early stage patients with molecular feature similar to that of EpiSC.
In advanced EECs, tumor tissues express more genes abundant in CD133+ EpiSC and acquired a stem cell trait (Figure 5). The expression of these EpiSC genes in late EECs may due to the re-expression of EpiSC features in late stage EECs, i.e., further mutations and stem cell gene reactivation in certain early EECs. The intermediate EpiSC gene expression level in early EECs supports this point (Figure 5A &5C-D). Recent studies demonstrated that EMT contributes to the acquisition of stem cell traits in cancer cells and the induction of EMT inducer Snail results in stemness gene expression [14, 15]. Whether EMT also contributes in EEC progression and metastasis is an interesting issue to follow. However, we did not rule out the possibility that certain late EECs may arise from an independent rapidly progressing cancer utilizing stemness molecular pathways. According to the tumor stem cell theory, cancer cells may be originated from different cancer stem cells acquiring distinctive oncogenic mutations. Certain early EECs have the capacity to progress to late stage disease may due to a mechanism that they arose from the same mutated progenitor cells as late EECs. The observation that several early EEC cases express EpiSC genes already (Figure 1D &5C) favors the later hypotheses. These 2 situations may both exist in vivo, but our profiling work cannot favor any of them yet. Nevertheless, genes filtrated here will provide clinicians novel prognosis markers and therapeutic targets.
In summary, here we reveal distinct epithelial stem cell traits and gene expression patterns in late EECs and some of these genes hold the potential of being novel drug targets. Drugs targeting MAP kinase pathway, for example, may be applied for the treatment of late EEC since this canonical pathway is significantly up in late EECs (Table 5). Since applying a statistical analysis of gene ontology terms is the reliance on prior knowledge of the biological activity of each differentially expressed gene, the enrichment of genes associated with specific pathways may be a consequence of intense research in such areas. Hence, new canonical pathways may still exist and may serve as candidate therapeutic targets. Function of the filtrated KIAA (such as KIAA0323, Figure 5C) and LOC series of anonymous ESTs (such as C20orf24, Figure 5C) in Tables 3, 4, 5, 6, 7 should be studied and their roles in tumor malignancy, chemoresistance and EpiSC stemness are awaited to be elucidated. Further studies to prove the prognosis values and therapeutic potentials of the identified genes, especially those also present in epithelial stem cells, should lead to a better understanding of EEC and EpiSC biology and the susceptibilities of late EECs to treatment.
Microarray data sets
All array data were implemented by the Affymetrix™ HG-U133 Plus 2.0 GeneChip. Array data of normal CD133+ epithelial stem cells, which were used as a normal counterpart of cancer stem cells , isolated from benign prostatic hyperplasia were downloaded from the ArrayExpress database at the European Bioinformatics Institute (http://www.ebi.ac.uk/microarray-as/ae/; Accession No. E-MEXP-993; array data files 1325504978.cel, 1325505459.cel and 1325505089.cel were used).
The gene expression profiles of EEC tissues of different stages were generated by the International Genomics Consortium (IGC) under the expO (Expression Project for Ontology) project and were downloaded from Gene Expression Omnibus (GEO http://www.ncbi.nlm.nih.gov/geo/; GSE2109). EEC array data were divided into training (n = 33; incl. all 4 stages) and testing cohorts (n = 15) (details in Table 1). Array data of normal endometrium controls were from a Human body index dataset in GEO (GSE7307).
Array data processing
Feature selection was performed as previously described . Briefly, the default robust multichip average (RMA) settings were used to background correct, normalize and summarize all expression values using the 'affy' package of the Bioconductor suite of software http://www.bioconductor.org/ for the R statistical programming language. A t-statistic was calculated as normal for each gene and a p-value then calculated using a modified permutation test in the "LIMMA" package . To control the multiple testing errors, a false discovery rate (FDR) algorithm was then applied to these p-values to calculate a set of q-values: thresholds of the expected proportion of false positives, or false rejections of the null hypothesis [22, 54]. Gene annotation was performed by the ArrayFusion web tool http://microarray.ym.edu.tw/tools/arrayfusion/. Gene enrichment analysis was performed by the Gene Ontology (GO) database using the DAVID Bioinformatics Resources 2008 interface http://david.abcc.ncifcrf.gov/, a graph theory evidence-based method to agglomerate gene or protein identifiers [56, 57].
The discrimination power of filtrated genes was evaluated by a machine-learning approach combining the weighted voting algorithm  and leave-one-out cross-validation (LOOCV). This approach has been integrated in our Java tool http://microarray.ym.edu.tw/tools/set/. In brief, the uploaded genes are ranked according to the absolute values of corresponding signal-to-noise scores  in a descending order. Genes are included into a signature one at a time based on the order of ranking. The error rate for each new signature is estimated by the weighted voting algorithm and LOOCV and can be monitored by an error rate distribution plot . Based on the error rate information, we then selected an appropriate composition of discriminating genes with the lowest error rate. Once a signature is defined, the result of prediction strength (PS) analysis for each sample was shown. The PS values range from -1 to +1, where higher absolute values reflect stronger predictions . An overview of the results for samples in different groups was then illustrated by a PS plot .
Classical multidimensional scaling (MDS) is performed by the standard function of the R program to provide a visual impression of how the various sample groups are related. The average linkage distance between samples is calculated by the Pearson correlation subtracted from unity to provide bounded distances in the range (0, 2), as described in our previous study . The distance between two groups of samples is calculated using the average linkage measure (the mean of all pair-wise distances (linkages) between members of the two groups concerned). The standard error of the average linkage distance between two groups (the standard deviation of pair-wise linkages divided by the square root of the number of linkages) is quoted when inter-group distances are compared in the text.
Staining was performed on formalin-fixed, paraffin-embedded specimens using anti-ERBB2 primary antibody (DAKO, Carpinteria, CA, USA). Scoring was performed as following. 0: undetectable staining or membrane staining in <10% of the tumor cells. 1+: faint and incomplete membrane staining in >10% of the tumor cells; 2+: weak to moderate complete membrane staining in >10% of the tumor cells; 3+: strong complete membrane staining observed in >10% of the tumor cells. ERBB2 protein expression was categorized as negative (scores 0 and 1+), or positive (scores 2+ and 3+) .
Yilmaz OH, Valdez R, Theisen BK, Guo W, Ferguson DO, Wu H, Morrison SJ: Pten dependence distinguishes haematopoietic stem cells from leukaemia-initiating cells. Nature. 2006, 441 (7092): 475-482. 10.1038/nature04703.
Barnhart BC, Simon MC: Metastasis and stem cell pathways. Cancer Metastasis Rev. 2007, 26 (2): 261-271. 10.1007/s10555-007-9053-3.
Trosko JE: Review paper: cancer stem cells and cancer nonstem cells: from adult stem cells or from reprogramming of differentiated somatic cells. Vet Pathol. 2009, 46 (2): 176-193.
Reya T, Morrison SJ, Clarke MF, Weissman IL: Stem cells, cancer, and cancer stem cells. Nature. 2001, 414 (6859): 105-111. 10.1038/35102167.
Lee J, Kotliarova S, Kotliarov Y, Li A, Su Q, Donin NM, Pastorino S, Purow BW, Christopher N, Zhang W, Park JK, Fine HA: Tumor stem cells derived from glioblastomas cultured in bFGF and EGF more closely mirror the phenotype and genotype of primary tumors than do serum-cultured cell lines. Cancer Cell. 2006, 9 (5): 391-403. 10.1016/j.ccr.2006.03.030.
Yang ZJ, Ellis T, Markant SL, Read TA, Kessler JD, Bourboulas M, Schuller U, Machold R, Fishell G, Rowitch DH, Wainwright BJ, Wechsler-Reya RJ: Medulloblastoma can be initiated by deletion of Patched in lineage-restricted progenitors or stem cells. Cancer Cell. 2008, 14 (2): 135-145. 10.1016/j.ccr.2008.07.003.
Schuller U, Heine VM, Mao J, Kho AT, Dillon AK, Han YG, Huillard E, Sun T, Ligon AH, Qian Y, Ma Q, Alvarez-Buylla A, McMahon AP, Rowitch DH, Ligon KL: Acquisition of granule neuron precursor identity is a critical determinant of progenitor cell competence to form Shh-induced medulloblastoma. Cancer Cell. 2008, 14 (2): 123-134. 10.1016/j.ccr.2008.07.005.
Perez-Caro M, Cobaleda C, Gonzalez-Herrero I, Vicente-Duenas C, Bermejo-Rodriguez C, Sanchez-Beato M, Orfao A, Pintado B, Flores T, Sanchez-Martin M, Jimenez R, Piris MA, Sanchez-Garcia I: Cancer induction by restriction of oncogene expression to the stem cell compartment. Embo J. 2009, 28 (1): 8-20. 10.1038/emboj.2008.253.
Hahn WC, Counter CM, Lundberg AS, Beijersbergen RL, Brooks MW, Weinberg RA: Creation of human tumour cells with defined genetic elements. Nature. 1999, 400 (6743): 464-468. 10.1038/22780.
Kho AT, Zhao Q, Cai Z, Butte AJ, Kim JY, Pomeroy SL, Rowitch DH, Kohane IS: Conserved mechanisms across development and tumorigenesis revealed by a mouse development perspective of human cancers. Genes Dev. 2004, 18 (6): 629-640. 10.1101/gad.1182504.
Hu M, Shivdasani RA: Overlapping gene expression in fetal mouse intestine development and human colorectal cancer. Cancer Res. 2005, 65 (19): 8715-8722. 10.1158/0008-5472.CAN-05-0700.
Kaiser S, Park YK, Franklin JL, Halberg RB, Yu M, Jessen WJ, Freudenberg J, Chen X, Haigis K, Jegga AG, Kong S, Sakthivel B, Xu H, Reichling T, Azhar M, Boivin GP, Roberts RB, Bissahoyo AC, Gonzales F, Bloom GC, Eschrich S, Carter SL, Aronow JE, Kleimeyer J, Kleimeyer M, Ramaswamy V, Settle SH, Boone B, Levy S, Graff JM, et al: Transcriptional recapitulation and subversion of embryonic colon development by mouse colon tumor models and human colon cancer. Genome Biol. 2007, 8 (7): R131-10.1186/gb-2007-8-7-r131.
Ben-Porath I, Thomson MW, Carey VJ, Ge R, Bell GW, Regev A, Weinberg RA: An embryonic stem cell-like gene expression signature in poorly differentiated aggressive human tumors. Nat Genet. 2008, 40 (5): 499-507. 10.1038/ng.127.
Mani SA, Guo W, Liao MJ, Eaton EN, Ayyanan A, Zhou AY, Brooks M, Reinhard F, Zhang CC, Shipitsin M, Campbell LL, Polyak K, Brisken C, Yang J, Weinberg RA: The epithelial-mesenchymal transition generates cells with properties of stem cells. Cell. 2008, 133 (4): 704-715. 10.1016/j.cell.2008.03.027.
Morel AP, Lievre M, Thomas C, Hinkal G, Ansieau S, Puisieux A: Generation of breast cancer stem cells through epithelial-mesenchymal transition. PLoS ONE. 2008, 3 (8): e2888-10.1371/journal.pone.0002888.
Liu FS: Molecular carcinogenesis of endometrial cancer. Taiwan J Obstet Gynecol. 2007, 46 (1): 26-32. 10.1016/S1028-4559(08)60102-3.
Sorosky JI: Endometrial cancer. Obstet Gynecol. 2008, 111 (2 Pt 1): 436-447.
Barker N, Ridgway RA, van Es JH, Wetering van de M, Begthel H, Born van den M, Danenberg E, Clarke AR, Sansom OJ, Clevers H: Crypt stem cells as the cells-of-origin of intestinal cancer. Nature. 2009, 457 (7229): 608-611. 10.1038/nature07602.
Zhu L, Gibson P, Currle DS, Tong Y, Richardson RJ, Bayazitov IT, Poppleton H, Zakharenko S, Ellison DW, Gilbertson RJ: Prominin 1 marks intestinal stem cells that are susceptible to neoplastic transformation. Nature. 2009, 457 (7229): 603-607. 10.1038/nature07589.
Flier van der LG, Haegebarth A, Stange DE, Wetering van de M, Clevers H: OLFM4 is a robust marker for stem cells in human intestine and marks a subset of colorectal cancer cells. Gastroenterology. 2009, 137 (1): 15-17. 10.1053/j.gastro.2009.05.035.
Wang HW, Trotter MW, Lagos D, Bourboulia D, Henderson S, Makinen T, Elliman S, Flanagan AM, Alitalo K, Boshoff C: Kaposi sarcoma herpesvirus-induced cellular reprogramming contributes to the lymphatic endothelial gene expression in Kaposi sarcoma. Nat Genet. 2004, 36 (7): 687-693. 10.1038/ng1384.
Huang TS, Hsieh JY, Wu YH, Jen CH, Tsuang YH, Chiou SH, Partanen J, Anderson H, Jaatinen T, Yu YH, Wang HW: Functional network reconstruction reveals somatic stemness genetic maps and dedifferentiation-like transcriptome reprogramming induced by GATA2. Stem Cells. 2008, 26 (5): 1186-1201. 10.1634/stemcells.2007-0821.
Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science. 1999, 286 (5439): 531-537. 10.1126/science.286.5439.531.
Ramaswamy S, Ross KN, Lander ES, Golub TR: A molecular signature of metastasis in primary solid tumors. Nat Genet. 2003, 33 (1): 49-54. 10.1038/ng1060.
Jen CH, Yang TP, Tung CY, Su SH, Lin CH, Hsu MT, Wang HW: Signature Evaluation Tool (SET): a Java-based tool to evaluate and visualize the sample discrimination abilities of gene expression signatures. BMC Bioinformatics. 2008, 9 (1): 58-10.1186/1471-2105-9-58.
Shabo I, Stal O, Olsson H, Dore S, Svanvik J: Breast cancer expression of CD163, a macrophage scavenger receptor, is related to early distant recurrence and reduced patient survival. Int J Cancer. 2008, 123 (4): 780-786. 10.1002/ijc.23527.
Komohara Y, Ohnishi K, Kuratsu J, Takeya M: Possible involvement of the M2 anti-inflammatory macrophage phenotype in growth of human gliomas. J Pathol. 2008, 216 (1): 15-24. 10.1002/path.2370.
Uharcek P: Prognostic factors in endometrial carcinoma. J Obstet Gynaecol Res. 2008, 34 (5): 776-783. 10.1111/j.1447-0756.2008.00796.x.
Grushko TA, Filiaci VL, Mundt AJ, Ridderstrale K, Olopade OI, Fleming GF: An exploratory analysis of HER-2 amplification and overexpression in advanced endometrial carcinoma: a Gynecologic Oncology Group study. Gynecol Oncol. 2008, 108 (1): 3-9. 10.1016/j.ygyno.2007.09.007.
Strojan P, Budihna M, Smid L, Svetic B, Vrhovec I, Kos J, Skrk J: Prognostic significance of cysteine proteinases cathepsins B and L and their endogenous inhibitors stefins A and B in patients with squamous cell carcinoma of the head and neck. Clin Cancer Res. 2000, 6 (3): 1052-1062.
Byers RJ, Sakhinia E, Joseph P, Glennie C, Hoyland JA, Menasce LP, Radford JA, Illidge T: Clinical quantitation of immune signature in follicular lymphoma by RT-PCR-based gene expression profiling. Blood. 2008, 111 (9): 4764-4770. 10.1182/blood-2007-10-115915.
Salvesen HB, Akslen LA: Molecular pathogenesis and prognostic factors in endometrial carcinoma. APMIS. 2002, 110 (10): 673-689. 10.1034/j.1600-0463.2002.1101001.x.
An C, Choi IS, Yao JC, Worah S, Xie K, Mansfield PF, Ajani JA, Rashid A, Hamilton SR, Wu TT: Prognostic significance of CpG island methylator phenotype and microsatellite instability in gastric carcinoma. Clin Cancer Res. 2005, 11 (2 Pt 1): 656-663.
Toyota M, Kopecky KJ, Toyota MO, Jair KW, Willman CL, Issa JP: Methylation profiling in acute myeloid leukemia. Blood. 2001, 97 (9): 2823-2829. 10.1182/blood.V97.9.2823.
Bieche I, Lerebours F, Tozlu S, Espie M, Marty M, Lidereau R: Molecular profiling of inflammatory breast cancer: identification of a poor-prognosis gene expression signature. Clin Cancer Res. 2004, 10 (20): 6789-6795. 10.1158/1078-0432.CCR-04-0306.
Skibbens RV: Cell biology of cancer: BRCA1 and sister chromatid pairing reactions?. Cell Cycle. 2008, 7 (4): 449-452.
Juric D, Sale S, Hromas RA, Yu R, Wang Y, Duran GE, Tibshirani R, Einhorn LH, Sikic BI: Gene expression profiling differentiates germ cell tumors from other cancers and defines subtype-specific signatures. Proc Natl Acad Sci USA. 2005, 102 (49): 17763-17768. 10.1073/pnas.0509082102.
Kho DH, Bae JA, Lee JH, Cho HJ, Cho SH, Seo YW, Ahn KY, Chung IJ, Kim KK: KITENIN recruits Dishevelled/PKC delta to form a functional complex and controls the migration and invasiveness of colorectal cancer cells. Gut. 2009, 58 (4): 509-519. 10.1136/gut.2008.150938.
Birnie R, Bryce SD, Roome C, Dussupt V, Droop A, Lang SH, Berry PA, Hyde CF, Lewis JL, Stower MJ, Maitland NJ, Collins AT: Gene expression profiling of human prostate cancer stem cells reveals a pro-inflammatory phenotype and the importance of extracellular matrix interactions. Genome Biol. 2008, 9 (5): R83-10.1186/gb-2008-9-5-r83.
Arribas J, Esselens C: ADAM17 as a therapeutic target in multiple diseases. Curr Pharm Des. 2009, 15 (20): 2319-2335. 10.2174/138161209788682398.
Yamazaki K, Takamura M, Masugi Y, Mori T, Du W, Hibi T, Hiraoka N, Ohta T, Ohki M, Hirohashi S, Sakamoto M: Adenylate cyclase-associated protein 1 overexpressed in pancreatic cancers is involved in cancer cell motility. Lab Invest. 2009, 89 (4): 425-432. 10.1038/labinvest.2009.5.
Thompson CC, Ashcroft FJ, Patel S, Saraga G, Vimalachandran D, Prime W, Campbell F, Dodson A, Jenkins RE, Lemoine NR, Crnogorac-Jurcevic T, Yin HL, Costello E: Pancreatic cancer cells overexpress gelsolin family-capping proteins, which contribute to their cell motility. Gut. 2007, 56 (1): 95-106. 10.1136/gut.2005.083691.
Labauge P, Denier C, Bergametti F, Tournier-Lasserve E: Genetics of cavernous angiomas. Lancet Neurol. 2007, 6 (3): 237-244. 10.1016/S1474-4422(07)70053-4.
Ma X, Zhao H, Shan J, Long F, Chen Y, Zhang Y, Han X, Ma D: PDCD10 interacts with Ste20-related kinase MST4 to promote cell growth and transformation via modulation of the ERK pathway. Mol Biol Cell. 2007, 18 (6): 1965-1978. 10.1091/mbc.E06-07-0608.
Lu Y, Lemon W, Liu PY, Yi Y, Morrison C, Yang P, Sun Z, Szoke J, Gerald WL, Watson M, Govindan R, You M: A gene expression signature predicts survival of patients with stage I non-small cell lung cancer. PLoS Med. 2006, 3 (12): e467-10.1371/journal.pmed.0030467.
Chiu SY, Asai N, Costantini F, Hsu W: SUMO-specific protease 2 is essential for modulating p53-Mdm2 in development of trophoblast stem cell niches and lineages. PLoS Biol. 2008, 6 (12): e310-10.1371/journal.pbio.0060310.
Hoffman AE, Zheng T, Ba Y, Zhu Y: The circadian gene NPAS2, a putative tumor suppressor, is involved in DNA damage response. Mol Cancer Res. 2008, 6 (9): 1461-1468. 10.1158/1541-7786.MCR-07-2094.
Deshpande AM, Akunowicz JD, Reveles XT, Patel BB, Saria EA, Gorlick RG, Naylor SL, Leach RJ, Hansen MF: PHC3, a component of the hPRC-H complex, associates with E2F6 during G0 and is lost in osteosarcoma tumors. Oncogene. 2007, 26 (12): 1714-1722. 10.1038/sj.onc.1209988.
Chen HY, Yu SL, Chen CH, Chang GC, Chen CY, Yuan A, Cheng CL, Wang CH, Terng HJ, Kao SF, Chan WK, Li HN, Liu CC, Singh S, Chen WJ, Chen JJ, Yang PC: A five-gene signature and clinical outcome in non-small-cell lung cancer. N Engl J Med. 2007, 356 (1): 11-20. 10.1056/NEJMoa060096.
Yu SL, Chen HY, Chang GC, Chen CY, Chen HW, Singh S, Cheng CL, Yu CJ, Lee YC, Chen HS, Su TJ, Chiang CC, Li HN, Hong QS, Su HY, Chen CC, Chen WJ, Liu CC, Chan WK, Li KC, Chen JJ, Yang PC: MicroRNA signature predicts survival and relapse in lung cancer. Cancer Cell. 2008, 13 (1): 48-57. 10.1016/j.ccr.2007.12.008.
Weigelt B, Glas AM, Wessels LF, Witteveen AT, Peterse JL, van't Veer LJ: Gene expression profiles of primary breast tumors maintained in distant metastases. Proc Natl Acad Sci USA. 2003, 100 (26): 15901-15905. 10.1073/pnas.2634067100.
Podsypanina K, Du YC, Jechlinger M, Beverly LJ, Hambardzumyan D, Varmus H: Seeding and propagation of untransformed mouse mammary cells in the lung. Science. 2008, 321 (5897): 1841-1844. 10.1126/science.1161621.
Husemann Y, Geigl JB, Schubert F, Musiani P, Meyer M, Burghart E, Forni G, Eils R, Fehm T, Riethmuller G, Klein CA: Systemic spread is an early step in breast cancer. Cancer Cell. 2008, 13 (1): 58-68. 10.1016/j.ccr.2007.12.003.
Chang SJ, Huang TS, Wang KL, Wang TY, Yang YC, Chang MD, Wu YH, Wang HW: Genetic network analysis of human CD34+ hematopoietic stem/precursor cells. Taiwanese Journal of Obstetrics & Gynecology. 2008, 47 (4): 422-430. 10.1016/S1028-4559(09)60010-3.
Yang TP, Chang TY, Lin CH, Hsu MT, Wang HW: ArrayFusion: a web application for multi-dimensional analysis of CGH, SNP and microarray data. Bioinformatics. 2006, 22 (21): 2697-2698. 10.1093/bioinformatics/btl457.
Harris MA, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R, Eilbeck K, Lewis S, Marshall B, Mungall C, Richter J, Rubin GM, Blake JA, Bult C, Dolan M, Drabkin H, Eppig JT, Hill DP, Ni L, Ringwald M, Balakrishnan R, Cherry JM, Christie KR, Costanzo MC, Dwight SS, Engel S, Fisk DG, Hirschman JE, Hong EL, Nash RS: The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2004, D258-261. 32 Database
Dennis G, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, Lempicki RA: DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol. 2003, 4 (5): P3-10.1186/gb-2003-4-5-p3.
The authors acknowledge the efforts of IGC and expO. This work is supported by the Mackay Memorial Hospital (MMH-HB-97-05), the National Health Research Institute (NHRI-EX97-9704BI), the National Science Council (NSC97-3111-B-010-004 and NSC98-2320-B-010-020-MY3), Taipei Veterans General Hospital Research Fund, VGHUST Joint research Program, Tsou's Foundation (V98ER2-003), and Yang-Ming University (a grant from Ministry of Education, Aim for the Top University Plan).
SJC, TYW, and HWW designed the study project. SJC and TYW collected microarray data sets and EEC materials. SJC, TYW, CYT, and TFW executed project plan and data analysis. SJC, TYW, MDC, and HWW carried out data interpretation and discussion. SJC wrote the manuscript. Then HWW modified it. All authors read and approved the final manuscript.
Electronic supplementary material
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Chang, S., Wang, T., Tsai, C. et al. Increased epithelial stem cell traits in advanced endometrial endometrioid carcinoma. BMC Genomics 10, 613 (2009). https://doi.org/10.1186/1471-2164-10-613
- Endometrial Carcinoma
- Normal Stem Cell
- Epithelial Stem Cell
- Stem Cell Gene
- Filtrate Gene