Tissue-specific gene expression templates for accurate molecular characterization of the normal physiological states of multiple human tissues with implication in development and cancer studies
© Hwang et al; licensee BioMed Central Ltd. 2011
Received: 22 February 2011
Accepted: 1 September 2011
Published: 1 September 2011
To elucidate the molecular complications in many complex diseases, we argue for the priority to construct a model representing the normal physiological state of a cell/tissue.
By analyzing three independent microarray datasets on normal human tissues, we established a quantitative molecular model GET, which consists of 24 tissue-specific G ene E xpression T emplates constructed from a set of 56 genes, for predicting 24 distinct tissue types under disease-free condition. 99.2% correctness was reached when a large-scale validation was performed on 61 new datasets to test the tissue-prediction power of GET. Network analysis based on molecular interactions suggests a potential role of these 56 genes in tissue differentiation and carcinogenesis.
Applying GET to transcriptomic datasets produced from tissue development studies the results correlated well with developmental stages. Cancerous tissues and cell lines yielded significantly lower correlation with GET than the normal tissues. GET distinguished melanoma from normal skin tissue or benign skin tumor with 96% sensitivity and 89% specificity.
These results strongly suggest that a normal tissue or cell may uphold its normal functioning and morphology by maintaining specific chemical stoichiometry among genes. The state of stoichiometry can be depicted by a compact set of representative genes such as the 56 genes obtained here. A significant deviation from normal stoichiometry may result in malfunction or abnormal growth of the cells.
It has been well-recognized that within a cell, not only genes participate in cascades of biochemical events (pathways), but also the pathways themselves cross-talk with each other as a delicate and intriguing network system. Such complexity was reflected in the normal biological processes (tissue development, for example) as well as in the complex disease processes such as autism, cancer, rheumatoid arthritis and coronary artery disease [1, 2]. In addition, the genetic interactions of oncogenes and tumor suppressor genes may perturb the normal network system through a variety of altered molecular properties of the normal genes, magnifying the difficulties encountered in the cancer biology study . Because of this, it is important to develop quantitative molecular models which can represent different physiological or pathological states of a complex biological system and can be used to predict the related states, using high throughput molecular data. In line with this viewpoint, we argue for the priority to construct models for the normal physiological states first. This is because (1) normal cells/tissues are endowed with the most stable biochemical homeostasis and (2) such models may serve as general references for contrasting with various pathological or altered physiological states.
Up to now, in part due to the limitation in sample availability, few studies on normal human tissues have been reported. Through the transcriptome study of the disease-free human samples via microarray analysis, gross patterns of tissue-gene relationships have been observed by several teams [4–7]. A recent study which applied statistical and network analysis to transcriptomic data from 31 normal human tissue types has resulted in putative tissue-specific networks for nine tissues. These putative tissue-specific networks were suggested as potential drug targets . However, it still awaits a deeper investigation to find out what molecular signatures can best represent the normal state of a specific tissue and offer the most transparent and systematic elucidation on tissue differences (regarding anatomy, pathology and development). In this study, by re-analyzing some of the transcriptomic datasets produced from normal human tissues in the Gene Expression Omnibus (GEO), we identified a set of 56 genes whose transcript profiles are endowed with strong tissue-specific properties for 24 different tissue types under the disease-free condition. These genes present significant variation of expression amongst tissues. From the expression level of these 56 genes, we constructed 24 tissue-specific G ene E xpression T emplates (GETs), one for each of the 24 tissues. We first validated that these GETs can differentiate tissue types under the normal physiological condition. Then we demonstrated how GET can be applied under other conditions, including development and cancerous conditions. Our results suggest that homeostasis among various molecules in a cell/tissue may play a key role in maintaining its normal functioning and the homeostasis state can be characterized by the 56 genes.
Characterization of 24 tissue types by the 56 genes
We searched for a set of genes whose expression profile could best represent normal state of a specific tissue type. We used three large-scale microarray datasets as our training datasets to identify a group of 56 genes with high variation in expression across different tissue types (Additional file 1: Table S1). Briefly, we selected the probe sets with coefficient of variation (CV) ranked within the top 2.5% of the entire transcriptome across all samples from each of the three training datasets. After intersecting the three groups of highly variably expressed probe sets, we removed redundant probe sets that share similar expression patterns. Our procedure yielded a set of 56 genes. [see methods for more details].
Taking a closer look at the outcome of the tissue classification obtained from the hierarchical clustering analysis, similar to the previous reports published by Shyamsundar et al and Ge et al on tissue classification, our results also showed that the tissues were grouped largely by their physiological functions, anatomical locations or cellular composition. For example, ovary and uterus, the organs from the female productive system, were clustered together; the gland tissues/organs including thyroid, pancreas, and salivary gland also aggregated and the testis as previously reported [4, 7, 9] was grouped with the tissues from the central nervous system like cerebellum, amygdala, and thalamus. However, it should be noted that we obtained the similar tissue classification result with far smaller set of genes (56) as the classifiers, in comparison with the two previous reports where they used 7396  genes and 5592  genes on 36 tissue types (from 36 samples) and 45 tissue types (from 115 specimens), respectively.
Accurate prediction of normal tissues
Large-scale prediction of normal human tissues with GETs
To show how stable and robust the relative expression level of the 56 genes is, we used Spearman's rank correlation to replace the standard Pearson correlation in the tissue-prediction analysis and obtained 96.2% (767/797) of accuracy (Additional file 1: Table S5). It indicates that the relative expression of the 56 genes is a robust feature for characterizing the normal state of a specific human tissue.
So far, we have chosen Affymetrix as the gene expression platform to test the performance of GET. Our reasons are (1) Affymetrix is the common platform used in our three training datasets which ensures the inclusion of the entire set of our 56 genes; (2) the data preprocessing procedure is standardized; (3) existence of many datasets in the public domain for validation and (4) high reproducibility. However, it remains questionable whether GET can be applied to other platforms of measuring gene expression or not. To address this issue, we searched GEO for datasets containing enough samples from norm human tissues. We found an ABI array generated dataset, GSE7905 , which contains many normal human tissue samples. Among them, there are 60 samples (20 tissue types in triplicates) matching our 24 tissue types. We treated the data generated by these ABI array as if they were from the Affymetrix and applied GET to make tissue prediction. Strikingly, we found that our Affymetrix-based GETs yield a perfect result, 100% (60/60) (Additional file 1: Table S6). This demonstrates that GET is platform-independent.
Network analysis reveals involvement of the 56 genes in development and tumorigenesis
Detection of developmental stages in cultured cells and embryonic tissue
As the network analysis above suggests possible roles of the 56 genes in tissue development and tumorigenesis, we intended to test whether the degree of similarity to our GETs may also reflect the physiological or developmental states of a given tissue. To proceed, we were able to obtain datasets for two embryonic tissues, skin and lung, from GEO.
To confirm that this trend was specific to the lung template only, not other tissue templates, we also performed the same correlation computation with each of the remaining 23 training tissues and calculated the regression coefficients of all correlation plots (Additional file 1: Table S7). The result agreed with our main finding that lung gave the highest regression coefficient among the twenty-four tissues.
Deviation of normal expression profiles from cancerous tissue
Tissue prediction on GSE5364 using as template the 56-gene profiles constructed from the 24 normal tissue types
Livera (n = 8)
0.85 ± 0.03
Lunga (n = 12)
0.81 ± 0.03
Thyroida (n = 16)
0.81 ± 0.03
Hepatomab (n = 9)
0.40 ± 0.16
Lung cancerb (n = 18)
0.61 ± 0.13
Thyroid cancerb (n = 35)
0.73 ± 0.06
The 56-gene profiles in cancerous cells/tissues strongly deviate from that in the normal counterpart
NCI60 cell lines
Lung (n = 8)
0.21 ± 0.12
Ovary (n = 8)
0.15 ± 0.08
Prostate (n = 2)
0.05 ± 0.04
Kidney (n = 8)
0.43 ± 0.15
Skin ((n = 10)
0.017 ± 0.05
Normal skin (n = 7)
0.78 ± 0.05
Benign nevus (n = 18)
0.73 ± 0.06
Melanoma (n = 45)
0.36 ± 0.14
Specificity and sensitivity of distinction of melanoma from nevi and normal skin using the 56-gene profile template.
Positive predict: 29
Negative predict: 41
Sensitivity = 96%
Specificity = 89%
Distinguishing normal skin from skin substitutes
Distinction of native human skin from skin substitutes by the skin GET
Best matched tissue type
NHS (native human skin)
0.88 ± 0.03
Skin, skin, skin, skin
CSS (cultured skin substitutes)
0.74 ± 0.04
Skin, skin, skin, skin
CK (cultured keratinocytes)
0.61 ± 0.02
Skin, skin, skin, skin
CF (cultured fibroblasts)
0.35 ± 0.02
Uterus, lung, lung, lung
We have attempted to address the fundamental issue of "Can the normal physiological states of various human tissue types be quantified at the molecular level faithfully and succinctly?" In the biomedical literature, the phrase "normal physiological state" is often brought up to contrast the phrase "pathological or disease state". In physics or engineering, a "state" of a system must be quantified by well-defined variables. Can we do the same in the biological world? We conceptualized the issue by arguing that one way to describe a biological state at the molecular level is to present a template consisting of (a) a list of molecule species and (b) their relative abundance levels. To be useful, three properties should be possessed- compactness, repeatability and discrimination ability. The list should be reasonably short and the template should be able to predict the state accurately for as many sets of data generated by as many different labs as possible. Taking full advantage of the rich data resource provided by GEO (Gene Expression Omnibus), here we offered the characterization of normal physiological state a bench mark solution.
This report is the first to present a multi-purposed, molecule-based molecular model that can characterize as many as 24 different human tissue types. The success of our tissue-specific GETs in accurately predicting the tissue types from various sources and in discriminating tissues/cells at different developmental stages indicates that (A) a tissue under the disease-free condition constantly maintains certain stoichiometry among many gene products; (B) the same tissue type from different disease-free individuals shares very similar gene-product stoichiometry; (C) the gene-product stoichiometry can be expressed as the relative transcription levels of a set of representative genes, a gene expression template (GET) (the combinatorial expression levels of the 56 genes in this study); (D) When the physiological or developmental state of a cell shifts, the gene-product stoichiometry may change accordingly. (E)
Severe alteration from the normal state gene-product stoichiometry, possibly caused by multiple mutations in genes or dramatic shifts of the overall biochemical environment of a cell, may lead to abnormal growth like cancer, if not death of a cell. In support of this notion, we also demonstrated that the 56-gene expression patterns in cancerous cells/tissues significantly deviate from normal GET and that our tissue-specific GETs can be used to discern melanoma from benign nevi and from normal skin. Potential applications of our results to tissue engineering, cancer diagnosis and development studies are therefore inferred.
Our approach to constructing a gene signature for predicting tissue types is simpler than existing classification methods . We first identified those genes showing a similar and reproducible trend in all three large datasets, then used the full gene group to perform tissue classification, and finally applied the group behavior (that is, the expression profiles of the compact 56-gene group) as predictors to characterize tissue types under various conditions. Without complicated modeling, our 56-gene signature provides high prediction power on numerous public datasets. As far as we are aware of, this is the most compact gene set capable of classifying the largest number of tissues. The use of multiple datasets which served as biological replicates allowed us to reduce the number of false positives and to find the genes with most variable expression across various tissues with better confidence. Note, however, that because of the high accuracy already achieved by the 56 genes, we did not explore the issue of possible existence of other gene sets that could serve as GETs and accomplish the same or even better rate of prediction - perhaps with aids of additional statistical tools such as one-way ANOVA for gene selection.
With the abundance of interplaying gene and pathway activities in a tissue, one may ask how the group behavior of these 56 genes can represent the states of various human tissue types. Our functional study of the 56 genes reveals a variety of functional categories including cytoskeleton (desmin, nebulin), signal transduction proteins (protein kinase C beta1, CDC28 protein kinase regulatory subunit 2), neural transmitter regulator (4-aminobutyrate aminotransferase), energy homeostasis regulator (insulin-like growth factor binding protein 1), and immunity (CD24 molecule) etc. It should be emphasized, however, that the high precision of large-scale validation on tissue prediction was not achieved through the combinatorial on/off states of a collection of tissue-specific markers because only 4 of the 56 genes appeared as tissue-specific genes which highly express in one particular tissue but minimally in others. They are TFPI2 specific for placenta, ANKRD7 for testis, ELA2A for pancreas and APOC3 for liver. However, the expressions of all 56 genes together as a template, did present distinctive patterns varying from tissue to tissue. Therefore, this gene set may be considered as the representative genes of the key biochemical pathways functioning distinctively across tissues, and the combinatorial transcription levels of the 56 genes, the GETs, may reflect the net sum of the relative activities of these pathways.
Despite that the feature of tissue-characterization of the 56 genes may not be exerted through collection of the so-called "tissue-specific" genes as discussed above, it would be interesting to find out how each of the 56 genes may contribute to tissue characterization. One of our on-going projects in reducing the gene set without compromising its power in defining the normal physiological state of a specified human tissue may help to answer this question.
The network analysis provided additional clues to the biological implications of the signature in development and carcinogenesis. Positive correlation of the 56- gene profile to developmental stages revealed in both in vitro and in vivo studies indicates that systematic shifts of the global gene expression through the complicated developmental process can be characterized with our signature genes. Hence it is possible that the 56 genes may be good candidates for modeling the human developmental process. Further, the capability of the 56- gene profiles in correlating quality of the engineered skin to the similarities to normal skin template brought up a potential application of the signature to serve as the quality index for engineered tissue.
The network analysis also helped to link our model to the current understanding of tumorigenesis. We showed that the c.f.s of the 56-gene profiles in malignant tumors were significantly lower than the normal tissues to the corresponding template, indicating changes of expression in multiple genes in a cancer tissue. It coincides with the findings that at least 4-5 mutations are required to initiate tumor [29, 30]. In our network analysis, more than half of the 56 genes were found to interact with those well-known cancer-related transcription factors or signaling receptors such as STAT3, TP53, ESR1 and EGFR which have been shown to interact with a great number of gene products involved in varieties of pathways. Therefore, it is possible that mutations occurring in such genes (i.e. EGFR, STAT3 etc.) may simultaneously affect expression of a number of the target genes which may ultimately lead to changes in the profile of our signature. Further, significant change in the profile of the 56 genes indicates alterations in relative activities of the pathways represented by these signature genes, reflecting a dramatic shift of the cellular homeostasis which may lead to cell necrosis or anomalous growth like tumorigenesis. Alternatively, accumulated mutations in the genes which affect the activities of those pathways represented by our signature may also affect the expression profile in one hand and lead to similar outcome as described above on the other hand. Taken together, despite that severe shift of the 56-gene profile from normal may not be the initial cause of many cancers, it could have the potential to serve as an indicator for the cancerous state of a cell/tissue. Whether our signature can be applied to cancer staging awaits further investigation. Nonetheless, this knowledge provides a new aspect in understanding the complex process of carcinogenesis.
These results strongly suggest that a normal tissue or cell may uphold its normal morphology and functioning by maintaining specific chemical stoichiometry among genes. The stoichiometry of a physiological state of a normal human tissue can be depicted by the relative expression levels of a compact set of representative genes such as the 56 genes obtained here. A significant deviation from such quantitative relationship may result in malfunction or abnormal growth of the cells.
Microarray data used in this study were obtained from the Gene Expression Omnibus (GEO) database at NCBI by Nov. 2nd of 2009. GEO series with accession numbers GSE2361, GSE1133(2004 version of the Gene Atlas) and GSE7307 (the "human body index") were used to find molecular features in normal tissues and to derive the 56-gene template profiles. (Additional file 1: Table S1) Datasets GSE14334, GSE3204, GSE5364 and GSE6932 were used as testing data to further explore the biological implications of GETs. Datasets GSE1133, GSE2361, GSE5364 and GSE6932 were hybridized on the Affymetrix GeneChip HG-U133A and GSE7307 on the HG-U133plus2.0. The Affymetrix GeneChip HG-U133plus2.0 contained 54,675 probe sets (representing around 38,572 unique UniGene clusters) which cover all the 22283 probe sets (representing 14,593 unique UniGene clusters) synthesized on the HG-U133A. The additional 62 datasets used for large-scale tissue prediction had all been hybridized on either HG-U133A or HG-U133plus2.0. The accession identification as well as the associated information are summarized in Additional file 1: Tables S1 and S3.
Molecular annotation for selected genes
The gene sets were annotated by searching the databases at the DAVID server (http://david.abcc.ncifcrf.gov/home.jsp) with Entrez Gene  identifier as input. Cellular location and biological processes were searched against Gene Ontology (GO) . The molecular functions were searched against PANTHER, since PANTHER gave a more complete set of biologically-relevant results for our gene set than GO. Pathways were searched against KEGG .
For those datasets whose CEL files are available at GEO, the data were first subjected to quality assessment by AffyQualityReport to remove the poor quality arrays and then to RMA processing for data normalization.
For identification of the 56 signature genes, this preprocessing procedure resulted in 143, 35 and 473 arrays for GSE1133, GSE2361, and GSE7307, respectively. Gene filtration was carried out by firstly selecting from each of the three training datasets the genes whose coefficients of variation ranked at top 2.5% of the entire transcriptome across different tissue types. The resulted highly variably expressed genes were then intersected to generate a set of candidate tissue-classifier genes which were later subjected to data redundancy elimination through hierarchical clustering against the 24 tissues commonly present in the three sets of training data. Following the hierarchical cluster analysis, one representative gene for each cluster was selected and additional genes with highly similar expression profiles got removed. This procedure resulted in 56 genes.
For tissue classification, the probe set intensities of the 56 genes or an equivalent number of random probe sets of the 24 selected tissues were extracted from each of the three GEO datasets using the programs Microsoft Access and Excel. The extracted probe intensities from the three datasets were then combined into a 56 × 72 matrix which was then subjected to hierarchical clustering with the GenePattern package  using Pearson correlation for similarity computing and average for clustering. Ten sets of 56 random probe sets were produced by a random number generation program written in C. Each set was used for a separate hierarchical clustering analysis.
Both AffyQualityReport and RMA were obtained from the Bioconductor package  in the R package (http://www.r-project.org/). Descriptive statistical analyses were computed using Excel while hierarchical clustering with the GenePattern package.
Tissue prediction using the 56 genes
Tissue prediction was performed following the KNN method (k-nearest neighbor) with k = 1. It compares the c.f. of the 56-gene profiles between a test tissue and each of our 24 tissue-specific GET profiles, one for each tissue type. The tissue type with highest correlation was nominated as our prediction. A computer program in R language was implemented to accomplish this task.
Dataset retrieval from GEO for large-scale tissue-prediction
Text The entire GEO database (2009-11-2 freeze) was searched with the following criteria: platform as GPL96 (Affymetrix HG-U133A) or GPL570 (HG-U133plus2.0), sample source containing one of the 24 distinguishable human organ/tissues and key word in the sample-related fields containing "normal". Two bioinformatics strategies were used to carry out the search: one was to apply SQL commands to the local MySQL database housing the data from the soft files of GPL96 and GPL570 which were imported from GEO website. The other strategy was to directly query the GEO database with Entrez keywords through the NCBI web interface. The union of both searching results was taken, followed by manual filtration to exclude irrelevant datasets that, for example, came from cell lines or specific cell types. Those datasets which had been contributed by the same research group as the three source datasets, GSE3526 for instance, were also removed from our test set. Expression profiles of the 56 genes were then extracted from the 61 resulting datasets.
Datasets of 56 gene expression values were organized into RMA-like or MAS-like according to the data preprocessing methods. For those datasets that had been normalized with MAS5 or equivalent method, logarithmic transformation was carried out prior to tissue-prediction analysis. For three datasets (GSE13355, GSE14951, GSE17539) it was hard to judge whether logarithm transformation was necessary and their CEL files were therefore preprocessed with AffyQualityReport followed by RMA normalization before tissue-prediction analysis.
Gene network construction
Gene networks were constructed with the MetaCore package using the algorithms "network analysis" and "receptor targets modeling". The algorithms are variants of the shortest paths algorithm where the main parameters are: 1) relative enrichment with the uploaded data (the 56 genes in this study), and 2) relative saturation of networks with canonical pathways. As a control for this network analysis, a set of 56 genes randomly selected from the Affymetrix microarray HG-U133A was entered as a query and no network was produced by either of the algorithms. The control experiments were repeated twice.
List of Abbreviations
tissue-specific gene expression templates.
The authors want to thank Adam Yao, Chen-Hsiang Yeang, and Ching-Feng Cheng for professional discussion regarding manuscript editing and critical discussion in statistics and biomedicine. Thanks are also given to Shi-Hsien Yang, Zhi-Shun Chen and Ying-Fu Ho for their technical support in random number generation and computing assistance. We are grateful to Harry Wilson and Karri Alston for the professional editing of our manuscript.
This work was supported in part by the National Science Council (grant NSC95-3114-P-002-005-Y, NSC97-2627-P-001-003) and also by a startup funding for Mathematics in Biology from Academia Sinica.
- Amberger J, Bocchini CA, Scott AF, Hamosh A: McKusick's Online Mendelian Inheritance in Man (OMIM®). Nucleic Acids Res. 2009, 37 (suppl 1): D793-D796.PubMed CentralPubMedView Article
- McClellan J, King M-C: Genetic Heterogeneity in Human Disease. Cell. 2010, 141 (2): 210-217. 10.1016/j.cell.2010.03.032.PubMedView Article
- Ashworth A, Lord Christopher J, Reis-Filho Jorge S: Genetic Interactions in Cancer Progression and Treatment. Cell. 2011, 145 (1): 30-38. 10.1016/j.cell.2011.03.020.PubMedView Article
- Ge X, Yamamoto S, Tsutsumi S, Midorikawa Y, Ihara S, Wang SM, H A: Interpreting expression profiles of cancers by genome-wide survey of breadth of expression in normal tissues. Genomics. 2005, 86 (2): 127-141. 10.1016/j.ygeno.2005.04.008.PubMedView Article
- Hsiao L-L, Dangond F, Yoshida T, Hong R, Jensen RV, Misra J, Dillon W, Lee KF, Clark KE, Haverty P, et al: A compendium of gene expression in normal human tissues. Physiol Genomics. 2001, 7 (2): 97-104.PubMedView Article
- Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, et al: A gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci USA. 2004, 101 (16): 6062-6067. 10.1073/pnas.0400782101.PubMed CentralPubMedView Article
- Shyamsundar R, Kim YH, Higgins JP, Montgomery K, Jorden M, Sethuraman A, van de Rijn M, Botstein D, Brown PO, JR P: A DNA microarray survey of gene expression in normal human tissues. Genome Biology. 2005, 6 (3): R22-10.1186/gb-2005-6-3-r22.PubMed CentralPubMedView Article
- Dezso Z, Nikolsky Y, Sviridov E, Shi W, Serebriyskaya T, Dosymbekov D, Bugrim A, Rakhmatulin E, Brennan R, Guryanov A, et al: A comprehensive functional analysis of tissue specificity of human gene expression. BMC Biology. 2008, 6 (1): 49-10.1186/1741-7007-6-49.PubMed CentralPubMedView Article
- Guo J, Zhu P, Wu C, Yu L, Zhao S, Gu X: In silico analysis indicates a similar gene expression pattern between human brain and testis. Cytogenetic and Genome Research. 2003, 103 (1-2): 58-62. 10.1159/000076290.PubMedView Article
- Yu K, Ganesan K, Tan LK, Laban M, Wu J, Zhao XD, Li H, Leung CHW, Zhu Y, Wei CL, et al: A Precisely Regulated Gene Expression Cassette Potently Modulates Metastasis and Survival in Multiple Solid Cancers. PLoS Genet. 2008, 4 (7): e1000129-10.1371/journal.pgen.1000129.PubMed CentralPubMedView Article
- Ekins SNY, Bugrim A, Kirillov E, Nikolskaya T: Pathway mapping tools for analysis of high content data. Methods in Molecular Biology. 2007, 356: 319-350.PubMed
- Nagase H, Woessner JF: Matrix Metalloproteinases. Journal of Biological Chemistry. 1999, 274 (31): 21491-21494. 10.1074/jbc.274.31.21491.PubMedView Article
- Ortega N, Behonick D, Stickens D, Werb Z: How Proteases Regulate Bone Morphogenesis. Annals of the New York Academy of Sciences. 2003, 995 (1): 109-116. 10.1111/j.1749-6632.2003.tb03214.x.PubMedView Article
- Kisseleva T, Bhattacharya S, Braunstein J, Schindler CW: Signaling through the JAK/STAT pathway, recent advances and future challenges. Gene. 2002, 285 (1-2): 1-24. 10.1016/S0378-1119(02)00398-0.PubMedView Article
- Hankey PA: Regulation of hematopoietic cell development and function by Stat3. Front Biosci. 2009, 14: 5273-5290. 10.2741/3597.View Article
- Satto S, Liu B, Yokoyama K: Animal Embryonic Stem (ES) Cells: Self-renewal, Pluripotency, Transgenesis and Nuclear Transfer. Human Cell. 2004, 17 (3): 107-116.View Article
- Takada I, Kouzmenko AP, Kato S: Wnt and PPAR[gamma] signaling in osteoblastogenesis and adipogenesis. Nat Rev Rheumatol. 2009, 5 (8): 442-447. 10.1038/nrrheum.2009.137.PubMedView Article
- Cimini A, Cerù M: Emerging Roles of Peroxisome Proliferator-Activated Receptors (PPARs) in the Regulation of Neural Stem Cells Proliferation and Differentiation. Stem Cell Reviews and Reports. 2008, 4 (4): 293-303. 10.1007/s12015-008-9024-2.PubMedView Article
- Kwon CY, Kim KR, Choi HN, Chung MJ, Noh SJ, Kim DG, Kang MJ, Lee DG, WS M: The role of serum response factor in hepatocellular carcinoma: implications for disease progression. Int J Oncol. 2010, 37 (4): 837-844.PubMed
- Lynch TJ, Bell DW, Sordella R, Gurubhagavatula S, Okimoto RA, Brannigan BW, Harris PL, Haserlat SM, Supko JG, Haluska FG, et al: Activating Mutations in the Epidermal Growth Factor Receptor Underlying Responsiveness of Non-Small-Cell Lung Cancer to Gefitinib. New England Journal of Medicine. 2004, 350 (21): 2129-2139. 10.1056/NEJMoa040938.PubMedView Article
- Walker F, Abramowitz L, Benabderrahmane D, Duval X, Descatoire V, Hénin D, Lehy T, Aparicio T: Growth factor receptor expression in anal squamous lesions: modifications associated with oncogenic human papillomavirus and human immunodeficiency virus. Human pathology. 2009, 40 (11): 1517-1527. 10.1016/j.humpath.2009.05.010.PubMedView Article
- Kuan C, Wikstrand C, Bigner D: EGF mutant receptor vIII as a molecular target in cancer therapy. Endocr Relat Cancer. 2001, 8 (2): 83-96. 10.1677/erc.0.0080083.PubMedView Article
- Hong J, Lee J, Min KH, Walker JR, Peters EC, Gray NS, Cho CY, Schultz PG: Identification and Characterization of Small-Molecule Inducers of Epidermal Keratinocyte Differentiation. ACS Chemical Biology. 2007, 2 (3): 171-175. 10.1021/cb600435t.PubMedView Article
- Kho AT, Bhattacharya S, Tantisira KG, Carey VJ, Gaedigk R, Leeder JS, Kohane IS, Weiss ST, TJ M: Transcriptomic Analysis of Human Lung Development. Am J Respir Crit Care Med. 2010, 181: 54-63. 10.1164/rccm.200907-1063OC.PubMed CentralPubMedView Article
- Reinhold WC, Reimers MA, Lorenzi P, Ho J, Shankavaram UT, Ziegler MS, Bussey KJ, Nishizuka S, Ikediobi O, Pommier YG, et al: Multifactorial Regulation of E-Cadherin Expression: An Integrative Study. Molecular Cancer Therapeutics. 2010, 9 (1): 1-16. 10.1158/1535-7163.MCT-09-0321.PubMed CentralPubMedView Article
- Talantov D, Mazumder A, Yu JX, Briggs T, Jiang Y, Backus J, Atkins D, Y W: Novel genes associated with malignant melanoma but not benign melanocytic lesions. Clinical Cancer Research. 2005, 11 (20): 7234-7242. 10.1158/1078-0432.CCR-05-0683.PubMedView Article
- Smiley AK, Klingenberg JM, Aronow BJ, Boyce ST, Kitzmiller W, Supp DM: Microarray Analysis of Gene Expression in Cultured Skin Substitutes Compared with Native Human Skin. J Investig Dermatol. 2005, 125 (6): 1286-1301. 10.1111/j.0022-202X.2005.23971.x.PubMedView Article
- Duval B, Hao J-K: Advances in metaheuristics for gene selection and classification of microarray data. Briefings in Bioinformatics. 2010, 11 (1): 127-141. 10.1093/bib/bbp035.PubMedView Article
- Fearon ER, Vogelstein B: A genetic model for colorectal tumorigenesis. Cell. 1990, 61 (5): 759-767. 10.1016/0092-8674(90)90186-I.PubMedView Article
- Hanahan D, Weinberg RA: The Hallmarks of Cancer. Cell. 2000, 100 (1): 57-70. 10.1016/S0092-8674(00)81683-9.PubMedView Article
- Lee J, Hever A, Willhite D, Zlotnik A, Hevezi P: Effects of RNA degradation on gene expression analysis of human postmortem tissues. FASEB J. 2005, 04-3552fje.
- Maglott D, Ostell J, Pruitt KD, Tatusova T: Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res. 2005, D54-58. 33 Database
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000, 25 (1): 25-29. 10.1038/75556.PubMed CentralPubMedView Article
- Mi H, Guo N, Kejariwal A, Thomas PD: PANTHER version 6: protein sequence and function evolution data with expanded representation of biological pathways. Nucl Acids Res. 2007, 35 (suppl_1): D247-252.PubMed CentralPubMedView Article
- Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, et al: KEGG for linking genomes to life and the environment. Nucl Acids Res. 2008, 36 (suppl_1): D480-484.PubMed CentralPubMed
- Irizarry R, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, Normalization, and Summaries of High Density Oligonucleotide Array Probe Level Data. Biostatistics. 2003, 4 (2): 249-264. 10.1093/biostatistics/4.2.249.PubMedView Article
- Reich M, Liefeld T, Gould J, Lerner J, Tamayo P, Mesirov JP: GenePattern 2.0. Nat Genet. 2006, 38 (5): 500-501. 10.1038/ng0506-500.PubMedView Article
- Gentleman R, Carey V, Bates D, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, et al: Bioconductor: open software development for computational biology and bioinformatics. Genome Biology. 2004, 5 (10): R80-10.1186/gb-2004-5-10-r80.PubMed CentralPubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.