Profiling and metaanalysis of epidermal keratinocytes responses to epidermal growth factor

Background One challenge of systems biology is the integration of new data into the preexisting, and then re-interpretation of the integrated data. Here we use readily available metaanalysis computational methods to integrate new data on the transcriptomic effects of EGF in primary human epidermal keratinocytes with preexisting transcriptomics data in keratinocytes and in EGF-treated non-epidermal cell types. Results We find that EGF promotes keratinocyte proliferation, attachment and motility and, surprisingly, induces DUSPs that attenuate the EGF signal. Our metaanalysis identified overlapping effects of EGF with those of IL-1 and IFNγ, activators of keratinocyte in inflammation and wound healing. We also identified the genes and pathways suppressed by EGF but induced by agents promoting epidermal differentiation. Metaanalysis comparison with the EGF effects in other cell types identified extensive similarities between responses in keratinocytes and in other epithelial cell types, but specific differences with the EGF effects in endothelial cells, and in transformed, oncogenic epithelial cell lines. Conclusions This work defines the specific transcriptional effects of EGF on human epidermal keratinocytes. Our approach can serve as a suitable paradigm for integration of new omics data into preexisting databases and re-analysis of the integrated data sets.


Background
Systems biology aims to integrate different levels and types of information to understand how biological organisms function and responds to perturbations. It uses a range of computational methods and relies extensively on massive amounts of data provided by high throughput methods and parallel large-scale measurements. A significant and urgent problem of molecular systems biology is the integration of new data into the preexisting, accumulated data, and then re-interpretation of the newly integrated total information. This is particularly acute in transcriptomics, where microarrays and RNA sequencing generate and daily append large amount of new data into the established databanks. Individual researchers and groups can relatively easily, although not inexpensively, produce and submit new transcriptomics data, but are often stymied trying to incorporate their contributions into the established, preexisting knowledge. Information technology core facilities, on the other hand, while providing data storage and access to analysis programs, seldom understand the specifics of individual investigator's research areas and such cores usually find it difficult to customize their analysis for a particular, distinct project.
Here, we use readily available, web-based, free computational metaanalysis methods to integrate the transcriptional consequences of treating human epidermal keratinocytes with EGF with the related, existing transcriptomics data in public databanks.
EGF was among the first signalling molecules isolated, and was named for its capacity to accelerate epidermal differentiation and eye opening in newborn mice [1]. EGF has become a paradigm for studies of extracellular and intracellular signalling [2][3][4][5]. EGF binds to its receptor, EGFR, causing receptor dimerization and consequent activation of its cytoplasmic kinase [6]. The activated kinase initiates several signal transduction cascades of protein phosphorylation, resulting in activation of transcription factors that regulate expression of many genes [7]. EGF promotes cell survival, proliferation, chemotactic migration, and suppresses terminal differentiation, inhibits apoptosis etc. [8]. EGFR is a member of the ErbB protein family and binds several ligands in addition to EGF, e.g., TGFα, and HB-EGF [9].
The transcriptome changes caused by EGF have not been extensively explored in its eponymous tissue, although EGF is very important in the healthy epidermal homeostasis and in several diseases [10]. Specifically in the epidermis, EGF is known to contribute to wound healing, regulate barrier function, suppress terminal differentiation, cause loss of adhesion, induce secreted proteases etc. [10,11]. EGFR is over-expressed in squamous cell carcinomas [12]. Perhaps understandably, the published microarray analyses of the effects of EGF mainly focused on breast cancer and glioblastoma cell lines, the two types of cancer with EGFR-associated aetiology and have provided valuable data on the molecular effect of its action in several cell lines [13][14][15]. Detailed molecular understanding of the EGFR kinase domain led to development of specific inhibitors, Gefitinib, Erlotinib, Herceptin, Cetuximab, currently used to treat breast, lung, ovarian, prostate, head-and-neck and other cancers [8,16].
To define systematically and comprehensively the transcriptional changes caused by EGF in primary human epidermal keratinocytes, we treated these cells with EGF and compared parallel treated and control cultures using Affymetrix microarrays. We then used metaanalysis programs to integrate the observed changes with a large set of already existing data on transcriptional profiling in epidermal keratinocytes and with data of EGF-treated nonepidermal cell types in GEO database. We found that EGF affects keratinocyte proliferation, attachment and motility and, surprisingly induces DUSPs, phosphatases that attenuate the EGF signal transduction in a feed-back loop. Metaanalysis identified specific overlapping effects of EGF with those of IL-1 and IFNγ, which play a role in keratinocyte activation, inflammation and wound healing. Conversely, we identified the specific genes and processes suppressed by EGF, but induced by pro-differentiation agents. Metaanalysis comparison of EGF effects in other cell types identified extensive similarities between the responses in keratinocytes and in other epithelial cell types, but differences with e.g., endothelial cells. We also found characteristic differences with transformed, oncogenic epithelial cell lines. Our approach can serve as a paradigm for integration of new omics data into preexisting databases and analysis of the integrated data.

Early time points, 1 and 4 hrs
Approximately 520 genes were induced by 50% or more at any of the time points, and 580 were suppressed by the EGF treatment in human epidermal keratinocytes. The regulated genes are listed in the Additional file 1: Table S1. A small number of genes, 44, are both induced and suppressed by EGF, at different time points; of these, 34 are induced early, but suppressed late and 10 have just the opposite regulation pattern.
Intriguingly, among the most prominent induced genes are 5 dual specificity phosphatases, DUSP-1, -2, -4, -5 and −6 ( Table 1 and Additional file 2: Table S3a, Additional file 3: Table S4a). These enzymes de-phosphorylate, and thereby inactivate, the kinases of the ERK pathways [17,18]. As a result of this de-phosphorylation, the signal transduction cascades activated by the EGF are attenuated. Apparently, the induction of the DUSPs is one of the important feed-back mechanisms that limits the intensity and duration of the EGF effects in keratinocytes.
As expected, EGF, being a fundamental and widereaching signal, affects a fairly large number of signalling pathways. Immediately after addition, i.e., in the first hour, EGF suppressed production of 67 genes in the 'signalling' ontological category, including 3 insulin-like growth factor binding proteins, mucins, cytokines and their receptors as well as secreted lectins, enzymes with signalling function, such as kallikrein peptidases, and structural proteins with signalling function, such as cadherins, desmocollins and desmogleins. Presumably, their suppression inactivates the corresponding active signalling pathways, and prepares the cells for new biological processes and reactions. At the 4 h time point, 75 signalling genes are induced, again comprising a wide range of functions and pathways. For example, proteases with signalling function, ADAM8, ADAM22, cathepsins, thrombospondin, coagulation factors and MMPs are induced, but so are protease inhibitors, such as serpins, TIMP1 etc. Also induced at this time point are integrins and components of the extracellular matrix, e.g., Laminin V and tenascin. Particularly noteworthy is the induction of amphiregulin and TGFα (Additional file 2: Table S3b); both are EGFR activating ligands that can sustain or extend the initial signal by EGF [19,20].
Among the regulators of proliferation and cell cycle, which are induced after 1 h of EGF treatment, we find CDK7, the protein thought to serve as a direct link between the regulation of transcription and the cell cycle [21,22]. Transcription factors including BTG2, cyclin G2, GADDα, Jun, Jun B, v-fos etc. are also induced (Additional file 2: Table S3c). Interestingly, regulation of apoptosis is also one of the ontological categories induced early after EGF treatment (Additional file 2: Table S3d). Parenthetically, we note that the negative regulators of apoptosis are suppressed at the 48 h time point (see below, Additional file 3: Table S4k).
EGF strongly affects cell adhesion and motility through induction of focal adhesion, basement membrane and  integrin binding protein categories. Cell adhesion and motility are complex processes that require a large set of coordinated protein functions; EGF affects these processes by inducing certain genes, e.g., α-actinin, cadherin 13, thrombin receptor, SMAD3 and SMAD7, VEGF-A and VEGF-C (Additional file 2: Table S3b, 3e), while suppressing others, e.g., desmocollins 1 and 2, desmoglein 1, cadherins, etc. (Additional file 2: Table S3h). In addition, EGF induced the expression of several proteins that, when mutated, cause in Epidermolysis bullosa, such as Integrin α2, laminin α3, (Additional file 2: Table S3e). It is unclear at present whether EGF-targeted treatment may be beneficial in certain Epidermolysis bullosa patients.
To confirm the proliferative and pro-migratory effects of EGF in human epidermal keratinocytes, we counted the cells in EGF-treated and control cultures, and we examined the re-epithelialization of scratches in culture ( Figure 1). As expected from previous studies and from the results described above, the addition of EGF profoundly improved both the proliferation and migration of keratinocytes.
Importantly, EGF induces defence and wounding response genes in keratinocytes. These include angiogenic signalling proteins, such as VEGF-A, VEGF-C, PTK2B, activin A receptor, cadherin 13, CD59 and CD55, placental growth factor, plasminogen activator, urokinase, TGFα, (Additional file 2: Table S3f ). Additionally, proinflammatory cytokines, including IL-1b and IL-6, are induced. The induction of defence and wounding response genes by EGF is congruent with the known protective role of EGF during skin wound healing [19]. Curiously, certain genes in the ontological categories of defence and wounding responses, such as fibrinogen α, PDGF receptor, lysozyme etc., are simultaneously suppressed by EGF (Additional file 2: Table S3g). However, in general, the function of the suppressed genes in the defence and wounding responses ontological categories is different from the function of the induced ones: the suppressed genes often include immune system modulators, e.g., IL-4, and interferon-α5, as well as the infection fighting proteins e.g., defensin b-1, lysozyme, TLR1, S100A8, etc. The ontological category of wounding response is broad and apparently EGF promotes certain aspects thereof, such as re-epithelialization and wound closure, while repressing others, such as antimicrobial processes and certain aspects of the immune response.
Interestingly, many proteins known as markers of keratinocyte differentiation are suppressed by EGF (Additional file 2: Table S3i). These include keratins KRT1 and KRT10, filaggrin, desmosomal proteins, SPRR and also members of the Ca-binding, S100 family genes.

Late time points, 24 and 48 hrs
Importantly, the DUSPs remain induced in the EGFtreated keratinocytes at the late time points as well, specifically DUSP-4, -5, -6 and −7 (Additional file 3: Table  S4a). At 24 h EGF induces expression of several secreted proteins, such as proteases, including MMP1, 9 and 10, as well as protease inhibitors, such as serpins; presumably, these play a role in matrix remodelling, given that matrix components fibronectin and laminin are also induced (Additional file 3: Table S4b). Among the secreted growth factors we particularly note TGFα and HB-EGF, the EGFR ligands, feed-back extenders of the original signal. We also note the induction of VEGF-A and VEGF-C, two angiogenic factors (see below). Genes promoting proliferation are induced at 24 h, including HB-EGF and TGFα, as well as additional secreted signals and their receptors (Additional file 3: Table S4c). Wound healing and angiogenesis proteins are induced, including CD73, ADAM8, DKK1, endothelin receptor A, fibronectin 1, integrin α2, interferon-γ receptor 2, MMP1, 9 and 10 urokinase, urokinase receptor, plasminogen activator and serpins-B2 and -E2, (Additional file 3: Table S4d). Extracellular matrix structural and remodelling proteins TIMP1, annexin A2, fibronectin 1, laminin γ2, MMPs etc. are induced after 48 h (Additional file 3: Table S4d).
Interestingly, and in contrast with the early time points, certain keratinocyte-specific genes are induced by EGF, including some markers of keratinocyte differentiation. These include S100 calcium binding proteins, SPRRs, involucrin, serpin-B1 etc. (Additional file 3: Table  S4e). Lipid and steroid biosynthesis enzymes, tangential markers of epidermal differentiation, are also induced; these include ATP citrate lyase, sterol isomerase, hydroxysteroid-17-β-dehydrogenase 2 and pyruvate carboxylase. Unexpectedly, 48 h after EGF addition, 8 different metallothioneins are induced (Additional file 3: Table  S4f ); their role in keratinocytes in response to EGF is unknown.
The ontological categories suppressed at 24 and 48 hrs include cell cycle and apoptosis. The cell cycle regulation 48 h after EGF addition continues primarily by suppression of cyclins A2, B1, B2 and CDK6, although notably, CDK inhibitors p19, p21 and p57 are also suppressed (Additional file 3: Table S4k). Desmosome components and cadherins are suppressed at 24 and 48 hrs, as are several keratin genes (Additional file 3: Table  S4h). Unexpectedly, we find KRT16 in the suppressed gene set; KRT16 is a known marker of proliferation and is induced in the early times after EGF addition [23]. The suppression of these cytoskeletal proteins may play a role in keratinocyte adhesion and locomotion.
A set of genes that respond to steroids is suppressed by EGF at the late time points (Additional file 3: Table S4i). This we find curious, because, as noted above, the steroid biosynthesis enzymes are induced (Additional file 2: Table S3e). Perhaps the locally produced steroids accumulate at an even later time point. Alternatively, these steroid-responsive genes are suppressed specifically to modulate the steroid response; this interesting metabolic issue deserves further examination.

Transcription factors
The transcriptional changes in response to EGF are, presumably, effected by changes in the activity of transcription factors (TF). To identify the putative transcription factors responsible, we searched for TF binding sites statistically overrepresented in the regulated genes using DAVID [24]. Interestingly, we find that the induced genes contain an overabundance of BACH1 and BACH2 binding sites (Table 2a). Statistically, the BACH binding sites are the most prominent ones in the regulated genes at all time points. The BACH proteins are basic leucine zipper transcription factors, can function both as transcription activators and as transcriptional repressors and regulate transcription of genes involved in G1/S and G2/ M phases of the cell cycle; they form heterodimers with MafK proteins and are known to be sensitive to oxidative stress [25,26]. Inhibition of BACH1 reduces UV light-caused damage in keratinocytes, apparently through regulating the expression of heme oxygenases [27]. In vitro, BACHs bind to AP1/NF-E2 binding sites, TGA(GC)TCA [28]. We note that transcriptionally functional AP1 sites have been found in a large proportion of keratinocyte differentiation marker genes [29][30][31][32][33]. The expression of the individual members of the AP1 protein family in different layers of the epidermis is controversial. It is, therefore, possible that the BACH proteins perform functions until now attributed to the AP1 proteins.
Our results suggest that the BACH proteins play important roles in keratinocyte proliferation and differentiation, in particular in response to EGF. The genes induced at early time points additionally contain a statistical overabundance of IK3, LMO2COM, PAX3 and SOX5 binding sites. While these results suggest that these four TFs mediate the EGF response, it is also possible that the overrepresented consensus binding sites could be recognized by other, related transcription factors in epidermal keratinocytes.
Much fewer TF binding sites reached statistical significance in the promoters of suppressed genes (Table 2b). We note that CEBPβ sites have been identified at both early and late time point genes. CEBPβ regulates genes involved in immune and inflammatory responses and has been shown to bind to regulatory regions of several acutephase and cytokine genes [34][35][36]. Additionally, binding sites for POU3F2 (a.k.a. BRN2, OCT7) appear overrepresented in the suppressed genes. This transcription factor has not been noted in keratinocytes before (although it is important in melanocytes and melanoma [37]).
As a final point, we note that AP1 sites are common in the suppressed genes as well (marked with asterisks in Table 2), although only at 48 hrs do the AP1 sites

Metaanalysis, part 1: Intersections of genes regulated by EGF and by other extracellular agents in keratinocytes
We compared the lists of genes regulated by EGF with lists of genes identified previously as regulated by extracellular signals, namely Interferon-γ (IFNγ), Interleukin 1 (IL-1), Retinoic acid (RA), Ephrin As and SP600125, a specific inhibitor of JNK [38][39][40][41][42]. The numbers of genes present in each list analyzed are given in Additional file 4: Table S5. We used the Lists2Networks program for comparisons and collected both the matrix of p-values for overlapping of related lists, as well as Bonferonicorrected lists of gene ontology categories 'biological process' and 'molecular function' [43]. We also examined GenMAPP and KEGG pathways, KEA kinase targets, predicted promoter sites and OMIM disease-associated genes. The gene ontology biological process comparisons gave the most complete and detailed results and these will be presented here (Figure 2, Tables 3 and 4); the other categories gave qualitatively similar, if sparser results (not shown).
Comparing lists of EGF-regulated genes revealed very interesting patterns: EGF, IFNγ and IL-1 induce and suppress related sets of genes, while EphrinAs and SP600125 have antiparallel effects: genes induced by EGF are suppressed by EphrinAs and SP600125, and vice versa ( Figure 2). Even more intriguing is the comparison of the EGF-and RA-regulated genes: parallel at 1 h after RA addition, the relationship is antiparallel after 4 and 24 h and lost after 48 hrs (Figure 2).
In comparisons of regulated genes, one finds many redundant or overly general ontological categories, which have to be parsed in order to detect the specific and important ontological categories regulated in parallel. For example, comparing lists of EGF-and IFNγ-regulated  genes (Table 3a), we find in common very general categories, such as 'biological process, ' 'metabolic process' or 'cell communication.' However, we also find specific categories: at early time points both EGF and IFNγ induce common regulators of transcription and, importantly, response to stress and response to wounding. Conversely, both EGF and IFNγ suppress keratinocyte differentiation and epidermis development genes. Another set of stress response genes is suppressed in common by EGF and by IFNγ. At the later time points, 24 and 48 h, proliferation and cell cycle genes are suppressed by both (Table 3a).
A somewhat different picture emerges in the comparisons of the EGF-and IL-1-regulated genes (Table 3b). Like IFNγ, IL-1 induces regulators of transcription and responses to stress and wounding. Importantly, unlike IFNγ, IL-1 also induces sets of genes responsible for cell motility and chemotaxis. Again distinctly from IFNγ, IL-1 induces inhibitors of apoptosis. Similar to IFNγ, IL-1 suppresses keratinocyte differentiation and epidermis development genes. This suppression continues in IL-1 treated keratinocytes longer than in the IFNγ-treated ones, and is still noticeable at 48 h (Table 3b).
Using the Lists2Networks program [43] for comparisons of predicted transcription factor binding sites in the promoters of regulated genes, we see again characteristic overlaps between the genes co-regulated by EGF and IFNγ and those co-regulated by EGF and IL-1 (Figure 3). For example, both EGF and IFNγ activate many transcription factors of the AP1 family, Fos and Jun, and the Rel NFκB proteins after 24 h (Figure 3a). The overlap of transcription factors targets for EGF and IL-1 is sparser Figure 2 Overlaps of the lists of genes regulated by EGF with those regulated by other extracellular agents. We show overlaps with genes regulate by Interferon-γ, IL-1, Retinoic acid, the five EphrinA proteins and the JNK inhibitor SP600125. UP denotes induced, DN suppressed genes. The overlaps with p-value better than 10 -20 are marked with gray. The downward arrows mark the high overlaps between both the induced and the suppressed gene lists. The upward arrows mark the antiparallel overlaps, i.e., between the lists of genes that are induced by EGF, but are suppressed by other agents, and vice versa.  than for EGF and IFNγ; in particular, fewer common AP1 protein binding sites were identified in the genes regulated by both EGF and IL-1.
We also used Lists2Networks to identify the kinases whose targets are overrepresented in the regulated genes [43]. Using the KEA kinase targets analysis, we found that both EGF and IFNγ induce targets of MAPK-1, -3 and −14, as well as of the receptor kinases EGFR, INSR,  a) The EGF and Interferon-γ overlaps b) The EGF and IL-1 overlaps. UP denotes induced, DN suppressed genes. The numbers represent the p-values. While many ontological categories are redundant, closely related or overly general, note that keratinocyte differentiation is suppressed in all three sets, the EGF-, Interferon-γ-, and IL-1-regulated genes. Also note that processes related to cell motility are induced by EGF and IL-1, but not by IFNγ.
JAK-1 and −2. However, only ROCK1 targets are induced already at 1 h, the targets for other kinases are overrepresented in parallel at later times ( Figure 3b). EGF, IFNγ and IL-1 induce a similar family of kinase targets, but this induction occurs earlier in the IL-1-treated cells than in the IFNγ-treated ones (Figure 3b). Despite larger sets of common kinase targets induced by EGF and IL-1, the set of transcription factors activated in common by EGF and IL-1 is sparserfewer AP1 proteins were identified. We note, parenthetically, that the kinases, and even more the transcription factors activated solely by IFNγ or by IL-1 (i.e., those not in common with the EGF-activated set) are very distinct. This suggests that the effects of EGF overlap the distinct subset of the effects of IFNγ and of IL-1. Certain genes induced by EGF are suppressed by EphrinAs, and vice versa (Figure 2). We reported that EphrinAs induce keratinocyte differentiation markers [38,44]; here we show that these markers are suppressed by EGF. The antiparallel effects are evident at both early and late time points (Table 4). SP600125 also induces keratinocyte differentiation [40]; correspondingly, similar antiparallel effects are seen in the EGF-and SP600125treated keratinocytes (Table 4). Moreover, SP600125 induced inhibitors of proliferation, another effect antiparallel to EGF. Interestingly, the antiparallel effects of EphrinAs and SP600125 are distinct and seem to proceed via different mechanisms: of the kinases targets induced by EGF, EphrinAs seem to suppress the targets of MAPK-1, -3 and −14, while SP600125 none at all. Conversely the targets of CDK1 and CDK2 suppressed by EGF at late time point, seem induced by SP600125, but not by EphrinAs (Table 5). Again, the effects of EGF overlap the distinct effects of EphrinAs and of the JNK inhibitor SP600125.
An interesting paradox emerges from the comparison of genes regulated by EGF and by RA (Table 6). At the earliest time point, 1 h after addition of RA, the effects are largely antiparallel; for example, cell motility genes are inhibited by RA, while EGF induces those (Table 6a). At 4 h, both RA and EGF in parallel induce transcription factors and signal transduction and in parallel suppress apoptosis genes (  motion seems induced by both EGF and RA. Interestingly, while MAPKs seem suppressed by RA after 1 h, they appear activated after 4 h. Their activation is not maintained at later time points, e.g., at 24 h, when targets of kinases EGFR, ErbB2 and Insulin receptor appear specifically induced in the EGF treated keratinocytes (Table 6c). The only transcription factors regulated by both EGF and RA to a statistically significant level are the NFκB proteins REL/RELA (Table 6d).

Metaanalysis, part 2: EGF effects in keratinocytes vs. EGF effects in other cell types
EGF is a catholic regulator affecting many different cell types. Curiously, transcriptional responses to EGF have been described in several established cell lines, but not in primary cultures. We compared our data with EGFregulated genes from 4 published studies, GSE6783, GSE6784, GSE10778 and GSE13009 [13,14,45]. These studies were selected because they compare directly EGF-treated vs. untreated cells and, for ease of comparison, because they used Affymetrix microarrays. Interestingly, we find that in epidermal keratinocytes, as well as in HeLa and MCF10A cell lines, EGF regulates many genes in parallel at the early time points, 1 and   (Figure 4). While in keratinocytes there is a very significant overlap between genes regulated at early and at late time points (Figure 4 top), the parallel with HeLa and MCF10A is clear for the induced genes only, not for the suppressed ones. Similarly, the parallel between keratinocytes and HUVEC holds for the induced genes only.
In contrast, MCF7 cell line shows transcriptional effects antiparallel to those of keratinocytes, particularly at the later time points, at 48 h. We find important parallels in the biological processes regulated by EGF in different cell types (Table 7). Specifically, regulation of transcription is affected in all cell types compared, at all time points (Table 7a). Similarly, negative regulators of apoptosis are induced in all cell types, although perhaps a little delayed in HUVEC (Table 7e). On the other hand, regulation of cell motility, signal transduction and cell proliferation are regulated in all four epithelial cell types, but not in the endothelial cells, HUVEC (Table 7b, c, d). Please also note that in MCF7, transformed cells, the regulators of transcription are downregulated at 48 hrs (marked with ! in Table 7).

Discussion
To define comprehensively the molecular effects of EGF in human epidermal keratinocytes we used transcriptional profiling and identified the regulated genes. We    The rightmost column shows the antiparallel overlap, i.e., with the genes suppressed by EGF in MCF7 cells. * Letters ae indicate the fully or largely redundant biological processes.
compared these with genes regulated in keratinocytes by other extracellular signals, and with genes regulated by EGF in other cell types. We found both expected and unexpected classes of regulated genes. Because EGF regulates keratinocyte proliferation, the cell cycle genes are prominently regulated, as expected; conversely, the apoptotic signal genes are suppressed. Similarly, numerous genes related to motility and substrate attachment are induced by EGF, whereas those promoting rigid cellto-cell contacts are suppressed. As expected from such a wide-ranging regulator, EGF induced many signalling proteins, both intra-and extracellular. Unexpectedly, EGF induced multiple members of the DUSP family; an important function of the DUSPs is to attenuate and dampen the signalling via ERKs, kinase members of the EGF-responsive signalling cascades. Thus, EGF induces its own feed-back mechanism, presumably to restrain uncontrolled proliferation, motility etc. At the same time, EGF induces production of HB-EGF and TGFα, ligands of EGFR, which are expected to augment and extend the signalling. A similar incongruity is seen at the late time points, 24 and 48 h, when both the cell cycle and the apoptosis genes are suppressed by EGF. Apparently, both positive and negative feedback are induced by EGF, and which will prevail may depend on additional, EGFindependent signals affecting the keratinocytes. While keratinocytes in healthy epidermis divide in the basal layer and differentiate as they move outward to the surface of the skin, in wound healing and in various pathological conditions keratinocytes become 'activated'they hyper-proliferate, migrate and elaborate complex defence mechanisms to protect the tissues and the organism beneath [46]. EGF is one such keratinocyte activating signal and induces production of proinflammatory and angiogenic cytokines and growth factors. Unexpectedly, EGF suppresses the production of antibacterial defensins and certain immunomodulators. Perhaps, then, the primary function of EGF in keratinocyte activation is in wound healing and re-epithelialization, rather than in immune responses and in fighting infection. We find that some of the activation-associated subsets of genes are coregulated by EGF and IFNγ, some by EGF and IL-1 and yet other gene subsets by all there. This reflects the different basic functions of proliferative, inflammatory and immunomodulating cytokines and growth factors.
The regulation of keratinocyte differentiation markers by EGF is complex: while the expression of one subset of markers is suppressed, another subset is induced. Proliferation and differentiation are incompatible: promoting cell proliferation diverts the cells from differentiating and consequently, certain differentiation markers are suppressed by EGF. Unexpectedly however, lipid and steroid biosynthesis enzymes are induced by EGF. Lipids, including steroids, are essential for epidermal barrier function and are necessary for cornification, the final stage of epidermal differentiation. Lipid biosynthesis enzymes would not be expected to interfere with cell proliferation, unlike e.g., large keratins or filaggrin, and thus may not be incompatible with the proliferative effects of EGF in keratinocytes. While both EGF and RA generally inhibit keratinocyte differentiation, paradoxically, at the earliest time points many of the genes induced by one are suppressed by the other and vice versa. Only at later time points do EGF and RA regulate parallel sets of genes. Conversely, SP600125 and EphrinAs promote keratinocyte differentiation. However, the differentiationassociated genes affected by SP600125 and those affected by EphrinAs seem to be targets of different signalling pathways. These results suggest that epidermal differentiation is 'modular' , not a fully integrated all-or-nothing process; for example, while subsets of differentiation markers are suppressed by EGF, other subsets are induced.
Importantly, metaanalysis of EGF regulation in keratinocytes and other cell types shows important parallel and antiparallel effects. Keratinocytes, HeLa and MCF10A cells show the highest parallelism. All three are epithelial and non-tumorigenic, and in these three cell types signal transduction, proliferation, motility etc. are regulated in parallel. In contrast, in HUVEC, which are not epithelial but endothelial cells, motility, proliferation and signal transduction are regulated differently. MCF7 cell line presents an interesting paradox: it is epithelial but also tumorigenic, migratory and invasive. While some of the early responses to EGF are parallel in MCF7 cells to those in other epithelial cell types, at late time points they become antiparallelgenes and processes induced by EGF in keratinocytes, HeLa and MCF10A are suppressed in MCF7, and vice versa. These conclusions agree with previous results demonstrating characteristic cytoskeleton proteins, attachment and cell spreading in the MCF7 cells [47,48].

Conclusions
In conclusion, we present here a paradigm for systems biology approach for integration of disparate sources of transcriptional data. We used the effects of EGF in its eponymous tissue, defined the transcriptional profile changes and compared these with transcriptional profile changes in the same cell type by different signalling, and by the same signal in different cell type. The approach can be widely generalized for similar studies.
Specifically, we find that EGF induces characteristic aspects of keratinocyte activation, comprising proliferation and motility, but not activation of the immune response. These aspects partially overlap the effects of IFNγ and IL-1. At the same time, EGF suppresses certain aspects of keratinocyte differentiation, namely expression of many known differentiation markers, but not of others, e.g., lipid and steroid biosynthesis enzymes. We conclude that both activation and differentiation of keratinocytes are multifaceted, modular processes, rather than all-or-nothing events, and that distinct, characteristic and overlapping subsets of genes are regulated by different extracellular signals.

Human keratinocyte cultures
The growth and treatment of keratinocytes has been described [38,42,49]. Briefly, normal human neonatal foreskin epidermal keratinocytes were obtained from Dr. M. Simon (Living Skin Bank, Burn Unit, State University of New York, Stony Brook, NY) and grown in a defined serum-free keratinocyte growth medium (Keratinocyte-SFM, Invitrogen) supplemented with 2.5 ng/ml epidermal growth factor, 0.05 mg/ml bovine pituitary extract, and 1% penicillin/streptomycin (keratinocyte growth medium) at 37°C in 5% CO2 incubator. Generally, third passage cells were used at 70-80% confluence. We changed the medium to EGF-free and pituitary extract-free medium 24 h before the EGF treatment to starve the cells for EGF and to avoid the effects of the supplements in growth medium. Keratinocytes, grown on 100-mm plates, were then treated with human EGF (20 ng/ml, Invitrogen) or left untreated as controls. Parallel cultures of treated and control cells were harvested by scraping 1, 4, 24 and 48 h after treatment.
In our previous studies we used the same protocol for the effects of Interferon-γ, Interleukin-1, and Ephrin As, namely, the keratinocytes were pre-incubated in EGF-free, pituitary extract-free medium for 24 h before the addition of these agents [38,41,42]. In our experiments using SP600125 and retinoic acid we did not pre-incubate the cells but added the drug directly to the complete medium, i.e., containing both EGF and pituitary extract [39,40].

Cell count and the migration assay
Keratinocytes were grown on 12-well plates to 80% confluency. The cells in un-supplemented medium were treated with EGF, or left untreated as controls. After 24 and 48 h the cells were trypsinized and counted using a hemocytometer. Three wells were measured at each time point in each treatment. The error bars represent the highest and lowest measured values.
Keratinocytes were grown on 12-well plates up to 80-90% confluency as above, and treated with 8 μg/ml mitomycin-C for 1 hour to prevent cells from dividing. After washing, the keratinocyte monolayers were scratched with a 200 μl pipette tip in each well to form a cross. Then EGF (20 ng/ml) was added with the fresh medium. Media were changed every 24 hours. At the 0, 24, and 48 h time points after the treatment, pictures were taken under a microscope.

Preparation of labelled cRNA and geneChip hybridization
We isolated total RNA from treated and untreated keratinocytes using RNeasy kits (Qiagen) according to the manufacturer's instructions. Approximately 5 μg of total RNA was reverse transcribed, amplified, and labelled as described [49]. Fifteen micrograms of labelled cRNA was fragmented and hybridized to HG-U95Av2 arrays (Affymetrix). Arrays were washed, stained with anti-biotin streptavidin-phycoerythrin-labelled antibody, and scanned using the Agilent GeneArray scanner system (Hewlett- Figure 5 Flowchart of analysis. The data flow from rectangle to rectangle; the programs are named in bold italics and the analyses to be performed by the investigator using spreadsheet are in ovals. a) An example of large spreadsheets is given in Additional file 1: Table S1. b) An example of Tables is given in Additional file 2: Table  S3, of Charts in Table 1 and of Clusters in Additional file 5: Table S2. c) An example of Matrices is given in Figure 2, of Kinase Targets in Table 5 and of Ontological Categories in common in Table 7.