Global gene expression analyses of hematopoietic stem cell-like cell lines with inducible Lhx2 expression
© Richter et al. 2006
Received: 25 January 2006
Accepted: 06 April 2006
Published: 06 April 2006
Skip to main content
© Richter et al. 2006
Received: 25 January 2006
Accepted: 06 April 2006
Published: 06 April 2006
Expression of the LIM-homeobox gene Lhx2 in murine hematopoietic cells allows for the generation of hematopoietic stem cell (HSC)-like cell lines. To address the molecular basis of Lhx2 function, we generated HSC-like cell lines where Lhx2 expression is regulated by a tet-on system and hence dependent on the presence of doxycyclin (dox). These cell lines efficiently down-regulate Lhx2 expression upon dox withdrawal leading to a rapid differentiation into various myeloid cell types.
Global gene expression of these cell lines cultured in dox was compared to different time points after dox withdrawal using microarray technology. We identified 267 differentially expressed genes. The majority of the genes overlapping with HSC-specific databases were those down-regulated after turning off Lhx2 expression and a majority of the genes overlapping with those defined as late progenitor-specific genes were the up-regulated genes, suggesting that these cell lines represent a relevant model system for normal HSCs also at the level of global gene expression. Moreover, in situ hybridisations of several genes down-regulated after dox withdrawal showed overlapping expression patterns with Lhx2 in various tissues during embryonic development.
Global gene expression analysis of HSC-like cell lines with inducible Lhx2 expression has identified genes putatively linked to self-renewal / differentiation of HSCs, and function of Lhx2 in organ development and stem / progenitor cells of non-hematopoietic origin.
A small number of hematopoietic stem cells (HSCs) are responsible for the continuous production of mature blood cells throughout life. This process is based on the capability of the HSC to replenish itself through a process called self-renewal [1–3], and to differentiate into all hematopoietic lineages. Consequently, analysis of the mechanisms underlying HSC self-renewal and differentiation is fundamental for understanding the maintenance of the normal hematopoietic system. At present, our knowledge of these processes on the molecular and cellular level is limited, since studies on HSCs are hampered by their low abundance in hematopoietic organs and are thus difficult to access in sufficiently large quantities for direct studies. An increase in the number of HSCs occurs under normal physiological conditions in the liver during embryonic development , indicating that the microenvironment in the fetal liver efficiently promotes self-renewal of HSCs. Elucidation of the mechanisms responsible for the expansion of the hematopoietic system during embryonic development might therefore offer insights into the mechanisms of self-renewal in the hematopoietic system.
The expansion of the hematopoietic system is intimately connected with the development of the liver, suggesting over-lapping molecular mechanisms of these processes. Liver development in the mouse is initiated at embryonic day 8 (E8) when a distinct region of the ventral foregut endoderm receives inductive signals from two adjacent tissues, the septum transversum mesenchyme and the pre-cardiac mesoderm [reviewed in ]. Ventral foregut endodermal cells committed to hepatic fate proliferate and form a liver bud from which hepatoblasts migrate and intermingle with cells of the septum transversum mesenchyme. The mesenchymal cells originating from the septum transversum thereby contribute to the mesenchymal part of the liver, and development into a functional organ relies on continuous interactions between the mesenchymal and endodermal portions of the liver [6–8]. At E10 the liver has become a distinct organ with discernible lobes and is infiltrated by numerous hematopoietic cells and cells with HSC properties can be detected in the liver by E11 .
Members of the LIM-homeodomain transcription factor family play critical roles during embryonic development in both vertebrates and invertebrates by controlling processes such as asymmetric cell division, tissue specification and differentiation of specific cell types [reviewed in ]. One member of this family, Lhx2, is of particular interest, based on its function in the development of several different tissues via mesenchymal-epithelial interactions and regulation of stem/progenitor cells [11–17]. Lhx2 is expressed in the liver-associated septum transversum mesenchyme that becomes an integral part of the liver and its expression is maintained during liver development until adult stage in hepatic stellate cells [12, 16]. Lhx2 -/- embryos display a decreased size of the liver manifested already at E10.5, suggesting that Lhx2 is required for expansion of the fetal liver [12, 17]. The mutant phenotype is due to the presence of activated hepatic stellate cells causing a fibrotic and disorganized liver containing phenotypically abnormal endodermal cells [12, 16]. The mesenchymal defect in the liver of Lhx2 -/- mice cause a lethal anemia, which is cell non-autonomous since the Lhx2 -/- hematopoietic cells appears to be normal , suggesting that the mutant microenvironment is unable to support hematopoietic development. These observations indicate that Lhx2 expression in hepatic stellate cells is involved in mesenchymal-epithelial cell interactions important for liver expansion, organization, differentiation and formation of the hematopoietic microenvironment in the fetal liver.
To further elucidate the putative role of Lhx2 in the hematopoietic system we ectopically expressed Lhx2 in hematopoietic progenitor/stem cells derived from embryonic stem (ES) cells differentiated in vitro and from adult bone marrow (BM) cells. This approach allowed for the generation of immortalized multipotent and Steel factor-dependent hematopoietic progenitor cell (HPC) lines [18, 19]. The HPC lines share several characteristics with normal HSCs such as response to specific cytokines/growth factors, expression of transcription factors and interactions with stromal cells [18, 20]. The pattern of cell surface markers expressed by HPC lines derived from ES cells and adult BM is similar to that of early fetal and adult HSCs, respectively. The HPC lines derived from adult BM can generate erythroid, myeloid and lymphoid cells following transplantation into lethally irradiated recipients, and can long-term engraft stem cell-deficient mice . The cells engrafting the stem cell-deficient mice maintain high level of expression of Lhx2 in vivo, which eventually leads to a chronic myeloproliferative disorder resembling human chronic myeloid leukemia . Thus, ectopic Lhx2 expression in hematopoietic cells allows for the generation of HSC-like cell lines, and molecular analyses of these HSC-like cell lines would give information of the role of Lhx2-induced self-renewal of HSCs, and hence novel insights into stem cell physiology and pathology.
In order to elucidate the molecular basis of Lhx2-induced self-renewal of HSCs we generated HSC-like cell lines from ES cells differentiated in vitro with Lhx2 expression controlled by a tetracycline-responsive element, and hence dependent on the presence of the tetracycline-analogue doxycyclin (dox) in the culture media . These dox-dependent hematopoietic progenitor cell (DoxHPC) lines down-regulate Lhx2 expression almost two orders of magnitude within 24 hrs after dox withdrawal, leading to rapid differentiation into various myeloid cell types. We used the DoxHPC lines to analyse Lhx2 function by comparing global gene expression in the presence of dox to different time points after dox withdrawal using cDNA array technology. This approach identified 267 genes differentially expressed at all time points, and thus putatively involved in Lhx2 function in stem cell self-renewal and / or differentiation, and during organ development.
Number of array features and corresponding genes that are differentially expressed in the various comparisons. A false-discovery rate of 0.01 is used, with and without a M-value cut-off of 0.4.
p < 0.01 (fdr)
p < 0.01 (fdr) and |M| > 0.4
A (36 vs. 0 h)
B (72 vs. 0 h)
C (96 vs. 0 h)
D (0 h vs. 0 h)
HPC1 vs. HPC7 (0 h)
The top 25 down-regulated genes in the DoxHPC lines after dox withdrawal. The columns 36 h, 72 h and 96 h refer to hours after dox withdrawal and include the M-values (=log2 [expression in differentiated/expression in undifferentiated]). The M-value of Lhx2 down-regulation is equivalent to a fold-change of approximately 8.
LIM homeobox protein 2
Serine (or cysteine) proteinase inhibitor, clade A, member 3G
Cysteine and glycine-rich protein 2
Phospholipid scramblase 1
Myelin and lymphocyte protein, T-cell differentiation protein
Sorbitol dehydrogenase 1
Uridine phosphorylase 1
Synaptic nuclear envelope 2
DNA segment, KIST 4
UDP-N-acetyl-alpha-D-galactosamine: polypeptide N-acetylgalactosaminyl transferase 2
Ets variant gene 5
RIKEN cDNA 5930433N17 gene
Lysosomal-associated protein transmembrane 4B
RIKEN cDNA B230104P22 gene
RIKEN cDNA 1810009M01 gene
SRY-box containing gene 4
RIKEN cDNA 2810408E11 gene
Milk fat globule-EGF factor 8 protein
Serine (or cysteine) proteinase inhibitor, clade A, member 3N
Chemokine-like factor super family 4
The top 25 up-regulated genes in the DoxHPC lines after dox withdrawal. The columns 36 h, 72 h and 96 h refer to hours after dox withdrawal and include the M-values (=log2 [expression in differentiated/expression in undifferentiated]).
Solute carrier family 2 (facilitated glucose transporter), member 3
BCL2/adenovirus E1B 19kDa-interacting protein 1, NIP3
Carbonic anhydrase 2
Enolase 1, alpha non-neuron
Aldolase 1, A isoform
RIKEN cDNA 2410129H14 gene
N-myc downstream regulated gene 1
Neutrophil cytosolic factor 1
PERP, TP53 apoptosis effector
Phosphoglycerate kinase 1
Triosephosphate isomerase 1
Minichromosome maintenance deficient 10 (S. cerevisiae)
Thioredoxin interacting protein
Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), alpha 1 polypeptide
RIKEN cDNA 2610034M16 gene
RIKEN cDNA 9830134C10 gene
RIKEN cDNA 2610034H20 gene
Lactate dehydrogenase 1, A chain
RIKEN cDNA A830010M20 gene
Cytoplasmic tyrosine kinase, Dscr28C related (Drosophila)
Twenty-six of the 337 probes that were differentially expressed at all three time points were not included in UniGene build 144. We used Blast sequence similarity searches  to annotate the corresponding ESTs and the results are provided in the (see Additional file 1). Several of the unknowns map to either 5' or 3' UTRs, but also partly to known genes such as Wbscr5, Map3k7, and Eifay/Eifax. Also, one of the probes [GenBank:CX207886] has no matches in any public sequence repository. Whether these transcripts correspond to novel genes or novel splice variants of already defined genes remains to be determined.
The most regulated groups genes. Enriched biological processes (level 3), molecular function and chromosome distribution as defined by Gene Ontology and pathways as defined by Kegg of the differentially expressed genes.
REGULATION OF SIGNAL TRANSDUCTION
COENZYME AND PROSTHETIC GROUP METABOLISM
TRANSFERASE ACTIVITY, TRANSFERRING GLYCOSYL GROUPS
OXIDOREDUCTASE ACTIVITY, ACTING ON PEROXIDE AS ACCEPTOR
PROTEASE INHIBITOR ACTIVITY
CARBOHYDRATE KINASE ACTIVITY
TRANSFERASE ACTIVITY, TRANSFERRING PHOSPHORUS-CONTAINING GROUPS
CARBON-OXYGEN LYASE ACTIVITY
PURINE NUCLEOTIDE BINDING
PENTOSE PHOSPHATE PATHWAY
GLYCOLYSIS / GLUCONEOGENESIS
FRUCTOSE AND MANNOSE METABOLISM
STARCH AND SUCROSE METABOLISM
Collectively these results show that Lhx2 expression partly maintains the HSC signature and hence, the DoxHPC lines with inducible Lhx2 expression is a relevant model system for normal HSCs also at the level of global gene expression.
The fifty genes that show largest fold-change (up- or down-regulated) after dox withdrawal are presented in Table 2 and 3. The list of down-regulated genes contains two members of the serpin family of protease inhibitors, Serpina3g and Serpina3n, where the former is the most highly differentially expressed gene. In general, serpins are involved in diverse processes such as coagulation, extra cellular matrix degradation, complement activation, fibrinolysis and apoptosis . The expression pattern of Serpina3n has not been well characterized, but expression of Serpina3g is highly enriched in normal HSCs , and in the hematopoietic precursor cell line FDCP-Mix . The exact biological function of Serpina3g is not known, but has been shown to involve inhibition of caspase-independent cell death . This is in agreement with the observed increase in apoptosis after turning Lhx2 expression off, which is also reflected by the observation that two pro-apoptotic genes, Bnip3 and Perp [35, 36], are among the 25 most up-regulated genes (Table 3). Over-expression of Serpina3g in FDCP-Mix cells caused delay in differentiation and increased clonogenic potential of these cells , suggesting similar function in the Lhx2 expressing HPC lines and normal HSCs.
Most genes encoding enzymes within the glycolytic pathway were up-regulated after Lhx2 expression was turned off, but one highly differentially expressed gene within this pathway, Bpgm (2,3-bisphosphoglycerate mutase), was an exception as it showed a decrease by 75%. This enzyme is known to be present in erythrocytes where it is involved in the reaction generating the metabolite 2,3-bisphosphoglycerate, which is important for the oxygen-hemoglobin interaction. However, Bpgm is also expressed during early developmental stages (egg and pre-implantation stages) , suggesting that it may affect other cellular processes.
Genes linked to both normal and abnormal hematopoietic progenitor cell function were identified among the top 50 most differentially expressed genes. For example, among the most down-regulated genes Sox4 and Plscr1 (Phospholipid scramblase 1), have both been reported to be enriched in HSCs [29, 30]. Although the normal function of Sox4 in HSCs is not known, over-expression of Sox4 in mouse HSCs has recently been shown to cause myeloid leukemia and Sox4 is also a frequent insertional mutagenesis target in murine myeloid leukemias [38, 39]. Plscr1-deficient mice have no major hematopoietic defects, but Plscr1 -/- hematopoietic progenitor cells show decreased colony formation and impaired differentiation to neutrophils in response to Steel factor and G-CSF . Examples of the most up-regulated genes were Ndrg1 (N-myc downstream regulated gene 1), and Ncf1 (Neutrophil cytosolic factor 1). Up-regulation of these genes would be expected since the DoxHPC lines under the culture conditions used herein differentiate towards both neutrophil and mast cell lineages, and these genes are linked to the development and/or function of these cell types [41, 42].
Etv5 and Csrp2 (cysteine and glycine-rich protein 2), which also showed significant decrease in expression after turning Lhx2 expression off, have been implicated in mesenchyme-epithelia interactions during embryogenesis [43, 44], similar to Lhx2. Interestingly, both Csrp2 and Lhx2 are specifically expressed in hepatic stellate cells of the liver [12, 45]. Chronic liver injury causes activation of hepatic stellate cells leading to hepatic fibrosis , and both Lhx2 and Csrp2 have been negatively implicated in this process [16, 45], suggesting overlapping function of these genes also in non-hematopoietic cells. In this respect it is interesting to note that one of the top 25 up-regulated genes is P4ha1 (proline-4-hydroxylase), which encodes an enzyme important for post-translational modifications of collagen, leading to increased stability of collagen fibrils. Expression of P4ha1 is also up-regulated in the fibrotic Lhx2-/- fetal liver as compared to the normal fetal liver . Thus, analyses of differentially expressed genes identified in this hematopoietic-based system might also give insights into Lhx2 function (and lack of function) in non-hematopoietic cells.
We have generated HSC-like cell lines by differentiating ES cells with inducible Lhx2 expression in vitro. Self-renewal of these DoxHPC lines, is strictly dependent on Lhx2 expression, since down-regulation of Lhx2 expression leads to loss of CAFCs and rapid and efficient differentiation into a variety of myeloid cells. To elucidate the putative function of Lhx2 in stem cells, we used microarray technology to compare global gene expression in these cell lines in the presence of dox (Lhx2 expression on) to different time points after dox withdrawal (Lhx2 expression off). We identified 267 genes (141 down-regulated and 126 up-regulated) that showed differential expression at all time points (36, 72 and 96 hours) after Lhx2 expression was turned off. Gene Ontology classification revealed that genes related to 'regulation of signal transduction', 'organogenesis' and 'cell death' were over-represented among the down-regulated genes (i.e. "stem cell-specific"), and that metabolism-related genes were over-represented among the up-regulated genes (i.e. "differentiation specific").
We have previously shown that the HPC lines generated by Lhx2 expression resemble normal HSCs in many basic molecular, cellular and biochemical aspects [18–20, 23, 47]. Moreover, the HPC lines derived from bone marrow that engrafted stem cell-deficient mice maintained Lhx2 expression in vivo and caused myeloproliferation . We therefore anticipated identifying genes involved in both physiological and pathological processes affecting HSCs by analyses of global gene expression of the DoxHPC lines. A comparison of the differentially expressed genes to previously defined HSC gene expression signatures revealed that a majority of the overlapping genes expressed in HSC-containing populations were those down-regulated after Lhx2 expression was turned off. Thus, although we have used a different microarray platform, the genes defined as stem cell-specific in our system (e.g. the 144 Unigene clusters that belonged to group of genes down-regulated after dox withdrawal), reveal a 21 and 23 % overlap with genes defined as HSC-specific in the two different stem cell databases, respectively (see Additional file 4). Furthermore, thirteen of the 144 Unigene clusters (9 %) defined as stem cell-specific in our study overlapped with both HSC databases (see Additional file 4). In interpreting the results from the comparison between our stem cell-specific genes and the HSC-enriched genes in Ivanova et al.  and Ramalho-Santos et al. , it should be noted that the probes on the arrays in those studies only has an overlap in our array of 50 and 57%, respectively. The overlap from our model system for HSCs could be further viewed in relation to a study showing that the overlap between three different HSC-specific datasets varied between 19 and 32 % . It should be noted that, despite the fact that these studies were carried out using the same Affymetrix GeneChips, identical analysis approaches and essentially the same cell populations, there is still considerable variability between the different HSC-specific datasets. Moreover, a majority of the genes that were up-regulated after Lhx2 expression was turned off were preferentially expressed in differentiated cell populations. These results suggest that there is molecular overlap in the self-renewal and differentiation process between the DoxHPC lines and normal HSCs also at global gene expression level. We could also identify genes linked to malignant transformation of hematopoietic cells and one such example is Sox4, which was recently shown to cause acute leukemia if over-expressed in HSCs . Sox4 has also been identified as a frequent insertional mutagenesis target in murine myeloid leukemias , further supporting a role of this gene in malignant transformation. Since Lhx2 is always integrated into the same position, close to the Hprt1 gene in the DoxHPC lines, expression of insertional mutagenesis target genes is not due to insertional mutation, as has been suggested for retroviral integration , hence, these genes are more likely to be linked to Lhx2 function and/or stem cell function. Human chronic myeloid leukemia cells have been reported to express Lhx2 although its function in the disease is not clear . Another gene down-regulated after Lhx2 expression is turned off is Prkd2 (Protein kinase D2, see Additional file 1), which has also been shown to be a substrate for the BCR-ABL fusion protein present in chronic myeloid leukemia cells . These results reveal a putative functional link between Lhx2 and the BCR-ABL fusion protein. However, both Sox4 and Prkd2 have been shown to be preferentially expressed in HSCs [29, 30], which suggests overlapping molecular mechanism regulating both HSC physiology and pathology. By using our inducible system it would be possible to dissect these processes on the molecular, cellular and biochemical level.
It has been suggested that the Notch pathway and pathways activated by different morphogens such as Wnt, Bone morphogenetic proteins (BMP) and hedgehog, are important regulators of stem cell function [52–55]. We have previously shown that the Lhx2-induced HPC lines self-renew by a cell nonautonomous mechanism , and the anemia in Lhx2 -/- mouse embryos is due to a cell nonautonomous mechanism, suggesting that Lhx2 regulates genes encoding proteins involved in cell-cell interactions . However, we have not found any evidence for differential expression of major mediators of the Notch or Wnt pathways in the DoxHPC lines after turning Lhx2 expression off. Although the microarrays we used do not contain all mediators of these pathways, the results suggest that the mechanism whereby Lhx2 expression generates HPC lines is partly or completely independent of at least the Wnt and Notch pathways. We did find consistent down-regulated expression of the hedgehog receptor Smo (Smoothened) after Lhx2 expression was turned off (see Additional file 1), and Smo has also been shown to be preferentially expressed in normal HSCs (see Additional file 4) [29, 30]. However, expression of the ligands (Sonic, Indian and Desert hedgehog) was very low to undetectable by real-time PCR analyses of the cell lines (data not shown). Thus, the functional role, if any, of hedgehog signalling in these cells remains to be determined. Furthermore, Tsg (Twisted gastrulation), which is a soluble modulator of BMP-signalling, was identified among the genes down-regulated after Lhx2 expression was turned off (see Additional file 1), and is also defined as an HSC-specific gene (see Additional file 4) [29, 30]. The functional relevance of this observation remains to be determined as Tsg protein has been show to act as a BMP antagonist as well as a BMP agonist [56, 57].
Lhx2 appears to play an important role during embryonic development in the formation of several different tissues via mesenchymal-epithelial interactions and/or regulation of stem/progenitor cells [11–17]. One of our aims was therefore to identify gene(s) directly or indirectly regulated by Lhx2, possibly providing insights into the function of Lhx2 in these processes. A caveat to this assumption is that it is not known whether Lhx2 functions as a transcriptional activator, repressor, or both. The only reported target genes putatively regulated by Lhx2 in a direct manner are those encoding olfactory receptors  and these receptors are not expressed in Lhx2-/- olfactory epithelia [11, 13]. These observations suggest that genes activated by Lhx2 are down-regulated in Lhx2-/- animals, and conversely, genes repressed by Lhx2 are up-regulated in Lhx2-/- animals. A putative example of the latter is P4ha1 as this gene was up-regulated after Lhx2 is turned off in DoxHPC lines, and the expression of this gene was also up-regulated in the liver of Lhx2-/- embryos compared to wild type embryos . Although our analysis was made in hematopoietic cells, almost 50% of the genes analysed among those down-regulated after dox withdrawal (Lhx2 expression turned off), showed similar expression pattern as Lhx2 in different non-hematopoietic organs/cells during embryonic development. Preliminary data suggests, however, that several of these genes show maintained expression pattern in Lhx2-/- embryos, suggesting that these genes are not direct transcriptional targets for Lhx2. A plausible explanation for this observation is that different genes exert their function via parallel pathways in the regulation, formation or differentiation of a certain tissue but are not dependent on each other for their expression in that particular tissue. An illustrative example of such parallel but equally important pathways during organ formation can be observed during eye development, where both Lhx2 and Pax6 null mutant (Sey) embryos show a developmental arrest at the optic vesicle stage [17, 59]. However, the expression of these genes appear to be independent of each other since Pax6 isexpressed in the arrested optic vesicle in the Lhx2-/- embryos and Lhx2 is expressed in the arrested optic vesicle in Pax6-/- embryos . Whether the genes identified in this study play an equally important role as Lhx2 in the development of the respective organ remains to be determined. Our approach may therefore have identified both genes that are directly or indirectly regulated by Lhx2, and genes that play an important role together with Lhx2 via parallel pathways but are independent of Lhx2 expression, similar to the Lhx2/Pax6 example during eye formation. These results suggest that elucidation of Lhx2 function in various organs would give further insights into Lhx2 function in the DoxHPC lines and hence both physiological and pathological processes regulating normal HSCs. Conversely, elucidation of Lhx2 function in DoxHPC would give insights into Lhx2 function in various basic processes during embryonic development, such as mesenchymal-epithelial interactions and regulation of stem/progenitor cells in non-hematopoietic tissues. We are at present time exploring all alternatives of Lhx2 function presented above.
Expression of the LIM-homeobox gene Lhx2 in hematopoietic cells derived from both ES cells differentiated in vitro and from bone marrow, allows for the generation of HSC-like cell lines. To address the molecular basis of Lhx2 function, we have analysed global gene expression in HSC-like cell lines with inducible Lhx2 expression. This approach identified 267 differentially expressed genes where 141 (144 Unigene clusters) were down-regulated and 126 (131 Unigene clusters) were up-regulated when Lhx2 expression was turned off. The relevance of this model system for normal HSC function was revealed by that the majority of the differentially expressed genes overlapping with HSC-specific datasets were defined as stem cell-specific in our system, and the majority of the differentially expressed genes overlapping with more differentiated hematopoietic cell population were defined as differentiation- or commitment-specific in our system. Moreover, gene expression analyses in various tissues during embryonic development of the down-regulated genes revealed that almost half of these genes showed overlapping expression patterns with Lhx2, suggesting a functional link to the development and regulation of various tissues and stem/progenitor cells. Thus, this approach has identified genes putatively linked to self-renewal / differentiation of HSCs, and function of Lhx2 in organ development and stem / progenitor cells of non-hematopoietic origin.
The ES cell line Ainv15 was maintained on irradiated mouse embryonic feeder (MEF) cells in Dulbecco's modified Eagle medium (DMEM) (Gibco-BRL, United Kingdom) supplemented with 15% fetal calf serum (FCS) (Boehringer, Germany), 1.5 × 10-4 M monothioglycerol (MTG) (Sigma, Germany) and leukaemia inhibitory factor (LIF) (Chemicon, Ca, USA). Lhx2 cDNA or Lhx2 cDNA linked to a green fluorescent protein cDNA preceded by an internal ribosomal entry site (Lhx2-ires-GFP), was inserted into the plox vector and transfected into the Ainv15 ES cells together with Cre recombinase cDNA as previously described , and subsequently cultured in 200 μg/ml G418 (Gibco-BRL). Clones of ES cells resistant to G418 were isolated, pooled and expanded.
ES cells used for in vitro differentiation were made feeder-independent in serum-free medium as previously described [60, 61]. Briefly, ES cells cultured on MEF cells were trypsinized and transferred to gelatinized culture flasks in N2B27 medium (Gibco-BRL) supplemented with 104 U/ml LIF and 10 ng/ml BMP4 (R&D-systems, United Kingdom). The ES cells were trypsinized and transferred into Iscove's modified Dulbecco's media (IMDM) (Gibco-BRL) supplemented with 15% FCS (Integro Inc., The Netherlands), 4.5 × 10-4 M MTG and 25 μg/ml ascorbic acid (Sigma) at 103 cells/ml. Embryoid bodies (EBs) were collected after five days of differentiation, resuspended in trypsin-EDTA and incubated for three minutes. Two ml of FCS was added and the cells were gently passaged through a syringe with a 20-gauge needle. Ten ml of IMDM medium was added; the cells were spun down and resuspended in fresh IMDM medium.
The progenitor assays were carried out in IMDM containing 1% methylcellulose (Fluka, Switzerland) and supplemented with L-glutamine, 300 g/ml iron-saturated transferrin (Boehringer), 5% protein-free hybridoma medium II (Gibco-BRL), 10% plasma-derived serum (Antech Inc., Tx, USA), 100 ng/ml murine Steel factor (SF) R&D-systems), 10 ng/ml human IL-6 (R&D-systems), 4 IU/ml erythropoietin (Eprex Janssen-Cilag, Sweden) and with or without 2 μg/ml doxycyclin (dox) (Sigma). 105 or 2 × 105 EB cells were plated in triplicates in a final volume of 1.25 ml in 35-mm Petri dishes (Falcon 1008). Primitive erythroid colonies were scored after 4–6 days of incubation and definitive hematopoietic colonies (e.g. SF/IL-6-responsive) were scored after 10–12 days of incubation.
Individual colonies were randomly picked from the clonal assays after 12 days of incubation, transferred to 96-well plates and expanded in IMDM supplemented with 5% FCS, 1.5 × 10-4 M MTG, 100 ng/ml SF, 10 ng/ml IL-6 and 2 μg/ml dox. Stable cell lines could be established from all cultures containing cells with blast-like morphology after three weeks in liquid culture. These cell lines were subsequently maintained in this media at cell densities between 5 × 105 and 2 × 106 cells/ml as previously described . The efficiency of the system was verified by removing dox and analysing Lhx2 transcription by real-time PCR or GFP expression by flow cytometry at different time points after dox withdrawal (Figure 1A and 1B). No cell lines could be established from the control cell line expressing GFP alone or from colonies expanded in the absence of dox, as such cells differentiated into mast cells under these culture conditions. Two cell lines called DoxHPC1 and DoxHPC7 were selected for further gene expression analysis. The DoxHPC7 line was established from the ES cells where GFP was linked to Lhx2 expression.
The gene expression changes in DoxHPC1 and DoxHPC7 cell lines were analysed using the hybridization scheme in Figure 2A. Total RNA prepared from Lhx2 expressing cells was used as reference and compared to total RNA prepared from three time-points (36, 72 and 96 hours) after dox withdrawal. Two independent cell cultures were carried out for both DoxHPC1 and DoxHPC7 lines, providing a first level of replication. Each comparison was further analysed using two replicated hybridizations with the dye assignments reversed, providing a second level of replication. Finally, each probe was printed in duplicate on the arrays, which for each comparison results in eight measurements using three different levels of replication. Additional hybridizations carried out included replicated comparisons between the two separate cultures of each cell line, and a direct replicated comparison between Lhx2-expressing DoxHPC1 and DoxHPC7 cell lines.
The cDNA arrays were produced at the KTH Microarray Center and contain 14,121 in-house sequenced cDNA clones originating from a mouse brain lateral ventricle wall library, a normalized neurosphere library, an adult bone-marrow derived hematopoietic stem cell line expressing Lhx2 and additional control features including the Lucidea Universal ScoreCard probes (Amersham Biosciences, Sweden). A detailed description of the libraries is available elsewhere . Details regarding the probes and cDNA amplification, purification and printing are available through the ArrayExpress microarray data repository using the array accession number A-MEXP-175. In brief, the cDNA inserts were amplified using Platinum Taq DNA polymerase (Invitrogen, Sweden), purified using the Multiscreen-PCR 384-well plates (Millipore, Sweden) automated on the Biorobot 8000 platform (Qiagen, Germany) and printed in either 30 or 50% DMSO. Probes were printed with a feature-to-feature distance of 175 μm into two identical fields consisting of 24 blocks each. The printed DNA was attached to the reactive surface of the Ultra-GAPS slides (Corning B.V., The Netherlands) using 250 mJ/cm2 UV light. All cDNA sequences on the chip are deposited in the dbEST sequence database  and are accessible with the GenBank accession numbers listed in the ArrayExpress database. All clone annotations are derived from Mus musculus UniGene build 144, unless otherwise stated.
Total RNA extraction from eight DoxHPC1 and eight DoxHPC7 samples was carried out using the RNeasy technology (Qiagen) and quality and quantity determined using the RNA Nano LabChip kit on the Agilent 2100 bioanalyzer (Agilent Technologies, Ca, USA) and Nanodrop ND-1000 spectrophotometer (Nanodrop Technologies, De, USA), respectively. For each cDNA synthesis reaction 20 μg of total RNA was mixed with 5 μg of random hexamer primer (Invitrogen), incubated at 70°C for 10 minutes, and finally on ice for at least 5 minutes. Reverse-transcription reaction mixture and 400 units of Superscript II RT-polymerase were added to yield a final volume of 30 μl containing first-strand buffer (Invitrogen), 10 mM DDT (Invitrogen) and 0.5 mM dNTPs (Sigma). The ratio of aminoallyl-modified dUTP to dTTP was 4:1 in the dNTP mixture. The reaction was carried out at 42°C for two hours, followed by 15 minutes hydrolysis of the RNA strand at 70°C in the presence of 16 mM EDTA (Sigma) and 150 mM NaOH. The reaction was neutralized using 150 mM HCl and purified using the MinElute Reaction Cleanup system (Qiagen) with the provided wash and elution buffers replaced by 80% ethanol and 100 mM NaHCO3, pH 9.0, respectively. A repeated elution from the column was carried out, generating a total volume of 20 μl. This was transferred to an aliquot containing one tenth of the monofunctional NHS-ester Cy3 or Cy5 dye tubes (Amersham Biotech), which had been dissolved in DMSO and subsequently dried in a vacuum centrifuge. After a 30-minute incubation in darkness at room temperature, the samples to be co-hybridized on a slide were pooled and purified using the MinElute columns.
To avoid unspecific hybridization to the surface the slides were pre-hybridized for 30 minutes in a solution consisting of 5xSSC, 0.1% SDS (Sigma) and 1% BSA (Sigma). Slides were subsequently washed in water and isopropanol (Sigma) and dried using a slide centrifuge. The labeled, pooled and denatured (3 minutes at 95°C) samples in a hybridization mixture containing 25 μg mouse Cot-1 DNA (Invitrogen), 40 μg poly-(dA) DNA (Operon),25% formamide (Sigma), 5xSSC and 0.1% SDS were applied under a LifterSlip cover (Erie Scientific Company, Nh, USA) and hybridized for 16–20 hours at 42°C in a water bath. After hybridisation the slides were washed with increasing stringency using 2xSSC and 0.1% SDS at 42°C, followed by 0.1xSSC and 0.1% SDS at room temperature and finally five times with 0.1xSSC at room temperature. Scanning was carried out at 10-μm resolution using the G2565BA DNA microarray scanner (Agilent Technologies) for which the photo multiplier tube was set to 100. The obtained TIFF-images were analysed using the GenePix Pro 5.1 software (Axon Instruments, Ca, USA). For each slide the foreground and background signal intensities were separated using the irregular feature-finding algorithm implemented in the software. A manual inspection was carried out to verify the results. The raw data, including several quality parameters, is available from the ArrayExpress data repository using the experiment accession number E-MEXP-431.
Data processing and identification of differentially expressed genes was carried out in the R environment for statistical computing and programming  using the Bioconductor package bundle , Limma , aroma package  and the kth-package . As a first step the Cy3 and Cy5 intensities without background subtraction were converted to M (=log2 [red channel/green channel]) and A (=log2 [red channel*green channel]/2) format using the median intensity values for both channels. A feature was considered unreliable and removed from further analysis if GenePix flagged it as "Not Found" or if it was manually flagged as "Bad"during the image analysis step. Furthermore, if a feature satisfied one of the following criteria it was considered unreliable: a) both channels were saturated (above 65190 intensity units), b) the percentage of foreground pixels above the local background plus two of its standard deviations were below 70 for both channels of a feature, or c) the signal-to-noise ratio (defined as [mean foreground-mean background]/background standard deviation) for both channels was below 3. Remaining data was normalized using the intensity-dependent print-tip lowess method  and differentially expressed genes identified using the empirical Bayes moderated t-test implemented in the Limma package. As a first step the signal intensities for duplicate features on each array were averaged. Replicated hybridizations are expected to have a higher correlation than the repeated cell cultures for each cell line. This was considered in the linear model by first estimating the between-replicates correlation by using restricted maximum likelihood methods to estimate a common correlation for all probes . Secondly, a linear model was fit for each probe using generalized least squares that takes into account the between-replicates correlation. In the third step an empirical Bayes approach was used to shrink the gene-wise standard errors towards a common value and a moderated t-statistic was calculated [66, 71]. To compensate for multiple testing, the p-values associated with this t-statistics were adjusted using a false-discovery rate approach  implemented in R. Finally, probabilities for differential expression (B-values) were calculated for each gene. For these calculations the a priori assumption of differentially expressed genes was set to 0.01.
Classification into Gene Ontology functional groups  and analysis of over-represented themes was carried out using the EASE-package available at [73, 74]. The complete mouse transcriptome was used for calculation of the expected frequencies in the over-representation analysis, and a Gene Ontology theme (detail level 3) was considered over-represented if the calculated EASE score was below 0.1.
Total RNA was extracted from cell pellets with TRizol reagent (Invitrogen). cDNA was synthesized by using the First-strand cDNA synthesis kit (Amersham Biosciences). Real-time PCR reactions were carried out in triplicates using SYBR green PCR master mix (Applied Biosystems, Ca, USA) and PCR products were detected with an ABI prism 7000 instrument (Applied Biosystems). The expression levels of the genes tested were normalized to the expression levels of Gapdh and confirmed with two additional housekeeping genes: Hprt1 (hypoxanthine phosphoribosyltransferase 1) and Tbp (TATA box binding protein) (data not shown). The following primers were used in the PCR analyses: Lhx2 Forward (F)- GCC GAG AAA GCG CAA GAG T and Reverse (R)- TGT TCA GCA TCG TTC TCG TTA CA; Plscr1 F- AGC TGC TGT TCC GAC ATT GA and R- GGA ACT GGA TCC CAA AAT TGT CT; Bpgm F- CTT AAA GGG CAA AAG CAT TCT GAT and R- TGG GCA GAG TGA TGT TGA TAA TAT C; Galnt2 F- TGC GGG TCC TCA GAA ATG A and R- TCA GCA ACC CGC TCC AA; Serpina3g F- CCT ACA GAT CCT GGC AGA GTT CA and R- GAT CTT CCC CTG GGT GTG ATT; Syne2 F- GGA GGT GTT CGG CAG AGT GT and R- TCT TCT ATG TCC GTC TCA TTC TCA GA; Csrp2 F- CCG TGT ATG CTG CGG AGA A and R- TTG GCA CAC CGG AAA CAG T; Upp1 F-TGA CCG CTA CGC CAT GTA TAA A and R- CAT GAT GCC GAT GGA AGG A; MyoIb F- ATC AGG TCA AGG AAC AGC TTC TG and R- TCA TTC CTC ACA GTC TTG GCA TT; Etv5 F- GAG CCG CTC TCT CCG CTA T and R- CCG GGT CAC ACA CAA ATT TG; Perp F- CTT GTT TTC CTG AGA GTC ATT GGA and R- GGT TAT CGT GAA GCC TGA AGG T; 2610034M16Rik F- GAG GAC TCA CTC AGG TTT TGT GAA and R- AAT GTC AAC TTC TGC TCC TTC TAA TTT TA; Ncf1 F- CAA AGA TGG CAA GAA TAA CGT AGC T and R- AGT CAG CAA TGG CCC GAT AG; Slc2a3 F- ACG ATC GGC TCT TTC CAG TTT and R- TTC TAA CCG CTC TTC CAA AGT GTA; Aldoa F-CCC TTC CCC CAA GTT ATC AAG and R- GGC ACC ACA CCC TTA TCT ACC T; Bnip3 F- GGT TTT CCT TCC ATC TCT GTT ACT GT and R- GTT GTC AGA CGC CTT CCA ATG; Tpi1 F- ACC GAG AAG GTC GTG TTC GA and R- GGC CAG GAC CAC CTT GCT; Ndrg1 F- CAT CGG CAT GAA CCA CAA GA and R- AAA ATG TTG TGT GAT CTC CTG CAT; Pgk1 F- GGA AGC GGG TCG TGA TGA and R- GCC TTG ATC CTT TGG TTG TTT G; Eno1 F- GGC ACC CTC TTT CCT TGC TT and R- GGC GTG GAT CCT GAG AAT AGA C; Gapdh F- CGT GTT CCT ACC CCC AAT GT and R- TGT CAT CAT ACT TGG CAG GTT TCT; Hprt F- GCA GTA CAG CCC CAA AAT GG and R- AAC AAA GTC TGG CCT GTA TCC AA; Tbp F- GAA TTG TAC CGC AGC TTC AAA A and R- AGT GCA ATG GTC TTT AGG TCA AGT T.
Embryos were fixed in 4% paraformaldehyde, cryoprotected (30% sucrose in PBS), and embedded in Tissue-Tek OCT compound and cryosectioned (8 μm). In situ hybridization using digoxygenin (Dig)-labeled probes was performed as described , with some modifications. Briefly, sections from E13.5 embryos were treated with 5 μg/ml proteinase K (Roche) in 0.1 M PBS for 15 minutes at room temperature prior to hybridization. The DIG signal was visualized with NBT/BCIP (Roche). The following probes were used: Lhx2 ([GenBank:NM_010710], probe spanning the region 460–1750), Nuak1 ([GenBank:NM_001004363], 2382–3143), Tmem2 ([GenBank:BC076570], 296–1262), Enc1 ([GenBank:NM_007930] 920–1793), and Etv5 ([GenBank:NM_023794], 661–1609).
We thank Annelie Waldén for valuable assistance with the microarray printing and Peter Nilsson for discussions. This work was supported by grants from the Knut and Alice Wallenberg Foundation, the Wallenberg Consortium North, the Swedish Scientific Research Council, the Foundation for Strategic Research, the Swedish Cancer Society and the Västerbotten County. LC is supported by the Tobias Foundation and by a grant from the European Union Regional Fund (Objective 1).
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.