Alternative splicing is frequent during early embryonic development in mouse
- Timothée Revil1Email author,
- Daniel Gaffney1Email author,
- Christel Dias1, 2,
- Jacek Majewski1, 2 and
- Loydie A Jerome-Majewska†1, 3
© Revil et al; licensee BioMed Central Ltd. 2010
Received: 20 January 2010
Accepted: 23 June 2010
Published: 23 June 2010
Alternative splicing is known to increase the complexity of mammalian transcriptomes since nearly all mammalian genes express multiple pre-mRNA isoforms. However, our knowledge of the extent and function of alternative splicing in early embryonic development is based mainly on a few isolated examples. High throughput technologies now allow us to study genome-wide alternative splicing during mouse development.
A genome-wide analysis of alternative isoform expression in embryonic day 8.5, 9.5 and 11.5 mouse embryos and placenta was carried out using a splicing-sensitive exon microarray. We show that alternative splicing and isoform expression is frequent across developmental stages and tissues, and is comparable in frequency to the variation in whole-transcript expression. The genes that are alternatively spliced across our samples are disproportionately involved in important developmental processes. Finally, we find that a number of RNA binding proteins, including putative splicing factors, are differentially expressed and spliced across our samples suggesting that such proteins may be involved in regulating tissue and temporal variation in isoform expression. Using an example of a well characterized splicing factor, Fox2, we demonstrate that changes in Fox2 expression levels can be used to predict changes in inclusion levels of alternative exons that are flanked by Fox2 binding sites.
We propose that alternative splicing is an important developmental regulatory mechanism. We further propose that gene expression should routinely be monitored at both the whole transcript and the isoform level in developmental studies
Developmental processes require precise spatial and temporal regulation of gene expression. Accordingly, developmental biologists have always been at the forefront of gene expression analysis, and recombinant DNA techniques such as transgenic and knockout models have greatly contributed to elucidation of developmental pathways and networks. Traditionally, these studies have focused on transcription factors and repressors that regulate the timing and strength of transcription. Recently, new regulatory mechanisms have emerged, such as post-transcriptional regulation by microRNAs and co-transcriptional regulation by alternative pre-mRNA splicing.
Alternative splicing is a pre-mRNA maturation process that consists of the removal or inclusion of certain alternative exons to produce different transcripts from one genomic locus [1, 2]. Alternative splicing is now known to be prevalent in advanced eukaryotes. In humans, recent reports show that more than 98% of multi-exonic pre-mRNAs are alternatively spliced [3, 4]. The mouse genome has been sequenced and, similarly to that of humans, a surprisingly low number of less than 30,000 genes have been identified [5, 6]. It has been widely hypothesized that the great complexity of higher eukaryotic organisms stems from processes such as alternative splicing [7, 8]. The distinct proteins translated from identical pre-mRNAs produced by this process can have different, even antagonistic activities. Thus, alternative splicing can play a major role in the activity of various important cellular mechanisms, such as cell differentiation, cell migration, cell growth and apoptosis. This wide range of cellular processes is required during mammalian embryogenesis to generate a viable organism from a single cell.
Several studies have suggested the importance of alternative splicing during development. In C. elegans, it was shown that 18% of the 352 verified alternative exons showed a larger than fourfold change in alternative splicing during its development from embryo to adult, including larval stages . In humans, mice, chickens and Xenopus, a well-known example is provided by the fibroblast growth factor 8 (FGF8), which can produce many different isoforms . Two of these, FGF8A and FGF8B, which differ by only eleven amino acids, have been shown to have different activities during development [11–14].
It has recently been shown that the levels of certain splicing factors, MBNL and CELF, are regulated and vary several-fold during pre- and postnatal heart development . This variation in CELF and MBNL expression levels affects the splicing modulation of a large quantity of other alternative splicing events, suggesting the existence of a regulatory cascade at the splicing level. Some of the most interesting examples of alternative splicing and its regulation by splicing factors have been carried out in neural tissues. As neuronal precursor cells differentiate into neurons, there is a switch from the ubiquitous PTB to the highly similar, but neuron-specific, neural PTB (nPTB) . As these two proteins modulate alternative splicing of specific subsets of pre-mRNAs, there is an associated switch of a large number of mRNA isoforms.
Since splicing factors may regulate alternative splicing of many different pre-mRNAs [17, 18], knocking out known splicing factors in mice generally has profound effects on embryo or young pup viability. The majority of germ-line loss of function mutations in splicing factors result in embryonic arrest early in development, before embryonic day (E) 7.5, e.g. Ptb , SC35 [20, 21], Asf2/Sf2  and SRp20 . In two cases, Ptb and SRp20, homozygous mutant embryos arrest at the morula stage [19, 23]. Prfp3 mutant embryos also exhibit embryonic arrest, although it is not clear if these embryos die early or late in embryogenesis . Germ-line loss of function mutations in splicing factors are also associated with organ specific abnormalities. For example, most SRp38 knockout embryos die before E15.5 with multiple cardiac defects , whereas a small number of mutant mice are born only to die soon after birth . Mbnl1 and Mbnl2 are required in the skeletal muscle and eye [26, 27], and Nova1 is required in motor neurons . These studies indicate that there are stage-specific and tissue-specific requirements for splicing factors.
Although there have been a number of indications of the importance of alternative splicing in development, past studies have been limited largely to individual experiments focused on known candidate genes. In recent years, technological advances have paved the way to genome-wide analyses of mRNA processing, and have enabled hypothesis-free approaches. In this study, we take advantage of a splicing-sensitive exon microarray to investigate genome-wide variation in alternative splicing during development of the mouse embryo and its associated placenta. We focus on early developmental stages, E8.5-E11.5, in order to capture the isoform differences occurring during organogenesis. In mouse embryos, organogenesis begins at E8.5 and is mostly completed by E11.5 when most organs of the foetus can be recognized. The placenta is one of the first organs to form and function in the developing foetus. During organogenesis the precursors of the adult organs undergo a series of morphogenetic movements and differentiation events which have been shown to be controlled by changes in gene expression. We postulate that alternative splicing provides an additional mechanism for increasing the repertoire of transcripts from a limited number of developmentally relevant genes and confers additional specificity to individual developing tissues. Our results show that alternative splicing is frequent during organogenesis, and that different tissues (i.e. the placenta and embryo) as well as different developmental stages express specific gene isoforms. We confirm some previously described splicing events but also report numerous new time- and tissue-specific alternative isoforms. In addition, we find that the mRNA expression levels of some known splicing factors are modulated during organogenesis, suggesting that changes in expression levels of splicing factors may be responsible for alternative isoform expression.
Data quality control and filtering
We next carried out the splicing index analysis which identified a large number of candidate exons predicted to be alternatively spliced across the two tissues and three stages. Since it is instrumental to distinguish between truly alternatively spliced exons, and exons expressed at very low levels (indistinguishable from background noise) or very highly (beyond the sensitivity levels of the microarray); we employed a number of filtering steps, as described in Materials and Methods and previously in . The Affymetrix Mouse Exon Array contains over 23,000 core meta probe sets (genes) and 280,000 probe sets (exons). Our filtering criteria reduced this number to 13,366 meta probe sets containing approximately 133,000 probe sets.
We investigated the efficacy of our data filtering steps by estimating the percentage overlap between candidate alternatively spliced exons in our study, with those predicted to be alternatively spliced based on independent EST and mRNA evidence such as those annotated by the "AltEvents" track from build 37 of the mouse genome on the UCSC Genome browser [5, 29, 30]. The Alt-Events track annotates alternative splice forms in UCSC "known" genes, which are assembled by combining evidence from multiple independent experimental sources. Application of our data filtering steps increases the percentage overlap with alternative events predicted from independent sources (Additional File 1 Figure S1). In addition, by increasing the stringency of the false discovery rate (FDR) cutoff we further increased the percentage overlap. This analysis demonstrates that the application of data filtering as well as the use of a stringent q-value cutoff increases the agreement between our results and alternatively spliced events predicted from independent experimental evidence. We note, however, that while our data filtering likely increases the sensitivity of our analysis, it is likely that the filters concurrently reduce specificity.
Comparison of differentially expressed and spliced genes
To assess patterns of gene expression and splicing change over our entire data set, we defined a significant difference as a P-value which was less than that expected at an FDR of 0.05. Significant stage effects were defined using the combined P-values from the embryo and placenta: that is, a significant developmental stage effect reflects a difference in expression between different embryonic stages, placental stages or both. For brevity, we refer to differences in expression between embryo and placenta as "tissue-specific" and differences in expression between developmental stages as "stage-specific".
We next investigated the overlap between whole gene expression change and alternative splicing. For clarity, in this section we focused on the 10,796 genes which contained at least one filtered (clearly expressed) probe set. We find that genes whose expression is significantly different between tissues or stages are highly enriched for alternatively spliced exons (Figure 3D). Genes that are differentially expressed between tissue are primarily enriched (compared to all genes) in tissue-specific alternative exons, while genes that are differentially expressed between developmental stages are likewise enriched in stage-specific alternative exons (Figure 3D). This analysis suggests that a large proportion of genes that are differentially expressed in early embryonic development concurrently exhibit significant isoform changes.
Developmental processes and functions are overrepresented in alternatively spliced genes
Overrepresented PANTHER "Biological Process" terms.
PANTHER Biological Process
Cell adhesion-mediated signaling
Overrepresented PANTHER "Molecular Function" terms.
PANTHER Molecular Function
Extracellular matrix structural protein
Cell adhesion molecule
Classification and visualization of alternative isoforms
Microarray results are confirmed by RT-PCR and qRT-PCR
Validation of cassette exons by RT-PCR.
Integrin α 6
ELKS/RAB6-interacting/CAST family member 1
Kinesin Family Member 2A
Erythrocyte Protein Band 4.1-like 3
NUMB Gene Homolog
DEP Domain Containing 5
WNK Lysine Deficient Protein Kinase 1
MYC Binding Protein 2
Alpha Glucosidase 2 Alpha Neutral Subunit
Validation of different alternative splicing events by real-time PCR.
engulfment and cell motility 1, ced-12 homolog (C. elegans)
ankyrin 3, epithelial
glucosaminyl (N-acetyl) transferase 2, I-branching enzyme
transcription factor 4
kinesin family member 1B
intersectin 1 (SH3 domain protein 1A)
MYC binding protein 2
ELKS/RAB6-interacting/CAST family member 1
WNK lysine deficient protein kinase 1
kinesin family member 2A
DEP domain containing 5
RAB6, member RAS oncogene family
In order to gain further insight into temporal variation in isoform expression, the qRT-PCR was performed on additional embryonic and placental stages, ranging from E6 to E11. The results of these additional stages were almost all predictable by extrapolation of the trends observed in the microarrays. A few demonstrated some unexpected behaviour, which may correspond to rapid changes in splicing patterns corresponding to developmental switches. As mice embryos develop extremely rapidly, with a gestation period of only 19-21 days, it is possible that the alternative splicing of certain pre-mRNAs can also be modified extremely rapidly.
The seventeen validated candidates are implicated in various cellular and organismal processes, and some have previously been shown to be involved in embryogenesis. For example, Numb is involved in neuronal differentiation during development. There are four known isoforms of Numb, two of which contain an alternative exon of 147 nt in the proline rich region and two without this exon . These two classes have been shown to be differentially spliced during embryogenesis from E10 to E17, as well as in adults, to regulate cortical development . This shift in alternative splicing is confirmed by our microarray results and we also find that this starts as soon as E8.5, the earliest data we have for this alternative splicing event.
Multiple RNA-binding proteins are alternatively expressed and spliced during development
Top 20 gene expression candidates in RNA binding proteins.
Proposed functions (GO)
P-value of changes of gene expression
P-value of changes of alternative splicing
Heat shock protein binding
Ribosome biogenesis (yeast)
RNA splicing, RNA helicase
Transcription regulator (during development)
Cell differentiation, development
rRNA processing, ribosome biogenesis
Because some aspects of the function of Fox2 and its binding sites within RNA sequences have been previously characterized, we were able to predict the effect of variation of Fox2 mRNA expression on a subset of exons that contain its putative binding sites. We compared the alternative splicing of cassette exons between E8.5 and E11.5 embryos as well as between E11.5 embryo and E11.5 placenta (P11.5), the two stages we observed the highest differences for Fox2 expression. The results obtained for these ratios (Figure 7D) demonstrate that inclusion rates of exons that are flanked by putative binding sites for Fox2 are highly correlated with the expression of this gene. The candidate exons were divided into two sets: those that contained a putative Fox2 binding site within the upstream intron, and those with a binding site in the downstream intron. Previous studies suggest that binding of Fox2 upstream of the exon should result in increased exon skipping, while downstream binding results in exon inclusion . Our results indeed demonstrate that when the introns downstream of a cassette exon contained UGCAUG, a hexamer representing the most specific binding site for this protein, exon inclusion levels were positively correlated to Fox2 expression levels, while the opposite was true for exons with upstream binding sites. There is nearly a ten-fold difference between the means of exon inclusion ratios for these two categories, when comparing E11.5 and E8.5 (P = 0.0204). We also used a more relaxed criterion for identifying putative Fox2 binding sites, including all the binding sites previously described . This analysis increased the sample size from 22 to 58 candidate regulated cassette exons (Additional file 4 Table S2), and resulted in increased statistical significance levels of the regulatory effect (P = 0.0012; Figure 7D, bars labelled "All"), suggesting that Fox2 binding sites may allow a level of degeneracy in vivo, and that there may be a large number of splicing events regulated by Fox2 during early embryonic development.
Alternative splicing is frequent during early embryonic development
Alternative splicing is becoming a widely studied process that can increase the transcriptome complexity using a finite set of genes. However, little is known on the regulation of alternative splicing during development, where time and tissue specific regulation of protein levels is fundamental for cell differentiation, apoptosis and migration, all key regulators of organogenesis. In this work, we show that splicing is extensively regulated during organogenesis. We compile a list of 5,500 genes that indicate transcript isoform differences between our two tissues of interest - embryonic and placental - or across three developmental stages, E8.5, E9.5 and E11.5. According to a 5% FDR cutoff, we estimate that 9% of all exons are differentially alternatively spliced between the placenta and the embryo, and 2% of exons are differentially alternatively spliced across the three developmental stages. With 5 biological replicates each, and 6 distinct tissue/stage samples, our study has considerable statistical power to detect alternatively spliced transcripts. Our analysis also allows us to compare the fraction of genes that show isoform level changes with genes that are differentially expressed at the whole transcript level. Although we find that the number of differentially expressed genes is larger than differentially spliced, the numbers are of similar orders of magnitude (Figure 3). Interestingly, there is a large overlap between these subsets, and we show that genes with very significant levels of differential expression have a much higher than expected chance of being alternatively spliced.
It is also worth noting that because of the high statistical power of the study, our number of differentially expressed candidates is ten times higher than what was previously demonstrated by a genome-wide expression analysis of placentas and embryos at E12.5, which found only 6.5% of statistically significant differences in gene expression . This could be explained in part by the advances in microarray analyses - i.e. the use of whole-transcript arrays - and the profiling of two placental stages allowing us to target a larger number of developmental events. However, it should be noted that the observed trends in expression are highly concordant between these two studies; most of the genes that had been shown to be highly expressed in placentas in the earlier study displayed the same high expression profile in our analysis.
Although we cannot rule out that a fraction of the genome-wide alternative splicing candidates may represent false positive results, our validation of 17 candidates presented here, as well as the 80% success in validation of previous experiments  suggests that false positives should not significantly affect our qualitative conclusions. It is difficult to estimate false negative rates in these types of studies. Although they are likely to be non-negligible, the limited power of the approach would render our estimates of the true frequency of alternative splicing conservative. We expect that this study exposes only the tip of the iceberg, and that a clearer picture will emerge as the whole-genome technologies mature. In the meantime, we provide a user-friendly database of our results in the additional material online (Additional file 1 Table S1).
Implications for regulation of developmental pathways
In the current paradigm, regulation of developmental pathways is accomplished mainly at the level of transcription, and is mediated by changes in expression of certain key genes. For example, one of the earliest events in embryonic development is the separation of the inner cell mass - which forms the embryo proper - from the trophectoderm lineage. This event is regulated by interaction between three transcription factors: Cdx2, Nanog, and Oct4. This transcriptional regulation is essential for turning on numerous other genes that will drive differentiation of the embryo and its associated placenta. Incidentally, Oct4 is known to express multiple alternatively spliced isoforms with distinct expression patterns and possibly distinct functions . In another well-known example, left-right asymmetry is governed by the Nodal molecular cascade. At E7.5 Nodal is turned on at the node, leading to auto regulation of its own expression in the left lateral plate, and turning on the expression of Lefty and Pitx2 genes. These three key players form the basis of a pathway containing numerous genes that control the specification of left versus right. Like Oct4, Pitx2, (one of our candidates, see Additional file 1 Table S1), expresses several isoforms with distinct transcriptional activities , and only one of those isoforms, Pitx2c, is required for left right asymmetry .
As the above examples demonstrate, many developmental pathways include alternative isoforms of key genes. Our own analysis presented here also indicates that a large proportion of developmentally regulated genes express alternative isoforms. The correct temporal and spatial expression of those isoforms is most likely regulated by specific splicing factors. Hence, we propose that in addition to canonical developmental pathways regulated by transcription factors, there exists a parallel, and yet highly overlapping, set of pathways regulated by splicing factors. A single splicing factor may affect the splicing of numerous exons in a large number of genes [18, 44, 45]. Thus, changes in expression levels of splicing factors can have profound downstream effects. Moreover, many splicing factors are themselves involved in self-regulatory feedback loops and express multiple isoforms [46, 47]. Our analysis indicates differential embryonic expression of a number of RNA binding proteins and putative splicing factors. One of the particularly interesting examples is Fox2 (RBM9 or Fxh), a factor whose expression and promoter usage are both variable across the three developmental stages (Figure 7B). We find that Fox2 expression increases nearly 2 fold between days 8.5 and 11.5 in the embryo, but not in the placenta. We also show that this increase in Fox2 levels has a significant and predictable effect on the alternative splicing of cassette exons which contain putative binding sites within 100 nt in the neighbouring introns (Figure 7D). Furthermore, the use of the proximal promoter, which gives rise to an mRNA encoding a shorter N-terminus, increases between those stages (Figure 7C). Thus, not only is the overall amount of Fox2 product, but also the ratio of distinct isoforms, variable across early embryonic development.
Recently Fox2 has been found to be highly expressed in another developmentally relevant system, pluripotent cell lines, along with other pluripotency markers such as Oct4 and Nanog. Fox2 has been shown to regulate the splicing of numerous alternative exons in human embryonic stem cells, and to be necessary for maintaining viability of the cells. Moreover, Fox2 pre-mRNA contains active binding sites for its own protein, indicating a degree of autoregulation. Finally, many Fox2 targets are in turn splicing regulators, suggesting the existence of more extensive splicing networks . The properties of such networks have been recently investigated in detail across a number of tissues and species .
Our study, along with a few recent publications [38, 48–50] demonstrates that alternative splicing is a frequent event and is likely to have a significant role in development. With the promising results presented here, the future step will be to improve detection rates by using additional developmental stages, and a finer resolution of embryonic tissues. Indeed, the use of whole embryo mRNAs does not permit discovery of small-scale tissue-specific alternative splicing, which is likely to be extremely significant to tissue differentiation. Finally, most of our knowledge pertaining to genome-wide variation in alternative splicing has been acquired thanks to recent developments in microarray technologies, using either exon or splice-junction microarrays. Even more recent advances in high throughput sequencing will soon make it feasible to carry out whole-genome expression and isoform profiling experiments using mRNA sequencing. This approach allows digital monitoring of expression by counting the number of reads that map to mRNA segments of interest and comparing them across samples. These fragments may be genes, individual exons, or specific splice junctions. mRNA sequencing also allows the discovery of novel, as of yet unannotated, isoforms and transcripts. Preliminary studies carried out on a limited number of tissues and with limited sequencing coverage, [3, 4] have already significantly enlarged our catalogue of alternatively spliced genes. With the rapidly increasing throughput, along with dropping costs and accessibility of sequencing, these technologies will soon allow us to routinely view gene expression at the sub exon level resolution and to decipher the role of splicing regulation in development and other systems.
In this study, we have demonstrated that changes in alternative splicing are frequent during early mouse development between tissues and stages. Indeed, the numbers of differentially spliced genes are similar to those of differentially expressed genes. In these two categories, there is an overrepresentation of genes that have been implicated in development. In addition, many RNA-binding proteins are differentially spliced and/or expressed at the mRNA level. We can see a direct correlation between expression of Fox2, a known splicing factor, and the alternative splicing of cassette exons containing binding sites for this protein in the neighbouring introns. Thus, our results suggest that the effects of alternative mRNA isoforms should now be systematically verified in developmental gene studies.
To generate embryos, C57BL/6J females were placed with C57BL/6J males overnight and checked for the presence of a vaginal plug in the morning. The day that a plug was detected was considered embryonic day (E) 0.5. All mice breeding and manipulations were performed in accordance with the Canadian Council on Animal Research. Embryos were collected between E6.5 - E11.5 for RNA isolation. Embryos for RNA isolation were stored in RNAlater (Ambion) for microarray analysis or TRIzol (Invitrogen) for RT-PCR, and isolated according to the manufacturers' protocol.
The microarray hybridization and analysis was done as previously described , but using the Affymetrix GeneChip® Mouse Exon 1.0 ST microarray. One microarray was used for each of the five biological replicates used in every stage and tissue analysed. All the microarray data has been deposited online in NCBI's Gene Expression Omnibus  with the accession number GSE21971. http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE21971.
Normalization and summarization of ExonArray hybridization data
The Affymetrix Mouse ExonArray contains approximately 1.2 million probe sets which target roughly 1 million known and predicted exons. The annotations used to design these probe sets are derived from a variety of sources and vary dramatically in the strength of experimental evidence which supports their existence. These annotations are divided, in order of decreasing quality of experimental support into "core", "extended", "full", "free" and "ambiguous" annotations. Our analysis was restricted to the approximately 2.2 × 105 core probe sets on the ExonArray. These probe sets interrogate exons derived from RefSeq transcripts and/or full-length GenBank mRNAs. There are a number of issues to consider when analysing GeneChip data from an experiment with multiple arrays. These include background correction, normalization, non-specific hybridization and probe summarization. We background-corrected probe expression levels for non-specific binding based the distribution of binding intensities of a set of "anti-genomic" probes of a similar GC content to the probe of interest. These anti-genomic probes are designed so as to not hybridize with any sequence in the mouse genome and so provide an estimate of the level of non-specific binding for a given GC content. Probe intensities were then quantile normalized across all samples. Finally, individual probe expression levels were summarized into exon- and gene-level expression using the Probe Logarithmic Intensity Error (PLIER) algorithm. All analyses were performed using the Affymetrix Power Tools suite of command line programs .
Data Quality Control
A major problem with inferring alternative splicing of exons between different biological samples using ExonArrays is differentiating such variation from changes in whole-gene expression level. The combination of changes in whole gene expression level with misleading probe set results can introduce potentially artefactual signals of alternative splicing in three main ways: via (i) the inclusion of probe sets from genes that are not expressed in a subset of samples (ii) the inclusion of cross-hybridizing probes or (iii) the inclusion of nonresponsive or "dead" probes [, Figure eight]. With these problems in mind we employed a number of quality control protocols to our expression data. Our objective was to minimize the number of false positive signals of differential splicing between our samples. Our filters are based upon the standards described by Affymetrix . Firstly, in order to differentiate genic from exonic expression changes, all exon expression levels were normalized by the whole gene expression level estimated by the PLIER algorithm. Secondly, to minimize errors introduced by unexpressed genes, we also excluded all meta-probe sets and their constituent probe sets that were not expressed in all 25 of our samples. A gene was defined as "not expressed" if its mean expression level was lower than the quartile of the distribution of all core meta probe set intensities for the chip in question. Thirdly, we attempted to minimize the influence of cross hybridizing probe sets in our analysis by removing all probe sets in which the probe set/meta-probe set intensity ratio was greater than 5, indicating a high level of cross-hybridization. Finally, in order to remove "dead" probe sets we used the detection above background (DABG) P-value, as estimated by the apt-probeset-summarize program. The DABG P-value describes the probability that an intensity value at least as extreme as the observed could have been drawn from the null distribution, in this case the background distribution of intensity values. In order to account for chip-to-chip variation, we set a False Discovery Rate-corrected P-value threshold at 0.05 for each chip, based on the distribution of DABG values for that chip. In our case, a "dead" or unresponsive probe set was defined as any probe set in which the DABG P-value exceeded the chip-specific FDR threshold value in all samples. In order to stabilize the variance data values were log-transformed.
Each probe set in our filtered dataset retained 25 estimates of expression level. These estimates were divided by tissue (embryonic and placental) and developmental stage (days E8.5, E9.5 and E11.5 embryonic, days E9.5 and E11.5 placental) with 5 biological replicates in each tissue-stage. For simplicity, we refer to embryonic samples as a single "tissue" throughout. The absence of placental samples from day E8.5, due to insufficient RNA quantities, means that our dataset is unbalanced and therefore, not amenable to analysis by standard two-way ANOVA. Instead, we implemented the following approach. In order to test for significant differences in expression level between tissues we use a two-sample t-test, combining samples from day E9.5 and day E11.5 from both tissues. We tested for differences in expression level between developmental stages using a two sample t-test in placenta to compare expression level on day E9.5 and day E11.5, and using a one-way ANOVA in embryo to compare expression levels on days E8.5, E9.5 and E11.5. The p-values for these latter two tests were then combined using Fisher's method  under the common null hypothesis of no significant variation in expression level across developmental stages in either tissue. All p-values were adjusted for multiple testing by using the Benjamini and Hochberg  FDR procedure, and a false discovery rate of 0.05 was used as a cutoff for reporting the results.
Fox2 effect on alternative cassette exon inclusion
In order to confidently identify cassette exons, of the top 2830 significant probe sets (Additional file 1 Table S1), we selected those that were annotated as "cassetteExon" in the "knownAlt" track on the UCSC Genome Browser . As binding sites for Fox2 are found predominantly in the surrounding introns within 100 nt of the splicing sites , upstream and downstream introns were scanned for the presence of several different putative binding sites previously published , using sequences extracted with the UCSC Table Browser data retrieval tool . We then calculated E11.5/E8.5 and E11.5/P11.5 ratios for each probe set value, normalized to the corresponding meta probe set. This ratio corresponds to the predicted ratio of exon inclusion levels. The candidate exons were divided into two sets: those that contained a putative Fox2 binding site within the upstream intron, and those with a binding site in the downstream intron. The data was log-transformed in order to stabilize the variance of the ratios, and the differences between the two sets were analysed using a one-tailed, unpaired Welch's t test . The results were considered significant if the p-values for the difference between the means were less than 0.05 (P < 0.05).
End-point PCR validation
Standard PCR validation was done using HotStarTaq PCR mix (Qiagen) according to the manufacturer's instructions. 8 ng of cDNA were used in a 10 μl reaction with conditions set as follows: 15 minutes at 95°C, followed by 30 cycles of 30 seconds at 95°C, 30 seconds at 58°C and 45 seconds at 72°C. The resulting products were analyzed on a 2% agarose gel (Additional file 3 Figure S2). Primer sequences used are available in additional table (Additional file 4 Table S3).
Real-time PCR validation
The validation of the results by qPCR was done in a 384-well format using the 7900HT Fast Real-Time PCR System (Applied Biosystems) and the Power SYBR® Green PCR Master Mix (Applied Biosystems) according to the manufacturer's protocol. The concentrations of the reagents used were 8 ng of cDNA was in a 10 μl reaction with 320 nM of each primer. The parameters of the qRT-PCR machine were 10 minutes at 95°C, followed by 40 cycles of 20 seconds at 95°C, 30 seconds at 58°C and 45 seconds at 72°C. The primers used are shown in additional table (Additional file 4 Table S4).
Detection Above BackGround
Differentially Alternatively Spliced
False Discovery Rate
Meta Probe Set
Principal Component Analysis
We would like to thank Kevin Lam, Vonda Koka, Kevin Ha and Tony Kwan for their useful advice. This work was supported by funding from FQRNT (LJM), CIHR and Genome Canada/Genome Quebec (JM). LAJM is a member of the Research Institute of the McGill University Health Centre, which is supported in part by the FRSQ.
- Chen M, Manley JL: Mechanisms of alternative splicing regulation: insights from molecular and genomics approaches. Nat Rev Mol Cell Biol. 2009, 10: 741-754.PubMed CentralPubMed
- Black DL: Mechanisms of alternative pre-messenger RNA splicing. Annu Rev Biochem. 2003, 72: 291-336. 10.1146/annurev.biochem.72.121801.161720.PubMedView Article
- Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ: Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat Genet. 2008, 40: 1413-1415. 10.1038/ng.259.PubMedView Article
- Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C, Kingsmore SF, Schroth GP, Burge CB: Alternative isoform regulation in human tissue transcriptomes. Nature. 2008, 456: 470-476. 10.1038/nature07509.PubMed CentralPubMedView Article
- Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, Agarwala R, Ainscough R, Alexandersson M, An P: Initial sequencing and comparative analysis of the mouse genome. Nature. 2002, 420: 520-562. 10.1038/nature01262.PubMedView Article
- Guenet JL: The mouse genome. Genome Res. 2005, 15: 1729-1740. 10.1101/gr.3728305.PubMedView Article
- Kim E, Magen A, Ast G: Different levels of alternative splicing among eukaryotes. Nucleic Acids Res. 2007, 35: 125-131. 10.1093/nar/gkl924.PubMed CentralPubMedView Article
- Graveley BR: Alternative splicing: increasing diversity in the proteomic world. Trends Genet. 2001, 17: 100-107. 10.1016/S0168-9525(00)02176-4.PubMedView Article
- Barberan-Soler S, Zahler AM: Alternative splicing regulation during C. elegans development: splicing factors as regulated targets. PLoS Genet. 2008, 4: e1000001-10.1371/journal.pgen.1000001.PubMed CentralPubMedView Article
- Gemel J, Gorry M, Ehrlich GD, MacArthur CA: Structure and sequence of human FGF8. Genomics. 1996, 35: 253-257. 10.1006/geno.1996.0349.PubMedView Article
- Fletcher RB, Baker JC, Harland RM: FGF8 spliceforms mediate early mesoderm and posterior neural tissue formation in Xenopus. Development. 2006, 133: 1703-1714. 10.1242/dev.02342.PubMedView Article
- Guo Q, Li JY: Distinct functions of the major Fgf8 spliceform, Fgf8b, before and during mouse gastrulation. Development. 2007, 134: 2251-2260. 10.1242/dev.004929.PubMed CentralPubMedView Article
- Liu Y, Ray SK, Yang XQ, Luntz-Leybman V, Chiu IM: A splice variant of E2-2 basic helix-loop-helix protein represses the brain-specific fibroblast growth factor 1 promoter through the binding to an imperfect E-box. J Biol Chem. 1998, 273: 19269-19276. 10.1074/jbc.273.30.19269.PubMedView Article
- Olsen SK, Li JY, Bromleigh C, Eliseenkova AV, Ibrahimi OA, Lao Z, Zhang F, Linhardt RJ, Joyner AL, Mohammadi M: Structural basis by which alternative splicing modulates the organizer activity of FGF8 in the brain. Genes Dev. 2006, 20: 185-198. 10.1101/gad.1365406.PubMed CentralPubMedView Article
- Kalsotra A, Xiao X, Ward AJ, Castle JC, Johnson JM, Burge CB, Cooper TA: A postnatal switch of CELF and MBNL proteins reprograms alternative splicing in the developing heart. Proc Natl Acad Sci USA. 2008, 105: 20333-20338. 10.1073/pnas.0809045105.PubMed CentralPubMedView Article
- Boutz PL, Stoilov P, Li Q, Lin CH, Chawla G, Ostrow K, Shiue L, Ares M, Black DL: A post-transcriptional regulatory switch in polypyrimidine tract-binding proteins reprograms alternative splicing in developing neurons. Genes Dev. 2007, 21: 1636-1652. 10.1101/gad.1558107.PubMed CentralPubMedView Article
- Blanchette M, Green RE, Brenner SE, Rio DC: Global analysis of positive and negative pre-mRNA splicing regulators in Drosophila. Genes Dev. 2005, 19: 1306-1314. 10.1101/gad.1314205.PubMed CentralPubMedView Article
- Venables JP, Koh CS, Froehlich U, Lapointe E, Couture S, Inkel L, Bramard A, Paquet ER, Watier V, Durand M: Multiple and specific mRNA processing targets for the major human hnRNP proteins. Mol Cell Biol. 2008, 28: 6033-6043. 10.1128/MCB.00726-08.PubMed CentralPubMedView Article
- Shibayama M, Ohno S, Osaka T, Sakamoto R, Tokunaga A, Nakatake Y, Sato M, Yoshida N: Polypyrimidine tract-binding protein is essential for early mouse development and embryonic stem cell proliferation. FEBS J. 2009, 276: 6658-68. 10.1111/j.1742-4658.2009.07380.x.PubMedView Article
- Ding JH, Xu X, Yang D, Chu PH, Dalton ND, Ye Z, Yeakley JM, Cheng H, Xiao RP, Ross J: Dilated cardiomyopathy caused by tissue-specific ablation of SC35 in the heart. EMBO J. 2004, 23: 885-896. 10.1038/sj.emboj.7600054.PubMed CentralPubMedView Article
- Wang HY, Xu X, Ding JH, Bermingham JR, Fu XD: SC35 plays a role in T cell development and alternative splicing of CD45. Mol Cell. 2001, 7: 331-342. 10.1016/S1097-2765(01)00181-2.PubMedView Article
- Xu X, Yang D, Ding JH, Wang W, Chu PH, Dalton ND, Wang HY, Bermingham JR, Ye Z, Liu F: ASF/SF2-regulated CaMKIIdelta alternative splicing temporally reprograms excitation-contraction coupling in cardiac muscle. Cell. 2005, 120: 59-72. 10.1016/j.cell.2004.11.036.PubMedView Article
- Jumaa H, Wei G, Nielsen PJ: Blastocyst formation is blocked in mouse embryos lacking the splicing factor SRp20. Curr Biol. 1999, 9: 899-902. 10.1016/S0960-9822(99)80394-7.PubMedView Article
- Graziotto JJ, Inglehearn CF, Pack MA, Pierce EA: Decreased levels of the RNA splicing factor Prpf3 in mice and zebrafish do not cause photoreceptor degeneration. Invest Ophthalmol Vis Sci. 2008, 49: 3830-3838. 10.1167/iovs.07-1483.PubMedView Article
- Feng Y, Valley MT, Lazar J, Yang AL, Bronson RT, Firestein S, Coetzee WA, Manley JL: SRp38 regulates alternative splicing and is required for Ca(2+) handling in the embryonic heart. Dev Cell. 2009, 16: 528-538. 10.1016/j.devcel.2009.02.009.PubMed CentralPubMedView Article
- Kanadia RN, Johnstone KA, Mankodi A, Lungu C, Thornton CA, Esson D, Timmers AM, Hauswirth WW, Swanson MS: A muscleblind knockout model for myotonic dystrophy. Science. 2003, 302: 1978-1980. 10.1126/science.1088583.PubMedView Article
- Ladd AN, Stenberg MG, Swanson MS, Cooper TA: Dynamic balance between activation and repression regulates pre-mRNA alternative splicing during heart development. Dev Dyn. 2005, 233: 783-793. 10.1002/dvdy.20382.PubMedView Article
- Bemmo A, Benovoy D, Kwan T, Gaffney DJ, Jensen RV, Majewski J: Gene expression and isoform variation analysis using Affymetrix Exon Arrays. BMC Genomics. 2008, 9: 529-10.1186/1471-2164-9-529.PubMed CentralPubMedView Article
- Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D: The human genome browser at UCSC. Genome Res. 2002, 12: 996-1006.PubMed CentralPubMedView Article
- Rhead B, Karolchik D, Kuhn RM, Hinrichs AS, Zweig AS, Fujita PA, Diekhans M, Smith KE, Rosenbloom KR, Raney BJ: The UCSC genome browser database: update 2010. Nucleic Acids Res. 2010, 38: D613-D619. 10.1093/nar/gkp939.PubMed CentralPubMedView Article
- Mi H, Lazareva-Ulitsky B, Loo R, Kejariwal A, Vandergriff J, Rabkin S, Guo N, Muruganujan A, Doremieux O, Campbell MJ: The PANTHER database of protein families, subfamilies, functions and pathways. Nucleic Acids Res. 2005, 33: D284-288. 10.1093/nar/gki078.PubMed CentralPubMedView Article
- Rozario T, Desimone DW: The extracellular matrix in development and morphogenesis: A dynamic view. Dev Biol. 2010, 341: 126-40. 10.1016/j.ydbio.2009.10.026.PubMed CentralPubMedView Article
- Verdi JM, Bashirullah A, Goldhawk DE, Kubu CJ, Jamali M, Meakin SO, Lipshitz HD: Distinct human NUMB isoforms regulate differentiation vs. proliferation in the neuronal lineage. Proc Natl Acad Sci USA. 1999, 96: 10472-10476. 10.1073/pnas.96.18.10472.PubMed CentralPubMedView Article
- Bani-Yaghoub M, Kubu CJ, Cowling R, Rochira J, Nikopoulos GN, Bellum S, Verdi JM: A switch in numb isoforms is a critical step in cortical development. Dev Dyn. 2007, 236: 696-705. 10.1002/dvdy.21072.PubMedView Article
- McKee AE, Minet E, Stern C, Riahi S, Stiles CD, Silver PA: A genome-wide in situ hybridization map of RNA-binding proteins reveals anatomically restricted expression in the developing mouse brain. BMC Dev Biol. 2005, 5: 14-10.1186/1471-213X-5-14.PubMed CentralPubMedView Article
- Ladd AN, Charlet N, Cooper TA: The CELF family of RNA binding proteins is implicated in cell-specific and developmentally regulated alternative splicing. Mol Cell Biol. 2001, 21: 1285-1296. 10.1128/MCB.21.4.1285-1296.2001.PubMed CentralPubMedView Article
- Nakahata S, Kawamoto S: Tissue-dependent isoforms of mammalian Fox-1 homologs are associated with tissue-specific splicing activities. Nucleic Acids Res. 2005, 33: 2078-2089. 10.1093/nar/gki338.PubMed CentralPubMedView Article
- Yeo GW, Coufal NG, Liang TY, Peng GE, Fu XD, Gage FH: An RNA code for the FOX2 splicing regulator revealed by mapping RNA-protein interactions in stem cells. Nat Struct Mol Biol. 2009, 16: 130-137. 10.1038/nsmb.1545.PubMed CentralPubMedView Article
- Tanaka TS, Jaradat SA, Lim MK, Kargul GJ, Wang X, Grahovac MJ, Pantano S, Sano Y, Piao Y, Nagaraja R: Genome-wide expression profiling of mid-gestation placenta and embryo using a 15,000 mouse developmental cDNA microarray. Proc Natl Acad Sci USA. 2000, 97: 9127-9132. 10.1073/pnas.97.16.9127.PubMed CentralPubMedView Article
- Kwan T, Benovoy D, Dias C, Gurd S, Provencher C, Beaulieu P, Hudson TJ, Sladek R, Majewski J: Genome-wide analysis of transcript isoform variation in humans. Nat Genet. 2008, 40: 225-231. 10.1038/ng.2007.57.PubMedView Article
- Atlasi Y, Mowla SJ, Ziaee SAM, Gokhale PJ, Andrews PW: OCT4 Spliced Variants Are Differentially Expressed in Human Pluripotent and Nonpluripotent Cells. Stem Cells. 2008, 26: 3068-3074. 10.1634/stemcells.2008-0530.PubMedView Article
- Cox CJ, Espinoza HM, McWilliams B, Chappell K, Morton L, Hjalt TA, Semina EV, Amendt BA: Differential regulation of gene expression by PITX2 isoforms. J Biol Chem. 2002, 277: 25001-25010. 10.1074/jbc.M201737200.PubMedView Article
- Schweickert A, Campione M, Steinbeisser H, Blum M: Pitx2 isoforms: involvement of Pitx2c but not Pitx2a or Pitx2b in vertebrate left-right asymmetry. Mech Dev. 2000, 90: 41-51. 10.1016/S0925-4773(99)00227-0.PubMedView Article
- Martinez-Contreras R, Cloutier P, Shkreta L, Fisette JF, Revil T, Chabot B: hnRNP proteins and splicing control. Adv Exp Med Biol. 2007, 623: 123-147. full_text.PubMedView Article
- Yeo GW, Van Nostrand EL, Liang TY: Discovery and analysis of evolutionarily conserved intronic splicing regulatory elements. PLoS Genet. 2007, 3: e85-10.1371/journal.pgen.0030085.PubMed CentralPubMedView Article
- Ni JZ, Grate L, Donohue JP, Preston C, Nobida N, O'Brien G, Shiue L, Clark TA, Blume JE, Ares M: Ultraconserved elements are associated with homeostatic control of splicing regulators by alternative splicing and nonsense-mediated decay. Genes Dev. 2007, 21: 708-718. 10.1101/gad.1525507.PubMed CentralPubMedView Article
- Lareau LF, Inada M, Green RE, Wengrod JC, Brenner SE: Unproductive splicing of SR genes associated with highly conserved and ultraconserved DNA elements. Nature. 2007, 446: 926-929. 10.1038/nature05676.PubMedView Article
- Zhang C, Zhang Z, Castle J, Sun S, Johnson J, Krainer AR, Zhang MQ: Defining the regulatory network of the tissue-specific splicing factors Fox-1 and Fox-2. Genes Dev. 2008, 22: 2550-2563. 10.1101/gad.1703108.PubMed CentralPubMedView Article
- Johnson MB, Kawasawa YI, Mason CE, Krsnik Z, Coppola G, Bogdanovic D, Geschwind DH, Mane SM, State MW, Sestan N: Functional and evolutionary insights into human brain development through global transcriptome analysis. Neuron. 2009, 62: 494-509. 10.1016/j.neuron.2009.03.027.PubMed CentralPubMedView Article
- Salomonis N, Nelson B, Vranizan K, Pico AR, Hanspers K, Kuchinsky A, Ta L, Mercola M, Conklin BR: Alternative splicing in the differentiation of human embryonic stem cells into cardiac precursors. PLoS Comput Biol. 2009, 5: e1000553-10.1371/journal.pcbi.1000553.PubMed CentralPubMedView Article
- Edgar R, Domrachev M, Lash AE: Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002, 30: 207-210. 10.1093/nar/30.1.207.PubMed CentralPubMedView Article
- Affymetrix Powertools. [http://www.affymetrix.com/partners_programs/programs/developer/tools/powertools.affx]
- Affymetrix: Identifying and Validating Alternative Splicing Events. Affymetrix Technical Report. 2006
- Fisher RA: Combining independent tests of significance. Amer Stastician. 1948, 2: 30-10.2307/2681650.
- Benjamini Y, Hochberg Y: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J Roy Statist Soc B. 1995, 57: 289-300.
- Karolchik D, Hinrichs AS, Furey TS, Roskin KM, Sugnet CW, Haussler D, Kent WJ: The UCSC Table Browser data retrieval tool. Nucleic Acids Res. 2004, 32: D493-496. 10.1093/nar/gkh103.PubMed CentralPubMedView Article
- Welch BL: The generalization of "Student's" problem when several different population variances are involved. Biometrika. 1947, 34: 28-35.PubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.