Alternative splicing is frequent during early embryonic development in mouse
© Revil et al. 2010
Received: 20 January 2010
Accepted: 23 June 2010
Published: 23 June 2010
Skip to main content
© Revil et al. 2010
Received: 20 January 2010
Accepted: 23 June 2010
Published: 23 June 2010
Alternative splicing is known to increase the complexity of mammalian transcriptomes since nearly all mammalian genes express multiple pre-mRNA isoforms. However, our knowledge of the extent and function of alternative splicing in early embryonic development is based mainly on a few isolated examples. High throughput technologies now allow us to study genome-wide alternative splicing during mouse development.
A genome-wide analysis of alternative isoform expression in embryonic day 8.5, 9.5 and 11.5 mouse embryos and placenta was carried out using a splicing-sensitive exon microarray. We show that alternative splicing and isoform expression is frequent across developmental stages and tissues, and is comparable in frequency to the variation in whole-transcript expression. The genes that are alternatively spliced across our samples are disproportionately involved in important developmental processes. Finally, we find that a number of RNA binding proteins, including putative splicing factors, are differentially expressed and spliced across our samples suggesting that such proteins may be involved in regulating tissue and temporal variation in isoform expression. Using an example of a well characterized splicing factor, Fox2, we demonstrate that changes in Fox2 expression levels can be used to predict changes in inclusion levels of alternative exons that are flanked by Fox2 binding sites.
We propose that alternative splicing is an important developmental regulatory mechanism. We further propose that gene expression should routinely be monitored at both the whole transcript and the isoform level in developmental studies
Developmental processes require precise spatial and temporal regulation of gene expression. Accordingly, developmental biologists have always been at the forefront of gene expression analysis, and recombinant DNA techniques such as transgenic and knockout models have greatly contributed to elucidation of developmental pathways and networks. Traditionally, these studies have focused on transcription factors and repressors that regulate the timing and strength of transcription. Recently, new regulatory mechanisms have emerged, such as post-transcriptional regulation by microRNAs and co-transcriptional regulation by alternative pre-mRNA splicing.
Alternative splicing is a pre-mRNA maturation process that consists of the removal or inclusion of certain alternative exons to produce different transcripts from one genomic locus [1, 2]. Alternative splicing is now known to be prevalent in advanced eukaryotes. In humans, recent reports show that more than 98% of multi-exonic pre-mRNAs are alternatively spliced [3, 4]. The mouse genome has been sequenced and, similarly to that of humans, a surprisingly low number of less than 30,000 genes have been identified [5, 6]. It has been widely hypothesized that the great complexity of higher eukaryotic organisms stems from processes such as alternative splicing [7, 8]. The distinct proteins translated from identical pre-mRNAs produced by this process can have different, even antagonistic activities. Thus, alternative splicing can play a major role in the activity of various important cellular mechanisms, such as cell differentiation, cell migration, cell growth and apoptosis. This wide range of cellular processes is required during mammalian embryogenesis to generate a viable organism from a single cell.
Several studies have suggested the importance of alternative splicing during development. In C. elegans, it was shown that 18% of the 352 verified alternative exons showed a larger than fourfold change in alternative splicing during its development from embryo to adult, including larval stages . In humans, mice, chickens and Xenopus, a well-known example is provided by the fibroblast growth factor 8 (FGF8), which can produce many different isoforms . Two of these, FGF8A and FGF8B, which differ by only eleven amino acids, have been shown to have different activities during development [11–14].
It has recently been shown that the levels of certain splicing factors, MBNL and CELF, are regulated and vary several-fold during pre- and postnatal heart development . This variation in CELF and MBNL expression levels affects the splicing modulation of a large quantity of other alternative splicing events, suggesting the existence of a regulatory cascade at the splicing level. Some of the most interesting examples of alternative splicing and its regulation by splicing factors have been carried out in neural tissues. As neuronal precursor cells differentiate into neurons, there is a switch from the ubiquitous PTB to the highly similar, but neuron-specific, neural PTB (nPTB) . As these two proteins modulate alternative splicing of specific subsets of pre-mRNAs, there is an associated switch of a large number of mRNA isoforms.
Since splicing factors may regulate alternative splicing of many different pre-mRNAs [17, 18], knocking out known splicing factors in mice generally has profound effects on embryo or young pup viability. The majority of germ-line loss of function mutations in splicing factors result in embryonic arrest early in development, before embryonic day (E) 7.5, e.g. Ptb , SC35 [20, 21], Asf2/Sf2  and SRp20 . In two cases, Ptb and SRp20, homozygous mutant embryos arrest at the morula stage [19, 23]. Prfp3 mutant embryos also exhibit embryonic arrest, although it is not clear if these embryos die early or late in embryogenesis . Germ-line loss of function mutations in splicing factors are also associated with organ specific abnormalities. For example, most SRp38 knockout embryos die before E15.5 with multiple cardiac defects , whereas a small number of mutant mice are born only to die soon after birth . Mbnl1 and Mbnl2 are required in the skeletal muscle and eye [26, 27], and Nova1 is required in motor neurons . These studies indicate that there are stage-specific and tissue-specific requirements for splicing factors.
Although there have been a number of indications of the importance of alternative splicing in development, past studies have been limited largely to individual experiments focused on known candidate genes. In recent years, technological advances have paved the way to genome-wide analyses of mRNA processing, and have enabled hypothesis-free approaches. In this study, we take advantage of a splicing-sensitive exon microarray to investigate genome-wide variation in alternative splicing during development of the mouse embryo and its associated placenta. We focus on early developmental stages, E8.5-E11.5, in order to capture the isoform differences occurring during organogenesis. In mouse embryos, organogenesis begins at E8.5 and is mostly completed by E11.5 when most organs of the foetus can be recognized. The placenta is one of the first organs to form and function in the developing foetus. During organogenesis the precursors of the adult organs undergo a series of morphogenetic movements and differentiation events which have been shown to be controlled by changes in gene expression. We postulate that alternative splicing provides an additional mechanism for increasing the repertoire of transcripts from a limited number of developmentally relevant genes and confers additional specificity to individual developing tissues. Our results show that alternative splicing is frequent during organogenesis, and that different tissues (i.e. the placenta and embryo) as well as different developmental stages express specific gene isoforms. We confirm some previously described splicing events but also report numerous new time- and tissue-specific alternative isoforms. In addition, we find that the mRNA expression levels of some known splicing factors are modulated during organogenesis, suggesting that changes in expression levels of splicing factors may be responsible for alternative isoform expression.
We next carried out the splicing index analysis which identified a large number of candidate exons predicted to be alternatively spliced across the two tissues and three stages. Since it is instrumental to distinguish between truly alternatively spliced exons, and exons expressed at very low levels (indistinguishable from background noise) or very highly (beyond the sensitivity levels of the microarray); we employed a number of filtering steps, as described in Materials and Methods and previously in . The Affymetrix Mouse Exon Array contains over 23,000 core meta probe sets (genes) and 280,000 probe sets (exons). Our filtering criteria reduced this number to 13,366 meta probe sets containing approximately 133,000 probe sets.
We investigated the efficacy of our data filtering steps by estimating the percentage overlap between candidate alternatively spliced exons in our study, with those predicted to be alternatively spliced based on independent EST and mRNA evidence such as those annotated by the "AltEvents" track from build 37 of the mouse genome on the UCSC Genome browser [5, 29, 30]. The Alt-Events track annotates alternative splice forms in UCSC "known" genes, which are assembled by combining evidence from multiple independent experimental sources. Application of our data filtering steps increases the percentage overlap with alternative events predicted from independent sources (Additional File 1 Figure S1). In addition, by increasing the stringency of the false discovery rate (FDR) cutoff we further increased the percentage overlap. This analysis demonstrates that the application of data filtering as well as the use of a stringent q-value cutoff increases the agreement between our results and alternatively spliced events predicted from independent experimental evidence. We note, however, that while our data filtering likely increases the sensitivity of our analysis, it is likely that the filters concurrently reduce specificity.
To assess patterns of gene expression and splicing change over our entire data set, we defined a significant difference as a P-value which was less than that expected at an FDR of 0.05. Significant stage effects were defined using the combined P-values from the embryo and placenta: that is, a significant developmental stage effect reflects a difference in expression between different embryonic stages, placental stages or both. For brevity, we refer to differences in expression between embryo and placenta as "tissue-specific" and differences in expression between developmental stages as "stage-specific".
We next investigated the overlap between whole gene expression change and alternative splicing. For clarity, in this section we focused on the 10,796 genes which contained at least one filtered (clearly expressed) probe set. We find that genes whose expression is significantly different between tissues or stages are highly enriched for alternatively spliced exons (Figure 3D). Genes that are differentially expressed between tissue are primarily enriched (compared to all genes) in tissue-specific alternative exons, while genes that are differentially expressed between developmental stages are likewise enriched in stage-specific alternative exons (Figure 3D). This analysis suggests that a large proportion of genes that are differentially expressed in early embryonic development concurrently exhibit significant isoform changes.
Overrepresented PANTHER "Biological Process" terms.
PANTHER Biological Process
Cell adhesion-mediated signaling
Overrepresented PANTHER "Molecular Function" terms.
PANTHER Molecular Function
Extracellular matrix structural protein
Cell adhesion molecule
Validation of cassette exons by RT-PCR.
Integrin α 6
ELKS/RAB6-interacting/CAST family member 1
Kinesin Family Member 2A
Erythrocyte Protein Band 4.1-like 3
NUMB Gene Homolog
DEP Domain Containing 5
WNK Lysine Deficient Protein Kinase 1
MYC Binding Protein 2
Alpha Glucosidase 2 Alpha Neutral Subunit
Validation of different alternative splicing events by real-time PCR.
engulfment and cell motility 1, ced-12 homolog (C. elegans)
ankyrin 3, epithelial
glucosaminyl (N-acetyl) transferase 2, I-branching enzyme
transcription factor 4
kinesin family member 1B
intersectin 1 (SH3 domain protein 1A)
MYC binding protein 2
ELKS/RAB6-interacting/CAST family member 1
WNK lysine deficient protein kinase 1
kinesin family member 2A
DEP domain containing 5
RAB6, member RAS oncogene family
In order to gain further insight into temporal variation in isoform expression, the qRT-PCR was performed on additional embryonic and placental stages, ranging from E6 to E11. The results of these additional stages were almost all predictable by extrapolation of the trends observed in the microarrays. A few demonstrated some unexpected behaviour, which may correspond to rapid changes in splicing patterns corresponding to developmental switches. As mice embryos develop extremely rapidly, with a gestation period of only 19-21 days, it is possible that the alternative splicing of certain pre-mRNAs can also be modified extremely rapidly.
The seventeen validated candidates are implicated in various cellular and organismal processes, and some have previously been shown to be involved in embryogenesis. For example, Numb is involved in neuronal differentiation during development. There are four known isoforms of Numb, two of which contain an alternative exon of 147 nt in the proline rich region and two without this exon . These two classes have been shown to be differentially spliced during embryogenesis from E10 to E17, as well as in adults, to regulate cortical development . This shift in alternative splicing is confirmed by our microarray results and we also find that this starts as soon as E8.5, the earliest data we have for this alternative splicing event.
Top 20 gene expression candidates in RNA binding proteins.
Proposed functions (GO)
P-value of changes of gene expression
P-value of changes of alternative splicing
Heat shock protein binding
Ribosome biogenesis (yeast)
RNA splicing, RNA helicase
Transcription regulator (during development)
Cell differentiation, development
rRNA processing, ribosome biogenesis
Because some aspects of the function of Fox2 and its binding sites within RNA sequences have been previously characterized, we were able to predict the effect of variation of Fox2 mRNA expression on a subset of exons that contain its putative binding sites. We compared the alternative splicing of cassette exons between E8.5 and E11.5 embryos as well as between E11.5 embryo and E11.5 placenta (P11.5), the two stages we observed the highest differences for Fox2 expression. The results obtained for these ratios (Figure 7D) demonstrate that inclusion rates of exons that are flanked by putative binding sites for Fox2 are highly correlated with the expression of this gene. The candidate exons were divided into two sets: those that contained a putative Fox2 binding site within the upstream intron, and those with a binding site in the downstream intron. Previous studies suggest that binding of Fox2 upstream of the exon should result in increased exon skipping, while downstream binding results in exon inclusion . Our results indeed demonstrate that when the introns downstream of a cassette exon contained UGCAUG, a hexamer representing the most specific binding site for this protein, exon inclusion levels were positively correlated to Fox2 expression levels, while the opposite was true for exons with upstream binding sites. There is nearly a ten-fold difference between the means of exon inclusion ratios for these two categories, when comparing E11.5 and E8.5 (P = 0.0204). We also used a more relaxed criterion for identifying putative Fox2 binding sites, including all the binding sites previously described . This analysis increased the sample size from 22 to 58 candidate regulated cassette exons (Additional file 4 Table S2), and resulted in increased statistical significance levels of the regulatory effect (P = 0.0012; Figure 7D, bars labelled "All"), suggesting that Fox2 binding sites may allow a level of degeneracy in vivo, and that there may be a large number of splicing events regulated by Fox2 during early embryonic development.
Alternative splicing is becoming a widely studied process that can increase the transcriptome complexity using a finite set of genes. However, little is known on the regulation of alternative splicing during development, where time and tissue specific regulation of protein levels is fundamental for cell differentiation, apoptosis and migration, all key regulators of organogenesis. In this work, we show that splicing is extensively regulated during organogenesis. We compile a list of 5,500 genes that indicate transcript isoform differences between our two tissues of interest - embryonic and placental - or across three developmental stages, E8.5, E9.5 and E11.5. According to a 5% FDR cutoff, we estimate that 9% of all exons are differentially alternatively spliced between the placenta and the embryo, and 2% of exons are differentially alternatively spliced across the three developmental stages. With 5 biological replicates each, and 6 distinct tissue/stage samples, our study has considerable statistical power to detect alternatively spliced transcripts. Our analysis also allows us to compare the fraction of genes that show isoform level changes with genes that are differentially expressed at the whole transcript level. Although we find that the number of differentially expressed genes is larger than differentially spliced, the numbers are of similar orders of magnitude (Figure 3). Interestingly, there is a large overlap between these subsets, and we show that genes with very significant levels of differential expression have a much higher than expected chance of being alternatively spliced.
It is also worth noting that because of the high statistical power of the study, our number of differentially expressed candidates is ten times higher than what was previously demonstrated by a genome-wide expression analysis of placentas and embryos at E12.5, which found only 6.5% of statistically significant differences in gene expression . This could be explained in part by the advances in microarray analyses - i.e. the use of whole-transcript arrays - and the profiling of two placental stages allowing us to target a larger number of developmental events. However, it should be noted that the observed trends in expression are highly concordant between these two studies; most of the genes that had been shown to be highly expressed in placentas in the earlier study displayed the same high expression profile in our analysis.
Although we cannot rule out that a fraction of the genome-wide alternative splicing candidates may represent false positive results, our validation of 17 candidates presented here, as well as the 80% success in validation of previous experiments  suggests that false positives should not significantly affect our qualitative conclusions. It is difficult to estimate false negative rates in these types of studies. Although they are likely to be non-negligible, the limited power of the approach would render our estimates of the true frequency of alternative splicing conservative. We expect that this study exposes only the tip of the iceberg, and that a clearer picture will emerge as the whole-genome technologies mature. In the meantime, we provide a user-friendly database of our results in the additional material online (Additional file 1 Table S1).
In the current paradigm, regulation of developmental pathways is accomplished mainly at the level of transcription, and is mediated by changes in expression of certain key genes. For example, one of the earliest events in embryonic development is the separation of the inner cell mass - which forms the embryo proper - from the trophectoderm lineage. This event is regulated by interaction between three transcription factors: Cdx2, Nanog, and Oct4. This transcriptional regulation is essential for turning on numerous other genes that will drive differentiation of the embryo and its associated placenta. Incidentally, Oct4 is known to express multiple alternatively spliced isoforms with distinct expression patterns and possibly distinct functions . In another well-known example, left-right asymmetry is governed by the Nodal molecular cascade. At E7.5 Nodal is turned on at the node, leading to auto regulation of its own expression in the left lateral plate, and turning on the expression of Lefty and Pitx2 genes. These three key players form the basis of a pathway containing numerous genes that control the specification of left versus right. Like Oct4, Pitx2, (one of our candidates, see Additional file 1 Table S1), expresses several isoforms with distinct transcriptional activities , and only one of those isoforms, Pitx2c, is required for left right asymmetry .
As the above examples demonstrate, many developmental pathways include alternative isoforms of key genes. Our own analysis presented here also indicates that a large proportion of developmentally regulated genes express alternative isoforms. The correct temporal and spatial expression of those isoforms is most likely regulated by specific splicing factors. Hence, we propose that in addition to canonical developmental pathways regulated by transcription factors, there exists a parallel, and yet highly overlapping, set of pathways regulated by splicing factors. A single splicing factor may affect the splicing of numerous exons in a large number of genes [18, 44, 45]. Thus, changes in expression levels of splicing factors can have profound downstream effects. Moreover, many splicing factors are themselves involved in self-regulatory feedback loops and express multiple isoforms [46, 47]. Our analysis indicates differential embryonic expression of a number of RNA binding proteins and putative splicing factors. One of the particularly interesting examples is Fox2 (RBM9 or Fxh), a factor whose expression and promoter usage are both variable across the three developmental stages (Figure 7B). We find that Fox2 expression increases nearly 2 fold between days 8.5 and 11.5 in the embryo, but not in the placenta. We also show that this increase in Fox2 levels has a significant and predictable effect on the alternative splicing of cassette exons which contain putative binding sites within 100 nt in the neighbouring introns (Figure 7D). Furthermore, the use of the proximal promoter, which gives rise to an mRNA encoding a shorter N-terminus, increases between those stages (Figure 7C). Thus, not only is the overall amount of Fox2 product, but also the ratio of distinct isoforms, variable across early embryonic development.
Recently Fox2 has been found to be highly expressed in another developmentally relevant system, pluripotent cell lines, along with other pluripotency markers such as Oct4 and Nanog. Fox2 has been shown to regulate the splicing of numerous alternative exons in human embryonic stem cells, and to be necessary for maintaining viability of the cells. Moreover, Fox2 pre-mRNA contains active binding sites for its own protein, indicating a degree of autoregulation. Finally, many Fox2 targets are in turn splicing regulators, suggesting the existence of more extensive splicing networks . The properties of such networks have been recently investigated in detail across a number of tissues and species .
Our study, along with a few recent publications [38, 48–50] demonstrates that alternative splicing is a frequent event and is likely to have a significant role in development. With the promising results presented here, the future step will be to improve detection rates by using additional developmental stages, and a finer resolution of embryonic tissues. Indeed, the use of whole embryo mRNAs does not permit discovery of small-scale tissue-specific alternative splicing, which is likely to be extremely significant to tissue differentiation. Finally, most of our knowledge pertaining to genome-wide variation in alternative splicing has been acquired thanks to recent developments in microarray technologies, using either exon or splice-junction microarrays. Even more recent advances in high throughput sequencing will soon make it feasible to carry out whole-genome expression and isoform profiling experiments using mRNA sequencing. This approach allows digital monitoring of expression by counting the number of reads that map to mRNA segments of interest and comparing them across samples. These fragments may be genes, individual exons, or specific splice junctions. mRNA sequencing also allows the discovery of novel, as of yet unannotated, isoforms and transcripts. Preliminary studies carried out on a limited number of tissues and with limited sequencing coverage, [3, 4] have already significantly enlarged our catalogue of alternatively spliced genes. With the rapidly increasing throughput, along with dropping costs and accessibility of sequencing, these technologies will soon allow us to routinely view gene expression at the sub exon level resolution and to decipher the role of splicing regulation in development and other systems.
In this study, we have demonstrated that changes in alternative splicing are frequent during early mouse development between tissues and stages. Indeed, the numbers of differentially spliced genes are similar to those of differentially expressed genes. In these two categories, there is an overrepresentation of genes that have been implicated in development. In addition, many RNA-binding proteins are differentially spliced and/or expressed at the mRNA level. We can see a direct correlation between expression of Fox2, a known splicing factor, and the alternative splicing of cassette exons containing binding sites for this protein in the neighbouring introns. Thus, our results suggest that the effects of alternative mRNA isoforms should now be systematically verified in developmental gene studies.
To generate embryos, C57BL/6J females were placed with C57BL/6J males overnight and checked for the presence of a vaginal plug in the morning. The day that a plug was detected was considered embryonic day (E) 0.5. All mice breeding and manipulations were performed in accordance with the Canadian Council on Animal Research. Embryos were collected between E6.5 - E11.5 for RNA isolation. Embryos for RNA isolation were stored in RNAlater (Ambion) for microarray analysis or TRIzol (Invitrogen) for RT-PCR, and isolated according to the manufacturers' protocol.
The microarray hybridization and analysis was done as previously described , but using the Affymetrix GeneChip® Mouse Exon 1.0 ST microarray. One microarray was used for each of the five biological replicates used in every stage and tissue analysed. All the microarray data has been deposited online in NCBI's Gene Expression Omnibus  with the accession number GSE21971. http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE21971.
The Affymetrix Mouse ExonArray contains approximately 1.2 million probe sets which target roughly 1 million known and predicted exons. The annotations used to design these probe sets are derived from a variety of sources and vary dramatically in the strength of experimental evidence which supports their existence. These annotations are divided, in order of decreasing quality of experimental support into "core", "extended", "full", "free" and "ambiguous" annotations. Our analysis was restricted to the approximately 2.2 × 105 core probe sets on the ExonArray. These probe sets interrogate exons derived from RefSeq transcripts and/or full-length GenBank mRNAs. There are a number of issues to consider when analysing GeneChip data from an experiment with multiple arrays. These include background correction, normalization, non-specific hybridization and probe summarization. We background-corrected probe expression levels for non-specific binding based the distribution of binding intensities of a set of "anti-genomic" probes of a similar GC content to the probe of interest. These anti-genomic probes are designed so as to not hybridize with any sequence in the mouse genome and so provide an estimate of the level of non-specific binding for a given GC content. Probe intensities were then quantile normalized across all samples. Finally, individual probe expression levels were summarized into exon- and gene-level expression using the Probe Logarithmic Intensity Error (PLIER) algorithm. All analyses were performed using the Affymetrix Power Tools suite of command line programs .
A major problem with inferring alternative splicing of exons between different biological samples using ExonArrays is differentiating such variation from changes in whole-gene expression level. The combination of changes in whole gene expression level with misleading probe set results can introduce potentially artefactual signals of alternative splicing in three main ways: via (i) the inclusion of probe sets from genes that are not expressed in a subset of samples (ii) the inclusion of cross-hybridizing probes or (iii) the inclusion of nonresponsive or "dead" probes [, Figure eight]. With these problems in mind we employed a number of quality control protocols to our expression data. Our objective was to minimize the number of false positive signals of differential splicing between our samples. Our filters are based upon the standards described by Affymetrix . Firstly, in order to differentiate genic from exonic expression changes, all exon expression levels were normalized by the whole gene expression level estimated by the PLIER algorithm. Secondly, to minimize errors introduced by unexpressed genes, we also excluded all meta-probe sets and their constituent probe sets that were not expressed in all 25 of our samples. A gene was defined as "not expressed" if its mean expression level was lower than the quartile of the distribution of all core meta probe set intensities for the chip in question. Thirdly, we attempted to minimize the influence of cross hybridizing probe sets in our analysis by removing all probe sets in which the probe set/meta-probe set intensity ratio was greater than 5, indicating a high level of cross-hybridization. Finally, in order to remove "dead" probe sets we used the detection above background (DABG) P-value, as estimated by the apt-probeset-summarize program. The DABG P-value describes the probability that an intensity value at least as extreme as the observed could have been drawn from the null distribution, in this case the background distribution of intensity values. In order to account for chip-to-chip variation, we set a False Discovery Rate-corrected P-value threshold at 0.05 for each chip, based on the distribution of DABG values for that chip. In our case, a "dead" or unresponsive probe set was defined as any probe set in which the DABG P-value exceeded the chip-specific FDR threshold value in all samples. In order to stabilize the variance data values were log-transformed.
Each probe set in our filtered dataset retained 25 estimates of expression level. These estimates were divided by tissue (embryonic and placental) and developmental stage (days E8.5, E9.5 and E11.5 embryonic, days E9.5 and E11.5 placental) with 5 biological replicates in each tissue-stage. For simplicity, we refer to embryonic samples as a single "tissue" throughout. The absence of placental samples from day E8.5, due to insufficient RNA quantities, means that our dataset is unbalanced and therefore, not amenable to analysis by standard two-way ANOVA. Instead, we implemented the following approach. In order to test for significant differences in expression level between tissues we use a two-sample t-test, combining samples from day E9.5 and day E11.5 from both tissues. We tested for differences in expression level between developmental stages using a two sample t-test in placenta to compare expression level on day E9.5 and day E11.5, and using a one-way ANOVA in embryo to compare expression levels on days E8.5, E9.5 and E11.5. The p-values for these latter two tests were then combined using Fisher's method  under the common null hypothesis of no significant variation in expression level across developmental stages in either tissue. All p-values were adjusted for multiple testing by using the Benjamini and Hochberg  FDR procedure, and a false discovery rate of 0.05 was used as a cutoff for reporting the results.
In order to confidently identify cassette exons, of the top 2830 significant probe sets (Additional file 1 Table S1), we selected those that were annotated as "cassetteExon" in the "knownAlt" track on the UCSC Genome Browser . As binding sites for Fox2 are found predominantly in the surrounding introns within 100 nt of the splicing sites , upstream and downstream introns were scanned for the presence of several different putative binding sites previously published , using sequences extracted with the UCSC Table Browser data retrieval tool . We then calculated E11.5/E8.5 and E11.5/P11.5 ratios for each probe set value, normalized to the corresponding meta probe set. This ratio corresponds to the predicted ratio of exon inclusion levels. The candidate exons were divided into two sets: those that contained a putative Fox2 binding site within the upstream intron, and those with a binding site in the downstream intron. The data was log-transformed in order to stabilize the variance of the ratios, and the differences between the two sets were analysed using a one-tailed, unpaired Welch's t test . The results were considered significant if the p-values for the difference between the means were less than 0.05 (P < 0.05).
Standard PCR validation was done using HotStarTaq PCR mix (Qiagen) according to the manufacturer's instructions. 8 ng of cDNA were used in a 10 μl reaction with conditions set as follows: 15 minutes at 95°C, followed by 30 cycles of 30 seconds at 95°C, 30 seconds at 58°C and 45 seconds at 72°C. The resulting products were analyzed on a 2% agarose gel (Additional file 3 Figure S2). Primer sequences used are available in additional table (Additional file 4 Table S3).
The validation of the results by qPCR was done in a 384-well format using the 7900HT Fast Real-Time PCR System (Applied Biosystems) and the Power SYBR® Green PCR Master Mix (Applied Biosystems) according to the manufacturer's protocol. The concentrations of the reagents used were 8 ng of cDNA was in a 10 μl reaction with 320 nM of each primer. The parameters of the qRT-PCR machine were 10 minutes at 95°C, followed by 40 cycles of 20 seconds at 95°C, 30 seconds at 58°C and 45 seconds at 72°C. The primers used are shown in additional table (Additional file 4 Table S4).
Detection Above BackGround
Differentially Alternatively Spliced
False Discovery Rate
Meta Probe Set
Principal Component Analysis
We would like to thank Kevin Lam, Vonda Koka, Kevin Ha and Tony Kwan for their useful advice. This work was supported by funding from FQRNT (LJM), CIHR and Genome Canada/Genome Quebec (JM). LAJM is a member of the Research Institute of the McGill University Health Centre, which is supported in part by the FRSQ.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.