Transcriptomic changes arising during light-induced sporulation in Physarum polycephalum

Background Physarum polycephalum is a free-living amoebozoan protist displaying a complex life cycle, including alternation between single- and multinucleate stages through sporulation, a simple form of cell differentiation. Sporulation in Physarum can be experimentally induced by several external factors, and Physarum displays many biochemical features typical for metazoan cells, including metazoan-type signaling pathways, which makes this organism a model to study cell cycle, cell differentiation and cellular reprogramming. Results In order to identify the genes associated to the light-induced sporulation in Physarum, especially those related to signal transduction, we isolated RNA before and after photoinduction from sporulation- competent cells, and used these RNAs to synthesize cDNAs, which were then analyzed using the 454 sequencing technology. We obtained 16,669 cDNAs that were annotated at every computational level. 13,169 transcripts included hit count data, from which 2,772 displayed significant differential expression (upregulated: 1,623; downregulated: 1,149). Transcripts with valid annotations and significant differential expression were later integrated into putative networks using interaction information from orthologs. Conclusions Gene ontology analysis suggested that most significantly downregulated genes are linked to DNA repair, cell division, inhibition of cell migration, and calcium release, while highly upregulated genes were involved in cell death, cell polarization, maintenance of integrity, and differentiation. In addition, cell death- associated transcripts were overrepresented between the upregulated transcripts. These changes are associated to a network of actin-binding proteins encoded by genes that are differentially regulated before and after light induction.


Background
Physarum polycephalum, commonly known as "slime mold", belongs to the mycetozoan group of Amoebozoa. Physarum follows a complex life cycle with haploid and diploid cell types, formed in temporal order as triggered by environmental stimuli [1]. The plasmodium is a multinucleate single cell whose nuclei display natural synchrony with respect to cell cycle and differentiation status. During several days of starvation, a plasmodium grown to macroscopic size becomes competent to sporulate. Sporulation can then be triggered experimentally by exposing a competent plasmodium to a pulse of blue or far-red light, or to heat shock. As plasmodial nuclei are synchronous, and because the timing of the differentiation program in individual plasmodia can be reproduced experimentally, the stage-specific gene expression program that leads to sporulation can be analyzed at high resolution [2][3][4]. Recent large scale cDNA surveys [5,6] provide a basis for studying the phenomenon at the transcriptomic level.
In order to identify the differentially expressed genes associated with the commitment to sporulation, we characterized and compared two cDNA libraries prepared from competent and light-induced plasmodia using massive parallel sequencing technology [7]. We employed this method because it does not rely on reference transcripts for quantitation, previous cloning steps are not required, it does not have an upper limit for quantitation, and it is a relatively unbiased approach [8]. From the comparison of annotations and transcript quantitations, we found that most differentially expressed genes encode proteins associated to a network of actin-binding proteins. Components of this putative interaction network are associated to development, DNA repair, cell division, calcium release, cell death, and maintenance of cell integrity.

Results
Sequencing and Profiling of cDNAs expressed in competent and light-induced plasmodia Separate cDNA libraries were constructed from polyA + RNA isolated from two sources: (i) competent plasmodia; and (ii) sporulation-induced plasmodia (competent plasmodia harvested six hours after exposure to far-red light). The cDNAs libraries were then analyzed using massive parallel sequencing [7,8]. Transcripts were annotated at every bioinformatic level, and the annotation data was used to infer hypothetical interaction networks from differentially regulated genes. The whole approach is summarized in the [Additional File 1: Figure S1].
From the pyrosequencing, we obtained a total output of 61.9 Mb from two runs, corresponding to the starved (26.1 Mb) and light-induced (35.8 Mb) plasmodia libraries. Considering that Physarum possess a 300 Mb genome, and assuming that 10% is encoding genes, we estimate a 2.06× coverage of protein coding sequences. The assembled sequencing output consisted of 26,037 sequences, and large cDNAs from this assembly (>500 bp) were then joined to a previously available sequence dataset [6], to form a comprehensive set of representative transcripts. This analysis produced 16,669 sequences, 13,169 of these containing transcript abundance data: 125,456 reads from competent and 99,632 reads from light-induced plasmodia, respectively [Additional File 2]. We used this abundance data (number of reads for each assembled transcript) as a measure of expression, which we defined as "hit counts". The remaining contigs without hit counts consisted of previously sequenced clones from a normalized cDNA library prepared from competent plasmodia [6], indicating that the normalization produced a broader coverage of transcripts. From 11,399 cDNA contigs detected in the competent plasmodia library (10,689 in lightinduced plasmodia), over 4,227 were represented with at least five hits (3,553 in light-induced plasmodia [Additional File 3: Figure S2]). Conversely, 8,711 transcripts (52,3%) were found with 5 or less sequence hits in both samples. For statistical reasons, no statement on the differential expression from this fraction could be made. Between contigs with lowest hit counts, 2,437 cDNA species were represented by just one hit (competent plasmodia), and 2,621 from light-induced sample [Additional File 3: Figure S2].
We compared the transcript hit counts between different libraries as a measure of differential gene expression. As most contig species were represented by low hit counts, we normalized the number of hits. To this end, we first obtained the relative frequency (number of hits divided by the total hits on a given condition), and later we calculated the relative frequencies for each contig in the two cDNA samples compared to each other. Given that each EST belongs to a single gene, the significance of its differential expression depends only on the number of hits, respect to the total number of hits on each library [9]. Following these assumptions, we found 2,772 cDNAs that displayed significant differential expression (P-value < 0.05). All contig species, regardless of whether differentially expressed or not were submitted to the Sequence Read Archive subset of GenBank [10] (Accession numbers SRX012830 and SRX012831).
The newly assembled contigs were compared against sequence databases using BLASTX [11]. This analysis revealed that 3,310 sequences have significant similarity (≤ 1 × 10-15) to existing sequences in SwissProt [12], 3,651 to the protozoa subset from RefSeq [13], and 3,345 to proteins of the related model organism Dictyostelium discoideum (dictyBase; [14]). From the 13,169 sequences with hit counts data, orthologs were identified for 5,544 transcripts (1,287 of these with significant differential expression; [Additional File 4]).
Later, in order to identify differentially regulated genes, we clustered the contig species into expression groups according to their relative frequencies in both conditions. As a result, we found contigs encoding orthologs related to cell division (meiosis-related protein MEI2; DNA polymerase beta; actin) and protein synthesis and degradation (elongation factor 1-alpha; cathepsin-L cysteine protease) with higher relative frequencies in the competent plasmodial library. Similarly, we found orthologs related to the cytoskeleton (spire; actophorin; cell wall integrity and stress response component, WSC1) and cell differentiation genes (CudA) with greater relative frequencies in the light-induction library (Figure 1; [Additional File 5: Table S1]).

Gene Ontology Annotation of the Transcriptome
The Gene Ontology (GO) project [15] is an annotation framework that provides a standardized vocabulary that is used to assign function to uncharacterized sequences, based on three main categories: biological processes (BP), molecular functions (MF) and cellular components (CC). We employed BLAST2GO [16], a tool that associates GO terms to sequences based in several annotation evidences, to classify gene function in our dataset. Using the BLASTX hits (annotation e-value cutoff < 1 × 10-6), together with GO terms previously extracted from InterPro domain searches [17], we inferred 13,068 GO annotations for 3,304 (20%) cDNAs, with 11,446 annotations belonging to 2,459 sequences with hit counts data. Transcripts were associated to biological processes (n = 2,437; 15%), molecular functions (n = 2,801; 17%), and cellular components (n = 2,023; 12%). As many as 2,136 (13%), 1,663 (10%) and 1,645 (10%) sequences were annotated with a combination of MF and BP terms, MF and CC, and BP and CC terms respectively, and 1,487 cDNAs were annotated with MF, BP and CC terms altogether. Later, in order to analyze the differences between the two condition groups with respect to the GO annotations, we carried out Fisher exact tests using the Gossip module [18] from BLAST2GO. We found that the 'cell development' (GO:0048468), 'cell death' (GO:0008219) and 'death' (GO:0016265) GO terms were overrepresented in cDNAs with higher relative frequencies in light-induced plasmodia (false discovery rate < 0.01), as compared to competent plasmodia [Additional File 6: Table S2].

Pathway classification of transcripts
Functional annotation can also be classified using the pathway-based definition of ortholog genes from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database [19]. In order to categorize the transcripts in KEGG pathways, we employed the KAAS server [20], a tool that uses similarity information to assign a sequence to a KEGG ortholog (KO) identifier, with default parameters for ESTs. We mapped 2,716 (16%) Figure 1 Relative frequencies of transcripts in libraries prepared from competent and photoinduced plasmodia. Each circle represents a single cDNA, plotted according to its relative frequencies (number of hits per transcript divided by the total number of hits) on each cDNA library. relL and relD represent the relative frequencies in the libraries prepared from light-induced and competent plasmodia, respectively. Transcripts more abundant in light-induced (above the diagonal) or in competent, not light-induced plasmodia (below the diagonal) are shown, and SwissProt orthologs are indicated for 10 contigs with relative frequencies greater than 0.005. Red dots represent those contigs which are more abundant in the light-induced plasmodial library, and black those which are more abundant in the library prepared from competent plasmodia. Detailed descriptions for these transcripts can be found on the Table S1. transcripts to 114 reference metabolic pathways, 1,904 including hit counts data, from which 770 correspond to cDNAs with higher relative frequencies in the library prepared from competent plasmodia, and 743 cDNAs in the library prepared from light-induced plasmodia respectively. In addition, 496 sequences in total were assigned to the KEGG BRITE hierarchies. Transcripts associated to the nucleotide metabolism (n = 110) and citrate cycle (n = 40) had the highest representation for the reference metabolic pathways, and the Wnt, TGFbeta and Jak-STAT signaling pathways were also depicted for the whole dataset (n = 49, 42 and 32 matches respectively). In the whole dataset we identified 420 cDNAs with potential roles in cell differentiation, with molecular entities associated to kinases (n = 140) and GTP binding (n = 110) having the highest representation in the BRITE hierarchies. In addition, we mapped 1,159 total enzyme commission (EC) numbers (418 unique) with 380 unique enzyme names in 851 transcripts, using the EC-module of BLAST2GO [16]. Later, in order to assess the global metabolic changes that occur after light induction, transcripts with KO identifiers were mapped using the KEGG Atlas tool [21]. For transcripts with higher relative frequencies in the competent plasmodia library, we mapped enzymes for the lipid biosynthesis (map00061) and oxidative phosphorylation (map00190) pathways. Conversely, we identified enzymes for the N-glycan biosynthesis (map00510), urea cycle (map00220) and fatty acid metabolism (map00071) pathways in transcripts with higher relative frequencies in the light-induced plasmodial library ( Figure 2). In the end, we obtained 2,567 contigs annotated for GO terms, KEGG orthologs, and InterPro hits together [Additional File 7]. A summary of sequencing annotations and statistics is listed on the Table 1.

Inference of Interaction Networks
In order to identify the functional relationships between the annotated cDNAs, we searched for known interactions in the literature. First, we used the cDNAs that were previously clustered according to their relative frequencies (Figure 1; [Additional File 5: Table  S1]), and included additional proteins whose interactions have been observed in the literature for Physarum. Using the "guilt by association" heuristic to link coexpressed transcripts into functional groups [22,23], we inferred an interaction network between those transcripts. This network is primarily based on actin-binding activities ( Figure 3).
Later, to identify genes with similar regulation, we listed those transcripts with highest rates of relative frequencies, counted in both cDNA libraries ([Additional File 8: Table S3]; [Additional File 9: Table S4]). As most of these highly differentially regulated transcripts did not show any sequence similarity to previously annotated genes, we clustered the subset of cDNAs with similarity to annotated genes according to two parameters: (i) their rate of relative frequencies; and (ii) their statistical significance of differential expression [9]. In this way we listed those 20 transcripts with annotations that were most up-or most downregulated in light-induced plasmodia, based on the statistical significance of their differential expression (P < 0.05; Tables 2  and 3). Despite the apparent diversity in biochemical functions, we searched for known interactions between these two groups of transcripts. From annotations of the top up-and down-regulated transcripts (Tables 2 and 3), and including the transcripts from the above mentioned analysis (Figure 3), we extended the initial putative network using Cytoprophet [24]. This Cytoscape [25] plugin predicts networks based on information from interaction databases, associated to SwissProt matches of newly annotated genes [26]. Accordingly, we found that most of these genes encoded proteins predicted to interact in a network of actin-binding proteins (coaA, ABP120, actobindin, FRGP, AFK, PROP; Figure 4). These genes encoding proteins orthologs of which are associated to cell division (MEI2, PUM2, CDC16), DNA repair (POLB, FEN1), signal transduction (PP2C, CDC16), and calcium-binding (LAV1-2, KCNIP2, GAD) are downregulated in light-induced plasmodia ( Table 2; [Additional File 5: Table S1]). In turn, a different group of developmentally regulated genes is preferentially expressed after photoinduction, including genes the products of which are involved in signaling (DCR2, RGS2, YPTC6, pakA), protein processing (FKBP70, sequestosome-1, PSMA7, RR7), cell integrity (WSC1, CDC31), calcium-binding (MLR1, TRHY, PAT1), and developmentally regulated actin-binding, such as the elongation factor 1α (EF1A), spire, and actophorin (Table 3; [Additional File 5: Table  S1]; Figures 3 and 4).

Discussion
The development of plasmodia competent for sporulation includes growth arrest, condensation of cellular constituents, and mitosis [27]. Sporulation of competent plasmodia can be triggered by a light pulse. Some proteins associated with the light-induced pathways that lead to sporulation have been described [2][3][4], suggesting that several signaling mechanisms are involved, but there are no studies that describe changes at the level of the whole transcriptome. In the present study we identified the most up-and downregulated transcripts, which are associated to a network of putative interactions (Figure 4). The network is hypothetical as interactions used for inference are based on data obtained from different organisms. For the sake of simplicity, the discussion will Figure 2 Metabolic Atlas of Physarum polycephalum. All P. polycephalum cDNAs (Refs [5,6] and our results) were sent for KEGG Ortholog [19] (KO) prediction using the KAAS server [20]. The output list of orthologs was used to plot this atlas with the KEGG mapping tool [21]. Nodes represent metabolites and edges (lines) correspond to enzymatic reactions. Colors are assigned to either down-(light blue) or upregulated (green) transcripts. Transcripts with equivalent relative frequencies in both novel cDNA libraries (relL/relD = 1) are also depicted (yellow); white represent those cDNAs with no expression data. After photoinduction, most enzymes from the N-glycan biosynthesis (A) and the urea cycle (D) pathways are upregulated. In contrast, cDNAs mapped to the oxidative phosphorylation (C) had higher relative frequencies in competent plasmodia, whereas a change from fatty acid synthesis to degradation is seen after photoinduction (B). be focused on genes with predicted significant interactions (P > 0.9).

A network of actin-binding proteins is associated to changes during light-induced sporulation in Physarum
The actin cytoskeleton of Physarum is essential for locomotion, division, and other biological processes [27]. Assembly and disassembly of actin filaments is controlled by a group of actin-binding proteins, whose activities in turn are regulated by specific signaling pathways. Physarum cell types differ in actin organization but express the same actin genes, suggesting that changes in actin-binding proteins are responsible for the differences in actin organization [28]. Physarum possesses several classes of actin-binding proteins, and most of these proteins display cell type-specific patterns of expression, but their precise roles are not known [29,30]. Nevertheless, expression changes in genes coding for actin-binding proteins correlate with modifications in cell organization and behavior [27]. In the present study, we found that some actin-binding genes were linked specifically to stages before and after photoinduction in the starved Physarum plasmodium.
Transcripts linked to cell division and DNA repair are downregulated in the light-induced plasmodium After several days of starvation, cell processes must be limited in order to save energy. Coordination of several biological processes is then required, and thus regulation of these phenomena needs a pleiotropic transducer like the cAMP, which targets several signaling pathways, including those that limit cell proliferation [37]. Cell differentiation pathways regulated by cAMP levels have been described in Dictyostelium, a closely related protist [38]. For Physarum, we found that MEI2, a transcript controlled via cAMP levels, is downregulated in the light-induced plasmodium (Figure 3; [Additional File 5: Table S1]). MEI2 is an RNA-binding protein that encodes a cAMP-regulated positive regulator of meiosis in the yeast S.pombe [39,40]. This gene product is functionally related to the actin cytoskeleton via the cAMPdependent protein kinase A (PKA) [37,38]. Other transcripts downregulated in light-induced plasmodia associated to cell division and DNA repair comprised FEN1, CDC16 and PUM2. First, the Flap endonuclease 1 (FEN1) appears in several processes linked to the maintenance of the genome integrity, such as the UVinduced DNA repair [41], as well as in DNA replication and DNA recombination [42]. Second, the yeast cell division control protein 16 (CDC16), constitutes the catalytic subunit of the spg1p GTPase-activating protein, that is involved in the signal transduction controlling septum formation. CDC16 is involved in cytokinesis [43]  and is essential for proliferation, as spores lacking a functional CDC16 gene complete mitosis without undergoing cell cleavage [44]. Finally, PUM2 (Pumilio 2) encodes a RNA-binding protein associated to the control of meiosis during development [45]. Consequently, starvation seems to be the signal that regulates cell division while protecting the cells from oxidative stress, through cAMP-regulated pathways (Figure 3). Other downregulated transcripts in the light-induced plasmodium comprised orthologs of transducers, such as FBL5, a leucine-repeat protein linked to phosphorylation-dependent ubiquitination [46], PARP1, an Oryza poly ADP-ribose polymerase, a phospholipase D from Phytophtora (PLD1), and the Arabidopsis phosphatase 2C (PP2C, also known as Poltergeist). In plants, G-proteins are involved in phospholipase D activation, and this also seems to be the case for Phytophtora [47]; on the other hand, PP2C operates in several signaling pathways that regulate stem cell differentiation [48]. It is then reasonable to consider that the differential expression of these transducers is also associated with the control of signaling mechanisms for differentiation, but more profound studies are needed to establish precise causal relationships.

Calcium-binding proteins exhibit diverse regulation patterns in the light-induced plasmodium
Transcripts identified as calcium-binding proteins displayed different patterns of expression regulation. These were either down-(LAV1-2, KCNIP2 and GAD) or upregulated (MLR1, TRHY, and PAT1) after light induction. LAV1-2 is a plasmodium-specific RNA of unknown function that encodes a protein containing an EF-hand type domain whose calcium-binding activity has been observed in vitro in Physarum [49]. LAV1-2 seems to act as a sensor of cell damage, releasing Ca 2+ Figure 3 Interactions with the Actin Cytoskeleton of Transcripts with Higher Relative Frequencies. The network was hypothesized from interaction data reported in the literature, using transcripts previously clustered according to their relative frequencies ( Figure 1 and Table S1). The transcripts shown are a subset of those from Figure 1, except for certain gene products (FRGP, AFK, and PROP) which were also included as their interactions have been previously observed in Physarum. cDNAs are displayed in colors corresponding to their expression status: down-(black) or up-regulated (red) upon photoinduction, as separated by the dotted vertical gray reference line. Each contig is shown with its hit number counts in both libraries (D: competent plasmodia, L: light-induced plasmodia). Transcripts with unambiguous annotations, significant differential expression (P < 0.05), and that possess the highest levels of downregulation (relD/relL > 1.0), are listed. BLAST2GO [15] automatic annotations were used, and manual corrections of annotations were included in some cases. A list of transcripts with unambiguous annotations, significant differential expression (P < 0.05) with the highest levels of upregulation (relL/relD > 1.0), is shown. Annotations, SwissProt accessions, hit counts, and probability values follow the same convention as in Table 2.
that leads to the activation of a plasmodium-specific transglutaminase, which separates damaged areas of a plasmodium [50]. Other transcripts encoding orthologs of calcium-binding proteins, such as KCNIP2 and GAD, were also downregulated in the photoinduced plasmodium and have not been previously described for in Physarum. KCNIP2 encodes a potassium channelinteracting protein that probably modulates channels density in a Ca 2+ -dependent manner. In turn, the activation of glutamate decarboxylase (GAD) by calciumbound calmodulin (CaM) is required for normal growth in plants [51]. Previous studies have shown that the intracellular increase of calcium levels is correlated with increased concentrations of cAMP and with sporulation and differentiation in both Physarum and Dictyostelium [52,53]. Moreover, actin filament crosslinking is affected by changes in intracellular calcium levels, which ultimately influences the cell contractility [54]. Therefore it seems possible that these calcium-binding proteins coordinate the Ca 2+ release as a means to influence the cell contractility through the interaction with the actin cytoskeleton ( Figure 4).
Furthermore, the upregulated subset of calcium-binding proteins included MLR1, which inhibits cytokinesis in yeasts; trychohyalin (TRHY), which is involved in its own calcium-dependent processing during differentiation; and the Dictyostelium PAT1 ATPase. PAT1 is localized in the membrane of contractile vacuoles, and is a component of a calcium sequestration and excretion pathway, which functions to help maintain homeostasis, especially under conditions of Ca 2+ stress [55]. Thus these are candidates to control the intracellular calcium levels after light induction of starved plasmodia.
Actin-binding proteins associated to development are overexpressed in the light-induced plasmodium After photoinduction, a group of actin-binding proteins is upregulated including the elongation factor 1α (EF1A), Spire, and actophorin (Figures 3 and 4; Table 3; [Additional File 5: Table S1]). Spire is a Drosophila gene involved in development through actin assembly. This gene is also widely distributed across the metazoan genomes. Spire mammalian isoforms are MAP kinase substrates, and data suggest that Spire evolved as an Figure 4 Interaction of the Most Upregulated and Downregulated Transcripts with the Actin Cytoskeleton. The conceptual network was predicted using the Cytoprophet module of Cytoscape [24][25][26], and therefore is solely based on information included on specialized interaction databases. Input transcripts included those from the top up-and down-regulated transcripts (Tables 2 and 3), and cDNAs taken from the previous interaction network (Figure 3). A significant probability of interaction (P-value > 0.9) is indicated as a thick edge. Node colors follow the same convention as in Figure 3, and hit count data can be found in Tables 2 and 3. This network includes 64 interactions (33 with P > 0.9) between 38 gene products. Genes without Cytoprophet-predicted interactions are not included, except for two interactions with Actin-P that were not predicted by Cytoprophet but that can be found in the literature (indicated with arrows). Interestingly, the previously featured network (Figure 3) connects the two groups of up-and downregulated transcripts in this figure. However, as Cytoprophet gathers experimental interaction data from specialized databases, some interactions depicted in Figure 3 are not shown (for example between POLB and ACTINP), because this data is not present on those source databases used by Cytoprophet for prediction.
alternative independent mechanism of actin polymerization, necessary for cell polarization in multicellular organisms [56]. Actophorin, in turn, binds actin monomers and separates actin filaments in a dose-dependent manner. Phosphorylation of actophorin blocks actin binding [57]. In turn, EF1A, aside from its role in the protein synthesis, has a separate conserved actin-binding activity in eukaryota, initially observed in Dictyostelium [58], where it is predominantly found in actin-bound form [59]. EF1A regulates the stoichiometry of cytoskeletal components, and the conservation of the EF1A-actin interaction across eukaryotes suggests its importance for cytoskeletal maintenance [60]. Overexpression of EF1A in yeast results in effects on cell growth, and influences the actin distribution, morphology and budding in a dosage-dependent manner, although this increase of EF1A has no effect over the protein synthesis [61]. In addition, changes in cytoskeletal redistribution of EF1A seem to be linked to the differentiation status, where the association between EF1A and microtubules gradually increases in differentiating cultures [62]. Furthermore, EF1A stimulates actin remodeling and induces the formation of filopodia [63], and possibly connects these processes with signaling pathways [64].
We noticed that two coexpressed transcripts (the cysteine proteinase CYSP2 and the developmentally regulated gene CudA) are related to EF1A. First, cysteine proteinases are believed to participate in protein cleavage during the differentiation of Dictyostelium as a response to starvation [65], and these peptidases were copurified with EF1A in yeasts [66]. CudA, on the other hand, is associated to the transition from slug migration to culmination in Dictyostelium. CudA expression levels depend on local cAMP concentration [67]. Recent evidences show that CudA contains a novel DNA-binding site that is distantly related to the metazoan STAT domains, which participate in the regulation of developmentally controlled genes [68], and whose orthologs coexpress with EF1A [69]. Yamada et al. [68] also proved a relationship between Dictyostelium CudA and a cDNA from Physarum, which corresponds to the contig reported here as a CudA ortholog. For these reasons, EF1A could work as a link between regulation of the protein synthesis, cytoskeletal maintenance, and signal transduction in slime molds (Figure 3).
Other developmentally regulated transcripts associated to the actin cytoskeleton included the cell wall integrity and stress response component (WSC1), which is a yeast membrane protein that acts as a sensor of cell wall damage [70], and CDC31, a constituent of the nuclear pore complex that is also involved in the maintenance of cell morphology (Table 3 and Figure 4). WSC1 is essential to keep the cell integrity, behaving like a stress-specific signal transducer that is involved in the reorganization of the actin cytoskeleton in response to osmotic shock [71,72]. WSC1 is involved in the depolarization of the actin cytoskeleton [72], and, like CDC16 (downregulated in light-induced plasmodia), is entailed in cytokinesis [73].
GTP signaling genes involved in different processes are upregulated in the light-induced plasmodium Orthologs of certain genes highly upregulated in lightinduced plasmodia are involved in signal transduction. These include transcripts linked to the GTP signaling (AGD8, YPTC6, RGS2), kinases (pakA) and phosphatases (DCR2). The serine/threonine-kinase pakA is a regulator of the myosin component of the cytoskeleton, required for cytokinesis and the regulation of the cytoskeleton during chemotaxis in Dictyostelium [74]. In turn, the yeast dosage-dependent cell cycle regulator 2 (DCR2), is a phosphatase whose increased dosage alters cell cycle progression, while its loss delays the progression in the G1 phase [75]. In addition, upregulated GTP signaling transducers included a putative GTPase-activating protein from Arabidopsis (AGD8); a Chlamydomonas GTPbinding protein (YPTC6); and RGS2, which acts as a negative regulator of G-protein signaling, a function that is evolutionarily conserved in yeast, C. elegans and mammals. Increased RGS2 expression is primarily mediated by the cAMP/PKA pathway [76], therefore it is possible that RGS2 is carrying out similar tasks in slime molds, where it could work in coordination with the other transducers, as hypothesized in Figure 4.

Transcripts annotated for cell death are overrepresented in the light-induced plasmodium
Comparison of GO terms between up-and downregulated groups showed that transcripts annotated for 'cell development' (GO:0048468), 'cell death' (GO:0008219) and 'death' (GO:0016265) were overrepresented in the upregulated group [Additional File 6: Table S2]. However, all these ontologies belong to the same hierarchy, meaning that 'cell death' can be the product of either development or organismal death, and hence 'cell death' is the only difference between both expression groups. One of these cDNAs annotated for 'cell death' is Sequestosome 1, which is also included on the list of upregulated transcripts (Table 3). Sequestosome 1, also known as p62, is a multifunctional protein that targets polyubiquitinated proteins to degradation by proteasomes and autophagy [77]. p62 knockouts significantly increased cell death [78], and this is probably linked to the interaction with atypical protein kinase C isoforms that are involved in pathways that control differentiation and apoptosis [79]. Therefore it is likely that this gene product regulates cell death pathways linked to the commitment for sporulation. Furthermore, other highly upregulated genes are also functionally linked to the protein turnover. These include the FKBP70 rotamase, which accelerates the folding of proteins during synthesis; the PSMA7 proteasome subunit, which together with the other subunits, suffer changes during the meiotic cell cycle [80]; and the endosome-lysosome vesicle traffic-related RR7 [81]. It is likely then that these gene products, together with Sequestosome 1, are linked to the control of differentiation through post-transcriptional regulation.

Conclusions
The gain of sporulation-competence of Physarum plasmodia involves growth arrest, condensation of constituents, and mitosis and is a prerequisite before sporulation can be induced by light [27]. Physarum gene expression has been shown to be cell type-specific, but existing studies have been focused only on individual genes [2][3][4]. Previously, we reported a library of 5,856 sequences obtained from plasmodia competent for the induction of sporulation [6]. In the present study we used the massive parallel sequencing technology at the level of the whole transcriptome [7,8] in order to identify global changes in expression that occur during light-induced sporulation of Physarum. We integrated the differentially expressed cDNAs into networks using interaction information from orthologs and the literature. We found that after light induction of a plasmodium the expression of transcripts linked to cell division and DNA repair is downregulated. In contrast, lightinduction stimulated the expression of genes associated with protein turnover (proteases and proteasome transcripts), genes related to cell cycle progression, and genes involved in the maintenance of cell integrity and cytokinesis. These latter gene products might protect the cell against osmotic shock. Additionally, different groups of calcium-binding proteins are either down-or upregulated after light exposure. These gene products are candidates to control the intracellular calcium levels during sporulation. We postulate that these changes are associated with a network of actin-binding proteins (Figures 3 and 4), the components of which are differentially regulated upon plasmodial photoinduction. It seems that these gene products accomplish different tasks in each stage: the reorganization of the subcellular compartments in order to inhibit migration during starvation on one hand, and cell polarization and cytoskeletal redistribution after photoinduction mediated by a group of actin-binding proteins on the other. We expect that the precise representation of the proposed interaction networks may become available as gene knockout experiments, proteomic data, and comparative interactomics are integrated in future studies of this organism.

Culture and light-induction of plasmodial cells
Physarum plasmodia of the white strain (LU897 × LU898) were hatched from spherules, and grown as microplasmodial suspensions for four days. The plasmodial mass was then applied to starvation agar plates. Microplasmodia spontaneously fused to give a single plasmodium on each plate. Plasmodia were then starved for six days in the dark at 22°C to obtain maximal competence for sporulation. To verify the sporulation-competent state, plasmodia were cut into two halves. One half was immediately frozen in liquid nitrogen for RNA extraction, and the other half was returned to the dark and incubated until the next day to verify that the plasmodium had not been induced to sporulation. To obtain light-induced plasmodia, competent plasmodia were irradiated for 30 min with far red light and then returned to the dark. Six hours after the start of irradiation, plasmodia were cut into two halves. One half was frozen in liquid nitrogen for RNA extraction. The other half was returned to the dark and incubated until the next day to verify the sporulation status [3,4].

cDNA Library Construction and Sequencing
Transcript poly(A) + RNAs were isolated by oligo-dT chromatography. cDNAs were prepared from these RNAs by the full-length enriched synthesis method (vertis Biotechnologie, Freising-Weihenstephan, Germany). First strand cDNA was synthesized using oligo(dT) adapter primers and MMLV H-reverse transcriptase. Following RNA hydrolysis, an adapter primer was annealed to the 3' end, and the produced fragments were PCR-amplified for 22 cycles with a proofreading enzyme. The cDNA libraries were then directly sequenced using the 454 GS FLX system (Roche Diagnostics, Mannheim, Germany) [7]. Chromatograms were scored for quality, and the produced sequences were trimmed of adapter sequences, and coassembled into contigs using previously available transcriptomic data [6]. For expression comparisons we obtained for each contig: (i) the number of reads (which we define as "hit counts") in both libraries; (ii), their relative frequencies (reads of a given contig divided by the total number of reads); and (iii) their relative frequencies. Statistical significance between the two hit counts for each contig species was then assessed [9].

Sequence Annotation and Network Inference
Similarity searches against protein databases were performed using BLASTX implemented in the MIGenAS tool [82] (e-value 1 × 10-3). We utilized nine protein databases in this comparison: Swiss-Prot and TrEMBL (versions 56.3 and 39.3) [12], dictyBase [14] and RefSeq database subsets: mammalian, other vertebrates, invertebrate, protozoa, plant and microbial (release 31) [13]. Functional annotation was carried out using BLAS-T2GO (version 2.2.3) [16]. This procedure consisted of a similarity search against the non-redundant GenBank database [10], using BLASTX (e-value 1 × 10-3), followed by Gene Ontology (GO) [15] mappings extracted from similarity results and InterPro domain matches (InterPro release 18.0) [17]. Annotation of sequences (cutoff value 1 × 10-6) was followed by their validation, and these annotations were extended using ANNEX [83]. Statistical analysis of GO annotations between differentially expressed cDNAs was carried out using the Fisher exact test, as implemented in the GOSSIP module [18] of BLAST2GO. Sequences were also categorized in metabolic and signaling pathways, via similarity search against orthologs present in the KEGG database [19] using the KAAS server [20]. In this case, we employed default parameters for ESTs. KEGG orthologs (KOs) were then plotted into the whole metabolic atlas, utilizing the KEGG mapping tool [21]. Putative networks of correlated genetic interactions were generated from annotation information, using the MLE algorithm [26], as implemented in the Cytoprophet plugin [24] of Cytoscape [25].
Additional file 1: Figure S1. Overview of the Experimental Design. A summary of experiments and computational analyses is depicted. RNA samples were taken from competent plasmodia after six days of starvation in the dark, and from competent plasmodia at six hours after exposition to a 30 minutes pulse of red light (≥ 700 nm) (1) [3]. cDNAs were synthesized from extracted RNAs (2), and sequenced and quantitated using the 454 Life Sciences platform (3) [78]. Contigs generated were then annotated at every bioinformatic level (4), and network interactions (5)  Additional file 3: Figure S2. Hits Distribution of Transcript Species. The distribution of pyrosequencing hit counts respect to the number of transcript species on each library (starvation and light-induced) is depicted on a semi-logarithmic scale. Hit counts are included in the adjacent upper ranges to the right; for example, transcripts with 2 hits are present in the 2-5 range. Similar distributions of contig species were found on both libraries, and most transcripts were represented by 1 to 5 hits only (Powerpoint file  Table S1. Annotated transcripts with relative frequencies higher than 0.005. A list of transcripts obtained from the scatterplot of relative frequencies (Figure 1) is depicted. Annotations, hit counts, and probability values follow the same convention as in  Table S2. Overrepresented Gene Ontology terms in Upregulated Transcripts. Full lists of GO terms from up-and downregulated contigs were compared against each other using the Fisher's exact test from the GOSSIP program [18], as implemented in BLAST2GO [16]. A two-tailed test with the false discovery rate (FDR) filter was employed. The number of GO-annotated transcripts used for comparison between up-(Test) and downregulated (Ref) groups of cDNAs is shown. All overrepresented GO terms belong to the biological process (BP) category (Word document  Table S3. Top 20 Transcripts Downregulated in Light-induced Plasmodia. Transcripts with the highest rates of downregulation (relD/relL > 1.0), are listed. BLAST2GO [15] automatic annotations were used, and manual corrections were included in some cases. Transcripts with unknown orthologs are described with "-NA-." Annotations, SwissProt accessions, hit counts, and probability values follow the same convention as in  Table S4. Top 20 Transcripts Upregulated in Lightinduced Plasmodia. Transcripts with the highest rates of upregulation (relL/relD > 1.0), are listed. BLAST2GO [15] automatic annotations were used, and manual corrections were included in some cases. Columns follow the same convention as in