Exploring temporal transcription regulation structure of Aspergillus fumigatus in heat shock by state space model

Background The thermotolerance of Aspergillus fumigatus plays a critical role in mammalian and avian infections. Thus, the identification of its adaptation mechanism to higher temperature is very important for an efficient anti-fungal drug development as well as fundamental understanding of its pathogenesis. We explored the temporal transcription regulation structure of this pathogenic fungus under heat shock conditions using the time series microarray data reported by Nierman et al. (Nature 2005, 438:1151-1156). Results The estimated transcription regulation structure of A. fumigatus shows that the heat shock proteins are strongly negatively associated with central metabolic pathway genes such as the tricarboxylic acid cycle (TCA cycle) and carbohydrate metabolism. It was 60 min and 120 min, respectively, after the growth temperature changes from 30°C (corresponding to environments of tropical soil) to 37°C and 48°C (corresponding to temperatures in the human body and compost, respectively) that some of genes in TCA cycle were started to be upregulated. In these points, most of heat shock proteins showed lowest expression level after heat shocks. Among the heat shock proteins, the HSP30 (AFU6G06470), a single integral plasma membrane heat shock protein, presented most active role in transcription regulation structure in both heat shock conditions of 37°C and 48°C. The metabolic genes associated with multiple genes in the gene regulation network showed a tendency to have opposite expression patterns of heat shock proteins. The role of those metabolic genes was second regulator in the coherent feed-forward loop type of regulation structure having heat shock protein as its first regulator. This type of regulation structure might be very advantageous for the thermal adaptation of A. fumigatus under heat shock because a small amount of heat shock proteins can rapidly magnify their regulation effect on target genes. However, the coherent feed-forward loop type of regulation of heat shock proteins with metabolic genes became less frequent with increasing temperature. This might be the reason for dramatic increase in the expression of heat shock proteins and the number of heat shock response genes at heat shock of 48°C. Conclusion We systemically analysed the thermal adaption mechanism of A. fumigatus by state space model with times series microarray data in terms of transcription regulation structure. We suggest for the first time that heat shock proteins might efficiently regulate metabolic genes using the coherent feed-forward loop type of regulation structure. This type of regulation structure would also be efficient for adjustment to the other stresses requiring rapid change of metabolic mode as well as thermal adaptation.


Background
Aspergillus fumigatus is both a primary and opportunistic pathogen as well as a major allergen associated with severe asthma and sinusitis [1]. The host respiratory system, including that of human, is constantly exposed to its spore because of prolific production of spores and nearly ubiquitous distribution in the environment, at a density of 1 to 100 conidia/m 3 [2]. The spores can be easily eliminated by the innate immune system from lung epithelial tissue in immunocompetent vertebrates. However, immunocompromised individuals are at risk of invasive aspergillosis as a consequence of proliferation on pulmonary or other tissues via mycelial growth. Invasive aspergillosis has an associated mortality rate ranging from 50 % to 90 % depending on the patient population [3]. The prosperity of A. fumigatus in mammalian and avian infection depends basically on its thermotolerance [4]. Therefore, the identification of the adaptation mechanism of this fungus to higher temperature is very important for efficient anti-fungal drug development as well as fundamental understanding of pathogenesis of A. fumigatus.
Nierman et al. [4] examined the gene expression change throughout a time course upon shift of growth temperatures from 30°C to 37°C and 48°C for investigation of the metabolic adaptation of A. fumigatus. They suggested that host temperature alone (37°C) is insufficient to turn on many virulence-related genes because no known genes implicated in pathogenicity showed higher expression at 37°C than that at 48°C. Lamarre et al. reported another genome-wide expression study of this fungus at 37°C with different age of conidia [5]. Their microarray data suggested that dormancy is associated with fermentation and reduced metabolism. In both studies, genome-wide expression profiles were well analysed from a simple level including identification of differentially expressed genes to some complex level including classification of genes with similar profiles, but the transcription regulation structure between genes in a system level has been never studied. Actually, the regulation of gene expression is achieved through genetic regulatory systems structured by networks of interactions between DNA, RNA, proteins, and small molecules [6]. Thus, it is important to understand how genes in A. fumigatus are transcriptionally regulated for its thermal adaptation.
Several methods have been proposed for modelling gene networks including Boolean networks [7,8], Bayesian networks [9], differential equations [10] and state space model (SSM) [11,12]. Especially, the temporal network structure of gene regulatory mechanism can be facilitated with time-course microarray data. However, a wide variety of statistical models proposed for estimation of temporal gene network structure are frequently limited because the length of the time-course data is fairly short, e.g. typically less than 10 whereas the number genes involved ranges from 10 2 to 10 4 . To overcome such a limitation, Hirose et al. [12] proposed a module-based gene network estimation which explores genetic networks of the transcriptional modules which are sets of genes sharing a common function or involved in the same path way rather than the use of gene level networks. The transcriptional modules may be considered as the groups of transcriptionally coexpressed genes. These transcriptional modules can be again mapped onto the gene-level networks. Here, we employed this module-based approach for the estimation of temporal transcription regulation structure of A. fumigatus in heat shock with microarray data consisting of very short length of time point, i.e., 6. The time series microarray data reported by Nierman et al. [4] were used in this study. We also compared the temporal transcription regulation structures in two different heat shock conditions, i.e., shift from 30°C to 37°C and 48°C, respectively.

Metabolic response of A. fumigatus to heat shock
To reduce the effect of missing values in time series microarray data of A. fumigatus in heat shock condition, the genes with over 11 % of missing values in total observations were excluded (see Methods section for detail). Thus, 4027 and 4771 genes from 9,516 genes on the array were first chosen in this study for heat shock of 37°C and to 48°C, respectively. From these two gene lists, we selected the significantly differentially expressed genes showing over two fold change in expression level and below 0.05 of p-value in t-test for at least a time point. Here, we will call these genes heat shock response genes. Finally, the number of heat shock response gene became 726 and 2200 for heat shock of 37°C and 48°C, respectively. Our study was focused on these heat shock response genes. Eighty two percent (596 genes) of heat response genes at 37°C were also observed in the set of heat response genes at 48°C. Almost half these common genes were involved in the central metabolic pathways such as carbohydrate, amino acid and lipid metabolism ( Figure 1).
For the more detail analysis, we examined the response of each metabolic pathway in each time point for these common heat response genes (see the additional file 1). The metabolic pathways showing the high sensitivity at the first time point (15 min) after heat shock were nucleotide metabolism including purine and pyrimidine and ribosome involved protein synthesis. Especially, the response genes involved in ribosome were almost down-regulated throughout the time course in both heat shocks of 37°C and 48°C. The citrate cycle (TCA cycle) genes were initially downregulated, but many of these genes were upregulated from 60 min and 120 min, respectively, after the heat shock of 37°C and 48°C. There are many metabo-The metabolic distribution of common genes (596 genes) between heat shock response genes of 37°C and 48°C Figure 1 The metabolic distribution of common genes (596 genes) between heat shock response genes of 37°C and 48°C.  lisms with positively regulated genes at 60 min after the heat shock of 37°C including glycolysis/gluconeogenesis, fructose/mannose metabolism, fatty acid metabolism, and amino acid metabolism including glycin, serine, threonine, histidine, phenylalanine, tyrosine and tryptophan. Similar trend was also observed at 120 min after the heat shock of 48°C for metabolisms including pyruvate and amino acids such as valine, leucine and isoleucine as well as TCA cycle. This might be related to the decrease of expression level of heat shock proteins other than AFU8G03930 (Hsp70) at 60 min and 120 min, respectively, after heat shock of 37°C and 48°C ( Figure 2). Considering overall metabolic response to heat shock, the The heat map of estimated temporal relationship between top-ranked 20 genes in each sub-module and temporal relationship between modules AFU3G08580 Glycine-rich RNA-binding protein, putative AFU8G05990 hypothetical protein 2(-) AFU2G00320 Sterol delta5,6-desaturase, putative AFU4G14250 hypothetical protein 2(-) AFU3G07230 Phosphopantothenate-cysteine ligase, putative AFU5G02330 Ribonuclease mitogillin precursor 2(-) AFU3G03580 Transferase family protein AFU7G05490 hypothetical protein 2(-) AFU3G10750 Acetate kinase, putative AFU5G13730 invasion associated protein p60 2(-) AFU6G07900 Rod1 protein AFU5G06240 Alcohol dehydrogenase, putative 2(-) AFU4G06890 14-alpha sterol demethylase Cyp51A AFU1G16190 Probable glycosidase crf1 precursor 2(-) AFU3G08990 unnamed protein product AFU5G03800 High-affinity ironpermease 2(-) AFU7G04730 major facilitator protein homolog, putative AFU7G05500 Glutathione S-transferase, putative The heat shock proteins with similar expression pattern are assigned to same module. The average expression profiles of all heat response genes are shown at additional file 2. heat shock of 48°C seems to bring more severe metabolic disturbance to the cells of A. fumigatus than that of 37°C.

Identification of transcriptional module in heat shock response of A. fumigatus using state space model
We first identified the potential transcriptional modules, sets of genes that are co-regulated under heat shock condition, to map them on the gene-level network by state space model (SSM) (see Methods section for detail). Each module is assumed to follow the states of a first-order Markov chain and a gene may belong to more than one module. The dimension of the state variable x n was chosen by k = 2 in this study (see Methods section). The regulation relationship between sub-modules are more clear than that between modules (the upper part of Figure 3). We selected top-ranked 15 genes from all sub-modules including M 1+ , M 1-, M 2+ and M 2- (Table 1). The plus (+) and minus (-) module in each module usually have opposite expression pattern for each other. In heat shock of 37°C, four heat shock proteins including AFU6G06470 (Hsp30), AFU5G07340 (Hsp40), AFU3G14540 (Hsp30/ Hsp42) and AFU1G15270 (Hsp98/Hsp104) appeared in M 1+ while the other heat shock protein AFU8G03930 (Hsp70) was in M 1-. This assignment of heat shock proteins to each sub-module shows a good agreement with their expression patterns ( Figure 2). The expression pattern of AFU8G03930 (Hsp70) is unusual compared with those of other Hsp70s including AFU1G07440 (Hsp70) and AFU2G04620 (Hsp70) (see the additional file 2 for expression profiles of AFU1G07440 and AFU2G04620). This suggests that the AFU8G03930 might have different function from general Hsp70 which plays an important role in normalizing proteins degenerated by stress, and in maintaining the physiological functions of cells.
The regulation relationships among modules are shown in the lower part of Figure 3. The number of above arrow represents the strength of regulation while the positive and negative value indicates positive and negative regulation, respectively. Module M1 containing heat shock proteins shows strong positive association with module M2 including many pathway genes in heat shock of 37°C while the module M2 including many pathway genes in heat shock of 48°C represents the strong positive self-regulation. This indicates that the regulation of metabolic gene by heat shock proteins might be stronger at heat shock of 37°C than at heat shock of 48°C.

Transcription regulation structure of heat shock response genes in A. fumigatus
We estimated the transcription regulation structure by mapping the identified transcriptional modules in the above section onto the gene-level networks via the first order vector autoregression form (see Methods section for detail). The edge density in the network of heat shock 37°C is always higher than that of 48°C over the threshold range surveyed ( Figure 4). The reduction of edge density in higher temperature was reported by Takemoto et al. [13]. They have suggested that with increasing temperature the metabolic networks undergo a change from heterogeneous and high-modular structures to homogeneous and low-modular structures like random networks. This suggests that the cells of A. fumigatus in the heat shock of 48°C might have less dense metabolic network than those in heat shock of 37°C. In our model, each element of the autoregressive coefficient matrix stands for interaction strength of corresponding gene pair (see Methods section for detail). Thus, the higher value it has, the stronger interaction the gene pair has. We can construct a sparse biological network by choosing a threshold. In this study, we have chosen a threshold of 0.015 from the region showing almost constant edge density in both heat shock conditions of 37°C and 48°C (Figure 4). Therefore, we obtained the two networks, one is composed of 418 edges and 99 genes at heat shock of 37°C and another consists of 308 edges and 132 genes at the heat shock of 48°C ( Figure 5). The gene information corresponding to each node is shown at additional file 3. The red and blue arrows represent positive and negative regulation, respectively. The direction of arrow indicates the regulation direction, i.e., from regulator gene to regulated or target gene. Except for heat shock protein, the gene with more than twenty edges is called hub gene in this study which is indicated by grey and green node, respectively, in the networks of 37°C and 48°C. Three The dependence of edge density on threshold in networks of 37°C and 48°C Figure 4 The dependence of edge density on threshold in networks of 37°C and 48°C.
The visualization of estimated transcriptional network at the threshold of 0.015  Hsp78) is observed only at network of 48°C. The gene number associated with the 1808, a single integral plasma membrane heat shock protein, dramatically increased at network of 48°C. This suggests that the 1808 might have more critical role in the high temperature than in low temperature.
From the network, we can estimate possible regulation between heat shock response genes. For example, the regulation of 561 (AFU2G04010, α-α trehalose phosphate synthase subunit) by 1808 in network of 48°C shows the possible regulation of trehalose production during heat shock condition. Trehalose is known to accumulate at high levels to stress conditions [14]. The positive regula-tion of trehalose metabolism by 1808 shows a good agreement with upregulation of AFU3G12100 (trehalose synthase) during time course at both heat shock conditions (see additional file 2). Many metabolic genes are targeted by heat shock proteins ( Table 2). It is interesting to notice that some of metabolic genes are playing the role as key regulator such as heat shock protein 1808 ( Figure 5). Most of these metabolic genes represent clear contrast in their expression patterns with heat shock proteins and play role as hub nodes in the network ( Figure 6). In terms of regulation, these metabolic genes seem to help the regulation of heat shock proteins. That is, the metabolic genes such as 1933 (AFU6G10650), 1934 (AFU6G10660), 2038 (AFU7G00170) and 2039 (AFU7G00200) tend to positively regulate the genes negatively regulated by heat shock protein 1808 at network of 37°C ( Figure 5A). The positive regulation of these meta-  The underlined genes are metabolic genes and the signs '+' and '-' in the parenthesis represent positive and negative associations with heat shock protein, respectively.
bolic genes towards target genes brings out the same effect as negative regulation by heat shock proteins towards the same target genes. This is mainly due to the opposite expression pattern between the metabolic gene and heat shock protein 1808. This coherent type of regulation could make it possible to maximize the regulation efficiency of heat shock proteins. The metabolic genes with similar function also appeared at the network of 48°C including 2312 (AFU8G06350) and 2078 (AFU7G01930) ( Figure 5B).
To more clarification of the regulation structure between heat shock proteins and hub metabolic genes, we extracted a small network including heat shock proteins, hub metabolic genes and transcription factors from network of Figure 5. The heat shock protein 1808 and a metabolic gene 2038 show coherent feed-forward loop (FFL) type of regulation against metabolic gene 1933 ( Figure  7A). The FFL is composed of first regulator that regulates a second regulator, and both regulators regulate the same target. In this network, the 1808 negatively regulates both 1933 and 2036 where 2036 again positively regulates 1933. The indirect regulation of 1808 towards 1933 through 2036 gives finally the same effect of negative reg-ulation by 1808 towards 1933. Thus, the expression of 1933 will be easily downregulated by both regulations of 1808 and 2036. The coherent FFL type of regulation is also shown in the network of 48°C ( Figure 7B). There are four types of coherent FFL in network including 3 nodes (Figure 8). The network in Figure 7 shows that the heat shock protein 1808 employs the type 2 of coherent FFL which can act as a response accelerator for the regulation of target genes. This indicates that heat shock protein rapidly and efficiently regulates target genes by the type 2 of coherent FFL with hub metabolic genes. Considering a FFL including first regulator 1808, second regulator 2036 and target 1933 ( Figure 7A), the second regulator 2036 shows positive auto-regulation. A positive auto-regulation gives slow response time in expression change, but the state of once changed expression can be maintained, even after its regulator has vanished [15]. This type of positive auto-regulation might help the cells of A. fumigatus to keep the heat shock mode for a time after removal of heat shock.
The regulation of transcription factor 723 (AFU3G02000, cutinase transcription factor 1β) at network of 37°C towards other genes may be result of indirect regulation The average expression profiles of hub nodes negatively regulated by heat shock proteins transcription factor (oefC)), 1060 (AFU3G14550, RNA polymerase II transcription factor B subunit 5), 1208 (AFU4G09710, C6 sexual development transcription factor (NosA)) and 1957 (AFU6G12160, C6 transcription factor) ( Figure 5B). 909 and 1208 were downregulated while 1060 and 1957 were strongly upregulated during time course at the heat shock of 48°C (see additional file 2). The relationship of these transcription factors with other metabolic genes might be the reflection of indirect interaction rather than direct regulation through other intermediate genes hidden. The expression patterns of the C6 sexual development transcription factor 1208 and heat shock protein 1808 show clear contrast. This suggests that the positive regulation by 1208 towards a target gene brings the same effect as the negative regulation by 1808 towards the same target gene. In fact, half of metabolic genes negatively regulated by 1808 are positively regulated by 1208 ( Figure 7B).
The schematic diagram for pre-processing of microarray data Figure 9 The schematic diagram for pre-processing of microarray data. Types of three-node coherent feed-forward loop Figure 8 Types of three-node coherent feed-forward loop. Arrow and symbol denote positive and negative regulation, respectively. Type 1  Type 2   Type 3  Type 4 The absence of heat shock transcription factor in the networks is due to its lower interaction value with other genes than threshold. Both heat shock conditions, the heat shock transcription factor (AFU5G01900) showed upregulation like most of heat shock proteins during time course (see additional file 2). The metabolic genes in two extracted networks of 37°C and 48°C are different, but the FFL type 2 of regulation structure including heat shock proteins and metabolic genes is well-conserved in both networks. This suggests that the FFL type 2 of regulation structure is important network motif in the thermal adaptation process. The frequency of this network motif decreased with increasing temperature. That is, the FFL type 2 of regulation structure was more frequent at network of 37°C than at network of 48°C. This indicates that the regulation efficiency of heat shock proteins would be reduced at heat shock of 48°C, which might be related to dramatic increase in the expression level of heat shock proteins and the number of heat shock response genes at heat shock of 48°C.

Discussion
It is widely accepted that biological systems are composed of functional modules with specific combinations of genes and proteins responsible for different biological functions [16,17]. The module can be considered as the temporal activity of a group of genes/proteins that controls a specific function in different environmental conditions. In this study, we identified the transcriptional module of heat response genes of A. fumigatus using SSM with the state dimension of k = 2. The genes with similar expression profile are successfully assigned to the same module. The estimated gene regulation structure based on identified module showed clearly the negative relationship between heat shock proteins and metabolic genes. Our analysis suggests that the metabolic genes as well as heat shock proteins might be used as key regulator in the thermal adaptation process of A. fumigatus. Especially, the metabolic genes with opposite expression pattern of heat shock proteins have a propensity to be involved in coherent FFL type 2 of regulation with heat shock proteins The BIC curves for the model-based clustering of microarray data including heat response genes VEI: diagonal, varying volume, equal shape; EVI: diagonal, equal volume, varying shape; VVI: diagonal, varying volume and shape; EEE: ellipsoidal, equal volume, shape, and orientation; EEV: ellipsoidal, equal volume and equal shape; VEV: ellipsoidal, equal shape; VVV: ellipsoidal, varying volume, shape, and orientation. in the transcriptional network ( Figure 5, Figure 6 and Figure 7).
The coherent FFL type 2 of regulation could make it possible to rapidly and efficiently adapt the cells of A. fumigatus to the change of temperature. For example, small amount of heat shock proteins can efficiently regulate the target genes. However, this type of regulation at heat shock of 48°C seems to be weak due to the reduction of metabolic genes showing coherent regulation with heat shock proteins. This might be related to dramatic increase in the amount of heat shock proteins and the number of heat shock response genes at heat shock of 48°C. Takemoto et al. [13] surveyed the dependence of the structure of metabolic networks in prokaryotes on their growth temperature. They found that the edge density becomes small, the clustering-coefficient-based modularity of the networks become small, and the frequency of recurring sub-graphs decays with increasing growth temperature. Although these findings are concerned with metabolic network, they show a good agreement with our results in terms of network structure. Besides edge density, the decrease of coherent FFL type 2 of regulation including heat shock protein at 48°C could be related to the fact that the frequency of recurring sub-graphs or network motifs decays with increasing temperature. Thus, the thermal adaptation process of A. fumigatus could be considered as change of network structure such as transition from metabolic network of mesophile into that of thermophile. This suggests that the network of 48°C might be optimal in the temperature for the survival of A. fumigatus even if it looks like less efficiently than the network of 37°C. Especially, in heat shock of 48°C many pathogenic genes such as AFU4G09580 (major allergen Asp F2) and AFU6G02280 (Asp F3) are appeared as heat response genes and these genes are upregulated at all time points except for 120 min (see additional file 2). This indicates that the cells of A. fumigatus at heat shock of 48°C might be more pathogenic than those at heat shock of 37°C.
Our results suggest for the first time that the thermal tolerance of A. fumigatus might be due to the efficient regulation of metabolic genes by heat shock proteins taking coherent FFL type 2 of regulation structure. We analysed the network at a fixed threshold of 0.015, but this type of regulation was well conserved at various thresholds. Thus, the network analysed in this study would be sufficient for the capturing transcriptional regulation characteristic of A. fumigatus in the thermal adaptation process.

Conclusion
The understanding of thermal adaptation mechanism in the opportunistic pathogen A. fumigatus is important in fungal infectiology. We estimated the transcriptional regulation structure of A. fumigatus under heat shock condi-tions using time series microarray data with very short length of time point, i.e., 6. We propose a thermal adaptation mechanism of A. fumigatus through heat shock proteins, which might efficiently regulate their target genes using the coherent FFL type 2 of regulation structure. This type of regulation structure would also be efficient cellular strategy for other stressful conditions requiring rapid change of metabolic mode.  (Figure 9). For the successful application of SSM to this short timeseries microarray data, we utilized all the replicates on the array for the parameter estimation of SSM. Thus, total 36 observations were obtained for each gene (triplicate on each array by two array by six time point) and the genes with more than 4 missing values were excluded ( Figure 9). We selected the significantly differentially expressed genes showing over two fold change in expression level and below 0.05 of p-value in t-test for at least a time point, called heat shock response genes here. Finally, the number of heat shock response gene became 726 and 2200 for heat shock of 37°C and 48°C, respectively. The regulation structure of these heat shock response genes was estimated by our SSM. The assignment of A. fumigatus genes to KEGG metabolic pathways was done using the Bioconductor package KEGGSOAP http://www.biocon ductor.org. To visualize the estimated network, we used Cytoscape 2.6.0 http://cytoscape.org.

State space model
The SSM employed in this study has been reported in our previous paper [12]. Here, we give brief description of this model. Let y n ∈ R p , n ∈ N obs ⊆ N be a series of vectors con-  Figure 10 and additional file 4). This supports our choice of k = 2.
We consider a linear-Gaussian SSM: where F ∈ R k × k is the state transition matrix, H ∈ R p × k is the observation matrix, v n ~ N k (0 k , Q) and w n ~ N p (0 p , R) are the system noise and the observation noise, respectively. The initial state vector x 0 is assumed to be a Gaussian random vector with mean vector μ 0 and covariance matrix Σ 0 . For the inference of gene regulatory system underlying the observation data, the parameters θ = {H, F, Q, R, μ 0 } ∈ Θ were estimated by EM algorithm with a fixed k. For the identifiability of the estimated model, we used following constraints on the parameters: Imposement of an arbitrary sign on the elements of the first row of H.
Thus, the parameters are reduced to θ = {H, F, R, μ 0 }. The estimation of these parameters is limited when the length of time series is very short, e.g., less than 10. This limitation can be overcome if replicates of time-course gene expression profiles are available. The dataset in each heat shock condition consists of six time points with six replicates (see pre-processing of time series microarray data in Methods section). Thus, we incorporate all replicates into parameter estimation. For this, we assume that each of the replicated time-courses is independently and identically distributed according to where and represent the gene expression vector which is measured by the l-th replicate and the corresponding state vector at time n, respectively. The total number of replicates is denoted by m (l = 1,...,m). Given this generative model, the parameter estimation amounts to maximizing the likelihood function l(θ ) over θ: where and . The maximum likelihood is calculated by modified EM algorithm [12]. After parameter estimation, gene expression vectors can be mapped onto the state space R k with the projection matrix D ∈ R k × p by transforming the observation model (equation 4) under above constraints as follows: where the projection matrix is parameterized as If the state dimension k is less than p, the dimensionality of the noise-removed gene expression vectors is reduced by the semi-orthogonal projection matrix D.
During the parameter estimation process, the reducedrank data (equation 6) are possibly constructed such that they are likely follow first-order Markov process of equation 3. This process automatically discovers k modules of genes that are relevant to the temporal structure of gene expressions. We can choose groups of genes, i.e., modules using the projection matrix D of which element d ij represents the contribution of the jth gene to the ith coordinate of the state variable, . The rank of |d ij | for a fixed i can be used for selection of genes belonging to the ith module from gene list. That is, genes with high contribution to the ith coordinate of state variable would be top-ranked and selected as the ith module. A module can also be divided two sub-module by the signs of d ij and the sub-module is determined using a similar approach to the module determination. The dynamic interactions between modules can be obtained through the system model (equation 3).
Because it describes effects from to , the relationship can be seen causal relationship. Thus, we can expect to obtain module-regulatory networks by using the estimated parameter, i.e., F. For the estimation of gene regulatory network using parameters obtained above, we converted the SSM (equations 3 and 4) to a parsimonious representation of the first order vector autoregressive form as below where the autoregressive coefficient matrix is given by Ψ ≡ D T ΛFD. The gene regulatory network was estimated based on Ψ which represents magnitude of interactions between genes.