- Research article
- Open Access
- Published:

# Network motifs: structure does not determine function

*BMC Genomics***volume 7**, Article number: 108 (2006)

## Abstract

### Background

A number of publications have recently examined the occurrence and properties of the feed-forward motif in a variety of networks, including those that are of interest in genome biology, such as gene networks. The present work looks in some detail at the dynamics of the bi-fan motif, using systems of ordinary differential equations to model the populations of transcription factors, mRNA and protein, with the aim of extending our understanding of what appear to be important building blocks of gene network structure.

### Results

We develop an ordinary differential equation model of the bi-fan motif and analyse variants of the motif corresponding to its behaviour under various conditions. In particular, we examine the effects of different steady and pulsed inputs to five variants of the bifan motif, based on evidence in the literature of bifan motifs found in Saccharomyces cerevisiae (commonly known as baker's yeast). Using this model, we characterize the dynamical behaviour of the bi-fan motif for a wide range of biologically plausible parameters and configurations. We find that there is no *characteristic* behaviour for the motif, and with the correct choice of parameters and of internal structure, very different, indeed even opposite behaviours may be obtained.

### Conclusion

Even with this relatively simple model, the bi-fan motif can exhibit a wide range of dynamical responses. This suggests that it is difficult to gain significant insights into biological function simply by considering the connection architecture of a gene network, or its decomposition into simple structural motifs. It is necessary to supplement such structural information by kinetic parameters, or dynamic time series experimental data, both of which are currently difficult to obtain.

## Background

The concept of a *network motif*, introduced by Alon and co-workers [1], has rapidly become one of the central topics of interest in the analysis of complex networks. These networks promise to provide a framework for the understanding of biological processes involving many components such as intra- or inter-cellular networks of interacting genes or proteins. The analysis of such frameworks is one of the key techniques in the rapidly emerging field of systems biology, which makes extensive use of protein interaction, metabolic and gene regulatory networks. Several authors have argued that knowing the structure of these networks, that is knowing the pattern of interactions, will allow us to understand how combinations of genes or proteins interact to achieve specific functional outcomes, and to predict new functional relationships.

A network motif in the sense introduced by Alon and co-workers is a pattern or small sub-graph that occurs more often (at some statistically significant level) in the true network than in an ensemble of networks generated by randomly rewiring the edges in the true network, where the number of nodes and the degree of each node is kept fixed. Of interest are the differences in the frequencies with which network motifs occur in real (biological as well as technological) networks. The recurrent presence of certain motifs has been linked to systematic differences [2] in the functional properties required from networks. In analogy to electric circuits which are built up of smaller modules, such as logic gates, it has been suggested that the motifs in biological networks reflect functional or computational units which combine to regulate the cellular behaviour as a whole. Recently the work of Prill *et al*. [3] has looked at how one aspect of motifs, their stability, influences biological network organisation and specifically the abundance of different motifs in the network.

The detection and enumeration of network motifs has now been followed up by studying the dynamics of corresponding mathematical models of these motifs, especially in the context of transcription regulation networks. These networks aim to describe the links between those genes which code for transcription factors and the genes whose products they control. At the moment, due to the diversity of stimuli a cell/organism can experience, our understanding of the complete sets of regulatory relationships is only preliminary and because of the apparent importance of post-transcriptional regulation, captures only one aspect of the regulatory machinery. Additionally, it must be recalled that these motifs do not exist in isolation within the network, and their behaviour will be heavily influenced by both global and local changes in the cellular environment and the state of the network as a whole. These considerations alone may make attempts to draw positive conclusions about how a motif will behave overly optimistic.

### Network motifs and transcriptional regulation

To a great extent, the control of gene transcription is performed by regulatory proteins known as transcription factors, which bind to specific sites on the DNA. Transcription factors may regulate a gene in isolation, but more commonly there are multiple transcription factors acting in concert. The transcription factors are of course themselves products of other (or possibly the same) genes, resulting in a network of interacting regulatory genes. Milo [1] and others have recently looked at ways in which such networks can be broken down into smaller functional units in order to more easily identify structures within the network. It is hoped that the appearance of such smaller units may be indicative of modular structure and efficient design. One of the most important motifs that has hitherto been identified is the feed-forward motif (see Figure 1(a)). A number of recent papers have examined the dynamics of mathematical models of the feed-forward motif [4–6]. Recently, however, it was noted that while the feed-forward loop motif is unusually common, other motifs may be even more prevalent [7]. In particular it was emphasized that when determining the relative statistical significance of the abundance of various motifs, it was important to use an appropriate null "random" model [8]. It was suggested that previously the background structure, that is, physical distance and compartmentalization, had not been adequately taken into account when generating random networks. By using a more sophisticated null model which took into account spatial separation when considering whether nodes would be connected it was found that the "bi-fan" motif was the most prevalent in the transcription regulation networks of both *E. coli* and *S. cerevisiae*. Thus far, however, there has been no detailed study of the dynamics of this motif.

### Bi-fan motifs in S. cerevisiae

In Table 1 we list bi-fan motifs extracted from the TRANSFAC database, and in Figure 2 highlight the regulatory relationships reported in the literature for some of these motifs. As is apparent from Table 1, several genes are involved in more than one bi-fan motif. Also, the regulatory interactions documented in the database have been ascertained in a non-uniform way, this simply reflects the non-exhaustive nature of present molecular interaction data-sets.

In Figure 2 repressive interactions are shown in red, while promoting interactions are depicted in green. Even in the small subset of genes for which interaction data was available we were able to find exemplars for a range of distinct bi-fan architectures. We consider and contrast the dynamics of each of these variants.

## Dynamical models of motifs

Given a particular transcription regulation motif, such as those in Figure 1, and straightforward assumptions about the binding kinetics of its constituent molecular species (genes and proteins), we can derive a mathematical model for its dynamics. This then allows analysis of the characteristic responses of these constituents of a motif following an external stimulus. Both deterministic and stochastic models are possible. A number of recent papers have constructed and analyzed models for the feed-forward motif. In particular Mangan and Alon [4] have shown that this single simple motif can exhibit a vast range of different dynamical behaviours.

We believe that these attempts to link structure (of motifs) to function in terms of mathematical models raise a number of interesting questions and problems. The present work looks in some detail at the dynamics of the motifs identified in Table 1. Our analysis follows the approach of [5] for the feed-forward motif and in particular we model the bi-fan motif using a system of ordinary differential equations (ODEs).

We use the number of molecules of the two second tier proteins as a measure of motif behaviour. We look in turn at the four variants of the motif identified in yeast under a series of example dynamical scenarios which reflect the diversity of behaviour that can be demonstrated by this simple motif, starting with coherent motifs, that is, motifs in which every transcription factor acts to promote transcription.

## Results

The motifs in Figure 2 can broadly be separated into two categories – A, B and D are generally referred to as "incoherent motifs", while C is a coherent motif. This nomenclature is due to the arrangement in C in which both inputs act as promoters, in comparison to A and D which are both fully incoherent (both second tier proteins have both promoter and repressor inputs), whilst B may be considered to be partially incoherent, as one of the second tier proteins has incoherent inputs, whilst the other has coherent inputs (the model used in this case is given in equation (1), in which we consider co-operative binding in the case that the order of binding is unimportant). In addition we also add a derivative of the C model which we denote as C', in which both inputs still act as promoters, however the assumption about the way the promoters act is different – we no longer require both promoters to bind for *P*_{
W
}to be expressed (in practise it can be seen that these operate as an AND and an OR gate respectively). Further discussion of the many ways in which the transcription factors operates can be modelled can be found in the methods section.

### Constant inputs

We first consider the effect of providing the motif with steady inputs, a biological scenario which corresponds to continued exposure to a either an environmental condition triggering independent factors, or to a constant signal which is split into two signals by the network structure. To examine the various responses, we consider the effect of both signals being turned on continuously at a high level (which we denote in the in Table 3 as a green square), the result of one of the factors occurring at a high level whilst the other is either low (denoted by a yellow square) or off (denoted by red).

In this way one can see the variation in the responses of the two output proteins, which we denote here as *P*_{
Z
}and *P*_{
W
}for all motifs. In this table, we have simply provided the transcription factors at a reasonably high concentration and quantified the response of each protein after the system has stabilized. All of the simulations are performed with the same kinetic parameters, as detailed in the methods section, and the qualification of a high or low response is based on comparison with the response of other motifs and the response of the other protein. Again, these results are colour-coded for easy comparison, with green, yellow and red again corresponding to high, low or no expression.

We can see that there is significant variation in the characterization of the response of the variants of the motif.

### Response to simultaneous pulses

We next look at the responses of the motif in the case in which both transcription factors are turned on simultaneously for 3600 seconds (an hour), and then turned off again. Again, the precise amplitudes and durations of the protein responses varies greatly, and are dependant on the precise values of the kinetic parameters. Two illustrative cases can be seen in Figure 5, again corresponding to variants A and B of the motifs however. The qualitative results are summarized in Figure 4.

### Staggered pulses

Finally we look at the effect of offset pulses in the levels of transcription factors. In the first case, providing the first transcription factor (denoted by *I*_{
X
}) for 3600 seconds, then turning that off, and turning the second factor (*I*_{
Y
}) on for 3600 seconds. This is then reversed. The results can be see in Figure 6, again using the colour coding scheme described above.

Two illustrative cases can be seen in Figure 7, corresponding to variants A and B, in the case in which first *I*_{
X
}and then *I*_{
Y
}are activated.

## Discussion

We have seen that the the bi-fan motif exhibits a rich variety of dynamical behaviour and has the ability to perform a number of potentially useful functions, considering for example Figure 3, the results of steady inputs, we can see that the motif can pass through the inputs as outputs (variant A), act as a logical AND gate (variant C) or an OR gate (variant C'). From this table of responses to the simplest dynamical stimulation we can see already that it is very hard to draw any firm conculsions about the role of a bifan motif without knowing a great deal about its internal structure. Furthermore the simple distinction of coherent or incoherent is also insufficient, as we can see from the opposing behaviours of C and C'. Furthermore, there is great variation in the detailed dynamical response as we have seen from Figures 5 and 7, with differences in total expression levels, steepness of response and timing of peak expression. It must be noted that the majority of the variation in behaviour is not unexpected, and arises as a consequence of the parameters used, however this only exacerbates the difficulties of trying to think of, and use motifs as higher level "functional modules".

## Conclusion

Analysis of many networks in a large number of scenarios, from biological and social networks to technological networks has revealed the presence of motifs, simple patterns which occur with a greater than expected frequency. In the context of biological networks, and specifically transcription regulatory networks, it has been argued [9] that motifs have evolved independently, indicating optimal design. It has been suggested that such motifs may represent "computational elements", and in the case of the feedforward motif, the possibility of the motif acting as a Boolean AND or OR gate has been investigated [4].

Here we have systematically studied a range of dynamical behaviours possible for bi-fan motifs. It had previously been demonstrated that even for the simpler feed-forward motif there is already a vast range of possible dynamical behaviours. For the bi-fan, which is only slightly more complex, we found again that there is a large range of possible response behaviours. Most notably we observed that entirely opposing behaviours (for example, in the case of the responses of variants C and C') can be elicited depending on the nature and strengths of individual interactions within the motif. We were able to identify a variety of combinations of such interactions in the bi-fan motifs found in the transcriptional network of *S. cerevisiae*. In particular, we found examples of both coherent and incoherent architectures. This suggests that simply identifying the presence of particular motifs, without a detailed experimental evaluation of their respective dynamics, is unlikely to offer much insight into the functional properties of real transcriptional networks. In essence this means that knowing the structure of a network, or an inventory of the discrete modules making up that structure, doesn't provide enough information to predict how functional processes occur or how biochemical reactions proceed in a biological system.

Admittedly, our analysis does not take into account the full complexity of real biological bi-fan motifs. This, however, makes the interpretation of bi-fan motif occurrences in nature even more difficult: if simple mathematical models can already demonstrate such different types of behaviour then it is likely that real bi-fan motifs exhibit an even richer repertoire of behaviour. One should also remember that motifs are themselves generally only artificially identified local structures, there is no good reason to believe that their dynamics can necessarily lead to a modularization in understanding the behaviour of transcription networks. Many of the databases used to mine for motifs are based on yeast-2-hybrid experimental data which gives no indication of whether elements of identified structures are active at the same time.

Furthermore we usually lack information about the behaviour of the input signals, essential to understand the relationship of the motif to the network as a whole. Thus, we can conclude that simply knowing the connection structure of this motif is insufficient to give much insight into its function or even its dynamical response. In order to do this, we would need much more detailed experimental information about binding and unbinding rates and other kinetic information. Currently, such experimental data is difficult to determine in a wholesale fashion, making the large scale analysis of the function of transcription networks very problematic.

## Methods

Following [5] we only consider deterministic models and hence use systems of ordinary differential equations (ODEs) to describe the time evolution of the number of molecules of the various constituent species of a bi-fan motif. In many cases the copy numbers of some of these (*e.g*. transcription factors) will be too low for ODEs to capture the dynamics of the real system. In such cases we can use a stochastic model which can be simulated using for instance the Gillespie algorithm [10]. However [5] found that for feed-forward motif the average behaviour of an ensemble of such simulations is well characterised by ODEs describing the mean behaviour. We confirmed this for the bi-fan motif (data not shown). Following the custom in the literature for feed-forward motifs, we refer to the bi-fan network in which all interactions act directly to promote expression as "coherent", and use the term "incoherent" otherwise. We analyse the dynamical response of the motif by observing the expression levels of the proteins in the motif. We assume that the motif is initially at equilibrium with zero input so that in the absence of basal transcription none of the proteins in the motif are initially expressed. We then stimulate the system at one or both inputs for varying periods and at varying strengths, corresponding to different dynamical situations. Our models of transcription regulation were translated into systems of approximately 30 ordinary differential equations [see Additional File 1], which were subsequently solved using Mathematica (Wolfram Research, Urbana Champagne Illinois).

### The bi-fan model

The bi-fan motif, see Figure 1, consists of four regulatory systems, denoted as *X*, *Y*, *Z* and *W*. It is necessary to represent each of the systems in a biologically relevant way and with realistic parameter choices. The model used here follows that of [5], in that each of the systems is composed of a transcriptional part whereby one or more transcription factors bind to promoter regions and regulate the production of mRNA, as well as a translational part whereby the mRNA is translated into protein, which may act as a transcription factor for another regulatory system.

The model is certainly not the simplest available – we could instead have modelled the motif as a Boolean network or with weighted funtions between nodes, however we believe that to do so does not take advantage of the knowledge that has been gained in study of the physical mechanism of gene regulation, and futhermore the model we do use has the advantage that there is considerable experimental data available to justify the choice of rate constants, and not to have used this would have been to not take full advantage of the experimental data. Equally, this model does not attempt to model the many hundreds of intermediate steps involved in each process such as transcription, as the steps introduced would necessarily have been arbitrary, and without experimental data to justify rate constants.

The model attempts to describe the process of gene regulation from transcription binding to protein production in a physically reasonable way. Each system, for example *X*, is represented as having a section of DNA *D*_{
X
}which codes for the mRNA *M*_{
X
}. First, the transcription factors, which may either be proteins produced by one of the systems in the motif, or by an external signal, bind to the promoter region to form a complex *Q*_{
X
}. RNA polymerase molecules *R*_{
X
}then bind to this complex as they read the DNA, forming a secondcomplex ${Q}_{X}^{*}$. This complex then breaks down as the reading process completes, releasing in the process *Q*_{
X
}, *R*_{
X
}and the newly formed mRNA *M*_{
X
}. The mRNA molecules are then translated, and copies of the protein *P*_{
X
}are produced. The initial numbers of molecules used in the model were: one copy of the DNA for each system and 30 copies of RNA polymerase, all others were set to zero. These reactions are modelled using the law of mass action.

### Modelling the interactions

The detailed modelling of the interactions between the two tiers of proteins can be carried out in a number of ways, all of which will have an effect on the resultant dynamics.

#### Co-operative binding – order unimportant

In this case, considered in the paper [5], both transcription factors act to promote transcription, however they are both needed for transcription to take place. This is modelled by introducing an intermediate species, *T*. The equations for this model are then as follows:

$\begin{array}{lll}{D}_{W}+{P}_{Y}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {Q}_{W}\hfill \\ {D}_{W}+{P}_{X}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {T}_{W}\hfill \\ {T}_{W}+{R}_{W}\hfill & \underset{{k}_{-2}}{\overset{{k}_{2}}{\rightleftharpoons}}\hfill & {Q}_{W}^{*}\hfill \\ {Q}_{W}^{*}\hfill & \stackrel{{k}_{8}}{\to}\hfill & {T}_{W}+{M}_{W}+{R}_{W}\hfill \end{array}\left(1\right)$

#### Co-operative binding – order important

$\begin{array}{lll}{D}_{W}+{P}_{X}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {Q}_{W}\hfill \\ {Q}_{W}+{P}_{Y}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {{Q}^{\prime}}_{W}\hfill \\ {{Q}^{\prime}}_{W}+{R}_{W}\hfill & \underset{{k}_{-2}}{\overset{{k}_{2}}{\rightleftharpoons}}\hfill & {Q}_{W}^{*}\hfill \\ {Q}_{W}^{*}\hfill & \stackrel{{k}_{8}}{\to}\hfill & {T}_{W}+{M}_{W}+{R}_{W}\hfill \end{array}\left(2\right)$

#### Independent promoters – order unimportant

This is a simpler case than the above, and is simply the case in which both transcription factors act to promote transcription independently. The equations are then as follows:

$\begin{array}{lll}{D}_{W}+{P}_{Y}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {Q}_{W}\hfill \\ {D}_{W}+{P}_{X}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {Q}_{W}\hfill \\ {Q}_{W}+{R}_{W}\hfill & \underset{{k}_{-2}}{\overset{{k}_{2}}{\rightleftharpoons}}\hfill & {Q}_{W}^{*}\hfill \\ {Q}_{W}^{*}\hfill & \stackrel{{k}_{8}}{\to}\hfill & {T}_{W}+{M}_{W}+{R}_{W}\hfill \end{array}\left(3\right)$

#### Promoter/repressor combination

In this case the repressor sequesters DNA, making it unavailable for the promoter to bind.

$\begin{array}{lll}{D}_{W}+{P}_{Y}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {T}_{W}\hfill \\ {D}_{W}+{P}_{X}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {Q}_{W}\hfill \\ {Q}_{W}+{R}_{W}\hfill & \underset{{k}_{-2}}{\overset{{k}_{2}}{\rightleftharpoons}}\hfill & {Q}_{W}^{*}\hfill \\ {Q}_{W}^{*}\hfill & \stackrel{{k}_{8}}{\to}\hfill & {T}_{W}+{M}_{W}+{R}_{W}\hfill \end{array}\left(4\right)$

#### Promoter/repressor combination – binding order important

In this case, the binding order of *P*_{
X
}and *P*_{
Y
}to the DNA for gene W now plays a role. If *P*_{
Y
}binds first it will block the production of *P*_{
W
}, perhaps by altering the conformation of the binding site; if, however, *P*_{
X
}binds first then *P*_{
Y
}is still required for production of the protein. This has the effect of making *P*_{
Y
}a repressor when it binds first to the regulation site. Examples of where this order specific behaviour may occur include the effect of the transcription factor p53 on chromatin structure [11]. Other papers which discuss the importance of binding order include [12] and [13]. We see the effect that this can have on motif behaviour in Figure 8, where the two graphs correspond to high *I*_{
X
}and low *I*_{
Y
}in (a), and the reverse in (b). Here the motif is apparently able to differentiate between signal combinations.

$\begin{array}{lll}{D}_{W}+{P}_{Y}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {Q}_{W}\hfill \\ {D}_{W}+{P}_{X}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {T}_{W}\hfill \\ {T}_{W}+{P}_{Y}\hfill & \underset{{{k}^{\prime}}_{-7}}{\overset{{{k}^{\prime}}_{7}}{\rightleftharpoons}}\hfill & {{Q}^{\prime}}_{W}\hfill \\ {{Q}^{\prime}}_{W}+{R}_{W}\hfill & \underset{{k}_{-2}}{\overset{{k}_{2}}{\rightleftharpoons}}\hfill & {Q}_{W}^{*}\hfill \\ {Q}_{W}^{*}\hfill & \stackrel{{k}_{8}}{\to}\hfill & {{Q}^{\prime}}_{W}+{M}_{W}+{R}_{W}\hfill \end{array}\left(5\right)$

#### Illustrative case

With these considerations in mind, we give the full model for variant C, the coherent motif (the other cases are similar). The basic model for *X* and *Y*, where *I*_{
X
}and *I*_{
Y
}represent the amounts of externally produced transcription factors:

For protein *X*:

$\begin{array}{lll}{D}_{X}+{I}_{X}\hfill & \underset{{k}_{-1}}{\overset{{k}_{1}}{\rightleftharpoons}}\hfill & {Q}_{X}\hfill \\ {Q}_{X}+{R}_{X}\hfill & \underset{{k}_{-2}}{\overset{{k}_{2}}{\rightleftharpoons}}\hfill & {Q}_{X}^{*}\hfill \\ {Q}_{X}^{*}\hfill & \stackrel{{k}_{3}}{\to}\hfill & {Q}_{X}+{M}_{X}+{R}_{X}\hfill \\ {M}_{X}\hfill & \stackrel{{k}_{4}}{\to}\hfill & {M}_{X}+{P}_{X}\hfill \\ {M}_{X}\hfill & \stackrel{{k}_{5}}{\to}\hfill & \varnothing \hfill \\ {P}_{X}\hfill & \stackrel{{k}_{X6}}{\to}\hfill & \varnothing \hfill \end{array}\left(6\right)$

For protein *Y*:

$\begin{array}{lll}{D}_{Y}+{I}_{Y}\hfill & \underset{{k}_{-1}}{\overset{{k}_{1}}{\rightleftharpoons}}\hfill & {Q}_{Y}\hfill \\ {Q}_{Y}+{R}_{Y}\hfill & \underset{{k}_{-2}}{\overset{{k}_{2}}{\rightleftharpoons}}\hfill & {Q}_{Y}^{*}\hfill \\ {Q}_{Y}^{*}\hfill & \stackrel{{k}_{3}}{\to}\hfill & {Q}_{Y}+{M}_{Y}+{R}_{Y}\hfill \\ {M}_{Y}\hfill & \stackrel{{k}_{4}}{\to}\hfill & {M}_{Y}+{P}_{Y}\hfill \\ {M}_{Y}\hfill & \stackrel{{k}_{5}}{\to}\hfill & \varnothing \hfill \\ {P}_{Y}\hfill & \stackrel{{k}_{Y6}}{\to}\hfill & \varnothing \hfill \end{array}\left(7\right)$

In the case in which both transcription factors act as promoters, and in which binding order is unimportant, that is, Equation (1), then the equations for proteins *Z* and *W* then become:

$\begin{array}{lll}{D}_{Z}+{P}_{X}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {Q}_{Z}\hfill \\ {Q}_{Z}+{P}_{Y}\hfill & \underset{{{k}^{\prime}}_{-7}}{\overset{{{k}^{\prime}}_{7}}{\rightleftharpoons}}\hfill & {{Q}^{\prime}}_{Z}\hfill \\ {D}_{Z}+{P}_{Y}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {T}_{Z}\hfill \\ {T}_{Z}+{P}_{X}\hfill & \underset{{{k}^{\prime}}_{-7}}{\overset{{{k}^{\prime}}_{7}}{\rightleftharpoons}}\hfill & {{Q}^{\prime}}_{Z}\hfill \\ {{Q}^{\prime}}_{Z}+{R}_{Z}\hfill & \underset{{k}_{-2}}{\overset{{k}_{2}}{\rightleftharpoons}}\hfill & {Q}_{Z}^{*}\hfill \\ {Q}_{Z}^{*}\hfill & \stackrel{{k}_{8}}{\to}\hfill & {{Q}^{\prime}}_{Z}+{M}_{Z}+{R}_{Z}\hfill \\ {M}_{Z}\hfill & \stackrel{{k}_{4}}{\to}\hfill & {M}_{Z}+{P}_{Z}\hfill \\ {M}_{Z}\hfill & \stackrel{{k}_{5}}{\to}\hfill & \varnothing \hfill \\ {P}_{Z}\hfill & \stackrel{{k}_{Z6}}{\to}\hfill & \varnothing \hfill \end{array}\left(8\right)$

$\begin{array}{lll}{D}_{W}+{P}_{Y}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {Q}_{W}\hfill \\ {Q}_{W}+{P}_{Y}\hfill & \underset{{{k}^{\prime}}_{-7}}{\overset{{{k}^{\prime}}_{7}}{\rightleftharpoons}}\hfill & {{Q}^{\prime}}_{W}\hfill \\ {D}_{W}+{P}_{X}\hfill & \underset{{k}_{-7}}{\overset{{k}_{7}}{\rightleftharpoons}}\hfill & {T}_{W}\hfill \\ {T}_{W}+{P}_{Y}\hfill & \underset{{{k}^{\prime}}_{-7}}{\overset{{{k}^{\prime}}_{7}}{\rightleftharpoons}}\hfill & {{Q}^{\prime}}_{W}\hfill \\ {{Q}^{\prime}}_{W}+{R}_{W}\hfill & \underset{{k}_{-2}}{\overset{{k}_{2}}{\rightleftharpoons}}\hfill & {Q}_{W}^{*}\hfill \\ {Q}_{W}^{*}\hfill & \stackrel{{k}_{8}}{\to}\hfill & {{Q}^{\prime}}_{W}+{M}_{W}+{R}_{W}\hfill \\ {M}_{W}\hfill & \stackrel{{k}_{4}}{\to}\hfill & {M}_{W}+{P}_{W}\hfill \\ {M}_{W}\hfill & \stackrel{{k}_{5}}{\to}\hfill & \varnothing \hfill \\ {P}_{W}\hfill & \stackrel{{k}_{W6}}{\to}\hfill & \varnothing \hfill \end{array}\left(9\right)$

The parameters used for the rates of transcription and translation are based on the derivations in [14], and were obtained from experimentally determined rates [15]. The values used for the basic model are shown in Table 2.

## Appendix: motifs and networks

A *network* or a *graph* is a mathematical object consisting of a set *V* of vertices, and a set *E* of edges connecting vertices. If the edges have arrows or directions then the network is called a directed network. Many different types of data and relationships may be represented in this way. Examples of undirected networks are networks of pairs of proteins which are known to interact. The networks considered in this paper have as their vertices genes which regulate or are regulated by the product of other genes. The edges then represent the relationship of control, and therefore the network is a directed network, with the direction of the edges indicating the direction of control. Figure A1 is an example of a directed network. Motifs, introduced in [1] are small sub-networks or patterns of vertices and edges which occur with in the network. A motif is considered to be interesting if it occurs unusually frequently in the network. In Figure A1 a (feed-forward) motif is highlighted in colour.

## Appendix figure

A directed network with a feed-forward motif highlighted in colour. In the case of a transcription regulation network the arrows indicate the direction of transcriptional control.

## References

- 1.
Milo R, Shen-Orr S, Itzkovitz S, Kashtan N, Chklovskii D, Alon U: Network motifs: simple building blocks of complex networks. Science. 2002, 298 (5594): 824-827. 10.1126/science.298.5594.824.

- 2.
Milo R, Itzkovitz S, Kashtan N, Levitt R, Shen-Orr S, Ayzenshtat I, Sheffer M, Alon U: Superfamilies of evolved and designed networks. Science. 2004, 303 (5663): 1538-1542. 10.1126/science.1089167.

- 3.
Prill R, Iglesias P, Levchenko A: Dynamic Properties of Network Motifs Contribute to Biological Network Organization. PLoS Biol. 2005, 3 (11): e343-10.1371/journal.pbio.0030343.

- 4.
Mangan S, Alon U: Structure and function of the feed-forward loop network motif. Proc Natl Acad Sci U S A. 2003, 100 (21): 11980-11985. 10.1073/pnas.2133841100.

- 5.
Hayot F, Jayaprakash C: A feedforward loop motif in transcriptional regulation: induction and repression. J Theor Biol. 2005, 234: 133-143. 10.1016/j.jtbi.2004.11.010.

- 6.
Mangan S, Zaslaver A, Alon U: The coherent feedforward loop serves as a sign-sensitive delay element in transcription networks. J Mol Biol. 2003, 334 (2): 197-204. 10.1016/j.jmb.2003.09.049.

- 7.
Artzy-Randrup Y, Fleishman SJ, Ben-Tal N, Stone L: Comment on "Network motifs: simple building blocks of complex networks" and "Superfamilies of evolved and designed networks". Science. 2004, 305 (5687): 1107-10.1126/science.1099334.

- 8.
Huang S: Back to the biology in systems biology: what can we learn from biomolecular networks?. Brief Fund Genomic Proteomic. 2004, 2 (4): 279-297. 10.1093/bfgp/2.4.279.

- 9.
Conant GC, Wagner A: Convergent evolution of gene circuits. Nat Genet. 2003, 34 (3): 264-266. 10.1038/ng1181.

- 10.
Gillespie D: Exact stochastic simulation of coupled chemical reactions. J Phys Chem. 1977, 81: 2340-2361. 10.1021/j100540a008.

- 11.
Ogden SK, Lee KC, Wernke-Dollries K, Stratton SA, Aronow B, Barton MC: p53 targets chromatin structure alteration to repress alpha-fetoprotein gene expression. J Biol Chem. 2001, 276 (45): 42057-42062. 10.1074/jbc.C100381200.

- 12.
Hiroi H, Christenson LK, Chang L, Sammel MD, Berger SL, Strauss JFr: Temporal and spatial changes in transcription factor binding and histone modifications at the steroidogenic acute regulatory protein (stAR) locus associated with stAR transcription. Mol Endocrinol. 2004, 18 (4): 791-806. 10.1210/me.2003-0305.

- 13.
Cosma MP: Ordered recruitment: gene-specific mechanism of transcription activation. Mol Cell. 2002, 10 (2): 227-236. 10.1016/S1097-2765(02)00604-4.

- 14.
Bundschuh R, Hayot F, Jayaprakash C: The role of dimerization in noise reduction of simple genetic networks. J Theor Biol. 2003, 220 (2): 261-269. 10.1006/jtbi.2003.3164.

- 15.
Ptashne M, Jeffrey A, Johnson AD, Maurer R, Meyer BJ, Pabo CO, Roberts TM, Sauer RT: How the lambda repressor and cro work. Cell. 1980, 19: 1-11. 10.1016/0092-8674(80)90383-9.

- 16.
Mai B, Breeden L: CLN1 and its repression by Xbp1 are important for efficient sporulation in budding yeast. Mol Cell Biol. 2000, 20 (2): 478-487. 10.1128/MCB.20.2.478-487.2000.

- 17.
Althoefer H, Schleiffer A, Wassmann K, Nordheim A, Ammerer G: Mcm1 is required to coordinate G2-specific transcription in Saccharomyces cerevisiae. Mol Cell Biol. 1995, 15 (11): 5917-5928.

- 18.
Kratzer S, Schuller HJ: Transcriptional control of the yeast acetyl-CoA synthetase gene, ACS1, by the positive regulators CAT8 and ADR1 and the pleiotropic repressor UME6. Mol Microbiol. 1997, 26 (4): 631-641. 10.1046/j.1365-2958.1997.5611937.x.

- 19.
Kratzer S, Schuller HJ: Carbon source-dependent regulation of the acety1-coenzyme A synthetase-encoding gene ACS1 from Saccharomyces cerevisiae. Gene. 1995, 161: 75-79. 10.1016/0378-1119(95)00289-I.

- 20.
Schuller HJ, Schutz A, Knab S, Hoffmann B, Schweizer E: Importance of general regulatory factors Rap1p, Abf1p and Reb1p for the activation of yeast fatty acid synthase genes FAS1 and FAS2. Eur J Biochem. 1994, 225: 213-222. 10.1111/j.1432-1033.1994.00213.x.

- 21.
Herrero P, Flores L, de la Cera T, Moreno F: Functional characterization of transcriptional regulatory elements in the upstream region of the yeast GLK1 gene. Biochem J. 1999, 343 (Pt 2): 319-325. 10.1042/0264-6021:3430319.

- 22.
Moreno-Herrero F, Herrero P, Colchero J, Baro AM, Moreno F: Analysis by atomic force microscopy of Med8 binding to cis-acting regulatory elements of the SUC2 and HXK2 genes of saccharomyces cerevisiae. FEBS Lett. 1999, 459 (3): 427-432. 10.1016/S0014-5793(99)01289-2.

- 23.
Bu Y, Schmidt MC: Identification of cis-acting elements in the SUC2 promoter of Saccharomyces cerevisiae required for activation of transcription. Nucleic Acids Res. 1998, 26 (4): 1002-1009. 10.1093/nar/26.4.1002.

- 24.
Estruch F, Carlson M: Two homologous zinc finger genes identified by multicopy suppression in a SNF1 protein kinase mutant of Saccharomyces cerevisiae. Mol Cell Biol. 1993, 13 (7): 3872-3881.

- 25.
Nehlin JO, Carlberg M, Ronne H: Yeast SKO1 gene encodes a bZIP protein that binds to the CRE motif and acts as a repressor of transcription. Nucleic Acids Res. 1992, 20 (20): 5271-5278.

- 26.
Coffman JA, Rai R, Loprete DM, Cunningham T, Svetlov V, Cooper TG: Cross regulation of four GATA factors that control nitrogen catabolic gene expression in Saccharomyces cerevisiae. J Bacteriol. 1997, 179 (11): 3416-3429.

- 27.
Cunningham TS, Svetlov VV, Rai R, Smart W, Cooper TG: G1n3p is capable of binding to UAS(NTR) elements and activating transcription in Saccharomyces cerevisiae. J Bacteriol. 1996, 178 (12): 3470-3479.

- 28.
Talibi D, Grenson M, Andre B: Cis- and trans-acting elements determining induction of the genes of the gamma-aminobutyrate (GABA) utilization pathway in Saccharomyces cerevisiae. Nucleic Acids Res. 1995, 23 (4): 550-557.

- 29.
Andre B, Talibi D, Soussi Boudekou S, Hein C, Vissers S, Coornaert D: Two mutually exclusive regulatory systems inhibit UASGATA, a cluster of 5'-GAT(A/T)A-3' upstream from the UGA4 gene of Saccharomyces cerevisiae. Nucleic Acids Res. 1995, 23 (4): 558-564.

- 30.
Cunningham TS, Cooper TG: The Saccharomyces cerevisiae DAL80 repressor protein binds to multiple copies of GATAA-containing sequences (URSGATA). J Bacteriol. 1993, 175 (18): 5851-5861.

- 31.
Proft M, Serrano R: Repressors and upstream repressing sequences of the stress-regulated ENA1 gene in Saccharomyces cerevisiae: bZIP protein Sko1p confers HOG-dependent osmotic regulation. Mol Cell Biol. 1999, 19: 537-546.

- 32.
Huie MA, Scott EW, Drazinic CM, Lopez MC, Hornstra IK, Yang TP, Baker HV: Characterization of the DNA-binding activity of GCR1: in vivo evidence for two GCR1-binding sites in the upstream activating sequence of TPI of Saccharomyces cerevisiae. Mol Cell Biol. 1992, 12 (6): 2690-2700.

- 33.
Brindle PK, Holland JP, Willett CE, Innis MA, Holland MJ: Multiple factors bind the upstream activation sites of the yeast enolase genes ENO1 and ENO2: ABFI protein, like repressor activator protein RAP1, binds cis-acting sequences which modulate repression or activation of transcription. Mol Cell Biol. 1990, 10 (9): 4872-4885.

- 34.
Machida M, Jigami Y, Tanaka H: Purification and characterization of a nuclear factor which binds specifically to the upstream activation sequence of Saccharomyces cerevisiae enolase 1 gene. Eur J Biochem. 1989, 184 (2): 305-311. 10.1111/j.1432-1033.1989.tb15020.x.

- 35.
Machida M, Uemura H, Jigami Y, Tanaka H: The protein factor which binds to the upstream activating sequence of Saccharomyces cerevisiae ENO1 gene. Nucleic Acids Res. 1988, 16 (4): 1407-1422.

- 36.
Willett CE, Gelfman CM, Holland MJ: A complex regulatory element from the yeast gene ENO2 modulates GCRl-dependent transcriptional activation. Mol Cell Biol. 1993, 13 (4): 2623-2633.

- 37.
Holland JP, Brindle PK, Holland MJ: Sequences within an upstream activation site in the yeast enolase gene ENO2 modulate repression of ENO2 expression in strains carrying a null mutation in the positive regulatory gene GCR1. Mol Cell Biol. 1990, 10 (9): 4863-4871.

- 38.
Zaragoza O, Vincent O, Gancedo JM: Regulatory elements in the FBP1 promoter respond differently to glucose-dependent signals in Saccharomyces cerevisiae. Biochem J. 2001, 359 (Pt 1): 193-201. 10.1042/0264-6021:3590193.

- 39.
Vincent O, Carlson M: Sip4, a Snf1 kinase-dependent transcriptional activator, binds to the carbon source-responsive element of gluconeogenic genes. EMBO J. 1998, 17 (23): 7002-7008. 10.1093/emboj/17.23.7002.

- 40.
Scholer A, Schuller HJ: A carbon source-responsive promoter element necessary for activation of the isocitrate lyase gene ICL1 is common to genes of the gluconeogenic pathway in the yeast Saccharomyces cerevisiae. Mol Cell Biol. 1994, 14 (6): 3613-3622.

- 41.
Roth S, Schuller HJ: Cat8 and Sip4 mediate regulated transcriptional activation of the yeast malate dehydrogenase gene MDH2 by three carbon source-responsive promoter elements. Yeast. 2001, 18 (2): 151-162. 10.1002/1097-0061(20010130)18:2<151::AID-YEA662>3.0.CO;2-Q.

## Acknowledgements

We thank the Wellcome Trust for a research studentship (PJI) and a research fellowship (MPHS). We would also like to thank the reviewers for their helpful comments.

## Author information

## Additional information

### Authors' contributions

PJI performed the analysis. PJI, MPHS and JS designed the study and jointly wrote the paper.

## Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

## Rights and permissions

## About this article

#### Received

#### Accepted

#### Published

#### DOI

### Keywords

- Directed Network
- Network Motif
- Ordinary Differential Equation Model
- Transcription Regulation Network
- Binding Order