Skip to main content
  • Research article
  • Open access
  • Published:

Diverse biological processes coordinate the transcriptional response to nutritional changes in a Drosophila melanogaster multiparent population



Environmental variation in the amount of resources available to populations challenge individuals to optimize the allocation of those resources to key fitness functions. This coordination of resource allocation relative to resource availability is commonly attributed to key nutrient sensing gene pathways in laboratory model organisms, chiefly the insulin/TOR signaling pathway. However, the genetic basis of diet-induced variation in gene expression is less clear.


To describe the natural genetic variation underlying nutrient-dependent differences, we used an outbred panel derived from a multiparental population, the Drosophila Synthetic Population Resource. We analyzed RNA sequence data from multiple female tissue samples dissected from flies reared in three nutritional conditions: high sugar (HS), dietary restriction (DR), and control (C) diets. A large proportion of genes in the experiment (19.6% or 2471 genes) were significantly differentially expressed for the effect of diet, and 7.8% (978 genes) for the effect of the interaction between diet and tissue type (LRT, Padj. < 0.05). Interestingly, we observed similar patterns of gene expression relative to the C diet, in the DR and HS treated flies, a response likely reflecting diet component ratios. Hierarchical clustering identified 21 robust gene modules showing intra-modularly similar patterns of expression across diets, all of which were highly significant for diet or diet-tissue interaction effects (FDR Padj. < 0.05). Gene set enrichment analysis for different diet-tissue combinations revealed a diverse set of pathways and gene ontology (GO) terms (two-sample t-test, FDR < 0.05). GO analysis on individual co-expressed modules likewise showed a large number of terms encompassing many cellular and nuclear processes (Fisher exact test, Padj. < 0.01). Although a handful of genes in the IIS/TOR pathway including Ilp5, Rheb, and Sirt2 showed significant elevation in expression, many key genes such as InR, chico, most insulin peptide genes, and the nutrient-sensing pathways were not observed.


Our results suggest that a more diverse network of pathways and gene networks mediate the diet response in our population. These results have important implications for future studies focusing on diet responses in natural populations.


Individuals can withstand changing nutritional conditions by flexibly adjusting the allocation of resources to competing life history traits, allowing populations to adapt and thrive. Individual ability to partition available nutrients and optimize fitness gains requires complex cooperation at multiple levels of functional and structural organization in tandem with prevailing conditions dictating nutrient availability. Changes in diet are associated with many phenotypic changes across the tree of life. For example, in many metazoan species, moderate nutrient limitation extends lifespan and delays age-related physiological decline [1,2,3,4]. In fluctuating resource conditions, this effect, in which the individual often shifts nutrients away from reproduction and towards somatic maintenance and repair may be adaptive, ensuring survival in bad conditions and reproduction when good conditions return [5, 6]. On the other hand, constant dietary excess such as diets high in sugar, promote hyperglycemia in many genetic backgrounds, accelerate the rate of aging, and reduce lifespan [7,8,9,10].

A large and growing body of literature points to endocrine pathways being involved in nutrient perception and balance in order to coordinate organismal response to diet change. Nutrient sensing pathways are associated with aging and longevity from yeast to mammals [11,12,13,14], reviewed in [15,16,17,18,19]. The insulin/insulin-like signaling (IIS) together with the target of rapamycin (TOR) are among the most studied pathways. These pathways jointly regulate multiple metabolic processes affecting growth, reproduction, lifespan, and resistance to stress [20,21,22]. In insects, IIS/TOR signaling determines body size by coordinating nutrition with cell growth, and steroid and neuropeptide hormones to cede feeding when the target mass is attained [23]. Mutations, including experimental gene knockouts, that reduce IIS/TOR signaling reduce growth and reproduction, and increase stress resistance and lifespan [12, 24, 25], and apparently coordinates nutrient status with metabolic processes. For example, lack of nutrients blocks insulin production [26] and mimics the effects of a down-regulated IIS/TOR [27], while a hyperactivated IIS/TOR pathway can exclude the necessity for nutrients [27]. Fruit flies raised on excess sugar diets as larvae become hyperglycemic, fat and insulin resistant, and show increased expression of genes associated with gluconeogenesis, lipogenesis, β-oxidation, and FOXO effectors [8, 9]. Additionally, modulating TOR signaling slows aging by affecting downstream processes including mRNA translation, autophagy, endoplasmic reticulum stress signaling, and metabolism (reviewed in [28]) .

Specific examples on the role of nutrient sensors abound in literature. Briefly, the forkhead transcription factor foxo in Drosophila melanogaster (D. melanogaster) and foxo orthologs in the nematode Caenohabditis elegans (daf-16) and vertebrates (FoxO) is the main transcription factor target of IIS/TOR, and is required for lifespan extension by a reduced IIS, reviewed in [18]. An activated foxo represses production of insulin-like peptides (ILPs) which in turn reduces IIS signaling [29, 30]. In a related mechanism, resveratrol-mediated activation of sirtuin genes mimic the effect of dietary restriction and promote lifespan in many metazoan species [1]. For example, in the cotton bollworm Helicoverpa armigera, Sirt2 extends lifespan by its role in cellular energy production and amino acid metabolism [31, 32]. Further, the regulation of appetite which has a major effect on plastic nutrient allocation (reviewed in [33]), depends on leptin signaling together with the AMP-activated protein kinase (AMPK), influencing nutrient intake and subsequent production of ILPs [34,35,36]. Lastly, the hormones ecdysone and juvenile hormone also bear on the IIS to regulate ovary size and influence dispersal-reproduction trade-offs in D. melanogaster and sand crickets, Gryllus firmus, respectively [21, 37,38,39,40], reviewed in [33]. In spite of these and other examples that demonstrate the effect of genetic variation on the metabolic response to nutrition, the underlying genetic basis diet effects in natural populations remain elusive [41].

Much of the current focus on how endocrine mechanisms affect phenotypic response to nutrition proceed in one-gene-at-a-time knockout strategies to elucidate function. This approach has been informative, largely in model species, but also supported to some extent in wild species. Endocrine pathways have been shown to affect plastic and adaptive resource allocation in wild D. melanogaster [42, 43], sexual selection of horn size in rhinoceros beetles [44], sex-specific mandible development in staghorn beetles [45, 46] and morph determination in wing dimorphic sand crickets [38, 47,48,49], leading to the conclusion that endocrine pathways mediate the evolution of resource allocation strategies [50,51,52]. However, natural populations have not consistently revealed these same genetic mechanisms [53,54,55,56] suggesting that large effect studies in mutants capture only the tails of effect distributions that occur in the wild [57], or that different mechanisms overlapping with endocrine pathways may be involved [58, 59], reviewed in [33]. This disconnect means that our understanding of the specific genetic mechanisms that govern the response to diet in natural populations remains limited.

The majority of the studies that have characterized changes in gene expression in response to diet have controlled for the genetic background by using one or a few inbred lines [60,61,62]. However, previous studies have shown that different inbred lines can vary widely in how they respond to diet changes [61, 63, 64], meaning that the findings from a single genotype could represent a highly specific response and thus not be broadly applicable. One approach to improve the chances that ecologically relevant mechanisms are identified is to start with experimental panels that include greater levels of standing genetic diversity available in a species in the wild. Multi-way advanced-intercross populations founded from multiple geographical inbred lines (i.e. multiparent populations - MPPs) typically integrate a greater subset of genetic diversity, and increase the ability to identify genetic variants underlying complex traits. These resources have gained traction in the past two decades in both plants and animals for the purposes of genetic mapping [65,66,67,68,69,70]. A study characterizing the overall transcriptional response to diet in a multiparent population would better capture the average response of the population and have the potential to be more broadly applicable than those characterized by only a few genotypes. In addition, MPPs are being used widely to map different complex traits, including responses to nutrition, and gaining a more complete picture of the changes in gene expression with diet could help identify possible candidate genes underlying mapped QTL in those studies.

In this study, our goal is to understand the transcriptional response in different nutritional environments in an outbred multiparent population of D. melanogaster. We use an admixed population derived from the Drosophila Synthetic Population Resources (DSPR). The DSPR is a large two-replicate set of advanced recombinant inbred lines (RILs), each derived from 8 inbred lines originating from several continents. The promise of this resource over traditional laboratory populations for characterizing the genetic mechanisms for complex traits is discussed in depth elsewhere [71, 72]. We analyze RNA-seq data sequenced from pooled samples of female D. melanogaster exposed to multiple diet conditions differing in the proportion of protein and carbohydrate sources: dietary restriction (DR), control (C) and high sugar (HS). Here, we profile gene expression for three tissues: heads (H), bodies (B) and ovaries (O), in high replication, and ask:

  1. 1)

    How does gene expression change in response to nutritional environment?

  2. 2)

    What specific biological processes and pathways are significantly perturbed by diet treatment?

  3. 3)

    Which sets of genes show similar expression patterns across diets and tissues, and what biological processes are involved in these specific patterns?


Global expression patterns

We use a replicate population of the DSPR comprising >800 RILs. This population was developed from eight inbred founder lines that have been fully genetically characterized (full sequences, the haplotype structure inferred, ~1.2 million SNPs identified, and the RILs genotyped at >10,000 SNPs). We generated a single outbred panel from 835 RILs by intercrossing the lines for five generations. Resulting flies were reared on three experimental diets (DR, C, and HS) for 10 days post-eclosion before isolation of total RNA from pools of 100 female fly tissues (head, body and ovary pair) in six replicates for each tissue-diet combination (Fig. 1). These 54 RNA samples (18 for each diet) were sequenced single end, generating a total of 35,572 transcripts, out of which 18,678 remained for analysis after removal of transcripts with a variance across samples of less than one [73]. Overall expression levels were generally consistent across diet treatments and tissues (Fig. 2). One sample (bodies, B2) in the DR treatment showed slightly lower median expression compared to the rest, but was similar enough to the others and was retained in the analysis.

Fig. 1
figure 1

Study design. Flies drawn from 835 RILs of the DSPR were bred together for 5 generations to create an outbred panel. Eggs were collected from this homogenous population and resulting flies reared on dietary restriction (DR), control (C) and high sugar (HS) diets in six replicates for 10 days from day 12 post-oviposition. Heads, ovaries and bodies were dissected over 100 female flies from each treatment replicate for total mRNA extraction

Fig. 2
figure 2

Principal components analysis (PCA) to visualize the overall effect of diet and tissue. Different colors denote different diets and different shapes correspond to the different tissues. Two dimensions are shown (PC1 and PC2)

To assess global expression patterns over tissues and diets we performed principal components analysis (PCA) on all samples using an expression matrix from which batch effects had been removed (Fig. 2). A similar figure prior to batch removal is shown in Additional file 1. As expected, tissue effects strongly dominated variance in the first two components which jointly accounted for 94% of the total variance. PC1 which explains 65% of the variance in expression presents non-overlapping separation of tissue expression, although body and head expression appear somewhat similar compared to the ovaries. PC2 (29%) distinguishes expression in bodies from that in heads and ovaries.

Differential gene expression in response to diet

We used DESeq2 to quantify differential gene expression in head, ovary and body samples obtained from adult flies held on C, DR, and HS diet treatments. We obtained lists of genes significantly differentially expressed due to the main effect of diet. After filtering out genes with a low overall count, a total of 12,614 genes remained in the experiment based on which we report all subsequent results. Of these, 2475 genes (19.6%, Additional file 2) were differentially expressed in response to diet treatment, and 978 (7.8%, Additional file 3) for the interaction between diet and tissue (LRT, Padj < 0.05). The overall expression differences are visualized for each tissue and diet pair in Fig. 3. Overall, relative to the C diet, many genes in all organs were expressed in the same direction in the DR and HS diets, meaning that the genes that have increased expression in the DR diet tend to also have increased expression in HS, and vice versa. This is indicated by the positive relationship between the fold changes for each of these diets (bodies: r = 0.64; heads: r = 0.59; ovaries: r = 0.59) and the proportion of genes that trend in the same direction for these two diets (i.e. number upregulated in both + number downregulated in both divided by the total number of genes; bodies: 0.70; heads: 0.82; ovaries: 0.66). However, this observed relationship between fold changes could be a result of comparing two ratios that are both calculated relative to the same reference diet (C), as randomly generated data will produce a positive relationship between these quantities and greater than 50% would be expected to show a fold change in the same direction. Several lines of evidence suggest this trend is biologically meaningful and not simply a result of comparing ratios. First, PCAs performed for each tissue separately show that clusters for DR and HS diets overlap for both bodies and heads, while the C diet forms its own cluster (Fig. 4). For ovaries, all three diets form separate clusters. Second, we calculated fold changes using both other diets as the reference diet and compared the correlation and proportion of genes trending in the same direction. In all cases, the correlation we observe between the DR and HS fold changes relative to C are higher than the correlations we observe for the other pairs of diets (Additional file 4). This also held true when comparing the proportions of genes that trend in the same direction for bodies and heads. In ovaries, the highest proportion trending in the same direction was observed for HS and C relative to DR (Additional file 4). Third, we performed 100 permutations of our expression data randomizing across the diets but constraining this to two randomly selected samples from each diet to ensure we obtained null datasets with no expectation of a diet effect and calculated pairwise fold changes, which allowed us to calculate empirical p-values (see Methods for details; Additional file 1). Only the comparison between DR and HS showed significant relationships, with no other comparison yielding a p-value less than 0.1 for either the correlation or the proportion trending in the same direction (Additional file 4. For heads, the proportion trending in the same direction is significantly greater than expected by chance (empirical p = 0.01). For ovaries, the correlation is significantly greater (empirical p = 0.04) and for bodies, the correlation is marginally significant (empirical p = 0.08). This general trend suggests a similar change in global transcription pattern in response to both the DR and HS diets relative to the C diet, despite their very different compositions by weight and subsequently their caloric content. Further, the 2475 DEGs for the main treatment effect were distributed across all diet-tissue combinations (Fig. 5), making it challenging to narrow down to a smaller list of genes for further examination.

Fig. 3
figure 3

Comparison between DR and HS fold changes. Horizontal and vertical lines at 0 show when gene expression in the two diets is the same relative to the C diet. Diagonal dashed line is the 1:1 line. Points in the quadrants above 0 for one diet and below 0 for the other are genes that trend in different directions in the HS vs. DR diet relative to C (top-left and bottom-right). Points falling above the 1:1 line in the top-right quadrant and below the 1:1 line in the bottom-left quadrant show a similar effect in the HS diet as in the DR diet. Points are colored according to their mean expression. Labels a., b., and c., correspond to tissues: bodies, heads and ovaries, respectively

Fig. 4
figure 4

PCA plots on each tissue performed separately, showing the pattern in which diet treatments cluster. Different colors denote different diets and different shapes correspond to the different tissues: (a) bodies, (b) heads, and (c) ovaries

Fig. 5
figure 5

Volcano plots (a-i) for differentially expressed genes showing genes with large fold changes that are also statistically significant. Horizontal lines indicate -log10(Padj.) = 0.05, and points above the line represent genes with statistically significant differential expression. Vertical lines differential expression with the value of log2 fold change of 1 (i.e. absolute fold change = 2) and FDR = 0.05. Upregulated and downregulated genes are on the right side and left side of the vertical lines, respectively, and statistically significant genes are above horizontal lines. Rows in the panel top to bottom are bodies, heads, and ovaries; columns left to right are DR vs C, HS vs C, DR, vs HS; color of points represent log10 of base mean expression

Gene set enrichment analysis

We performed gene set enrichment analysis (GSEA) on the significantly differentially expressed genes (i.e. 2475 DEGs) for the main effect of diet, using the fold changes for each diet-tissue combination to identify pathways and gene sets which were significantly perturbed relative to all DEGs in the model. Of these pairwise comparisons, only DR versus HS in bodies and DR versus C in bodies showed evidence for significantly enriched gene sets/pathways at an FDR Padj. < 0.05 (Benjamini & Hochberg procedure). We identified four pathways showing gene set level changes for bodies in DR relative to HS: Metabolic pathways (two-sample t-test, mean change = 5.38, FDR = 2.94e− 06), Carbon metabolism (two-sample t-test, mean change = 3.31, FDR = 2.26e− 02), Oxidative phosphorylation (two-sample t-test, mean change = 2.95, FDR = 4.52e− 02), and Protein processing in endoplasmic reticulum (two-sample t-test, mean change = 2.83, FDR = 4.52e− 02, Additional file 1). Notably, metabolic pathways (dme01100), which was most significantly enriched, is a large group of pathways in the KEGG database ( At the default threshold (FDR Padj. < 0.1) in GAGE, ten more pathways appeared for DR relative to HS in bodies (Additional file 5). These additional pathways encompass three main metabolic themes: carbohydrate, amino acid and protein, and drug/xenobiotics. For the comparison of DR vs C in bodies, oxidative phosphorylation (dme00190) was significantly enriched (two-sample t-test, mean change = 3.2, FDR Padj. = 7.36e− 02).

Further, we examined GO term gene set enrichment for biological process (BP) to capture significant diet-related differences occurring below the level of pathway. Four terms were enriched at an FDR Padj < 0.01. Small molecule metabolic process was enriched for the DR vs HS comparison in bodies (mean change = 4.49; Padj = 5.84e− 3). Cell communication (mean change = 5.10; Padj = 1.83e− 4), signaling (mean change = 5.06; Padj = 1.83e− 4), and signal transduction (mean change = 4.56; Padj = 1.37e− 3) were all enriched for the HS vs C comparison in heads. At an FDR Padj. < 0.05, 41 unique enriched terms were observed, of these, 34 terms were enriched for HS relative to C diet in heads (Additional file 5). These terms highlighted a broad range of themes including signaling, metabolism, growth, cytoskeleton, gene expression and development. Three terms were enriched for HS relative to C in bodies, including cell communication, signaling, and system process. The remaining six terms were all for the HS diet relative to DR in bodies, all within one theme of metabolism (acid, small molecule, carbohydrate). No terms were enriched for the comparisons in ovaries. To understand broader inclusive processes represented by these GO terms, we evaluated our list for ancestral terms using QuickGO (EMBL-EBI Nine ancestral terms at the same hierarchical level immediately below category BP were observed (metabolic process, biological regulation, cellular process, localization, response to stimulus, cellular component organization, multicellular organismal process, growth, and developmental process). Among these, metabolic process, cellular process, and developmental process had the most connections to child terms. Our GSEA analysis therefore highlights multiple pathways and biological processes triggered by diet changes, especially in bodies and heads, and encompassing broad themes from metabolism to signaling to homeostasis, but none of the canonical nutrient sensing pathways such as IIS/TOR and FOXO signaling pathways. Notably, our results do not show particular enrichment of diet-associated terms in ovaries, at least for biological processes.

Diet-induced gene coexpression

Next, we asked how diet treatment affected the correlation patterns among genes (i.e. co-expression) across samples. To identify sets of genes that are highly correlated in their expression patterns (or modules), we performed hierarchical clustering on a batch-controlled, rlog transformed expression data including all replicate samples over all treatments using WGCNA [74]. We first computed a matrix of pairwise correlations for all genes on which we performed hierarchical clustering to produce module assignments. We then used a resampling procedure to determine if genes were correctly assigned to modules (see Methods for details and literature). Setting the minimum module size to 30 genes, a total of 31 modules were detected (range gene number 39–3240), with 1049 unassigned genes (grey module). After merging highly similar modules (i.e. eigengene correlation r > 0.9, see methods), 21 modules were identified with an additional module holding all unassigned genes (Additional file 5).

To appreciate module-level effects of diet and tissue on coexpression, we visualized eigengene expression across diets (Fig. 6, Additional file 6). It is clear from these plots that some modules showed greater diet by tissue interaction effects than others (e.g. e, f, m, q, s and v). These modules show either reduced or increased expression for one or two tissues in one or two diets. To gain better insight into these intra-modular effects of diet and diet-tissue interaction, we fit an analysis of variance model (ANOVA) to module eigengenes. For the main effect of diet, all modules turned up significant (FDR Padj. < 0.05), except modules c (Fig. 6). Similarly, for the effect of the interaction between diet and tissue, all modules showed a significant effect (FDR Padj. < 0.05), except module a.

Fig. 6
figure 6

Eigenegene expression across diet treatment for each module (a-v) identified in hierarchical clustering. Color scheme represents the three tissues; each filled circle represents a sample; the open triangle marks the mean eigene expression for a given diet in a given tissue. In all cases except those noted on (a) and (c), the main effect of treatment and the tissue by treatment interaction are significant

Focusing on the modules showing a statistically significant interaction effect, and divergent expression profiles in one or more diets for a given tissue (), several distinct patterns became apparent: 1) generally reduced expression in the DR diet for ovaries and bodies unlike the rest of diets (Fig. 6e, f, k and s), 2) increased expression in the DR diet for ovaries and bodies (i, m), 3) elevated expression in bodies in both DR and HS diets (v), and 4) different responses in all three diets (g, r). An attempt to isolate specific diet-tissue combinations driving the interaction effect using post hoc tests revealed large numbers of highly significant combinations. We therefore explored the modules further via functional enrichment to identify the processes driving these coexpression patterns.

We conducted functional analysis on all modules to identify enriched GO terms (Bonferroni corrected enrichment P values, Additional file 7). Of 12,614 Entrez identifiers in our experiment, 10,334 mapped in current GO categories (see methods), and therefore used as a background list for enrichment analysis in WGCNA. A large number of terms were obtained across CC, MF and BP categories: 658 terms (P < 0.01), and 791 terms (Bonferroni corrected P < 0.05) (Additional file 7). A visual inspection of enriched terms in the 21 robustly assigned modules confirmed a large diversity of highly significantly enriched biological processes in most modules, ranging from nuclear processes to membrane and cytosolic processes; from structural to signaling and immune response processes; and from pigmentation to homeostatic processes (Additional file 7).

The first module (Fig. 6a) which included 2956 showed 291 GO terms (Bonferroni corrected, Padj. < 0.01), and had the most significantly enriched terms (i.e. > 60 terms ranged between Padj. < e− 156 - < e− 15). This module was characterized by greater eigengene expression in ovaries compared to heads and bodies, although the diet effect was subtle but significant. Nuclear and intracellular organelle processes including gene expression, and RNA processing were key tissue (ANOVA, P < 2e-16) and diet (ANOVA, P < 0.002) effects independently regulated (i.e. no interaction effect). With reference to the trends described above (Fig. 6), those modules showing generally reduced expression in the DR diet for ovaries and bodies (e, f, k and s), are associated with biological processes including signaling (e, Padj. < 1.1e− 10), cellular component organization (k, Padj. <5.8e− 09), nervous system development (f, Padj. <1.3e− 14), signaling and protein localization on Golgi apparatus (s, Padj. <3.0e− 06). Interestingly, expression increase in DR in bodies and heads compared to ovaries is related to ubiquitin-dependent proteolytic processes in the proteasome (i, Padj. <1.8e− 08), and cytosolic vesicle transport/mitochondrial activities (m, Padj. <8.9e− 156). Module (v, Padj. <1.1e− 21) was interesting because bodies show monotonic increase in expression from C to DR to HS, a trend that may relate to the GO term chitin-based cuticle structure development (Padj. = 5.78e− 30), indicating cuticular remodeling in stressful diets (DR and HS), presumably to accommodate gain or loss of body mass.

Analysis of our modules therefore revealed a large number of biological processes (BP), molecular function (MF) and cellular components (CC) (Additional file 7), suggesting that response to diet changes in natural D. melanogaster involves a multi-system response rather than one or a few signaling pathways that can be very different in different tissues.

Previously implicated pathways

Several pathways have been well-studied in the context of diet-induced phenotypic changes (see Background). We specifically examined these pathways in order to characterize the transcriptional response. We obtained a precomputed list of genes for key metabolic pathways (fbgn_annotation_ID_fb_2019_02.tsv): IIS (55 genes), TOR (152 genes), and FOXO (110 genes, which includes the sirtuins). Of 317 pathway genes, 47 (14.8%) were significantly differentially expressed for the diet effect (Additional file 8), representing fewer pathway genes compared to 2475 of 12,614 of all genes, genome-wide (i.e. 19.6%). However, our GSEA results presented above did not show pathway level enrichment of any of these pathways as defined in KEGG Pathway Database (, although as mentioned above, a handful of member genes were differentially expressed. These genes that were differentially expressed however, showed mixed modular membership in our clustering results. Similarly, GO analysis on module genes did not enrich categories specifically including ‘insulin’, ‘TOR’ and ‘FoxO’ in the term. Figure 7 shows pattern of expression across diets, of the insulin peptide and sirtuin genes that are widely cited to act regulate IIS/TOR and AMPK. From these results we could not link any of these pathways with particular modules within our data. Importantly, we did not find evidence for significant whole pathway perturbation of IIS/TOR and the downstream FOXO signaling in our co-expressed modules.

Fig. 7
figure 7

Individual gene plots (a-m) representing a sample of key genes hypothesized to underlie canonical nutrient sensing pathways, particularly the IIS/TOR and sirtuin deacetylases which are thought to regulate growth, reproduction, lifespan and stress resistance in many species including D. melanogaster. Ilp - insulin peptide gene, Sirt - sirtuin gene. There are 8 insulin peptides and 5 sirtuins known in D. melanogaster

Parsing previously identified QTL for the response to diet

A useful application of genome-wide expression data is to identify possible regulatory variants underlying QTL. In a previous study, we used a multivariate approach to identify a global expression QTL for the response to diet treatment of 52 genes in the IIS/TOR pathways [64] that we hypothesized was influencing the expression of many of the genes in these pathways. After performing a discriminant function analysis predicting diet (DR or C) from expression measured on whole flies of 52 genes in the IIS/TOR pathways, we mapped the difference in discriminant function to identify this eQTL. The eQTL interval, as defined by the Bayesian credible interval, contains 327 genes, making it challenging to narrow to possible candidates. We therefore searched for differentially expressed genes identified in this study that fall within this eQTL region of interest. Of these, we find 49 genes are differentially expressed in the different diet treatments and 13 show a diet by tissue interaction, with 5 of these genes showing both a main treatment effect and an interaction effect (i.e. Odc1, Dgat2, CG12822, CG12159, and Obp44a, Additional file 9). The patterns of expression across tissues for this set of genes is shown in Fig. 8. While our expression results don’t allow us to narrow to a single candidate, they do significantly reduce the list and provide detailed expression data for the possible causal genes.

Fig. 8
figure 8

Differentially expressed genes under a previously identified eQTL interval [64]. The patterns of expression of the genes within the Bayesian credible interval that are differentially expressed in different diet treatments in this study are shown here


We sequenced pooled RNA samples from a three-diet by three-tissue by six-replicate experiment of outbred mated female D. melanogaster in the DSPR. Our aim was to understand diet-induced patterns of gene expression influencing plastic nutrient allocation in different diet conditions in a multiparent population resource. Our results suggest that: 1) global expression patterns are dominated by tissue and diet-tissue interaction effects, while the effects of diet alone are subtle but significant, 2) patterns of gene expression are generally similar in low-protein and carbohydrate-rich diets relative to the control diet, 3) multiple pathways, co-expressed gene modules, and biological processes are invoked that affect transcription in different diet conditions, especially in the head and body tissues, and 4) expression results did not suggest a single regulatory variant underlying QTL, but narrowed down to a few possible causal genes. Overall, our results suggest that multiple networks are involved in phenotypic changes in response to nutrient availability, rather than just a few key genes. We advocate a broader, genome-wide approach to studying the genetic mechanisms underlying diet effects on phenotypic change.

It is established that nutrient signaling pathways and hub genes in those pathways play a crucial role in how organisms adjust to changing conditions in availability and quality of nutrients to optimize fitness traits [50,51,52]. Analysis of differential gene expression in a population presented with diets differing in nutritional richness provides an ‘omic’ alternative to study intermediate processes that connect genetic architectures to phenotypic outcomes such as allocation patterns. In fruit flies, studies typically focus on whole-body or head tissue transcription (e.g. [75]); one or a few gene pathways known to affect diet responses at a time (e.g. [64]); one or two diet manipulations (e.g. [75,76,77]), but scarcely integrate over more than two organs and conditions at a time, or explore expression outside a few gene pathways. Further, despite costs trending downwards recently, sequencing of more highly replicated experiments remains unaffordable for many laboratories. By and large, studies in model organisms focus on genes in a few endocrine pathways, so called nutrient sensors, as critical players in coordinating growth, reproduction, stress resistance and somatic maintenance responses to changing diet conditions. Components of the IIS/TOR, growth, and ecdysone hormones; and sirtuin deacetylases are deemed some of the major players in this respect. Our results suggest an expanded scope of mechanisms underlying flexible responses to nutrient limitation (DR studies) or oversupply (high sugar and high fat diet studies) in natural populations, which has also been suggested by [78,79,80] and reviewed by [58, 81]. We discuss our major results and their implications below.

First, we observed a large global effect of tissue type and a more subtle, but significant, effect of diet treatment. Previous studies in flies have also found relatively small effects of diet on transcription. Previously, we characterized the genetic basis of standing genetic variation for 55 genes of the IIS/TOR pathway following treatment with the same diets used here [64], and found only small changes in gene expression associated with diet treatment although most of those genes were differentially expressed. Similarly, Reed et al., [82] measured transcriptional and metabolomic changes for 20 inbred lines (North Carolina and Maine population) of D. melanogaster treated with four diets varying in sugar and fat content and observed a small dietary component in gene expression profiles, with much larger contributions of genotype by diet interactions. Musselman et al. [83] investigated expression differences in D. melanogaster fed with two different forms of sugar and found small but significant changes in gene expression. Overall, diet seems to produce fairly small magnitude changes in expression in many genes across the genome, which in concert presumably can lead to large phenotypic changes.

Secondly, comparisons between DR and HS diets relative to C revealed a similar pattern of expression. This result mirrors our earlier finding in an eQTL mapping experiment using DSPR lines in which gene expression in DR and HS relative to C generally trended in the same direction [64] . This result is in spite of the fact that the DR and HS diets lead to very different outcomes in median lifespan and fecundity in our population [64, 84]. Nutritional geometry studies which measure traits in a series of concentrations of liquid media suggest that traits such as lifespan and reproduction (which differ significantly across our diets) are influenced primarily by the diet protein to carbon (P:C) ratio, not its caloric content (e.g., [8, 85, 86]. Thus, calorie limitation alone does not drive phenotypic patterns in these studies. While our HS diet has a high concentration of sugar per liter of food, the P:C ratio is a lot lower (i.e. ~1:2 yeast to sugar ratio). Lifespan and fecundity in nutritional geometry studies are maximized at a much higher P:C ratio (i.e. 2- to 4-fold higher than our HS diet) [85, 86]. Thus, our results are consistent with similarity in expression pattern between DR and HS. It is also possible that diet macromolecules serve only as a cue that lead to optimal allocation of resources in natural populations. Thus, it is possible for expression measures to trend in the same direction in DR and HS treatments. Further, Dobson et al. [87] found that excess sugar diets in young adult flies inhibited foxo and reduced survival in middle and old age. While we did not measure lifespan and fecundity here, our data showed mixed pattern of foxo activation across diet tissue combinations, and these mapped to many co-expression modules which may be due to the relatively young age (i.e. 21 days) of our experimental flies.

Thirdly, an overall take away from our GSEA on the full dataset, and also on gene modules coming from hierarchical clustering was that diet effects could not be attributed to a particular genetic mechanism. GSEA highlighted metabolism, oxidative phosphorylation and protein processing at pathway level, but showed a broad spectrum of processes in GO term enrichment encompassing metabolism, cell signaling, structural development and organization, and defence. Similarly, GO analysis of gene modules yielded a broad range of biological processes. From both these analyses, IIS, TOR, and FOXO signaling were not significantly enriched. However, several genes had significantly reduced expression in the IIS/TOR (e.g. Ilp5, Rheb, Atg1, Myc, and eIF4E1) and significantly higher expression in FoxO downstream effectors (e.g. AMPK, orct2, Gadd45, cdk2 and p38) in most DR-tissue combinations consistent with indicating canonical activity. With the exception of Ilp5 [41] however, undetectable differential expression of hub genes (such as Ilp2, S6K, chico, InR, Akt1, Torc, Thor, and foxo) suggest that diet induced effects may involve many more pathways/genes than have been traditionally studied in this context. This argument is further strengthened in this study by relatively fewer genes belonging to nutrient sensing pathways, (IIS, TOR and AMPK) (i.e. 14.8%) compared to genes differentially expressed for the effect of diet (i.e. 19.6%). However, this last point should be interpreted with caution because sets of genes defined in different databases as belonging to these pathways, and their relative importance in those pathways may differ and we may not have included all genes.

Evidence in C. elegans suggest that the worm ortholog of foxo, daf-16 is not required in the DR response [88,89,90,91]. On the other hand, sir-2.1 is a worm ortholog of the fruit fly Sirt 2 (which was significantly differentially expressed in this study), and is required for lifespan extension in adult worms by diet deprivation was independent of the daf-2/insulin-like signaling [91]. In D. melanogaster, similar evidence is emerging that suggest that foxo is not required for the response to DR [92], but is involved in the normal response to DR [93]. When dFOXO was removed, DR treatment still resulted in significant lifespan extension in null flies [92]. Another study testing a novel DR assay in C. elegans found that DAF-16, but not DAF-2 (the worm lnR) was required when DR was performed on solid media, and concluded that AMPK-FOXO signaling resulted in lifespan extension on solid food [94]. Our data provides further evidence supporting these studies in suggesting a broader mechanism in which IIS, TOR and FOXO play a role, but in concert with other pathways.

A potential limitation of our study is the heterogeneity in tissue types present in our samples, which may affect the level and nature of gene expression [95]. For instance, assuming fewer cell types are available in the ovaries or head samples compared to body samples, we could expect the range of biological processes triggered by nutrient levels to scale with cell types to some degree. In addition, our use of an outbred population also means that our samples encompass heterogeneity stemming from genetic variation for the response to diet, and averaging over this heterogeneity might obscure some patterns. Ideally, to fully capture the genetic variation for the transcriptional response to diet, one would perform RNA-seq separately in hundreds of the DSPR lines reared in multiple diets, but this would obviously be a much larger experiment requiring a much larger investment of experimental resources. Further, DR and HS protocols vary tremendously across laboratories which can result in different studies detecting only subsets of the gene network distribution which responds to nutrient change [92]. Patterns of phenotypic expression (fecundity and lifespan) and patterns of gene expression have held stable across several studies using the same set of diets in our population, suggesting that the effects we find are biologically relevant in the DSPR. However, in addition to a broader view of the potential mechanisms causing phenotypic changes in response to diet, a broader consideration of different diets would also benefit the field.


We have characterized the genome-wide transcriptional response to diet composition in multiple tissues in D. melanogaster, providing a comprehensive picture of potential genetic mechanisms underlying phenotypic changes. We found that the general pattern of expression was similar in DR and HS diets relative to C diet, probably reflecting specific nutrient ratios. In addition, we identified a large set of co-expression networks, pathways and gene ontology terms that were enriched in response to diet. Our data did not show enrichment of canonical nutrient sensing pathways and key genes, although some genes in those pathways were significantly perturbed. Our results therefore support the hypothesis that in natural populations, multiple gene networks and pathways are invoked to respond to environmental differences in nutrient availability, and not just a few pathways as it is often assumed. Our study has potential implications for future studies focusing on the effects of stressful diets in natural populations, including our own. We therefore urge future studies to look beyond traditional genetic mechanisms governing diet effects and to move beyond the use of just a single genotype when characterizing these responses.


Experimental population

We used 835 recombinant inbred lines (RILs) of the B sub-population of the DSPR as source of our experimental population. Five young females that had mated intra-line were randomly selected from each RIL and placed in six custom-made cages each measuring 20.3 cm × 20.3 cm × 20.3 cm. Eggs were collected (twice on successive days) after 22 h of oviposition in milk bottles (250 ml) containing 40 ml media. Resulting flies were introduced to six cages in large population sizes (i.e. > 2000 per cage) and allowed to mate for five generations (21-day cycle). To ensure a genetically homogenous base population was generated, eggs from each cage equally seeded each of the 5 other cages at each generation. We refer to these six cages as the base population. All life stages during generation of the base population were reared on a cornmeal-dextrose-yeast maintenance diet. Proportions of ingredients in our maintenance diet are presented in Additional file 10. Additional culture practices including equipment and supplies are described in [96].

Study design

We used one random cage from the base population for phenotyping to generate RNA-seq data we present here. This population is referred to as the “synthetic population”. We adopted a factorial design comprising three dietary treatments, three tissues, and six replicates per each treatment-tissue combination. To obtain flies for each diet treatment, we placed two plates (100 mm × 15 mm, Fisherbrand®) of maintenance diet in the synthetic population and allowed for 24 h of oviposition, repeating egg collection three times. From the two egg plates, a thin slice of media bearing 50–90 eggs (visually estimated) was cut out and grafted to each of 60 new vials (25 mm × 95 mm, Polystyrene Reload, cat. no. 32—109RL, Genesee Scientific, USA) of maintenance food containing 10 ml media. At 12 days post-oviposition, eclosed flies were released into 9 cages, 3 cages for each of three diet treatments, such that treatments are equally represented across egg collection dates. For each cage, we used 20 vials to seed the cage with adult flies. Flies were held on treatment diet for 10 days, and provided with new food every 2–3 days. Each replicate was started on a new week to provide for time that would be needed to dissect a large number of flies per replicate later on. All flies were reared in a growth chamber at 23 °C, ≥50% relative humidity, and a 24:0 light:dark cycle, which are the typical maintenance conditions for the DSPR flies. Details of our study design are schematically shown in Fig. 1.

Experimental diets

We extracted RNA for sequencing from 22 day-old (post-oviposition) female flies held on three experimental diets adapted from Bass et al, [97] and Skorupa et al, [8]: C, DR, and HS for 10 days, as described above. These are all cornmeal-sucrose-yeast diets which differ in two ways: relative to C, DR contains 50% less yeast, and HS contains about seven times sucrose (Additional file 10). We have used these diets in past studies to map expression QTL in the DSPR RILs [64], and to estimate quantitative genetic parameters in an outbred DSPR population [84]. To prevent desiccation and preserve quality, diets were covered with multipurpose sealing wrap (Press’n Seal®) sealed and stored at 4 °C and used within 2 weeks of preparation. To prevent food degradation, individuals were moved to vials with fresh food three times per week.

Sample preparation and sequencing

At day 22, we collected 100–104 females per diet treatment under mild CO2 anesthesia. We replicated this process six times over 6 days, thus creating six same-age sample batches with each batch including all three treatments (Additional file 11). This yielded assays of at least 100 females per treatment diet (HS, C and DR), three tissues within each diet (heads, ovaries and bodies), biologically replicated 6 times. Our experiment therefore comprised 54 samples (18 per diet). We immediately dissected tissues from these females and promptly flash-froze them in liquid nitrogen and held them in a closed styrofoam box on dry ice before storage at − 80 °C. Precision scissors (RS-5604, Roboz Surgical Instrument Co., Inc.) were used for fly dissection in a bath of pH 7.4, 1% PBS (Life Technologies™) containing 2 drops (80–100 μl) Triton × 100 (SIGMA) under dissecting stereoscopes (Leica S6E and Leica M216) in 100 mm × 15 mm polystyrene petri dishes (Fisherbrand®). Scissors and forceps were rinsed with 70% ethanol and wiped dry with Kimwipes between samples. To minimize time-of-day effects, dissection was done across treatments (e.g. [C, HS, DR], [DR, HS, C], etc) rather than one treatment at a time (e.g. [C, C, C], [HS, HS, HS], etc).

We purified whole RNA from each of 54 samples using a protocol modified from Life Technologies TRIzol RNA extraction protocol. Tissues were homogenized in a tissue lyser using steel beads in 1 ml TRIzol reagent, and subsequent RNA extraction by following the TRIzol protocol. RNA quality was evaluated on a Nanodrop 2000 spectrophotometer (Thermo Scientific) and concentration on a Qubit 2.0 fluorometer (Invitrogen, Life Technologies). An additional clean-up step with a QIAGEN RNeasy Plus Mini Kit (Hilden, Germany) was used for gDNA elimination following manufacturer’s protocol to achieve high quality RNA for library preparation. After the RNA cleaning step, yields typically ranged between 400 and 800 ng/μl; and absorbance values (260/280 and 260/230) above 2.0 (Additional file 12). Fifty-four libraries were each prepared from 1.5 μg total mRNA, RNA integrity (RIN) ≥ 8.0 using Illumina’s TruSeq Stranded mRNA (poly A enrichment) and single-end read sequencing on Illumina NextSeq 500. Barcoded libraries were combined in a single 54-plex library, which we sequenced on three lanes of a NextSeq 1 × 75 bp run for an average of 23 million reads per sample. The resulting reads were trimmed of adaptor sequences and FASTQC was used to assess quality. Sequencing was performed at the University of Missouri DNA Core Facility.

Data analysis

Read alignment and quantification of expression

We employed the ‘new Tuxedo’ suite analysis pipeline [73] for read assembly and transcript quantification. We mapped single-end reads to the reference genome of D. melanogaster, bdgp6_tran (, updated March, 2016,), using HISAT2 (version 2.1.0, Kim et al, [98]. Alignments were converted to BAM file format and runs combined using SAMtools [99]. Then, StringTie [73, 100] was used to assemble RNA-seq alignment into full-length transcripts, and to quantify levels transcript expression.

We intended our downstream analysis to focus on gene-level differential expression of known genes in the D. melanogaster transcriptome. Therefore we used the alternative workflow given by Pertea et al. [73] skipping steps 3–5 in that protocol, (i.e. we skipped the individual assembly of each sample and the merge step, documented here, last accessed May 20, 2019). To do this, we ran ‘stringtie -eB’ directly on the output of ‘samtools -sort’. In order to extract more gene (FBgn) ids from reference gene annotations into StringTie output, we ran a Perl post-processing script, ‘’ (Pertea, that appends reference gene ids to the MSTRG gene ids used in StringTie. In order to perform differential expression analysis on row counts using DESeq2 [101], we processed the output from StringTie with a Python script ‘’ (Pertea M, with the -e parameter to extract read count matrices (one for transcripts, one for genes) directly from the files generated by StringTie.

Controlling for batch effects

As described earlier, our samples were processed in six groups on separate days (all treatments and tissues represented equally in each) including setup, dissection, and RNA extraction and preparation (Additional file 11). We sought to control for this obvious batch effect and any unknown underlying batch effects. We therefore used surrogate variable analysis (SVA, [102] implemented in the R library svaseq [103]. We used the DEseq2 package [101] to obtain counts that are normalized for library size (i.e. counts divided by size factors). Eliminating genes with zero counts, we used the svaseq function comparing the full model with a known batch to a null model with batch only. A single surrogate variable (SV) was identified and included in all subsequent models.

Differential gene expression

Next, we estimated differential gene expression with DESeq2 for DR and HS treatment conditions relative to the C treatment. We fit two generalized linear models (GLM), with parameter fitType = ‘local’ and only included genes with at least 10 reads in at least 2 samples. We compared the following GLMs:

  1. (1)

    Expression ~ SV + batch + tissue

  2. (2)

    Expression ~ SV + batch + treatment

  3. (3)

    Expression ~ SV + batch + tissue + treatment

  4. (4)

    Expression ~ SV + batch + tissue + treatment + tissue*treatment

where SV is the surrogate variable identified earlier. For all tests, we used a likelihood ratio test, comparing a more complex model [101] with a reduced model in the following way: 1) main effect of tissue: model 3 vs. model 2, 2) main effect of treatment: model 3 vs. model 1, 3) treatment by tissue interaction: model 4 vs. model 3. From these model comparisons, we identified sets of significantly differentially expressed genes using an FDR threshold of 0.05. To visualize and rank the genes we used the function lfcShrink, which performs shrinkage on log2(Fold Changes), which have been shown to produce better estimates. All log2(Fold Changes) reported here are the shrinkage estimated values using the “normal” estimator [101]. To identify global patterns of expression across diets and tissues, we performed principal components analysis (PCA) on expression values before and after removal of batch effects. We also performed PCAs for each tissue separately to better identify the global diet effects and the relationships between each diet.

Interpreting fold changes can be challenging when trying to compare the diet treatments because they are essentially a ratio. Thus, when comparing two-fold changes that are both calculated with the same diet as the reference diet (in our case the C diet), we expect to see a positive relationship even for randomly generated data, making it challenging to interpret these patterns. To determine if our observed patterns were significantly different than those expected by chance, we performed 100 permutations of our dataset to eliminate the true diet effect. For each permutation, we constructed each of our three diet categories by randomly choosing two samples from each diet, thereby eliminating the diet effect and allowing us to obtain a set of null fold changes for each diet pair. We performed the permutation with a random set of two from each diet instead of simply permuting across all samples because with only 6 samples per diet in each tissue, it is not unlikely to randomly construct a treatment that contains samples from mostly one diet. For each permuted dataset, we calculated fold changes as described above. We then calculated the correlation between each pair of fold changes (e.g. fold change in HS relative to C vs. fold change in DR relative to C) and the proportion of genes that changed in the same direction in each pair (i.e. number upregulated in both + number downregulated in both/total number of genes). We then compared these values to our observed data to calculate empirical p-values [104].

Gene set enrichment in relation to diet

We considered our list of DEGs for the effect of diet treatment and used the R Bioconductor GAGE package [105] to infer gene sets and pathways that were significantly perturbed relative to all DEGs. Briefly, GAGE takes an expression matrix with log-based fold changes as input. It assumes that a particular gene set or canonical pathway (from literature or database) comes from a different distribution than the experiment-derived background list. Therefore, a two-sample t-test is applied to account for the gene set specific variance and the background variance and derives 1) a global p-value using a meta-test on p-values from gene set comparisons, and 2) a FDR q-value adjustment. Using GAGE to access the Fly annotation database ‘org.’, we generated current KEGG (Kyoto Encyclopedia of Genes and Genomes [106]) pathway gene sets, limiting our search to metabolic and signaling functional annotations. Similarly, we obtained up-to-date gene ontology (GO) gene sets specifying biological processes. We mapped our FlyBase gene (FBgn) IDs to ENTREZ ids, and performed pathway and gene set enrichment, and GO analysis on the resulting gene list within GAGE with the gage() function. We tested for perturbation of each gene set relative to all genes in the experiment by calculating the mean individual statistics (stat.mean) from multiple gene set tests using a two-sample t-test implemented in GAGE as gs.tTest(), and obtain a global p-value from individual p-values. The global p-values were then adjusted for multiple testing using the Benjamini & Hochberg procedure [107], and refer to these as FDR [105].

Gene co-expression analysis

Next, we sought to identify sets of genes (modules) showing high correlation in their pattern of expression across tissues with respect to treatment condition. We used the removeBatchEffect() in the R limma package to correct for known (batch) and identified (SV) effects. We applied a regularized log transformation (rlog) to the batch-corrected matrix to minimize differences between samples for rows with low counts as well as normalize to library size. The rlog transformation is recommended if, as in this study (0.52–1.92), size factors vary widely [101]. The resulting expression matrix was used as input in the R package, Weighted Gene Co-expression Network Analysis, WGCNA, [74] to identify co-expressed gene modules showing similar patterns of expression across tissues and treatments. We built the initial network from samples over all treatments (N = 54) using a signed adjacency matrix with power 23 (i.e. function pickSoftThreshold()) to construct the topological overlap matrix (TOM) from all 12,614 genes (Additional file 1). This power represents the lowest power for which the scale-free topology fit index curve flattens out after reaching a high value of r2 = 0.90. We performed hierarchical clustering of the TOM using the flashClust() function (method = “average”) which implements hclust() clustering more efficiently for large datasets [108]. We used the cutTreeDynamic() function to identify initial the initial set of modules at the following thresholds: cutHeight = 0.95 (default), deepSplit = 2, minClusterDendro = 30, pamRespectsDendro = FALSE. A module eigengene is defined as the first principal component of a given set of co-expressed genes, and can be considered as a representative of the gene expression profiles in that set [74]. By convention, modules are referred to by their color labels, and, the grey module is used by default to contain genes not assigned to any module [109].

We then evaluated module membership using a resampling strategy described in Sikkink et al., [110]. Randomly, we chose four of six replicates for each diet-sample combination to produce 100 new datasets, each with 36 samples. Using the same parameters as in the full dataset, WGCNA was re-run for each new dataset and resulting modules examined for gene membership. First, a resampled module was accepted if it included at least 10% of the genes in the corresponding module in the full dataset. Secondly, a gene was considered to be strongly supported to belong to a module in the full dataset if it appeared in that module in at least 50% of resampled networks. We identified genes that failed to meet these criteria and placed them in the unassigned module. e merged highly correlated modules (r 0.9) during network construction for each resampled dataset to be consistent with how the full dataset was analyzed.

To summarize major patterns of within-module expression with respect to diet condition, we extracted the first principal component of expression of genes in each module, called the module eigengene. We then performed an ANOVA for each module eigengenes to assess the effects of diet, tissue, and diet-tissue interactions on total module expression. Because all modules had a significant effect of either diet or diet-tissue interaction, we examined module enrichment for diet-related functional annotations. We therefore accessed Bioconductor annotation libraries AnnotationDbi and org. using the GOenrichmentAnalysis() function within WGCNA, and calculated the Fisher’s Exact test with the Bonferroni correction to identify significantly enriched GO terms in each module, providing all genes available in our experiment as a background list. We reviewed all terms significantly enriched (Padj. < 0.01) for the BP category (to view terms for CC, MF, see Additional file 7). For this discussion, we restricted the analysis to the BP category to focus only on biological function and not biochemical activities (MF) or subcellular location where a gene product is active (CC), at the same time reducing the number of tests for enrichment as a way to limit the number of terms for interpretation.

Availability of data and materials

Trimmed read data are available in the Sequence Read Archive (SRA) database ( under the accession number PRJNA557551. Sample data, scripts and intermediate files are available here



Adenosine monophosphate-activated protein kinase


Analysis of variance


Biological process




Cellular component


Dietary restriction


Drosophila Synthetic Population Resources


Expression quantitative trait locus


False discovery rate


Generally applicable gene-set enrichment for pathway analysis


Gene ontology


Gene set enrichment analysis


High sugar




Kyoto Encyclopedia of Genes and Genomes


Likelihood ratio test


Molecular function


Messenger ribonucleic acid


Principal components analysis


Surrogate variable analysis


Topological overlap matrix


Target of rapamycin


Weighted Gene Co-expression Network Analysis


  1. Wood JG, Rogina B, Lavu S, Howitz K, Helfand SL, Tatar M, et al. Sirtuin activators mimic caloric restriction and delay ageing in metazoans. Nature. 2004;430:686–9.

    Article  CAS  PubMed  Google Scholar 

  2. Sohal R, Weindruch R. Oxidative stress, caloric restriction, and aging. Science. 1996;;273/5271/59.

  3. Shanley DP, Kirkwood TB. Calorie restriction and aging: a life-history analysis. Evolution. 2000;54:740–50.

    Article  CAS  PubMed  Google Scholar 

  4. Sinclair DA. Toward a unified theory of caloric restriction and longevity regulation. Mech Ageing Dev. 2005;126:987–1002.

    Article  CAS  PubMed  Google Scholar 

  5. Neel JV. Diabetes mellitus: a “thrifty” genotype rendered detrimental by “progress”? Am J Hum Genet. 1962;14:353–62.

    CAS  PubMed  PubMed Central  Google Scholar 

  6. Wells JCK. Thrift: a guide to thrifty genes, thrifty phenotypes and thrifty norms. Int J Obes. 2009;33:1331–8.

    Article  CAS  Google Scholar 

  7. May CM, van den Heuvel J, Doroszuk A, Hoedjes KM, Flatt T, Zwaan BJ. Adaptation to developmental diet influences the response to selection on age at reproduction in the fruit fly. J Evol Biol. 2019;32:425–37.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Skorupa DA, Dervisefendic A, Zwiener J, Pletcher SD. Dietary composition specifies consumption, obesity, and lifespan in Drosophila melanogaster. Aging Cell. 2008;7:478–90.

    Article  CAS  PubMed  Google Scholar 

  9. Musselman LP, Fink JL, Narzinski K, Ramachandran PV, Hathiramani SS, Cagan RL, et al. A high-sugar diet produces obesity and insulin resistance in wild-type Drosophila. Dis Model Mech. 2011;4:842–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Na J, Musselman LP, Pendse J, Baranski TJ, Bodmer R, Ocorr K, et al. A Drosophila model of high sugar diet-induced cardiomyopathy. PLoS Genet. 2013;9:e1003175.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Tatar M, Post S, Yu K. Nutrient control of Drosophila longevity. Trends Endocrinol Metab. 2014;25(10):509-17.

    Article  CAS  Google Scholar 

  12. Kapahi P, Zid BM, Harper T, Koslover D, Sapin V, Benzer S. Regulation of lifespan in Drosophila by modulation of genes in the TOR signaling pathway. Curr Biol. 2004;14:885–90.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Bolukbasi E, Khericha M, Regan JC, Ivanov DK, Adcott J, Dyson MC, et al. Intestinal fork head regulates nutrient absorption and promotes longevity. Cell Rep. 2017;21:641–53.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Essers P, Tain LS, Nespital T, Goncalves J, Froehlich J, Partridge L. Reduced insulin/insulin-like growth factor signaling decreases translation in Drosophila and mice. Sci Rep. 2016;6:30290.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Giannakou ME, Partridge L. Role of insulin-like signalling in Drosophila lifespan. Trends Biochem Sci. 2007;32:180–8.

    Article  CAS  PubMed  Google Scholar 

  16. Kaletsky R, Murphy CT. The role of insulin/IGF-like signaling in C. elegans longevity and aging. Dis Model Mech. 2010;3(7-8):415-9.

    Article  CAS  Google Scholar 

  17. Teleman AA. Molecular mechanisms of metabolic regulation by insulin in Drosophila. Biochem J. 2010;425:13–26.

    Article  CAS  Google Scholar 

  18. Partridge L, Alic N, Bjedov I, Piper MDW. Ageing in Drosophila: the role of the insulin/Igf and TOR signaling network. EXG. 2011;46:1–6.

    Google Scholar 

  19. Kapahi P, Kaeberlein M, Hansen M. Dietary restriction and lifespan: lessons from invertebrate models. Ageing Res Rev. 2017;39:3–14.

    Article  PubMed  Google Scholar 

  20. Kenyon C, Chang J, Gensch E, Rudner A, Tabtiang R. A C. elegans mutant that lives twice as long as wild type. Nature. 1993;366:461–4.

    Article  CAS  PubMed  Google Scholar 

  21. Tatar M, Kopelman A, Epstein D, Tu MP, Yin CM, Garofalo RS. A mutant Drosophila insulin receptor homolog that extends life-span and impairs neuroendocrine function. Science. 2001;292:107–10.

    Article  CAS  PubMed  Google Scholar 

  22. Clancy DJ, Gems D, Harshman LG, Oldham S, Stocker H, Hafen E, et al. Extension of life-span by loss of CHICO, a Drosophila insulin receptor substrate protein. Science. 2001;292:104–6.

    Article  CAS  PubMed  Google Scholar 

  23. Edgar BA. How flies get their size: genetics meets physiology. Nat Rev Genet. 2006;7:907–16.

    Article  CAS  PubMed  Google Scholar 

  24. Vellai T, Takacs-Vellai K, Zhang Y, Kovacs AL, Orosz L, Müller F. Genetics: influence of TOR kinase on lifespan in C. elegans. Nature. 2003;426:620.

    Article  CAS  PubMed  Google Scholar 

  25. Hansen M, Taubert S, Crawford D, Libina N, Lee S-J, Kenyon C. Lifespan extension by conditions that inhibit translation in Caenorhabditis elegans. Aging Cell. 2007;6:95–110.

    Article  CAS  PubMed  Google Scholar 

  26. Géminard C, Rulifson EJ, Léopold P. Remote control of insulin secretion by fat cells in Drosophila. Cell Metab. 2009;10:199–207.

    Article  PubMed  CAS  Google Scholar 

  27. Britton JS, Lockwood WK, Li L, Cohen SM, Edgar BA. Drosophila’s insulin/PI3-kinase pathway coordinates cellular metabolism with nutritional conditions. Dev Cell. 2002;2:239–49.

    Article  CAS  PubMed  Google Scholar 

  28. Kapahi P, Chen D, Rogers AN, Katewa SD, Li PW-L, Thomas EL, et al. With TOR, less is more: a key role for the conserved nutrient-sensing TOR pathway in aging. Cell Metab. 2010;11:453–65.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Giannakou ME, Goss M, Jünger MA, Hafen E, Leevers SJ, Partridge L. Long-lived Drosophila with overexpressed dFOXO in adult fat body. Science. 2004;305:361.

    Article  CAS  PubMed  Google Scholar 

  30. Hwangbo DS, Gersham B, Tu MP, Palmer M, Tatar M. Drosophila dFOXO controls lifespan and regulates insulin signaling in brain and fat body. Nature. 2004;429:562–6.

    Article  CAS  PubMed  Google Scholar 

  31. Rahman M, Nirala NK, Singh A, Zhu LJ, Taguchi K, Bamba T, et al. Drosophila Sirt2/mammalian SIRT3 deacetylates ATP synthase β and regulates complex V activity. J Cell Biol. 2014;206:289–305.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Wang T, Geng S-L, Guan Y-M, Xu W-H. Deacetylation of metabolic enzymes by Sirt2 modulates pyruvate homeostasis to extend insect lifespan. Aging. 2018;10:1053–72.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Ng’oma E, Perinchery AM, King EG. How to get the most bang for your buck: the evolution and physiology of nutrition-dependent resource allocation strategies. Proc Biol Sci. 2017;284.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  34. Hardie DG, Ross FA, Hawley SA. AMPK: a nutrient and energy sensor that maintains energy homeostasis. Nat Rev Mol Cell Biol. 2012;13:251–62.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. French SS, Denise Dearing M, Demas GE. Leptin as a physiological mediator of energetic trade-offs in ecoimmunology: implications for disease. Integr Comp Biol. 2011;51:505–13.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Rajan A, Perrimon N. Drosophila cytokine unpaired 2 regulates physiological homeostasis by remotely controlling insulin secretion. Cell. 2012;151:123–37.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Zera AJ. Intermediary metabolism and life history trade-offs: lipid metabolism in lines of the wing-polymorphic cricket, Gryllus firmus, selected for flight capability vs. early age reproduction. Integr Comp Biol. 2005;45:511–24.

    Article  CAS  PubMed  Google Scholar 

  38. Zhao Z, Zera AJ. Biochemical basis of specialization for dispersal vs. reproduction in a wing-polymorphic cricket: morph-specific metabolism of amino acids. J Insect Physiol. 2006;52:646–58.

    Article  CAS  PubMed  Google Scholar 

  39. Toivonen JM, Partridge L. Endocrine regulation of aging and reproduction in Drosophila. Mol Cell Endocrinol. 2009;299:39–50.

    Article  CAS  PubMed  Google Scholar 

  40. Efeyan A, Comb WC, Sabatini DM. Nutrient-sensing mechanisms and pathways. Nature. 2015;517:302–10.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Min K-J, Yamamoto R, Buch S, Pankratz M, Tatar M. Drosophila lifespan control by dietary restriction independent of insulin-like signaling. Aging Cell. 2008;7:199–206.

    Article  CAS  PubMed  Google Scholar 

  42. Paaby AB, Blacket MJ, Hoffmann AA, Schmidt PS. Identification of a candidate adaptive polymorphism for Drosophila life history by parallel independent clines on two continents. Mol Ecol. 2010;19:760–74.

    Article  CAS  PubMed  Google Scholar 

  43. Paaby AB, Bergland AO, Behrman EL, Schmidt PS. A highly pleiotropic amino acid polymorphism in the Drosophila insulin receptor contributes to life-history adaptation. Evolution. 2014;68:3395–409.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Emlen DJ, Warren IA, Johns A, Dworkin I, Lavine LC. A mechanism of extreme growth and reliable signaling in sexually selected ornaments and weapons. Science. 2012;337:860–4.

    Article  CAS  PubMed  Google Scholar 

  45. Gotoh H, Miyakawa H, Ishikawa A, Ishikawa Y, Sugime Y, Emlen DJ, et al. Developmental link between sex and nutrition; doublesex regulates sex-specific mandible growth via juvenile hormone signaling in stag beetles. PLoS Genet. 2014;10:e1004098.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  46. Gotoh H, Cornette R, Koshikawa S, Okada Y, Lavine LC, Emlen DJ, et al. Juvenile hormone regulates extreme mandible growth in male stag beetles. PLoS One. 2011;6:e21139.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Zera AJ, Harshman LG. Laboratory selection studies of life-history physiology in insects. In: Experimental evolution: concepts, methods, and applications of selection experiments. Berkeley: University of California Press; 2009. p. 236–81.

    Google Scholar 

  48. King EG, Roff DA, Fairbairn DJ. The evolutionary genetics of acquisition and allocation in the wing dimorphic cricket, Gryllus firmus. Evolution. 2011;65:2273–85.

    Article  PubMed  Google Scholar 

  49. King EG, Roff DA, Fairbairn DJ. Trade-off acquisition and allocation in Gryllus firmus: a test of the Y model. J Evol Biol. 2011;24:256–64.

    Article  CAS  PubMed  Google Scholar 

  50. Flatt T, Tu M-P, Tatar M. Hormonal pleiotropy and the juvenile hormone regulation of Drosophila development and life history. Bioessays. 2005;27:999–1010.

    Article  CAS  PubMed  Google Scholar 

  51. Tatar M, Bartke A, Antebi A. The endocrine regulation of aging by insulin-like signals. Science. 2003;299:1346–51.

    Article  CAS  PubMed  Google Scholar 

  52. Zera AJ, Harshman LG, Williams TD. Evolutionary endocrinology: the developing synthesis between endocrinology and evolutionary genetics. Annu Rev Ecol Evol Syst. 2007;38:793–817.

    Article  Google Scholar 

  53. Hughes KA, Reynolds RM. Evolutionary and mechanistic theories of aging. Annu Rev Entomol. 2005;50:421–45.

    Article  CAS  PubMed  Google Scholar 

  54. Remolina SC, Chang PL, Leips J, Nuzhdin SV, Hughes KA. Genomic basis of aging and life-history evolution in Drosophila melanogaster. Evolution. 2012;66:3390–403.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Burke MK, King EG, Shahrestani P, Rose MR, Long AD. Genome-wide association study of extreme longevity in Drosophila melanogaster. Genome Biol Evol. 2014;6:1–11.

    Article  PubMed  Google Scholar 

  56. Magwire MM, Yamamoto A, Carbone MA, Roshina NV, Symonenko AV, Pasyukova EG, et al. Quantitative and molecular genetic analyses of mutations increasing Drosophila life span. PLoS Genet. 2010;6:e1001037.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  57. Rockman MV. The QTN program and the alleles that matter for evolution: all that’s gold does not glitter. Evolution. 2012;66:1–17.

    Article  PubMed  Google Scholar 

  58. Briga M, Verhulst S. What can long-lived mutants tell us about mechanisms causing aging and lifespan variation in natural environments? Exp Gerontol. 2015;71:21–6.

    Article  PubMed  Google Scholar 

  59. Savory FR, Benton TG, Varma V, Hope IA, Sait SM. Stressful environments can indirectly select for increased longevity. Ecol Evol. 2014;4:1176–85.

    Article  PubMed  PubMed Central  Google Scholar 

  60. Ehrich TH, Kenney-Hunt JP, Pletscher LS, Cheverud JM. Genetic variation and correlation of dietary response in an advanced intercross mouse line produced from two divergent growth lines. Genet Res. 2005;85:211–22.

    Article  CAS  PubMed  Google Scholar 

  61. Cheverud JM, Ehrich TH, Kenney JP, Pletscher LS, Semenkovich CF. Genetic evidence for discordance between obesity- and diabetes-related traits in the LGXSM recombinant inbred mouse strains. Diabetes. 2004;53:2700–8.

    Article  CAS  PubMed  Google Scholar 

  62. Jehrke L, Stewart FA, Droste A, Beller M. The impact of genome variation and diet on the metabolic phenotype and microbiome composition of Drosophila melanogaster. Sci Rep. 2018;8:6215.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  63. Martínez-Micaelo N, González-Abuín N, Terra X, Ardévol A, Pinent M, Petretto E, et al. Identification of a nutrient-sensing transcriptional network in monocytes by using inbred rat models on a cafeteria diet. Dis Model Mech. 2016;9:1231–9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  64. Stanley PD, Ng’oma E, O’Day S, King EG. Genetic dissection of nutrition-induced plasticity in insulin/insulin-like growth factor signaling and median life span in a Drosophila multiparent population. Genetics. 2017;206:587–602.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  65. Kover PX, Valdar W, Trakalo J, Scarcelli N, Ehrenreich IM, Purugganan MD, et al. A multiparent advanced generation inter-cross to fine-map quantitative traits in Arabidopsis thaliana. PLoS Genet. 2009;5:e1000551.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  66. McMullen MD, Kresovich S, Villeda HS, Bradbury P, Li H, Sun Q, et al. Genetic properties of the maize nested association mapping population. Science. 2009;325:737–40.

    Article  CAS  PubMed  Google Scholar 

  67. Huang X, Paulo M-J, Boer M, Effgen S, Keizer P, Koornneef M, et al. Analysis of natural allelic variation in Arabidopsis using a multiparent recombinant inbred line population. Proc Natl Acad Sci U S A. 2011;108:4488–93.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Aylor DL, Valdar W, Foulds-Mathes W, Buus RJ, Verdugo RA, Baric RS, et al. Genetic analysis of complex traits in the emerging collaborative cross. Genome Res. 2011;21:1213–22.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Threadgill DW, Churchill GA. Ten years of the collaborative cross. G3: genes, genomes. Genetics. 2012;2:153–6.

    Google Scholar 

  70. Cubillos FA, Parts L, Salinas F, Bergström A, Scovacricchi E, Zia A, et al. High-resolution mapping of complex traits with a four-parent advanced intercross yeast population. Genetics. 2013;195:1141–55.

    Article  PubMed  PubMed Central  Google Scholar 

  71. King EG, Merkes CM, McNeil CL, Hoofer SR, Sen S, Broman KW, et al. Genetic dissection of a model complex trait using the Drosophila synthetic population resource. Genome Res. 2012;22:1558–66.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  72. King EG, Macdonald SJ, Long AD. Properties and power of the Drosophila synthetic population resource for the routine dissection of complex traits. Genetics. 2012;191:935–49.

    Article  PubMed  PubMed Central  Google Scholar 

  73. Pertea M, Kim D, Pertea GM, Leek JT, Salzberg SL. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat Protoc. 2016;11:1650–67.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  74. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  75. Stobdan T, Sahoo D, Azad P, Hartley I, Heinrichsen E, Zhou D, et al. High fat diet induces sex-specific differential gene expression in Drosophila melanogaster. PLoS One. 2019;14:e0213474.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  76. Rivera O, McHan L, Konadu B, Patel S, Sint Jago S, Talbert ME. A high-fat diet impacts memory and gene expression of the head in mated female Drosophila melanogaster. J Comp Physiol B. 2019;189:179–98.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  77. Heinrichsen ET, Zhang H, Robinson JE, Ngo J, Diop S, Bodmer R, et al. Metabolic and transcriptional response to a high-fat diet in Drosophila melanogaster. Mol Metab. 2014;3:42–54.

    Article  CAS  PubMed  Google Scholar 

  78. Linnen C, Tatar M, Promislow D. Cultural artifacts: a comparison of senescence in natural, laboratory-adapted and artificially selected lines of Drosophila melanogaster. Evol Ecol Res. 2001;3:877–88.

    Google Scholar 

  79. Sgrò, Sgrò, Partridge. Evolutionary responses of the life history of wild-caught Drosophila melanogaster to two standard methods of laboratory culture. Am Nat. 2000;156:341.

    Article  Google Scholar 

  80. Sgrò CM, van Heerwaarden B, Kellermann V, Wee CW, Hoffmann AA, Lee SF. Complexity of the genetic basis of ageing in nature revealed by a clinal study of lifespan and methuselah, a gene for ageing, in Drosophila from eastern Australia. Mol Ecol. 2013;22:3539–51.

    Article  CAS  PubMed  Google Scholar 

  81. Harshman LG, Hoffmann AA. Laboratory selection experiments using Drosophila: what do they really tell us? Trends Ecol Evol. 2000;15:32–6.

    Article  CAS  PubMed  Google Scholar 

  82. Reed LK, Lee K, Zhang Z, Rashid L, Poe A, Hsieh B, et al. Systems genomics of metabolic phenotypes in wild-type Drosophila melanogaster. Genetics. 2014;197:781–93.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  83. Musselman LP, Fink JL, Baranski TJ. Similar effects of high-fructose and high-glucose feeding in a Drosophila model of obesity and diabetes. PLoS One. 2019;14:e0217096.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  84. Ng’oma E, Fidelis W, Middleton KM, King EG. The evolutionary potential of diet-dependent effects on lifespan and fecundity in a multi-parental population of Drosophila melanogaster. Heredity. 2019;122:582-94.

    Article  PubMed  PubMed Central  Google Scholar 

  85. Lee KP, Simpson SJ, Clissold FJ, Brooks R, Ballard JWO, Taylor PW, et al. Lifespan and reproduction in Drosophila: new insights from nutritional geometry. Proc Natl Acad Sci U S A. 2008;105:2498–503.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  86. Jensen K, McClure C, Priest NK, Hunt J. Sex-specific effects of protein and carbohydrate intake on reproduction but not lifespan in Drosophila melanogaster. Aging Cell. 2015;14:605–15.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  87. Dobson AJ, Ezcurra M, Flanagan CE, Summerfield AC, Piper MDW, Gems D, et al. Nutritional programming of lifespan by FOXO inhibition on sugar-rich diets. Cell Rep. 2017;18:299–306.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  88. Lakowski B, Hekimi S. The genetics of caloric restriction in Caenorhabditis elegans. Proc Natl Acad Sci U S A. 1998;95:13091–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  89. Houthoofd K, Braeckman BP, Johnson TE, Vanfleteren JR. Life extension via dietary restriction is independent of the Ins/IGF-1 signaling pathway in Caenorhabditis elegans. Exp Gerontol. 2003;38:947–54.

    Article  CAS  PubMed  Google Scholar 

  90. Kaeberlein TL, Smith ED, Tsuchiya M, Welton KL, Thomas JH, Fields S, et al. Lifespan extension in Caenorhabditis elegans by complete removal of food. Aging Cell. 2006;5:487–94.

    Article  CAS  PubMed  Google Scholar 

  91. Lee GD, Wilson MA, Zhu M, Wolkow CA, de Cabo R, Ingram DK, et al. Dietary deprivation extends lifespan in Caenorhabditis elegans. Aging Cell. 2006;5:515–24.

    Article  CAS  PubMed  Google Scholar 

  92. Giannakou ME, Goss M, Partridge L. Role of dFOXO in lifespan extension by dietary restriction in Drosophila melanogaster: not required, but its activity modulates the response. Aging Cell. 2008;7:187–98.

    Article  CAS  PubMed  Google Scholar 

  93. Gershman B, Puig O, Hang L, Peitzsch RM, Tatar M, Garofalo RS. High-resolution dynamics of the transcriptional response to nutrition in Drosophila: a key role for dFOXO. Physiol Genomics. 2007;29:24–34.

    Article  CAS  PubMed  Google Scholar 

  94. Greer EL, Dowlatshahi D, Banko MR, Villen J, Hoang K, Blanchard D, et al. An AMPK-FOXO pathway mediates longevity induced by a novel method of dietary restriction in C. elegans. Curr Biol. 2007;17:1646–56.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  95. Newberg LA, Chen X, Kodira CD, Zavodszky MI. Computational de novo discovery of distinguishing genes for biological processes and cell types in complex tissues. PLoS One. 2018;13:e0193067.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  96. Ng’oma E, King EG, Middleton KM. A model-based high throughput method for fecundity estimation in fruit fly studies. Fly. 2018.

    Article  PubMed  PubMed Central  Google Scholar 

  97. Bass TM, Grandison RC, Wong R, Martinez P, Partridge L, Piper MDW. Optimization of dietary restriction protocols in Drosophila. J Gerontol A Biol. 2007;62:1071–81.

    Article  Google Scholar 

  98. Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;12:357–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  99. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  100. Pertea M, Pertea GM, Antonescu CM, Chang T-C, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33:290–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  101. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  102. Leek JT, Storey JD. Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet. 2007;3:e161.

    Article  PubMed Central  CAS  Google Scholar 

  103. Leek JT. svaseq: removing batch effects and other unwanted noise from sequencing data. Nucleic Acids Res. 2014;42.

    Article  PubMed Central  CAS  Google Scholar 

  104. North BV, Curtis D, Sham PC. A note on the calculation of empirical P values from Monte Carlo procedures. Am J Hum Genet. 2002;71:439–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  105. Luo W, Friedman MS, Shedden K, Hankenson KD, Woolf PJ. GAGE: generally applicable gene set enrichment for pathway analysis. BMC Bioinformatics. 2009;10:161.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  106. Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  107. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Stat Methodol. 1995;57:289–300.

    Google Scholar 

  108. Langfelder P, Horvath S. Fast R functions for robust correlations and hierarchical clustering. J Stat Softw. 2012;46.

  109. Dong J, Horvath S. Understanding network concepts in modules. BMC Syst Biol. 2007;1:24.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  110. Sikkink KL, Reynolds RM, Ituarte CM, Cresko WA, Phillips PC. Environmental and evolutionary drivers of the modular gene regulatory network underlying phenotypic plasticity for stress resistance in the Nematode Caenorhabditis remanei. G3. 2019;9:969–82.

    CAS  PubMed  PubMed Central  Google Scholar 

Download references


We would like to acknowledge Elizabeth Lo Presti, Michael Reed, and Kevin Middleton for help with experimental setup and fly husbandry. High-throughput sequencing services were performed at the University of Missouri DNA Core Facility.


Funding for this study was provided by NIH R01 GM117135 and NSF IOS 1654866 to EGK. Computational work was performed on the high-performance computing infrastructure provided by Research Computing Support Services and in part by the National Science Foundation under grant number CNS-1429294 at the University of Missouri, Columbia Mo. Funding bodies had no role in the design of the study; collection, analysis, and interpretation of data; and in the writing and revising of the manuscript.

Author information

Authors and Affiliations



EGK conceived and planned the project. EN and EGK designed, set up, and managed the experiment, AR isolated RNA and prepared samples for sequencing, EN, PAW-S, EGK analyzed the data, EN and EGK wrote the manuscript, EN, EGK, AR and PAW-S commented on the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to E. Ng’oma.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Figure S1.

Module detection by hierarchical clustering of 12,614 genes showing co-expressed sets of gene clusters. Figure S2. Global patterns of gene expression in 54 RNA samples. Figure S3. Histograms of the correlation between fold changes for each pair of diets, calculated with each diet as the reference diet obtained from permuted data. Figure S4. Histograms of the proportion of genes that trend in the same direction, calculated with each diet as the reference diet obtained from permuted data. Figure S5. Gene set enrichment analysis (GSEA).

Additional file 2.

Differentially expressed genes for the main effect of diet.

Additional file 3.

Differentially expressed genes for the interaction effect between diet and tissue.

Additional file 4.

Correlation of pairwise fold changes relative to the third diet as reference.

Additional file 5.

Pathways and GO terms from Gene set enrichment analysis using R GAGE package.

Additional file 6.

Eigenegene expression. Modules are assigned color names which correspond with letter names (a - v, second row below) reported in the manuscript.

Additional file 7.

Gene Ontology (GO) term enrichment on each module after performing a module assignment consistency test via resampling.

Additional file 8.

Nutrient sensing pathway (IIS, TOR, and FOXO) genes, 47 of which were significantly differentially expressed for the diet effect.

Additional file 9.

49 DEGs were located within the BC interval of a QTL for the difference in median lifespan between flies on DR and C diets [64].

Additional file 10.

Composition of the four diets used in this experiment.

Additional file 11.

Husbandry and sample preparation batches.

Additional file 12.

Sample information and RNA quality.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ng’oma, E., Williams-Simon, P.A., Rahman, A. et al. Diverse biological processes coordinate the transcriptional response to nutritional changes in a Drosophila melanogaster multiparent population. BMC Genomics 21, 84 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: