Skip to main content

Container-aided integrative QTL and RNA-seq analysis of Collaborative Cross mice supports distinct sex-oriented molecular modes of response in obesity



The Collaborative Cross (CC) mouse population is a valuable resource to study the genetic basis of complex traits, such as obesity. Although the development of obesity is influenced by environmental factors, underlying genetic mechanisms play a crucial role in the response to these factors. The interplay between the genetic background and the gene expression pattern can provide further insight into this response, but we lack robust and easily reproducible workflows to integrate genomic and transcriptomic information in the CC mouse population.


We established an automated and reproducible integrative workflow to analyse complex traits in the CC mouse genetic reference panel at the genomic and transcriptomic levels. We implemented the analytical workflow to assess the underlying genetic mechanisms of host susceptibility to diet induced obesity and integrated these results with diet induced changes in the hepatic gene expression of susceptible and resistant mice. Hepatic gene expression differs significantly between obese and non-obese mice, with a significant sex effect, where male and female mice exhibit different responses and coping mechanisms.


Integration of the data showed that different genes but similar pathways are involved in the genetic susceptibility and disturbed in diet induced obesity. Genetic mechanisms underlying susceptibility to high-fat diet induced obesity are different in female and male mice. The clear distinction we observed in the systemic response to the high-fat diet challenge and to obesity between male and female mice points to the need for further research into distinct sex-related mechanisms in metabolic disease.


The Collaborative Cross (CC) is a multiparent genetic reference panel (GRP) of recombinant inbred lines (RIL) of mice derived from eight different founder strains. The CC resource was developed to facilitate the study of the genetic basis of complex traits, and serve as a uniquely powerful resource for the mapping and integration of various phenotypic and genotypic data [1]. CC lines have been shown to be highly variable for traits related to both normal physiology and disease and have been used successfully in multiple system genetics studies. The CC panel has been the catalyst for the development of a variety of bioinformatics tools for haplotype inference and reconstruction, and genetic mapping [2,3,4].

Based on a recent report, people with overweight or obese phenotype account for almost two thirds of the population in the USA [5]. The latest estimates in European Union countries show that 30–70% of the adult population are overweight and 10–30% are obese. Some medical associations classify obesity (defined as BMI ≥ 30) as a disease. The development of obesity is affected by various environmental factors such as excess high fat/carbohydrate enriched food consumption and sedentary lifestyle. However, underlying genetic mechanisms are involved in determining the host response to these factors with the rate of heritability of body mass index (BMI) ranging from 40 to 70% in various studies. The CC panel is an experimental population that aids with the dissection of the genetic mechanisms underlying susceptibility to complex traits, such as obesity. Chromosomal regions that are involved in the susceptibility or resistance to the trait can be mapped as quantitative trait loci (QTL) with high precision. QTL mapping results can be strengthened and enriched through integration of RNA-seq data to identify gene expression differences in susceptible versus resistant individuals.

High-throughput sequencing technologies produce millions of reads in a relatively short time, overcoming the limitations of previous technologies and unravelling previously inaccessible complexities in the transcriptome. However, the overall high complexity of the produced datasets, due to their large size and low signal-to-noise ratio, hinders the interpretation of the underlying information. Most recent analytical methods depend on individual tools that users must download, install and use on their physical drives. The process of deciding which bioinformatic tool accommodates the needs of researchers (depending on the experimental approach, the scientific question of interest, as well as the computational needs) can be time-consuming and requires expertise.

Our main aim, was to develop an easy-to-use, scalable and cost-effective workflow for the integrative analysis of genotyping and RNA-seq data from CC mice, offering cross-platform portability to different high-end computing configurations. We then opted to use this workflow to assess the underlying QTL that influence genomic susceptibility to high-fat diet induced obesity in mice and the hepatic gene expression response of the mice to high-fat diet and obesity.

The tools used in each step of the developed workflow are connected through Python scripts, are archived in our public GitHub repository and can be fully modified. The Python scripts are constructed in a user-friendly manner, so that inexperienced users are required to input only the basic parameters (such as the paths to input files, indexes and annotation files) and the whole process is executed automatically, assuming default parameters. More advanced users can apply different sets of input commands or fully modify the scripts according to their requirements.

All bioinformatic tools comprising this workflow have been containerised using Docker containers, resulting in a portable infrastructure that can be executed on physical drives or cloud servers. The main advantage of containerisation is that it is no longer required to install numerous pieces of software, with complex dependencies, downloading instead a pre-built and ready-to-run image file, containing all necessary software and their required dependencies. Another advantage is that applications are run in an isolated and sanitised container environment, where all dependencies are configured for optimal performance, preventing conflicts with other installed programs in the hosting environment. Containerisation ensures the standard operation and performance of applications, not affected by system updates or programming errors from the host end, making the process transparent to the end-user and consistenly fully reproducible. Furthermore, each Docker container can be used as a stand-alone version of the tool making the workflow scalable and adaptable to the individual needs of the researcher.

A question that arises is to what extend the start-up, deployment and instantiation of containers performed by the Docker daemon affect the overall performance and the computational cost of a workflow. An IBM research study suggests that the overhead for CPU and memory performance introduced by Docker containers is negligible, and that containerised applications perform equally or better when compared to virtual machine technology [6]. Regarding the containerisation of genomic pipelines, it has been suggested that Docker containerisation has a negligible impact on the execution performance of common genomic pipelines, especially when tasks (such as the alignment of reads to a reference genome) are generally very time consuming [7].



Breeding and housing

The Collaborative Cross mouse lines are novel and highly genetically diverse mouse resource population derived from a genetically diverse set of eight founder mouse strains (A/J, C57BL/6 J, 129S1/SvImJ, NOD/LtJ, NZO/HlLtJ, CAST/EiJ, PWK/PhJ, and WSB/EiJ), designed specifically for complex trait analysis, with the aim to overcome the limitations of previously available resources [8]. A cohort of CC lines was developed and currently is housed at conventional environmental conditions at the small animal facility of Tel-Aviv University (TAU) between generations of G27 to G64 of inbreeding by full-sib mating [9]. The CC lines have been maintained and bred by our team at TAU since 2006 as described in the 2008 paper by Iraqi and colleagues [10]. The study was approved by the Institutional Animal Care and Use Committee (IACUC), with the approval numbers M-12-025 and M-14-007.

Mice were weaned at 3 weeks old, housed separately by sex, with a maximum number of five mice per cage, fed with standard rodents’ chow diet (TD.2018SC, Teklad Global, Harlan Inc., Madison, WI, USA, containing % Kcal from Fat 18%, Protein 24%, and Carbohydrates 58%) and water ad libitum. All animals were housed at the TAU animal facility at conventional open environment conditions, in clean polycarbonate cages with stainless metal covers, and bedded with wood shavings, at light:dark cycles of 12:12 h, and constant room temperature of 220c (±2). Due to genetic variations between CC lines, breeding rate, the number and sex of litters in each cycle might vary. Therefore, the CC lines were assessed based on litter availability, while making our best efforts to scan as many of the CC lines as possible with representation of both sexes.

Study cohort

The study cohort consisted of 540 mice, from 60 different CC lines that were generated, maintained and studied at the TAU small animal facility. Forty-three of lines had representation of both sexes. A subgroup of 84 mice from 43 CC lines, with 20 lines with representation of both sexes, was selected for RNA sequencing.

Dietary challenge

At the age of 8 weeks old, the baseline body weight was recorded. The average body weight (±SE) of females was 18.92 g (±0.27) and of males 22.92 g (±0.24) [11]. Thereafter, the mice were switched to a high-fat diet (HFD) (TD 88137 Harlan Teklad, Madison, WI, USA; containing 42% of calories from fat and 34.1% from carbohydrate, primarily sucrose) starting the dietary challenge for a period of 12 weeks with free access to food and water. During the experiment, mice welfare and health status were monitored daily. The experiment was terminated (euthanasia) for mice that showed deteriorating health, manifested in phenomena such as limited movement, heavy weight loss (about 10% between the weekly weigh measures and over 20% of the initial body weight), apathy and lack of physical activity, interrupted equilibrium (physical instability), breathing difficulty, exceptional behaviour (high aggressiveness / loneliness), and extremely high glucose levels (> 400 mg/dL).


Mice were weighed at the beginning of the HFD challenge and bi-weekly thereafter for the following 12 weeks. Mice that gained less than 10 g to their initial body weight over the dietary challenge period were considered “Normal”, while those that gained over 10 g of body weight were considered “Obese”. The median weight at the beginning of the dietary challenge and the median body weight gain for each CC-line are presented in Supplementary Table 1. At the end of the 12-week dietary challenge, an intraperitoneal glucose tolerance test (IPGTT) was performed after 6 h of fasting to evaluate the diabetic stage of the mice. By using the method of glucose IP injection, the gut effect was bypassed and the glucose stimulated insulin secretion was lower compared to oral administration of glucose. The IPGTT lasted 180 min, with glucose levels measured at different time points before and after the glucose administration. Glucose tolerance was calculated as the total area under the curve between the initial and end time point of the test, for each CC line, separately for females and males. Mice that reached the end time-point of IPGTT with glucose levels under 400 mg/dL were considered “Nondiabetic”, while mice with glucose levels over 400 mg/dL were labelled as “Diabetic”. The complete CC line, sex, baseline body weight at the start of the dietary challenge, body weight gain after 12 weeks of dietary challenge, and blood glucose levels at 180 min after the IPGTT challenge data included in the analysis are presented in Supplementary Table 2.

RNA extraction

After overnight recovery from the IPGTT, mice were sacrificed by cervical dislocation, and their livers were collected and stored in liquid nitrogen (− 80 °C). RNA extraction was performed using the QIAGEN commercial kit (Cat.No.73404). Quality control of RNA samples was performed with 2100 BioAnalyzer (Agilent). The RNA Integrity Number (RIN) was used to estimate the integrity of the total RNA sample, samples with RIN above 7.0 passed the quality control test.

RNA-seq libraries

RNA-seq libraries were prepared using the TruSeq Stranded mRNA library preparation kit (Illumina). Libraries were pooled and sequenced on the Illumina HiSeq 2000 and 2500 sequencers with Illumina v3 sequencing chemistry. Paired-end sequencing was performed by reading 50 bases at each end of a fragment. Overall, each sample consisted of 24 M to 37.5 M RNA-sequencing fragments with an average of 31.5 M fragments.

RNA-seq analysis

The base images for the Docker containers were pulled from the repositories of Biocontainers and Rocker [12]. For the purposes of differential expression testing and QTL analysis we developed Docker containers with integrated R v.3.4.2 and all the required packages installed (such as Bioconductor, EdgeR and HAPPY). The Docker images also contained ready-to-run R scripts set on the default parameters so that users unfamiliar with R programming can perform the analyses using R packages otherwise unavailable as stand-alone versions (EdgeR and HAPPY), while advanced users can modify the scripts in a sanitised Docker container environment. In order to run the workflow we developed, the user is required to install the docker engine locally and then pull the images from the Docker Hub repository. This task is relatively fast and requests minimal programming knowledge in comparison to the skills needed for downloading and installing a combination of several pieces of third-party software, while configuring their implicit dependencies and libraries.

Quality control

For quality assessment of the reads we used the FastQC tool [13], which is the golden standard for quality control workflows. We did not incorporate a trimming tool in our workflow as aggressive trimming of reads has been suggested to alter RNA-seq expression estimates and the soft-clipping performed by HISAT2 makes read trimming not strictly necessary [14]. However, we provide a docker container with Trim Galore!, which is a tool that makes use of the publicly available adapter trimming tool Cutadapt [15] and FastQC for optional quality control [16].

Mapping of reads

Mapping of the reads was performed using the HISAT2 tool, which is a fast and sensitive alignment program for mapping RNA-seq reads [17]. The high efficiency of HISAT2 is based on the indexing scheme it utilises (employing two types of indexes based on Burrows - Wheeler transform and the Ferragina-Manzini index), allowing the tool to perform alignments very fast and with equal or better accuracy than any other method currently available. HISAT2 supports genomes of any size and has low memory requirements (approximately 4.3 GB of RAM for the human genome).

Assigning sequence reads to genomic features

For the purpose of assigning reads to genomic features (such as genes, exons, promoters and genomic bins) we used the software program FeatureCounts (v1.5.0-p1), from the Subread stand-alone package [18]. FeatureCounts is considerably faster than existing methods and has exceptionally low memory requirements, while being one of the top-ranking software in accuracy.

DE testing

The DE testing was performed with the R package EdgeR [19] using a modified Rscript from Su [20]. The Rscript was run through a Docker container that employs R version 3.3.0 and has been built with Bioconductor version 3.4 and all the required packages installed. The normalisation (for library size, gene length and sequencing depth) was performed on the raw count matrix produced by FeatureCounts using the Trimmed Mean of M-values (TMM) [21], which is set as the default normalisation method. A negative binomial generalized linear model (GLM) was fitted to the data and the testing procedure for determining differential expression was performed using quasi-likelihood (QL) F-test. The P-value adjustment was performed using the Benjamini-Hochberg method and in order to restrict the false discovery rate (FDR) to 0.05, all the genes with adjusted P-values less than 0.05 were selected. The filtering of the gene list (threshold default values: adjusted P-value 0.05 and |log2FC| ≥ 1) was performed with a custom python script.

Functional analysis

The functional pathway analysis was performed with the BioInfoMiner platform [22]. BioInfoMiner exploits biological hierarchical vocabularies through statistical and network analysis to detect and rank significantly altered processes and the hub linker genes involved. For our analysis we utilised Gene Ontology (GO) [23] and MGI Mammalian Phenotype Ontology [24]. BioInfoMiner maps the genes on a genomic network created from semantic data and prioritises them based on topological properties after minimising the impact of semantic noise (bias) through different types of correction. This analysis accomplishes the systemic interpretation of the complex cellular mechanisms described in the input gene lists, while at the same time it prioritises genes with central functional and regulatory roles in important cellular processes, underlying the studied phenotype. The correction for potential semantic inconsistencies of the selected vocabulary is implemented by linking the annotation of each gene with the ancestors of every direct correlated ontological term, consequently restoring the sound structure of an ontological tree. The BioInfoMiner platform is available online at the website

Genetic mapping


All CC lines were genotyped with high-density SNP markers using the MDA (620 K SNP markers), MUGA (7.7 K markers) and MegaMUGA (77 K markers) genotyping arrays based on the Illumina infinium platform. All SNPs with heterozygous or missing genotypes in the 8 CC founders or not common between the arrays were filtered out, leaving 170,935 SNPs. The SNPs were mapped onto build 37 of the mouse genome. A descent probability distribution was computed using the HAPPY HMM for each of the 170,935 SNPs intervals. The genotype status of the CC lines using the MUGA SNP was presented in CTC 2012 report [9, 25].

QTL mapping

The QTL mapping was based on the haplotype mosaic reconstruction with HAPPY. We developed two automated containerised R scripts using the mapping methodology proposed by Durrant and colleagues [26]. The first script uses the probability distribution of descent as calculated by the HAPPY algorithm to test for association between the reconstructed haplotype for each CC line at each locus and the median body weight gain at different time points. The second script estimates confidence intervals (CI) for each QTL through simulation of a QTL with a similar logP and strain effects in the neighbourhood of the observed QTL peak. We used the SNPtools R package to extract the genes inside the 95% CI for each QTL [27].


Data stratification

It is well documented that obesity and obesity-related health complications are affected by gender, and that sex-specific differences have a genetic basis and cannot be solely attributed to differential hormonal regulation [28]. Gender-specific differences in adiposity as well as fat distribution, in addition to the distinctive genetic basis and hormonal regulation of men and women, may result in sex-specific patterns.

In order to assess the presence of confounding factors, we performed a multi-dimensional scaling analysis, using the EdgeR R package. According to the MDS plot (Fig. 1) the points cluster into two groups, not on the basis of the differences between the two conditions (“Normal” vs “Obese”) but on the basis of sex. This finding led to the conclusion that the observed separation in the principal component analysis performed, was the result of gender-specific differences, meaning that a straightforward approach would be prone to confounding biases.

Fig. 1

Multidimensional scaling (MDS) plot visualising the level of similarity of individual cases in the dataset. Male samples are represented with red, while female samples are represented with black labels

In order to partition the samples into non-overlapping groups, we performed a stratified sampling dividing the total sample population into four strata, considering the sex and diabetes status of the mice. Of the total 85 samples, 21 samples were categorised in the Male-Nondiabetic group, 21 samples were categorised in the Male-Diabetic group, 36 samples were categorised in the Female-Nondiabetic group and 5 samples in the Female-Diabetic group. One sample (DT 123) was not used in the analysis due to missing data.

QTL analysis

We performed haplotype association analysis, using the median value of the weight gain in the 12 weeks of the HFD challenge. We mapped one QTL on chr3 for female mice and one QTL on chr5 for male mice, designated as ObFL and ObML for obesity female locus and obesity male locus, respectively. Details on the position and size of each QTL are given in Table 1. We performed functional analysis with BioInfoMiner on the genes extracted from each of the QTLs using the Gene Ontology and MGI Mammalian Phenotype Ontology vocabularies.

Table 1 Positions of QTLs associated with obesity

Female mice body weight gain

In female mice, prioritised genes included Sgms2 and Hadh that were found to be involved in various processes, such as increased energy expenditure (MP:0004889), decreased susceptibility to diet-induced obesity (MP:0005659) and increased circulating free fatty acid level (MP:0001554). Interestingly, Sgms2 deficiency in mice increases insulin sensitivity and ameliorates high-fat diet-induced obesity and Hadh−/− mice, while having a disrupted β-oxidation pathway, are also protected from diet-induced obesity [29, 30]. Other prioritised genes are Lef1, Dkk2 and Egf, which are all involved in the Wnt signaling pathway (GO:0016055). Non-canonical Wnt signaling has been shown to contribute to obesity-associated metabolic dysfunction by increasing adipose tissue inflammation [31].

Male mice body weight gain

In male mice the most highly prioritised gene is Ppargc1a, which is found to be involved in various enriched processes both in GO and in MGI Mammalian Phenotype Ontology, including decreased muscle weight (MP:0004232), regulation of muscle tissue development (GO:1901863) and lipid modification (GO:0030258). Ppargc1a is a transcriptional co-activator that regulates genes involved in energy metabolism through its interaction with Pparγ. In human GWAS analysis a single nucleotide variant of Ppargc1a (rs8192678) has been associated with susceptibility to obesity and insulin resistance [32]. Another important gene prioritised through both vocabularies, is Cckar, found involved in processes such as abnormal small intestinal transit time (MP:0006002) and abnormal intestinal cholesterol absorption (MP:0002645). Rats with a naturally occurring mutation in Cckar (Otsuka Long-Evans Tokushima Fatty (OLETF) rat) develop diabetes and obesity [33]. Prioritised gene Sod3 is involved along with Ppargc1a in: response to reactive oxygen species (GO:0000302), increased susceptibility to injury (MP:0005165), and abnormal cytokine secretion (MP:0003009). Over-expression of Sod3 in high-fat diet fed mice has been shown to block diet induced obesity [34]. Med28 is involved in the regulation of muscle cell differentiation (GO:0051147). Med28 is one of the subunits of the Mediator complex, which acts as a transcription factor co-activator and plays an important role in muscle metabolism by enhancing the transcriptional activity of Ppargc1a and Pparα [35]. Lastly, Slit2 is also prioritised and shares enriched terms with Ppargc1a, such as regulation of smooth muscle cell migration (GO:0014910) and response to organonitrogen compound (GO:0010243). Slit2 has been shown to regulate metabolic function and thermogenic activity and improve glucose homeostasis in diet-induced obese mice, mainly through Ucp1 [36], which has been found highly under-expressed in male obese mice, in comparison to female obese mice in our analysis.

RNA-seq analysis

DE analysis was performed on the Male-Nondiabetic and Female-Nondiabetic groups separately. In order to perform the statistical differential expression tests we divided the samples into two subgroups, 10 samples were categorised in the Male-Nondiabetic-Obese group (the male case group), 11 samples were categorised in the Male-Nondiabetic-Normal (the male control group) 12 samples were categorised in the Female-Nondiabetic-Obese group (the female case group) and 24 samples in the Female-Nondiabetic-Normal group (the female control group). Differences in library size were addressed by TMM normalisation.

Female nondiabetic normal vs female nondiabetic obese

As a first step we tried to assess the DE genes within the two genders, performing the DE tests between control and case groups of the same gender. The Female-Nondiabetic-Normal samples (control group) versus the Female-Nondiabetic-Obese samples (case group) DE genes list consisted of 1382 DE genes. Using BioInfoMiner for the enrichment analysis we obtained two prioritised gene list of 23 DE genes from GO and 21 DE genes from MGI Mammalian Phenotype. Seven genes were common in both gene lists. Processes that were highly enriched in DE genes in female mice include digestion (GO:0007586), regulation of insulin secretion (GO:0050796), response to lipid (GO:0033993), fatty acid biosynthetic process (GO:0006633), increased energy expenditure (MP:0004889), decreased susceptibility to diet-induced obesity (MP:0005659), abnormal glucose tolerance (MP:0005291) and decreased circulating leptin level (MP:0005668). Top prioritised, linker genes in female mice include Lepr and Ppargc1a that were under-expressed in obese females and Pnlip, Pyy, and Ins2 that were over-expressed in obese females. Lepr functions as a receptor for leptin, an adipose secreted hormone that regulates energy expenditure, satiety, lipid and glucose metabolism and immune system activation. Pancreatic lipase is normally secreted from the pancreas and is efficient in the digestion of dietary fats. Inhibition of Pnlip may prevent HFD induced obesity in mice. Orlistat, an inhibitor of Pnlip was the first FDA-approved anti-obesity therapeutic drug in treating diet-induced obesity [37]. Increased pancreatic expression of Pnlip has been observed in mice induced by fasting via the PPARα-FGF21 signalling pathway [38]. PYY is synthesised and released from specialised cells found predominantly within the distal gastrointestinal tract and regulates appetite. Transgenic mice with increased circulating PYY are resistant to diet-induced obesity [39]. Ins2 ectopic expression in the liver has been observed before in mice subjected to HFD [40]. We hypothesise that Pyy may be expressed ectopically in the liver in the same way as Ins2 in response to the HFD.

Male nondiabetic normal vs male nondiabetic obese

The DE genes list for the Male-Nondiabetic-Normal samples (control group) versus the Male-Nondiabetic-Obese samples (case group) consisted of 1589 DE genes. We performed functional analysis on this list and obtained two prioritised gene lists of 24 DE genes from GO and 22 genes from MGI Mammalian Phenotype. Ten of the genes from the two prioritised gene lists were common in both. Enriched pathways in male mice are overwhelmingly related to muscle and cardiac muscle processes: impaired muscle contractility (MP:0000738), abnormal skeletal muscle mass (MP:0004817), myopathy (MP:0000751), cardiac muscle hypertrophy (GO:0003300). A second category of enriched processes involves ion transport (GO:0006811). Top prioritised linker genes in male mice include Nos1, Ryr1, Des, Ttn all under-expressed in obese males while Ryr2 is over-expressed. Ryr1 is mainly expressed in skeletal muscle. The encoded protein functions as a calcium release channel in the sarcoplasmic reticulum. However, there is a number of studies suggesting that RyRs are widely expressed and have been linked with inositol 1,4,5-trisphosphate receptors in hepatocytes [41]. Downregulation of Ryr1 could lead to reduced release of Ca2+ from the sarcoplasmic (muscle cells) and the endoplasmic reticulum (hepatic cells) into the cytoplasm and therefore hinder muscle contraction as well as the regulation of mitochondrial metabolism and glycogen degradation [42, 43]. Des encodes a muscle-specific class III intermediate filament, which is important to help maintain the structure of sarcomeres and has been linked with hepatic fibrosis due to obesity [44]. Titin is an essential component of skeletal and cardiac muscles, but it has been suggested in recent studies that titin isoforms are expressed in non-muscle tissues including the liver, with an essential role in maintaining cellular organisation and contributing to signal transduction [45]. Ryr2 is primarily expressed in cardiac muscle. Ryr2 channels are associated with mitochondrial metabolism, gene expression regulation and cell survival, in addition to their role in cardiomyocyte contraction. A recent study links Ryr2 to insulin release and glucose homeostasis, suggesting that the upregulation of Ryr2 might be a coping mechanism in response to stress [46]. Nos1 produces nitric oxide (NO), which has multiple biological functions. Downregulation of Nos1 in obesity and diabetes is largely attributed to insulin resistance [47].

Male vs female comparisons

Finally, in order to assess the DE genes between the two genders we performed DE tests between groups that belonged to different genders. The DE genes list for the Male-Nondiabetic-Normal samples versus the Female-Nondiabetic-Normal samples comparison, consisted of 1923 genes. Functional analysis produced two prioritised gene lists consisting of 28 genes from GO and 25 genes from MGI Mammalian Phenotype, with 9 common genes. As for the DE genes for the Male-Nondiabetic-Obese samples versus the Female-Nondiabetic-Obese samples, the genes list consisted of 1940 DE genes. Functional analysis produced two prioritised gene lists consisting of 40 genes from GO and 22 genes from MGI, with 8 common genes. Between the two groups 482 DE genes are shared while 1441 genes are uniquely DE in the Male-Nondiabetic-Normal versus the Female-Nondiabetic-Normal samples and 1458 genes are uniquely DE in the Male-Nondiabetic-Obese versus the Female-Nondiabetic-Obese samples.

The 482 DE genes that are common in the two comparisons are enriched in genes that are involved in processes such as the epoxygenase P450 pathway (GO:0019373), long-chain fatty acid metabolic process (GO:0001676), negative regulation of gluconeogenesis (GO:0045721), increased circulating gonadotropin level (MP:0003362), and regulation of NF-κB import into nucleus (GO:0042345). Functional analysis of these 482 DE expressed genes produced 21 prioritised genes from GO and 20 prioritised genes from MGI Mammalian Phenotype. Of these 8 genes were common in both lists. Cav1, Egfr, Gstp1, Gcg, and Nox4 are consistently over-expressed in male versus female mice both in the non-obese and obese comparisons. Caveolin-1 regulates hepatic lipid accumulation and glucose metabolism and plays an important role in metabolic adaptation [48]. Hepatic glucagon action has been associated with elevated fatty acid oxidation and might act protectively against non-alcoholic fatty liver disease (NAFLD) [49]. Esr1, Il1b and Ptgs2 are consistently over-expressed in female versus male mice in both comparisons. Ptgs2 is involved in inflammation in fat and drives obesity-linked insulin resistance and fatty liver [50]. Tph1 is under-expressed in non-obese males and over-expressed in obese males in comparison to females. In contrast, Ryr2 is over-expressed in non-obese males and under-expressed in obese males in comparison to females. Tph1 is involved in the synthesis of serotonin, which is known to modulate appetite, energy expenditure and thermogenesis. Tph1-deficient mice fed a high-fat diet are protected from obesity, insulin resistance and NAFLD [51].

Functional analysis of the 1441 genes that were uniquely DE in the Male-Nondiabetic-Normal versus the Female-Nondiabetic-Normal comparison produced enriched processes like abnormal bone volume (MP:0010874), increased mesenteric fat pad weight (MP:0009298), unsaturated fatty acid and lipid metabolic process (GO:0033559, GO:0006629), regulation of hormone levels (GO:0010817), and a number of terms related to inflammatory processes, such as abnormal IgG2a level (MP:0020176), increased susceptibility to autoimmune diabetes (MP:0004803) and small intestinal inflammation (MP:0003306). The list of prioritised genes consisted of 21 genes from GO and 23 genes from MGI, with 9 overlapping genes. Genes involved in inflammatory processes, such as Il2, Il4, Il10, and Il1r1, were predominantly over-expressed in female mice. Vdr and Lepr were also over-expressed in females. Males had over-expressed Mrap2 and Oprm1. Mrap2 encodes a protein that modulates melanocortin receptor signalling. Mice deficient in Mrap2 exhibit severe obesity and a mutation in this gene may be associated with severe obesity in human patients [52].

Finally, functional analysis of the 1458 unique DE genes from the Male-Nondiabetic-Obese versus the Female-Nondiabetic-Obese comparison resulted again in enriched immune response related processes (GO:0006955) and positive regulation of inflammatory response (GO:0050729) with related genes also predominantly over-expressed in obese females. The list of prioritised genes consisted of 34 genes from GO and 20 genes from MGI Mammalian Phenotype, with 4 overlapping genes. Genes related to immunological processes include Ifng, Il6, and Tnf. Other enriched processes were muscle system processes (GO:0003012) and abnormal muscle contractility (MP:0005620), abnormal adrenaline level (MP:0003962), abnormal blood pH regulation (MP:0003027), digestion (GO:0007586) and decreased glucagon secretion (MP:0002711). Genes involved in these processes were predominantly under-expressed in male mice, for example, the Mc2r and Pyy. Notably, both Ins1 and Ins2 are also highly over-expressed in obese female mice compared to an almost zero level of expression in male mice. Mc2r belongs to the MCR family that plays an important role in appetite and energy regulation via leptin signalling. Increased MCRs expression in the liver has been associated with muscle tissue damage and potentially exerts a protective effect through metabolic regulation [53]. The common prioritised genes identified by both GO and MGI Mammalian Phenotype ontologies have been used to produce two heatmaps of male versus female comparisons which are presented in Fig. 2.

Fig. 2

Heatmap graphical representation of Male Obese mice versus Female Obese (right). The genes (y axis) are derived from the union of the prioritised lists produced


Obesity encompasses a complex array of traits closely related to the development of type 2 diabetes, metabolic syndrome and associated with comorbidities such as cardiovascular disease, hypertension, atherosclerosis and various cancers. In this study we have used a systems biology approach to examine genetic susceptibility and hepatic gene response to obesity in CC mice. The CC panel is a unique resource with wide genetic diversity that enables complex trait genomic mapping at a very high resolution. Moreover, it provides the opportunity to study differences in genetic expression in a diverse genetic reference population under controlled environmental conditions.

To facilitate this analysis, we developed a unified, automated workflow using Docker containers that can be simultaneously deployed on different computation infrastructures and can facilitate cross-platform collaboration. This is achieved by creating an identical insulated and stable operating environment that can be precisely controlled resulting in consistency overtime, without the worry of conflicting dependencies, UNIX compatibility issues and system updates. Docker containers can be used effectively to address some of the major setbacks of microarray and next generation sequencing technologies. The fast start-up time of Docker containers opens up the ability to chain together the execution of multiple containers and the development of sophisticated workflows for genomic analyses.

This work takes into consideration the overall analytical performance of the workflow as well as computational cost. Our aim was to develop an automated, accurate, and computationally efficient analytical process, while keeping the computational requirements at a minimum. Exploiting the minimal performance loss introduced by the Docker engine we developed a modular architecture for the integrative analysis of QTL and RNA-seq data that can be run on a personal computer (with approximately 4 GB of RAM or higher). For the aforementioned purpose the bioinformatic tools have been picked according to their overall effectiveness and their computational cost as described in the methods. The workflow we propose requires no more than 4.3 GB of RAM, 10 GB of physical drive and can run a top-down analysis of 20 GB of input data in less than 3 h (these values may change depending on whether multiprocessing or multi-threading are used).

Using our workflow, we mapped two QTLs related to the body weight gain phenotype, one on chromosome 5 for male mice and one on chromosome 3 for female mice. The QTL on chromosome 5 has already been described as significant in relation to the percentage of body fat, in response to an atherogenic diet, where Ppargc1a was suggested to be a candidate gene through comparative genomics and haplotype analysis [54]. There has also been evidence that suggests Ppargc1a activity may be significantly influenced by sex, although results to that direction have been contradictory with the rs8192678 allele influencing the risk of developing obesity in men but not in women, while in female mice Ppargc1a expression is estrogen regulated and has a protective effect against obesity-induced oxidative damage [55, 56]. In our analysis Ppargc1a is under-expressed in obese versus normal females. Additional genes involved in pathways closely related to the obesity phenotype highlighted by our analysis as potential candidates that play a role in male mice susceptibility to obesity are Cckar, Sod3, Med28, and Slit2.

QTLs on chromosome 3 have been previously described to be involved in the percentage of body fat and heat loss, without the introduction of a high fat diet [57, 58]. To our knowledge, this is the first time that the two mapped QTL are described in a sex specific context. It is notable that while the QTL and DE do not overlap significantly, they converge at the functional level, pointing to potentially common regulatory mechanisms. For example, in female mice there is a single differentially expressed gene located inside the QTL on chr3, Cyp2u1, a hydroxylase that is involved in the metabolism of long chain fatty acids. However, the two gene lists derived from the QTL and differential expression analysis share a high number of common relevant enriched pathways, such as increased energy expenditure (MP:0004889), decreased susceptibility to diet-induced obesity (MP:0005659) and increased circulating free fatty acid level (MP:0001554).

The comparison of male and female DE results (presented in supplementary Tables 3 and 4) showed that male and female mice have completely different mechanisms of response to HFD induced obesity. Sex differences in glucose metabolism have been previously described in mouse strains. Males have been found to be more prone to developing insulin resistance and obesity than females after being fed a high-fat / high carbohydrate diet. In addition, the same study showed that estrogen contributes to insulin sensitivity in females, and testosterone exacerbates insulin resistance in C57BL/6 J mice [59].

In our results, ectopic expression of Pyy and Ins2 in the liver appear to be coping mechanisms utilised by female obese mice but not males. We also observed extrapancreatic expression of Pnlip in female obese mice. A hepatic to pancreatic switch that occurs in a compensatory mode under stress conditions has been previously described [60]. Under-expression of Lepr indicates liver resistance to leptin, which leads to impaired hepatic insulin sensitivity, regulation of lipid metabolism and glucose homeostasis in female obese mice [61, 62]. Our findings regarding the cases of male nondiabetic obese mice when compared with male nondiabetic normal mice indicate that obesity in the male population heavily affects genes related to the skeletal and cardiac tissues, which agrees with previous studies connecting obesity with impaired muscular structure and function, as well as abnormal regulation of insulin secretion and glucose homeostasis [46, 63, 64]. Diet induced obesity has been shown to alter skeletal muscle fiber types in male but not in female mice [65].

A direct comparison of DE genes in female versus male mice subjected to HFD (the complete lists of DE genes of Male-Nondiabetic-Normal vs Female-Nondiabetic-Normal and Male-Nondiabetic-Obese vs Female-Nondiabetic-Obese, are available in Supplementary Tables 5 and 6, respectively) confirms gender specific differences in the responses of both resistant and susceptible individuals. Male mice appear to undergo and respond to oxidative stress induced by the HFD by over-expressing antioxidant enzymes such as Gstp1 and Nox4 (Fig. 3). They also respond to the metabolic stress they undergo by over-expressing metabolism modulating genes, such as Cav1 (Fig. 3). Additionally, as illustrated in Fig. 3, female mice over-express genes involved in the regulation of the inflammatory response, such as Il1b, in comparison to male mice. A previous study of the sex-dependent inflammatory response of HFD fed mice has shown that male mice tend to develop low-grade systemic inflammation, while female mice are protected through expansion of their population of anti-inflammatory T lymphocytes [66]. As systemic low-grade inflammation caused by obesity is believed to play a role in insulin resistance, it is of great interest to further look into sex-dependent inflammatory responses and coping mechanisms.

Fig. 3

Heatmap graphical representation of the most highly prioritised genes that are common in Male vs Female comparisons and their respective ontologies based on GO produced by BioInfoMiner. Genes marked blue are overexpressed in Female mice while genes marked red are overexpressed in Male mice

Our work addresses data integration from analytical platforms at the genomic and trancsriptomic level, but we expect that our findings can be extended to accommodate broader integration schemes and result in a variety of computational genomic analysis pipelines. Moreover, our findings highlight the potential of Docker technologies for the development of prototype analytical workflows that suit the individual needs of different research objectives. As a future step in this project, we plan to develop a fully automated, dockerised workflow to perform eQTL analysis on Collaborative Cross mice, which is a significantly more computationally demanding task and will greatly benefit from the possibility of fast and reliable remote deployment.


We observed that the genetic mechanisms which underlie susceptibility and response to HFD induced obesity differ in female and male mice. This clear distinction in the systemic response to the HFD challenge and obesity between male and female mice points to the need for further research into distinct sex-related mechanisms in metabolic disease in humans as well.

Moreover, the integration of data using ontological functional analysis, which showed that different genes but similar pathways are involved in the genetic susceptibility and are disturbed in diet induced obesity, demonstrate the importance of integrating genomic and transcriptomic data at the level of signaling pathways rather than focusing on genes. This way the collective background biological knowledge is meaningfully utilised to systemically interpret results at the genomic scale.

Availability of data and materials

The dataset(s) supporting the conclusions of this article is (are) available in the GEO (and SRA) repositories, under the accession number GSE126490 at

All docker images used in this analysis are freely available on Docker Hub at enios/rnaseq-qtl repository, Accompanying Python scripts and documentation are available on Github at e-nios/RNA-seq-QTL repository,

The genotype data of the CC mouse lines are available on the UNC Systems Genetics website,



Collaborative Cross


Genetic reference panel


High-fat diet


Recombinant inbred lines


Body mass index


Quantitative trait loci


Central processing unit


Tel-Aviv University


Standard error


Intraperitoneal glucose tolerance test




RNA Integrity Number


log2 fold change


Gene ontology


Mouse genome informatics


Confidence interval


Multidimensional scaling


Differential expression / differentially expressed


Genome-wide association study


Sphingomyelin synthase 2


Hydroxyacyl-coenzyme A dehydrogenase


Lymphoid enhancer binding factor 1


Dickkopf WNT signaling pathway inhibitor 2


Epidermal growth factor


Peroxisome proliferative activated receptor, gamma, coactivator 1 alpha


Cholecystokinin A receptor


Superoxide dismutase 3


Mediator complex subunit 28


Slit homolog 2


Uncoupling protein 1


Leptin receptor


Pancreatic lipase


Peptide YY


Insulin I


Insulin II


Nitric oxide synthase 1


Ryanodine receptor 1


Ryanodine receptor 2






Caveolin 1


Epidermal growth factor receptor


Glutathione S-transferase




NADPH oxidase 4


Non-alcoholic fatty liver disease


Estrogen receptor 1


Interleukin 1 beta


Prostaglandin-endoperoxide synthase 2


Tryptophan hydroxylase 1


Interleukin 2


Interleukin 4


Interleukin 6


Interleukin 10


Interleukin 1 receptor


Vitamin D receptor


Melanocortin 2 receptor accessory protein 2


Opioid receptor, mu 1


Interferon gamma


Tumour necrosis factor


Melanocortin 2 receptor


Melanocortin receptor


Cytochrome P450 family 2 subfamily U member 1


  1. 1.

    Churchill, G. A., Airey, D. C., Allayee, H., et al., and Consortium, C. T. The collaborative cross, a community resource for the genetic analysis of complex traits. Nat Genet. 2004;36:1133–1137.

  2. 2.

    Mott R, Talbot CJ, Turri MG, Collins AC, Flint J. A method for fine mapping quantitative trait loci in outbred animal stocks. Proc Natl Acad Sci. 2000;97(23):12649–54.

    Article  PubMed  Google Scholar 

  3. 3.

    Valdar W, Holmes CC, Mott R, Flint J. Mapping in structured populations by resample model averaging. Genetics. 2009;182(4):1263–77.

    Article  PubMed  PubMed Central  Google Scholar 

  4. 4.

    Fu C-P, Welsh CE, de Villena FP-M, McMillan L. Inferring ancestry in admixed populations using microarray probe intensities. In: Proceedings of the ACM conference on bioinformatics, computational biology and biomedicine - BCB '12: ACM Press; 2012.

  5. 5.

    Yang L, Colditz GA. Prevalence of overweight and obesity in the United States, 2007-2012. JAMA Intern Med. 2015;175(8):1412–3.

    Article  PubMed  PubMed Central  Google Scholar 

  6. 6.

    Felter, W., Ferreira, A., Rajamony, R., and Rubio, J. An updated performance comparison of virtual machines and linux containers. Technical report, RC25482 (AUS1407–001), IBM Research Division. 2014.

    Google Scholar 

  7. 7.

    Tommaso PD, Palumbo E, Chatzou M, Prieto P, Heuer ML, Notredame C. The impact of Docker containers on the performance of genomic pipelines. PeerJ. 2015;3:e1273.

    Article  PubMed  PubMed Central  Google Scholar 

  8. 8.

    Soller M, Iraqi F. The collaborative cross - A next generation mouse genetic resource population for high resolution genomic analysis of complex traits. Livest Sci. 2014;166:19–25.

    Article  Google Scholar 

  9. 9.

    Iraqi FA, Mahajne M, Salaymah Y, Sandovski H, Tayem H, Vered K, Balmer L, Hall M, Manship G, Morahan G, Pettit K, Scholten J, Tweedie K, Wallace A, Weerasekera L, Cleak J, Durrant C, Goodstadt L, Mott R, Yalcin B. The genome architecture of the collaborative cross mouse genetic reference population. Genetics. 2012;190(2):389–401.

    CAS  Article  Google Scholar 

  10. 10.

    Iraqi FA, Churchill G, Mott R. The collaborative cross, developing a resource for mammalian systems genetics: a status report of the Wellcome Trust cohort. Mamm Genome. 2008;19(6):379–81.

    Article  PubMed  Google Scholar 

  11. 11.

    Atamni HJ, Mott R, Soller M, Iraqi FA. High-fat-diet induced development of increased fasting glucose levels and impaired response to intraperitoneal glucose challenge in the collaborative cross mouse genetic reference population. BMC Genet. 2016;17:10.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. 12.

    Boettiger C, Eddelbuettel D. An introduction to rocker: Docker containers for R; 2017.

    Google Scholar 

  13. 13.

    Andrews, S.. Fastqc: a quality control tool for high throughput sequence data. 2010. Available online at:

    Google Scholar 

  14. 14.

    Williams CR, Baccarella A, Parrish JZ, Kim CC. Trimming of sequence reads alters RNA-seq gene expression estimates. BMC Bioinformatics. 2016;17(1):103.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. 15.

    Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17(1):10.

    Article  Google Scholar 

  16. 16.

    Krueger, F.. Trim galore: a wrapper tool around Cutadapt and FastQC to consistently apply quality and adapter trimming to FastQ files, with some extra functionality for MspI-digested RRBS-type libraries. 2014. Available online at

    Google Scholar 

  17. 17.

    Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;12(4):357–60.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  18. 18.

    Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2013;30(7):923–30.

    Article  CAS  PubMed  Google Scholar 

  19. 19.

    Robinson MD, McCarthy DJ, Smyth GK. edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2009;26(1):139–40.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. 20.

    Su, S.. Edger. R script. 2014. Available at

    Google Scholar 

  21. 21.

    Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010;11(3):R25.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. 22.

    Koutsandreas T, Binenbaum I, Pilalis E, Valavanis I, Papadodima O, Chatziioannou A. Analyzing and visualizing genomic complexity for the derivation of the emergent molecular networks. Int J Monit Surveill Technol Res. 2016;4(2):30–49.

    Google Scholar 

  23. 23.

    Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25–9.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  24. 24.

    Smith CL, Eppig JT. The mammalian phenotype ontology as a unifying standard for experimental and high-throughput phenotyping data. Mamm Genome. 2012;23(9–10):653–68.

    Article  PubMed  PubMed Central  Google Scholar 

  25. 25.

    Yang H, Ding Y, Hutchins LN, Szatkiewicz J, Bell TA, Paigen BJ, Graber H, de Villena FP-M, Churchill GA. A customized and versatile high-density genotyping array for the mouse. Nat Methods. 2009;6(9):663–6.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  26. 26.

    Durrant C, Tayem H, Yalcin B, Cleak J, Goodstadt L, de Villena FP-M, Mott R, Iraqi FA. Collaborative cross mice and their power to map host susceptibility to Aspergillus fumigatus infection. Genome Res. 2011;21(8):1239–48.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  27. 27.

    Gatti, D. SNPtools: accessing, subsetting and plotting mouse SNPs. 2015.

    Google Scholar 

  28. 28.

    Link JC, Reue K. Genetic basis for sex differences in obesity and lipid metabolism. Annu Rev Nutr. 2017;37(1):225–45.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  29. 29.

    Li Z, Zhang H, Liu J, Liang C-P, Li Y, Li Y, Teitelman G, Beyer T, Bui HH, Peake DA, Zhang Y, Sanders PE, Kuo M-S, Park T-S, Cao G, Jiang X-C. Reducing plasma membrane sphingomyelin increases insulin sensitivity. Mol Cell Biol. 2011;31(20):4205–18.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  30. 30.

    Schulz N, Himmelbauer H, Rath M, van Weeghel M, Houten S, Kulik W, Suhre K, Scherneck S, Vogel H, Kluge R, Wiedmer P, Joost H-G, Schürmann A. Role of medium- and short-chain l-3-hydroxyacyl- CoA dehydrogenase in the regulation of body weight and thermogenesis. Endocrinology. 2011;152(12):4641–51.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  31. 31.

    Fuster JJ, Zuriaga MA, Ngo DT-M, Farb MG, Aprahamian T, Yamaguchi TP, Gokce N, Walsh K. Noncanonical Wnt signaling promotes obesity-induced adipose tissue inflammation and metabolic dysfunction independent of adipose tissue expansion. Diabetes. 2014;64(4):1235–48.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. 32.

    Vandenbeek R, Khan NP, Estall JL. Linking metabolic disease with the PGC-1α Gly482Ser polymorphism. Endocrinology. 2017;159(2):853–65.

    Article  CAS  Google Scholar 

  33. 33.

    Funakoshi A, Miyasaka K, Jimi A, Kawanai T, Takata Y, Kono A. Little or no expression of the cholecystokinin-a receptor gene in the pancreas of diabetic rats (Otsuka Long-Evans Tokushima Fatty = OLETF rats). Biochem Biophys Res Commun. 1994;199(2):482–8.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  34. 34.

    Cui R, Gao M, Qu S, Liu D. Overexpression of superoxide dismutase 3 gene blocks high-fat diet-induced obesity, fatty liver and insulin resistance. Gene Ther. 2014;21(9):840–8.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  35. 35.

    Youn DY, Xiaoli AM, Pessin JE, Yang F. Regulation of metabolism by the mediator complex. Biophys Rep. 2016;2(2–4):69–77.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  36. 36.

    Svensson KJ, Long JZ, Jedrychowski MP, Cohen P, Lo JC, Serag S, Kir S, Shinoda K, Tartaglia JA, Rao RR, Chédotal A, Kajimura S, Gygi SP, Spiegelman BM. A secreted Slit2 fragment regulates adipose tissue thermogenesis and metabolic function. Cell Metab. 2016;23(3):454–66.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  37. 37.

    Weibel EK, Hadvary P, Hochuli E, Kupfer E, Lengsfeld H. Lipstatin, an inhibitor of pancreatic lipase, produced by Streptomyces toxytricini. Producing organism, fermentation, isolation and biological activity. J Antibiot. 1987;40:1081–5.

    CAS  Article  Google Scholar 

  38. 38.

    Inagaki T, Dutchak P, Zhao G, Ding X, Gautron L, Parameswara V, Li Y, Goetz R, Mohammadi M, Esser V, Elmquist JK, Gerard RD, Burgess SC, Hammer RE, Mangelsdorf DJ, Kliewer SA. Endocrine regulation of the fasting response by PPARα-mediated induction of fibroblast growth factor 21. Cell Metab. 2007;5(6):415–25.

    CAS  Article  Google Scholar 

  39. 39.

    Boey D, Lin S, Enriquez RF, Lee NJ, Slack K, Couzens M, Baldock PA, Herzog H, Sainsbury A. PYY transgenic mice are protected against diet-induced and genetic obesity. Neuropeptides. 2008;42(1):19–30.

    CAS  Article  Google Scholar 

  40. 40.

    Kojima H, Fujimiya M, Matsumura K, Nakahara T, Hara M, Chan L. Extrapancreatic insulin-producing cells in multiple organs in diabetes. Proc Natl Acad Sci U S A. 2004;101:2458–63.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  41. 41.

    Pierobon N, Renard-Rooney DC, Gaspers LD, Thomas AP. Ryanodine receptors in liver. J Biol Chem. 2006;281(45):34086–95.

    CAS  Article  PubMed  Google Scholar 

  42. 42.

    Hajnóczky G, Robb-Gaspers LD, Seitz MB, Thomas AP. Decoding of cytosolic calcium oscillations in the mitochondria. Cell. 1995;82:415–24.

    Article  PubMed  Google Scholar 

  43. 43.

    Assimacopoulos-Jeannet FD, Blackmore PF, Exton JH. Studies on alpha-adrenergic activation of hepatic glucose output. Studies on role of calcium in alpha-adrenergic activation of phosphorylase. J Biol Chem. 1977;252:2662–9.

    CAS  PubMed  PubMed Central  Google Scholar 

  44. 44.

    Siersbæk M, Varticovski L, Yang S, Baek S, Nielsen R, Mandrup S, Hager GL, Chung JH, Grøntved L. High fat diet-induced changes of mouse hepatic transcription and enhancer activity can be reversed by subsequent weight loss. Sci Rep. 2017;7:40220.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. 45.

    Ackermann MA, Shriver M, Perry NA, Hu L-YR, Kontrogianni- Konstantopoulos, A. Obscurins: goliaths and davids take over non-muscle tissues. PLoS One. 2014;9(2):e88162.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. 46.

    Santulli G, Pagano G, Sardu C, Xie W, Reiken S, D’Ascia SL, Cannone M, Marziliano N, Trimarco B, Guise TA, Lacampagne A, Marks AR. Calcium release channel RyR2 regulates insulin release and glucose homeostasis. J Clin Investig. 2015;125(5):1968–78.

    Article  PubMed  Google Scholar 

  47. 47.

    Sansbury BE, Hill BG. Regulation of obesity and insulin resistance by nitric oxide. Free Radic Biol Med. 2014;73:383–99.

    CAS  Article  PubMed  Google Scholar 

  48. 48.

    Fernandez-Rojo MA, Ramm GA. Caveolin-1 function in liver physiology and disease. Trends Mol Med. 2016;22(10):889–904.

    CAS  Article  PubMed  Google Scholar 

  49. 49.

    Berglund ED, Lustig DG, Baheza RA, Hasenour CM, Lee-Young RS, Donahue EP, Lynes SE, Swift LL, Charron MJ, Damon BM, Wasserman DH. Hepatic glucagon action is essential for exercise-induced reversal of mouse fatty liver. Diabetes. 2011;60(11):2720–9.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  50. 50.

    Hsieh P-S, Jin J-S, Chiang C-F, Chan P-C, Chen C-H, Shih K-C. COX mediated inflammation in fat is crucial for obesity-linked insulin resistance and fatty liver. Obesity. 2009;17:1150.

    CAS  Article  PubMed  Google Scholar 

  51. 51.

    Crane JD, Palanivel R, Mottillo EP, Bujak AL, Wang H, Ford RJ, Collins A, Blümer RM, Fullerton MD, Yabut JM, Kim JJ, Ghia J-E, Hamza SM, Morrison KM, Schertzer JD, Dyck JRB, Khan WI, Steinberg GR. Inhibiting peripheral serotonin synthesis reduces obesity and metabolic dysfunction by promoting brown adipose tissue thermogenesis. Nat Med. 2014;21(2):166–72.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. 52.

    Asai M, Ramachandrappa S, Joachim M, Shen Y, Zhang R, Nuthalapati N, Ramanathan V, Strochlic DE, Ferket P, Linhart K, Ho C, Novoselova TV, Garg S, Ridderstrale M, Marcus C, Hirschhorn JN, Keogh JM, O'Rahilly S, Chan LF, Clark AJ, Farooqi IS, Majzoub JA. Loss of function of the melanocortin 2 receptor accessory protein 2 is associated with mammalian obesity. Science. 2013;341(6143):275–8.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  53. 53.

    Malik IA, Triebel J, Posselt J, Khan S, Ramadori P, Raddatz D, Ramadori G. Melanocortin receptors in rat liver cells: change of gene expression and intracellular localization during acute-phase response. Histochem Cell Biol. 2011;137(3):279–91.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. 54.

    Su Z, Korstanje R, Tsaih S-W, Paigen B. Candidate genes for obesity revealed from a C57BL/6J × 129S1/SvImJ intercross. Int J Obes. 2008;32(7):1180–9.

    CAS  Article  Google Scholar 

  55. 55.

    Ridderstråle M, Johansson LE, Rastam L, Lindblad U. Increased risk of obesity associated with the variant allele of the PPARGC1a Gly482Ser polymorphism in physically inactive elderly men. Diabetologia. 2006;49(3):496–500.

    Article  CAS  PubMed  Google Scholar 

  56. 56.

    Besse-Patin A, Léveillé M, Oropeza D, Nguyen BN, Prat A, Estall JL. Estrogen signals through peroxisome proliferator-activated receptor-γ coactivator 1α to reduce oxidative damage associated with diet-induced fatty liver disease. Gastroenterology. 2017;152(1):243–56.

    CAS  Article  PubMed  Google Scholar 

  57. 57.

    Moody DE, Pomp D, Nielsen MK, Van Vleck LD. Identification of quantitative trait loci influencing traits related to energy balance in selection and inbred lines of mice. Genetics. 1999;152:699–711.

    CAS  PubMed  PubMed Central  Google Scholar 

  58. 58.

    Brockmann GA, Haley CS, Renne U, Knott SA, Schwerin M. Quantitative trait loci affecting body weight and fatness from a mouse line selected for extreme high growth. Genetics. 1998;150:369–81.

    CAS  PubMed  PubMed Central  Google Scholar 

  59. 59.

    Parks BW, Sallam T, Mehrabian M, Psychogios N, Hui ST, Norheim F, Castellani LW, Rau CD, Pan C, Phun J, Zhou Z, Yang W-P, Neuhaus I, Gargalovic PS, Kirchgessner TG, Graham M, Lee R, Tontonoz P, Gerszten RE, Hevener AL, Lusis AJ. Genetic architecture of insulin resistance in the mouse. Cell Metab. 2015;21(2):334–47.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  60. 60.

    Shanmukhappa K, Mourya R, Sabla GE, Degen JL, Bezerra JA. Hepatic to pancreatic switch defines a role for hemostatic factors in cellular plasticity in mice. Proc Natl Acad Sci. 2005;102(29):10182–7.

    CAS  Article  PubMed  Google Scholar 

  61. 61.

    Amitani M, Asakawa A, Amitani H, Inui A. The role of leptin in the control of insulin-glucose axis. Front Neurosci. 2013;7.

  62. 62.

    Huynh FK, Neumann UH, Wang Y, Rodrigues B, Kieffer TJ, Covey SD. A role for hepatic leptin signaling in lipid metabolism via altered very low density lipoprotein composition and liver lipase activity in mice. Hepatology. 2012;57(2):543–54.

    Article  CAS  PubMed  Google Scholar 

  63. 63.

    Tomlinson DJ, Erskine RM, Morse CI, Winwood K, Onambélé-Pearson G. The impact of obesity on skeletal muscle strength and structure through adolescence to old age. Biogerontology. 2015;17(3):467–83.

    Article  PubMed  PubMed Central  Google Scholar 

  64. 64.

    Choi KM. Sarcopenia and sarcopenic obesity. Korean J Intern Med. 2016;31(6):1054–60.

    Article  PubMed  PubMed Central  Google Scholar 

  65. 65.

    DeNies MS, Johnson J, Maliphol AB, Bruno M, Kim A, Rizvi A, Rustici K, Medler S. Diet-induced obesity alters skeletal muscle fiber types of male but not female mice. Phys Rep. 2014;2(1):e00204.

    Article  CAS  Google Scholar 

  66. 66.

    Pettersson US, Waldén TB, Carlsson P-O, Jansson L, Phillipson M. Female mice are protected against high-fat diet induced metabolic syndrome and increase the regulatory T cell population in adipose tissue. PLoS One. 2012;7(9):e46057.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

Download references


Not applicable


This work was funded by the Operational Program” Competitiveness, Entrepreneurship and Innovation 2014–2020″ (co-funded by the European Regional Development Fund) and managed by the General Secretariat of Research and Technology, Ministry Of Education, Research & Religious Affairs, under the project” Innovative Nanopharmaceuticals: Targeting Breast Cancer Stem Cells by a Novel Combination of Epigenetic and Anticancer Drugs with Gene Therapy (INNOCENT)” (7th Joint Translational Call - 2016, European Innovative Research and Technological Development Projects In Nanomedicine) of the ERA-NET EuroNanoMed II and by “BioS: Digital Skills on Computational Biology”, ΕRASMUS+, EACEA/04/2017, “KA2: Cooperation for innovation and the exchange of good practices - Sector Skills Alliances” for the development of the bioinformatic analysis pipeline, Hendrech and Eiran Gotwert Fund for studying diabetes, Wellcome Trust grants 085906/Z/08/Z, 075491/Z/04 and 090532/Z/09/Z; for financially supporting the initiation, breeding, genotyping and maintaining the CC mouse colony at TAU, core funding by Tel-Aviv University (TAU) for supporting staff at Iraqi’s lab, Israeli Science foundation (ISF) grants 429/09 and 1085/18, Binational Science Foundation (BSF) grant number 2015077 and German Israeli Science Foundation (GIF) grant I-63-410.20-2017 for performing the mouse assessment for T2D and technical staff, and European Sequencing and Genotyping Infrastructure (ESGI) consortium grant for conducting the RNAseq analysis.

Ilona Binenbaum’s PhD thesis was supported by a scholarship from the State Scholarship Foundation in Greece (IKY) (Operational Program “Human Resources Development - Education and Lifelong Learning” Partnership Agreement (PA) 2014–2020).

Author information




IB was involved with data analysis and writing the MS; HJAA was involved with project design, data collection and analysis, and writing the MS; GF and GK were involved with data analysis and writing the MS; TK and EP were involved with data analysis; RM and HH were involved with project design; FAI is the co-corresponding author, involved with project design, data analysis and writing the MS. AC was involved with data analysis and writing the MS. All authors have read and approved the manuscript.

Corresponding authors

Correspondence to Fuad A. Iraqi or Aristotelis A. Chatziioannou.

Ethics declarations

Ethics approval and consent to participate

The Institutional Animal Care approved all experiments in this study and Use Committee (IACUC) at TAU, which adheres to the Israeli guidelines, which follow the NIH/USA animal care and use protocols (approved experiment number M-12-025 and M-14-007).

Consent for publication

Not applicable

Competing interests

Dr. Himmelbauer is an Associate Editor of BMC Genomics and was blinded from the review process.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Table S1. Median values of baseline body weight of mice at the start of the dietary challenge and body weight gain after 12 weeks of dietary challenge for each CC-line.

Additional file 2:

Table S2. CC line, sex, baseline body weight (BW0) at the start of the dietary challenge, body weight gain after 12 weeks (ΔBW0-12) of dietary challenge, aNA blood glucose levels at T180 after IPGTT challeng (BGM-T180) for all mice included in the analysis.

Additional file 3:

Table S3. List of differentially expressed genes from the female non-diabetic normal vs. female non-diabetic obese mice with the logFC and adjusted p-values.

Additional file 4:

Table S4. List of differentially expressed genes from the male non-diabetic normal vs. male non-diabetic obese mice with the logFC and adjusted p-values.

Additional file 5:

Table S5. List of differentially expressed genes from the male non-diabetic normal vs. female non-diabetic normal mice with the logFC and adjusted p-values.

Additional file 6:

Table S6. List of differentially expressed genes from the male non-diabetic obese vs. female non-diabetic obese mice with the logFC and adjusted p-values.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Binenbaum, I., Atamni, H.AT., Fotakis, G. et al. Container-aided integrative QTL and RNA-seq analysis of Collaborative Cross mice supports distinct sex-oriented molecular modes of response in obesity. BMC Genomics 21, 761 (2020).

Download citation


  • Collaborative Cross
  • Obesity
  • Sex-differences
  • High-fat diet
  • QTL
  • RNAseq