- Research Article
- Open Access
Determination of dosage compensation and comparison of gene expression in a triploid hybrid fish
BMC Genomics volume 18, Article number: 38 (2017)
Polyploidy and hybridization are both recognized as major forces in evolution. Most of our current knowledge about differences in gene regulation in polyploid hybrids comes from plant studies. The gene expression of diverged genomes and regulatory interactions are still unclear in lower vertebrates.
We generated 229 million cleaned reads (42.23 Gbp) from triploid of maternal grass carp (Ctenopharyngodon idellus, Cyprininae, 2n = 48) × paternal blunt snout bream (Megalobrama amblycephala, Cultrinae, 2n = 48) and their diploid parents using next-generation sequencing. In total, 157,878 contigs were assembled and 15,444 genes were annotated. We examined gene expression level changes among the parents and their triploid offspring. The mechanisms of dosage compensation that reduced triploid expression levels to the diploid state were determined in triploid fish. In this situation, novel gene expression and gene silencing were observed. Then, we established a model to determine the extent and direction of expression level dominance (ELD) and homoeolog expression bias (HEB) based on the relative expression level among the parents and their triploid offspring.
Our results showed that the genome-wide ELD was biased toward maternal genome in triploid. Extensive alterations in homoeolog expression suggested a combination of regulatory and epigenetic interactions through the transcriptome network. Additionally, the expression patterns of growth genes provided insights into the relationship between the characteristics of growth and underlying mechanisms in triploids. Regulation patterns of triploid state suggest that various expression levels from the initial genomic merger have important roles in adaptation.
Polyploid hybrids that play a role in the origin of plant and animal species have been studied for many years. Hybridization is viewed as a destructive process that counteracts speciation and delays evolution . However, biologists increasingly find new examples where hybridization seemed to facilitate speciation and adaptive radiation in animals and plants . Although polyploidy and hybridization can be viewed separately, the processes often occur together in the form of allopolyploidy . Allotriploidy is rarely discovered in lower vertebrates except of triploid edible frog Rana esculenta , the triploid cyprinidae fish of Squalius alburnoides complex , the triploid of Ctenopharyngodon idellus × Megalobrama amblycephala, the triploid of Carassius auratus red var. and Cyprinus carpio [6, 7]. The coexistence of divergent parental genomes begins with heterozygosity and heterosis in F1 hybrids , whereas gene redundancy shields hybrids from the deleterious effects of mutations [2, 8].
The molecular mechanisms of gene expression regulation in allotetraploids are well studied in plants. However, only a few animal species, mostly insects and fish, have been recognized as being the result of hybridization and polyploidy . Therefore, little has been done to understand the effects of ploidy increases on gene regulation and their impact on the evolutionary potential of populations. Both the Squalius alburnoides complex and triploid Chinook salmon are appropriate systems to research gene copy silencing that is attributed to complex dosage-compensation mechanisms [5, 9–11]. Although the responsible molecular mechanisms have not been determined, some hypotheses have been proposed to explain this fundamental biological phenomenon. In cyprinid fishes, a few reports described the dosage effect of the house-keeping gene β-actin between triploids and diploids, in which the absolute expression level was estimated to be 1:1 . This gene could be used as an internal control in the study of mRNA and microRNA expression levels in triploids [12–15]. Additionally, the dosage effect of functional genes including growth-hormone was detected in triploid salmon . Although triploids also exhibited higher narrow-sense heritability values relative to diploid salmon, maternal effects were estimated to be generally lower in triploids than in diploids. The dosage effects resulting from adding an extra set of chromosomes to maternal genome are primarily additive .
Compared with either parent, a stable and distinct hybrid will result from hybridization if reproductive isolation is weak. Therefore, hybrid species usually are considered as a third cluster of genotypes . However, evolution normally occurs by small adjustments rather than saltation. The expression pattern of homologous genes is the focus of our attention. Recent reports show that duplicate gene pairs in hybrids may display homoeolog expression bias (HEB), where the two homoeologs are expressed unequally and often vary among tissues [19, 20]. The epigenetic remodeling including nuclear enlargement and increased complexity of the processes during cell division always results in both the activation and suppression of gene expression in polyploids . In addition to HEB, a second phenomenon was more recently described: expression silencing of parental homoeologs and the formation of novel genes are some of the consequences that the new polyploid genome may experience [21, 22]. Different from genome diploidization in autotetraploids, the merge of the A and D genome in hybrids often resulted in a variety of expression regulation changes that occurred in either parental homoeolog, and the differential homoeolog expression and homoeologs silencing patterns were reported in allopolyploid cotton and fungi [23, 24].
Molecular mechanisms, or even the specific biological processes that are involved with changes in gene expression levels in polyploids, are largely unknown. Differences in growth and survival commonly are observed in early stages in allopolyploids. Triploids of Ctenopharyngodon idellus × Megalobrama amblycephala are reported to have significantly higher growth rates than their diploid parents . Hybrid growth disorders always refer to the decreased growth or overgrowth that is identified in hybrid individuals. A study of hybrid mice that investigated the possible causes for hybrid growth disorders revealed that gene imprinting had a major effect . Hybrid growth disorders may also be known as growth dysplasia . At the same time, the increased amount of DNA may result in the larger cell volume of polyploids relative to their diploid progenitors [27, 28]. However, comparisons of inbred diploid and polyploid salamanders  and mice  indicate that the larger cells in polyploids did not necessarily result in larger bodies. Instead, a developmental mechanism regulates organ growth to compensate for cell size. Another hypothesis supports the idea that the larger cells in polyploids were attributed to high metabolic rates and result in high growth rates . After triploidization, the change in growth function in triploids would be determined by various of growth regulation mechanisms.
In this study, we investigated the liver transcriptome in diploid parents (Ctenopharyngodon idellus, ♀ × Megalobrama amblycephala, ♂) and their triploid offspring. The three sets of chromosomes allowed us to analyze the global expression level in triploid. Compared to the expression level of the diploid parents, we detected a negative dosage effect in triploids. Then, the genomic constitution of two sets of maternal homoeologs and one set of paternal homoeologs allowed us to investigate the expression pattern in triploid offspring. We characterized gene expression patterns according to the 12 possible categories, including mid-parents, up- and down-parent, maternal-dominance, and paternal-dominance [22, 32]. The aim of this study was to assess the magnitude and directionality of ELD and HEB in triploids. Furthermore, we detected the expression patterns in growth-related genes in triploid offspring and the inbred parents, and we discussed their relationship with the characteristic of rapid growth. Therefore, these results provide a novel perspective to describe expression regulation in triploids and hint at the underlying mechanism of triploidy.
To examine the changes in the global transcriptomic profile in triploid of Ctenopharyngodon idellus and Megalobrama amblycephala (GB), we obtained nine liver transcriptomes from maternal Ctenopharyngodon idellus (GC), paternal Megalobrama amblycephalae (BSB), and triploid offspring GB (Fig. 1).
The paired-end sequencing (PE × 90) had performed based on the nine libraries of the two parents and their triploid offspring. The basic information was summarized in Table 1. After the initial adapter trimming and quality filtering, we had collected all 299.03 million cleaned reads from the nine libraries (Table 1). Then, we assembled the 100.14 (BSB), 96.77 (GC) and 102.12 (GB) million cleaned reads (42.23 Gb) using Trinity, separately. Among of 157,878 assembled contigs in three species, the number of contigs (≥1000 bp) were 11,190 in paternal BSB, 9,873 in maternal GC, and 11,005 in triploid GB (Table 1).
Using BLASTX (e-value ≤ 1e−6) against NCBI-NR, Swiss-Prot, Kyoto Encyclopedia of Genes and Genomes (KEGG), Clusters of Orthologous Groups (COG) and Gene Ontology (GO) databases (alignment length ≥100 bp), 28,950 sequences from paternal BSB, 29,110 sequences from maternal GC, and 29,255 sequences from triploid GB were identified as annotated sequences. The sequence distribution of annotated sequences in the above five public databases and the e-value distribution of annotated genes are shown in Additional file 1. After BLASTX alignment, we performed GO analysis (level 2). The distribution of gene annotations showed the function differences between the parents and their hybrids (Additional file 2). To obtain more accurate information about the gene expression in the three species, our next analysis was focused on the 13,893 shared genes (Additional file 3).
Differential expression between diploid and triploid species
To investigate expression level in the two diploid parents and their triploid offsprings, a total of 157,878 contigs from nine individuals were clustered by CD-HIT, and the 95,702 reference transcript contigs were obtained from clustering (Additional file 4). Then, the total reads from the nine samples were mapped to the 95,702 reference transcripts using BLAST-like alignment tool (Blat) (Additional file 5) . According to the mapping results, we detected the silenced genes (GB = 0, GC > 10, and BSB > 10) and novel genes based on the read counts (GB > 10, GC = 0, and BSB = 0) in triploid offspring, the 27 genes appeared to be silenced, and two genes exhibited a novel expression pattern (Additional file 6).
To detect significant differentially expression, false discovery rate (FDR) < 0.001 and the absolute value of log2 ratio > 1 were used as thresholds in comparison of the two parents and their triploid offsprings. In all comparisons, the percentage of genes showing differential expression between the F1 triploids and the two parents was asymmetric (P < 0.05; Fisher’s exact test). Comparison of the expression level in the two parents revealed that 2,446 genes were up-regulated in paternal BSB, and 2,376 genes were up-regulated in maternal GC (Fig. 2a and d). We compared the gene expression in paternal BSB and triploid GB, and we determined that 2,138 genes were up-regulated in BSB, and 1,257 genes were up-regulated in GB (Fig. 2b and d). Then, we compared the expression of maternal GC and triploid GB; 2,483 genes were up-regulated in GC, and 1,516 genes were up-regulated in GB (Fig. 2c and d).
To detect whether the phenomenon of dosage effect occurred in triploidstriploid, the comparison of the value of predicted triploid expression level (PT-ELV, also known as in silico mid-parents C2 + B) and the value of actual triploid expression level (AT-ELV) of GB was performed (see methods). The 4,048 genes (29.1%) had exhibited up-regulated expression in PT-ELV of GB and only 81 genes (0.6%) had shown up-regulated expression in AT-ELV of GB (Fig. 3a and c). The above results were obviously showing that the negative dosage effect of maternal GC-homoeologous chromosomes had occurred in triploid offspring. Based on the existence of dosage effect, we had hypothesized the value of predicted diploid expression level (PD-ELV, also known as in silico mid-parents C + B) and compared it with the AT-ELV. The 2,441 genes that were significantly differentially expressed in triploids included 2,232 (16.1%) up-regulated genes in PD-ELV of GB and 209 (1.5%) up-regulated genes in AT-ELV of GB (Fig. 3b and c). Our results shed insight into that both the mechanism of negative dosage effects and another unknown mechanism result in triploid expression level decreasing to the diploid state.
Expression patterns under dosage effect
As a prerequisite of the dosage effect found in triploid, it shed us insight into the expression level raised from one paternal set of chromosomes and one maternal set of chromosomes in triploid. For better understanding ELD and HEB under dosage effects, we had established 12 categories including mid-parents (XI and XII), up/down expression (I, II, III, IV, V, and VI), and ELD (VII, VIII, IX and X) to assess differential gene expression (see Methods). Among of 13,893 shared genes, 2,749 genes (19.8%) were detected as ELD category (Fig. 4a). Maternal GC-ELD including 1,645 genes (11.8% of all genes, categories IX and X) had exhibited more influence than paternal BSB-ELD (1,104 genes, 7.9% of all genes, categories VII and VIII) in triploid (Fig. 4a). Categories VII and X (GC vs BSB = 1.8 vs 1) represented the up-regulated ELD, while down-regulated ELD (GC vs BSB = 1.3 vs 1) was detected in categories VIII and IX in triploid (Fig. 4a). The results showed that the number of HEB genes was unbalanced in triploid with respect to the original parent was inclined to maternal GC genome (paternal BSB bias vs maternal GC bias = 1,104 vs 1,645) (Fig. 4a). To compare triploid GB with paternal BSB, we examined the 1,536 up-regulated genes (IV, V, VI, X, and XII) and 2,170 down-regulated genes (I, II, III, IX, and XI). Compared with maternal GC, the 1,144 up-regulated genes (IV, V, VI, VII, and XI) and 2,021 down-regulated genes (I, II, III, VIII, and XII) was examined in triploid (Fig. 4a). The gene number related to down- or up-regulation had a global mRNA preference toward down-regulation (up-regulation vs down-regulation = 70 vs 586). In addition, 65.4% (9,083 genes, categories of no changes) showed similar expression levels in the parents.
The expression level of growth genes in the hybrid
To analyze the expression level using the 12-categories model, comparison of GB with both of parents indicated that hybridization and triploidization not only resulted in the up-regulation of some genes (70 genes, 0.6%, categories IV-VI) but also lead to the down-regulation in a large number of genes (586 genes, 29.1%, categories I-III). To study on function of growth-regulated in triploid, we obtained 57 shared growth genes among triploid offspring and their parents in the following analysis (Fig. 4a). Analysis of the differential expression of growth-related genes among the shared growth genes revealed that 7.0% (4 genes, categories IV-VI) of genes were up-regulated and 10.5% (10 genes, categories I-III) of genes were down-regulated (Table 2, Fig. 4a). The ratio of the number of up-regulated genes in the growth function category was higher than the total ratio of up-regulated genes (P < 0.05; Fisher’s exact test).
After detecting the ELD of growth-regulated genes in triploid, eleven genes exhibited a paternal BSB-ELD, and 13 genes were showed a maternal GC-ELD (Fig. 4a). The percent of maternal GC-ELD (22.8%) of growth genes was higher than that of the total genes (11.8%). The percent of paternal BSB-ELD (19.3%) of growth genes was higher than that of the total genes (8.9%). The percent of parent ELD in growth-related genes was more than other genes in triploid. Eleven genes were considered to be mid-parent genes, and the remaining 12 growth-related genes showed no change in expression levels (Fig. 4a). The 21.1% of growth-related genes in the “No Change” category was lower than the 65.4% of total genes in that category (Additional file 7). These results suggest that there are more changes in growth-related gene expression in triploid than in other gene functions.
Real-time quantitative PCR (qPCR) validation
To validate the quality of RNA sequencing (RNA-Seq) data and the reliability of triploid expression level compared to both parents, we chose 10 representative differentially expressed genes (igfbp2b, igfbp5a, smad7, gdf6a, igf1, ctnnb1, igf2b, ppm1bb, gdf2, and insra) and performed qPCR on biological replicates in triplicate. The same trends in expression levels of these genes were detected by qPCR as were obtained from the RNA-Seq data analysis (Fig. 5). These results indicate that RNA-Seq data and associated analysis methods can be used to accurately detect differentially expressed genes.
Dosage effect in triploid fish
To investigate whether a regulation mechanism was operating on gene dosage in a triploid genome context, 13,893 shared genes were used in our analysis because other genes may be errors in the assembly or the result of differential transcript expression in the nine individual livers. We compared the gene number of AT-ELV and PT-ELV among the total genes, and most of the total differentially expressed genes (4,048, 29.1% in total shared genes) were down-regulated; these results (number of up-regulated genes vs down-regulated genes = 1 vs 49.9), suggesting that dosage effects would occur in 4,048 genes of triploid (Fig. 3). A recent report suggested the silencing of one of three sets of alleles would result in transcript levels in triploid fish being decreased to the diploid state . In contrast to both X chromosomes being partially repressed in Caenorhabditis elegans hermaphrodites (XX), the dosage effect resulted in the silencing of one X chromosome in vertebrates . The existence of two identical sets of chromosomes in the nucleus would induce dosage compensation, which could result in the silencing of one set of maternal chromosomes (GC) in triploid (GB). The similar result was detected in salmon [17, 35]. However, the comparison of AT-ELV and PD-ELV was used to assess the percent of up- and down-regulated genes (see Methods). The results showed that the expression level (AT-ELV) of 2,232 genes was lower than ones’ in the diploid state (PD-ELV), while only 209 genes (AT-ELV) showed higher than ones’ in the diploid state (PD-ELV). That give insight into other mechanisms occurred in triploid offsprings, such as the level of methylation variation accompanied by triploidization, might act on the down-regulation of gene expression of some alleles, results in the expression level in triploid decreasing to lower than that in the diploid state . The above results helped us understanding maternal GC-dosage effect on the parts of genes in triploid offspring.
Homoeolog expression bias and expression level dominance
After charting the dosage effect in triploid fish, we used a novel method to analyze the state of expression levels between triploids and their diploid parents. This is the first report of this phenomenon in triploid fish (previously referred to as genomic dominance in plants [22, 32]). The 12 categories of expression level patterns were described above. Our results showed that the percentage of up-/down-regulated genes was 19.8% (2,749 genes) (Fig. 4a). Our method to analyze the negative dosage effect was feasible. Although we used the AT-ELV as the normalization state, the down- vs up-regulated ratio was 10.7:1 in triploid compared to the AT-ELV with PT-ELV (Fig. 4a). This suggested that the expression level of homoeologous genes was regulated after the genomes merged, which is the potential force behind the differential epigenetic regulation of the hybrid [22, 36]. Therefore, compared with the pattern of no change in initial predictions of triploid expression levels, the number of genes in the “no change” category was reduced because of the more feasible and detailed method that we used for classification based on expression levels.
To detect the global gene expression, the negative dosage effect of silencing of one set of maternal GC homoeologs was used in our analysis. Further analysis of triploid expression level compared with either of the diploid parents demonstrated the preferential transcription of maternal GC homoeologs in triploid (Fig. 4a). This phenomenon was commonly described in polyploidy  and refers to the pattern of redundant genes being silenced . Approximately 25% of genes showed evidence of ELD in four allopolyploid cottons based on RNA-Seq data . In triploid Squalius alburnoides, the vasa gene illustrated genome ELD in the gonad, and the β-actin gene exhibited the same phenomenon in the gonad and liver . In addition to ELD, a second phenomenon was also described: middle expression levels were found in the polyploid based on the relative expression levels of the two parents. In our study, the 10,488 genes (75.5%, XI, XII, and No Change categories) showed expression levels that were regulated by homoeologs from both parents (Fig. 4a). These phenomena were always described in hybrids and polyploids based on the total gene analysis [22, 39, 40]. However, the phenomenon of middle expression levels exhibited organ-specific expression. For example, the rpl8 and gapdh genes only show a mid-parents expression level in the liver of triploid individuals .
Expression patterns of growth-related genes
The liver plays a major role in metabolism and has a number of functions, including the regulation of growth and development in fish. The study of the expression level of growth-related genes in triploid individuals is central to understanding the mechanism of the hybrid system. Here, we applied next-generation sequencing technology to study the relationship between growth rate and gene expression in a triploid. The dosage effect was evident in the global gene expression in the liver as we showed above. Genome-wide ELD shows maternal GC-HEB in the growth genes of triploid individuals.
In our study, 57 growth-related genes were screened from the categories of global gene expression (Fig. 4a). Four genes (7%) were up-regulated, which was higher than the percentage of total genes (0.5%) (P < 0.05; Fisher’s exact test). For example, igfbp2b and igfbp5a serve as a carrier protein for igf-1, which binds to igf-1 inside the liver, allowing growth hormone to continuously act upon the liver to produce more igf-1 . Up-regulated expression of these genes will help organisms to accumulate and prolong the half-life of the insulin-like growth factors (Table 2; Fig. 5f and g). Another up-regulated gene, igf-1, and a paternal BSB-ELD gene, igf2b, were shown to play roles in the promotion of cell proliferation (Table 2; Fig. 5a and b). The up-regulated expression of igf in triploid was considered to play a crucial role in its faster growth rate relative to diploids . The last up-regulated growth-related gene, insra, is a transmembrane receptor that is activated by insulin and igf, and it belongs to the class of tyrosine kinase receptors (Table 2; Fig. 5h) . The up-regulation of insra resulted in an enhancement of the regulation of glucose homeostasis. Additionally, down-regulated expression of ppm1bb is known to be a negative regulator of cell stress response pathways, and overexpression of this phosphatase is reported to cause cell-growth arrest (Table 2; Fig. 5i) .
The other expression patterns were dominance and mid-parents expression level patterns that included 35 growth-related genes (Fig. 4a); these patterns provided insight into new expression level patterns in triploids. For example, paternal BSB-ELD was evident for the gdf6 and gdf2 genes (Table 2; Fig. 5c and e), which are members of the BMP family and the TGF-β superfamily that regulator cell growth and differentiation in both embryonic and adult tissues. These genes also promote bone and joint formation . The hybrid individual had higher expression levels than maternal GC. These small changes in expression level contributed to changes in growth regulation. In addition, compared to the diploid GC, triploid had a significantly higher growth rate . Therefore, these mechanisms might play important roles in the regulation of growth by changing some growth-related gene expression levels. Maternal GC-ELD gene smad7 enhances muscle differentiation and plays a role in the negative feedback of TGF-β signaling (Table 2; Fig. 5j) . These observations agreed with observations from some previous reports of polyploid fish [46, 47]. The middle-parents gene ctnnb1 indicated that its expression was positively regulated by paternal BSB and resulted in up-regulated expression in triploid.
The differences in expression levels in triploid and the inbred diploid parents gave us a platform to investigate the rapid growth in triploid individuals. However, we should also investigate the gene expression changes that indirectly result in a change in growth traits. More research on these subjects will help us understand how the growth-related function was regulated in triploids. However, the observed results suggested that the rapid growth in triploids could be regulated by genes with a negative dosage effect.
Mechanism of various expression patterns
Recent evidence showed that dosage compensation resulted in novel epigenetic regulation in triploids . The current challenge is determining which changes in regulatory mechanisms explain the observed differences in gene expression levels and the evolution of complex phenotypes [35, 48]. Epigenetic instability in polyploids was described recently [49, 50]. Increased gene copy numbers from different species usually lead to changes in gene expression. This change usually destroys the steady state of the regulatory adaptations that were selected in the parents . However, these abundant expression level patterns in polyploids provide important materials for adapting to various situations. The hybrids are likely to display regulatory alterations. These changes involved the silencing or activation of genes and DNA transposition of the Spm/CACTA family; these changes were described in allopolyploids of Arabidopsis thaliana [51, 52]. Our study also showed the activation of two genes in the liver transcriptome (Additional file 6). Possible mechanisms include small inhibitory RNA and epigenetic pathways that mediate the expression levels together with dosage compensation in triploids [35, 53].
The hypothesis that differences in expression levels have an important role in speciation and adaptation has been accepted generally . The mechanism of dosage compensation may be an extremely relevant factor contributing to the success and perpetuation of polyploidy in lower vertebrates . Our results reveal the dosage effect occurring in triploid fish. To further analyze the regulated expression from dosage compensation, we used 12 expression patterns including up-/down-regulation, homoeolog dominance, and mid-parents to help us understand the speciation of triploid fish. The slightly unregulated growth genes and preferential transcription of paternal homoeologs provided insight into the regulation mechanisms that may contribute to the relationship between heterosis and growth expression in triploid fish. At present, we are trying to elaborate how these transcriptomic dynamics affect function and mediate phenotypes. In addition, the genes with changes in expression levels that were conferred by gene abundance are available for evolutionary experimentation. However, more studies using various species, tissues, and environmental conditions are needed to describe the various expression level patterns in hybrids and polyploids.
For this study, all experiments were approved by Animal Care Committee of Hunan Normal University and followed guidelines statement of the Administration of Affairs Concerning Animal Experimentation of China. Experimental individuals were fed in a pool with suitable illumination, water temperature, dissolved oxygen content, and adequate forage for 19 months in the Engineering Center of Polyploidy Fish Breeding of the National Education Ministry located at Hunan Normal University, China. Triploid hybrids of female grass carp (Ctenopharyngodon idellus, GC, Cyprininae, 2n = 48) × male blunt snout bream (Megalobrama amblycephala, BSB, Cultrinae, 2n = 48) were successfully obtained by distant hybridization as a result of human selection (Fig. 1b and c) . The 5S rDNA locus has been used to identify triploid hybrids that possessed 72 chromosomes with two sets from maternal GC and one set from paternal BSB . Triploid hybrid of GC (♀) × BSB (♂) was abbreviated as GB hybrids. Nine individuals (three hybrids and six parents) were collected for our studies. The information about fish samples including body traits (body length, body height, and weight) and DNA content were obtained at the time of the experiment (Additional file 8).
The ploidy levels of the nine individuals were distinguished by a metaphase chromosome assay of cultured blood cells (Fig. 1a). After anesthetizing the fish with 2-phenoxyethanol, liver tissue was excised carefully to avoid gut contamination. The fish were treated humanely. All of the experiments were approved by the Animal Care Committee of Hunan Normal University and the Administration of Affairs Concerning Animal Experimentation guidelines stated approval from the Science and Technology Bureau of China. Samples were cut into small pieces and immediately pulled into RNALater (Ambion, AM7021, USA) at −80 °C following the manufacturer’s instructions. Total RNA was extracted from liver tissue of the BSB, GC, and GB samples. After RNALater was removed, the samples were homogenized using a pestle and mortar. RNA was isolated according to the standard trizol protocol, and agarose gel electrophoresis and the optical density at 260 nm (OD260)/OD280 ratio was used to assess RNA quality. A TURBO DNA-free kit was used to remove DNA contamination.
Illumina sequencing and assembly of the Illumina contigs
Poly (A) mRNA isolation was performed using oligo (dT) beads after total RNA collection. Fragmentation buffer was added to generate short fragments of mRNA. Using these short fragments as templates, first-strand cDNA was synthesized by a random hexamer primer. Second-strand cDNA was then synthesized using buffer, dNTPs, RNaseH, and DNA polymerase I. Short fragments were purified with the QiaQuick PCR extraction kit (Qiagen) and resolved with elution buffer. These fragments were separated by agarose gel electrophoresis after adding sequencing adapters. PCR amplification templates of the suitable fragments were selected. During the quality control steps, the Agilent 2100 Bioanalyzer and ABI StepOnePlus Real-Time PCR System were used to qualify and quantify the sample library. Finally, the nine libraries from the nine individuals (six parents and three triploids) were sequenced using an Illumina HiSeq™ 2000/2500.
After raw reads were produced by sequencing, the read adaptors and low quality reads were removed. Transcriptome de novo assembly was carried out with a short-reads assembly program (Trinity) , using three independent software modules called Inchworm, Chrysalis, and Butterfly. Principal component analysis (PCA) of nine liver transcriptomes was applied to examine the contribution of each transcript to the separation of the classes [55, 56] (Additional file 9).
Contig annotation was performed using the five public databases. BLASTX alignment (e-value ≤ 1e−6) between contigs and protein databases was performed, and the best-aligned results were used to decide the sequence direction of contigs (Additional file 1). After screening the sequences (alignment length ≤ 100 bp), accession numbers of the genes were obtained from the BLASTX results. Then, GO terms of annotation sequences were obtained through Ensembl BioMart . WEGO software was used to analyze the GO annotation (Additional file 2) . For pathway enrichment analysis, we mapped all differentially expressed genes to terms in the KEGG database and looked for significantly enriched KEGG terms (Additional file 10).
Mapping and differential expression
To obtain the shared transcripts in the three species, the reference transcripts were merged from the BSB, GC, and GB contigs using CD-HIT with 95% as the threshold . Then, we utilized the merged sequences as the reference transcript because this database was built using transcripts from both parents and the hybrid offspring. The total clean reads were aligned against the merged sequences using Blat . Then, information about the expression level in the three species was reflected by the number of aligned reads.
Mapped, filtered, and sorted reads were analyzed with the DEGseq package in R software version 2.13 (R Foundation for Statistical Computing, Vienna, Austria) . Differential expression was assessed in triploids and their diploid parents using Fisher’s exact tests . The abundance or the coverage of each transcript was determined by read counts and normalized using the number of reads per kilobase exon per million mapped reads (RPKM) . The RPKM value of the read density reflected the molar concentration of a transcript in the starting sample after normalizing for the RNA length and total read number in the measurements. This facilitated a transparent comparison of transcript levels within and between samples. Herein, we defined gene expression as the average sequence expression of a gene, and a species comparison was shown (Fig. 3a, b, and c).
Dosage compensation in triploid fish
The absolute values of the log2Ratio ≤ 1 were used as the threshold to judge the significance of the gene expression difference. Expression values above the threshold were described as upregulated and those below the threshold were described as downregulated.
To effectively analyze the dosage effects in triploid, we first set the PT-ELV (χtriploid) according to the composition of the genome: two sets of genomes from maternal GC and one set from paternal BSB . The value was constructed from two parts in which one is half the BSB value of gene expression (χBSB) and the other is the GC values of gene expression (χGC) (χtriploid = 1/2χBSB + χGC). If no dosage effect happens in triploids, the gene expression level of triploids will float along with χtriploid. However, comparing the AT-ELV with PT-ELV in triploids revealed that most genes were down-regulated. In this situation, we assumed that the dosage effect occurred in maternal GC homoeolog of triploids similar to other triploid individuals  and set up the PD-ELV (χdiploid = 1/2χBSB + 1/2χGC). Comparing the AT-ELV and PD-ELV, the number of differentially expressed genes showed trends of up- and down-regulation in triploid fish.
Analyses of expression level dominance and homoeolog expression bias
We explored the data to identify candidate novel expression (new expression of a gene in liver) and homoeolog silencing patterns (no expression of one homoeolog) in the hybrids. Novel expression was inferred when both parental species had no reads for a gene, yet hybrids displayed more than 10 RPKM. If both parental species had more than 10 RPKM, but hybrids had zero counts for the same gene, this was considered silencing. These two cases were eliminated from further analysis, and we focused on genes that were expressed among both the diploid parents and triploid offspring.
In triploid offspring, the total liver genes were affected by a negative dosage effect. Genes that were identified as differentially expressed in the hybrid relative to the diploid parents were binned into 12 possible expression classes of differential expression (Fig. 4a), ELD, mid-parents, and up/down expression (outside the range of either parent), according to Rappet et al. (2009) . Briefly, genes were parsed into these 12 categories (using Roman numerals; see Fig. 4a), depending on the relative expression levels between triploid and the diploid parents. Examined in this manner, genes may display mid-parents (XI and XII), paternal BSB-ELD (VII and VIII), maternal GC-ELD (IX and X), expression lower than both parents (I, II, and III), or expression higher than both parents (IV, V, and VI). For each of the 12 categories above (which are based on joint expression levels for both homoeologs), we calculated the RPKM value of reads to examine the gene expression for each homoeolog pair. The FDR was used to determine the threshold P value in multiple tests and analyses. FDR < 0.001 and the absolute value of log2 ratio ≤ 1 were used as thresholds to judge the significance of gene expression differences between two species. For each gene, the expression level of the two diploid parents was estimated and classified into three situations; then, the expression level of triploid hybrid for the same gene was exhibited in the three situations (Fig. 4b, c, and d).
According to the expression level of transcriptome data, we had detected the expression of β-actin among of BSB, GC and GB. The expression level of β-actin in liver of triploid was also decreased to one’s in diploid state. So β-actin could be considered as the references gene in qPCR. The total RNA that was extracted from the liver tissue was used for qPCR analysis. qPCR analysis was performed using the Prism 7500 Sequence Detection System (Applied Biosystems) with a miScript SYBR Green PCR kit (Qiagen). qPCR was performed on biological replicates in triplicate (and triplicate technical qPCR replicates). The amplification conditions were as follows: 50 °C for 5 min and 95 °C for 10 min, followed by 40 cycles at 95 °C for 15 s and 60 °C for 45 s. The average threshold cycle (Ct) was calculated for each sample using the 2-ΔΔCt method and normalized to β-actin. Lastly, a melting curve analysis was completed to validate the specific generation of the expected product.
BLAST-like alignment tool
The value of actual triploid expression level
Expression level dominances
False discovery rate
Homoeolog expression bias
Principal component analysis
The value of predicted diploid expression level
The value of predicted triploid expression level
Reads per kilobase exon per million mapped reads
Real-time quantitative polymerase chain reaction
Mayr E. Animal species and evolution. Animal species and their evolution. 1963.
Comai L. The advantages and disadvantages of being polyploid. Nat Rev Genet. 2005;6(11):836–46.
Vrijenhoek RC. Polyploid hybrids: multiple origins of a treefrog species. Curr Biol. 2006;16(7):R245–247.
Tunner HG, Nopp H. Heterosis in the common European Water Frog. Naturwissenschaften. 1979;66(5):268–9.
Alves M, Coelho M, Collares-Pereira M. Evolution in action through hybridisation and polyploidy in an Iberian freshwater fish: a genetic review. Genetica. 2001;111(1–3):375–85.
He W, Xie L, Li T, Liu S, Xiao J, Hu J, Wang J, Qin Q, Liu Y. The formation of diploid and triploid hybrids of female grass carp x male blunt snout bream and their 5S rDNA analysis. BMC Genet. 2013;14(1):110.
Shen JM, Liu SJ, Sun YD, Zhang C, Luo KK, Tao M, Zeng C, Liu Y. A new type of triploid crucian crap-red crucian carp (female) x allotetraploid (male). Prog Nat Sci. 2006;16(12):1348–52.
Adams KL, Wendel JF. Novel patterns of gene expression in polyploid plants. Trends Genet. 2005;21(10):539–43.
Larsen PA, Marchan-Rivadeneira MR, Baker RJ. Natural hybridization generates mammalian lineage with species characteristics. Proc Natl Acad Sci U S A. 2010;107(25):11447–52.
Pala I, Coelho MM, Schartl M. Dosage Compensation by Gene-Copy Silencing in a Triploid Hybrid Fish. Curr Biol. 2008;18(17):1344–8.
Ching B, Jamieson S, Heath JW, Heath DD, Hubberstey A. Transcriptional differences between triploid and diploid Chinook salmon (Oncorhynchus tshawytscha) during live Vibrio anguillarum challenge. Heredity. 2009;104(2):224–34.
Zhong H, Zhou Y, Liu S, Tao M, Long Y, Liu Z, Zhang C, Duan W, Hu J, Song C, et al. Elevated expressions of GH/IGF axis genes in triploid crucian carp. Gen Comp Endocr. 2012;178(2):291–300.
Zhou Y, Zhong H, Liu S, Yu F, Hu J, Zhang C, Tao M, Liu Y. Elevated expression of Piwi and piRNAs in ovaries of triploid crucian carp. Mol Cell Endocrino. 2014;383(1–2):1–9.
Yu F, Xiao J, Liang X, Liu S, Zhou G, Luo K, Liu Y, Hu W, Wang Y, Zhu Z. Rapid growth and sterility of growth hormone gene transgenic triploid carp. Chinese Sci Bull. 2011;56(16):1679–84.
Xu K, Wen M, Duan W, Ren L, Hu F, Xiao J, Wang J, Tao M, Zhang C, Wang J, et al. Comparative Analysis of Testis Transcriptomes from Triploid and Fertile Diploid Cyprinid Fish. Biol Reprod. 2015;92(4):95.
Devlin RH, Sakhrani D, Biagi CA, Smith JL, Fujimoto T, Beckman B. Growth and endocrine effect of growth hormone transgene dosage in diploid and triploid coho salmon. Gen Comp Endocr. 2014;196:112–22.
Johnson RM, Shrimpton JM, Cho GK, Heath DD. Dosage effects on heritability and maternal effects in diploid and triploid Chinook salmon (Oncorhynchus tshawytscha). Heredity. 2007;98(5):303–10.
Mallet J. Hybrid speciation. Nature. 2007;446(7133):279–83.
Long Y, Tao M, Liu S, Zhong H, Chen L, Tao S, Liu Y. Differential expression of Gnrh2, Gthβ, and Gthr genes in sterile triploids and fertile tetraploids. Cell Tissue Res. 2009;338(1):151–9.
Tao M, Liu S, Long Y, Zeng C, Liu J, Liu L, Zhang C, Duan W, Liu Y. The cloning of Dmc1 cDNAs and a comparative study of its expression in different ploidy cyprinid fishes. Sci China Ser C. 2008;51(1):38–46.
Tate JA, Joshi P, Soltis KA, Soltis PS, Soltis DE. On the road to diploidization? Homoeolog loss in independently formed populations of the allopolyploid Tragopogon miscellus (Asteraceae). BMC Plant Biol. 2009;9(1):1–10.
Yoo MJ, Szadkowski E, Wendel JF. Homoeolog expression bias and expression level dominance in allopolyploid cotton. Heredity. 2013;110(2):171–80.
Cox MP, Dong T, Shen G, Dalvi Y, Scott DB, Ganley AR. An interspecific fungal hybrid reveals cross-kingdom rules for allopolyploid gene expression patterns. PLoS Genet. 2014;10(3):e1004180.
Chaudhary B, Flagel L, Stupar RM, Udall JA, Verma N, Springer NM, Wendel JF. Reciprocal silencing, transcriptional bias and functional divergence of homeologs in polyploid cotton (gossypium). Genetics. 2009;182(2):503–17.
Vrana PB, Fossella JA, Matteson P, del Rio T, O'Neill MJ, Tilghman SM. Genetic and epigenetic incompatibilities underlie hybrid dysgenesis in Peromyscus. Nat Genet. 2000;25(1):120–4.
Brennecke J, Malone CD, Aravin AA, Sachidanandam R, Stark A, Hannon GJ. An epigenetic role for maternally inherited piRNAs in transposon silencing. Science. 2008;322(5906):1387–92.
Olmo E. Nucleotype and cell size in vertebrates: a review. Basic Appl Histochem. 1983;27(4):227–56.
Melaragno JE, Mehrotra B, Coleman AW. Relationship between Endopolyploidy and Cell Size in Epidermal Tissue of Arabidopsis. Plant Cell. 1993;5(11):1661–8.
Fankhauser G. Maintenance of normal structure in heteroploid salamander larvae, through compensation of changes in cell size by adjustment of cell number and cell shape. J Exp Zool. 1945;100:445–55.
Henery CC, Bard JB, Kaufman MH. Tetraploidy in mice, embryonic cell number, and the grain of the developmental map. Dev Biol. 1992;152(2):233–41.
Cavalier-Smith T. Nuclear volume control by nucleoskeletal DNA, selection for cell volume and cell growth rate, and the solution of the DNA C-value paradox. J Cell Sci. 1978;34:247–78.
Rapp RA, Udall JA, Wendel JF. Genomic expression dominance in allopolyploids. BMC Biol. 2009;7:18.
Baxter RC, Binoux M, Clemmons DR, Conover C, Drop SL, Holly JM, Mohan S, Oh Y, Rosenfeld RG. Recommendations for nomenclature of the insulin-like growth factor binding protein (IGFBP) superfamily. J Clin Endocr Metab. 1998;8(3):273–4.
Casci T. Dosage compensation: What dosage compensation? Nat Rev Genet. 2011;12(1):2–2.
Matos I, Machado MP, Schartl M, Coelho MM. Gene expression dosage regulation in an allopolyploid fish. PLoS One. 2015;10(3):e0116309.
Verhoeven KJ, Van Dijk PJ, Biere A. Changes in genomic methylation patterns during the formation of triploid asexual dandelion lineages. Mol Ecol. 2010;19(2):315–24.
Chen XL, Yue PQ, Lin RD. Major groups within the family cyprinidae and their phylogenetic relationships. Acta Zootaxonomica Sinica. 1984;4:022.
Werth CR, Windham MD. A model for divergent, allopatric speciation of polyploid pteridophytes resulting from silencing of duplicate-gene expression. Am Nat. 1991;137(4):515–26.
Yoo MJ, Liu X, Pires JC, Soltis PS, Soltis DE. Nonadditive gene expression in polyploids. Annu Rev Genet. 2014;48:485–517.
Straub T, Becker PB. Dosage compensation: the beginning and end of generalization. Nat Rev Genet. 2007;8(1):47–57.
Hwa V, Oh Y, Rosenfeld RG. The Insulin-Like Growth Factor-Binding Protein (IGFBP) Superfamily 1. Endocr Rev. 1999;20(6):761–87.
Ward CW, Lawrence MC. Ligand-induced activation of the insulin receptor: a multi-step process involving structural changes in both the ligand and the receptor. Bioessays. 2009;31(4):422–34.
Tasdelen I, van Beekum O, Gorbenko O, Fleskens V, van den Broek Niels JF, Koppen A, Hamers N, Berger R, Coffer Paul J, Brenkman Arjan B, et al. The serine/threonine phosphatase PPM1B (PP2Cβ) selectively modulates PPARγ activity. Biochem J. 2013;451(1):45–53.
Settle SH, Rountree RB, Sinha A, Thacker A, Higgins K, Kingsley DM. Multiple joint and skeletal patterning defects caused by single and double mutations in the mouse Gdf6 and Gdf5 genes. Dev Biol. 2003;254(1):116–30.
Ishisaki A, Yamato K, Hashimoto S, Nakao A, Tamaki K, Nonaka K, ten Dijke P, Sugino H, Nishihara T. Differential inhibition of Smad6 and Smad7 on bone morphogenetic protein-and activin-mediated growth arrest and apoptosis in B cells. J Biol Chem. 1999;274(19):13637–42.
Shrimpton JM, Sentlinger AM, Heath JW, Devlin RH, Heath DD. Biochemical and molecular differences in diploid and triploid ocean-type chinook salmon (Oncorhynchus tshawytscha) smolts. Fish Physiol Biochem. 2007;33(3):259–68.
Beckman BR, Shearer KD, Cooper KA, Dickhoff WW. Relationship of insulin-like growth factor-I and insulin to size and adiposity of under-yearling Chinook salmon. Comp Biochem Phys A. 2001;129(2):585–93.
Romero IG, Ruvinsky I, Gilad Y. Comparative studies of gene expression and the evolution of gene regulation. Nat Rev Genet. 2012;13(7):505–16.
Tirosh I, Reikhav S, Sigal N, Assia Y, Barkai N. Chromatin regulators as capacitors of interspecies variations in gene expression. Mol Syst Biol. 2010;6(1):435.
Matzke M, Scheid OM, Matzke A. Rapid structural and epigenetic changes in polyploid and aneuploid genomes. Bioessays. 1999;21(9):761–7.
Kudapa H, Azam S, Sharpe AG, Taran B, Li R, Deonovic B, Cameron C, Farmer AD, Cannon SB, Varshney RK. Comprehensive transcriptome assembly of chickpea (Cicer arietinum L.) using Sanger and next generation sequencing platforms: development and applications. PLoS One. 2014;9(1):e86039.
Xu C, Bai Y, Lin X, Zhao N, Hu L, Gong Z, Wendel JF, Liu B. Genome-wide disruption of gene expression in allopolyploids but not hybrids of rice subspecies. Mol Biol Evol. 2014;31(5):1066–76.
Lu J, Zhang C, Baulcombe DC, Chen ZJ. Maternal siRNAs as regulators of parental genome imbalance and gene expression in endosperm of Arabidopsis seeds. Proc Natl Acad Sci U S A. 2012;109(14):5529–34.
Dion-Cote AM, Renaut S, Normandeau E, Bernatchez L. RNA-seq Reveals Transcriptomic Shock Involving Transposable Elements Reactivation in Hybrids of Young Lake Whitefish Species. Mol Biol Evol. 2014;31(5):1188–99.
Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010;11(10):R106.
Reeb PD, Steibel JP. Evaluating statistical analysis models for RNA sequencing experiments. Front Genet. 2013;4:178.
Flicek P, Ahmed I, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, Clapham P, Coates G, Fairley S, et al. Ensembl 2013. Nucleic Acids Res. 2013;41(Database issue):D48–55.
Ye J, Fang L, Zheng H, Zhang Y, Chen J, Zhang Z, Wang J, Li S, Li R, Bolund L. WEGO. a web tool for plotting GO annotations. Nucleic Acids Res. 2006;34 suppl 2:W293–7.
Li W, Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22(13):1658–9.
Wang L, Feng Z, Wang X, Wang X, Zhang X. DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics. 2010;26(1):136–8.
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5(7):621–8.
We thank Pengcheng Yan (Beijing Computing Center) for bioinformatics analysis. We also thank Hui Zhang for helpful comments on a previous version of the manuscript.
This research was supported by the National Natural Science Foundation of China (Grant No. 31430088), the Key Research and Development Project of Hunan Province (Grant No. 2016NK2128), the educational scientific research of Hunan Province (Grant No. 16C0974), the Natural Science Foundation of Hunan Province (Grant No. 14JJ6008), Training Program of the Major Research Plan of the National Natural Science Foundation of China (Grant No. 91331105, the National Key Basic Research Program of China (Grant No. 2012CB722305), the National High Technology Research and Development Program of China (Grant No.2011AA100403), Cooperative Innovation Center of Engineering and New Products for Developmental Biology (Grant No. 20134486), and the construct program of the key discipline in Hunan province and China.
Availability of data and materials
Sequences of the genes analyzed in this work are available through GenBank.
LR, Jun W, YZ, XJT, and YFX carried out bioinformatics analyses and wrote the manuscript. SJL and LR contributed to the conception and design of the study. WHL, CCT, JC and JX provided assistance extracting the raw material. JLC, Jing W, MT and CZ modified the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
For this study, all experiments were approved by Animal Care Committee of Hunan Normal University and followed guidelines statement of the Administration of Affairs Concerning Animal Experimentation of China.
The transcriptome data were submitted to NCBI (accession number: SRP022247, SRP040125 and SRP040126).
In addition, datasets further supporting the conclusions of this article are included within the article and its additional files.
Summary information of assembled sequences blasted against the five databases. (A). Contig distribution of GC, BSB and GB, and merge sequences aligned to NCBI-NR, Swiss-Prot, KEGG, COG and GO, respectively. (B). E-value distribution of BLASTX hits with threshold of 1.0E-6. (TIF 647 kb)
Gene ontology (GO) assignments for the GC, BSB and GB. GO assignments (level 2) were used to predict the functional distribution including cellular component ontology, molecular function ontology and biological processes ontology. (TIF 3893 kb)
Venn diagram where the area of each circle (and intersections) is proportional to the number of unigenes from GC, BSB and GB after GO annotation. Numbers are indicated in each section. (TIF 561 kb)
Distribution of contigs of GC, BSB and GB and merge sequences. The 95,702 contigs were merged using CD-HIT. (TIF 215 kb)
The basic information of mapping data. (DOCX 17 kb)
The basic information of gene silencing and novel genes in triploid offspring. (DOCX 19 kb)
Basic information of the categories of “No change” in growth genes as comparison of the triploid hybrids with its parents. (DOCX 17 kb)
Comparison of the measurable traits among the hybrid offspring and their parents. (DOCX 20 kb)
(Color online) Symmetric heatmap of attribute correlations among nine individuals. Blue (red) indicates perfect correlation (anti-correlation). White exhibits the intermediate case of no correlation. The small amount of clustering along the diagonal attests to the relative independence of the attributes. (TIF 182 kb)
The pathway information in three species. (DOCX 30 kb)
About this article
Cite this article
Ren, L., Tang, C., Li, W. et al. Determination of dosage compensation and comparison of gene expression in a triploid hybrid fish. BMC Genomics 18, 38 (2017). https://doi.org/10.1186/s12864-016-3424-5
- Dosage compensation
- Genomic dominance
- Biased expression