Small RNA-seq analysis of single porcine blastocysts revealed that maternal estradiol-17beta exposure does not affect miRNA isoform (isomiR) expression
BMC Genomics volume 19, Article number: 590 (2018)
The expression of microRNAs (miRNAs) is essential for the proper development of the mammalian embryo. A maternal exposure to endocrine disrupting chemicals during preimplantation bears the potential for transgenerational inheritance of disease through the epigenetic perturbation of the developing embryo. A comprehensive assembly of embryo-specific miRNAs and respective isoforms (isomiR) is lacking to date. We aimed at revealing the sex-specific miRNA expression profile of single porcine blastocysts developing in gilts orally exposed to exogenous estradiol-17 β (E2). Therefore we analyzed the miRNA profile specifically focusing on isomiRs and potentially embryo-specific miRNAs.
Deep sequencing of small RNA (small RNA-seq) result in the detection of miRNA sequences mapping to known and predicted porcine miRNAs as well as novel miRNAs highly conserved in human and cattle. A set of highly abundant miRNAs and a large number of rarely expressed miRNAs were identified by using a small RNA analysis pipeline, which was integrated into a novel Galaxy workflow specifically benefits incompletely annotated species. In particular, orthologue species information increased the total number of annotated miRNAs, while mapping to other non-coding RNAs avoided falsely annotated miRNAs. Neither the low nor the high dose of E2 treatment (10 and 1000 µE2/kg body weight daily, respectively) affected the miRNA profile in blastocysts despite the distinct differential mRNA expression and DNA methylation found in previous studies. The high number of generated sequence reads enabled a comprehensive analysis of the isomiR repertoire showing various templated and non-templated modifications. Furthermore, potentially blastocyst-specific miRNAs were identified.
In pre-implantation embryos, numerous distinct isomiRs were discovered indicating a high complexity of miRNA expression. Neither the sex of the embryo nor a maternal E2 exposure affected the miRNA expression profile of developing porcine blastocysts. The adaptation to the continuous duration of the E2 treatment might explain the lack of an effect.
Endocrine disrupting chemicals (EDCs) are natural and synthetic substances that may impact on health by deteriorating the endogenous hormone system [1, 2]. During the preimplantation phase the embryo is specifically sensitive towards EDC acting as transcriptional modulators [3, 4]. EDC may perturb the uterine milieu and thereby also indirectly impose adverse transgenerational effects on the developing embryo . Depending on the time, dose and EDC substance, the effects may be either detrimental or only lead to minor effects in the targeted individual or the subsequent offspring. Estradiol-17 β (E2) is the most potent endogenous estrogen in the body. Because it is present in the environment and can adversely act on the developing organisms, it is considered as an EDC [6, 7].
In pigs, embryo development is under maternal control until the 4-cell stage. While embryonic genome activation occurs at around 3 days post fertilization , both compact morula and the subsequent blastocyst are formed in the uterus at around 4–5 days after fertilization, respectively . At Day 7 of development, the embryo hatches from the zona pellucida, increases in size until Day 10 of development  and starts to rapidly elongate by Day 11. At the same time, the embryo secretes E2 as primary pregnancy recognition signal .
We have only recently shown that gestational low-dose E2 treatment in pigs altered body composition and bone development in male and female offspring, respectively [6, 11]. Specifically, the oral application of very low doses of E2 left a lasting epigenetic fingerprint in both the mother and her offspring, revealed by differential gene expression and DNA methylation of cell cycle regulation and tumor suppressor genes in a number of different tissues . An intergenerational effect of gestational exposure was revealed as blastocysts not only showed differential gene expression, but also differential local DNA methylation patterns largely similar to the maternal tissues  (Flöter et al, under review). In embryos of several species including mice, cattle, and pigs, a dynamic developmental stage-specific microRNA (miRNA) expression has been described [13–15]. In addition, it has been hypothesized that embryonic E2 in pigs is also associated with changes in the expression of miRNAs in the embryo , which can lead to pregnancy disruption .
In this study we focus on non-coding RNA (ncRNA) which are transcripts that are not translated into proteins and usually have structural or regulatory roles . The corresponding genes are located in intergenic and intronic regions or on the reverse strand of protein-coding genes, but can also have similar structures as protein-coding genes located within protein-coding sequences . Many known ncRNAs are assigned to different functional classes  and play a pivotal role in gene expression regulation. Well-known long ncRNAs present in very high amounts in the cell are rRNAs and tRNAs. Other small ncRNA groups, such as long piRNAs and miRNAs with sizes below 50 nt, have only lately been studied intensively . Due to the small size, the latter had been difficult to discriminate from partially degraded longer transcripts . While piRNAs have been shown to mediate transposon silencing and epigenetic gene regulation in gametes , miRNAs comprise a large class of small ncRNAs involved in post-transcriptional repression of genes, inhibition of messenger RNA (mRNA) translation and/or enhanced mRNA degradation [23, 24]. MiRNAs are produced from a stem-loop-containing primary precursor (pri-miRNA)  that is processed by the Drosha-DGCR8 complex into a precursor hairpin (pre-miRNA) of ∼70nt length in the nucleus. The pre-miRNA is transported into the cytoplasm where it is converted by Dicer to a 17–27 nucleotides long double stranded miRNA/miRNA* duplex. One of the strands or sometimes both represents the functional mature miRNA form that is incorporated in the RNA-induced silencing complex (RISC). This complex is able to target specific mRNAs . MiRBase is one of the databases for miRNA sequences. It permanently increases driven by the deposition of results of small RNA deep sequencing experiments and provides a number of precursor and mature miRNAs that are confirmed based on mapping of small RNA deep sequencing reads . Small RNA sequencing (RNA-seq) data can be analyzed similar to other transcriptome sequencing data based on basic analysis pipelines including quality control, filtering, trimming, and adapter clipping followed by mapping to a reference genome or transcriptome. However, for small RNA-seq data it is necessary to modify the analysis pipeline. The first universal steps of the RNA-seq analysis starting with quality control up to adapter clipping are needed for every FastQ file analysis. For mapping the short miRNA sequences, a specialized mapping strategy is required, e.g., the BWA aligner which works best for well-annotated genomes  or the National Center for Biotechnology Information (NCBI) BLASTn-short specifically for short sequences . The universal steps as well as sequence alignment can be performed in Galaxy [30–32].
In human and mouse, a great variation of ncRNA is known compared to other mammalian species. A serious limitation appears when working with incompletely annotated species such as livestock. Since the number of annotated small ncRNAs is relatively low, it is very difficult to comprehensively and accurately annotate small RNA data sets. IsomiRs are miRNA variants that originate from imprecise and alternative cleavage during the pre-miRNA processing and post-transcriptional modifications [33, 34]. This results in different miRNA stability, sub-cellular localization and different target sites . Mature miRNA sequences can have variations at the 3’ or 5’ end or both ends such as additions or deletions. Modifications that do not match the precursor are so-called “non-template” isomiRs and are of particular interest because these isomiRs might indicate active miRNA function, i.e., were used for repression of target mRNAs [36, 37]. Five-prime end isomiRs can have an interesting role because this modification affects the seed region of the miRNA which can lead to a different set of targets [38, 39].
We used small RNA-seq analysis to investigate isomiR expression, expression of blastocyst-specific miRNAs, and effects of a maternal estradiol-17 β exposure from fertilization onwards until the day of analysis (Day 10) in single porcine blastocysts.
Animal trial and sampling
The experimental trial has been previously described [6, 40]. In brief, estrous cycle synchronized German Landrace sows (n=4-6/treatment) were inseminated with sperm of the same single Pietrain boar. Starting from insemination until Day 10, sows were fed with different doses of estradiol-17 β (E2; 1, 3, 5(10)-ESTRATRIEN-3, 17 β-DIOL, Steraloids, Newport, USA), namely with 10 and 1000 µg E2/kg body weight twice daily, respectively, or with 2 ml ethanol carrier only (control group). The E2 concentrations were selected according to reference values for humans referring to oral uptake levels  and have been reported earlier . The low dose corresponds to the no observed effect level (NOEL). One hour after ingestion of the last dosing, sows were slaughtered on Day 10 of pregnancy. The uterus was removed and embryos were flushed from the uterus using 10 ml phosphate-buffered saline (PBS) (autoclaved, pH 7.4) per horn. All embryos were transferred into a petri-dish containing PBS, washed twice and then single embryos were snap-frozen in liquid nitrogen and stored at -80 °C. Animals were only included in the RNA-seq analysis if embryos were at the hatched blastocyst stage. Experiments with sows were performed in accordance with the International Guiding Principles for Biomedical Research Involving Animals, as proposed by the Society for the Study of Reproduction, with the European Convention on Animal Experimentation and with the German Animal Welfare Act.
Extraction of RNA and DNA
Total RNA and DNA from single embryos were extracted as reported recently  using the AllPrep RNA/DNA Micro Kit (Qiagen, Hilden, Germany) following the manufacturers protocol for cells with slight modifications. In brief, 700 µl of Buffer RLT Plus supplemented with 1% β-mercaptoethanol was added to the frozen embryos. Disruption was achieved by pipetting up and down and by a single brief vortexing. Homogenization was performed using a syringe and needle (20 G). After centrifugation of the lysate using a DNA spin column, the column was stored at 4 °C, while the flow-through was processed following the protocol for “purification of total RNA containing small RNAs from cells”. In order to improve RNA purity, the column was incubated with Buffer RPE at step D3 and D4 before centrifugation for 4 min and 2 min, respectively. RNA elution was repeated using the first eluate to increase the final concentration. The DNA was purified subsequently. Samples were immediately put on ice. RNA and DNA samples were stored at -80 °C and -20 °C, respectively. Purity and quantity was assessed spectrophotometrically using the NanoDrop 1000 (peqLab, Erlangen, Germany). Additionally, RNA quantity of embryos was determined using the Qubit (Invitrogen) with the Qubit™ RNA BR Assay. RNA integrity was measured by means of the Bioanalyzer 2100 (Agilent Technologies, Waldbronn, Germany) with the RNA 6000 Nano Kit (Agilent). The mean RNA Integrity Number of the embryo samples was 9.7 ± 0.3 (± SD).
RNA and DNA, extracted from at least four embryos per sow from four sows per treatment group (control, NOEL, high dose), were used (n = 65). The sex of the embryos was determined by means of quantitative real-time polymerase chain reaction using the DNA with primers specific for the y- chromosomal gene SRY in addition to primers for the autosomal histone gene. Primers were design using NCBI primer-Basic Local Alignment Search Tool (BLAST) . The SuperScript®; III Platinum®;SYBR®;Green One-Step qRT-PCR Kit (Invitrogen) was used on the LightCycler 2.0 (Roche Diagnostics GmbH, Mannheim, Germany). One microliter of DNA was added to 9 µl of the master mix (5 µl of 2X SYBR ®; Green Reaction Mix (includes 0.4 mM of each dNTP and 6 mM MgSO4), 2.4 µl of nuclease free water, 1 µl of 20x Bovine Serum Albumin (ultrapure, non-acetylated) (1 mg/ml), 0.2 µl of forward primer [20 µM], 0.2 µl of reverse primer [20 µM], and 0.2 µl of SuperScript ®; III RT/Platinum ®; Taq Mix). PCR was performed with the following thermal cycler program: 50 °C for 10 min, 95 °C for 2 min, followed by 50 cycles for 5 s at 95 °C, 10 s at 60 °C, and 15 s at 72 °C. Melting curve analysis was performed from 55 to 95 °C with 0.1 °C/sec, then samples were cooled to 40 °C. Reactions were run in duplicates for each sample. A positive control DNA was included in each run. Each primer pair was checked for specificity by sequencing the amplification product. The melting point was used for confirmation of the identity of the product for each reaction.
Small RNA sequencing
NEBNext ®; Small RNA libraries were prepared starting from 50–100 ng total RNA from individual embryos (n=6 per group, 6 groups in total) and were sequenced as one pool of 36 barcode-tagged samples on an Illumina HiSeq 4000 (126 bp single-end reads) on one lane. These six groups consists of a control group (ethanol carrier only), low dose (10 µg E2/kg), and a high dose (1000 µg E2/kg) for male and female embryos, respectively. Sequencing was performed at the Functional Genomics Center Zurich and results uploaded as FastQ files to our local Galaxy data storage (Galaxy-Platform version 15.10). NGS experiments have been deposited in EBI ArrayExpress  with accession number (E-MTAB-6201).
Analysis of small RNA datasets derived from porcine endometrium
Two small RNA-seq datasets derived from porcine endometrium were used for comparison of miRNA expression, namely one small RNA dataset for Day 10 of pregnancy generated in a previous study in our group , and another dataset for Day 12 of pregnancy downloaded from Gene Expression Omnibus (GSE64863, SRP052027) .
Processing of Fastq files
The data analysis was performed on our locally installed Galaxy system . In the first step of the pipeline, adapter sequences were removed (Illumina Universal Adapter: AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC) with the tool “clip adapter” (Version 1.0.1 FASTX-toolkit ). This tool allows to keep clipped sequences and discard non-clipped sequences. Next, Trimmomatic , (Version 0.36) was used to remove N bases (any nucleotide, not a gap) at the first position of the 5’ end which affected 15% of all sequences due to a known sequencing issue of the Illumina HiSeq 4000 instrument. All Fastq files were quality checked after each processing step with FastQC (v0.11.2) to control the performance of the processing steps. The data analysis strategy was to generate a count table for all obtained unique sequences. This count table was the basis for the subsequent statistical analysis and to annotate the sequences. For this, a series of standard Galaxy tools were used as well as additional converting tools from the ToolShed  (Additional file 1: Figure S1). At first, each FastQ file was collapsed into the sequence and number of appearance (using the tool “Collapse” - FastxTools  ranked by the counts (Additional file 2: Figure S2: no. 1). These FASTA files were converted into Galaxy data file type ‘tabular’ (tab-separated text files) with the tool “FASTA-to-Tabular” (Version 1.1.0) (Additional file 2: Figure S2: no. 2). In the following step of the workflow (Additional file 2: Figure S2: no. 3-4) the rank was removed from the table of each identifier generated by the “Collapse” tool followed by two tools (Convert and Cut) to extract the counts and sequence information. With the tool “Convert” dashes were converted into tabs and the columns of interest (counts and sequences) were obtained using “Cut”. In the next step, all separate files corresponding to the individual samples were joined into one table based on the column with the sequences. To join these tabular files, an in-house tool termed “Join datasets by identifier column” was implemented (available in the “ToolShed” Galaxys app store  “join_files_by_id”) (Additional file 2: Figure S2: no. 5). The resulting table contained the unique sequences and the number of reads per sample. This count table was filtered to remove sequences with negligible read counts, comprising sequencing errors or sequences with very low evidence for potential expression by using counts per million (CPM) per sample . The mean library size and potential CPM cutoff (Count table statistics, in house tool) was calculated and the cutoff set to 2.64 CPM (corresponding to an average of 20 reads per library) for at least 5 out of 36 libraries.
Annotation of filtered unique sequences
Filtered sequences were extracted “Tabular to FASTA” and mapped with NCBI BLAST+  blastn-short to align them to all transcripts of Sus scrofa including non-coding RNAs and related well-annotated species. The collection of BLAST databases contained sequences from miRBase (precursor and canonical mature miRNAs), transcript sequences from NCBI and Ensembl, including non-coding RNAs, as well as tRNA and piRNA cluster sequences retrieved from NCBI Sus scrofa 10.2 GFF3 file [48, 49] (files: mirbase 21: mature/precursor, human, pig, and cattle; Ensembl: predicted precursor sequences, human, pig, cattle, plus other ncRNAs from pig and human; NCBI: all RefSeq transcripts and other non-coding RNAs, for human, pig, and cattle (E-MTAB-6201)). Finally, all BLAST results were filtered and joined by removing all duplicated hits. The annotated sequences were further filtered for potential miRNAs and used for analysis of differential expression. Therefore, we checked if all potential miRNAs also had a significant hit to tRNAs (from pig) or rRNAs (from pig and human) and removed these from the potential miRNAs. Filtering of the alignments was done rather conservative by not allowing any mismatch neither for mapping to porcine nor to the other species (for detailed filtering options see Additional file 3: Table S1).
Analysis of differential expression
The analysis of differential miRNA and isoform expression was performed with the BioConductor package edgeR . To estimate trended dispersions, first the dataset was normalized on library size (TMM normalized)  and the GLM robust (estimateGLMRobustDisp)  function was used to dampen the effect of biological outliers. For comparison of the experimental groups, the contrasts (1) male versus female for all estradiol-17 β doses and control in separate and (2) low (10) and high (1000) dose versus control separately for male and female embryos were set. An adjusted p-value (false discovery rate (FDR)) of 10% and 5% was used as threshold for significance of differentially expressed miRNA sequences. Expression values (CPM) of identified differentially expressed sequences (DES) were further processed with an R script to perform hierarchical cluster (HCL) analysis by samples and DES .
IsomiRs were detected with a pipeline of several R packages from BioConductor . First, all annotated sequences were assigned to their annotated miRNA precursor and aligned with ClustalOmega used by the R package msa [54, 55]. The alignment was used to calculate added and deleted nucleotides and compared to the annotated canonical mature miRNA found in miRBase. In addition, the BLAST result was used to characterize miRNA forms containing non-template nucleotides. Finally, the rank of all isomiRs in their mature miRNA group was calculated according to the individual read counts (expression levels, CPM).
MiRDeep2  was performed on our locally installed Galaxy system. It allows the user to map small RNA-seq reads against mature miRNA, precursor miRNA and in addition to align all sequences to the corresponding genome. It is also capable of predicting pre-miRNA folding structures of known and novel miRNA genes. Prefiltered Fastq files (quality control and adapter clipping) from our pipeline were used to run first the miRDeep2 mapper (version 2.0.0) followed by the main program to identify novel and known miRNAs. For the comparison, the ARF output file of the miRDeep2-mapper was used which provides all mapped sequences. Additionally, the same miRNA annotation files (see above) were used to map all miRNA sequences.
Expression database search
The MiRmine - Human miRNA Expression Database was used to search for the expression of human ortholog miRNAs in different tissues . This database provides tissue-specific expression profiles and relative abundance of miRNAs identified in different human miRNA studies.
Processing and annotation of small RNA-seq reads
The sequencing of 36 small RNA-seq libraries for the Day 10 blastocysts resulted in 8 to 32 million raw reads per library. After joining all unique sequences and read counts into one count table for all samples, ∼ 24,000,000 unique sequences were obtained. The count table was reduced to ∼68,000 unique sequences by filtering on >2.64 CPM corresponding to an average of ≥20 reads per sample. Of these sequences, ∼45,000 sequences could be assigned to known RNAs in the annotation files (Additional file 4: Table S2). The majority of the sequences ∼74% (∼70%) (percentage of read counts indicated in parentheses, meaning the number of mapped sequences per unique sequence) mapped to rRNAs, ∼6% (∼13%) to miRNAs, and ∼8% (∼10%) to tRNAs, less than 1% (<1%) of the sequences were assigned to piRNA clusters and ∼1% (<1%) to mRNAs (Fig. 1a). The remaining 11% (∼6%) of sequences were mapped to other groups of ncRNAs. With respect to read counts per RNA type we found large differences between samples (Fig. 1b, Additional file 5: Figure S3). For example, libraries varied from ∼ 6% to ∼42% in miRNA sequence content with respect to read counts.
Analysis of miRNA using a galaxy pipeline
The predominant length of all annotated miRNAs was ∼23 nt ranging from 16 nt for small isomiRs and 62 nt long sequences mapping to precursor miRNAs. In total, 2949 unique sequences were assigned to miRNAs of miRBase21 using all mature and precursor miRNAs from Sus scrofa, Bos taurus and Homo sapiens (Additional file 6: Table S2). Of these, 1842 (62%) sequences mapped to known miRNAs of Sus scrofa. In addition, we detected 1107 (38%) sequences for novel porcine miRNAs, 443 (15%) mapping to Bos taurus and 623 (21%) Homo sapiens, and 41 (∼1%) to predicted Ensembl porcine novel miRNAs (Additional file 7: Figure S4). Overall, 71 of these novel miRNAs had also been assigned to a porcine pre-miRNA, which indicates that they might not be novel miRNA genes, but novel mature miRNAs. One-hundred-and-nine sequences were filtered out because although annotated as porcine miRNAs in miRBase, they were highly similar to human rRNAs. Another 17 sequences were removed due to their positive hit to porcine tRNA sequences (Additional file 3: Table S2). Altogether, the sequences reliably assigned to miRNAs represented 257 mature miRNAs. Of these miRNAs 162 were detected as known porcine miRNAs (miRBase21) and 95 were novel miRNAs for the pig which were assigned to known miRNAs from human and cattle (Additional file 6: Table S3).
Comparison of galaxy pipeline to miRDeep2
The focus of the comparison of our pipeline to miRDeep2 was on mapping all reads to mature and precursor miRNAs and compare the differences in isomiR content and expression levels. Therefore a pre-result file including all mapped miRNA isoforms was used within the miRDeep2 pipeline (Result file of miRDeep2.pl “Text output of MiRDeep2”). MiRDeep2 detected 2862 potential miRNA sequences after filtering on low expressed miRNAs (5 out of 36 libraries > 20 reads each which is equal to the overall expression of equal or greater 180 reads in total). Of those, 1788 were overlapping with our pipeline. Exactly 1074 sequences were assigned to miRNAs by miRDeep2 and identified as false positives, 644 mapped to tRNAs, 193 to rRNAs, and 38 to mRNAs. In addition, miRDeep found 199 miRNAs, which could not be mapped to known miRNAs but to other ncRNAs. Of the miRNA sequences, 1161 were uniquely annotated by our pipeline (Fig. 2). With respect to expression levels, no significant differences in reads counts were detected in comparison to miRDeep2.
Analysis of differential miRNA expression in response to maternal estradiol-17 β treatment and between female and male embryos
For the statistical analysis of miRNA expression, a filtered count table was used containing 2892 annotated miRNA sequences. None of the generated multiple scaling plot (MDS) plots showed a grouping of embryo samples neither by treatment nor by sex (Fig. 3a, c, d, Additional file 8: Figure S5). A grouping of embryos collected from the same sow was observed in the MDS plots (Fig. 3b) and in the pairwise distance heatmap (Additional file 9: Figure S6). The comparison of male versus female embryos did not reveal DES (FDR 5%). The comparison of low dose estradiol-17 β feeding versus control resulted in 20 DES for male embryos (FDR 5%), which represented 10 different mature miRNAs. The miRNA with the highest number of DES was ssc-miR-34c. All other comparisons did not detect DES (Additional file 10: Table S4). A HCL analysis was performed for these 20 DES to characterize the homogeneity of the expression within the treatment groups (Fig. 4). The cluster analysis did not reveal a clear grouping according to treatments and control. In addition, the read counts of isomiRs were summarized for each miRNA and also female and male embryo samples were joined for each treatment and the control group to increase the number of replicates. Furthermore, the embryos nested in the sow and further technical batch effects were included in the statistical model. None of the various analyses revealed more reliable significant miRNA expression differences.
Analysis of tissue-specific miRNA expression
To identify embryo-specific miRNAs potentially involved in regulation of development, the embryo miRNA expression results were compared to two datasets from porcine endometrium and to a miRNA expression database (MiRmine). The data sets were analyzed with the same pipeline and follow-up scripts. The expression levels of the top 20 most frequently expressed miRNAs in the embryo were compared. All three datasets had mir-21 (miR-21-5p) as the most highly expressed miRNA with 39% of all reads in Day 12 endometrium, 18% in Day 10 endometrium and 22% in Day 10 embryos (Additional file 11: Table S5, Fig. 5). In addition to commonly expressed miRNAs, potential tissue-specific miRNAs were identified. For this, we compared the top 10 expressed miRNAs of each study, which resulted in 22 different miRNAs. For endometrium Day 10 or Day 12, mir-143-3p (∼12% of all reads), let-7i, let-7f, and let-7g (each ∼4% of all reads), mir-10b and mir-10d (each ∼5%) were classified as potential embryo-underrepresented miRNAs. Five miRNAs were identified as potential embryo-enriched miRNAs (mir-371-5p 16%, mir-302b 7%, mir-378 ∼5%, mir-7 ∼3%, and mir-302d ∼3%) (Fig. 6). These results were further validated by a search in the miRmine miRNA expression database, which contains expression data for human miRNA for 15 different tissues collected from a variety of deep sequencing studies. The results showed that miR-371-5p is also highly expressed in placenta and umbilical plasma, and at considerable levels in testis, semen/sperm, and salivary exosomes. Furthermore, high expression was found in tumor tissues. MiR-302b-5p and miR-302d-5p were not found as expressed in any dataset contained in miRmine. MiR-378-5p is highly expressed in umbilical plasma, blood macrophages, and other plasma and serum samples. Lower expression was also found in testis, placenta, and a variety of tumors. MiR-7-5p expression in the miRmine data sets was very high in pancreas beta cells and high in various plasma samples, testis, sperm, placenta, serum, salivary exosomes, CD4+ T cells and CD19+ B cells from blood, and different tumor tissues. The most dominant potential endometrium-specific miRNAs (mir-148a-3p, let-7i, mir-30a-5p, let-7g, mir-143-3p) were all equally high expressed in all 15 tissue types (Additional file 12: Table S6). MiR-21-5p was highly expressed in all tissues, which confirmed our findings from the comparison of embryo and endometrium.
IsomiR expression analysis
The top 30 highly expressed miRNAs represented ∼89% of all read counts and the top 4 miRNAs ∼51%. Of those, miR-21 was the most prominent, accounting for ∼22% of all miRNA read counts (Additional file 13: Table S7, Fig. 5). On average, 5 isomiRs per mature miRNA were detected (from 1 to 138). The greatest variety of different isomiRs were expressed by ssc-miR-371-5p (138) and ssc-miR-21 (67) (Additional file 14: Table S8). Modification only at the 3’ end occurred with 54% (1214), while 29% (649) had a modification at both sides and 10% (215) only at the 5’ end. The canonical form was found for 178 miRNAs (Fig. 7). The variations from the canonical form found in miRBase were most frequently added bases rather than deletions (Fig. 8a). At the 5’ end, most frequently one deletion or one addition were found (Fig. 8b). At the 3’ end, one or two added nucleotides per isomiR occurred most frequently (Additional file 14: Table S8, Fig. 8a). Regarding the expression levels per isomiR, mostly the canonical form was most abundant (∼68%), closely followed by isomiRs with 3’ end modifications (25%) (Fig. 9a). For miRNAs where the canonical form was not present (89 miRNAs), the highest expression level was detected for 3’ end modifications (∼ 61%), followed by modifications at both sides (25%) and modifications at the 5’ end (∼14%). Furthermore, the expression levels of isomiRs were analyzed. For the expression analysis on isomiR level, low (on average less than 20 reads per sample) and moderate up to high expressed isomiRs were defined (more than 20 reads per sample). In total, 1003 isomiRs were identified with low expression (Additional file 15: Table S9) and 1946 sequences as highly expressed. The isomiRs containing non-template nucleotides sequences could be detected at 5’ end (398), at 3’ end (1169) and or sequences that had modifications on both sides (381) (Fig. 7b). The analysis of expression levels of all non-templated miRNAs revealed that 28 sequences with ∼10% of the 3’ end non-template isomiRs and ∼ 3% of the 5’ non-template isomiRs showed the highest expression of the corresponding miRNA (Additional file 14: Table S8, Fig. 9b). The top 5 most frequently expressed miRNAs were hsa-miR-200a-3p, bta-miR-21-5p, bta-miR-378d, bta-miR-1246, and ssc-miR-210 ranging from ∼6000 till ∼62,000 read counts equally distributed through all samples.
In this study, we used core Galaxy tools complemented with customized scripts to make the analysis of porcine small RNA as straightforward as possible. We specifically included sequences of well-annotated species from various sequence databases to improve the annotation of the sequences found in the small RNA-seq results. This strategy allowed the identification of sequence fragments derived from rRNAs and tRNAs as well as other ncRNAs or mRNAse. Furthermore, a strategy to analyze variant isoforms of miRNAs (isomiRs) was developed [13, 58], increasing the percentage of annotated sequences and thereby adding additional information to the subsequent statistical data analysis of each expressed small RNA.
Earlier studies aimed at standing the role of differential miRNA expression patterns in embryonic and fetal preimplantation development and in embryo/feto-maternal interactions in pigs [13, 15, 59–62] (Flöter et al, under review). In contrast, the present study focused on the hatched blastocyst prior to conceptus elongation and specifically explored effects of a maternal estradiol-17 β exposition on embryonic miRNA patterns. To our knowledge, this is the first study investigating sex-specific miRNA profiles including isomiRs of single preimplantation embryos.
Development of a small RNA-seq analysis pipeline for incompletely annotated species
The pig served as large animal model for the effect of endocrine disrupting chemicals in humans. Compared to rodents, it better resembles the women regarding the endogenous placental estrogen production during fetal development . However, working with incompletely annotated species such as the pig makes the analysis of small RNA-seq data sets much more difficult compared to human or mouse. In pig, the miRNA annotation is rather incomplete with respect to the relatively low number of annotated miRNAs (342) that can be found in miRBase (version 21). Therefore, a corresponding analysis pipeline requires an approach that is focused on annotation of new miRNAs. The approach used in the present study is based on sequence comparison (BLAST) of the sequences obtained from small RNA-seq to various sequences derived from the pig, next to different types of transcripts from cattle and human. Since the generation of small RNA-seq libraries does not specifically select miRNA sequences , further potential source sequences have to be included such as rRNAs, tRNAs, and other ncRNAs in addition to the available precursor (pre-) and mature miRNAs. Particularly, fragments of tRNAs, rRNAs or other small ncRNAs have been falsely identified as miRNAs based on their respective size [64, 65]. The comparison of the results of the new analysis pipeline to a standard miRNA analysis pipeline (miRDeep2) revealed differences and some limitations, mainly on the side of miRDeep2. In particular, the mapping to other ncRNAs including tRNAs in the new pipeline turned out to be a great improvement to prevent misannotation of miRNAs. Both tools detected similar numbers of miRNA sequences, but only 62% of the sequences assigned to miRNAs were overlapping. The analysis of the non-overlapping sequences revealed that 1074 potential miRNA sequences from the miRDeep2 tool were derived from other ncRNAs such as rRNAs, tRNAs or other ncRNA classes. Other studies also noticed the problem of wrongly annotated miRNAs that were actually derived from tRNAs [64, 65]. Venkatesh et al. stated that such sequences would represent two new classes of regulatory non-coding small RNAs that are derived from the cleavage of pre-existing tRNAs, namely tRFs and tiRNAs. Schopman et al. (2010) concluded that such small RNA fragments described as miRNAs in miRBase are likely derived from tRNA processing. In our study, the respective sequences were present in a size range of miRNAs up to about 100 nt and were all covering the same tRNA sequence. Schopman et al. (2010) also concluded that the rapid release of data from small RNA-seq projects would lead to a misannotation of miRNAs in databases such as miRBase. Especially sequences derived from fragments of rRNAs and tRNAs are often highly abundant and present in varying percentages in small RNA libraries. This might introduce a severe bias in estimating miRNA expression levels and thereby affecting the detection of differentially expressed miRNAs.
Effects of a maternal estradiol-17 β treatment and of the sex of the embryo on miRNA expression in blastocysts
We recently demonstrated that the maternal low-dose estradiol-17 β treatment led to a perturbed endometrial mRNA expression profile of sows as well as their developing blastocysts  (Flöter et al, under review). This is of particular note, as low estradiol-17 β doses, considered to exert no effect, were applied . Because we found a shift in body composition  and bone density  in male and female offspring, respectively, we considered a sex-specific intergenerational epigenetic epigenetic reprogramming imposed as early as the preimplantation phase. Recently, it has been shown that the biogenesis of uterine miRNAs is steroidal regulation . Therefore, we hypothesized that the maternal oral E2 exposure could lead to an inference with normal miRNA expression in the early embryo. This could be attributed to a direct estradiol-17 β impact on the embryo on the one hand or through perturbations of the endometrial secretions due to the estrogen treatment leading to changes in the uterine milieu (Flöter et al, under review), that could in turn have effects on the developing embryo.
Similar to the analysis of miRNA expression in the endometrium samples (corresponding to the samples of the present study), where miRNA expression was not affected by E2 treatment, only small effects were found on male Day 10 embryos. Since only a very low number of miRNA sequences showed significant differences in male embryos of the low dose group compared to controls and the biological replicates did not group very well according to treatment or sex in the cluster and principal component analysis, it is difficult to say if these observations are true expression differences in response to the E2 feeding. Furthermore, despite some expression differences were found in male embryos, differential miRNA expression between female and male embryos was not detected. The observed clustering of embryos collected from the same sow is most likely attributed to a maternal effect on sibling embryos. In addition, a similar developmental stage for siblings can be assumed, since although the time point of fertilization varies between sows, it is almost the same for siblings within the same sow . However, considering the sow as batch effect in the statistical analysis did not change the results. It is possible that not every sow and/or embryo was affected by the treatment in the same way. However, we did not find a difference in variation among the control and the treatment groups. Overall, from the obtained results we can say that there is only a slight effect of the maternal estradiol-17 β feeding on miRNA expression only in male embryos.
With respect to the experimental model, the time point of the analysis may limit a generalization of the finding. The sample collection was performed one hour after the last estrogen dose, which might have been too late to observe a very rapid miRNA response. On the contrary, it is possible that the former ten days of treatment might have led to an adaption of miRNA transcription. Further studies are needed to verify the present findings. This adaption might account for the difference to an earlier study in which the singular exposure to estradiol-17 β at days 9 or 10 of pregnancy resulted in pregnancy failure . The pregnancy rate in our study, giving a continuous treatment twice daily from fertilization until Day 10, was not affected, not even in the high dose group . The treatment mode, namely i.m. injection versus oral application in our study, most likely lead to a different steroid metabolization possibly exerting the particular lack of an effect in our case. It thus remains to be further unraveled which circumstances estrogen treatment does not affect the miRNA expression of a target tissue.
Interestingly, a sex-specific differential miRNA expression between embryos was likewise excluded. The HCL analysis showed a partial clustering of embryos collected from the same sow, which is most likely attributed to a maternal effect on siblings embryos. In addition, a similar developmental stage for siblings can be assumed, since although the time point of fertilization varies between sows, it is almost the same for siblings within the same sow .
Identification of potential embryo-specific miRNAs
The comparison of the embryo miRNA profiles to small RNA-seq studies in porcine endometrium collected on Day 12  and Day 10 of pregnancy  identified not only miRNAs with similar expression levels for some of the most abundant miRNAs, but particularly also tissue-specific miRNAs. Both endometrium and embryo samples are mixed tissues containing many different cell types such as epithelial cells, fibroblasts, endothelial cells, and various immune cells in the endometrium, and trophectoderm and embryoblast/hypoblast cells in the Day 10 blastocyst . MiR-21 was highly expressed in both tissues and the most abundant miRNA in the embryo as well in Day 10 endometrium. A high expression of miR-21 was also reported in other studies in porcine embryos , oocytes , embryos of other species , and in a variety of somatic tissues as well as cancer cells [70, 71].
Four miRNAs (miR-371-5p, miR-302a, miR-302b, and miR-302d) were expressed only in blastocysts and not in endometrium. MiR-371-5p is located within a pregnancy-related miRNA cluster , and its increased expression has been found in the first compared to the third trimester human placenta . A database (miRmine) search revealed its high expression in 9 (bladder, brain, breast, placenta, plasma, saliva, semen, sperm, and testis) out of 15 tissues including placenta, which indicates a tissue-specific expression. Mir-302b was not expressed in any tissue contained in the database, which supports our finding from the comparison to the endometrium that miR-302b could be embryo-specific. MiR-371 and miR-302a,b,d have been found specifically expressed in human embryonic stem cells . In the mouse, for the homolog miRNA gene cluster to the human mir-371-373 cluster (mir-290-295) and the mir-302 cluster, specific expression in early embryos and embryonic germ cells was found, and an important role in pluripotency and embryonic development has been suggested [75–77]. An upregulation of miR-302a, miR-302-b, and miR-371-5p was found in porcine induced pluripotent stem cells in comparison to porcine embryonic fibroblasts . In the same study, the mir-302 gene cluster has been found to improve reprogramming efficiency. On average we detected 5 isomiRs per miRNA. Interestingly, for mir-302b, mir-302d, and mir-302a, which are not yet contained in the miRBase for the pig (miRBase 21), more different isoforms than the average were found (mir-302b (23), mir-302d (33), and mir-302a (42). All potential endometrium-specific miRNAs (mir-148a-3p, let-7i, mir-30a-5p, let-7g, mir-143-3p) were highly expressed in all 15 tissue types included in the miRmine database. This indicates that these miRNAs are not truly endometrium-specific. As the endometrium is a complex tissue comprising epithelial cells, fibroblasts, endothelial cells, and various immune cells, it remains to be determined in which of the endometrial cell types these miRNAs are actually expressed.
Analysis of isomiR expression in blastocysts
The analysis of isomiRs was performed according to a previous study investigating retina samples  to identify templated and non-templated variations at the 5’ and 3’ end. Therein, more variations in form of deletions in comparison to the canonical form rather than added bases have been found. Mostly, the isoforms were one base shorter than the canonical which occurred more often at the 5’ end. In contrast, the embryos displayed more added bases and in general more variations at the 3’ end. These could play a role in the effectiveness of miRNA targeting. Specifically, 3’ non-templated adenylation of the mature miRNA has been shown to interfere with incorporation into the RISC , whereas monouridylation of the pre-miRNA is required for processing by Dicer for group II let-7 miRNAs  and oligouridylation, in contrast, blocks uptake into Dicer [81, 82]. Adenylation at 3’ non-templated miRNAs tend to be higher with ∼35% compared to the other nucleotides C, G, and U with ∼20% whereas modifications of uridine could not be detected.
In addition, the distribution of expression level of different isomiR types (canonical, templated, 3’ and 5’ modifications) was analyzed for the highest expressed miRNAs. In accordance with Karali et al. (2016), the isomiR showing the highest expression was mostly the canonical or templated form. Furthermore, in studies of human embryonic stem cells  and human hepatocellular carcinoma  different isomiRs resulting from deletions and additions at the 5’ and 3 ’ side were identified and characterized. Consistently, the authors found more modifications at the 3’ end compared to the 5’ end and concluded that this was due to the greater constraint of the 5’ side. This finding is also supported by the possibility of shifting the seed region [84, 85], which can change the target mRNA spectrum.
In our analysis, we found a high number of non-templated isomiRs of rather low expression level. Non-templated isomiRs at the 3’ and 5’ end were only highly abundant in ∼11% of all cases (rank 1). Due to the possible target change they are often not dominantly expressed . As a limitation of our data set, most of the deletions of one nucleotide at the 5’ end were caused by the removal of the first position of the obtained sequence reads during quality trimming, since a “N” occurred at the first position in 15% of all sequences. This issue was derived from running the samples on the Illumina HiSeq 4000 instrument. However, five-prime non-templated events may turn out to be of special interest in case they can be shown to have a modified seed sequence and thereby could have a different range of target transcripts.
In conclusion, despite the lack of both sex-specificity and the significancy of effects of a maternal estrogen treatment on miRNA expression in blastocysts, the deep sequencing of small RNAs in porcine blastocysts revealed a high variety of expressed miRNAs and their isomiRs. A catalog of isoforms of known porcine miRNAs and many novel porcine miRNAs was generated indicating the high complexity of miRNA expression and regulation in pre-implantation embryo. Their role in embryo development specifically related to the observed robustness towards the maternal estrogenic treatment requires further exploration.
Basic Local Alignment Search Tool
counts per million
differentially expressed sequences
Endocrine disrupting chemical
false discovery rate
multiple scaling plot
National Center for Biotechnology Information
no observed effect level
RNA-induced silencing complex
reads per million
Colborn T, Vom Saal FS, Soto AM. Developmental effects of endocrine-disrupting chemicals in wildlife and humans. Environ Health Perspect. 1993; 101(5):378.
Bigsby R, Chapin RE, Daston GP, Davis BJ, Gorski J, Gray LE, Kembra L, Zoeller RT, Saal FS. Evaluating the Effects of Endocrine Disruptors on Endocrine Function during Development. Environ Health Perspect. 1999; 107(suppl 4):613–8.
Guillette Jr LJ, Crain DA, Rooney AA, Pickford DB. Organization versus activation: The role of endocrine-disrupting contaminants (EDCs) during embryonic development in wildlife. Environ Health Perspect. 1995; 103(Suppl 7):157.
Rhind SM. Endocrine disrupting compounds and farm animals : their properties, actions and routes of exposure. Domest Anim Endocrinol. 2002; 23:179–87.
Skinner MK. Role of Epigenetics in Developmental Biology and Transgenerational Inheritance. Birth Defects Res. 2011; 55(Part C):51–5. https://doi.org/10.1002/bdrc.20199.
Fürst RW, Kliem H, Meyer HHD, Ulbrich SE. Journal of Steroid Biochemistry and Molecular Biology A differentially methylated single CpG-site is correlated with estrogen receptor alpha transcription. J Steroid Biochem Mol Biol. 2012; 130(1-2):96–104. https://doi.org/10.1016/j.jsbmb.2012.01.009.
Rasier G, Toppari J, Parent A-S, Bourguignon J-P. Female sexual maturation and reproduction after prepubertal exposure to estrogens and endocrine disrupting chemicals: A review of rodent and human data. Mol Cell Endocrinol. 2006; 254-255:187–201. https://doi.org/10.1016/j.mce.2006.04.002. Puberty: A Sensor of Genetic and Environmental Interactions throughout Development.
Jarrell VL, Day BN, Prather RS. The transition from maternal to zygotic control of development occurs during the 4-cell stage in the domestic pig, Sus scrofa: quantitative and qualitative aspects of protein synthesis. Biol Reprod. 1991; 44(1):62–8. https://doi.org/10.1095/biolreprod44.1.62.
Bazer FW, Spencer TE, Johnson GA, Burghardt RC. Uterine receptivity to implantation of blastocysts in mammals Fuller. Front Biosci. 2011; S3(2):745–67.
Geisert RD, Thatcher WW, Roberts RM, Bazer FW. Establishment of Pregnancy in the Pig : III. Endometrial Secretory Response to Estradiol Valerate Administered on Day 11 of the Estrous Cycle. Biol Reprod. 1982; 27:957–65.
Flöter VL, Galateanu G, Fürst RW, Seidlová-wuttke D, Wuttke W, Möstl E, Hildebrandt TB, Ulbrich SE. Sex-speci fi c effects of low-dose gestational estradiol-17 b exposure on bone development in porcine offspring. Toxicology. 2016; 366-367:60–7. https://doi.org/10.1016/j.tox.2016.07.012.
Van der Weijden VA, Flöter VL, Ulbrich SE. Gestational oral low-dose estradiol-17 β induces altered DNA methylation of CDKN2D and PSAT1 in embryos and adult offspring. Sci Rep. 2018. https://doi.org/10.1038/s41598-018-25831-9.
Krawczynski K, Bauersachs S, Reliszko ZP, Graf A, Kaczmarek MM. Expression of microRNAs and isomiRs in the porcine endometrium: implications for gene regulation at the maternal-conceptus interface. BMC Genomics. 2015; 16(1):906. https://doi.org/10.1186/s12864-015-2172-2.
Mondou E, Dufort I, Gohin M, Fournier E, Sirard MA. Analysis of micrornas and their precursors in bovine early embryonic development. Mol Hum Reprod. 2012; 18(9):425–34. https://doi.org/10.1093/molehr/gas015.
Yang CX, Du ZQ, Wright EC, Rothschild MF, Prather RS, Ross JW. Small RNA profile of the cumulus-oocyte complex and early embryos in the pig. Biol Reprod. 2012; 87(5):117. https://doi.org/10.1095/biolreprod.111.096669.
Ross JW, Ashworth MD, White FJ, Johnson GA, Ayoubi PJ, DeSilva U, Whitworth KM, Prather RS, Geisert RD. Premature estrogen exposure alters endometrial gene expression to disrupt pregnancy in the pig. Endocrinology. 2007; 148(10):4761–73. https://doi.org/10.1210/en.2007-0599.
Bushati N, Cohen SM. microRNA functions. Annu Rev Cell Dev Biol. 2007; 23:175–205. https://doi.org/10.1146/annurev.cellbio.23.090506.123406.
Mattick JS, Makunin IV. Non-coding RNA. Hum Mol Genet. 2006; 15(1):17–29. https://doi.org/10.1093/hmg/ddl046.
Amaral PP, Mattick JS. Noncoding RNA in development. Mamm Genome Off J Int Mammal Genome Soc. 2008; 19(7-8):454–92. https://doi.org/10.1007/s00335-008-9136-7.
Basak J, Nithin C. Targeting Non-Coding RNAs in Plants with the CRISPR-Cas Technology is a Challenge yet Worth Accepting. Front Plant Sci. 2015; 6(November):1001. https://doi.org/10.3389/fpls.2015.01001.
Jarvis K, Robertson M. The noncoding universe,. BMC Biol. 2011; 9:52. https://doi.org/10.1186/1741-7007-9-52.
Hale BJ, Yang CX, Ross JW. Small RNA Regulation of Reproductive Function. Mol Reprod Dev. 2014; 159:148–59. https://doi.org/10.1002/mrd.22272.
Bidarimath M, Khalaj K, Wessels JM, Tayade C. MicroRNAs, immune cells and pregnancy. Cell Mol Immunol. 2014:538–47. https://doi.org/10.1038/cmi.2014.45.
Davis-Dusenbery BN, Hata A. Mechanisms of control of microRNA biogenesis. J Biochem. 2010; 148(4):381–92. https://doi.org/10.1093/jb/mvq096.
Krol J, Loedige I, Filipowicz W. The widespread regulation of microRNA biogenesis, function and decay. Nat Rev Genet. 2010; 11(9):597–610. https://doi.org/10.1038/nrg2843.
Kozomara A, Griffiths-Jones S. MiRBase: Annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 2014; 42:68–73. https://doi.org/10.1093/nar/gkt1181.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool,. J Mol Biol. 1990; 215(3):403–10. https://doi.org/10.1016/S0022-2836(05)80360-2.
Blankenberg D, Gordon A, Von Kuster G, Coraor N, Taylor J, Nekrutenko A, Team G. Manipulation of FASTQ data with galaxy. Bioinformatics. 2010; 26(14):1783–5. https://doi.org/10.1093/bioinformatics/btq281.
Goecks J. Galaxy - RNA-seq Analysis Exercise. https://usegalaxy.org/u/jeremy/p/galaxy-rna-seq-analysis-exercise. Accessed 25 Oct 2017.
Cock PJA, Chilton JM, Grüning B, Johnson JE, Soranzo N. NCBI BLAST+ integrated into Galaxy. GigaScience. 2015; 4:39. https://doi.org/10.1186/s13742-015-0080-7.
Guo L, Yang Q, Lu J, Li H, Ge Q, Gu W, Bai Y, Lu Z. A Comprehensive Survey of miRNA Repertoire and 3 9 Addition Events in the Placentas of Patients with Pre- Eclampsia from High-Throughput Sequencing. PLoS ONE. 2011; 6(6). https://doi.org/10.1371/journal.pone.0021072.
Starega-Roslan J, Witkos TM, Galka-Marciniak P, Krzyzosiak WJ. Sequence Features of Drosha and Dicer Cleavage Sites Affect the Complexity of IsomiRs. Molecular Sciences. 2015:8110–27. https://doi.org/10.3390/ijms16048110.
Zhang Y, Zang Q, Xu B, Zheng W, Ban R, Zhang H, Yang Y, Hao Q, Iqbal F, Li A, Shi Q. Data and text mining IsomiR Bank : a research resource for tracking IsomiRs. Bioinformatics. 2016; 32(September):2069–71. https://doi.org/10.1093/bioinformatics/btw070.
de la Mata M, Gaidatzis D, Vitanescu M, Stadler MB, Wentzel C, Scheiffele P, Filipowicz W, Grosshans H. Potent degradation of neuronal miRNAs induced by highly complementary targets. EMBO reports. 2015; 16(4):500–11. https://doi.org/10.15252/embr.201540078.
Martin HC, Wani S, Steptoe AL, Krishnan K, Nones K, Nourbakhsh E, Vlassov A, Grimmond SM, Cloonan N. Imperfect centered miRNA binding sites are common and can mediate repression of target mRNAs. Genome Biol. 2014; 15(3):1–22. https://doi.org/10.1186/gb-2014-15-3-r51.
Tan GC, Nicholas D. IsomiRs have functional importance. Malays J Pathol. 2015; 37(2):73–81.
Song J, Song J, Mo B, Chen X. Uridylation and adenylation of RNAs. Sci China Life Sci. 2015; 58(11):1057–66. https://doi.org/10.1007/s11427-015-4954-9.
Flöter VL, Lorenz AK, Kirchner B, Pfaffl MW, Bauersachs S, Ulbrich SE. Impact of preimplantational oral low-dose estradiol-17 β exposure on the endometrium: the role of mirna. Mol Reprod Dev; 0(ja). https://doi.org/10.1002/mrd.22975. https://onlinelibrary.wiley.com/doi/pdf/10.1002/mrd.22975.
JECFA. Evaluation of Certain Food Additives and Contraminants: Forty-ninth report of the Joint FAO/WHO Expert Committee on Food Additives. Technical report, World Health Organisation, Geneva. 1999. http://apps.who.int/iris/bitstream/10665/42142/1/WHO_TRS_884.pdf.
Ye J, Coulouris G, Zaretskaya I, Cutcutache I, Rozen S, Madden TL. Primer-BLAST : A tool to design target-specific primers for polymerase chain reaction. BMC Bioinformatics. 2012; 13(1):134.
EMBL-EBI. EBI - ArrayExpress. https://www.ebi.ac.uk/arrayexpress/. Accessed 25 Oct 2017.
Gordon A. Fastx toolkit. http://hannonlab.cshl.edu/fastx_toolkit/. Accessed 25 Oct 2017.
Bolger AM, Lohse M, Usadel B. Genome analysis Trimmomatic : a flexible trimmer for Illumina sequence data. Bioinformatics. 2014; 30(15):2114–20. https://doi.org/10.1093/bioinformatics/btu170.
Intergalactic Utilities Commission. Galaxy - ToolShed. https://toolshed.g2.bx.psu.edu/. Accessed 25 Oct 2017.
Chen Y, Mccarthy D, Robinson M, Smyth GK. edgeR : differential expression analysis of digital gene expression data User’s Guide. UsersGuide. 2014;(March)
Stein L. GENERIC FEATURE FORMAT VERSION 3. 2010. https://github.com/The-Sequence-Ontology/Specifications/blob/master/gff3.md. Accessed 25 Oct 2017.
Rosenkranz D. piRNA cluster database : a web resource for piRNA producing loci. Nucleic Acids Res. 2016; 44(November 2015):223–30. https://doi.org/10.1093/nar/gkv1265.
Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010; 11(3):R25.
Zhou X, Lindsay H, Robinson MD. Robustly detecting differential expression in RNA sequencing data using observation weights. Nucleic Acids Res. 2014; 42(11). https://doi.org/10.1093/nar/gku310. 1312.3382.
Warnes GR, Bolker B, Bonebakker L, Gentleman R, Liaw WHA, Lumley T, Maechler M, Magnusson A, Moeller S, Schwartz M, Venables B, Huber W, Liaw A, Lumley T, Maechler M, Magnusson A, Moeller S, Others. gplots: Various R Programming Tools for Plotting Data. R package version 2.17.0. 2015; 2(4):2015. https://doi.org/10.1111/j.0022-3646.1997.00569.x. arXiv:1510.05677.
Reimers M, Carey VJ. Bioconductor : An Open Source Framework for Bioinformatics and Computational Biology Introduction : Bioconductor in Brief Bioconductor is a project devoted to the development of software and. Methods Enzymol. 2006; 411(2004):119–34. https://doi.org/10.1016/S0076-6879(06)11008-3.
Sievers F, Higgins DG. Clustal Omega. Current Protocols in Bioinformatics. 2014; 2014(December):3–13131316. https://doi.org/10.1002/0471250953.bi0313s48.
Bodenhofer U, Bonatesta E, Horejs C. Sequence analysis msa : an R package for multiple sequence alignment. Bioinformatics. 2015; 31(August):3997–9. https://doi.org/10.1093/bioinformatics/btv494.
Friedlaender MR, Mackowiak SD, Li N, Chen W, Friedla MR, Rajewsky N. miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades. Nucleic Acids Res. 2011; 40(1):37–52. https://doi.org/10.1093/nar/gkr688.
Panwar B, Omenn GS, Guan Y. miRmine: a database of human miRNA expression profiles. Bioinformatics. 2017; 33(June 2014):1554–60. https://doi.org/10.1093/bioinformatics/btx019.
Zhang Y, Zang Q, Zhang H, Ban R, Yang Y, Iqbal F, Li A, Shi Q. DeAnnIso : a tool for online detection and annotation of isomiRs from small RNA sequencing data. Nucleic Acids Res. 2016; 44(May):166–75. https://doi.org/10.1093/nar/gkw427.
Li M, Xia Y, Gu Y, Zhang K, Lang Q, Chen L, Guan J, Chen H, Li Y, Li Q, Li X, Jiang A-A, Shuai S, Wang J, Zhu Q, Zhou X, Gao X, Li X. MicroRNAome of Porcine Pre- and Postnatal Development. PLoS ONE. 2010; 5(7). https://doi.org/10.1371/journal.pone.0011541.
Wessels JM, Edwards AK, Khalaj K, Kridli RT, Bidarimath M, Tayade C. The MicroRNAome of Pregnancy : Deciphering miRNA Networks at the Maternal-Fetal Interface. PLoS ONE. 2013; 8(11):1–14. https://doi.org/10.1371/journal.pone.0072264.
Krawczynski K, Najmula J, Bauersachs S, Kaczmarek MM. MicroRNAome of porcine conceptuses and trophoblasts: expression profile of micrornas and their potential to regulate genes crucial for establishment of pregnancy. Biol Reprod. 2015; 92(1):21. https://doi.org/10.1095/biolreprod.114.123588.
Su L, Liu R, Cheng W, Zhu M, Li X, Zhao S, Yu M. Expression Patterns of MicroRNAs in Porcine Endometrium and Their Potential Roles in Embryo Implantation and Placentation. PLoS ONE. 2014; 9(2). https://doi.org/10.1371/journal.pone.0087867.
Hoen PAC, Friedländer MR, Almlöf J, Sammeth M, Pulyakhina I, Anvar SY, Laros JFJ, Buermans HPJ, Karlberg O, Brännvall M, Consortium TG, Dunnen JTD, Ommen GJBV, Gut IG, Guigó R. Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories. Nat Biotechnol. 2013; 31(11). https://doi.org/10.1038/nbt.2702.
Schopman NCT, Heynen S, Haasnoot J, Berkhout B, Schopman NCT, Heynen S, Haasnoot J, Berkhout B. A miRNA-tRNA mix-up: tRNA origin of proposed miRNA. RNA Biol. 2010; 6286(September). https://doi.org/10.4161/rna.7.5.13141.
Venkatesh T, Suresh PS, Tsutsumi R. TRFs: miRNAs in disguise. Gene. 2016; 579(2):133–8. https://doi.org/10.1016/j.gene.2015.12.058.
Nothnick WB. NIH Public Access. PMC. 2011; 37(2):265–73. https://doi.org/10.1007/s12020-009-9293-9.Steroidal.
Soede NM, Kemp B. In synchronized pigs, the duration of ovulation is not affected by insemination and is not a determinant for early embryonic diversity. Theriogenology. 1992. 2 8AU6 1992. 1993 May 1:105.
Oestrup O, Hall V, Petkov SG, Wolf XA, Hyldig S, Hyttel P. From Zygote to Implantation : Morphological and Molecular Dynamics during Embryo Development in the Pig. Reprod Domest Anim. 2009; 44:39–49. https://doi.org/10.1111/j.1439-0531.2009.01482.x.
Stowe HM, Curry E, Calcatera SM, Krisher RL, Paczkowski M, Pratt SL. Cloning and expression of porcine Dicer and the impact of developmental stage and culture conditions on MicroRNA expression in porcine embryos. Gene. 2012; 501(2):198–205. https://doi.org/10.1016/j.gene.2012.03.058.
Krichevsky AM, Gabriely G. miR-21: a small multi-faceted RNA. J Cell Mol Med. 2009; 13(1):39–53. https://doi.org/10.1111/j.1582-4934.2008.00556.x.
Yang CH, Li K, Pfeffer SR, Pfeffer LM. The Type I IFN-Induced miRNA, miR-21. Pharmaceuticals. 2015; 8:836–47. https://doi.org/10.3390/ph8040836.
Morales-Prieto DM, Ospina-Prieto S, Chaiwangyen W, Schoenleben M, Markert UR. Pregnancy-associated miRNA-clusters. J Reprod Immunol. 2013; 97(1):51–61. https://doi.org/10.1016/j.jri.2012.11.001.
Gu Y, Sun J, Groome LJ, Wang Y, Gu Y, Sun J, Lj G, Differential WY. Differential miRNA expression profiles between the first and third trimester human placentas. AJP Endocrinol Metab. 2013:836–43. https://doi.org/10.1152/ajpendo.00660.2012.
Suh M-R, Lee Y, Kim Y, Kim S-K, Moon S-H, Lee JY, Cha K-Y, Chung M, Yoon S, Moon Y, Kim VN, Kim K-S. Human embryonic stem cells express a unique set of microRNAs. Dev Biol. 2004; 270:488–98. https://doi.org/10.1016/j.ydbio.2004.02.019.
Spruce T, Pernaute B, Di-gregorio A, Cobb BS, Merkenschlager M, Manzanares M, Rodriguez TA. Article An Early Developmental Role for miRNAs in the Maintenance of Extraembryonic Stem Cells in the Mouse Embryo. Dev Cell. 2010; 19(2):207–19. https://doi.org/10.1016/j.devcel.2010.07.014.
Medeiros LA, Dennis LM, Gill ME, Houbaviy H, Markoulaki S, Fu D. Mir-290 – 295 de fi ciency in mice results in partially penetrant embryonic lethality and germ cell defects. PNAS. 2011; 108(34):1–6. https://doi.org/10.1073/pnas.1111241108.
Wu S, Aksoy M, Shi J, Houbaviy HB. Evolution of the miR-290 – 295 / miR-371 – 373 Cluster Family Seed Repertoire. PLoS ONE. 2014; 9(9):1–15. https://doi.org/10.1371/journal.pone.0108519.
Karali M, Persico M, Mutarelli M, Carissimo A, Pizzo M, Singh Marwah V, Ambrosio C, Pinelli M, Carrella D, Ferrari S, Ponzin D, Nigro V, Di Bernardo D, Banfi S. High-resolution analysis of the human retina miRNome reveals isomiR variations and novel microRNAs. Nucleic Acids Res. 2016; 44(4):1525–40. https://doi.org/10.1093/nar/gkw039.
Burroughs AM, Ando Y, Hoon MJLD, Tomaru Y, Nishibu T, Ukekawa R, Funakoshi T, Kurokawa T, Suzuki H, Hayashizaki Y, Daub CO. A comprehensive survey of 3 9 animal miRNA modification events and a possible role for 3 9 adenylation in modulating miRNA targeting effectiveness. Genome Res. 2010:1398–410. https://doi.org/10.1101/gr.106054.110.
Heo I, Ha M, Lim J, Yoon M. -j., Park J. -e., Kwon SC, Chang H, Kim VN. Mono-Uridylation of Pre-MicroRNA as a Key Step in the Biogenesis of Group II let-7 MicroRNAs. Cell. 2012; 151(3):521–32. https://doi.org/10.1016/j.cell.2012.09.022.
Heo I, Joo C, Kim Y-K, Ha M, Yoon M-J, Cho J, Yeom K-H, Han J, Kim VN. TUT4 in Concert with Lin28 Suppresses MicroRNA Biogenesis through Pre-MicroRNA Uridylation. Cell. 2009; 138(4):696–708. https://doi.org/10.1016/j.cell.2009.08.002.
Ha M, Kim VN. Regulation of microRNA biogenesis. Nature Publishing Group. 2014; 15(8):509–524. https://doi.org/10.1038/nrm3838.
Guo L, Zhao Y, Yang S, Zhang H, Chen F. A genome-wide screen for non-template nucleotides and isomiR repertoires in miRNAs indicates dynamic and versatile microRNAome. Mol Biol Reprod. 2014:6649–58. https://doi.org/10.1007/s11033-014-3548-0.
Marco A, Hui JHL, Ronshaugen M, Griffiths-jones S. GBE Functional Shifts in Insect microRNA Evolution. GBE. 2010; 2:686–96. https://doi.org/10.1093/gbe/evq053.
Wheeler BM, Heimberg AM, Moy VN, Sperling EA, Holstein TW, Heber S, Peterson KJ. The deep evolution of metazoan microRNAs. Evol Dev. 2009; 68:50–68. https://doi.org/10.1111/j.1525-142X.2008.00302.x.
Bardou P, Mariette J, Escudié F, Djemiel C, Klopp C. jvenn: an interactive Venn diagram viewer. BMC Bioinformatics. 2014; 15:293. https://doi.org/10.1186/1471-2105-15-293.
We thank the Functional Genomics Center Zurich for sequencing your provided samples.
Availability of data and materials
The raw reads produced in this study were deposited in EBI ArrayExpress the accession number E-MTAB-6201.
Ethics approval and consent to participate
The experiments with sows were performed in accordance with the International Guiding Principles for Biomedical Research Involving Animals, as proposed by the Society for the Study of Reproduction, with the European Convention on Animal Experimentation and with the German Animal Welfare Act and approved by the Regierung of Oberbayern, Bavaria, Germany, on July 1st, 2009.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. Complete analysis pipeline in Galaxy: 1) Quality control and trimming plus adapter clipping. 2) MiRNA pipeline to convert sequences into a count table. 3) Filtering count table and sequence identification. (PNG 828 kb)
Figure S2. Schema of the Galaxy pipeline: 1) to collapse FastQ file into the sequence and number of appearance ranked by the counts. 2) converts FASTA files into the Galaxy data file type “tabular” (tab-separated text files). 3) converts dashes in to tabs and 4) extracts selected columns of a given tabular file. 5) joins files by a selected identifier column. (PNG 4714 kb)
Table S1. Filtering details for potential miRNAs. (XLSX 73 kb)
Table S2. Blast result with all hits to database files found (E-MTAB-6201). (XLSX 2127 kb)
Figure S3. Library sizes of mapped reads per RNA type. (PNG 617 kb)
Table S3. Blast result with only hits related to miRNA features. (XLSX 246 kb)
Figure S4. Known and novel miRNAs and species annotation - unique sequences and read counts. In the outer ring the mapped unique sequences are shown related to mapped species. The inner cycle represents the corresponding read counts per species. (PNG 468 kb)
Figure S5. MDS plots of the top 300 and 2000 isomiRs. The different colors represent the six different treatment groups (three different E2 doses with respective male and female embryos). (PNG 533 kb)
Figure S6. Distance heatmap of all samples. (PNG 1065 kb)
Table S4. EdgeR result of all comparisons on isomiR level. (XLSX 32902 kb)
Table S5. Embryo Day 10 miRNAs expression versus endometrium Day 10 and Day 12. (XLSX 70 kb)
Table S6. MiRmine result of potential embryo- and endometrium-specific miRNAs. (XLSX 67 kb)
Table S7. Expression level embryo Day 10 in percentage. (XLSX 7 kb)
Table S8. IsomiR analysis. (XLSX 202 kb)
Table S9. IsomiR count table. (XLSX 594 kb)
About this article
Cite this article
Bick, J.T., Flöter, V.L., Robinson, M.D. et al. Small RNA-seq analysis of single porcine blastocysts revealed that maternal estradiol-17beta exposure does not affect miRNA isoform (isomiR) expression. BMC Genomics 19, 590 (2018). https://doi.org/10.1186/s12864-018-4954-9
- Small RNA-seq
- Pig embryos
- Estradiol treatment