Skip to main content
  • Methodology article
  • Open access
  • Published:

Increasing gene discovery and coverage using RNA-seq of globin RNA reduced porcine blood samples



Transcriptome analysis of porcine whole blood has several applications, which include deciphering genetic mechanisms for host responses to viral infection and vaccination. The abundance of alpha- and beta-globin transcripts in blood, however, impedes the ability to cost-effectively detect transcripts of low abundance. Although protocols exist for reduction of globin transcripts from human and mouse/rat blood, preliminary work demonstrated these are not useful for porcine blood Globin Reduction (GR). Our objectives were to develop a porcine specific GR protocol and to evaluate the GR effects on gene discovery and sequence read coverage in RNA-sequencing (RNA-seq) experiments.


A GR protocol for porcine blood samples was developed using RNase H with antisense oligonucleotides specifically targeting porcine hemoglobin alpha (HBA) and beta (HBB) mRNAs. Whole blood samples (n = 12) collected in Tempus tubes were used for evaluating the efficacy and effects of GR on RNA-seq. The HBA and HBB mRNA transcripts comprised an average of 46.1% of the mapped reads in pre-GR samples, but those reads reduced to an average of 8.9% in post-GR samples. Differential gene expression analysis showed that the expression level of 11,046 genes were increased, whereas 34 genes, excluding HBA and HBB, showed decreased expression after GR (FDR <0.05). An additional 815 genes were detected only in post-GR samples.


Our porcine specific GR primers and protocol minimize the number of reads of globin transcripts in whole blood samples and provides increased coverage as well as accuracy and reproducibility of transcriptome analysis. Increased detection of low abundance mRNAs will ensure that studies relying on transcriptome analyses do not miss information that may be vital to the success of the study.


Blood is a valuable resource to probe an animal’s physiological and pathological status as well as to obtain repeated samples before harvest, for example, monitoring the dynamic change of gene expression in response to disease, treatment, or aging, for which the onset of gene expression response is not known. However, transcriptomic analysis of blood samples is a challenge since blood is composed of heterogeneous cell types including red blood cells (99%), platelets (1%) and white blood cells (<1%; e.g., neutrophils, monocytes, basophils, lymphocytes and eosinophils) [1, 2]. In human blood, HBA and HBB are the most abundant transcripts (~52-76%) [3, 4]. The high level of globin transcripts in blood was reported to be the most limiting factor for accurate and sensitive detection of gene expression, especially for the less abundant transcripts [35]. This issue is a great concern for sequence-based approaches, in which the globin transcripts will be highly abundant and limit the potential coverage and detection of other transcripts from blood [3].

To date, several globin RNA reduction protocols have been successfully applied to gene expression studies in human [69]. GLOBINclearTM (Ambion, Austin, TX, USA), a commercial product widely used in human clinical research, removes up to 95% of the HBA and HBB transcripts in human whole blood samples and improves the efficacy of gene expression assays [4, 10, 11]. Further approaches developed by Affymetrix (Affymetrix Inc., Santa Clara, CA, USA) [5, 11] or PNA Bio Inc. (Thousand Oaks, CA, USA) [9, 10] also have differential reduction rates of globin transcripts in human blood. Globin RNA reduction improved the sensitivity and reproducibility of high throughput mRNA expression analysis of whole human blood samples [35, 7, 9, 10]. There is, however, neither a commercial GR product available nor any literature demonstrating the efficiency and effects of GR at global level for porcine whole blood [2].

Our objectives were to develop a porcine specific GR protocol and to evaluate the effects of GR treatment on gene discovery and coverage in RNA-seq experiments for swine.

Results and discussion

Comparisons of globin reduction methods

To determine the suitability of the GR process for porcine whole blood samples, we initially evaluated the efficacies of three distinct methods (GLOBINclearTM, biotinylated PNA and RNase H) with whole blood samples drawn from 12 pigs collected in either PAXgeneTM (n = 3) or TempusTM (n = 9) tubes. To evaluate and compare GR efficiency, we performed qPCR analysis of HBA and HBB transcripts with a pooled sample for GLOBINclear and PNA methods and 5 randomly selected samples for the RNase H method (Additional file 1: Table S1). The GLOBINclearTM-Human Kit (Ambion, Austin, TX, USA), commonly used in human samples, seemed to have merit as it employs a non-enzymatic magnetic method but its reduction efficiency in pig barely reached 64% and 67% for HBA and HBB transcripts, respectively (Additional file 1: Figure S1). The manufacturer confirmed that porcine HBA and HBB sequences had low sequence homology to their corresponding human oligonucleotide probes used in the GLOBINclearTM-Human Kit, but the degree of dissimilarity is not known because the human probe sequences used in the GLOBINclearTM Kit are not publicly available. Next, we designed porcine specific biotinylated PNA oligonucleotides and used them with the GLOBINclearTM Kit. This PNA oligo method, however, reduced levels of HBA and HBB transcripts only 40% and 34%, respectively (Additional file 1: Figure S1). Third, we evaluated the RNase H mediated GR method using porcine specific oligonucleotides modified from the Affymetrix GeneChip GR Protocol developed for reduction of human globin transcripts [5]. We examined the sequence similarities of HBA and HBB, especially the oligonucleotide sequences on 3′ UTR, between human and pig using Clustal Omega (Additional file 1: Figure S2) [12]. Due to a lack of consensus, we designed two sets of porcine specific oligonucleotides each for HBA and HBB (Table 1). This revised RNase H mediated GR protocol resulted in an average reduction of 94% of HBA and 92% of HBB transcripts from porcine whole blood samples (Additional file 1: Figure S1). Thus we determined that the RNase H GR method using our custom designed porcine specific oligos was the most efficient of the three GR methods tested here and confirmed its efficacy by RNA-seq (Additional file 1: Table S2).

Table 1 Porcine specific globin oligonucleotides used in RNase H-mediated globin reduction assay

Performance of GR protocol in an RNA-Seq experiment

After determining the valid and highly efficient GR method, we evaluated the effects of the RNase H GR treatment on gene discovery and coverage in an RNA-seq experiment. Since the above study on the GR method included samples collected using different blood collection tubes and RNA isolation methods, we evaluated the effects of the RNase H GR treatment on gene discovery and coverage in an RNA-seq experiment with a different set of 12 porcine blood samples collected in TempusTM tubes and for which the RNA was isolated by a magnetic bead based MagMaxTM kit.

More than 653 million (M) sequence reads generated from 12 pre- and post-GR samples passed Illumina’s CASAVA (v.1.8) filtration (Table 2). These reads were then aligned to the pig genome build 10.2 by TopHat (v. 2.0.8). After GR treatment, total filtered reads and mapped reads were reduced by an average of 6.1 M and 6.8 M reads, respectively, and globin reads were reduced by an average of 11.4 M reads. The percentage of globin reads among all aligned reads averaged 46.1% and of these, 84.7% were removed by GR treatment. The proportion of globin reads to mapped reads were 46.1% and 8.9% in pre- and post-GR samples, respectively, and proportions of HBA and HBB reads to mapped reads were significantly reduced to 5.2% from 26.1% and to 3.7% from 20.1%, respectively (p <0.001, Figure 1). Considering that human globin transcripts constitute 50-70% of the blood RNA [3, 4], the level of pig globin transcripts in pre-GR samples is comparatively low. A possible explanation for the lower level of porcine globin transcripts is that the pigs used in this study were only 1-2 months old, an age associated with rapid decreases in erythrocyte population size and hemoglobin concentration. Although pigs at birth have similar hematological values to adult pigs, by three days of age a 25% reduction in hemoglobin concentration has occurred and hemoglobin concentration then increased gradually from the age of 3 months due to the pig’s tremendous early growth rate, as much as eight times faster than humans [13, 14]. Thus, we expect the GR protocol will reduce more globin transcripts in newborn and adult pig blood RNAs.

Table 2 RNA-seq mapping statistics for pre- and post-globin reduction samples
Figure 1
figure 1

The average proportions of HBA and HBB reads to total mapped reads in pre- and post-GR samples. Solid and pattern bars show the average (± s.d.) proportions of globin reads in pre- and post-GR samples, respectively. RNase H mediated GR protocol decreased both proportions of HBA and HBB reads to mapped reads significantly (Paired Wilcoxon signed rank test, p <0.001).

Classification of samples based on RNA integrity number (RIN)

We examined the RIN changes after GR treatment of pig blood RNA and its effect on sequencing results. The quality of RNA was not changed overall (p >0.1); though 8 samples showed a reduction in RIN after GR treatment, only 3 samples showed a marked decrease of RIN (0.4-0.6) after GR. However, there was a reduction in RNA yield following GR treatment with only 33.3-78.2% of total RNA being recovered. Studies on GR treatment in humans also reported the reduction of RNA yields ranging from 52-95% of total RNA [3, 4, 7, 15]. The reasons for the significant reduction and the wide variation in RNA yield are not clear. To offset the RNA loss accompanying GR treatment, it would be desirable to prepare sufficient amounts of initial RNA. Because we identified possible bias introduced by RIN from the preliminary sequencing results (data not shown), we empirically classified the samples into three categories based on RIN after GR treatment: high (RIN ≥7), moderate (5 ≤ RIN <7), and low (RIN <5) representing ideal, critical and inferior RNA integrity for RNA-seq experiments, respectively.

Increased coverage of non-globin genes in post-GR samples

Following an approach similar to that described by Mastrokolias et al. [3], we investigated the effect of GR on enhancing the coverage of non-globin genes and the sensitivity of gene expression detection. Read count data was normalized by library size and DE genes between pre- and post-GR samples were determined using edgeR (see Methods). Compared to pre-GR samples, 11,046 genes showed higher level of detection (expression) and 34 genes (Table 3), excluding HBA and HBB and ENSSSCG00000014727 (hemoglobin subunit beta-like), showed lower level of detection after GR treatment (FDR <0.05) (Figure 2a). We checked for sequence similarities among these 34 genes and the four globin oligonucleotides for possible non-target specific hybridization, but found none.Figure 2b depicts a heatmap of the normalized log2 transformed expression of the 11,046 genes with higher level of detection in post-GR samples compared to pre-GR samples. It was observed that a large set of genes in the low RIN samples (within the boxes in Figure 2b) was considerably lower expressed than the corresponding set in the high/moderate RIN samples, both pre- and post-GR. We believe that these are the genes with the greatest degradation in the low RIN samples. We then examined the variation in gene body coverage from 5′ to 3′ in high/moderate and low RIN samples, respectively. Low RIN samples showed strong bias toward increased coverage at the 3′ end (Figure 3). Among the low-quality RNA samples, pre- and post-GR treatment showed the same trend of bias, which affirmed that the RNase H treatment was not the determining factor. High quality samples showed better coverage from 5′ to 3′ as well as at the ends in both pre- and post-GR treated samples. All low quality samples were biased toward increased coverage at the 3′ end, possibly due to the degradation of RNA. However, the number of unique genes detected did not differ significantly between low and high RIN samples.

Table 3 The list of 34 genes that showed decreased expression level after globin reduction
Figure 2
figure 2

Differential gene expression in pre- versus post-GR samples. a. MA plot revealed that the majority of the differentially expressed genes showed increased detection (11,046) in post-GR samples and only 34 (excluding globin genes) showed lower detection. b. Heatmap shows the expression profile across all samples for the 11,046 genes with increased detection by GR treatment. A large set of genes in the low RIN samples (within the orange boxes) was considerably lower expressed than the corresponding set in the high/moderate RIN samples.

Figure 3
figure 3

Normalized 5′ to 3′ sequence coverage by position showing lower expression in one representative low RIN sample. For lower expressed 4,792 genes in low RIN samples, the average read numbers of low RIN (<5) and moderate/high RIN (≥5) in pre- and post-GR samples is shown at each relative position.

The lower detection levels of a small number of genes in post-GR samples could also be due to the effect of RIN. We investigated all genes with decreased level of detection after GR (fold change <0) from each sample independently, regardless of statistical significance (Additional file 1: Figure S3). We observed that samples with the most notable RIN change after GR (RIN reduction ≥0.4) had the highest number of genes with decreased expression level (samples 4, 7 and 8; Additional file 1: Figure S3). In addition to the effect of RIN, technical variations or sampling effects could also contribute to differences in detection levels of genes.

Increased number of non-globin genes identified in post-GR samples

The number of detected genes (read counts >5) in post-GR samples was significantly increased compared to pre-GR samples (paired t-test) (Figure 4a). GR treatment increased the gene detection rate by 8.6% in high RIN samples, 2.2% in moderate and 5.4% in low RIN samples. It was also noticed that the number of additional genes identified in post-GR samples was higher for samples with a high RIN (Figure 4b). It may be noted that the detection rate was higher in high RIN samples compared to low RIN samples despite being sequenced at half the depth. Pre-GR, an average of 93 genes were uniquely detected in the high RIN group, whereas 243 genes were uniquely detected in the moderate/low RIN group. Post-GR, the corresponding uniquely detected genes in the two groups were 1,157 and 753, respectively (Additional file 1: Figure S4).

Figure 4
figure 4

Increased gene coverage as a result of globin reduction. GR treatment increased the detection of expressed genes (read >5). 6 high (RIN ≥7) and 2 moderate (5 ≤ RIN <7) RIN samples were sequenced in one lane and 4 low (RIN <5) RIN samples were sequenced in another lane. a) Comparisons of number of expressed genes in pre- and post-GR treatment. b) RIN influence on identifying additional genes post-GR treatment.

We next determined genes expressed in porcine whole blood using all 12 samples, based on the criterion that a gene was detected at read counts above 5 in at least 5 of the 12 samples. We identified 12,588 genes in post-GR samples and 11,826 genes in pre-GR samples with an overlap of 11,773 genes (Figure 5). Excluding the overlap, 815 genes were detected only in post-GR samples, whereas 53 were specific to pre-GR samples. The small number of genes found specific to pre-GR samples may be due to the effect of RIN or technical variations. A comparison of the mean expressions of the set of 11,773 genes detected in both pre- and post-GR samples and the 815 genes detected only in post-GR samples revealed increased expression in post-GR samples (Additional file 1: Figure S5). The mean expression of the 815 additional genes in post-GR samples was well below the lower quartile of the expression levels of genes common to both pre- and post-GR samples. Thus GR treatment increases the ability to detect genes expressed at very low levels.

Figure 5
figure 5

Venn diagram of number of genes identified as expressed in pre- versus post-GR samples. There is significant overlap (11,773) between pre- and post-GR samples. An additional 815 genes were identified in only post-GR samples.


The porcine specific GR protocol described here successfully removed a significant proportion of the HBA and HBB transcripts prior to sequence analysis. The range of gene discovery from RNA sequencing was extended with significant increases in number of identified genes via improved coverage. Our DE analyses using the GR samples showed increased sensitivity, with no observed strong negative effects as a result of the GR protocol. We also demonstrated the effects of RIN on blood RNA-seq analyses. Thus, the GR protocol incorporated into porcine blood transcriptomics will help advance pig physiological, pathological and blood biomarker studies, by providing more targets for drug development and disease resistance research.


Blood samples and RNA isolation

Animal protocols were approved by the Kansas State University and University of Alberta Animal Care and Use Committees. A total of 24 blood samples were used to conduct two independent studies: comparisons of three GR methods to select the best method and evaluating the effects of the selected GR method on an RNA-seq experiment. For the first study, 3 mL of blood samples from 9 pigs of 1-2 months age produced from Landrace x Large White selected from a commercial populations used in the Porcine reproductive and respiratory syndrome Host Genetics Consortium (PHGC) studies [16] were collected in TempusTM Blood RNA tubes (Life Technologies, Carlsbad, CA, USA) and 2.5 mL of blood samples from 3 crossbred pigs of Large White x Landrace were collected in PAXgeneTM Blood RNA tubes (PreAnalytiX, Qiagen, Germany) at the University of Alberta. Total RNA was isolated with PAXgeneTM Blood RNA Kit (PreAnalytiX, Qiagen) for PAXgeneTM tubes and either TempusTM Spin RNA Isolation Kit (Life Technologies, Carlsbad, CA, USA) or magnetic bead based MagMaxTM for Stabilized Blood Tubes RNA Isolation Kit for Tempus tubes (Life Technologies), according to the respective manufacturer’s instructions.

For the second study evaluating the effects of RNase H mediated GR protocol on RNA-seq, another set of 12 blood samples were drawn from crossbred pigs of Duroc x (Landrace x Yorkshire) in a PHGC population. Three mL of blood from each pig at 1-2 months of age was collected into TempusTM Blood RNA Tubes at Kansas State University. Total RNA was isolated using the MagMaxTM for Stabilized Blood Tubes RNA isolation kit according to the manufacturer’s protocol.

RNA concentration was quantified using a NanoDrop ND-1000 spectrophotometer (Nano-Drop Technologies, Wilmington, DE, USA) and RNA quality was assessed using an Agilent Bioanalyzer 2100 (Agilent Technologies, Inc., Santa Clara, CA, USA). To determine an accurate 28S/18S rRNA ratio in the pig, we aligned the human 28S sequence against pig genome build 10.2 using BLAST and identified 97-100% of similarity on pig chromosome 6: 871128-866484 (Ensembl release 73). The sizes of the 28S and 18S genes in pig were estimated to be 4645 bp and 2302 bp, respectively, yielding an rRNA ratio of 2.02, whereas the rRNA ratios in human and mouse are known to be 2.69 and 2.53, respectively (the ratio obtained from Genbank database; M11167 and X03205 in human and NR003279 and NR003278 in mouse).

Design of porcine specific oligonucleotides

We first tested GLOBINclearTM Human Kit (Ambion, Austin, TX, USA) which hybridized biotinylated oligonucleotides with globin transcripts by binding to Streptavidin Magnetic Beads. Second, we designed porcine specific biotinylated Peptide Nucleic Acids (PNA) oligos to inhibit reverse transcription of globin transcripts (HBA: 5′-CGAGGCTCCAGCTTA-3′ and HBB: 5′-CACCAGCCACCACCT-3′). Third, we designed four porcine specific antisense oligonucleotides for HBA and HBB using Primer 3 (v. 0.4.0) (Table 1) to hybridize with globin transcripts prior to digestion with RNase H. To design porcine specific oligonucleotides, we first used Clustal Omega ( to align the porcine HBA (ENSSSCT00000008741) and HBB (ENSSSCT00000016076) transcript sequences in the current assembly of the pig genome (build 10.2) with their orthologues from human, mouse, cow and pig obtained from the Ensembl database ( and then checked the similarity of the 3′ end hybridization sites (Additional file 1: Figure S2).

Globin reduction treatment

GR treatment with porcine specific oligonucleotides was performed using a modified Affymetrix GR protocol [5]. In brief, 10X GR oligonucleotides mix was prepared adding 100 uL each of two HBA Oligos at 30 uM, two HBB Oligos at 120 uM per reaction, yielding a final concentration of 7.5 uM HBA Oligos and 30 uM HBB Oligos. Three ug of denatured total RNA was hybridized in a thermal cycler at 70°C for 2 min with the 400 uL 10X GR oligonucleotides mix in hybridization buffer (100 mM Tris-HCL, pH 7.6; 200 mM KCl) at 70°C for 5 min, then cooled to 4°C. The RNA-DNA hybrids were digested with 2 U RNase H (Ambion) in the reaction buffer (100 mM Tris–HCl, pH 7.6, 20 mM MgCl2, 0.1 mM DTT, SUPERase-In) at 37°C for 10 min and cooled to 4°C. The reaction was stopped by addition of 1.0 ul 0.5 M EDTA. RNase H treated RNA was immediately purified with the RNeasy MinElute Cleanup Kit (Qiagen, Toronto, Canada, Cat. No.: 74204) according to manufacturer’s instructions. RNA quality of GR treated samples was assessed using an Agilent Bioanalyzer 2100 (Agilent Technologies, Inc.).

Quantitative real-time PCR (qPCR) analysis

We quantified the mRNA level of the porcine HBA and HBB transcripts by SYBR Green I based qPCR using a StepOneTM Real-Time PCR System (Applied Biosystems, Foster City, CA, USA). First strand cDNA was synthesized using SuperScript® II reverse transcriptase (Invitrogen) and random hexamer primers in a final volume of 20 μL following the manufacturer’s instruction. SYBR Green I based qPCR was performed in a total volume of 10 μL per reaction comprising 2 μL of template, 1 μL of the assay-specific primer mix, 5 μL of the Fast SYBR® Green Master Mix Bulk Pack (Applied Biosystems) and 2 μL of water. The reaction conditions used were one cycle of 95°C for 3 min for initial denaturation, 23 cycles of 95°C for 30 s and 60°C for 30 s. The primer sequences are shown in Additional file 1: Table S1.

Library preparation for sequencing

Poly-A + fractions from the GR treated samples and respective non GR treated samples (1.5 μg RNA each) were purified by using oligo-dT magnetic beads (Illumina, Inc., San Diego, USA), and used to construct cDNA libraries. The Poly (A+) RNA was primed with random hexamers and fragmented at 94°C for 8 min. Second strand cDNA was synthesized after the first strand cDNA using SuperScript II (Invitrogen). The cDNA fragments were end-repaired and a single ‘A’ nucleotide was added to 3′-ends to prevent them from cross ligation during the adapter ligation step. Then individual RNA adapter index oligos were ligated to the end-repaired cDNA and subsequently amplified using Veriti Thermo cycler (Applied Biosystems). The initial denaturation was performed at 98°C for 30 seconds, followed by 15 cycles at 98°C for 10 seconds, annealing at 60°C for 30 seconds and extension at 72°C for 30 seconds. The final extension was followed at 72°C for 5 minutes, and held at 10°C.

The quality and size (~260 bp) of the resulting cDNA libraries were assessed using the High sensitivity DNA Kit (Agilent Technologies, Inc.) in an Agilent Bioanalyzer 2100 (Agilent Technologies, Inc.). The quantification was performed using StepOneTM Real-Time PCR System (Applied Biosystems), as suggested in the Sequencing Library qRT-PCR Quantification Guide (Illumina, Inc.). The KAPA SYBR® FAST ABI Prism qPCR Kit (Kapa Biosystems, Inc., Woburn, USA) was used for the qPCR reactions. The individual libraries were pooled into 2 nM after quantification.

Sequencing was performed on the HiSeq System (Illumina, Inc.). The pooled 10 μL of the 2 nM libraries were diluted and denatured. The pooled cDNA libraries (12 pM) were loaded on the cBot (Illumina, Inc.) for clustering on a flow cell, and single-read cluster generation proceeded using the TruSeqTM SR Cluster Generation Kit v3 (Illumina, Inc., Cat.: FC-930-3001). A portion of each library was diluted to 10 nM and stored at -20°C. Fifty cycles of sequencing-by-synthesis using the paired-end protocol was performed on a HiSeq (Illumina, Inc.) according to manufacturer’s instructions. Real-time analysis and base calling was performed using the HiSeq Control Software Version 1.4.8 (Illumina, Inc.).

Bioinformatic analysis

Sequence reads with base quality scores were produced by the Illumina sequencer. Raw reads were processed using the Illumina CASAVA (v. 1.8) to filter out the low-quality reads. Sequence reads were then aligned to the pig genome reference assembly (build 10.2; [17]) using TopHat 2.0.8 [18] with default parameters. The number of reads uniquely mapped to each gene (Ensembl 71 annotation) was determined using Htseq-count (v0.5.3.p3; [19]). To determine number of genes identified in each sample, we required a read count >5.

To identify genes detected at decreased or increased levels between the globin reduced and non-reduced samples, the read count data were analysed using edgeR (version 3.0.8) [20] in R (version 2.15.2), as described [3]. Count data was normalized by the library size to account for different numbers of reads obtained from each sample. To determine differences in detection levels between the two groups, an exact test for the negative binomial distribution was used. The genes were considered to be differentially detected at FDR <0.05. RSeQC (v2.3.3) [21] was used for read distribution over gene body to check 5′/3′ bias. We used BlastN (v2.2.25) [22] to perform the alignment between globin oligos and the genes with decreased levels after GR treatment.


  1. Klem TB, Bleken E, Morberg H, Thoresen SI, Framstad T: Hematologic and biochemical reference intervals for Norwegian crossbreed grower pigs. Veterinary clinical pathology/American Society for Veterinary Clinical Pathology. 2010, 39 (2): 221-226.

    Article  PubMed  Google Scholar 

  2. Takahashi J, Misawa M, Iwahashi H: Oligonucleotide microarray analysis of age-related gene expression profiles in miniature pigs. PLoS One. 2011, 6 (5): e19761-10.1371/journal.pone.0019761.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  3. Mastrokolias A, den Dunnen JT, van Ommen GB, 't Hoen PA, van Roon-Mom WM: Increased sensitivity of next generation sequencing-based expression profiling after globin reduction in human blood RNA. BMC Genomics. 2012, 13: 28-10.1186/1471-2164-13-28.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  4. Field LA, Jordan RM, Hadix JA, Dunn MA, Shriver CD, Ellsworth RE, Ellsworth DL: Functional identity of genes detectable in expression profiling assays following globin mRNA reduction of peripheral blood samples. Clin Biochem. 2007, 40 (7): 499-502. 10.1016/j.clinbiochem.2007.01.004.

    Article  CAS  PubMed  Google Scholar 

  5. Wu K, Miyada G, Martin J, Finkelstein D: Globin reduction protocol: A method for processing whole blood RNA samples for improved array results. Affymetrix Technical Note. 2007, Available at:

    Google Scholar 

  6. Tian Z, Palmer N, Schmid P, Yao H, Galdzicki M, Berger B, Wu E, Kohane IS: A practical platform for blood biomarker study by using global gene expression profiling of peripheral whole blood. PLoS One. 2009, 4 (4): e5157-10.1371/journal.pone.0005157.

    Article  PubMed Central  PubMed  Google Scholar 

  7. Vartanian K, Slottke R, Johnstone T, Casale A, Planck S, Choi D, Smith J, Rosenbaum J, Harrington C: Gene expression profiling of whole blood: Comparison of target preparation methods for accurate and reproducible microarray analysis. BMC Genomics. 2009, 10 (1): 2-10.1186/1471-2164-10-2.

    Article  PubMed Central  PubMed  Google Scholar 

  8. Wright C, Bergstrom D, Dai H, Marton M, Morris M, Tokiwa G, Wang Y, Fare T: Characterization of Globin RNA Interference in Gene Expression Profiling of Whole-Blood Samples. Clin Chem. 2008, 54 (2): 396-405. 10.1373/clinchem.2007.093419.

    Article  CAS  PubMed  Google Scholar 

  9. Debey S, Zander T, Brors B, Popov A, Eils R, Schultze JL: A highly standardized, robust, and cost-effective method for genome-wide transcriptome analysis of peripheral blood applicable to large-scale clinical trials. Genomics. 2006, 87 (5): 653-664. 10.1016/j.ygeno.2005.11.010.

    Article  CAS  PubMed  Google Scholar 

  10. Liu J, Walter E, Stenger D, Thach D: Effects of globin mRNA reduction methods on gene expression profiles from whole blood. JMD. 2006, 8 (5): 551-558. 10.2353/jmoldx.2006.060021.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  11. Whitley P, Moturi S, Santiago J, Johnson C, Setterquist R: Improved microarray sensitivity using whole blood RNA samples. Ambion Tech Notes. 2005, 12: 20-23.

    Google Scholar 

  12. Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J, Thompson JD, Higgins DG: Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011, 7 (1): 539-544.

    Article  PubMed Central  PubMed  Google Scholar 

  13. Miller ER, Ullrey DE, Ackermann I, Schmidt DA, Luecke RW, Hoefer JA: Swine hematology from birth to maturity. II. Erythrocyte population, size and hemoglobin concentration. J Anim Sci. 1961, 20: 890-897.

    CAS  PubMed  Google Scholar 

  14. Ramirez CG, Miller ER, Ullrey DE, Hoefer JA: Swine hematology from birth to maturity. II. Erythrocyte population, size and hemoglobin concentration. J Anim Sci. 1963, 22 (4): 1068-1074.

    Google Scholar 

  15. Shin H, Shannon CP, Fishbane N, Ruan J, Zhou M, Balshaw R, Wilson-McManus JE, Ng RT, McManus BM, Tebbutt SJ: Variation in RNA-Seq Transcriptome Profiles of Peripheral Whole Blood from Healthy Individuals with and without Globin Depletion. PLoS One. 2014, 9 (3): e91041-10.1371/journal.pone.0091041.

    Article  PubMed Central  PubMed  Google Scholar 

  16. Lunney J, Steibel J, Reecy J, Fritz E, Rothschild M, Kerrigan M, Trible B, Rowland R: Probing genetic control of swine responses to PRRSV infection: current progress of the PRRS host genetics consortium. BMC proceedings. 2011, 5 (Suppl 4): S30-10.1186/1753-6561-5-S4-S30.

    Article  PubMed Central  PubMed  Google Scholar 

  17. Groenen MAM, Archibald AL, Uenishi H, Tuggle CK, Takeuchi Y, Rothschild MF, Rogel-Gaillard C, Park C, Milan D, Megens H-J, Li S, Larkin DM, Kim H, Frantz LAF, Caccamo M, Ahn H, Aken BL, Anselmo A, Anthon C, Auvil L, Badaoui B, Beattie CW, Bendixen C, Berman D, Blecha F, Blomberg J, Bolund L, Bosse M, Botti S, Bujie Z, et al: Analyses of pig genomes provide insight into porcine demography and evolution. Nature. 2012, 491 (7424): 393-398. 10.1038/nature11622.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  18. Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL: TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013, 14 (4): R36-10.1186/gb-2013-14-4-r36.

    Article  PubMed Central  PubMed  Google Scholar 

  19. Anders S, Pyl PT, Huber W: HTSeq – A Python framework to work with high-throughput sequencing data. Bioinformatics. 2014, ᅟ: ᅟ-Sep 25, doi:10.1093/bioinformatics/btu638. [Epub ahead of print]

    Google Scholar 

  20. Robinson MD, McCarthy DJ: Smyth GK: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010, 26 (1): 139-140. 10.1093/bioinformatics/btp616.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  21. Wang L, Wang S, Li W: RSeQC: quality control of RNA-seq experiments. Bioinformatics. 2012, 28 (16): 2184-2185. 10.1093/bioinformatics/bts356.

    Article  CAS  PubMed  Google Scholar 

  22. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215 (3): 403-410. 10.1016/S0022-2836(05)80360-2.

    Article  CAS  PubMed  Google Scholar 

Download references


This project was supported by Applied Livestock Genomics Program (ALGP) grants #13 and #29, Genome Canada grant #2209_F, the USDA ARS project #1245-32000-098, and the NRSP-8 Bioinformatics Coordination project. The authors gratefully acknowledge the suggestions and advice of Dr. James Koltes at Iowa State University.

Author information

Authors and Affiliations


Corresponding authors

Correspondence to Joan K Lunney or Le Luo Guan.

Additional information

Competing interests

The authors declare they have no competing interests.

Authors’ contributions

AH, IC, HB, AK, GSP, CKT, SA, JKL, LLG designed the GR strategy; AH, IC, SA, XS, YM performed the RNA purification, GR treatment, RIN and qPCR analyses as well as RNA-sequencing; HB, AK, PS, JMR, EFW designed the RNAseq data analyses pipeline; all authors contributed to writing the final manuscript. All authors read and approved the final manuscript.

Igseo Choi, Hua Bao, Arun Kommadath contributed equally to this work.

Electronic supplementary material


Additional file 1: Table S1.: Primer sequences used in qPCR. Table S2. Blood collection tube, RNA isolation methods, sequence statistics, number of expressed genes and globin reads count in pre- and post-globin reduction samples. Figure S1. qPCR results for HBA and HBB gene expression comparing three different globin reduction methods. Figure S2. Alignment of orthologous HBA and HBB cDNA sequences in human, mouse, cattle and pig. Figure S3. Differential gene expression in pre- and post-GR samples. Figure S4. Individual Venn diagrams showing the number of genes detected by RNA-seq in pre- and post-GR samples. Figure S5. Comparison of the mean expressions of the set of genes detected in both pre- and post-GR samples and the genes detected only in post-GR samples. (DOCX 570 KB)

Authors’ original submitted files for images

Rights and permissions

Open Access  This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit

The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Choi, I., Bao, H., Kommadath, A. et al. Increasing gene discovery and coverage using RNA-seq of globin RNA reduced porcine blood samples. BMC Genomics 15, 954 (2014).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: