RNA-seq and microarray complement each other in transcriptome profiling
© Kogenaru et al.; licensee BioMed Central Ltd. 2012
Received: 18 July 2012
Accepted: 5 November 2012
Published: 15 November 2012
RNA-seq and microarray are the two popular methods employed for genome-wide transcriptome profiling. Current comparison studies have shown that transcriptome quantified by these two methods correlated well. However, none of them have addressed if they complement each other, considering the strengths and the limitations inherent with them. The pivotal requirement to address this question is the knowledge of a well known data set. In this regard, HrpX regulome from pathogenic bacteria serves as an ideal choice as the target genes of HrpX transcription factor are well studied due to their central role in pathogenicity.
We compared the performance of RNA-seq and microarray in their ability to detect known HrpX target genes by profiling the transcriptome from the wild-type and the hrpX mutant strains of γ-Proteobacterium Xanthomonas citri subsp. citri. Our comparative analysis indicated that gene expression levels quantified by RNA-seq and microarray well-correlated both at absolute as well as relative levels (Spearman correlation-coefficient, rs > 0.76). Further, the expression levels quantified by RNA-seq and microarray for the significantly differentially expressed genes (DEGs) also well-correlated with qRT-PCR based quantification (rs = 0.58 to 0.94). Finally, in addition to the 55 newly identified DEGs, 72% of the already known HrpX target genes were detected by both RNA-seq and microarray, while, the remaining 28% could only be detected by either one of the methods.
This study has significantly advanced our understanding of the regulome of the critical transcriptional factor HrpX. RNA-seq and microarray together provide a more comprehensive picture of HrpX regulome by uniquely identifying new DEGs. Our study demonstrated that RNA-seq and microarray complement each other in transcriptome profiling.
Transcriptome of an organism represents the entire repertoire of transcripts encoded by the genes as a phenotypic response to the condition in which they exist. The sheer ability to simultaneously quantify the expression levels for a vast number of genes has revolutionized the biomedical research, facilitating the analysis of global gene expression patterns at the genome-wide scale . In the past decade, there has been a tremendous progress in the development of methods to deduce and quantify the gene expression levels at the whole transcriptome level . Among the several transcriptome profiling methods, RNA-seq and DNA microarray stand out as the two widely used genome-wide gene expression quantification methods [1–17].
RNA-seq method involves the conversion of isolated transcripts into the complementary DNA (cDNA), which is then directly sequenced in a massively parallel deep-sequencing-based approach . By mapping the resulting short sequencing reads onto the reference genome, the expression levels of genes relative to the condition of interest or absolute levels can be quantified [9, 11]. This method has been implemented in different platforms like Illumina’s Genome Analyzer, Roche 454 Genome Sequence, and Applied Biosystems’ SOLiD . On the other hand, microarray is based on the hybridization of specimen target strands onto the immobilized complementary probe strands. For example, in a two-color microarray, transcripts extracted from different conditions are labeled with distinct fluorescent dyes while being converted to cDNA. These labeled samples are then hybridized to the immobilized complementary probe strands in an array representing the genes. By measuring the light intensity of the distinct fluorescent dyes, the relative abundance of each transcript in the two different conditions can be measured [8, 12, 13, 17, 19, 20]. Affymetrix and Agilent are the two prevalent platforms in microarray technology [2, 14].
Even though, initially microarray has been instrumental in whole transcriptome analysis, currently RNA-seq is becoming a preferred method of choice, since it is considered to effectively surmount the limitations of microarray [1, 21–23]. RNA-seq technology, unlike microarray, does not depend on the prerequisite knowledge of the reference transcriptome . Further, RNA-seq data contains very low background signal, a higher dynamic range of expression levels, and also relatively small amount of total RNA required for quantification, when compared to microarray [1, 23]. Despite these advantages, the efficiency of RNA-seq is marred with the problem of overwhelming amount of ribosomal RNA (rRNA) in the data, short reads, less base accuracy, and variation of read density along the length of the transcript, posing a challenge for this high-throughput method [21, 25, 26]. However, in spite of their strengths and limitations, RNA-seq and microarray have become the default popular methods of choices for genome-wide transcriptome studies [1, 2, 23].
Currently several studies have been conducted to compare the performance of RNA-seq and microarray in quantifying the expression level of genes, by focusing on various aspects like reproducibility, accuracy, statistical issues, technical and biological variabilities [1, 15, 21, 27–30]. The main conclusion from these studies has been that the expression levels quantified by these two methods correlated to a large extent, and overall favored the RNA-seq because of high reproducibility, accuracy, and dynamic range [27, 29]. However, none of these comparison studies have addressed if these two methods complement each other in transcriptome profiling given the strengths and limitations associated with them. In order to address this question, we require an already well characterized dataset. The HrpX regulome from Xanthomonas citri subsp. citri (Xcc) serves as an ideal data model in this regard [31–33]. Xcc is a causal agent of citrus canker, one of the serious and destructive diseases in citrus that is resulting in significant losses to citrus industry worldwide , while HrpX is a key global transcription factor that regulates the expression of hrp (hypersensitive response and pathogenicity) cluster of genes, which are considered as the major pathogenicity factors [31, 35]. HrpX contains AraC-type of DNA binding domain, which specifically recognizes the plant-inducible promoter (PIP) box (TTCGC-N15-TTCGC) and imperfect PIP box (TTCGC-N8-TTCGT) present in the cis-regulatory regions of hrp gene cluster [36–38]. Since HrpX has a key role in pathogenicity, tremendous progress has been made in cataloguing the target genes of HrpX [39–45]. We therefore assessed the performance of RNA-seq and microarray in their ability to detect known HrpX target genes. We chose Illumina and Agilent as the corresponding platforms for RNA-seq and microarray, as they are the most popular platforms for these technologies [2, 4].
In order to uncover the regulome of HrpX transcription regulator by profiling the wild-type and the hrpX mutant strains transcriptome, we had designed a microarray chip covering the whole genome under Agilent platform in our previous study . Here, we conducted genome-wide transcriptome profiling of these two strains by RNA-seq and compared the results to the previously published microarray data, to assess the performance of these two methods. Further, to avoid technical variation associated with RNA isolation, we used the aliquots from the same total RNA samples used for microarray experiments also for RNA-seq.
We obtained 16,431,283, 17,289,220, 18,124,120 sequence reads for the wild-type and 15,084,955, 17,831,920, and 18,115,115 for the hrpX mutant strain with a median sequence length of 74-base pairs (bp) (Additional file 1: Table S1). Raw reads often have high sequencing errors, especially in the 3′ end where there is a high chance of sequencing errors to occur . We therefore filtered the reads for high quality ones by trimming off the base pairs with low quality score assigned to them during down-line processing of RNA-seq. More than 90% of the reads passed the quality filter, as a result, the median sequence length of quality filtered reads subsequently dropped to 68-bp (Additional file 1: Table S1). We then mapped these high quality trimmed reads on to the Xcc genome. Approximately more than 90% of the reads could be mapped on to the reference genome, indicating good sequence coverage (Additional file 1: Table S1). Overall ~97% of the annotated genes had more than one read mapped, while merely ~3% of the annotated genes had no reads mapped, indicating good sequencing depth. Further, we also observed a difference in the sequence coverage between the chromosome and the two endogenous plasmids of Xcc. Annotated coding genes from the chromosome with a size of 5.18 mega base pairs (Mb) had 98% sequence coverage, whereas, it was 78% for plasmid pXAC64 with a size of 0.06 Mb, and relatively lower with only 62% sequence coverage for plasmid pXAC33 with a size of 0.03 Mb (Additional file 1: Table S2).
Comparison at absolute levels of expression
We further estimated the correlation between all the combinations of biological replicates for the wild-type and the hrpX mutant strains independently. The resulting rs values of these comparisons are represented in the form of heat maps, for the wild-type (Figure 1C) and the hrpX mutant strains (Figure 1D), which provide a global view of these correlations. Overall, on an average the wild-type with rs = 0.76 (p-value < 0.0001) and the hrpX mutant with rs = 0.78 (p-value < 0.0001) were observed for the biological replicates from all the correlation combinations. This level of comparison strongly suggested that not only the absolute level of gene expressions determined by RNA-seq and microarray highly correlated, but were also highly reproducible, in spite of the technical as well as the biological variability associated with the quantifications.
Comparison at relative levels of expression
The correlation coefficient provides an overall estimate of correlation between the expression levels determined by RNA-seq and microarray methods. However, this does not zoom into the data in a detailed manner. For instance, no information is provided about how much of FC magnitude that actually differs between the two methods for a given gene. In order to get an insight into this aspect, we computed the fraction of genes deviating in their FC magnitude values by dividing the FC magnitude value determined by RNA-seq with that of microarray (Figure 2C). Here, the fold difference of one represents the fraction of genes that are determined to have a FC magnitude of ± 0.5 (bin width) by both RNA-seq and microarray methods. When we plotted this frequency as a histogram for the whole 4312 consensus genes, more than 75% of genes were found to have FC magnitude values ± 0.5 by RNA-seq and microarray methods. Since it is a relative expression comparison, genes whose expression values did not change much in the wild-type and the hrpX mutant strains, tend to have FC values = 1. Subsequently, it is more sensible to consider only differentially expressed set of genes for further comparisons.
We therefore applied FDR ≤ 0.05 (5%) in conjunction with FC (absolute log2FC ≥ 0.6) to filter the whole data set. In total, 87 (2%) genes from RNA-seq and 64 (1.5%) from microarray qualified at this cut-off threshold from the 4312 consensus genes (Additional file 1: Table S4). Together, 106 genes satisfied our selection criterion from both the methods (Additional file 1: Table S4). Among them 84 (79.2%) genes were up-regulated, while 22 (20.8%) genes were found to be down-regulated. Further, 45 (~42.45%) genes were common between both the methods, whereas, 42 (39.63%) and 19 (~17.92%) genes were uniquely detected by RNA-seq and microarray respectively (Additional file 1: Table S4; Additional file 2: Figure FS2). We further compared the FC values of the 45 consensus genes both qualitatively and quantitatively. These genes qualitatively agreed 100% by having the same trend of log2 transformed FC values by both RNA-seq and microarray (Figure 2D). Likewise the quantitative comparison was performed by estimating the correlation between the magnitude of log2FC determined by RNA-seq and microarray for the 45 consensus genes as shown in Figure 2E. The magnitude of FC values between the two methods were found to be well correlated (rs = 0.76, p-value < 0.0001), indicating that the same trend of variation was observed in FC values between the two methods without any dispersion. Thereby, the magnitude of FC values determined by RNA-seq and microarray agreed to a large extent for the 45 consensus genes. In order to further pinpoint the deviation in the FC magnitude quantified by the two methods, we plotted the differences in the FC values determined by RNA-seq with respect to microarray, and the percentage of genes with that difference for the 45 consensus genes (Figure 2F). Majority of the genes (~98%) were found to have a magnitude of FC within the range of ≤ 1.5, while for the remaining 2% of the genes, it was 4.7-times higher in RNA-seq than the microarray based quantification. Based on these comparisons, we concluded that the relative gene expression levels quantified by RNA-seq and microarray were consistent to a large extent for the statistically differentially expressed set of consensus genes.
Comparison with qRT-PCR
We further plotted the percentage of genes that deviated in the magnitude of FC quantified by RNA-seq and microarray with respect to qRT-PCR (Figures 3C and 3D). For most of the genes, the magnitude of FC quantified by RNA-seq and microarray were relatively higher, when compared to qRT-PCR (fold difference >1). Overall, the magnitude of FC quantified by RNA-seq was in consistence with qRT-PCR based quantification (Figure 3C). For microarray, the magnitude of FC was observed to be consistent with qRT-PCR for a majority of genes, however, we also noticed outlier genes with a 9-times higher FC magnitude (Figure 3D).
Comparison in terms of detection of genes encoding T3SS and effectors
Extensive and detailed studies have been carried out since past three decades in cataloguing the target genes of HrpX in the genus Xanthomonas using various genetic and biochemical methods [32, 38, 39, 50–55]. HrpX is known to regulate hrp gene cluster that encodes the type III secretion system (T3SS) and effectors [31, 56]. T3SS are specialized macromolecular machinery that act as a nano-injector to translocate the effector proteins into the cytoplasm of host plant cells . These translocated effectors manipulate the host cellular processes by altering signal transduction, transcriptional activities like suppression of basal plant defense responses, and protein turnover in host cells for the benefit of the pathogen . The T3SS machineries are evolutionarily conserved across many Gram-negative animal- and plant-pathogenic bacteria .
Summary of Type III secretion system (T3SS) hrp cluster genes detected by RNA-seq and microarray
Summary of Type III effector genes detected by RNA-seq and microarray
Overall, considering T3SS and effector genes, in total there are 47 genes, from which, 26 genes (55%) were detected by both RNA-seq and microarray (Tables 1 and 2). RNA-seq uniquely detected 1 gene (2%), whereas, microarray detected 9 genes (19%). Remaining 11 genes (23%) were not detected by either one of the methods by failing to pass the cut-off threshold (Tables 1 and 2). Further, considering only the genes that are detected by at least one method, 72% of the known were detected by both methods, while remaining 28% were detected by either one of the methods.
Genes uniquely detected by RNA-seq and microarray
Among the 87 statistically significant differentially expressed genes from RNA-seq, 42 (39.63%) genes were found to be uniquely detected by this method (Additional file 2: Figure FS2). Of these 42 genes, 17 were found to be down-regulated, while 25 were up-regulated (Additional file 1: Table S4). Nearly 98% of these genes (41 of 42 unique) could not pass the FC cut-off threshold by microarray. The only exception is the gene fliO (XAC1945) that encodes a flagellar protein for flagellum apparatus, which passed the FC cut-off, but failed with FDR threshold. The gene XAC0755 encoding KdpF, a component of an integral membrane potassium-transporting system , is down-regulated by a factor of 3 (log2 FC of 1.6) according to RNA-seq, but, microarray could not capture this, as the probes for this gene were missing on the chip. This shows the limitation of microarray, where probes for all the genes need to be defined while designing the chip. Furthermore, four genes uniquely found by RNA-seq are involved in signal transduction and gene regulation, i.e. XAC4116 encoding a serine/threonine kinase, XAC1819 encoding a tryptophan-rich sensory protein, and two regulatory genes XAC3026, and XAC3363, whose function in citrus canker disease development remain to be explored. Furthermore, 21 genes (24%) are currently annotated as hypothetical proteins (Additional file 1: Table S6). Among them, four hypothetical proteins XAC0854, XAC4131, XAC1203, and XACb0064 were predicted to be T3SS secreted while 7 hypothetical proteins, XAC3275, XAC3680, XAC1943, XAC0527, XAC0599, XAC0239, and XAC0755 were predicted to be Type 2 Secretion System (T2SS) substrates (Additional file 1: Table S6) by Effective database . Gram-negative bacteria employ T2SS to transport proteins to the extracellular milieu, where the T2SS exo-proteins containing N-terminal signal peptides are used for inner-membrane translocation through either the Sec translocon or the Tat complex . Genes encoding proteins secreted by T3SS and T2SS have been experimentally proved to be regulated by HrpX [33, 62, 63].
Among the 64 statistically significant differentially expressed genes from microarray, 19 (29.7%) genes were found to be uniquely detected by this method (Additional file 2: Figure FS2). 18 were found to be down-regulated, while one gene was up-regulated (Additional file 1: Table S4). Unlike that of RNA-seq, nearly 63% genes (12 of 19 unique) could pass the FC cut-off threshold, but failed to pass the FDR threshold by RNA-seq. The remaining 37% genes (7 of 19 unique) could not pass both FC and FDR cut-off threshold. Furthermore, six genes were found to be hypothetical. Among them XAC2876, XAC1241, and XAC2370 were predicted as T2SS substrates. XAC1241 predicted as a T2SS substrate, shared 73% identity with a putative secreted protein from X. campestris pv. vesicatoria strain 85–10. Another T2SS candidate XAC2370 shared 95% identity with a secreted protein from X. fuscans subsp. aurantifolii str. ICPB 10535. XAC1124 shared 100% identity with MEKHLA domain protein from X. axonopodis pv. punicae str. LMG 859 . This domain is found in bacteria associated with plants. It further shares similarity with the PAS domain and might be involved in light, oxygen, and redox potential sensation .
Comparison at the level of functional annotations of genes
For comparison based on the biological function for the differentially expressed genes from RNA-seq and microarray, we utilized the ClueGO to integrate the Gene Ontology (GO)  terms and KEGG  pathway terms and create a functionally organised GO/KEGG network. Functional annotation with biological processes category resulted in 13 (14.94%) genes found from cluster for RNA-seq, while for microarray it was 12 (19.35%).
The ClueGO overview pie chart highlighted that significant proportion of the genes differentially regulated are involved in “protein secretion by the T3SS” by both RNA-seq and microarray (Additional file 3: Figure FS3A & D). Additionally, RNA-seq also identified genes involved in “secretion activity by cell” as well as “single organism catabolic process” (Additional file 3: Figure FS3A). On the other hand, microarray highlighted the genes involved in “protein transmembrane transport”, “polycyclic aromatic hydrocarbon degradation” and “establishment of localization in cell” (Additional file 3: Figure FS3D). Majority of the genes are involved in “bacterial secretion system”, as shown by both RNA-seq and microarray. Also the differentially expressed genes are found to be significantly involved in the “transport of monovalent inorganic cation” (Additional file 3: Figure FS3B) and “protein transport” (Additional file 3: FS3E). Genes have also been found uniquely by microarray as significantly involved in “polycyclic aromatic hydrocarbon degradation” (Additional file 3: Figure FS3E). Genes from RNA-seq have been found to be involved in “riboflavin metabolism” as well as “single organism catabolic process” (Additional file 3: Figure FS3B). Further, visualization of the functionally grouped annotation network for the differentially regulated genes derived from RNA-seq (Additional file 3: Figure FS3C) and microarray (Additional file 3: Figure FS3F) methods highlighted the relationships between the terms. RNA-seq highlighted “protein secretion by the T3SS” along with the “small molecule catabolic process”, while microarray reflected “polycyclic aromatic hydrocarbon degradation” and “establishment of localization in cell”, as the most significant terms of the group. This analysis also showed that RNA-seq and microarray together provide more comprehensive functional information than the individual methods.
PIP box detection
Currently, RNA-seq is becoming the preferable choice for gene expression profiling in place of microarrays. Although, all the parameters that influence the various aspects of this method are yet to be understood completely, RNA-seq undoubtedly is playing a very important role in deciphering the complexity of the transcriptome by giving a new direction to isoforms, allelic expression, untranslated regions, splice junctions, antisense regulation and intragenic expression [10, 16, 29, 68–74]. Several studies have begun to investigate on the parameters like sequencing depth, precision, GC bias, length bias, lane effects, and processing artifacts [16, 29, 48, 75–77]. On the other hand, microarrays are in usage for more than two decades. Therefore, most of the biases inherent to this method have become more apparent . For instance, biases in the hybridization of the samples labeled with Cyanine5 (Cy5) and Cyanine3 (Cy3) are sufficiently explored, and currently several approaches are practiced to minimize such effects [79–82]. Further, systematic variability like influence of the image scanner settings on the dye intensity measurements have now been robustly handled by applying various normalization techniques [83–86]. Despite these developments, some inherent genes–specific biases like differential hybridization efficiencies of the labeled target transcript to the same probe are still found to be inevitable in microarrays. In RNA-seq as well as microarray, all these known and unknown parameters influence the final outcome. Therefore, in this study, we focused on the assessment of RNA-seq and microarray based on the final outcome .i.e. statistically significant differentially expressed genes.
In comparison with previous RNA-seq studies, with a sequence coverage of 97% we observed for our data set, is in consistence with the reported 89.5% to 95% coverage observed in other bacterial RNA-seq studies [87–89]. In our study, RNA-seq has identified more significantly differentially expressed genes (82%), when compared to microarray (63%) as in previous studies [18, 29, 30]. The overall correlation (rs 0.76) in the magnitudes of FC for the consensus genes between the two methods was found to be similar or higher than previous studies [18, 29, 30, 72]. Furthermore, our comparison analysis with qRT-PCR suggested that the expression levels were highly reliable for those genes that were determined to be differentially expressed by both RNA-seq and microarray. Hence, confirming the differential expression of genes by multiple methods reduces false positives thereby enhances the biological discovery.
Even though microarray overall outperformed RNA-seq by detecting more known HrpX target genes from the T3SS in hrp cluster by satisfying both FC and FDR cut-off threshold, in principle RNA-seq also detected genes hrpB5, hrcS, hpaP, XAC0395, hrpB7, and hrcT, in terms of FC, but failed to pass FDR threshold. This parameter is more directly influenced by error model considered in the statistical method that is used to infer the differential expression rather than RNA-seq itself. For the same read counts, one can get slightly different FDR values depending on the statistical method . But the implementation of all the statistical methods is not feasible for every dataset. From the T3SS in hrp cluster, three genes namely, hrcC, hpa2, and hpaA were not found to be detected by both RNA-seq and microarray, mainly because they fail to pass FDR threshold. Interestingly, our previous microarray analysis confirmed that all these three genes are regulated by HrpX, but only at a later stage of the growth phase by satisfying both FC and FDR cut-off thresholds . This consolidates the regulation of some of the genes at later stages of the growth phase. Further, in case of Type III effector genes, 8 genes (36.4%) were not detected by both RNA-seq and microarray within considered cut-off threshold limit. However, among them xopL, avrBs2, xopAK and xopZ were found to be regulated by HrpX only at the later stage of the growth phase (OD600 time point 0.5), according to our previous microarray analysis . Further, four genes namely, pthA2, pthA1, pthA3, pthA4 were regulated by another transcription regulator HrpG at early stage of growth phase (OD600 = 0.25 and 0.4) as observed in our previous study, while another undetected gene xopE was found to be also regulated by HrpG, but only at OD600 = 0.25 time point of growth phase . Thereby this study further validated our previous results. Subsequently, both methods detected 100% of the genes known to be regulated by HrpX (at time point OD600 = 0.4) without any false positives. Among them, 72% were detected by both the methods while interestingly 28% of the known target genes were detected by either one of the methods. Hence, both the methods together could complement each other.
In addition 55 genes (~51%) were newly identified as differentially expressed by applying both microarray as well as RNA-seq methods, thereby adding up to the already existing repertoire of HrpX regulated genes. Furthermore, 46 (83.6%) genes among them were uniquely identified by either one of the methods. Overall, 21 newly identified genes were found to have PIP box in their promoter regions, wherein 14 (58.3%) genes were uniquely identified by either RNA-seq or microarray. The presence of the PIP box in the promoter regions of the HrpX-regulated genes uniquely identified by RNA-seq and microarray further not only confirmed that these genes are directly regulated by HrpX, but also that these candidates are not false positives. Consequently, 100% of the known HrpX regulated genes could only be detected together by both the methods, since each method missed out on some of the known genes; hence both the methods together enhance the understanding of HrpX regulome by providing a more comprehensive picture.
This study has significantly advanced our understanding of the regulome of the critical transcriptional factor HrpX and demonstrates that RNA-seq and microarray complement each other in transcriptome profiling. Consequently, our study demonstrates the advantage of applying multiple transcriptome profiling methods to reveal a more comprehensive picture of a transcriptome, rather than relying solely on one method.
Bacterial strains and growth conditions
The wild-type X. citri subsp. citri, and the hrpX mutant strains used in this study were described in our previous study . Both the strains were grown at 28°C in nutrient broth (NB), on nutrient agar (NA), or in NYG medium . Antibiotics rifamycin and kanamycin were added to the media at 50 μg/ml final concentrations.
Total RNA was extracted from the wild-type and the hrpX mutant strains as described in our previous study . Briefly, strains from NA plates were grown in NB medium at 28°C until mid-exponential phase. Cultures were harvested by centrifugation and inoculated in to nutrient-deficient XVM2 medium, after washing the pellet once with the same medium. Cultures were finally harvested for RNA extraction, when the optical density at 600 nm reached the value of 0.4, and mixed immediately with RNAprotect bacterial reagent (Qiagen, Valencia, CA, and U.S.A.). Total RNA was extracted from each replicate separately using RiboPure bacteria kit (Ambion, Austin, TX, USA), according to manufacturer’s instructions. Genomic DNA contamination from the extracted RNA samples was removed using TURBO DNA-free kit (Ambion). Amount and the quality of the RNA samples was initially determined using NanoDrop™ 1000 spectrophotometer (NanoDrop Technologies, Inc., Wilmington, DE). Samples with absorbency at 260/280 and 260/230 nm ratios > 2 were subjected to further processing. Three biological replicates of the wild-type and the hrpX mutant samples were used for RNA-seq analysis.
The microarray data used in this study was generated during our previous study . Three unique 60-mer oligonucleotide probes were designed for each of the 4,427 protein coding genes of X. citri subsp. citri. 8-by-15-K DNA microarray chips covering the whole genome were implemented under the Agilent platform. These microarrays were processed at the Interdisciplinary Center for Biotechnology Research Microarray Core Facility, University of Florida. The raw data is available at National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (GEO) data repository under the accession number GSE24016 .
mRNA enrichment and RNA-seq
Total RNA samples were enriched for mRNA, by depleting rRNA using MICROBExpress kit from Ambion following the manufacturer’s instructions. Enriched samples were checked for integrity using Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). RNA samples that passed the quality control were sequenced using the Illumina Genome Analyzer IIx (GAIIx) system by following the standard protocol at the Center for Genome Analysis at Yale University. Real-time analysis and base calling were performed using the CASAVA v1.6 pipeline. The raw sequence data has been submitted to the NCBI Sequence Read Archive and assigned with an accession number SRA052842.
Reads mapping and statistical analysis
The X. citri subsp. citri whole genome sequence consisting of one chromosome [GenBank: NC_003919.1], and two plasmids [GenBank: NC_003921.3 and NC_003922.1], along with the annotation information were downloaded from NCBI repository (ftp://ftp.ncbi.nih.gov/genomes/Bacteria/). Quality-filtered reads were aligned on to the genome using CLC Genomics Workbench v4.7.2 (CLC bio, Aarhus, Denmark). Reads uniquely aligned to each gene were tabulated from each replicate separately. Differentially expressed genes were estimated using DESeq package , available under the open-source Bioconductor suite of programs . DESeq is a powerful tool to estimate the variance in RNA-seq data and test for differential expression . As an input, DESeq accepts a table of read counts for each gene from different biological replicates, and estimates the differentially expressed genes using negative binomial distribution . Statistically significant differentially expressed genes from both microarray and RNA-seq data were obtained by applying a cut-off threshold of FDR ≤ 0.05 (5%) and an absolute log2 fold-change ≥ 0.6.
Similarity searches were performed online using position-specific iterative BLAST (PSI-BLAST) at NCBI site against non-redundant protein database . T3SS and T2SS predictions were performed using Effective database . The promoter regions of the significantly differentially expressed genes were retrieved manually using NCBI genome browser to look for the presence of PIP boxes. The differentially expressed genes were assigned to the transcriptional units by referring to the MetaCyc database . Biological interpretation of the differentially expressed genes was carried out using the ClueGO v1.5 , a Cytoscape plug-in .
All the qRT-PCR assays were performed as detailed elsewhere . Briefly, gene-specific primers were designed for the selected genes using PrimerQuestSM from Integrated DNA technologies (IDT), Coralville, Iowa (Additional file 1: Table S6). qRT-PCR experiments were performed in triplicates, at least three times for each gene using 7500 fast real-time PCR system (Applied Biosystems, Foster City, CA, USA), using a QuantiTect SYBR green RT-PCR kit (Qiagen) with similar results, by following the manufacturer’s instructions. The relative fold change of target gene expression was calculated using 16S rRNA as an endogenous control with the formula 2–∆∆CT.
The raw RNA-seq data from this study is deposited at the NCBI sequence read archive (http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi), under the accession number SRA052842, while the raw microarray data is available at the NCBI Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo) with the accession number GSE24016.
We wish to acknowledge Gabriela Bindea for providing annotation files for ClueGO software. This work was supported by United States Department of Agriculture - NIFA Special Citrus Canker Grant Project 94677.
- Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009, 10: 57-63. 10.1038/nrg2484.PubMed CentralView ArticlePubMedGoogle Scholar
- Baginsky S, Hennig L, Zimmermann P, Gruissem W: Gene expression analysis, proteomics, and network discovery. Plant Physiol. 2010, 152: 402-410. 10.1104/pp.109.150433.PubMed CentralView ArticlePubMedGoogle Scholar
- Cloonan N, Forrest AR, Kolle G, Gardiner BB, Faulkner GJ, Brown MK, Taylor DF, Steptoe AL, Wani S, Bethel G, et al: Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nat Methods. 2008, 5: 613-619. 10.1038/nmeth.1223.View ArticlePubMedGoogle Scholar
- Costa V, Angelini C, De Feis I, Ciccodicola A: Uncovering the complexity of transcriptomes with RNA-Seq. J Biomed Biotechnol. 2010, 2010: 853916-PubMed CentralView ArticlePubMedGoogle Scholar
- DeRisi J, Penland L, Brown PO, Bittner ML, Meltzer PS, Ray M, Chen Y, Su YA, Trent JM: Use of a cDNA microarray to analyse gene expression patterns in human cancer. Nat Genet. 1996, 14: 457-460.View ArticlePubMedGoogle Scholar
- Ekins R, Chu FW: Microarrays: their origins and applications. Trends Biotechnol. 1999, 17: 217-218. 10.1016/S0167-7799(99)01329-3.View ArticlePubMedGoogle Scholar
- Fodor SP, Rava RP, Huang XC, Pease AC, Holmes CP, Adams CL: Multiplexed biochemical assays with biological chips. Nature. 1993, 364: 555-556. 10.1038/364555a0.View ArticlePubMedGoogle Scholar
- Hegde P, Qi R, Abernathy K, Gay C, Dharap S, Gaspard R, Hughes JE, Snesrud E, Lee N, Quackenbush J: A concise guide to cDNA microarray analysis. Biotechniques. 2000, 29: 548-4. 556.PubMedGoogle Scholar
- Marguerat S, Bahler J: RNA-seq: from technology to biology. Cell Mol Life Sci. 2010, 67: 569-579. 10.1007/s00018-009-0180-6.PubMed CentralView ArticlePubMedGoogle Scholar
- Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M: The transcriptional landscape of the yeast genome defined by RNA sequencing. Science. 2008, 320: 1344-1349. 10.1126/science.1158441.PubMed CentralView ArticlePubMedGoogle Scholar
- Nagalakshmi U, Waern K, Snyder M: RNA-Seq: a method for comprehensive transcriptome analysis. Curr Protoc Mol Biol. 2010, Chapter 4: Unit 4.11.1-13.Google Scholar
- Ramsay G: DNA chips: state-of-the art. Nat Biotechnol. 1998, 16: 40-44. 10.1038/nbt0198-40.View ArticlePubMedGoogle Scholar
- Schena M, Shalon D, Davis RW, Brown PO: Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science. 1995, 270: 467-470. 10.1126/science.270.5235.467.View ArticlePubMedGoogle Scholar
- Tan PK, Downey TJ, Spitznagel EL, Xu P, Fu D, Dimitrov DS, Lempicki RA, Raaka BM, Cam MC: Evaluation of gene expression measurements from commercial microarray platforms. Nucleic Acids Res. 2003, 31: 5676-5684. 10.1093/nar/gkg763.PubMed CentralView ArticlePubMedGoogle Scholar
- Toung JM, Morley M, Li M, Cheung VG: RNA-sequence analysis of human B-cells. Genome Res. 2011, 21: 991-998. 10.1101/gr.116335.110.PubMed CentralView ArticlePubMedGoogle Scholar
- Wilhelm BT, Marguerat S, Watt S, Schubert F, Wood V, Goodhead I, Penkett CJ, Rogers J, Bahler J: Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution. Nature. 2008, 453: 1239-1243. 10.1038/nature07002.View ArticlePubMedGoogle Scholar
- Xiang CC, Chen Y: cDNA microarray technology and its applications. Biotechnol Adv. 2000, 18: 35-46. 10.1016/S0734-9750(99)00035-X.View ArticlePubMedGoogle Scholar
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5: 621-628. 10.1038/nmeth.1226.View ArticlePubMedGoogle Scholar
- Pariset L, Chillemi G, Bongiorni S, Romano SV, Valentini A: Microarrays and high-throughput transcriptomic analysis in species with incomplete availability of genomic sequences. N Biotechnol. 2009, 25: 272-279. 10.1016/j.nbt.2009.03.013.View ArticlePubMedGoogle Scholar
- Schena M, Heller RA, Theriault TP, Konrad K, Lachenmeier E, Davis RW: Microarrays: biotechnology’s discovery platform for functional genomics. Trends Biotechnol. 1998, 16: 301-306. 10.1016/S0167-7799(98)01219-0.View ArticlePubMedGoogle Scholar
- Fu X, Fu N, Guo S, Yan Z, Xu Y, Hu H, Menzel C, Chen W, Li Y, Zeng R, et al: Estimating accuracy of RNA-Seq and microarrays with proteomics. BMC Genomics. 2009, 10: 161-10.1186/1471-2164-10-161.PubMed CentralView ArticlePubMedGoogle Scholar
- Shendure J: The beginning of the end for microarrays?. Nat Methods. 2008, 5: 585-587. 10.1038/nmeth0708-585.View ArticlePubMedGoogle Scholar
- van Vliet AH: Next generation sequencing of microbial transcriptomes: challenges and opportunities. FEMS Microbiol Lett. 2010, 302: 1-7. 10.1111/j.1574-6968.2009.01767.x.View ArticlePubMedGoogle Scholar
- Raz T, Kapranov P, Lipson D, Letovsky S, Milos PM, Thompson JF: Protocol dependence of sequencing-based gene expression measurements. PLoS One. 2011, 6: e19287-10.1371/journal.pone.0019287.PubMed CentralView ArticlePubMedGoogle Scholar
- Martin JA, Wang Z: Next-generation transcriptome assembly. Nat Rev Genet. 2011, 12: 671-682. 10.1038/nrg3068.View ArticlePubMedGoogle Scholar
- Zhou X, Ren L, Meng Q, Li Y, Yu Y, Yu J: The next-generation sequencing technology and application. Protein Cell. 2010, 1: 520-536. 10.1007/s13238-010-0065-3.View ArticlePubMedGoogle Scholar
- ‘t Hoen PA, Ariyurek Y, Thygesen HH, Vreugdenhil E, Vossen RH, de Menezes RX, Boer JM, van Ommen GJ, den Dunnen JT: Deep sequencing-based expression analysis shows major advances in robustness, resolution and inter-lab portability over five microarray platforms. Nucleic Acids Res. 2008, 36: e141-e141.PubMed CentralView ArticlePubMedGoogle Scholar
- Leimena MM, Wels M, Bongers RS, Smid EJ, Zoetendal EG, Kleerebezem M: Comparative Analysis of Lactobacillus plantarum WCFS1 Transcriptomes by Using DNA Microarray and Next-Generation Sequencing Technologies. Appl Environ Microbiol. 2012, 78: 4141-4148. 10.1128/AEM.00470-12.PubMed CentralView ArticlePubMedGoogle Scholar
- Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y: RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008, 18: 1509-1517. 10.1101/gr.079558.108.PubMed CentralView ArticlePubMedGoogle Scholar
- Su Z, Li Z, Chen T, Li QZ, Fang H, Ding D, Ge W, Ning B, Hong H, Perkins RG, et al: Comparing next-generation sequencing and microarray technologies in a toxicological study of the effects of aristolochic acid on rat kidneys. Chem Res Toxicol. 2011, 24: 1486-1493. 10.1021/tx200103b.View ArticlePubMedGoogle Scholar
- Wengelnik K, Bonas U: HrpXv, an AraC-type regulator, activates expression of five of the six loci in the hrp cluster of Xanthomonas campestris pv. vesicatoria. J Bacteriol. 1996, 178: 3462-3469.PubMed CentralPubMedGoogle Scholar
- da Silva AC, Ferro JA, Reinach FC, Farah CS, Furlan LR, Quaggio RB, Monteiro-Vitorello CB, Van Sluys MA, Almeida NF, Alves LM, et al: Comparison of the genomes of two Xanthomonas pathogens with differing host specificities. Nature. 2002, 417: 459-463. 10.1038/417459a.View ArticlePubMedGoogle Scholar
- Guo Y, Figueiredo F, Jones J, Wang N: HrpG and HrpX play global roles in coordinating different virulence traits of Xanthomonas axonopodis pv. citri. Mol Plant Microbe Interact. 2011, 24: 649-661. 10.1094/MPMI-09-10-0209.View ArticlePubMedGoogle Scholar
- Civerolo E: Bacterial canker disease of citrus. J Rio Grande Vall Hortic Soc. 1984, 37: 127-145.Google Scholar
- Astua-Monge G, Freitas-Astua J, Bacocina G, Roncoletta J, Carvalho SA, Mchado MA: Expression profiling of virulence and pathogenicity genes of Xanthomonas axonopodis pv. citri. J Bacteriol. 2005, 187: 1201-1205. 10.1128/JB.187.3.1201-1205.2005.PubMed CentralView ArticlePubMedGoogle Scholar
- Fenselau S, Bonas U: Sequence and expression analysis of the hrpB pathogenicity operon of Xanthomonas campestris pv. vesicatoria which encodes eight proteins with similarity to components of the Hrp, Ysc, Spa, and Fli secretion systems. Mol Plant Microbe Interact. 1995, 8: 845-854. 10.1094/MPMI-8-0845.View ArticlePubMedGoogle Scholar
- Koebnik R, Kruger A, Thieme F, Urban A, Bonas U: Specific binding of the Xanthomonas campestris pv. vesicatoria AraC-type transcriptional activator HrpX to plant-inducible promoter boxes. J Bacteriol. 2006, 188: 7652-7660. 10.1128/JB.00795-06.PubMed CentralView ArticlePubMedGoogle Scholar
- Noel L, Thieme F, Nennstiel D, Bonas U: Two novel type III-secreted proteins of Xanthomonas campestris pv. vesicatoria are encoded within the hrp pathogenicity island. J Bacteriol. 2002, 184: 1340-1348. 10.1128/JB.184.5.1340-1348.2002.PubMed CentralView ArticlePubMedGoogle Scholar
- Alfano JR, Collmer A: The type III (Hrp) secretion pathway of plant pathogenic bacteria: trafficking harpins, Avr proteins, and death. J Bacteriol. 1997, 179: 5655-5662.PubMed CentralPubMedGoogle Scholar
- Bonas U: hrp genes of phytopathogenic bacteria. Curr Top Microbiol Immunol. 1994, 192: 79-98. 10.1007/978-3-642-78624-2_4.PubMedGoogle Scholar
- Buttner D, Bonas U: Regulation and secretion of Xanthomonas virulence factors. FEMS Microbiol Rev. 2010, 34: 107-133. 10.1111/j.1574-6976.2009.00192.x.View ArticlePubMedGoogle Scholar
- Iwamoto M, Oku T: Cloning and molecular characterization of hrpX from Xanthomonas axonopodis pv. citri. DNA Seq. 2000, 11: 167-173.PubMedGoogle Scholar
- Gurlebeck D, Thieme F, Bonas U: Type III effector proteins from the plant pathogen Xanthomonas and their role in the interaction with the host plant. J Plant Physiol. 2006, 163: 233-255. 10.1016/j.jplph.2005.11.011.View ArticlePubMedGoogle Scholar
- Oku T, Alvarez AM, Kado CI: Conservation of the hypersensitivity-pathogenicity regulatory gene hrpX of Xanthomonas campestris and X. oryzae. DNA Seq. 1995, 5: 245-249.PubMedGoogle Scholar
- Lahaye T, Bonas U: Molecular secrets of bacterial type III effector proteins. Trends Plant Sci. 2001, 6: 479-485. 10.1016/S1360-1385(01)02083-0.View ArticlePubMedGoogle Scholar
- Kelley DR, Schatz MC, Salzberg SL: Quake: quality-aware detection and correction of sequencing errors. Genome Biol. 2010, 11: R116-10.1186/gb-2010-11-11-r116.PubMed CentralView ArticlePubMedGoogle Scholar
- Klebanov L, Yakovlev A: How high is the level of technical noise in microarray data?. Biol Direct. 2007, 2: 9-10.1186/1745-6150-2-9.PubMed CentralView ArticlePubMedGoogle Scholar
- Oshlack A, Wakefield MJ: Transcript length bias in RNA-seq data confounds systems biology. Biol Direct. 2009, 4: 14-10.1186/1745-6150-4-14.PubMed CentralView ArticlePubMedGoogle Scholar
- Sahl JW, Rasko DA: Analysis of global transcriptional profiles of enterotoxigenic Escherichia coli isolate E24377A. Infect Immun. 2012, 80: 1232-1242. 10.1128/IAI.06138-11.PubMed CentralView ArticlePubMedGoogle Scholar
- Alfano JR, Collmer A: Type III secretion system effector proteins: double agents in bacterial disease and plant defense. Annu Rev Phytopathol. 2004, 42: 385-414. 10.1146/annurev.phyto.42.040103.110731.View ArticlePubMedGoogle Scholar
- Collmer A, Bauer DW: Erwinia chrysanthemi and Pseudomonas syringae: plant pathogens trafficking in extracellular virulence proteins. Curr Top Microbiol Immunol. 1994, 192: 43-78. 10.1007/978-3-642-78624-2_3.PubMedGoogle Scholar
- Cornelis GR: The type III secretion injectisome. Nat Rev Microbiol. 2006, 4: 811-825. 10.1038/nrmicro1526.View ArticlePubMedGoogle Scholar
- Szczesny R, Jordan M, Schramm C, Schulz S, Cogez V, Bonas U, Buttner D: Functional characterization of the Xcs and Xps type II secretion systems from the plant pathogenic bacterium Xanthomonas campestris pv vesicatoria. New Phytol. 2010, 187: 983-1002. 10.1111/j.1469-8137.2010.03312.x.View ArticlePubMedGoogle Scholar
- Van GF, Genin S, Boucher C: Conservation of secretion pathways for pathogenicity determinants of plant and animal bacteria. Trends Microbiol. 1993, 1: 175-180. 10.1016/0966-842X(93)90087-8.View ArticleGoogle Scholar
- White FF, Potnis N, Jones JB, Koebnik R: The type III effectors of Xanthomonas. Mol Plant Pathol. 2009, 10: 749-766. 10.1111/j.1364-3703.2009.00590.x.View ArticlePubMedGoogle Scholar
- Lindgren PB: The role of hrp genes during plant-bacterial interactions. Annu Rev Phytopathol. 1997, 35: 129-152. 10.1146/annurev.phyto.35.1.129.View ArticlePubMedGoogle Scholar
- Lipscomb L, Schell MA: Elucidation of the regulon and cis-acting regulatory element of HrpB, the AraC-type regulator of a plant pathogen-like type III secretion system in Burkholderia pseudomallei. J Bacteriol. 2011, 193: 1991-2001. 10.1128/JB.01379-10.PubMed CentralView ArticlePubMedGoogle Scholar
- Kim JG, Park BK, Yoo CH, Jeon E, Oh J, Hwang I: Characterization of the Xanthomonas axonopodis pv. glycines Hrp pathogenicity island. J Bacteriol. 2003, 185: 3155-3166. 10.1128/JB.185.10.3155-3166.2003.PubMed CentralView ArticlePubMedGoogle Scholar
- Hu GB, Rice WJ, Drose S, Altendorf K, Stokes DL: Three-dimensional structure of the KdpFABC complex of Escherichia coli by electron tomography of two-dimensional crystals. J Struct Biol. 2008, 161: 411-418. 10.1016/j.jsb.2007.09.006.PubMed CentralView ArticlePubMedGoogle Scholar
- Jehl MA, Arnold R, Rattei T: Effective–a database of predicted secreted bacterial proteins. Nucleic Acids Res. 2011, 39: D591-D595. 10.1093/nar/gkq1154.PubMed CentralView ArticlePubMedGoogle Scholar
- Korotkov KV, Sandkvist M, Hol WG: The type II secretion system: biogenesis, molecular architecture and mechanism. Nat Rev Microbiol. 2012, 10: 336-351.PubMed CentralPubMedGoogle Scholar
- Furutani A, Takaoka M, Sanada H, Noguchi Y, Oku T, Tsuno K, Ochiai H, Tsuge S: Identification of novel type III secretion effectors in Xanthomonas oryzae pv. oryzae. Mol Plant Microbe Interact. 2009, 22: 96-106. 10.1094/MPMI-22-1-0096.View ArticlePubMedGoogle Scholar
- Wang L, Rong W, He C: Two Xanthomonas extracellular polygalacturonases, PghAxc and PghBxc, are regulated by type III secretion regulators HrpX and HrpG and are required for virulence. Mol Plant Microbe Interact. 2008, 21: 555-563. 10.1094/MPMI-21-5-0555.View ArticlePubMedGoogle Scholar
- Mukherjee K, Burglin TR: MEKHLA, a novel domain with similarity to PAS domains, is fused to plant homeodomain-leucine zipper III proteins. Plant Physiol. 2006, 140: 1142-1150. 10.1104/pp.105.073833.PubMed CentralView ArticlePubMedGoogle Scholar
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, et al: Gene ontology: tool for the unification of biology The Gene Ontology Consortium. Nat Genet. 2000, 25: 25-29. 10.1038/75556.PubMed CentralView ArticlePubMedGoogle Scholar
- Kanehisa M, Goto S, Kawashima S, Nakaya A: The KEGG databases at GenomeNet. Nucleic Acids Res. 2002, 30: 42-46. 10.1093/nar/30.1.42.PubMed CentralView ArticlePubMedGoogle Scholar
- Caspi R, Altman T, Dreher K, Fulcher CA, Subhraveti P, Keseler IM, Kothari A, Krummenacker M, et al: The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res. 2012, 40: D742-D753. 10.1093/nar/gkr1014.PubMed CentralView ArticlePubMedGoogle Scholar
- Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, Oyama R, Ravasi T, Lenhard B, Wells C, et al: The transcriptional landscape of the mammalian genome. Science. 2005, 309: 1559-1563.View ArticlePubMedGoogle Scholar
- Graveley BR, Brooks AN, Carlson JW, Duff MO, Landolin JM, Yang L, Artieri CG, van Baren MJ, Boley N, Booth BW, et al: The developmental transcriptome of Drosophila melanogaster. Nature. 2011, 471: 473-479. 10.1038/nature09715.PubMed CentralView ArticlePubMedGoogle Scholar
- Mane SP, Evans C, Cooper KL, Crasta OR, Folkerts O, Hutchison SK, Harkins TT, Thierry-Mieg D, Thierry-Mieg J, Jensen RV: Transcriptome sequencing of the Microarray Quality Control (MAQC) RNA reference samples using next generation sequencing. BMC Genomics. 2009, 10: 264-10.1186/1471-2164-10-264.PubMed CentralView ArticlePubMedGoogle Scholar
- Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ: Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat Genet. 2008, 40: 1413-1415. 10.1038/ng.259.View ArticlePubMedGoogle Scholar
- Sultan M, Schulz MH, Richard H, Magen A, Klingenhoff A, Scherf M, Seifert M, Borodina T, Soldatov A, Parkhomchuk D, et al: A global view of gene activity and alternative splicing by deep sequencing of the human transcriptome. Science. 2008, 321: 956-960. 10.1126/science.1160342.View ArticlePubMedGoogle Scholar
- Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, Salzberg SL, Wold BJ, Pachter L: Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010, 28: 511-515. 10.1038/nbt.1621.PubMed CentralView ArticlePubMedGoogle Scholar
- Van BH, Nislow C, Blencowe BJ, Hughes TR: Most “dark matter” transcripts are associated with known genes. PLoS Biol. 2010, 8: e1000371-10.1371/journal.pbio.1000371.View ArticleGoogle Scholar
- Bullard JH, Purdom E, Hansen KD, Dudoit S: Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinforma. 2010, 11: 94-10.1186/1471-2105-11-94.View ArticleGoogle Scholar
- Labaj PP, Leparc GG, Linggi BE, Markillie LM, Wiley HS, Kreil DP: Characterization and improvement of RNA-Seq precision in quantitative transcript expression profiling. Bioinformatics. 2011, 27: i383-i391. 10.1093/bioinformatics/btr247.PubMed CentralView ArticlePubMedGoogle Scholar
- Risso D, Schwartz K, Sherlock G, Dudoit S: GC-content normalization for RNA-Seq data. BMC Bioinforma. 2011, 12: 480-10.1186/1471-2105-12-480.View ArticleGoogle Scholar
- Draghici S, Khatri P, Eklund AC, Szallasi Z: Reliability and reproducibility issues in DNA microarray measurements. Trends Genet. 2006, 22: 101-109. 10.1016/j.tig.2005.12.005.PubMed CentralView ArticlePubMedGoogle Scholar
- Goryachev AB, Macgregor PF, Edwards AM: Unfolding of microarray data. J Comput Biol. 2001, 8: 443-461. 10.1089/106652701752236232.View ArticlePubMedGoogle Scholar
- Kerr MK, Martin M, Churchill GA: Analysis of variance for gene expression microarray data. J Comput Biol. 2000, 7: 819-837. 10.1089/10665270050514954.View ArticlePubMedGoogle Scholar
- Tseng GC, Oh MK, Rohlin L, Liao JC, Wong WH: Issues in cDNA microarray analysis: quality filtering, channel normalization, models of variations and assessment of gene effects. Nucleic Acids Res. 2001, 29: 2549-2557. 10.1093/nar/29.12.2549.PubMed CentralView ArticlePubMedGoogle Scholar
- Wang X, Wang D, Chen X, Hu M, Wang J, Li Y, Guo N, Shen B: cDNA cloning and function analysis of two novel erythroid differentiation related genes. Sci China C Life Sci. 2001, 44: 99-105. 10.1007/BF02882078.View ArticlePubMedGoogle Scholar
- Cleveland WS, Devlin SJ, Grosse E: Regression by Local Fitting: Methods, Properties, and Computational Algorithms. J Econ. 1988, 37: 87-114.View ArticleGoogle Scholar
- Engelen K, Coessens B, Marchal K, De MB: MARAN: normalizing micro-array data. Bioinformatics. 2003, 19: 893-894. 10.1093/bioinformatics/btg085.View ArticlePubMedGoogle Scholar
- Ihaka R, Gentleman R: R: A language for data analysis and graphics. Journal of Computational and Graphical Statistics. Journal of Computational and Graphical Statistics. 1996, 5: 299-314.Google Scholar
- Venet D: MatArray: a Matlab toolbox for microarray data. Bioinformatics. 2003, 19: 659-660. 10.1093/bioinformatics/btg046.View ArticlePubMedGoogle Scholar
- Kumar R, Lawrence ML, Watt J, Cooksey AM, Burgess SC, Nanduri B: RNA-seq based transcriptional map of bovine respiratory disease pathogen “Histophilus somni 2336”. PLoS One. 2012, 7: e29435-10.1371/journal.pone.0029435.PubMed CentralView ArticlePubMedGoogle Scholar
- Wurtzel O, Sapra R, Chen F, Zhu Y, Simmons BA, Sorek R: A single-base resolution map of an archaeal transcriptome. Genome Res. 2010, 20: 133-141. 10.1101/gr.100396.109.PubMed CentralView ArticlePubMedGoogle Scholar
- Yoder-Himes DR, Chain PS, Zhu Y, Wurtzel O, Rubin EM, Tiedje JM, Sorek R: Mapping the Burkholderia cenocepacia niche response via high-throughput sequencing. Proc Natl Acad Sci U S A. 2009, 106: 3976-3981. 10.1073/pnas.0813403106.PubMed CentralView ArticlePubMedGoogle Scholar
- Tarazona S, Garcia-Alcalde F, Dopazo J, Ferrer A, Conesa A: Differential expression in RNA-seq: a matter of depth. Genome Res. 2011, 21: 2213-2223. 10.1101/gr.124321.111.PubMed CentralView ArticlePubMedGoogle Scholar
- Daniels MJ, Barber CE, Turner PC, Sawczyc MK, Byrde RJ, Fielding AH: Cloning of genes involved in pathogenicity of Xanthomonas campestris pv. campestris using the broad host range cosmid pLAFR1. EMBO J. 1984, 3: 3323-3328.PubMed CentralPubMedGoogle Scholar
- Anders S, Huber W: Differential expression analysis for sequence count data. Genome Biol. 2010, 11: R106-10.1186/gb-2010-11-10-r106.PubMed CentralView ArticlePubMedGoogle Scholar
- Reimers M, Carey VJ: Bioconductor: an open source framework for bioinformatics and computational biology. Methods Enzymol. 2006, 411: 119-134.View ArticlePubMedGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMedGoogle Scholar
- Bindea G, Mlecnik B, Hackl H, Charoentong P, Tosolini M, Kirilovsky A, Fridman WH, Pagès F, Trajanoski Z, Galon J: ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics. 2009, 25: 1091-1093. 10.1093/bioinformatics/btp101.PubMed CentralView ArticlePubMedGoogle Scholar
- Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13: 2498-2504. 10.1101/gr.1239303.PubMed CentralView ArticlePubMedGoogle Scholar
- Livak KJ, Schmittgen TD: Analysis of relative gene expression data using real-time quantitative PCR and the 2(−Delta Delta C(T)) Method. Methods. 2001, 25: 402-408. 10.1006/meth.2001.1262.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.