Skip to main content


  • Research article
  • Open Access

RNA-seq analysis of the Rhizobium tropici CIAT 899 transcriptome shows similarities in the activation patterns of symbiotic genes in the presence of apigenin and salt

  • 1Email authorView ORCID ID profile,
  • 1,
  • 1,
  • 1,
  • 1,
  • 2,
  • 1 and
  • 1
Contributed equally
BMC Genomics201617:198

  • Received: 15 October 2015
  • Accepted: 25 February 2016
  • Published:



Rhizobium tropici strain CIAT 899 establishes effective symbioses with several legume species, including Phaseolus vulgaris and Leucaena leucocephala. This bacterium synthesizes a large variety of nodulation factors in response to nod-gene inducing flavonoids and, surprisingly, also under salt stress conditions. The aim of this study was to identify differentially expressed genes in the presence of both inducer molecules, and analyze the promoter regions located upstream of these genes.


Results obtained by RNA-seq analyses of CIAT 899 induced with apigenin, a nod gene-inducing flavonoid for this strain, or salt allowed the identification of 19 and 790 differentially expressed genes, respectively. Fifteen of these genes were up-regulated in both conditions and were involved in the synthesis of both Nod factors and indole-3-acetic acid. Transcription of these genes was presumably activated through binding of at least one of the five NodD proteins present in this strain to specific nod box promoter sequences when the bacterium was induced by both apigenin and salt. Finally, under saline conditions, many other transcriptional responses were detected, including an increase in the transcription of genes involved in trehalose catabolism, chemotaxis and protein secretion, as well as ribosomal genes, and a decrease in the transcription of genes involved in transmembrane transport.


To our knowledge this is the first time that a transcriptomic study shows that salt stress induces the expression of nodulation genes in the absence of flavonoids. Thus, in the presence of both nodulation inducer molecules, apigenin and salt, R. tropici CIAT 899 up-regulated the same set of symbiotic genes. It could be possible that the increases in the transcription levels of several genes related to nodulation under saline conditions could represent a strategy to establish symbiosis under abiotic stressing conditions.


  • RNA-seq
  • Rhizobium tropici CIAT 899
  • Nodulation
  • Nod factors
  • Lipochitooligosaccharides
  • Apigenin
  • Salt stress


Rhizobia comprise a group of α- and β-proteobacteria known for their ability to establish symbioses with several leguminous species. The rhizobium-legume interaction, characterized by an exchange of signal molecules from both partners, culminates in the formation of specific structures, called nodules, where biological nitrogen fixation takes place [15]. This molecular dialogue begins with the exudation of flavonoids by the host legume roots that are recognized by a LysR-type transcriptional regulator in the bacterium, the NodD protein, which triggers the expression of the so-called nodulation (nod) genes by binding to specific sequences, nod boxes (NB), located upstream of these genes. Their cognate enzymes are implied in the production of lipochitooligosaccharides, also known as Nod factors (NF), which in turns induce the formation of root nodule primordia and play an essential role in the infection process. A part from flavonoids, other rhizobial nod gene inducers have been identified, such as betaines [6], but they are required at higher concentrations [7].

Rhizobium tropici CIAT 899 (hereafter CIAT 899) is a broad host-range rhizobial strain isolated from tropical acid soils of South America that effectively nodulates several legumes, including Phaseolus vulgaris, Macroptilium atropurpureum, and Leucaena leucocephala [8, 9]. Main characteristics of this strain includes its high tolerance to several environmental stresses such as high temperature, acidity or salinity and its capacity to producing a large variety of NF in the presence of inducer flavonoids, such as apigenin [8, 10, 11]. It is remarkable that under acidity or salt stress conditions the synthesis of NF in CIAT 899 is also induced, resulting in increased diversity and concentration of these molecules in comparison to non-stressing conditions [12, 13]. Interestingly, Guasch-Vidal et al. [14] demonstrated that, even in the absence of flavonoids, CIAT 899 is able of synthesizing NF in the presence of high concentrations of salt, and the biological activity of these NF was confirmed. Moreover, the activation under salt stress is independent of NodD1 [14]. It has been reported that initial steps of rhizobium-legume symbioses are very sensitive to salt stress. However, the ability to form root nodules on their host legume species under saline conditions has been described for many rhizobia [15]. In general, rhizobial strains use distinct mechanisms for osmotic adaptation under salt stressing conditions during the free-lifestyle, such as the intracellular accumulation of osmolytes and specific ions, modification in cell surface polysaccharides or the synthesis of certain ABC membrane transporters [15, 16]. However, the synthesis of salt-induced NF has been only reported in CIAT 899.

Genome sequencing of CIAT 899 revealed five different nodD genes and three different nodA genes in the symbiotic plasmid [17]. NodA catalyzes the transfer of the fatty acyl group from an acyl carrier protein to a terminal n-glucosamine residue previously deacetyled by NodB, on the chitin oligomer [18]. The nodA1 gene is located adjacent to nodD1, whose encoded protein seems to be the major regulator of NF synthesis upon induction with flavonoid [19, 20] and together with nodBC compose an operon responsible for the synthesis of the NF core. The nodA2 gene is part of a gene cluster including hsnT and nodFE, implied in unsaturated fatty acid incorporation into NF molecules and is located close to the nodD2 gene. Curiously, a previous study [20] showed that, apparently, the activation of the expression of the nodC gene under salt stress is lower in a nodD2 mutant than in both a nodD1 mutant and the wild-type strain. Finally, nodA3 is located downstream the nodD3 gene but no other symbiotic-related genes have been identified in its vicinities [21].

The main objective of this study was to identify genes of CIAT 899 that are differentially expressed in the presence of the nod-gene inducer molecules apigenin and salt by RNA-seq analysis. In addition, the promoter regions of the symbiotic genes that were up-regulated in both conditions were studied to determine possible conserved promoter consensus motifs. Our results showed similar patterns of expression for the differentially expressed genes of the symbiotic plasmid replicon in the presence of both apigenin and salt, indicating that the NF synthesis was carried out following the same pathway, independently of the inducer molecule. The biological significance of the CIAT 899 transcriptomic response under salt condition was discussed.

Results and discussion

Identification of the differentially expressed genes

To identify differentially expressed genes in the presence of nod gene inducers six RNA-seq libraries were generated from CIAT 899 grown in the presence of apigenin (3.7 μM), salt (300 mM) or under control conditions. Two independent biological experiments were carried out for each condition, being the general features of each run shown in Additional file 1. Libraries were sequenced and a range of 54 to 210 million reads were obtained in each condition, indicating that similar amounts of data were generated independently of the growth condition. Three different RNA-seq metrics for quality control, such as GC content, duplicate distribution, and the distribution of respective genetic coordinates, were performed (Additional file 1). Besides, before all subsequent analysis, a normalization of the quantitative data was performed to avoid statistical deviations due to differences in library and genetic sizes [22] (Additional file 1). Data set were validated by qRT-PCR (Table 1). In all cases, positive correlation degrees were obtained in fold-change values of the qRT-PCR and the RNA-seq data (Additional file 2).
Table 1

RNA-seq data validation using qRT-PCR. Fold-change values were calculated using the ∆∆Ct method and normalized to the reference gene RNA 16S for 20 differentially expressed genes. HP: gene that codes for a hypothetical protein

Gene name

Locus tag

Nucleotide range



















































































































































R. tropici CIAT 899 genome contains 6289 genes distributed among one chromosome (3672 CDS, GeneBank number CP004015.1) and three different plasmids: pRtrCIAT899a/pA (212 CDS, GeneBank number CP004016.1), pRtrCIAT899b/pB/symbiotic plasmid (500 CDS, GeneBank number CP004017.1), and pRtrCIAT899c/pC (1905 CDS, GeneBank number CP004018.1) [17]. Differentially expressed genes in each condition were obtained using the statistical software R. Results revealed 19 differentially expressed genes when the bacterium was grown in the presence of apigenin (0.3 % of the genome) and 790 genes when the bacterium was grown under salt stress condition (12.5 % of the genome). In the first case, 15 genes were up-regulated (78.9 %) and located in the symbiotic plasmid, while the other 4 were down-regulated and distributed in the other replicons: chromosome (2) and plasmids A (1) and C (3) (Fig. 1a). With respect to the cultures supplemented with salt, the majority of genes were down-regulated (646, 81.6 %); only 144 genes were over-expressed representing 18.4 % of the differentially expressed genes (Fig. 1b). Replicon distribution revealed that most of these genes (723; 95 up-regulated and 628 down-regulated) were located in the chromosome, some in plasmid B (31; 30 up-regulated and 1 down-regulated) and the rest in plasmid C (36; 19 up-regulated and 17 down-regulated) (Fig. 1b).
Fig. 1
Fig. 1

Distribution of differentially expressed genes. Percentage and location of differentially expressed genes (up- and down-regulated) in R. tropici CIAT 899 induced with apigenin (a) or salt (b). Ch: chromosome, pA: pRtrCIAT899a, pB: pRtrCIAT899b, pC: pRtrCIAT899c. The number of differentially expressed genes is indicated under each replicon

Considering all genes, 17 were differentially expressed in both conditions, 15 were up-regulated and located in the symbiotic plasmid and the other two were down-regulated and located in the chromosome (Fig. 2). In addition, many other genes were differentially expressed only under saline conditions. This could be due to a survival strategy when the bacterium faces an abiotic stress. Similar results have been recently reported in R. etli CE3 under saline shock conditions, when 6.57 % of the genes were differentially expressed, most of them were located in the chromosome [16].
Fig. 2
Fig. 2

R. tropici CIAT 899 responses to apigenin and salt stress. Venn diagram showing the overlapping of differentially expressed genes in the presence of inducer molecules. The arrows indicate the number of up-regulated and down-regulated genes in each treatment, being the number of overlapping intersections indicative of genes expressed in both conditions

Functions associated to responses to nod gene-inducing molecules

CIAT 899 is able to synthesize NF not only upon induction with flavonoid but also under salt stress [14]. To evaluate which functions are intrinsic to each inducing condition, a functional enrichment was carried out to assign the statistically over-represented biological processes (activated or repressed) using data available at the Uniprot database (Gene Ontology, GO). Results are summarized in Additional file 3. As expected, when induced with apigenin, some of the differentially expressed genes were involved in nodulation and oligosaccharide transport (Fig. 3a). The same biological processes were also activated under salt stress. However, the presence of salt also induced many other biological processes such as nitrogen fixation, chemotaxis, carbohydrate metabolism, transcription, translation, conjugation and ATP biosynthesis (Fig. 3b). RNA-seq and proteomics studies performed on other rhizobial strains under saline and heat shock conditions revealed that many of these processes are also being altered [16, 23, 24].
Fig. 3
Fig. 3

Number of regulated genes from each over-represented biological process (Gene Onthology, GO). Biological processes and number of genes affected in bacterial cultures supplemented with apigenin (yellow) a or salt (blue) (b). Over-represented functional categories were those with a p value in hypergeometrical test inferior to 0.15

Apigenin activates the transcription of nod gene operons and the synthesis of indole-3-acetic acid

The CIAT 899 genome harbors five different nod genes and three different nodA genes in the symbiotic plasmid [17]. Are the nodA1BC and the nodA2FE operons and the nodA3 gene of CIAT 899 activated by apigenin? Transcriptomic analysis revealed that four sets of genes were significantly up-regulated in the symbiotic plasmid under this condition. Two of them corresponded to the nodA1 and nodA2 operons, the third, located adjacent to the nodD5 gene, was involved in the synthesis of the phytohormone indole-3-acetic acid (IAA) [21], and the last set of genes coded for proteins with unknown functions (Fig. 4, Additional file 4). Therefore, at least two main biological processes were activated in CIAT 899 when induced with apigenin: the synthesis of NF (nodA1BCSUIJHPQ1Q2 and nodA2hsnTnodFE genes) and the production of IAA (y4wEF genes). Of special interest is the elucidation of the symbiotic role of the set of genes with unknown function activated with apigenin (RTCIAT899_PB01550 and RTCIAT899_ PB01545). In summary, these results indicate that when induced with apigenin, CIAT 899 is able to synthesize NF that will induce root nodule primordia formation, as well as phytohormones that will favor root development. This is consistent with a recent study in which an increase in the production of IAA is reported when CIAT 899 is grown in cultures supplemented with both apigenin and salt [20]. A similar regulation cascade involving NodD1 and flavonoids has been reported in Sinorhizobium fredii NGR234 for the synthesis of IAA [25].
Fig. 4
Fig. 4

Circular representation of the complete RNA-seq-based transcriptomic data set of apigenin and salt cultures versus standard cultures for all replicons of R. tropici CIAT 899. Fold-change values of consecutive genes are represented by linked peaks. Each peak corresponds to one gene, being ordered according to their relative position in the replicon. Yellow peaks: differentially expressed genes in the presence of apigenin. Blue peaks: differentially expressed genes under salt stress. The arrows indicate names and related biological processes of some differentially expressed genes. NB: nod box. a Chromosome (3.8 Mb), b pRtrCIAT899a (0.22 Mb), c pRtrCIAT899b (0.55 Mb), d pRtrCIAT899c (2.08 Mb)

Common responses: synthesis of nodulation molecules and identification of nod boxes

As previously mentioned in the introduction, the capacity to nodulate under salt stress conditions has been described for many rhizobial strains [15]. However, the salt-regulated production of NF has been only reported for R. tropici CIAT 899. Are the CIAT 899 genetic pathways to synthesize NF under salt stress similar to those described for apigenin? The RNA-seq analysis shown in this work indicated that although the pattern of symbiotic gene activation in the presence of salt was similar to that obtained with apigenin (Fig. 4, Additional files 4 and 5), slightly higher expressions of nod genes, with an emphasis on the nodA1 operon, was observed when the bacterium was induced with salt (Table 2). Moreover, under salt stress nodM and a gene that codes for a flavine mononucleotide reductase (RTCIAT899_PB02705) were significantly up-regulated. Although the up-regulation of nodM was not detected in the RNA-seq of CIAT 899 grown in the presence of apigenin, it was detected in the qRT-PCR analysis (5.69 fold-change) (Table 1).
Table 2

Fold-change expression values of the R. tropici CIAT 899 nod genes on the RNA-seq analysis. Fold-change values followed by an asterisk (*) are significantly over-expressed

Gene name/Locus tag





































































All these results indicate that the transcription of the CIAT 899 nod genes seems to respond similarly to the presence of either apigenin or salt (Table 2). The question that remains unclear is whether the responses observed follow the common NodD activation pathway. To answer this question, an in silico analysis to detect promoter motifs was conducted using consensus sequences of nod boxes (NB) present in other related rhizobial strains. Thus, ten potential NB were identified, all sharing the AT-N11-AT-N7-AT-N2-A-N4-AT-N2-ATT-N-T sequence consensus (Fig. 5). Four of these NB were situated upstream of the four different sets of genes up-regulated under both inducing conditions, namely, the operons nodA1BCSUIJHPQ1Q2 (NB1), nodA2hsnTnodFE (NB2), y4wEF operon (NB4) and the nodM gene (NB3) (Table 3, Additional files 4 and 5). Interestingly, two of the genes previously identified as up-regulated with apigenin (RTCIAT899_PB01550 and RTCIAT899_ PB01545) were also induced with salt. The expression of one of these genes (RTCIAT899_PB01545) was validated by qRT-PCR in the presence of both apigenin (8.88-fold) and salt (18.11-fold) (Table 1). This gene expression, combined with the fact that both genes were located downstream NB5, indicate that these hypothetical proteins could be playing a role in the symbiotic process (Table 3, Additional files 4 and 5). Finally, NB7 was located upstream a gene (RTCIAT899_PB01055) up-regulated with salt, but not with apigenin (Table 3, Additional file 5).
Fig. 5
Fig. 5

Alignment of NB sequences from R. tropici CIAT 899. Software fuzznuc of the EMBOSS package was used to identify nod box-like sequences. Hits were selected based on their conservation to previously identified NB sequences. Searching of these consensus sequences was carried out on the whole genome of R. tropici CIAT 899. DNA sequences were aligned using the ClustalW program and manipulated with Boxshade at EMBnet. Dark and gray boxes indicate identical and similar nucleotides, respectively. Promoter consensus motifs are marked with an asterisk on the consensus line

Table 3

Up-regulation of the R. tropici CIAT 899 pRtCIAT899b genes located downstream nod boxes. Transcriptional activation (4-fold induction with respect to control cultures) of several nod box controlled operons was demonstrated by RNA-seq data in the presence of both inducer molecules. HP: gene that codes for a hypothetical protein

NB (Distance)

Cds number/Locus tag/Gene name

Putative function

Fold change


Fold change


1 (270 pb)

cds216 to 227/RTCIAT899_ PB01300 to RTCIAT899_ PB01345/nodABCSUIJHPQ1Q2

NF production

8.73 to 1.28

13.66 to 2.54

2 (250 pb)

cds177 to 180/RTCIAT899_ PB01095 to RTCIAT899_ PB01110/nodA2hsnTnodFE

NF production

10.3 to 10.37

9.81 to 11.94

3 (180 pb)

cds450 to 449/RTCIAT899_ PB02710 to RTCIAT899_ PB02705/nodM

NF production

2.43 to 3.18

5.85 to 8.49

4 (680 pb)

cds95 to 93/RTCIAT899_PB00575 to RTCIAT899_PB00565/y4wEF

Synthesis of IAA

8.57 to 1.63

12.17 to 3.27

5 (38 pb)

cds263 to 262/RTCIAT899_PB01550 to RTCIAT899_PB01545/HP


6.75 to 4.26

28.65 to 14.37

6 (70 pb)

cds61 to 62/RTCIAT899_PB00370 to RTCIAT899_PB00375/HP


1.80 to 2.11

2.17 to 1.64

7 (191 pb)





8 (470 pb)





9 (979 pb)


Transcriptional regulation



10 (292 pb)





aThe transcriptional orientation of this gene is opposite to the NB10 sequence orientation

Altogether, these results suggest that when CIAT 899 was induced either by apigenin or salt, genes related to the synthesis of NF and IAA were activated through the recognition of specific nod boxes (Table 2). In S. fredii NGR234, nineteen nod boxes have been identified and eighteen of them were inducible with flavonoids via NodD1. In addition, S. fredii NGR234 carries two copies of the nodD gene and four of these nod boxes are controlled by NodD2 [26].

Salt stress response: a pathway towards symbiosis?

Chromosomal RNA-seq analysis showed, under salt stress, an up-regulation of genes whose products are implied in the formation of ribosomes (both 30 and 50S ribosomal proteins coded by the rpl and rps genes, respectively) and in the utilization of the disaccharide trehalose (thuAB genes), an osmotic stabilizer [27, 28]. Instead, down-regulation was detected in genes that encode proteins involved in chemotaxis (cheXYAWRBYD genes) [29], formation of Flp-type pili (type IVb protein secretion system; tadBCD and cpaABCDEF genes) [30], or ABC-type transport (several genes: e.g. phn genes). In addition, a set of genes located in pRtrCIAT899c and involved in the export of capsular polysaccharide across the inner membrane (genes of the family bexC/ctrB/kpsE) was also down-regulated (Fig. 4, Additional file 5) [31].

Therefore, the global response of CIAT 899 to salt stress indicates that the strain drastically reduces the uptake of molecules and the energy consumption, reducing the ATP-dependent transport of the ABC-type permeases. However, under saline shock conditions R. etli CE3 over-expresses genes that code for many ABC transporters and increases the transcription levels of genes related to the biosynthesis of trehalose [16]. Nevertheless, in CIAT 899 an up-regulation of genes involved in the degradation of trehalose was detected. The explanation could be that this osmolyte tends to accumulate during symbiosis, but it is toxic to plants. To solve this problem, bacteria would produce trehalose-degrading enzymes [32]. These observations in CIAT 899, combined with the production of NF and IAA under salt stress, make us formulate the next question: could the promotion of the symbiosis represent a strategy of CIAT 899 to ensure nodulation under this abiotic stress condition? To shed light on this question we have analyzed the changes in biological processes under saline conditions. First, we observed an enhancement in ribosomal synthesis, which could be related to a translation increase due to an increase in the transcription of nodulation genes (a stronger nod gene up-regulation was detected in salt stress conditions). Second, general protein secretion systems and surface polysaccharides seem to play an important role in bacterial fitness under stressing environmental conditions, such as high salinity or temperature [16, 23, 24]. However, in CIAT 899 we observed inhibition under salt stress. Remarkably, both cellular components can also be considered microbe-associated molecular patterns (MAMP), which trigger the plant immune system [33]. Other rhizobial strains in the presence of inducer flavonoids modify their surface molecules to avoid plant immune responses, thereby promoting nodulation [34]. Finally, when colonizing legume roots many rhizobia form micro-colonies or biofilms. In some of these rhizobia common nod genes are required for the development of these structures, since the biofilm matrix is composed in part by NF [35, 36]. Thus, in CIAT 899, the synthesis of NF and the down-regulation of chemotaxis genes detected under saline conditions could be related to the formation of a symbiotic biofilm. In summary (Fig. 6), these results suggest that in the presence of salt CIAT 899 could strategically increase the transcription of nodulation genes and the synthesis of NF to increase the chances to establish symbiosis even under abiotic stressing conditions.
Fig. 6
Fig. 6

Model of the Rhizobium tropici CIAT 899 transcriptomic response under salt stress. RNA-seq studies indicate that in the presence of high concentration of salt CIAT 899 increase the transcription of nodulation genes and the synthesis of NF. Transcriptional activation (+) or inhibition (−) of other biological processes associated with the presence of salt supports the hypothesis that salt-dependent production of NF could be a special R. tropici CIAT 899 strategy to ensure nodulation under salt stress conditions. HP: gene that codes for a hypothetical protein


Our work shows that in the presence of the nodulation inducing molecules apigenin (3.7 μM) and salt (300 mM), R. tropici CIAT 899 increases the transcription of the same set of genes (slightly higher expression upon salt treatment), whose encoded proteins are involved in the synthesis of symbiotic molecules. Besides, the salt-dependent production of these symbiotic molecules could be a CIAT 899 strategy to ensure nodulation under salt stress.


Culture conditions and RNA extraction

R. tropici CIAT 899 was grown for 72 h at 28 °C on tryptone yeast (TY) medium [37], supplemented with apigenin 3.7 μM or NaCl 300 mM when necessary (induction of the synthesis of NF) [14, 20]. Total RNA was isolated using a High Pure RNA Isolation Kit (Roche), according to the manufacturer’s instructions. Verification of the amount and quality of total RNA samples was carried out using a Nanodrop 1000 spectrophotometer (Thermo Scientific) and a Qubit 2.0 Fluorometer (Invitrogen). Two independent total RNA extractions were obtained for each condition.

Quantitative reverse transcription PCR

Result obtained in the RNA-seq analysis were validated by quantitative reverse transcription PCR (qRT-PCR) of 20 selected genes, which represented differentially and non-differentially expressed genes in the presence of apigenin and salt. Total RNA was isolated using a High Pure RNA Isolation Kit (Roche) and RNAase Free DNA Set (Qiagen), according to the manufacturer’s instructions. This (DNA-free) RNA was reverse transcribed into cDNA using a QuantiTec Reverse Transcription Kit (Qiagen). Quantitative PCR was performed using a LightCycler 480 (Roche) with the following conditions: 95 °C, 10 min; 95 °C, 30 s; 50 °C, 30 s; 72 °C, 20 s; forty cycles, followed by the melting curve profile from 60 to 95 °C to verify the specificity of the reaction. The R. tropici CIAT 899 16S rRNA gene was used as an internal control to normalize gene expression. The fold-changes of two biological samples with three technical replicates of each condition were obtained using the ∆∆Ct method [38]. Selected genes and primers are listed in Additional file 2.

RNA sequencing

Ribosomal RNA was depleted using a MICROB Express Bacterial mRNA Purification kit (Ambion), following the manufacturer’s protocol. Integrity and quality of the ribosomal depleted RNA was checked with Agilent Bioanalyzer 2100 (Agilent Technologies). RNA sequencing was carried out by Sistemas Genómicos ( with the Next Generation Sequence (NGS) platform Illumina using the Illumina HiSeq 2000 sequencing instrument (Illumina). Ribosomal-depleted samples were used to generate whole transcriptome libraries following the manufacturer's recommendations for sequencing on this NGS platform. Amplified cDNA quality was analyzed by the Bioanalyzer 2100 DNA 1000 kit (Agilent Technologies) and quantified using the Qubit 2.0 Fluorometer (Invitrogen).

Mapping of the RNA-seq data

The initial whole transcriptome paired-end reads obtained from sequencing were mapped against the latest version of the R. tropici CIAT 899 genome ( using the the Life Technologies mapping algorithm version 1.3 ( Low-quality reads were eliminated using Picard Tools software version 1.83, remaining only high quality reads.

Assessment of differentially expressed genes

Gene prediction was estimated using the cufflinks method [39] and the expression levels were calculated using the htseq software, version 0.5.4p3 [40]. This method eliminates multimapped reads, considering only unique reads for the gene expression estimation. The edge method version 3.2.4 was applied for differential expression analysis among conditions [41]. This method uses a Poisson model to estimate the variance of the RNA-seq data for differential expressions, and relies on different normalized processes based on depth global samples, CG composition and length of genes. Differentially expressed genes were established in those genes with a fold-change lower or higher than−4 or 4, respectively, with a p value adjust to 0.7.

Functional categorization of genes

In order to assign the statistical over-represented functional categories in the presence of both nod gene-inducing molecules, an enrichment functional study was performed. Thus, genes were annotated using Uniprot databases and a hypergeometrical test using all genes as background and differential gene expression as interesting group of genes was applied [42]. This statistical test calculates the statistical significance using p value [43], being in this case evaluated the significance of functional categories. Those functional categories (biological processes) with a p value inferior to 0.15 were considered over-represented.

Consensus motifs

The program fuzznuc of the EMBOSS package was used to identify nod box-like sequences. Hits were selected based on their conservation to known NB sequences [44, 45]. Thus, the search pattern used was at[ct][cg][ag]n(5)[tc][ga][ga]atn(7)at[ct]caaacaatc[ga]attttncn(2)at, allowing a maximum of 3 mismatches. Searching of these consensus sequences was carried out on the whole genome of CIAT 899. Alignment of the NB sequences was performed using ClustalW at EMBnet.

RNA-seq data accession number

The RNA-seq data discussed in this publication have been deposited in the Sequence Read Archive of NCBI under the accession number SRP067561.



We would like to thank the Ministerio de Economía y Competitividad of the Spanish government (project AGL2012-), the Junta de Andalucía (project P11-CVI-7050) and the University of Seville (VPPI) for funding this work. Mr. Del Cerro is recipient of an FPU fellowship of the Ministerio de Economía y Competitividad. Dr. Pérez-Montaño is recipient of a post-doctoral contract of the VPPI of the University of Seville. Dr. Hungria is a research fellow of CNPq-Brazil. We also would like to acknowledge Dr. Juan Carlos Treviño (Sistemas Genómicos S.L.) for his help during all the bioinformatic analysis and Dr. Jonathan Trow (Sequence Read Archive from NCBI) for his technical assistance during the RNA-seq data submission. We thank the Servicio General de Biología of the CITIUS from the University of Seville for allowing us to use their laboratory equipment.

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Authors’ Affiliations

Departamento de Microbiología, Facultad de Biología, Universidad de Sevilla, Avenida Reina Mercedes n° 6, 41012 Sevilla, Spain
Embrapa Soja, Londrina, Paraná, Brazil


  1. Broughton WJ, Jabbouri S, Perret X. Keys to symbiotic harmony. J Bacteriol. 2000;182:5641–52.View ArticlePubMedPubMed CentralGoogle Scholar
  2. Oldroyd GE. Speak, friend, and enter: signaling systems that promote beneficial symbiotic associations in plants. Nat Rev Microbiol. 2013;11:252–63.View ArticlePubMedGoogle Scholar
  3. Black M, Moolhuijzen P, Chapman B, Barrero R, Howieson J, Hungria M, et al. The genetics of symbiotic nitrogen fixation: comparative genomics of 14 rhizobia strains by resolution of protein clusters. Genes. 2012;3:138–66.View ArticlePubMedPubMed CentralGoogle Scholar
  4. van Hameren B, Hayashi S, Gresshoff PM, Ferguson BJ. Advances in the identification of novel factors required in soybean nodulation, a process critical to sustainable agriculture and food security. J Plant Biol Soil Health. 2013;1:6.Google Scholar
  5. Hungria M, Phillips DA. Effects of a seed color mutation on rhizobial nod-gene-inducing flavonoids and nodulation in common bean. Mol Plant-Microbe Interact. 1993;6:418–22.View ArticleGoogle Scholar
  6. Phillips DA, Joseph CM, Maxwell CA. Trigonelline and stachydrine released from alfalfa seeds activate NodD2 protein in Rhizobium meliloti. Plant Physiol. 1992;99:1526–31.View ArticlePubMedPubMed CentralGoogle Scholar
  7. Cooper JE. Early interactions between legumes and rhizobia: disclosing complexity in a molecular dialogue. J Appl Microbiol. 2007;103:1355–65.View ArticlePubMedGoogle Scholar
  8. Martínez-Romero E, Segovia L, Mercante FM, Franco AA, Graham P, Pardo MA. Rhizobium tropici, a novel species nodulating Phaseolus vulgaris L. Beans and leucaena sp. Trees. Int J Syst Bacteriol. 1991;41:417–26.View ArticlePubMedGoogle Scholar
  9. Gomes DF, Ormeño-Orrillo E, Hungria M. Biodiversity, symbiotic efficiency and genomics of Rhizobium tropici and related species. In: De Bruijn F, editor. Biological nitrogen fixation. vol.2. Chapter 74. New Jersey: Wiley; 2015. p. 747–56.View ArticleGoogle Scholar
  10. Poupot R, Martínez-Romero E, Promé JC. Nodulation factors from Rhizobium tropici are sulfated or nonsulfated chitopentasaccharides containing an N-methyl-N-acylglucosaminyl terminus. Biochemistry. 1993;32:10430–5.View ArticlePubMedGoogle Scholar
  11. Garcia MLJ, Dunlap J, Loh J, Stacey G. Phenotypic characterization and regulation of the nolA gene of Bradyrhizobium japonicum. Mol Plant Microbe Interact. 1996;9:625–35.View ArticlePubMedGoogle Scholar
  12. Estévez J, Soria-Díaz ME, de Córdoba FF, Morón B, Manyani H, Gil A, et al. Different and new Nod factors produced by Rhizobium tropici CIAT899 following Na + stress. FEMS Microbiol Lett. 2009;293:220–31.View ArticlePubMedGoogle Scholar
  13. Morón B, Soria-Díaz ME, Ault J, Verroios G, Noreen S, Rodríguez-Navarro DN, et al. Low pH changes the profile of nodulation factors produced by Rhizobium tropici CIAT899. Chem Biol. 2005;12:1029–40.View ArticlePubMedGoogle Scholar
  14. Guasch-Vidal B, Estévez J, Dardanelli MS, Soria-Díaz ME, de Córdoba FF, Balog CI, et al. High NaCl concentrations induce the nod genes of rhizobium tropici CIAT899 in the absence of flavonoid inducers. Mol Plant Microbe Interact. 2013;26:451–60.View ArticlePubMedGoogle Scholar
  15. Zahran HH. Rhizobium-legume symbiosis and nitrogen fixation under severe conditions and in an arid climate. Microbiol Mol Biol Rev. 1999;63:968–89.PubMedPubMed CentralGoogle Scholar
  16. López-Leal G, Tabche ML, Castillo-Ramírez S, Mendoza-Vargas A, Ramírez-Romero MA, Dávila G. RNA-Seq analysis of the multipartite genome of Rhizobium etli CE3 shows different replicon contributions under heat and saline shock. BMC Genomics. 2014;15:770.View ArticlePubMedPubMed CentralGoogle Scholar
  17. Ormeño-Orrillo E, Menna P, Gonzaga LA, Ollero FJ, Nicolas MF, Rodrigues EP, et al. Genomic basis of broad host range and environmental adaptability of rhizobium tropici CIAT 899 and rhizobium sp. PRF 81 which are used in inoculants for common bean (Phaseolus vulgaris L.). BMC Genomics. 2012;13:735.View ArticlePubMedPubMed CentralGoogle Scholar
  18. John M, Röhring H, Schmidt J, Wienecke U, Schell J. Rhizobium NodB protein involved in nodulation signal synthesis is a chitooligossacharide deacetylase. Proc Natl Acad Sci U S A. 1993;90:625–9.View ArticlePubMedPubMed CentralGoogle Scholar
  19. van Rhijn PJ, Feys B, Verreth C, Vanderleyden J. Multiple copies of nodD in Rhizobium tropici CIAT899 and BR816. J Bacteriol. 1993;175:438–47.PubMedPubMed CentralGoogle Scholar
  20. del Cerro P, Rolla-Santos AAP, Gomes DF, Marks BB, Pérez-Montaño F, Rodríguez-Carvajal MA, et al. Regulatory nodD1 and nodD2 genes of Rhizobium tropici strain CIAT 899 and their roles in the early stages of molecular signaling and host-legume nodulation. BMC Genomics. 2015;16:251.View ArticlePubMedPubMed CentralGoogle Scholar
  21. del Cerro P, Rolla-Santos AAP, Gomes DF, Marks BB, Espuny MR, P Rodríguez-Carvajal MA, et al. Opening the "black box" of nodD3, nodD4 and nodD5 genes of Rhizobium tropici strain CIAT 899. BMC Genomics. 2015;16:864.View ArticlePubMedPubMed CentralGoogle Scholar
  22. Hansen KD, Irizarry RA, Wu Z. Removing technical variability in RNA-seq data using conditional quantile normalization. Biostatistics. 2012;3:204–16.View ArticleGoogle Scholar
  23. Alexandre A, Laranjo M, Oliveira S. Global transcriptional response to heat shock of the legume symbiont mesorhizobium loti MAFF303099 comprises extensive gene downregulation. DNA Res. 2014;21:195–206.View ArticlePubMedPubMed CentralGoogle Scholar
  24. Gomes DF, Batista JSS, Schiavon AL, Andrade DS, Hungria M. Proteomic profiling of Rhizobium tropici PRF 81: identification of conserved and specific responses to heat stress. BMC Microbiol. 2012;12:84.View ArticlePubMedPubMed CentralGoogle Scholar
  25. Theunis M, Kobayashi H, Broughton WJ, Prinsen E. Flavonoids, NodD1, NodD2, and nod-box NB15 modulate expression of the y4wEFG locus that is required for indole-3-acetic acid synthesis in Rhizobium sp. Strain NGR234. Mol Plant Microbe Interact. 2004;17:1153–61.View ArticlePubMedGoogle Scholar
  26. Kobayashi H, Naciri-Graven Y, Broughton WJ, Perret X. Flavonoids induce temporal shifts in gene-expression of nod-box controlled loci in Rhizobium sp. NGR234. Mol Microbiol. 2004;51:335–47.View ArticlePubMedGoogle Scholar
  27. Ampomah OY, Avetisyan A, Hansen E, Svenson J, Huser T, Jensen JB, et al. The thuEFGKAB operon of rhizobia and Agrobacterium tumefaciens codes for transport of trehalose, maltitol, and isomers of sucrose and their assimilation through the formation of their 3-keto derivatives. J Bacteriol. 2013;195:3797–807.View ArticlePubMedPubMed CentralGoogle Scholar
  28. Strøm AR, Kaasen I. Trehalose metabolism in escherichia coli: stress protection and stress regulation of gene expression. Mol Microbiol. 1993;8:205–10.View ArticlePubMedGoogle Scholar
  29. Park SY, Chao X, Gonzalez-Bonet G, Beel BD, Bilwes AM, Crane BR. Structure and function of an unusual family of protein phosphatases: the bacterial chemotaxis proteins CheC and CheX. Mol Cell. 2004;16:563–74.PubMedGoogle Scholar
  30. Zatakia HM, Nelson CE, Syed UJ, Scharf BE. ExpR coordinates the expression of symbiotically important, bundle-forming Flp pili with quorum sensing in Sinorhizobium meliloti. Appl Environ Microbiol. 2014;80:2429–39.View ArticlePubMedPubMed CentralGoogle Scholar
  31. Kroll JS, Loynds B, Brophy LN, Moxon ER. The bex locus in encapsulated Haemophilus influenzae: a chromosomal region involved in capsule polysaccharide export. Mol Microbiol. 1990;4:1853–62.View ArticlePubMedGoogle Scholar
  32. Boboye B. Degradation of trehalose by rhizobia and characteristics of a trehalose-degrading enzyme isolated from Rhizobium species NGR234. J Appl Microbiol. 2004;97:256–61.View ArticlePubMedGoogle Scholar
  33. Jones JD, Dangl JL. The plant immune system. Nature. 2006;444:323–9.View ArticlePubMedGoogle Scholar
  34. Vinardell JM, Ollero FJ, Hidalgo A, López-Baena FJ, Medina C, Ivanov-Vangelov K, et al. NolR regulates diverse symbiotic signals of Sinorhizobium fredii HH103. Mol Plant Microbe-Interact. 2004;17:676–85.View ArticlePubMedGoogle Scholar
  35. Fujishige NA, Lum MR, De Hoff PL, Whitelegge JP, Faull KF, Hirsch AM. Rhizobium common nod genes are required for biofilm formation. Mol Microbiol. 2008;67:504–15.View ArticlePubMedGoogle Scholar
  36. Pérez-Montaño F, Jiménez-Guerrero I, Del Cerro P, Baena-Ropero I, López-Baena FJ, Ollero FJ, et al. The symbiotic biofilm of Sinorhizobium fredii SMH12, necessary for successful colonization and symbiosis of glycine max cv osumi, is regulated by quorum sensing systems and inducing flavonoids via NodD1. PLoS One. 2014;9, e105901.View ArticlePubMedPubMed CentralGoogle Scholar
  37. Beringer JE. R factor transfer in Rhizobium leguminosarum. J Gen Microbiol. 1974;84:188–98.PubMedGoogle Scholar
  38. Pfaffl MW. A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001;29, e45.View ArticlePubMedPubMed CentralGoogle Scholar
  39. Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28:511–5.View ArticlePubMedPubMed CentralGoogle Scholar
  40. Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010;11:R106.View ArticlePubMedPubMed CentralGoogle Scholar
  41. Robinson MD, McCarthy DJ, Smyth GK. EdgeR a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–40.View ArticlePubMedPubMed CentralGoogle Scholar
  42. Huang DW, Sherman BT. Lempicki. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37:1–13.View ArticlePubMed CentralGoogle Scholar
  43. Rice JA. Mathematical Statistics and Data Analysis. 2007; (Third ed.). In: USA: Duxbury Press. p.42.Google Scholar
  44. Sousa C, Folch JL, Boloix P, Megías M, Nava N, Quinto C. A Rhizobium tropici DNA region carrying the amino-terminal half of a nodD gene and a nod-box-like sequence confers host-range extension. Mol Microbiol. 1993;9:1157–68.View ArticlePubMedGoogle Scholar
  45. Okker RJ, Spaink HP, Lugtenberg BJ, Schlaman HR. Mutants in the nodFEL promoter of Rhizobium leguminosarum bv. Viciae reveal a role of individual nucleotides in transcriptional activation and protein binding. Arch Microbiol. 2001;175:152–60.View ArticlePubMedGoogle Scholar


© Pérez-Montaño et al. 2016