Skip to main content
  • Research article
  • Open access
  • Published:

Detailed transcriptome analysis of the plant growth promoting Paenibacillus riograndensis SBR5 by using RNA-seq technology



The plant growth promoting rhizobacterium Paenibacillus riograndensis SBR5 is a promising candidate to serve as crop inoculant. Despite its potential in providing environmental and economic benefits, the species P. riograndensis is poorly characterized. Here, we performed for the first time a detailed transcriptome analysis of P. riograndensis SBR5 using RNA-seq technology.


RNA was isolated from P. riograndensis SBR5 cultivated under 15 different growth conditions and combined together in order to analyze an RNA pool representing a large set of expressed genes. The resultant total RNA was used to generate 2 different libraries, one enriched in 5′-ends of the primary transcripts and the other representing the whole transcriptome. Both libraries were sequenced and analyzed to identify the conserved sequences of ribosome biding sites and translation start motifs, and to elucidate operon structures present in the transcriptome of P. riograndensis. Sequence analysis of the library enriched in 5′-ends of the primary transcripts was used to identify 1082 transcription start sites (TSS) belonging to novel transcripts and allowed us to determine a promoter consensus sequence and regulatory sequences in 5′ untranslated regions including riboswitches. A putative thiamine pyrophosphate dependent riboswitch upstream of the thiamine biosynthesis gene thiC was characterized by translational fusion to a fluorescent reporter gene and shown to function in P. riograndensis SBR5.


Our RNA-seq analysis provides insight into the P. riograndensis SBR5 transcriptome at the systems level and will be a valuable basis for differential RNA-seq analysis of this bacterium.


Members of Paenibacillus genus are Gram-positive, spore-forming, motile and facultatively anaerobic bacteria [1]. This group is biochemically and morphologically diverse and is found in various environments, such as soil [2], rhizosphere [3], insect larvae [4], and clinical samples [5]. Originally, Paenibacillus belonged to the genus Bacillus, however, in 1993 it was reclassified as a separate genus [6]. The important plant growth promoting (PGP) species P. polymyxa, P. macerans and P. azotofixans were included in the new genus when it was proposed [6]. The genus Paenibacillus currently comprises more than 150 named species; approximately 6% of these are able to fix nitrogen and possess some other plant growth promotion abilities [7].

Paenibacillus riograndensis SBR5 is the type strain of this species and was isolated from rhizosphere of wheat (Triticum aestivum) fields in the south of Brazil (Rio Grande do Sul) [8]. It was shown that P. riograndensis SBR5 is a promising candidate for crop inoculation because of its nitrogen fixation ability and other plant growth promotion characteristics such as production of phytohormones and antimicrobial substances [9, 10]. Furthermore, SBR5 is cellulolytic and xylanolytic, and is able to perform competition against Gram-negative and Gram-positive pathogens such as Pectobacterium carotovorum and Listeria monocytogenes, respectively [11].

A phylogenetic analysis of SBR5 based on the 16S rRNA gene sequence has showed that it is most closely related to Paenibacillus graminis RSA19T (98.1% similarity) [9]. The genome of SBR5 was completely sequenced and annotated; its circular chromosome consists of 7,893,056 base pairs, with GC content of 50.97% [12]. The annotation of the finished genome sequence showed the presence of 6705 protein coding genes, 87 tRNAs and 27 rRNAs genes [12].

Recent research efforts on transcriptome characterization in paenibacilli focused on comparative transcriptomic analysis under different plant-related conditions [13, 14]. Contrary to these differential transcriptomic analyses, comprehensive transcriptome analysis allows to chart the RNA landscape of a particular organism for improvement of the genome annotation, detection of novel transcripts and conserved sequence motifs such as transcription start sites (TSS), promoters and ribosomal biding sites (RBS) [15, 16]. These comprehensive analyses have been performed for bacteria of industrial or public health relevance such as Corynebacterium glutamicum [15] and Salmonella [17]. Although complete genome sequences of several Paenibacillus PGP members have been published [12, 18,19,20], a genome-wide analysis of the transcriptome characterizing the whole transcriptome of a member of Paenibacillus genus is still missing.

In this study, we describe genome-wide TSS mapping and whole transcriptome analysis of P. riograndensis SBR5 cultivated under 15 conditions. Conserved sequence motifs for promoters, ribosome binding sites, riboswitches and other RNA families were determined, and the function of a TPP (thiamine pyrophosphate) riboswitch confirmed by translational fusion with a green fluorescence protein (GfpUV) reporter gene. TPP riboswitches typically bind TPP and regulate expression of genes that are involved in biosynthesis and transport of thiamine in eukaryotes and bacteria making them interesting targets for the study of antibacterial compounds [21].


Cultivation of P. riograndensis SBR5 in different conditions

P. riograndensis SBR5, the bacterial strain used in this study, was obtained from the strain collection of the Department of Genetics at Universidade Federeal do Rio Grande do Sul. Here, we exposed SBR5 to varied growth conditions. In all experiments, the bacterial cells were grown in 500 mL flasks containing 50 mL of medium shaking at 120 rpm and at 30 °C, if not stated otherwise. For each condition tested, four biological replicates were used: one for harvesting of bacterial cells and total RNA isolation, and three for further determination of growth characteristics. The optical density at 600 nm (OD600 nm) of the cultivated cells was measured throughout growth. The initial OD600 nm in all cultivations was approximately 0.05.

The first experiment was performed with lysogeny broth (LB) as growth medium; the cells were grown under 3 different temperatures: 20 °C, 30 °C or 37 °C. Cells were also cultivated at 30 °C for further application of 5 min-cold shock (from 30 °C to 4 °C) or heat shock (from 30 °C to 50 °C) when the middle of the exponential phase was reached. The PbMM (P. riograndensis minimal medium) with 20 mM glucose as carbon source was used for application of the remaining stress conditions. Minimal PbMM medium contained the following, in 1 L of RO-water: K2HPO4, 4.09 g; NaH2PO4, 1.3 g; (NH4)2SO4, 2.11 g; biotin, 0.1 mg; concentrated trace element (TE) solution, 1 mL. The concentrated TE solution contained the following, in 1 L of RO-water: FeSO4*7H2O, 5.56 g; CuCl2*2H2O, 0.027 g; CaCl2*2H2O, 7.35 g; CoCl2*6H2O, 0.04 g; MnCl2*4H2O, 9.90 g; ZnSO4*7H2O, 0.288 g; Na2MoO4*2H2O, 0.048 g; H3BO3, 0.031 g. The growth of SBR5 was carried with addition of 100 mM KCl or NaCl or addition of 2 g L−1 of ethanol or methanol to the medium. Moreover, growth in PbMM with addition of 3 different carbon sources of was compared: 20 mM of glucose, 40 mM of glycerol or 10 mM of sucrose. Finally, the cells were cultivated in 3 different pHs: 5, 7 or 8, buffered with 50 mM of 2-(N-morpholino)ethanesulfonic acid (MES), 3-morpholinopropane-1-sulfonic acid (MOPS) and 3-[[1,3-dihydroxy-2-(hydroxymethyl)propan-2-yl]amino]propane-1-sulfonic acid (TAPS), respectively.

The bacterial cells were harvested in the middle of the exponential phase (Additional file 2: Table S2) and the harvesting procedure was done according to Irla et al. 2015 [16].

For the cultivation of the P. riograndensis transformants harboring plasmid DNA with gfpUV reporter gene under control of the pyruvate kinase promoter (Ppyk) with either native 5′ UTR (pP2pyk-gfpUV) or 5' untranslated regions (5′ UTR) of the gene P.riograndensis_final_150 (pP2pyk_TPP-gfpUV), the cells were routinely grown at 30 °C, shaking at 120 rpm, in medium DSMZ 220 [22] with addition of 5.5 μg mL−1 of chloramphenicol. Escherichia coli strains were routinely cultivated at 37 °C in LB supplied with 15 μg mL−1 of chloramphenicol when needed. To assay the effect of thiamine on gfpUV expression by the 2 P. riograndensis strains, bacterial cells were transferred from DSMZ 220 medium to glucose minimal medium PbMM (see above) with 0, 5, 10, 15, 20 or 25 μM of thiamine for SBR5(pP2pyk_TPP-gfpUV) and 0 or 25 μM of thiamine for SBR5(pP2pyk-gfpUV). After overnight growth, cells in minimal medium were used to inoculate fresh PbMM medium containing its respective thiamine concentration.

RNA isolation and preparation of cDNA libraries for sequencing

In order to isolate total RNA from SBR5 cells, bacterial cell pellets previously harvested and kept at −80 °C were thawed in ice and RNA was extracted individually for each cultivation condition using NucleoSpin RNA isolation kit (Macherey-Nagel, Düren, Germany). Polymerase Chain Reactions (PCRs) utilizing Taq polymerase (New England Biolabs) and 2 pairs of primers amplifying 2 different genome regions was perform to detect the presence of remaining genomic DNA in the samples. Primer characteristics and sequences are listed in Additional file 1: Table S1 and the reactions were carried according to the Taq polymerase manufacture’s recommendations. RNA samples with genomic DNA contamination were treated with the RNase-free DNase set (Qiagen, Hilden, Germany). The concentration of isolated RNA was determined by DropSense™ 16 (Trinean, Ghent, Belgium; software version To verify the quality of RNA samples, we performed capillary gel electrophoresis (Agilent Bioanalyzer 2100 system using the Agilent RNA 6000 Pico kit; Agilent Technologies, Böblingen, Germany). All procedures to obtain high quality RNA were done according to manufacturer’s recommendations. The extracted RNA samples were pooled in equal parts and the pool of total RNA was subsequently used for the preparation of 2 different cDNA libraries.

The cDNA libraries of SBR5 were prepared according to 2 different protocols. One library followed the protocol for the enrichment of 5′-ends of primary transcripts, while the other method allowed the analysis of the whole transcriptome [15, 16]. The libraries were prepared and sequenced according to Irla et al. 2015 [16]. The generated whole transcriptome and 5′-end enriched cDNA libraries were sequenced on a single flow cell of a MiSeq Desktop Sequencer system.

Mapping sequenced reads onto the genome of P. riograndensis SBR5

Before mapping to the reference genome, the reads obtained during sequencing of the whole transcriptome and 5′-end enriched library were trimmed to a minimal length of 20 base pairs with the Trimmotatic ver. 0.33 [23], with three first base pairs cut off at the start and bad quality bases at the end of the reads. The reads of 5′-end enriched library were trimmed in the single end mode, whereas those of whole transcriptome library in paired end mode. Trimmed reads were mapped to the reference genome of P. riograndensis SBR5 (accession number LN831776.1) by using the software for short read alignment Bowtie [24].

Determination of transcription start sites (TSS) based on 5′-end enriched library

To determine and classify the TSS based on mapped 5′-end enriched RNA-seq data, we used the software for visualization of mapped sequences ReadXplorer [25]. This determination was done in 2 steps, automatic TSS determination and manual data set curing. First, the TSS were automatically detected by ReadXplorer Transcription Analysis Parameter Wizard, following 2 different selected sets of criteria described in Table 1. In the generated data, to each TSS detected, several characteristics were reported; including: 70 base pairs sequence upstream the TSS, the assigned gene name and product, the DNA strand to which the assigned gene belongs, the assigned gene start and end position, the distance between the given TSS and its assigned translation start sites (TLS) and its classification regarding a TSS assigned to tRNA, mRNA or a novel transcript. As second step, the data generated through the 2 parameter sets were combined and manually cross-checked to classify the novel transcripts as antisense, intergenic or intragenic, and also to eliminate false positives, as previously described by Irla et al. 2015 [16].

Table 1 Parameter sets selected for transcription analysis of P. riograndensis SBR5

Determination of 5′ UTR length and identification of cis-regulatory elements in 5′ UTRs of P. riograndensis SBR5 genes

A genome-wide analysis was performed in order to identify putative RNA motifs in the genome of SBR5. To this end, we used the Infernal tool [26]. The RNAs were annotated to the genome of SBR5 in conjunction with the Rfam database [27]. Furthermore, based on the difference between the position of the analyzed TSS and its assigned TLS, we could determine the 5′ UTR length of each TSS belonging to an annotated gene. The 5′ UTRs which were longer than 100 base pairs were used as candidates to evaluate whether they contain cis-regulatory elements. In total, 209 5′ UTRs were analyzed by comparison to Rfam database [28]. Because thiamine is involved in the interaction of plants with plant growth promoting rhizobacteria [29], a TPP riboswitch was selected among the detected riboswitches for further analysis; a 313 base pairs sequence of the TPP riboswitch present in the 5′ UTR of the thiC gene was analyzed in the ARNold tool for identification of transcriptional terminators [30] and in the RNAfold tool for determination of its secondary structure [31].

Detection of conserved ribosomal biding site (RBS) and promoter motifs sequences

To identify the conserved promoter motifs, 70 base pairs sequences upstream the TSS assigned to annotated genes were analyzed. All the genes with identified TSS were considered in the analysis of TLS and RBS motifs, for this analysis 50 base pairs upstream of TLS were considered. The Improbizer [32] program was used to find the motifs and the tool WebLogo [33] was used to generate the visualization charts. In both programs, the default settings were applied for the analysis. In the text representations, the conserved motifs are represented in upper or lower case depending on its conservation, as follows: nucleotides in upper case letters represent more than 80% of occurrence among all analyzed sequences, nucleotides in lower case letters represent occurrence of more than 40%, but less than 80% of all cases. If a base occurs less often than 40%, the letter “n” in lower case appears.

Determination of most abundant genes transcribed in P. riograndensis SBR5

In order to determine the most abundant genes transcribed in the applied cultivation conditions in SBR5, the whole transcriptome RNA-seq data set was used. The data was normalized by calculation of Reads Per Kilobase per Million mapped reads (RPKM) [34]. The calculation of abundances was automatically generated by the ReadXplorer software [25] as described in Irla et al. 2015 [16]. When the transcripts of proteins of unknown function were automatically defined as the most abundant, the gene sequences were submitted to BLASTx analysis to identify the family to which the protein in question belongs [35].

Identification of operon structures in P. riograndensis SBR5

The operon structures present in this transcript analysis were automatically detected in the ReadXplorer software [25]. The same approach was previously shown in Irla et al. 2015 [16]. The classical operon has multiple genes transcribed as a single mRNA molecule having a single promoter to drive its expression, but transcription start sites internal to the operon sequence pointed to the presence of suboperons which often respond to different conditions [15, 16]. Based on the whole transcriptome RNA-seq data set, an operon structure was identified if the intergenic space of 2 genes positioned in same orientation linked those genes by a bridge of at least 2 paired mappings. Among the detected operon structures, the operons and suboperons were classified separately: a primary operon was considered when a TSS was assigned to the first gene of the operon; and a suboperon was detected when a TSS was assigned within primary operons. Furthermore, the automatically generated operon set was manually cross-checked with the complete whole transcriptome RNA-seq data. Finally, the difference between the position in the genome of the first nucleotide and the last nucleotide of the suboperons/operons was calculated to determine the approximated suboperons/operons length distribution. This calculation does not take the lengths of 5′ UTRs and 3′ UTRs into account.

Strains, plasmid construction and primers

P. riograndensis SBR5 was used as host for heterologous expression of gfpUV. Information about the plasmids constructed in this work and primer sequences is available in Additional file 1: Table S1. Molecular cloning was performed as described by Sambrook (2001) [36]. Chemically competent cells of E. coli DH5α were prepared for cloning [37]. Genomic DNA of P. riograndensis SBR5 was isolated as described by Eikmanns et al. (1994) [38]. The NucleoSpin® Gel and PCR Clean-up kit (Machery-Nagel, Düren, Germany) was used for PCR clean-up and plasmids were isolated using the GeneJET Plasmid Miniprep Kit (Thermo Fisher Scientific, Waltham, USA). Plasmid pNW33Nkan backbone was cut with restriction enzyme BamHI (Thermo Fisher Scientific, Waltham, USA) and inserts were amplified using Allin HiFi DNA polymerase (HighQu, Kraichtal, Germany) and the overlapping regions joined by Gibson assembly [39]. Taq polymerase (New England Biolabs) was used as mentioned above for colony PCR and primer characteristics and sequences are isted in Additional file 1: Table S1. The correctness of inserted DNA sequences was confirmed by sequencing. The constructed plasmids were named pP2pyk-gfpUV or pP2pyk_TPP-gfpUV and transformed to P. riograndensis SBR5 via magnesium-aminoclay method as described by Brito et al. (2016) [22].

Fluorescence-activated cell scanning analysis

To quantify the fluorescence intensities, SBR5 cells were analyzed by using flow cytometry. Routinely, the SBR5 transformants were grown until reaching the middle of the exponential growth phase and centrifuged for 15 min at 4000 rpm. The pellets were washed 3 times in NaCl 0.9% solution and the OD600nm was adjusted to 0.3. The fluorescence of the cell suspension was measured by using flow cytometer (Beckman Coulter, Brea, US) and the data analyzed in the Beckman Coulter Kaluza Flow Analysis Software. The settings for the emission signal and filters within the flow cytometer for detection of GfpUV were 550 short pass and 525 band pass in FL9 filter. In order to compare the obtained values of median fluorescence intensity (MFI), the results were tested for significance using one-way ANOVA followed by post hoc comparisons using the Tukey’s honest significant difference (HSD) test. The level of significance of the differences observed in each strain between the control (0 μM of thiamine) and test conditions (5, 10, 15, 20 and 25 μM of thiamine) was expressed as one star for *p ≤ 0.05. Nonsignificant differences, when p > 0.05, were not pointed.


Cultivation of P. riograndensis SBR5 under various growth conditions

Apart from a core subset of constitutively expressed genes, most genes are transcribed only under certain conditions. In order to obtain a broad representation of the whole transcriptome, we performed several shaking flasks cultivations of SBR5 under different conditions for subsequent RNA extraction. In standard conditions, such as growth in PbMM medium with pH 7 and growth in LB medium at 30 °C, the biomass (ΔOD) was approximately 1.35 and the growth rate (μ) approximately 0.5 h-1 (Additional file 2: Table S2). Compared to standard conditions, growth of SBR5 under stress conditions was in general slower (Additional file 2: Table S2). Hyperosmotic stress, low or high pH and low temperature (20 °C) affected growth of SBR5 to the largest extent (Additional file 2: Table S2). Under all conditions, exponentially growing cells were harvested for RNA isolation.

RNA-seq experiment of P. riograndensis SBR5

After confirmation of RNA integrity and absence of DNA contamination, the prepared RNA samples were pooled. The total number of reads generated from whole transcriptome and 5′-end enriched libraries were 11.57 million and 1.40 million, respectively (Table 2). Trimming of the reads with a length threshold of 20 base pairs resulted in 5.87 million (51% of the total reads) remaining reads for the whole transcriptome library and 827,376 (59% of total reads) for the 5′-end enriched library (Table 2). The trimmed reads were mapped to the genome of P. riograndensis SBR5, and 1.22 million whole transcriptome library reads and 345,313 reads of the 5′-end enriched library were uniquely aligned to the genome of SBR5 while 122,980 and 31,899 reads were aligned to multiple genome regions, respectively (Table 2).

Table 2 Sequencing and mapping features of cDNA libraries of P. riograndensis SBR5

Identification of transcription start sites (TSS) based on the mapped 5′-end enriched data

In order to detect putative TSS in the mapped 5′-end enriched data; 2 TSS analysis parameter sets were chosen (Table 1). The use of the parameter set 1 led to the automatic detection of 849 TSS and 1951 TSS were detected by using parameter set 2 (Table 1). Subsequently, these results were merged. Figure 1 shows the scheme of the manual review of the automatically detected TSS which led to the identification of 86 TSS belonging to rRNA or tRNA genes. Moreover, 363 elements were considered not to be TSS or to be false positives. TSS were considered false-positives if no clear accumulation of read starts was observed at the particular genomic position and additionally the putative TSS was detected within an uneven gradient of accumulated read starts [16]. The 2351 remaining TSS were classified as either belonging to 5′ UTRs of annotated genes or of novel transcripts. Out of the 6705 genes annotated in the genome of SBR5 [12], 1173 were found to possess TSS. The detected TSS were classified as single when only one TSS was present upstream a gene (1102) or multiple when more than one TSS were present upstream a gene (166). The remaining 1082 TSS were classified as belonging to novel transcripts, divided into the groups of antisense when the transcript was located in the antisense orientation to an annotated gene (170), intergenic when the transcript was located between annotated genes (77) or intragenic when a TSS was located within annotated genes in sense orientation (835) (Fig. 1).

Fig. 1
figure 1

Classification of TSS identified with RNA-seq. Schematic view of the TSS analysis flow: TSS automatic identification by ReadXplorer [25], filtering of false positives and rRNA/tRNA, manual verification and classification of TSS between TSS belonging to 5′ UTR of annotated genes or to novel transcripts

Distribution of 5′ UTR length in P. riograndensis SBR5

The sequences located between TSS and the gene start codons were used for the analysis of 5′ UTR lengths. For this purpose, only the 5′ UTRs assigned to annotated genes were considered. The length of 5′ UTRs in P. riograndensis varied from 0 to 799 base pairs. Only 2 of the genes with annotated TSS were considered leaderless (no 5′ UTR present): P.riograndensis_final_2873 coding for stress-induced protein and P.riograndensis_final_5691 coding for hypothetical protein (Additional file 3: Table S3). Moreover, 10 of the analyzed 5′ UTRs were found to be shorter than 10 base pairs (Additional file 3: Table S3). Figure 2 shows the distribution of the 5′ UTR lengths indicating that the majority of 5′ UTRs are 25 to 50 base pairs long. Among the 1269 analyzed 5′ UTRs, 209 (16.4%) were longer than 100 base pairs (Fig. 2). Those 5′ UTRs were further used in a screen for cis-regulatory RNA elements.

Fig. 2
figure 2

Distribution of 5′ UTR lengths of mRNAs assigned to genes in P. riograndensis SBR5. The 5′ UTR length was the distance between the identified TSS and its assigned TLS. The lengths of the 1269 5′ UTRs of annotated genes were grouped in a crescent interval of 5 base pairs or longer than 500 base pairs

Identification of consensus promoter motif sequences in P. riograndensis SBR5

The 1269 TSS identified as belonging to annotated genes were used in a search for the conserved promoter motifs (Fig. 1). The software Improbizer was applied to predict the motifs in a DNA region 70 base pairs upstream of each of those TSS [32]. Conserved −35 and −10 promoter sequence motifs were found in 1220 (96.1%) and 1217 (95.9%) of the analyzed sequences, respectively (Fig. 3A). Figure 3A shows the −10 and −35 motif sequence logos, which were ttgaca for −35 hexamer motif and TAtaaT for the −10 hexamer motif. The mean spacer lengths between the −35 and −10 motifs and −10 motifs and TSS were 17.6 base pairs and 4.1 base pairs, respectively (Fig. 3A).

Fig. 3
figure 3

Analysis of promoter, ribosome binding site and translation start site motives in P. riograndensis SBR5. The nucleotide distribution in the promoter motifs (a), ribosome binding sites and translation start sites (b) of P. riograndensis SBR5 were determined by using the Improbizer tool [32]. WebLogo tool [33] was used to determine the conservation of the nucleotides which was measured in bits and represented in the plot by the size of the nucleotide

Identification of RBS (ribosome binding site) and TLS (translation start site) consensus sequences in P. riograndensis SBR5

Similarly to the analysis of the promoter motifs, the Improbizer software [32] was used to determine the consensus sequence of RBS and TLS in the sequence 50 base pairs upstream of the translation start codon of genes associated to the 1269 previously identified TSS (Fig. 1). Some genes were characterized as associated to multiple TSS (Fig. 1), therefore the upstream sequence of these genes was only included once in the analysis. Hence, the 1173 remaining sequences were extracted from the genome of SBR5 and submitted to Improbizer [32] and WebLogo [33] for the identification of the conserved motifs of RBS and TLS (Fig. 3B). RBS motifs were identified in 98% (1155) of analyzed sequences. The determined RBS motif aGGaGg of P. riograndensis SBR5 includes 3 conserved guanines in approximately 90% of the analyzed sequences (Fig. 3B). Translational start codons were identified in all the analyzed sequences (Fig. 3B). The TLS found in the analyzed sequences were ATG (924; 79%), GTG (138; 12%) and TTG (111; 9%). The lengths of the spacer sequence between RBS and TLS varies between 5 and 13 base pairs, with an average of 7.8 ± 2.0 base pairs (Fig. 3B).

Identification of cis-regulatory elements in 5′ UTRs of P. riograndensis SBR5 genes

In order to identify putative RNA motifs in the genome sequence of P. riograndensis SBR5, we used the Infernal tool [26] and the Rfam database, which contains hundreds of RNA families [27]. This approach revealed 327 RNA motifs that subsequently were manually cross checked. Matches to tRNAs, ribosomal RNAs and RNA motifs from Eukaryotes or different bacterial groups were not considered. As result, 98 RNA motifs among 31 Rfam families were identified (Additional file 4: Table S4).

In an alternative approach based on the RNA-seq data, we analyzed 209 5′ UTRs longer than 100 base pairs (Fig. 2) for the presence of cis-regulatory elements by comparison to the Rfam database. This analysis revealed the presence of 11 putative cis-regulatory elements grouped in 9 types of riboswitch families (Table 3). Thus, based on the RNA-seq data, the existence of 11 out of 98 putative 5′ UTR RNA motifs upstream of annotated genes was confirmed. A TPP (thiamine pyrophosphate) sensitive riboswitch was predicted to be present in the 5′ UTR of the gene P.riograndensis_final_150 (thiC) encoding phosphomethylpyrimidine synthase, which is putatively involved in thiamine biosynthesis, and in the 5′ UTR belonging to the operon P.riograndensis_final_504–502. Although P.riograndensis_final_503 gene is automatically annotated as a hypothetical protein, BLASTx analysis revealed that it belongs to the thiamine-biding protein superfamily. More vitamin and amino acid related riboswitches were found: a pantothenate related pam riboswitch in the 5′ UTR of putative pantothenate synthesis operon and a riboswitch recognizing S-adenosylmethionine (SAM) in the 5′ UTR of an operon encoding homoserine O-succinyltransferase and cystathionine gamma-lyase proteins. The T-box regulatory elements were found in 5′ UTR of the genes coding for D-3-phosphoglycerate dehydrogenase (serA) and valine tRNA ligase (valS). Furthermore, the protein dependent L20 leader and L21 leader riboswitches, the metabolite dependent ydaO-yuaA riboswitch, the pfl riboswitch and the glycine dependent riboswitch were identified in this work (Table 3).

Table 3 Riboswitches detected in the transcriptome of P. riograndensis SBR5 and their transcriptional organization

A TPP riboswitch influences gfpUV expression in P. riograndensis SBR5

Riboswitches are regulatory elements found in the 5′-UTR of genes and they perform the regulatory control over the gene transcript by directly binding a small ligand molecule. In the riboswitch sequence, an aptamer domain recognizes and binds to that ligand which leads to adopting a new conformation that interfaces with the gene transcriptional (presence of terminator sequence) or translational machinery (sequestration of the RBS by a stem) [40]. The prediction of the secondary structure of the TPP riboswitch in the 5′ UTR of thiC gene, with length of 313 base pairs, showed that it contains no terminator sequence. However, a 5′-GAUAA-3′ sequence and its complementary 5′-UUAUC-3′ is present in many of the predicted stems, including the stems of the aptamer region (Fig. 4A). This indicates the existence of anti-sequestering stems in this molecule, as showed schematically in Fig. 4A. Furthermore, we aimed to detect the influence of the P. riograndensis TPP riboswitch on gene expression in the presence of different concentrations of its ligand thiamine by measuring the GfpUV fluorescence. SBR5 cells were transformed with the plasmid pP2pyk_TPP-gfpUV which carries the constitutive promoter Ppyk with its native 5′ UTR replaced by the 5′ UTR of the P. riograndensis_final_150. The so changed Ppyk promoter drives the expression of the reporter gene gfpUV (Additional file 1: Table S1). As shown before, the 5′ UTR of the P. riograndensis_final_150 gene contains the sequence of a TPP riboswitch (Table 3). As control for this assay, the plasmid pP2pyk-gfpUV, containing Ppyk native 5′ UTR was used to transform SBR5 cells and the resultant strain was also cultivated in glucose PbMM, but supplied with 0 or 25 μM of thiamine. The MFI of the control strain SBR5(pP2pyk-gfpUV) remained the same when the cells were in absence or in presence of 25 μM of thiamine (Fig. 4B). The GfpUV MFI of SBR5(pP2pyk_TPP-gfpUV) was similar to the control strain when no thiamine was added to the growth medium (Fig. 4B). In contrast, there was a significant effect of thiamine on MFI of SBR5(pP2pyk_TPP-gfpUV) at the p < 0.05 level for the tested conditions [F (5, 12) = 17.8, p = 0.00004]: post hoc comparisons using the Tukey’s HSD test indicated that the mean score for the 0 μM thiamine was significantly different than the 5, 10, 15, 20 and 25 μM thiamine conditions. However, the MFI of SBR5(pP2pyk_TPP-gfpUV) in 5, 10, 15, 20 and 25 μM thiamine conditions did not significantly differ from one another (Fig. 4B).

Fig. 4
figure 4

TPP riboswitch influence on the reporter gfpUV gene expression of P. riograndensis SBR5. a. Schematic representation of the TPP riboswitch and TPP aptamer sequence predicted using RNAfold tool [31]; regions of riboswitch scheme in red represents possible anti-sequestering stems present in the riboswitch sequence; regions of aptamer sequence in bold are identical to the TPP riboswitch consensus sequence of B. subtilis b. GfpUV median fluorescence intensity (MFI) in SBR5 under 6 gradually increasing concentrations of thiamine; gfpUV expression was driven either by the pyk promoter with 5′ UTR exchanged by the thiC gene 5′ UTR or pyk promoter carrying native 5′ UTR. Means and standard deviation of biological triplicates were measured by flow cytometry of 20,000 cells. Under one-way between subjects ANOVA followed by post hoc comparisons using the Tukey’s HSD test, the level of significance of the differences observed in each strain between the control (0 μM of thiamine) and test conditions (5, 10, 15, 20 and 25 μM of thiamine) is represented as one star (*p ≤ 0.05). Nonsignificant differences, when p > 0.05, are not pointed

Identification and characterization of novel transcripts

Here, we performed the characterization of P. riograndensis novel transcripts based on the 5′-end enriched data set. Among the 2351 manually verified TSS, 1082 were classified as belonging to novel transcripts. Depending on their position in genes or untranslated regions, these TSS belonged to antisense transcripts (170), transcripts intragenic (835) to annotated genes or their 5′/3′ UTRs, or intergenic (77) transcripts (Fig. 1). Additional file 5: Table S5 shows the intragenic transcripts which were organized according to their position and associated gene. As intergenic novel transcripts could not be assigned to annotated genes, they were manually annotated as unknown transcripts (Additional file 6: Table S6). BLAST analysis of the intergenic novel transcripts resulted in discovery of 34 small proteins and 27 small RNAs. Small RNAs were analyzed in the Rfam database and 3 of them were annotated as Small SRP (P.riograndensis_final_s0002), BsrC sRNA (P.riograndensis_final_s0008) and RNase P (P.riograndensis_final_s0013) (Table 4). All others were assigned an unknown function.

Table 4 Novel transcripts with known function in P. riograndensis SBR5

Gene expression ranked according to transcript abundances

The abundance of transcripts in the analyzed RNA samples was quantified on the basis of the whole transcriptome dataset using RPKM values. Transcripts were detected in 6367 of the coding sequences during the analysis, corresponding to 94% of the total number of genes annotated in the genome of P. riograndensis. Transcript abundance varied over 6 orders of magnitude with RPKM values ranging from 0.11 to 71,849.57 and was categorized arbitrarily as follows. Transcript abundance was considered low for approximately 70% of transcripts (with RPKM values <100), intermediate (RPKM between 100 and 1000) for around 25% of the detected transcripts and high for approximately 5% of the transcripts (RPKM between 1000 and 10,000). Twenty one transcripts showed RPKM values exceeding 10,000 and these were considered as transcripts with very high transcript abundance and are listed in Table 5.

Table 5 Most abundant transcripts of P. riograndensis SBR5 under the chosen cultivation conditions

BLASTx analysis was performed to predict the functions (conserved protein domains) of the 14 genes which were automatically annotated as hypothetical proteins or as proteins with unknown function. However, a function could not be predicted for 5 genes with very highly abundant transcripts (Table 5). Part of the very highly abundant transcripts code for ribosomal proteins (6 genes). Remarkably, 3 genes related to bacterial sporulation had very highly abundant transcripts (Table 5).

Identification of operon structures in P. riograndensis SBR5

Based on the mapped reads generated from whole transcriptome library, we assigned genes either to monocistronic transcripts, primary operons or suboperons. Genes were assigned to suboperons when a TSS was detected internal to the operon sequence. Genes with annotated TSS that were not automatically detected as primary operons were classified as monocistronic transcripts. In total, 919 monocistronic transcripts were detected, and 1776 genes were assigned to 622 operons and 248 suboperons (Fig. 5B). The length distribution of the operons and suboperons was estimated and shown to peak between 1000 and 3000 base pairs for operons, while the majority of the suboperons were shorter than 2000 base pairs (Fig. 5A). In general, the number of operons decreases with the increasing number of genes in those operons and most operon structures (71%) are composed of 2 genes while only 5 operons contained more than 7 genes (Fig. 5B). Two operon structures which are putatively involved in Fe3+ siderophore uptake and transport could be detected: fhuB - P. riograndensis_final_3660 (Fe3+ hydroxamate import system permease and component of an ABC type Fe3+ siderophore transport system, respectively) and P. riograndensis_final_5688 - P. riograndensis_final_5687 which encodes a Fe3+ siderophore ABC transporter permease (Additional file 3: Table S3). However, known operons comprising nitrogen fixation genes [10] were not detected in the present study. Notably, riboswitches were found in the 5′ UTRs of 6 operons P.riograndensis_final_502–504, infC-P.riograndensis_final_1528–1529, metA-P.riograndensis_final_2059, panB-panC-P.riograndensis_final_4379, rplU-P.riograndensis_final_5299–5300 and P.riograndensis_final_6104-gcvPA-gcvPB (Table 3).

Fig. 5
figure 5

Operon analysis in P. riograndensis SBR5. a. Length distribution (in base pairs) of detected operons and suboperons; b. Analysis of feature number in monocistronic transcripts, operons and suboperons in P. riograndensis SBR5


In the present study, we performed for the first time a detailed transcriptome analysis of P. riograndensis SBR5. This work lays a foundation for understanding of gene expression in this bacterium and complements differential gene expression analysis. To enable the comprehensive characterization of ‘static’ bacterial transcriptomes it is necessary to generate a pool of different transcripts in order to obtain the expression of as many genes as possible [16]. This was largely achieved by cultivation of P. riograndensis SBR5 under 15 distinct conditions and pooling RNA samples prior to sequencing, since we found 94% genes expressed under these conditions. However, the absolute number of TSS (present in 5′ UTR of annotated genes together with TSS belonging to novel transcripts) in P. riograndensis SBR5 was comparable to that in B. methanolicus MGA3 [16] and C. glutamicum [15] (equaled 2350, 2167, and 2591, respectively) although the genome of P. riograndensis SBR5 is more than 2 fold larger than those of B. methanolicus MGA3 [41] and C. glutamicum [42]. This observation may also reflect the fact that RNA was pooled from cells cultivated under various growth and stress conditions that were chosen with a similar rationale for the 3 bacteria and may indicate that a similar set of genes is transcribed under the chosen conditions.

In this study, the 5′ UTR length of P. riograndensis SBR5 transcripts was shown to be equal or longer than 10 nucleotides in 99.2% of the cases and it peaked at around 30 base pairs (Fig. 2). A similar 5′ UTR length distribution can be found in Actinoplanes sp., C. glutamicum and B. methanolicus [15, 16, 43]. However, leaderless transcripts are rare in P. riograndensis SBR5 as the present study only revealed 2 transcripts (P.riograndensis_final_2873 and P.riograndensis_final_5691) to be leaderless in this firmicute (Additional file 3: Table S3). In silico analysis performed by Zheng et al. (2011) [44] showed that 207 among 953 analyzed bacterial genomes possess leaderless genes including species of the Firmicutes and Actinobacteria phyla. The scarcity of leaderless transcripts in the transcriptomes of the low-GC Gram-positives B. methanolicus [16] and P. riograndensis contrasts with the large proportion of leaderless transcripts present in the high-GC Gram-positive actinobacteria Actinoplanes sp. (20%) [43] and C. glutamicum (33%) [15].

The analysis of the promoter, RBS and TLS motives in P. riograndensis SBR5 transcriptome revealed that the RBS consensus sequence aGGaGg, TLS frequency (ATG represented 79% of the analyzed sequences and GTG and TTG represented 12% and 9%, respectively) and spacing between RBS and TLS (7.8 base pairs) in P. riograndensis SBR5 corresponds well to the conserved sequences motifs historically found in bacteria [45,46,47]. While the −10 region (TAtaaT), spacing between −10 and −35 boxes (17.6 base pairs) and between −10 box and TSS (4.1 base pairs) are conserved between P. riograndensis SBR5, E. coli, B. subtilis and B. methanolicus [16, 48, 49], the conserved −35 region (ttgaca) in SBR5 was similar only to the −35 box described for other bacilli [16, 50].

Riboswitch-mediated control of expression of a variety of genes in bacteria could have practical implications, such as development of new antibacterial drugs [21], or more generally contribute to improvement of the understanding of bacterial metabolism. Here, a genome-based riboswitch analysis revealed 98 putative RNA motifs, 11 of which were also detected in the sequenced RNAs (Additional file 4: Table S4). In Firmicutes, the SAM riboswitch is part of the S-box group of riboswitches which are involved in regulation of SAM, cysteine and methionine biosynthesis, and sulfur metabolism [51, 52]. This type of riboswitch has been well characterized in bacilli, for example in B. subtilis, which has at least 11 operons and 26 genes under control of S-box RNA [53]. S-box RNA from B. subtilis directly senses the level of SAM and functions as SAM dependent riboswitch [54]. However, the most frequent mechanism of riboswitch regulation of amino acid operons in the Firmicutes is the T-box regulatory system [55, 56]. In B. subtilis and other Firmicutes, the T-box can regulate many genes encoding amino acid biosynthetic enzymes and transporters [57]. The ydaO-yuaA riboswitches occur upstream of these 2 genes in B. subtilis and operate as a genetic “off” switch [58]. Furthermore, recognition of the cyclic di-AMP by ydaO-yuaA was characterized and also shown to exist in B. subtilis [59, 60]. The riboswitches in P. riograndensis SBR5 identified in the present study need to be investigated to some detail to unravel their regulatory function.

In order to analyze the function of 1 exemplary riboswitch found in the transcriptome of P. riograndensis SBR5, we selected the TPP dependent riboswitch present upstream of the thiC gene (Table 3) to control the expression of gfpUV. The gene thiC encodes a phosphomethylpyrimidine synthase involved in TPP biosynthesis [61]. The E. coli thiC riboswitch controls translation initiation and, in the presence of TPP thiC product is not translated [62, 63]. The thiamine analog triazolethiamine showed a concentration dependent reporter gene repression by the TPP riboswitch in the 5′ UTR of the thiamine kinase thiK [62]. In the present study, thiamine was added to the growth medium and already 5 μM thiamine fully reduced gfpUV expression (Fig. 4B). P. riograndensis SBR5 is a thiamine prototroph capable of growing in minimal medium without added thiamine (data not shown). The gfpUV expression without additional thiamine remained high (Fig. 4B) suggesting that the amount of thiamine synthetized by SBR5 did not activate the TPP riboswitch. Riboswitch aptamers remain highly conserved through evolution because each one must preserve selective binding of its target metabolite. Hence, the conserved TPP riboswitch consensus regions of B. subtilis were also present in the TPP riboswitch aptamer sequence targeted in this study (Fig. 4A) [21]. The secondary structure analysis showed that, in contrast to the tenA TPP riboswitch in B. subtilis [21], the SBR5 thiC TPP riboswitch does not possess a transcriptional terminator sequence. This led us to investigate the presence of sequestering/anti-sequestering stems that could participate in the “on/off” state of the P. riograndensis SBR5 thiC TPP riboswitch. The schematic representation of the thiC TPP riboswitch secondary structure shows that the RBS is sequestered within a stem-loop structure predicted to inhibit translation initiation (Fig. 4A). We could detect the sequence 5′-GATAA-3′ and its complementary region 5′-UUAUC-3′ inserted in the TPP aptamer sequence, in the stem-loop containing the thiC RBS and also in the sequence of 1 stem-loop located before RBS containing stem. The locations of these sequences inside the TPP riboswitch structure are decipted in red in the schematic representation of Fig. 4A. Comparable secondary structures and gene expression control have very recently been described for the E. coli thiC TPP riboswitch [63]. P. riograndensis possesses 3 further putative TPP riboswitches, one of which was expressed under the growth conditions of the RNA-seq analysis presented here (Table 3; Additional file 4: Table S4). Although they share conserved sequences and secondary structure predictions (data not shown) it remains to be studied if these putative TPP riboswitches are indeed responsive to TPP and if they operate as transcriptional or translational riboswitches.

Landscape transcriptome analyses are suitable to identify antisense, intragenic or intergenic novel transcripts [15, 16]. Sixteen percent of the 1082 novel transcripts identified in the transcriptome of P. riograndensis SBR5 were shown to be antisense (Fig. 1). The number of reported antisense RNAs varies between bacteria and the biological advantages of such overlapping transcription remains unclear, but antisense RNAs may play important roles in regulation e.g. by transcription interference [64]. Commensurate with this notion, we could identify that transcription of 3 antisense RNAs initiates in the 5′ UTRs of the genes on the complementary strand and, thus, antisense transcription may interfere or attenuate with their transcription (P.riograndensis_final_5580, P.riograndensis_final_6016 and P.riograndensis_final_6182; Additional file 5: Table S5). Moreover, 77 novel transcripts were identified in the intergenic regions between previously known genes, part of the identified intergenic transcripts represent small RNA and small protein genes (Additional file 7: Table S7). Of these, 34 were predicted to encode small proteins, for example, small signal recognition particle (or small SRP; P.riograndensis_final_s0002), which is known to be involved in protein targeting in other bacteria [65]. Twenty seven small RNA genes were found, e.g. the small RNA bsrC (P.riograndensis_final_s0008), which is present also in B. subtilis [66] and RNase P (P.riograndensis_final_s0013), the ubiquitous endonuclease that catalyzes the maturation of the 5′ end of the tRNAs [48]. Overall, the function of the novel intergenic transcripts representing small protein and small RNA genes still has to be elucidated.

In the transcriptome of P. riograndensis SBR5, the abundantly transcribed genes could be grouped by their presumed functions: ribosomal proteins, sporulation related proteins, proteins related to carbon metabolism and others. Many abundantly transcribed genes encode proteins of unknown function (Table 5). Among the highly expressed genes, we have detected 1 gene coding for a subunit of a maltose phosphotransferase system (P.riograndensis_final_1999, Table 5). Maltose is a disaccharide formed from two units of glucose and this organic compound can be identified in root exudates of different plant species [65]. Although maltose was not utilized as carbon source in the present study, the transcription of this carbohydrate phosphotransferase system gene may be due to the fact that we mostly used glucose as carbon source. The gene encoding a glucose specific phosphotransferase system (P.riograndensis_final_1998) had RPKM value of 156.17 (data accession GSE98766) which places it in the group of intermediately abundant transcripts. Thus, the affinity of the highly transcribed maltose transporter to glucose still remains to be studied. A transcriptome analysis of carbon source utilization (β-glucan, starch, cellobiose, maltose, glucose, xylose and arabinose) by Paenibacillus sp. JDR-2 revealed a regulatory connection for the utilization of the polysaccharides β-glucan, starch and xylans, while transcription of genes coding for proteins involved in monosaccharide (e.g. arabinose and glucose) utilization was less apparent [13]. BLASTx analysis revealed 3 sporulation-related genes (P.riograndensis_final_4321, P.riograndensis_final_2316 and P.riograndensis_final_5601) among the most abundantly expressed genes (Table 5). This result might be due to the fact that different stress conditions were applied during cultivations of SBR5, which included 5 min of cold and heat shock but also exposure to salinity, solvent, low temperature and low pH along the bacterial growth. The exposure to stress conditions affected growth rates in comparison to the optimal growth conditions, and might have also induced expression of sporulation related genes (Additional file 3: Table S2). Very recently, sporulation genes spoVT and spoIIIAH were shown to be transcribed by P. riograndensis SBR5 under iron-limiting conditions [67]. Moreover, the related P. polymyxa SC2 expressed sporulation genes (spo0A, spoIIE, spoIIAA, spoIIAB, sigE and sigF) when cultivated under sporulation conditions: on LB agar for 24 h at 37 °C, when most of SC2 step into the progress of sporulation [68].

The transcriptional organization of 1776 genes of P. riograndensis SBR5 in 622 operons including 248 suboperons and 919 monocistronically transcribed genes (Fig. 5B) was comparable to that found in B. methanolicus, in which 1164 genes were assigned to 381 operons and 94 suboperons, and also ~ 900 monocistronic transcripts were detected [16]. Similarly, in B. subitilis 736 regulated operons were found [69] and 1013 genes were organized in 616 operons (including 565 suboperons) for the actinobacterium C. glutamicum [15]. Most operons detected here were composed of only 2 genes and were between 1000 and 3000 base pairs in length (Fig. 5). Accordingly, most suboperons comprised only 1 gene and were smaller than 2000 base pairs (Fig. 5). The length distribution of suboperons/operons and the number of genes constituting these are commensurate with the average length of the genes in P. riograndensis SBR5 genome of 1008 base pairs (data not shown).

Only two differential gene expression analyses on nitrogen fixation [10] and iron metabolism [67] of P. riograndensis SBR5 have been published. The nitrogen fixation genes present in 3 genome clusters were characterized by transcript analysis by quantitative real time-qPCR and shown to be transcribed in the operons nifB1H1D1K1E1N1X1-orf1-hesA-V, nifE2N2X2 and anfHDGK [10]. In the present study, these operons that are generally transcribed under poor nitrogen supply conditions [14, 70, 71] were not found to be expressed under the chosen growth conditions since all growth conditions were characterized by sufficient nitrogen concentrations (in LB medium or minimal media with 16 mM ammonium sulfate). By contrast, a different gene related to nitrogen fixation encoding putative nitrogenase (flavodoxin; P.riograndensis_final_4327) was found to be expressed. In the second study, 150 genes of P. riograndensis SBR5 were shown to be differentially expressed under iron-replete in comparison to iron-limiting conditions [67]. Surprisingly, a high expression level of the Fe3+ siderophore transporter gene fecE was observed suggesting that P. riograndensis SBR5 can uptake Fe3+ siderophore from the environment although it is not able to produce those siderophores itself [67]. Here, we could identify 2 operon structures putatively involved in Fe3+ siderophore uptake and transport: the operon fhuB - P. riograndensis_final_3660 which encodes a putative Fe3+ hydroxamate import system permease and a component of an ABC type Fe3+ siderophore transport system, respectively, and the operon P. riograndensis_final_5688 - P. riograndensis_final_5687 which comprises a gene encoding the Fe3+ siderophore ABC transporter permease (Additional file 3: Table S3).


The examination of the whole transcriptome of P. riograndensis, which was reclassified recently as P. sonchi [72], is a valuable contribution to the understanding of biology of this organism. Moreover, our data validated the uncovering of novel transcripts and the presence of hundreds of operons. Although our study has revealed a functional TPP riboswitch in gene regulation of SBR5, further effort is required to fully elucidate the function of this riboswitch. Finally, the data generated in this study should be valuable for future development of genetic tools for this poorly characterized species as much as for the genus Paenibacillus. As our RNA-seq analysis provides new insight into the P. riograndensis SBR5 transcriptome at the systems level, it will be a valuable basis for further differential RNA-seq analysis exploring agronomical/physiological aspects of this bacterium, e.g. phosphate solubilization.


5′ UTR:

5′ Untranslated region


Basic local alignment search tool


Coding DNA sequences


Honest significant difference


Lysogeny broth


2-(N-morpholino)ethanesulfonic acid


Median fluorescence intensity


3-Morpholinopropane-1-sulfonic acid


Optical density


Paenibacillus minimal medium


Plant growth promoting


Ribosome binding site


Reads per kilobase per million mapped reads




3-[[1,3-Dihydroxy-2-(hydroxymethyl)propan-2-yl]amino]propane-1-sulfonic acid


Trace element


Translation start sites


Thiamine pyrophosphate


Transcription start sites


  1. Govindasamy V, Senthilkumar M, Magheshwaran V, Kumar U, Bose P, Sharma V, et al. Bacillus and Paenibacillus spp.: potential PGPR for sustainable agriculture; In: "Plant Growth and Health Promoting Bacteria" D.K. Maheshwari (ed.), Microbiology Monographs 18, Springer-Verlag Berlin Heidelberg; 2010. doi:10.1007/978-3-642-13612-2_15.

  2. Andreeva I, Morozov I, Pechurkina N, Morozova O, Ryabchikova E, Saranina I, et al. Isolation of bacteria of the genus Paenibacillus from soil and springs of the valley of geysers (Kamchatka). Microbiology. 2010;79:705–13.

    Article  Google Scholar 

  3. Beneduzi A, Moreira F, Costa PB, Vargas LK, Lisboa BB, Favreto R, et al. Diversity and plant growth promoting evaluation abilities of bacteria isolated from sugarcane cultivated in the south of Brazil. Appl Soil Ecol. 2013;63:94–104.

    Article  Google Scholar 

  4. Genersch E, Forsgren E, Pentika J, Ashiralieva A, Rauch S, Kilwinski J, et al. Reclassification of Paenibacillus larvae subsp. pulvifaciens and Paenibacillus larvae subsp. larvae as Paenibacillus larvae without subspecies differentiation. Int J Syst Evol Microbiol. 2006;56:501–11.

  5. Drancourt M, Berger P, Raoult D. Systematic 16S rRNA gene sequencing of atypical clinical isolates identified 27 new bacterial species associated with humans. J Clin Microbiol. 2004;42:2197–202.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Ash C, Priest FG, Collins MD. Molecular identification of rRNA group 3 bacilli (ash, Farrow, Wallbanks and Collins) using a PCR probe test. Proposal for the creation of a new genus Paenibacillus. Antonie Leeuwenhoek. 1993;64:253–60.

    Article  CAS  PubMed  Google Scholar 

  7. Xie J, Du Z, Bai L, Tian C, Zhang Y, Xie J, et al. Comparative genomic analysis of N2-fixing and non-N2-fixing Paenibacillus spp.: organization, evolution and expression of the nitrogen fixation genes. PLoS Genet. 2014;10:1371.

    Article  Google Scholar 

  8. Beneduzi A, Peres D, Costa PB da, Zanettini MHB, Passaglia LMP. Genetic and phenotypic diversity of plant-growth-promoting bacilli isolated from wheat fields in southern Brazil. Res Microbiol 2008;159:244–250.

  9. Beneduzi A, Costa PB, Melo IS, Bodanese-Zanettini MH, Passaglia LMP. Paenibacillus riograndensis sp. nov., a nitrogen- fixing species isolated from the rhizosphere of Triticum aestivum. Int J Syst Evol Microbiol. 2010;60:128–33.

    Article  CAS  PubMed  Google Scholar 

  10. Fernandes GDC, Trarbach LJ, De Campos SB, Beneduzi A, Passaglia LMP. Alternative nitrogenase and pseudogenes: unique features of the Paenibacillus riograndensis nitrogen fixation system. Res Microbiol. 2014;165:571–80.

    Article  CAS  Google Scholar 

  11. Bach E, Dubal G, Carvalho G, De Brito B, Maria L, Passaglia P. Evaluation of biological control and rhizosphere competence of plant growth promoting bacteria, vol. 99. B.V: Elsevier; 2016. p. 141–9.

    Google Scholar 

  12. Brito LF, Bach E, Kalinowski J, Rückert C, Wibberg D, Passaglia LMP, et al. Complete genome sequence of Paenibacillus riograndensis SBR5, a gram-positive diazotrophic rhizobacterium. J Biotechnol. 2015;207:30–1.

    Article  CAS  PubMed  Google Scholar 

  13. Sawhney N, Crooks C, Chow V, Preston JF, John FJS. Genomic and transcriptomic analysis of carbohydrate utilization by Paenibacillus sp. JDR-2: systems for bioprocessing plant polysaccharides. BMC Genomics. 2016;17:131.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Shi H, Wang L, Li X, Liu X, Hao T, He X, et al. Genome-wide transcriptome profiling of nitrogen fixation in Paenibacillus sp. WLY78. BMC Microbiol. 2016;16:25.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Pfeifer-Sancar K, Mentz A, Rückert C, Kalinowski J. Comprehensive analysis of the Corynebacterium glutamicum transcriptome using an improved RNAseq technique. BMC Genomics. 2013;14:888.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Irla M, Neshat A, Brautaset T, Rückert C, Kalinowski J, Wendisch VF. Transcriptome analysis of thermophilic methylotrophic Bacillus methanolicus MGA3 using RNA-sequencing provides detailed insights into its previously uncharted transcriptional landscape. BMC Genomics. 2015;16:73.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Perkins TT, Kingsley RA, Fookes MC, Gardner PP, James KD, Yu L, et al. A strand-specific RNA-seq analysis of the transcriptome of the typhoid bacillus Salmonella typhi. PLoS Genet. 2009;5:e1000569.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Kim JF, Jeong H, Park S, Kim S, Park YK, Choi S, et al. Genome sequence of the polymyxin-producing plant-probiotic rhizobacterium Paenibacillus polymyxa E681. J Bacteriol. 2010;192:6103–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Mingchao M, Wang C, Ding Y, Li L, Shen D, Jiang X, et al. Complete genome sequence of Paenibacillus polymyxa SC2, a strain of plant growth-promoting rhizobacterium with broad-spectrum antimicrobial activity. J Bacteriol. 2011;193:311–2.

    Article  Google Scholar 

  20. Kwak Y, Shin J. Complete genome sequence of Paenibacillus beijingensis 7188T (=DSM 24997T), a novel rhizobacterium from jujube garden soil. J Biotechnol. 2015;206:75–6.

    Article  CAS  PubMed  Google Scholar 

  21. Sudarsan N, Cohen-Chalamish S, Nakamura S, Emilsson GM, Breaker RR. Thiamine pyrophosphate riboswitches are targets for the antimicrobial compound pyrithiamine. Chem Biol. 2005;12:1325–35.

    Article  CAS  PubMed  Google Scholar 

  22. Brito LF, Irla M, Walter T, Wendisch VF. Magnesium aminoclay-based transformation of Paenibacillus riograndensis and Paenibacillus polymyxa and development of tools for gene expression. Appl Microbiol Biotechnol. 2017;101:735–47.

    Article  CAS  PubMed  Google Scholar 

  23. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10:R25.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Hilker R, Stadermann KB, Doppmeier D, Kalinowski J, Stoye J, Straube J, et al. ReadXplorer—visualization and analysis of mapped sequences. Bioinformatics. 2014;30:2247–54.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Nawrocki EP, Eddy SR. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics. 2013;29:2933–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Gardner PP, Daub J, Tate JG, Nawrocki EP, Kolbe DL, Lindgreen S, et al. Rfam: updates to the RNA families database. Nucleic Acids Res. 2009;37:136–40.

    Article  Google Scholar 

  28. Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR. Rfam: an RNA family database. Nucleic Acids Res. 2003;31:439–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Palacios OA, Bashan Y, De-Bashan LE. Proven and potential involvement of vitamins in interactions of plants with plant growth-promoting bacteria-an overview. Biol Fertil Soils. 2014;50:415–32.

    Article  CAS  Google Scholar 

  30. Naville M, Ghuillot-Gaudeffroy A, Marchais A, Gautheret D. ARNold: a web tool for the prediction of rho-independent transcription terminators. RNA Biol. 2011;8:11–3.

    Article  CAS  PubMed  Google Scholar 

  31. Lorenz R, Bernhart SH, Höner Zu Siederdissen C, Tafer H, Flamm C, Stadler PF, et al. ViennaRNA Package 2.0. Algorithms Mol Biol. 2011;6:26.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Ao W, Gaudet J, Kent WJ, Muttumu S, Mango SE. Environmentally induced foregut remodeling by PHA-4/FoxA and DAF-12/NHR. Science. 2004;305:1743–6.

    Article  CAS  PubMed  Google Scholar 

  33. Crooks G, Hon G, Chandonia J, Brenner S. WebLogo: a sequence logo generator. Genome Res. 2004;14:1188–90.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5:621–8.

    Article  CAS  PubMed  Google Scholar 

  35. Altschup SF, Gish W, Miller W, Myers E, Lipman D. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.

    Article  Google Scholar 

  36. Sambrook J. Molecular cloning: a laboratory manual. New York: Cold Spring Harbor; 2001.

    Google Scholar 

  37. Hanahan D. Studies on transformation of Escherichia coli with plasmids. J Mol Biol. 1983;166:557–80.

    Article  CAS  PubMed  Google Scholar 

  38. Eikmanns BJ, Thum-Schmitz N, Eggeling L, Ludtke K, Sahm H. Nucleotide sequence, expression and transcriptional analysis of the Corynebacterium glutamicum gltA gene encoding citrate synthase. Microbiology. 1994;140:1817–28.

    Article  CAS  PubMed  Google Scholar 

  39. Gibson DG. Programming biological operating systems: genome design, assembly and activation. Nat Methods. 2014;11:521–6.

    Article  CAS  PubMed  Google Scholar 

  40. Blount KF, Breaker RR. Riboswitches as antibacterial drug targets. Nat Biotechnol. 2006;24:1558–64.

    Article  CAS  PubMed  Google Scholar 

  41. Irla M, Neshat A, Winkler A, Albersmeier A, Heggeset TMB, Brautaset T, et al. Complete genome sequence of Bacillus methanolicus MGA3, a thermotolerant amino acid producing methylotroph. J Biotechnol. 2014;188:110–1.

    Article  CAS  PubMed  Google Scholar 

  42. Kalinowski J, Bathe B, Bartels D, Bischoff N, Bott M, Burkovski A, et al. The complete Corynebacterium glutamicum ATCC 13032 genome sequence and its impact on the production of L-aspartate-derived amino acids and vitamins. J Biotechnol. 2003;104:5–25.

    Article  CAS  PubMed  Google Scholar 

  43. Schwientek P, Neshat A, Kalinowski J, Klein A, Rückert C, Schneiker-Bekel S, et al. Improving the genome annotation of the acarbose producer Actinoplanes sp. SE50/110 by sequencing enriched 5′-ends of primary transcripts. J Biotechnol. 2014;190:85–95.

    Article  CAS  PubMed  Google Scholar 

  44. Zheng X, Hu G, She Z, Zhu H. Leaderless genes in bacteria: clue to the evolution of translation initiation mechanisms in prokaryotes. BMC Genomics. 2011;12:361.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Shine J, Dalgarno L. The 3′-terminal sequence of Escherichia coli 16S ribosomal RNA: complementarity to nonsense triplets and ribosome binding sites. Proc Natl Acad Sci U S A. 1974;71:1342–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Vellanoweth RL, Rabinowitz JC. The influence of ribosome-binding-site elements on translational efficiency in Bacillus subtilis and Escherichia coli in vivo. Mol Microbiol. 1992;6:1105–14.

    Article  CAS  PubMed  Google Scholar 

  47. Villegas A, Kropinski AM. An analysis of initiation codon utilization in the domain bacteria - concerns about the quality of bacterial genome annotation. Microbiology. 2017;154:2559–61.

    Article  Google Scholar 

  48. Hawley DK, Mcclure WR. Compilation and analysis of Escherichia coli promoter DNA sequences. Nucleic Acids Res. 1983;11:2237–55.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Camacho A, Salas M. Effect of mutations in the “extended −10” motif of three Bacillus subtilis Sigma A-RNA polymerase-dependent promoters. J Mol Biol. 1999;286:683–93.

    Article  CAS  PubMed  Google Scholar 

  50. Helmann JD. Compilation and analysis of Bacillus subtilis Sigma A-dependent promoter sequences: evidence for extended contact between RNA polymerase and upstream promoter DNA. Nucleic Acids Res. 1995;23:2351–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Nudler E, Mironov AS. The riboswitch control of bacterial metabolism. Trends Biochem Sci. 2004;29:11–7.

    Article  CAS  PubMed  Google Scholar 

  52. Vitreschak AG, Rodionov DA, Mironov AA, Gelfand MS. Riboswitches: the oldest mechanism for the regulation of gene expression? Trends Genet. 2004;20:44–50.

    Article  CAS  PubMed  Google Scholar 

  53. Grundy FJ, Henkin TM. The S box regulon: a new global transcription termination control system for methionine and cysteine biosynthesis genes in gram-positive bacteria. Mol Microbiol. 1998;30:737–49.

    Article  CAS  PubMed  Google Scholar 

  54. McDaniel BAM, Grundy FJ, Artsimovitch I, Henkin TM. Transcription termination control of the S box system: direct measurement of S-adenosylmethionine by the leader RNA. Proc Natl Acad Sci U S A. 2003;100:3083–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Grundy FJ, Henkin TM. The T box and S box transcription termination control systems. Front Biosci. 2003;8:20–31.

    Article  Google Scholar 

  56. Merino E, Yanofsky C. Transcription attenuation: a highly conserved regulatory strategy used by bacteria. Trends Genet. 2005;21:260–4.

    Article  CAS  PubMed  Google Scholar 

  57. Vitreschak AG, Mironov AA, Lyubetsky VA, Gelfand MS. Comparative genomic analysis of T-box regulatory systems in bacteria. RNA. 2008;14:717–35.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Barrick JE, Corbino KA, Winkler WC, Nahvi A, Mandal M, Collins J, et al. New RNA motifs suggest an expanded scope for riboswitches in bacterial genetic control. Proc Natl Acad Sci U S A. 2004;101:6421–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Sudarsan N, Lee ER, Weinberg Z, Moy RH, Kim JN, Link KH, et al. Riboswitches in eubacteria sense the second messenger cyclic di-GMP. Science. 2008;321:411–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Gao A, Serganov A. Structural insights into recognition of c-di-AMP by the ydaO riboswitch. Nat Chem Biol. 2014;10:787–92.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Begley TP, Downs DM, Ealick SE, McLafferty FW, Van Loon AP, Taylor S, et al. Thiamin biosynthesis in prokaryotes. Arch Microbiol. 1999;171:293–300.

    Article  CAS  PubMed  Google Scholar 

  62. Lünse CE, Scott FJ, Suckling CJ, Mayer G. Novel TPP-riboswitch activators bypass metabolic enzyme dependency. Front Chem. 2014;2:53.

    PubMed  PubMed Central  Google Scholar 

  63. Chauvier A, Turcotte P, Perreault J, Lafontaine DA, Naghdi MR, Dube A. Transcriptional pausing at the translation start site operates as a critical checkpoint for riboswitch regulation. Nat Commun. 2017;8:13892.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Thomason MK, Storz G. Bacterial antisense RNAs: how many are there and what are they doing? Annu Rev Genet. 2011;44:167–88.

    Article  Google Scholar 

  65. Dakora FD, Phillips DA. Root exudates as mediators of mineral acquisition in low-nutrient environments. Plant Soil. 2002;245:35–47.

    Article  CAS  Google Scholar 

  66. Zwieb C, Nues RWV, Rosenblad MALM, Brown JD, Samuelsson T. A nomenclature for all signal recognition particle RNAs. RNA. 2005;48:7–13.

    Article  Google Scholar 

  67. Sperb ER, Tadra-Sfeir MZ, Sperotto RA, Fernandes Gde C, Pedrosa FO, de Souza EM, et al. Iron deficiency resistance mechanisms enlightened by gene expression analysis in Paenibacillus riograndensis SBR5. Res Microbiol. 2016;167:501–9.

    Article  CAS  PubMed  Google Scholar 

  68. Hou X, Yu X, Du B, Liu K, Yao L, Zhang S, et al. A single amino acid mutation in Spo0A results in sporulation deficiency of Paenibacillus polymyxa SC2. Res Microbiol. 2016;167:472–9.

    Article  CAS  PubMed  Google Scholar 

  69. Sierro N, Makita Y, De Hoon M, Nakai K. DBTBS: a database of transcriptional regulation in Bacillus subtilis containing upstream intergenic conservation information. Nucleic Acids Res. 2008;36:93–6.

    Article  Google Scholar 

  70. Poza-Carrión C, Jiménez-Vicente E, Navarro-Rodríguez M, Echavarri-Erasun C, Rubio LM. Kinetics of nif gene expression in a nitrogen-fixing bacterium. J Bacteriol. 2014;196:595–603.

    Article  PubMed  PubMed Central  Google Scholar 

  71. Wealand JAYL, Myers JA, Hirschberg R. Changes in gene expression during nitrogen starvation in Anabaena variabilis ATCC 29413. J Bacteriol. 1989;171:1309–13.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  72. Sant’Anna FH, Ambrosini A, de Souza R, Fernandes GD, Bach E, Balsanelli E, Baura V, Brito LF, Wendisch VF, Pedrosa FD, Souza EM, Passaglia LMP. Reclassification of Paenibacillus riograndensis as a genomovar of Paenibacillus sonchi: genome-based metrics improve bacterial taxonomic classification. Front. Microbiol. 2017;8:1849.

Download references


We thank Anika Winkler and Tobias Busche for the kind assistance with the preparation and sequencing of the cDNA libraries and Dr. Alexander Sczyrba and Maximilian Wiens for the bioinformatics advice.


LFB is funded by the Science without Borders program (Coordenação de Aperfeiçoamento de Pessoal de Nível Superior, Brazil). The funding agency was not involved in the design of the study, collection, analysis, and interpretation of data and in writing the manuscript.

Availability data and materials

The data sets supporting the results of this article are available in the NCBI Gene Expression Omnibus database; under the accession number GSE98766,

Author information

Authors and Affiliations



LFB and MI performed experimental procedure and the complete data analysis of the present study. LFB prepared a draft of the manuscript. VFW and JK coordinated the study and helped finalize the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Volker F. Wendisch.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1: Table S1.

Bacterial strains, plasmids and oligonucleotides used in this study. *Overlap regions in bold; oligonucleotide sequencess in low caps were used in RNA samples to detect genomic DNA contamination. (XLSX 9 kb)

Additional file 2: Table S2.

Delta OD, growth rate and OD of harvested cells of P. riograndensis SBR5 when cultivated in Paenibacillus minimal medium (PbMM) or lysogeny broth (LB) with a variation of growth parameters. Cells cultivated at 30 °C and transferred to LB medium at 4 °C or 50 °C for 5 min for application of treatment of *cold shock or +heat shock, respectively. (XLSX 14 kb)

Additional file 3: Table S3.

List of CDS of P. riograndensis SBR5 with corrected translational start sites. (XLSX 288 kb)

Additional file 4: Table S4.

Putative RNA motifs present in the genome of P. riograndensis SBR5. Highlighted cells are referent to riboswitches detected in the transcriptome analysis. (XLSX 22 kb)

Additional file 5: Table S5.

Novel antisense transcripts of P. riograndensis SBR5. (XLSX 16 kb)

Additional file 6: Table S6.

Novel intragenic transcripts of P. riograndensis SBR5. (XLSX 21 kb)

Additional file 7: Table S7.

Novel intergenic transcripts of P. riograndensis SBR5. (XLSX 12 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Brito, L.F., Irla, M., Kalinowski, J. et al. Detailed transcriptome analysis of the plant growth promoting Paenibacillus riograndensis SBR5 by using RNA-seq technology. BMC Genomics 18, 846 (2017).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: