Deep RNA sequencing of L. monocytogenes reveals overlapping and extensive stationary phase and sigma B-dependent transcriptomes, including multiple highly transcribed noncoding RNAs
- Haley F Oliver†1,
- Renato H Orsi†1,
- Lalit Ponnala2,
- Uri Keich3, 4,
- Wei Wang5,
- Qi Sun2,
- Samuel W Cartinhour6, 7,
- Melanie J Filiatrault6, 7,
- Martin Wiedmann1 and
- Kathryn J Boor1Email author
© Oliver et al; licensee BioMed Central Ltd. 2009
Received: 1 June 2009
Accepted: 30 December 2009
Published: 30 December 2009
Identification of specific genes and gene expression patterns important for bacterial survival, transmission and pathogenesis is critically needed to enable development of more effective pathogen control strategies. The stationary phase stress response transcriptome, including many σB-dependent genes, was defined for the human bacterial pathogen Listeria monocytogenes using RNA sequencing (RNA-Seq) with the Illumina Genome Analyzer. Specifically, bacterial transcriptomes were compared between stationary phase cells of L. monocytogenes 10403S and an otherwise isogenic ΔsigB mutant, which does not express the alternative σ factor σB, a major regulator of genes contributing to stress response, including stresses encountered upon entry into stationary phase.
Overall, 83% of all L. monocytogenes genes were transcribed in stationary phase cells; 42% of currently annotated L. monocytogenes genes showed medium to high transcript levels under these conditions. A total of 96 genes had significantly higher transcript levels in 10403S than in ΔsigB, indicating σB-dependent transcription of these genes. RNA-Seq analyses indicate that a total of 67 noncoding RNA molecules (ncRNAs) are transcribed in stationary phase L. monocytogenes, including 7 previously unrecognized putative ncRNAs. Application of a dynamically trained Hidden Markov Model, in combination with RNA-Seq data, identified 65 putative σB promoters upstream of 82 of the 96 σB-dependent genes and upstream of the one σB-dependent ncRNA. The RNA-Seq data also enabled annotation of putative operons as well as visualization of 5'- and 3'-UTR regions.
The results from these studies provide powerful evidence that RNA-Seq data combined with appropriate bioinformatics tools allow quantitative characterization of prokaryotic transcriptomes, thus providing exciting new strategies for exploring transcriptional regulatory networks in bacteria.
See minireivew http://jbiol.com/content/8/12/107.
The development of powerful new DNA sequencing technologies has yielded new tools with the potential for dramatically revolutionizing scientific approaches to biological questions . These new technologies can be used for a variety of applications, including genome sequencing, identification of DNA-methylation sites, population studies, chromatin precipitation (CHIP-Seq), and transcriptome studies (RNA-Seq). For RNA-Seq, cDNA is generated from an mRNA-enriched total RNA preparation and sequenced using high-throughput technology. Here, we used the Illumina Genome Analyzer to characterize the transcriptome of stationary phase Listeria monocytogenes 10403S and its isogenic ΔsigB mutant, which lacks the general stress response sigma factor, σB.
L. monocytogenes, a Gram-positive foodborne pathogen of the Firmicutes family, is the etiological agent of the disease known as listeriosis. As 20% of listeriosis cases result in death in humans, with an estimated annual human death toll of ~ 500 in the US alone , this disease is a considerable public health concern. As a foodborne pathogen (with 99% of human illnesses caused by a foodborne route of infection ), this bacterium also presents challenging food safety concerns due to its ability to survive and grow under many conditions that are typically applied to control bacterial populations in foods, such as low pH, low temperature and high salt conditions [3–5]. The alternative general stress response sigma factor, σB, is an essential component of a regulatory mechanism that contributes to the ability of L. monocytogenes to respond to and survive exposure to harsh environmental conditions .
Sigma factors are dissociable subunits of prokaryotic RNA polymerase responsible for enzyme recognition of a conserved DNA sequence encoding a transcriptional promoter site. Promoter recognition specificities of bacterial RNA polymerase are determined by the transient association of an appropriate sigma factor with core polymerase in response to conditions affecting the cell . The regulon of a single alternative sigma factor can include hundreds of transcriptional units, thus sigma factors provide an effective mechanism for simultaneously regulating large numbers of genes under appropriate conditions . Critical phenotypic functions regulated by alternative sigma factors range from bacterial sporulation  to stress response systems [6, 9].
Through microarray analyses, the σB regulon in L. monocytogenes has been reported to encompass more than 200 genes, including both virulence and stress response genes, many of them up-regulated upon entry into stationary phase [10–12]. However, interpretation of microarray analyses is dependent on the quality of existing genome annotations, which are rarely experimentally verified. Further, transcripts that do not correspond to annotated features (e.g., noncoding RNA transcripts) cannot be identified. In addition, the utility of microarrays is limited by the genomic variation that exists among bacterial strains (i.e., ideally, a unique microarray should be constructed for each strain to be analyzed) and by technical biases such as cross-hybridization. Hence, microarray data can be difficult to analyze and occasionally, misleading [13, 14]. Although interpretation of RNA-Seq data also relies on the availability of a genome sequence, it is probe- and annotation-independent and therefore, is free of cross-hybridization and low-hybridization biases, hence enabling genome-wide identification of all transcripts, including small noncoding RNAs (ncRNAs). Moreover, because RNA-Seq technology can generate multiple reads corresponding to each transcribed nucleotide on the genome, it is usually possible to identify 5' and 3' transcript ends with high resolution . Therefore, in combination with bioinformatics tools, RNA-Seq data can be used to identify transcriptional promoters and terminators. We used L. monocytogenes as a model system to explore application of RNA-Seq for the dual purposes of genome-wide transcriptome characterization in a bacterial pathogen and comprehensive quantification of target gene expression for the alternative sigma factor, σB.
RNA-Seq provided comprehensive coverage of the L. monocytogenes transcriptome
Summary of RNA-Seq coverage data
10403S replicate 2
ΔsigB replicate 1
ΔsigB replicate 2
Reads that aligned uniquely with no mismatches (U0)
Reads that aligned uniquely with 1 mismatch (U1)
Reads that aligned uniquely with 2 mismatches (U2)
USUM = U0 + U1 + U2
Reads that aligned at more than one location (reads not used; R)
Reads that did not align to the pseudochromosome (NM)
Total number of reads in the sample (Total = USUM + R +NM)
Percentage of unique alignments, i.e. 100*(USUM)/Total
Reads that aligned to the 16S rRNA gene (16S)
Reads that aligned to the 23S rRNA gene (23S)
Reads that aligned to the 16S and 23S rRNA genes (16S + 23S)
Percentage of all reads that aligned to 16S and 23S rRNA genes
UTOTAL = USUM - (16S + 23S)
Normalization factor (f norm = 894,344/UTOTAL)a
To allow for quantitative comparisons among genes and runs, the coverage for each run was normalized for the total number of reads in each run and for gene size. The normalized data are presented as the Gene Expression Index (GEI), which is expressed as the number of reads per 100 bases . Although in silico analyses suggested that the sequencibility (i.e., the portion of the pseudochromosome that could yield unique 32 nt reads) of the 10403S pseudochromosome was 99.6% (Additional file 1: Sequencibility text file), approximately 77.5% of the genome was covered by reads from at least one of the four runs, suggesting that more than 20% of the genome is not transcribed or is transcribed at low levels.
RNA-Seq coverage correlated with qRT-PCR transcript levels indicating that RNA-Seq data are quantitative
Stationary phase L. monocytogenes transcribed at least 83% of annotated genes
Genes with highest GEI
EGD-e locus b
10403S Average GEI c
transfer-messenger RNA (tmRNA)
non-heme iron-binding ferritin
bacterial signal recognition particle RNA
similar to adhesion binding proteins and lipoproteins with multiple specificity for metal cations (ABC transporter)
bacterial RNAse P class B
similar to major cold-shock protein
similar to metal cations ABC transporter, ATP-binding proteins
similar to hypothetical proteins
similar to unknown proteins
similar to B. subtilis SpoVG protein
similar to cold shock protein
similar to non-specific DNA-binding protein HU
similar metal cations ABC transporter (permease protein)
30S ribosomal protein S21
similar to B. subtilis SpoVG protein
similar to B. subtilis YwmG protein
similar to B. subtilis general stress protein
similar to glutamate decarboxylase
highly similar to dihydrolipoamide dehydrogenase, E3 subunit of pyruvate dehydrogenase complex
similar to alpha,alpha-phosphotrehalase
similar to phosphotransferase system beta-glucoside-specific enzyme IIB component
similar to B. subtilis YvyD protein
highly similar to pyruvate dehydrogenase (E1 beta subunit)
similar to PTS system trehalose-specific enzyme IIBC
highly similar to pyruvate dehydrogenase (dihydrolipoamide acetyltransferase E2 subunit)
similar to amino acid antiporter (acid resistance)
similar to aminotransferase
similar to unknown protein
Associations between GEI and role categories
Low average GEI in 10403S
Amino acid biosynthesis
Transport and binding
High average GEI in 10403S
Purines, pyrimidines, nucleosides, and nucleotides
Identification and annotation of noncoding RNAs (ncRNAs)
New L. monocytogenes ncRNAsa identified in this study
Coordinates in 10403S
10403S Average GEI b
ΔsigB Average GEI c
Three putative ncRNAs with high GEI covered either part or all of each of three annotated CDS, suggesting that ncRNAs overlap with these CDS or that some putative CDS actually encode ncRNAs rather than proteins. Specifically, LMRG_01574 (lmo2257), LMRG_02926 (no homolog in EGD-e), and LMRG_1986 (lmo2711) overlapped with lhrA (partial overlap), with the bacterial RNAse P class B ncRNA (full overlap), and with the bacterial signal recognition particle RNA (partial overlap), respectively. In concert with our findings, lmo2257 was previously hypothesized not to be a CDS [19, 21].
RNA-Seq identified 96 annotated CDS and one ncRNA as σB-dependent and provided comprehensive data on transcript levels for genes in the σB regulon
One-sided Fisher's exact tests were used to determine if σB-dependent genes are over-represented within specific TIGR role categories. Genes identified as σB-dependent were over-represented among genes involved in cellular functions (q-value = 0.045). σB-dependent genes in this category include genes involved in pathogenesis (inlA, inlB, inlH), adaptation to atypical conditions (lmo0515, lmo0669, lmo2673, lrtC), detoxification (lmo1433, lmo2230), cell division (lmo1624) and an unknown protein that may be involved in toxin production and resistance (lmo0321).
Summary of genes up-regulated by σB
Avg. fold change (WT/ΔsigB)a
10403S Average GEIb
ΔsigB Average GEIc
σB-dependent genes found by RNA-Seq and not previously identified by microarray analyses of stationary phase cells
similar to phage proteins
similar to succinyldiaminopimelate desuccinylase
similar to beta-glucosidase
similar to glycine betaine/carnitine/choline ABC transporter (ATP-binding protein)
similar to betaine/carnitine/choline ABC transporter (membrane p)
similar to glycine betaine/carnitine/choline ABC transporter (osmoprotectant-binding protein)
similar to conserved hypothetical proteins
similar to transcription regulator GntR family
similar to PTS system, fructose-specific IIABC component
putative ncRNA, sbrE
σ B -dependent genes with Average GEI ≥ 25 in 10403S
similar to B. subtilis YwmG protein
similar to unknown proteins
similar to tagatose-1,6-diphosphate aldolase
similar to arsenate reductase
similar to oxidoreductase
similar to ribose 5-phosphate epimerase
low temperature requirement C protein, also similar to B. subtilis YutG protein
similar to host factor-1 protein
similar to B. subtilis stress protein YdaG
conserved hypothetical protein
similar to ribosomal-protein-alanine N-acetyltransferase
conserved hypothetical protein similar to B. subtilis YhfK protein
similar to B. subtilis yvlC protein
similar to B. subtilis YwnB protein
similar to unknown proteins
conserved hypothetical protein
similar to succinate semialdehyde dehydrogenase
similar to mannose-specific phosphotransferase system (PTS) component IID
similar to mannose-specific phosphotransferase system (PTS) component IIC
similar to DNA translocase
similar to Chain A, Dihydrofolate Reductase
similar to CDP-abequose synthase
similar to zinc-binding dehydrogenase
similar to mannose-specific phosphotransferase system (PTS) component IIA
conserved hypothetical protein
similar to nicotinamidase
An ~ 500 nt σB-dependent ncRNA was identified between lmo2141 and lmo2142 (Figure 4B); this ncRNA was recently designated rli47 . To be consistent with the nomenclature for other σB-dependent ncRNA , we propose that rli47 be named sbrE (s igma B-dependent R NA). Although BLASTX searches (using 6 possible reading frames) and searches against the Pfam database did not yield significant matches, a σB-dependent promoter was identified upstream of the transcript and a Rho-independent terminator was found by TransTermHP (Figure 4B). The sequence for this putative ncRNA was also present in 17 other L. monocytogenes genomes, including EGD-e (GenBank accession no. NC 003210), F2365 (GenBank accession no. NC 002973), and 15 unfinished genome sequences by the Broad Institute http://www.broad.mit.edu/annotation/genome/listeria_group/MultiHome.html as well as in one L. innocua (GenBank accession no. NC 003212) and one L. welshimeri (GenBank accession no. NC 008555) genome. The 514 nt sbrE (rli47) sequence was 96.6% conserved among the 18 L. monocytogenes genomes.
HMM showed that 84% of σB-dependent genes and operons identified by RNA-Seq are preceded by σB promoters and therefore, appear to be directly regulated by σB
RNA-Seq successfully identifies a number of previously identified as well as novel σB-dependent genes
To evaluate the ability of RNA-Seq to identify L. monocytogenes σB-dependent genes, we compared the σB-dependent genes identified here with those identified in two independent microarray studies by our research group. Specifically, we compared our results with microarray data reported by (i) Raengpradub et al. , who identified σB-dependent genes using L. monocytogenes strains and growth conditions identical to those in this study, and by (ii) Ollinger et al. , who identified σB-dependent genes by comparing transcripts from L. monocytogenes 10403S with a PrfA* (G155S) allele , which constitutively expresses the PrfA-regulated virulence genes [24–26], with those from an isogenic ΔsigB mutant grown to stationary phase under the same conditions used here. Further, we compared our results with those from a microarray study using another L. monocytogenes strain (EGD-e) and its isogenic ΔsigB mutant, grown under similar conditions (i.e., growth to early stationary phase ). Among the 96 σB-dependent annotated CDS identified in the present study, 72 were also identified as σB-dependent in previous microarray studies of stationary phase L. monocytogenes 10403S cells [10, 12] (Figure 3). In addition, 64 (66.7%) of the 96 σB-dependent genes identified here were identified as positively regulated by σB in L. monocytogenes strain EGD-e cells grown to early stationary phase (8 h growth in BHI) . Overall, 12 genes identified as σB-dependent in stationary phase cells in both previous microarray studies by our group [10, 12], were not identified as σB-dependent by the RNA-Seq experiments reported here (Figure 3); 9 of these genes showed a σB-dependent promoter based on the HMM analyses in this study and are likely to be directly regulated by σB (see Additional file 8: Comparison of genes found to be σB-dependent by microarray analysis and not by RNA-Seq for further details on these genes).
Finally, a total of 13 annotated CDS identified as σB-dependent by RNA-Seq (including 9 genes that also showed a σB-dependent promoter in our HMM analysis) had not been identified as σB-dependent in either of the previous microarray studies with strain 10403S grown to stationary phase [10, 12] (see Table 3). Among these 13 genes not previously identified as σB-dependent in stationary phase L. monocytogenes 10403S, five had previously been identified as σB-dependent in salt-stressed cells , including the well-characterized virulence genes inlA and inlB, which have also been shown by qRT-PCR and promoter mapping to be directly regulated by σB . In addition, two of these 13 genes had been identified as positively regulated by σB in L. monocytogenes strain EGD-e , even though they had not been identified as σB-dependent in previous microarray studies of strain 10403S [10, 12]. For one of these genes (i.e. lmo0265), the microarray probe (designed based on the genome of L. monocytogenes strain EGD-e) showed a low hybridization index (HI; % match between strain-specific sequence and oligonucleotide probe) to 10403S (< 80%). Interestingly, lmo2003, which encodes a transcription regulator similar to the GntR family, was identified as σB-dependent by RNA-Seq, but had not been previously identified as σB-dependent in either 10403S or EGD-e.
In this study, we used deep RNA sequencing to define and characterize the transcriptomes of L. monocytogenes strain 10403S and an otherwise isogenic ΔsigB mutant, which does not express the general stress-response sigma factor, σB. The data generated using this approach showed that (i) at least 83% of annotated L. monocytogenes genes are transcribed in stationary phase cells; and (ii) stationary phase L. monocytogenes transcribes 67 ncRNAs, including one σB-dependent ncRNA and seven ncRNAs that, to our knowledge, have not previously been identified in L. monocytogenes. Additionally, RNA-Seq data provided for quantitation of transcript levels and approximate identification of transcriptional start sites on a genome scale. Use of a novel, iterative, dynamic HMM, in combination with RNA-Seq data, identified putative σB-dependent promoters and further defined the L. monocytogenes σB regulon.
The majority of annotated L. monocytogenes genes are transcribed in stationary phase cells
While genome sequencing and microarray approaches have provided important insight into the biology of prokaryotic organisms, including a number of human bacterial pathogens, identification of all genes and their transcriptional patterns remains a major challenge in all areas of biology. Our results demonstrate that global probe-independent approaches for transcriptome characterization are valuable tools for analyzing bacterial transcriptomes [16, 28, 29]. A major challenge that currently hinders analysis of transcriptomic data generated by approaches such as RNA-Seq is the ability to differentiate between genes with low levels of transcription and background levels of coverage. Several approaches have been used to define cut-off values between background GEI and GEI indicative of low transcript levels (e.g., [15, 30, 31]). We chose a comparative analysis of L. monocytogenes 10403S transcript levels with those of a mutant strain that does not express a transcription factor (i.e., the alternative sigma factor σB) as a novel approach for robustly defining background RNA-Seq coverage. Our results show that a number of σB-dependent genes were solely σB-dependent (at least under the conditions used here), as supported by the lack of detectable RNA-Seq coverage in the ΔsigB strain, despite considerable RNA-Seq coverage of the same genes in the isogenic parent strain 10403S. This is an important observation as a number of σB-dependent L. monocytogenes genes are also activated by other sigma factors (e.g., σA [32, 33]). Using the average GEI for L. monocytogenes genes that were solely σB-dependent in the ΔsigB strain as a conservative cut-off value for transcribed genes, we found that approximately 83% of L. monocytogenes 10403S annotated CDS were transcribed in stationary phase cells. These transcribed genes include 355 putative operons, which cover a total of 1,107 genes, indicating that a considerable proportion of L. monocytogenes genes appear to be transcribed polycistronically. In comparison, a recent study using a tiling microarray identified 517 polycistronic operons that encompass 1,719 genes in L. monocytogenes EGD-e . Taken together, these data indicate that the majority of annotated L. monocytogenes genes are transcribed. This conclusion is consistent with results from a whole-genome tiled microarray transcriptome study of E. coli MG1655 , which reported transcription of 4052 E. coli MG1655 genes in bacteria grown under different conditions, suggesting that about 98% of the E. coli MG1655 genes are transcribed.
Our results also demonstrate that RNA-Seq coverage levels (generated with the Illumina Genome Analyzer System) correlate well with quantitative RT-PCR-based mRNA transcript level data. Therefore, in combination with results from previous studies (e.g., in yeast [15, 31], human cell lines , human tissue , murine tissue ), our findings indicate that RNA-Seq tools can be broadly applied in biological studies to enable quantitative analysis of transcript levels. We also found a positive correlation between RNA-Seq-based transcript levels and codon bias, consistent with the well-documented observation that genes with high codon bias are often highly expressed [37–39]. Genes in four role categories, including (i) signal transduction, (ii) viral functions, (iii) amino acid biosynthesis, and (iv) transport and binding, were significantly associated with lower transcript levels. These categories include a number of genes that encode proteins predominantly required for growth and survival under specialized environmental conditions (e.g., viral replication genes) or under conditions other than stationary phase (e.g., amino acid biosynthesis may be less important in stationary phase than during exponential growth as sufficient amino acids from dead bacteria are likely to be available for scavenging), and/or proteins that may only be required in small amounts. On the other hand, we found that genes in seven role categories, including (i) cellular processes, (ii) DNA metabolism, (iii) protein fate, (iv) protein synthesis, (v) purines, pyrimidines, nucleosides, and nucleotides, (vi) transcription, and (vii) genes encoding proteins with unknown functions, showed, on average, higher transcript levels in stationary phase L. monocytogenes. These findings suggest that genes in these particular categories are important for bacterial cells transitioning from exponential growth to stationary phase.
Overall, the L. monocytogenes genes with the highest transcript levels were ncRNAs, specifically the transfer-messenger RNA (tmRNA) and 6S RNA, consistent with the observation that tmRNAs are involved with bacterial recovery from a variety of stresses including entry into stationary phase, amino acid starvation, and heat shock . 6S RNA accumulates in cells during stationary phase; cells lacking 6S RNA have reduced fitness relative to wildtype stationary phase cells . In addition to down-regulating some housekeeping genes, 6S RNA has been shown to up-regulate expression of some σS-dependent genes in Gram-negative bacteria . σS is the stationary phase stress response alternative sigma factor in E. coli . Taken together, we hypothesize that 6S RNA plays a critical role in the ability of L. monocytogenes to survive stationary phase associated stress conditions.
Specific protein-encoding genes with very high transcript levels in stationary phase L. monocytogenes include fri, sod, cspB, and cspL, all genes with some previous evidence for contributions to L. monocytogenes stationary phase and stress survival [43–49]. flaA, which encodes a flagellin protein, was also highly transcribed in stationary phase cells at 37°C. Although L. monocytogenes has been reported to show flagellar motility only when grown at ≤ 30°C [50, 51], our results are consistent with the observation that strain 10403S, which was used in this study, has been shown to express flagellin at 37°C . Interestingly, we also found some annotated CDS without known function to be highly transcribed, including lmo1847 and lmo1849, which encode putative ABC transporters based on BLAST and Pfam  searches, respectively, and lmo1468, which encodes an unknown protein.
RNA-Seq identifies ncRNA molecules in L. monocytogenes, including a σB-dependent ncRNA, in 10403S
Using RNA-Seq, we found 67 previously identified or putative ncRNAs that were transcribed in stationary phase L. monocytogenes. Of these, 7 represent ncRNAs that have not been identified previously as transcribed in L. monocytogenes. Sixty of the ncRNAs identified here have previously been reported by Toledo-Arana et al. , Nielsen et al. , Mandin et al.  and Christiansen et al. . Interestingly, 16 L. monocytogenes ncRNAs with similarities to ncRNAs identified in other bacterial organisms are putative riboswitches. We also found that sbrE (rli47), which has no homologies to ncRNA entries in Rfam, appears to be directly regulated by σB, based on the considerably higher transcript levels (186 fold) present in the parent strain as compared to the sigB-null mutant, consistent with results from a recent tiling microarray study . As the RNA isolation procedure used here selected against small RNA molecules (see Materials and Methods for details), it is likely that additional small ncRNAs not detected here (e.g., some small ncRNAs identified by Toledo-Arana et al. ), are also transcribed in stationary phase L. monocytogenes 10403S.
Prior to this study, L. monocytogenes ncRNAs, including potential σB-dependent ncRNAs , had been identified using in silico modeling [22, 53], co-precipitation with the RNA-binding protein Hfq , and, most recently, tiling microarrays . While, among these approaches, tiling microarrays  provided the most comprehensive characterization of L. monocytogenes ncRNAs, deep RNA sequencing also identified a large number of transcribed L. monocytogenes ncRNAs, including ncRNAs with no similarities to previously identified ncRNAs. Our results, taken together with previous studies that have identified numerous novel transcripts with RNA-Seq in bacteria (S. meliloti , B. cenocepacia , V. cholerae ), yeast [15, 31], mouse , Arabidopsis , human cell lines [35, 55], and human tissue , clearly show the power of this technique for characterizing bacterial transcriptomes and ncRNAs.
The L. monocytogenes σB regulon is composed of at least 96 genes, including 82 genes and 1 ncRNA that are preceded by putative σB promoters
As alternative sigma factors, such as σB, are known to play critical roles in gene regulation across bacterial genera , we used L. monocytogenes 10403S and an isogenic ΔsigB null mutant as a model system for exploring the use of RNA-Seq, in combination with in silico analyses, for characterization of transcriptional blueprints associated with bacterial regulatory elements. In our study, RNA-Seq identified 96 annotated CDS and one ncRNA SbrE (Rli47) that are up-regulated by σB. Quantitative RT-PCR experiments also confirmed σB-dependent transcript levels of SbrE (Rli47) (Mujahid et al., unpublished). Among the 96 σB-dependent annotated CDS identified in this study, 74 (77.1%)  and 81 (84.4%)  were also identified as σB-dependent in stationary phase cells in two previous microarray studies using the same strain background. Also, 63 of the 96 σB-dependent genes identified here were reported as positively regulated by σB in another L. monocytogenes strain (EGD-e) grown to early stationary phase . Twelve genes were identified as σB-dependent in both previous microarray studies performed with the same L. monocytogenes strain background and the same conditions used here, but were not identified as σB-dependent by RNA-Seq in this study. This disparity is likely due to the fact that the thresholds and statistical cut-offs used to define σB-dependent genes were very stringent in the present study (e.g., a q-value < 0.05 in all four comparisons).
Overall, in addition to confirming a previously identified σB-dependent ncRNA , RNA-Seq identified 13 genes that had not been defined as σB-dependent in previous microarray studies of stationary phase L. monocytogenes 10403S cells [10, 12], including 5 genes that had been identified as σB-dependent in salt stressed cells, but not in stationary phase cells. One gene not previously identified as σB-dependent was lmo2003, which encodes a transcription regulator similar to the GntR family. The GntR family of regulators has been characterized as global regulators of primary metabolism in a number of bacteria [56–58]. This finding further supports that L. monocytogenes σB appears to be involved in a number of transcriptional regulatory networks . Increasing evidence indicates that regulatory RNAs also contribute to regulatory networks that involve L. monocytogenes σB. For example, in addition to the σB-dependent SbrE ncRNA described here, tiling array analyses also identified additional σB-dependent ncRNAs. While previous in silico studies in L. monocytogenes strain EGD-e  identified four putative σB-dependent ncRNAs (i.e., SbrA, SbrB, SbrC, SbrD), only SbrA was confirmed in vivo as σB-dependent in EGD-e [20, 53]. Even though our RNA-Seq analyses in 10403S identified SbrA transcripts, transcript levels for this ncRNA were not σB-dependent under the conditions used in our study. The fact that SbrA was not found to be σB-dependent in 10403S may be due to differences in strains or growth conditions used (e.g., Nielsen et al.  and Toledo-Arana et al.  used strain EGD-e, while we used strain 10403S). Further studies in different L. monocytogenes strains will thus be needed to understand the full complexity of regulatory networks in this pathogen, including those involving σB and ncRNAs.
The quantitative nature of RNA-Seq allowed us to also identify highly transcribed σB-dependent genes, including lmo2158 (which encodes a protein similar to the B. subtilis YwmG), lmo1602 (which encodes an unknown protein), and lmo0539 (which encodes a tagatose-1,6-diphosphate aldolase). Interestingly, none of these genes encode proteins that appear to contribute to any of the presently recognized σB-dependent phenotypes in L. monocytogenes, such as acid resistance [9, 59], oxidative stress resistance [59, 60], or virulence [27, 33, 61, 62]. As there are no published reports of construction and characterization of null mutations in these highly transcribed σB-dependent genes, our data clearly suggest that σB and the σB regulon make additional important contributions to L. monocytogenes physiology that remain to be characterized.
In conjunction with appropriate bioinformatics tools, such as the iterative, dynamic HMM developed in this study to identify putative σB promoters, RNA-Seq data also allowed mapping of approximate transcriptional start and termination sites. Specifically, putative σB-dependent promoters were identified upstream of (i) 49 monocistronic σB-dependent genes, (ii) 15 σB-dependent operons (covering a total of 40 genes), and (iii) 1 σB-dependent ncRNA. By comparison, in the absence of genome wide transcriptional start site data, a previous study that solely relied on HMM and genome sequence data identified putative σB-dependent promoters upstream of only 40 genes that had been identified as σB-dependent by microarray analyses . Our data reported here show that the majority of σB-dependent genes are directly regulated by σB and illustrate the power of combining RNA-Seq data and bioinformatics approaches for characterizing transcriptional regulatory systems. Specifically, combining transcriptional start site information with an HMM that identifies promoter motifs (e.g., the motif for σB-dependent promoters) provides a powerful approach for identifying genes directly regulated by a given transcription factor. This approach facilitates rapid genome-wide identification of putative transcriptional start sites, which currently represents a critical bottleneck in genome-wide characterization of transcriptional regulation and regulatory networks, as many current strategies for promoter mapping (e.g., primer extension, rapid amplification of cDNA ends (RACE-PCR), RNAse protection assays) are time- and labor-intensive.
Using the human foodborne pathogen L. monocytogenes as a model system, we have shown that RNA-Seq provides a powerful approach to (i) rapidly, comprehensively, and quantitatively characterize prokaryotic genome-wide transcription profiles without hybridization bias, and (ii) characterize putative transcriptional start sites and operon structures. We also show that RNA-Seq transcriptomic evaluation of a bacterial strain bearing a deletion in a transcriptional regulator in comparison with its parent strain can provide rapid, comprehensive insights into the blueprints of prokaryotic transcriptional regulation. Such tools and approaches will revolutionize our ability to characterize genome-wide transcriptional regulatory networks, with wide ranging applications from medicine to ecology, e.g., by providing a means to quickly characterize transcriptional networks contributing to pathogen transmission and virulence as well as environmental growth and gene expression in bacteria used for specific purposes, such as bio-remediation. When applied to both genome and transcriptome sequencing, novel high throughput sequencing approaches can also provide rapid and comprehensive characterization of bacterial genomes, representing an important tool for initial rapid characterization of novel and emerging bacterial pathogens.
Strains and growth conditions
RNA-Seq was performed on the L. monocytogenes parent strain 10403S and a previously described  isogenic mutant (ΔsigB, FSL A1-254) with an internal non-polar deletion of sigB, which encodes the stress response alternative sigma factor σB.
Prior to RNA isolation, bacteria were grown in 5 ml Brain Heart Infusion (BHI) broth (BD Difco, Franklin Lakes, NJ) at 37°C with shaking (230 rpm) for 15 h, followed by transfer of a 1% inoculum to 5 ml pre-warmed BHI. After growth to OD600 ~ 0.4, a 1% inoculum was transferred to a 300 ml nephelo flask (Bellco, Vineland, NJ) containing 50 ml pre-warmed BHI. This culture was incubated at 37°C with shaking until cells reached stationary phase (defined as growth to OD600 = 1.0, followed by incubation for an additional 3 h). Two independent growth replicates and RNA isolations were performed for each strain.
RNA isolation, integrity and quality assessment
RNA isolation was performed as previously described . Briefly, RNAProtect bacterial reagent (Qiagen, Valencia, CA) was added according to the manufacturer's instructions to the cultures grown to stationary phase; treated cells were stored at -80°C (for no longer than 24 h) until RNA isolation was performed. Bacterial cells were treated with lysozyme followed by 6 sonication cycles at 18W on ice for 30 s. Total RNA was isolated and purified using the RNeasy Midi kit (Qiagen) according to the manufacturer's protocol; RNA molecules <200 nt in length are not recovered well with this procedure, according to the manufacturer. RNA was eluted from the column using RNase-free water. Total RNA was incubated with RQ1 DNase (Promega, Madison, WI) in the presence of RNasin (Promega) to remove remaining DNA. Subsequently, RNA was purified using two phenol-chloroform extractions and one chloroform extraction, followed by RNA precipitation and resuspension of the RNA in RNAse free TE (10 mM Tris, 1 mM EDTA; pH 8.0; Ambion, Austin, TX). UV spectrophotometry (Nanodrop, Wilmington, DE) was used to quantify and assess purity of the RNA.
Efficacy of the DNase treatment was assessed by TaqMan qPCR analysis of DNA levels for two housekeeping genes, rpoB  and gap . qPCR was performed using TaqMan One-Step RT-PCR Master Mix Reagent and the ABI Prism 7000 Sequence Detection System (all from Applied Biosystems, Foster City, CA). Each RNA sample was run in duplicate and standard curves for each target gene were included for each assay to allow for absolute quantification of residual DNA. Data were analyzed using the ABI Prism 7000 Sequence Detection System software as previously described  Normalization and log transformation were performed as described by Kazmierczak et al. . All samples showed log copy numbers ≤ 1.5 and Ct values > 35 for both rpoB and gap, indicating negligible levels of DNA contamination. As a final step, RNA integrity was assessed using the 2100 Bioanalzyer (Agilent, Foster City, CA).
Removal of 16S and 23S rRNA from total RNA was performed using MicrobExpress™ Bacterial mRNA Purification Kit (Ambion) according to the manufacturer's protocol with the exception that no more than 5 μg total RNA was treated per enrichment reaction. Each RNA sample was divided into multiple aliquots of ≤ 5 μg RNA and separate enrichment reactions were performed for each sample. Enriched mRNA samples were pooled and run on the 2100 Bioanalzyer (Agilent) to confirm reduction of 16S and 23S rRNA prior to preparation of cDNA fragment libraries.
Preparation of cDNA fragment libraries
Ambion RNA fragmentation reagents were used to generate 60-200 nucleotide RNA fragments with an input of 100 ng of mRNA. Following precipitation of fragmented RNA, first strand cDNA synthesis was performed using random N6 primers and Superscript II Reverse Transcriptase, followed by second strand cDNA synthesis using RNaseH and DNA pol I (Invitrogen, CA). Double-stranded cDNA was purified using Qiaquick PCR spin columns according to the manufacturer's protocol (Qiagen).
RNA-Seq using the Illumina Genome Analyzer
The Illumina Genomic DNA Sample Prep kit (Illumina, Inc., San Diego, CA) was used according to the manufacturer's protocol to process double-stranded cDNA for RNA-Seq, including end repair, A-tailing, adapter ligation, size selection, and pre-amplification. Amplified material was loaded onto independent flow cells; sequencing was carried out by running 36 cycles on the Illumina Genome Analyzer.
The quality of the RNA-Seq reads was analyzed by assessing the relationship between the quality score and error probability; these analyses were performed on Illumina RNA-Seq quality scores that were converted to phred format http://www.phrap.com/phred/. Quality scores are reported in Additional file 9: Distribution of quality scores for all RNA-Seq runs.
RNA-Seq data will be available in the NCBI GEO Short Read Archives: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE15651.
RNA-Seq alignment and coverage
The program nucmer, which is part of the MUMmer package http://mummer.sourceforge.net/, was used to align the 10403S unfinished genome sequences (available at http://www.broad.mit.edu/annotation/genome/listeria_group/MultiHome.html as supercontigs 5.1 to 5.21) against the finished genome sequence of the L. monocytogenes reference strain EGD-e  to create a pseudochromosome for 10403S. Creation of the 10403S pseudochromosome was performed using the order and orientation of the 10403S supercontigs provided by the alignment with EGD-e; the assembled pseudochromosome was 2.87 Mb long. The annotation of the genes in the individual 10403S supercontigs, as provided by the Broad Institute http://www.broad.mit.edu/annotation/genome/listeria_group/MultiHome.html was then mapped to the 10403S pseudochromosome (Additional file 10: Genbank (gbk) file with ncRNAs identified here). The 5S, 16S and 23S rRNA genes as well as the various tRNA genes in 10403S were identified using blastn and the EGD-e annotated rRNA and tRNA genes as a reference (Genbank ID: AL591824).
Based on quantitative analyses of RNA-Seq data, throughout this manuscript, transcript levels of a given gene are reported as the Gene Expression Index (GEI), which is expressed as number of reads per 100 bases. To obtain the GEI, the 10403S pseudochromosome was used to align Illumina RNA-Seq reads. These alignments were performed using the whole genome alignment software Eland (Illumina), which reports unique alignments of the first 32 bases of each read, allowing up to 2 mismatches. Coverage at each base position along the pseudochromosome was calculated by enumerating the number of reads that align to a given base. The coverage for each base from the first to last nt in an annotated CDS was summed then divided by 32 (i.e., the length of each aligned read) to obtain the RNA-Seq coverage for that gene before normalization. The following data were discarded prior to further analyses: (i) reads with more than 2 mismatches, (ii) reads that matched to multiple locations, (iii) reads that did not map to the chromosome, and (iv) reads that mapped to the 16S or 23S genes (Table 1). Reads identified as "matching two locations" did not include those matching rRNA genes as the 10403S pseudochromosome created for this study was designed with only one unique rRNA gene sequence. Reads matching the 16S and 23S genes were removed prior to normalizing the total number of aligned reads across the four samples because of the technical bias introduced by our deliberate partial removal of 16S and 23S transcripts from the samples. Despite removal of 16S and 23S rRNA, in a given run, between 1,860,817 and 3,138,329 reads aligned to the 23S gene and between 434,263 and 760,863 reads aligned to the 16S gene. In a given run, between 101,419 and 242,246 reads matched the 5S rRNA gene and between 7,778 and 62,699 reads matched the various tRNA genes present in the pseudochromosome.
Because of the inherent differences in the total number of reads among the four runs, the total number of reads for each run was normalized to the run with the highest coverage (i.e. ΔsigB replicate 2, Table 1). The ratio of total number of reads for ΔsigB replicate 2 to the total number of reads for 10403S replicate 1, 10403S replicate 2, or ΔsigB replicate 2 was used as a multiplier to normalize the approximate number of reads matching a given gene (Table 1). The GEI was then obtained by dividing the normalized number of reads matching each gene by the gene length. The average GEI was the number of reads that match each nt in a given gene after normalization; this value represented the average of the 2 biological replicates for a given strain and is presented as reads per 100 bases (as opposed to reads per 1 base) to simplify identification of differences. The distribution of the coefficient of variation for each gene between replicates is depicted in Additional file 11: Coefficient of variation among RNA-Seq replicates by strain.
Identification of transcribed annotated CDS
Depending on RNA-Seq coverage, genes were classified into four categories, including (i) not transcribed (average GEI < 0.7), (ii) low transcript levels (average GEI ≥ 0.7 and < 10), (iii) medium transcript levels (average GEI ≥ 10 and < 25), and (iv) high transcript levels (average GEI ≥ 25). While cut-offs between low, medium, and high transcript level categories were somewhat arbitrary, they were chosen to yield a relative distribution of genes into these categories similar to the distribution of yeast genes into low, medium, and high expression categories reported previously by Nagalakshimi et al. .
Annotation of Rho-independent terminators and putative operons
Potential operons were manually annotated based on the continuity of a similar level of RNA-Seq coverage across consecutive genes and the (i) absence of putative Rho-independent terminators between genes, and/or (ii) presence of a putative Rho-independent terminator at the end of a putative operon. Putative Rho-independent terminators in the 10403S pseudochromosome were identified using the program TransTermHP v2.04 .
Discovery and annotation of regions transcribing ncRNAs
To aid in identification of transcribed ncRNAs, ncRNAs previously identified in L. monocytogenes EGD-e [19–22] were mapped onto the 10403S pseudochromosome and were identified as transcribed in 10403S in this study.
New putative ncRNAs (i.e., ncRNAs not previously reported or previously identified by Rfam) were manually identified using the genome browser Artemis . Specifically, regions not matching annotated genes, but showing contiguous coverage by RNA-Seq reads (i.e., regions that contain at least 100 bp completely covered by RNA-Seq reads) were designated putative ncRNAs. Further, RNA-Seq reads that did not cover an entire annotated CDS, but showed partial contiguous coverage within a CDS, were also designated as putative ncRNAs. All ncRNAs, including those reported in previous publications [19, 20, 22, 53], those identified by Rfam, and those with no matches to the Rfam database were annotated into a Genbank (gbk) file that is available as Additional file 10: Genbank (gbk) file with ncRNAs identified here. ncRNAs identified by RNA-Seq, but with no matches to the Rfam database were designated "putative ncRNA" and received designations from rli64 to rli70. The presence of rho-independent transcriptional terminators was used to assign the strand of putative ncRNAs. For two instances where terminators were not observed, the ncRNAs were annotated on both strands.
Differential expression analysis
To identify genes that showed significantly different transcript levels in the parent strain (10403S) and the ΔsigB strain, statistical analyses were performed using the normalized RNA-Seq coverage of each coding gene (as annotated by the Broad Institute). Normalized RNA-Seq coverage (i.e. the number of reads that match an annotated CDS after normalization across runs) was used in lieu of the GEI (in which the normalized RNA-Seq coverage number is divided by the gene length) for statistical analyses. Corresponding analyses were also performed for each region encoding a putative ncRNA transcript identified as described above. A coverage file of normalized RNA-Seq coverage is available in Additional file 12: Coverage file with the normalized RNA-Seq coverage for the 4 RNA-Seq runs.
For each gene, a binomial probability was calculated for the normalized RNA-Seq coverage, using each of the four possible comparisons between the 10403S and ΔsigB transcripts (i.e. 10403S replicate 1 vs ΔsigB replicate 1; 10403S replicate 1 vs ΔsigB replicate 2; 10403S replicate 2 vs ΔsigB replicate 1; 10403S replicate 2 vs ΔsigB replicate 2). The binomial probability was calculated under the hypothesis that genes that are not regulated by σB will show the same normalized number of reads in the two strains (p = 0.5 and q = 0.5). For a gene to be considered up-regulated by σB, the binomial probability of observing as many reads in the ΔsigB strain as those observed for 10403S had to be < 0.05 for each of the four possible combinations. Conversely, for a gene to be considered down-regulated by σB, the binomial probability of observing as many reads as those observed for ΔsigB had to have q-values < 0.05 for each of the four possible combinations. To control for multiple comparisons, a False Discovery Rate (FDR) approach was used. q-values (representing the FDR) were calculated using the program Q-Value  for R. Only genes with q-values < 0.05 and fold change ≥ 2 or ≤ 0.5 among all four possible comparisons between 10403S and ΔsigB were considered significantly up-regulated or down-regulated by σB.
Iterative HMM-based promoter identification
An initial training set containing 17 experimentally validated σB-dependent promoter motifs was used to build a Hidden Markov Model (HMM) of these motifs (Additional file 13: σB-dependent promoters used for HMM search). HMM construction and searches were performed using the program hmmer version 1.8.5. The HMM was constructed from unaligned sequences (using hmmt) and then used to search the 10403S pseudochromosome (using the hmmls tool). The null frequencies of each nucleotide used were those observed in the L. monocytogenes genome (i.e., A/T = 0.31 and G/C = 0.19).
Correlation of RNA-Seq relative coverage (GEI) with TaqMan absolute transcript copy number
Average GEI was correlated with absolute transcript copy numbers quantified by TaqMan qRT-PCR. qRT-PCR-based transcript level data obtained for selected genes in L. monocytogenes grown under the same conditions used here (i.e., stationary phase) were obtained from previous studies and unpublished work (see Additional file 2: RNA-Seq average GEI and TaqMan qRT-PCR absolute copy number); qRT-PCR methods are detailed in Raengpradub et al. . qRT-PCR data from these studies were used to calculate absolute transcript copy numbers (using a standard curve as described by Sue et al. ); values were log transformed.
One-sided Wilcoxon rank sum tests were used to assess whether genes in certain role categories showed lower or higher average GEI in 10403S than genes in other role categories. One-sided Fisher's exact tests were used to assess whether σB-dependent genes were overrepresented in certain TIGR role categories http://cmr.jcvi.org/cgi-bin/CMR/RoleIds.cgi. Linear regression analysis was used to assess correlations between average GEI and qRT-PCR data as well as between codon bias and average GEI in 10403S. The effective number of codons used in a gene (Nc), a measure of the codon bias, was assessed using the program "chips" implemented in the EMBOSS package . All tests were carried out in R (version 2.7.0; http://www.r-project.org/). Correction for multiple testing was performed using the procedure reported by Benjamini & Hochberg , as implemented in the program Q-Value . Significance was set at 5%.
RNA-Seq data will be available in the NCBI GEO Short Read Archives. All RNA-Seq data are provided in an Access database file (Additional file 4: Access database). This database contains information on the annotated CDS and ncRNAs with their 10403S locus name, 10403S start and end coordinates, lengths, strand, EGD-e locus, EGD-e gene name, EGD-e common name, EGD-e role category, codon bias, GEI, average GEI in 10403S and ΔsigB strains, fold change for the four possible comparisons involving the two replicates with 10403S and the ΔsigB strains, q-values of the binomial tests, operon annotation, promoter annotation, list of σB-dependent genes identified in this study, and data from 3 other studies of the σB regulon in L. monocytogenes using microarrays including Ollinger et al. , Hain et al.  , and Raengpradub et al. .
Gene Expression Index
Rapid Amplification of cDNA Ends PCR
False Discovery Rate
Hidden Markov Model
This work was funded by NIH-NIAID (R01 AI052151 to K.J.B.). U.K. was supported by NSF (award no. 0644136). We thank P. Schweitzer and the staff at the Cornell DNA Sequencing and Genotyping Core Facility for sample preparation and sequencing and A. G. Clark and T. B. Sackton for helpful discussion.
- Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009, 10 (1): 57-63. 10.1038/nrg2484.PubMed CentralView ArticlePubMedGoogle Scholar
- Mead PS, Slutsker L, Dietz V, McCaig LF, Bresee JS, Shapiro C, Griffin PM, Tauxe RV: Food-related illness and death in the United States. Emerg Infect Dis. 1999, 5 (5): 607-625. 10.3201/eid0505.990502.PubMed CentralView ArticlePubMedGoogle Scholar
- Begley M, Gahan CG, Hill C: Bile stress response in Listeria monocytogenes LO28: adaptation, cross-protection, and identification of genetic loci involved in bile resistance. Appl Environ Microbiol. 2002, 68 (12): 6005-6012. 10.1128/AEM.68.12.6005-6012.2002.PubMed CentralView ArticlePubMedGoogle Scholar
- Phan-Thanh L, Gormon T: Analysis of heat and cold shock proteins in Listeria by two-dimensional electrophoresis. Electrophoresis. 1995, 16 (3): 444-450. 10.1002/elps.1150160172.View ArticlePubMedGoogle Scholar
- Watkins J, Sleath KP: Isolation and enumeration of Listeria monocytogenes from sewage, sewage sludge and river water. J Appl Bacteriol. 1981, 50 (1): 1-9.View ArticlePubMedGoogle Scholar
- Chaturongakul S, Raengpradub S, Wiedmann M, Boor KJ: Modulation of stress and virulence in Listeria monocytogenes. Trends Microbiol. 2008, 16 (8): 388-396. 10.1016/j.tim.2008.05.006.PubMed CentralView ArticlePubMedGoogle Scholar
- Kazmierczak MJ, Wiedmann M, Boor KJ: Alternative sigma factors and their roles in bacterial virulence. Microbiol Mol Biol Rev. 2005, 69 (4): 527-543. 10.1128/MMBR.69.4.527-543.2005.PubMed CentralView ArticlePubMedGoogle Scholar
- Piggot PJ, Hilbert DW: Sporulation of Bacillus subtilis. Curr Opin Microbiol. 2004, 7 (6): 579-586. 10.1016/j.mib.2004.10.001.View ArticlePubMedGoogle Scholar
- Wiedmann M, Arvik TJ, Hurley RJ, Boor KJ: General stress transcription factor σB and its role in acid tolerance and virulence of Listeria monocytogenes. J Bacteriol. 1998, 180 (14): 3650-3656.PubMed CentralPubMedGoogle Scholar
- Raengpradub S, Wiedmann M, Boor KJ: Comparative analysis of the σB-dependent stress responses in Listeria monocytogenes and Listeria innocua strains exposed to selected stress conditions. Appl Environ Microbiol. 2008, 74 (1): 158-171. 10.1128/AEM.00951-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Hain T, Hossain H, Chatterjee SS, Machata S, Volk U, Wagner S, Brors B, Haas S, Kuenne CT, Billion A, et al: Temporal transcriptomic analysis of the Listeria monocytogenes EGD-e σB regulon. BMC Microbiol. 2008, 8: 20-10.1186/1471-2180-8-20.PubMed CentralView ArticlePubMedGoogle Scholar
- Ollinger J, Bowen B, Wiedmann M, Boor KJ, Bergholtz TM: Listeria monocytogenes σB modulates PrfA-mediated virulence factor expression. Infect Immun. 2009, 77 (5): 2113-2124. 10.1128/IAI.01205-08.PubMed CentralView ArticlePubMedGoogle Scholar
- Asmann YW, Wallace MB, Thompson EA: Transcriptome profiling using next-generation sequencing. Gastroenterology. 2008, 135 (5): 1466-1468. 10.1053/j.gastro.2008.09.042.View ArticlePubMedGoogle Scholar
- Mockler TC, Ecker JR: Applications of DNA tiling arrays for whole-genome analysis. Genomics. 2005, 85 (1): 1-15. 10.1016/j.ygeno.2004.10.005.View ArticlePubMedGoogle Scholar
- Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M: The transcriptional landscape of the yeast genome defined by RNA sequencing. Science. 2008, 320 (5881): 1344-1349. 10.1126/science.1158441.PubMed CentralView ArticlePubMedGoogle Scholar
- Yoder-Himes DR, Chain PSG, Zhu Y, Wurtzel O, Rubin EM, Tiedje JM, Sorek R: Mapping the Burkholderia cenocepacia niche response via high-throughput sequencing. Proc Natl Acad Sci USA. 2009, 106 (10): 3976-3981. 10.1073/pnas.0813403106.PubMed CentralView ArticlePubMedGoogle Scholar
- Schmittgen TD, Lee EJ, Jiang J, Sarkar A, Yang L, Elton TS, Chen C: Real-time PCR quantification of precursor and mature microRNA. Methods. 2008, 44 (1): 31-38. 10.1016/j.ymeth.2007.09.006.PubMed CentralView ArticlePubMedGoogle Scholar
- Glaser P, Frangeul L, Buchrieser C, Rusniok C, Amend A, Baquero F, Berche P, Bloecker H, Brandt P, Chakraborty T, et al: Comparative genomics of Listeria species. Science. 2001, 294 (5543): 849-852.PubMedGoogle Scholar
- Christiansen JK, Nielsen JS, Ebersbach T, Valentin-Hansen P, Sogaard-Andersen L, Kallipolitis BH: Identification of small Hfq-binding RNAs in Listeria monocytogenes. RNA (NY). 2006, 12 (7): 1383-1396. 10.1261/rna.49706.View ArticleGoogle Scholar
- Toledo-Arana A, Dussurget O, Nikitas G, Sesto N, Guet-Revillet H, Balestrino D, Loh E, Gripenland J, Tiensuu T, Vaitkevicius K, et al: The Listeria transcriptional landscape from saprophytism to virulence. Nature. 2009, 459: 950-956. 10.1038/nature08080.View ArticlePubMedGoogle Scholar
- Nielsen JS, Olsen AS, Bonde M, Valentin-Hansen P, Kallipolitis BH: Identification of a sigma B-dependent small noncoding RNA in Listeria monocytogenes. J Bacteriol. 2008, 190 (18): 6264-6270. 10.1128/JB.00740-08.PubMed CentralView ArticlePubMedGoogle Scholar
- Mandin P, Repoila F, Vergassola M, Geissmann T, Cossart P: Identification of new noncoding RNAs in Listeria monocytogenes and prediction of mRNA targets. Nucleic Acids Res. 2007, 35 (3): 962-974. 10.1093/nar/gkl1096.PubMed CentralView ArticlePubMedGoogle Scholar
- Kazmierczak MJ, Mithoe SC, Boor KJ, Wiedmann M: Listeria monocytogenes σB regulates stress response and virulence functions. J Bacteriol. 2003, 185 (19): 5722-5734. 10.1128/JB.185.19.5722-5734.2003.PubMed CentralView ArticlePubMedGoogle Scholar
- Shetron-Rama LM, Mueller K, Bravo JM, Bouwer HG, Way SS, Freitag NE: Isolation of Listeria monocytogenes mutants with high-level in vitro expression of host cytosol-induced gene products. Mol Microbiol. 2003, 48 (6): 1537-1551. 10.1046/j.1365-2958.2003.03534.x.View ArticlePubMedGoogle Scholar
- McGann P, Raengpradub S, Ivanek R, Wiedmann M, Boor KJ: Differential regulation of Listeria monocytogenes internalin and internalin-like genes by σB and PrfA as revealed by subgenomic microarray analyses. Foodborne Pathog Dis. 2008, 5 (4): 417-435. 10.1089/fpd.2008.0085.PubMed CentralView ArticlePubMedGoogle Scholar
- Mueller KJ, Freitag NE: Pleiotropic enhancement of bacterial pathogenesis resulting from the constitutive activation of the Listeria monocytogenes regulatory factor PrfA. Infect Immun. 2005, 73 (4): 1917-1926. 10.1128/IAI.73.4.1917-1926.2005.PubMed CentralView ArticlePubMedGoogle Scholar
- Kim H, Marquis H, Boor KJ: σB contributes to Listeria monocytogenes invasion by controlling expression of inlA and inlB. Microbiology. 2005, 151 (Pt 10): 3215-3222. 10.1099/mic.0.28070-0.View ArticlePubMedGoogle Scholar
- Mao C, Evans C, Jensen RV, Sobral BW: Identification of new genes in Sinorhizobium meliloti using the Genome Sequencer FLX system. BMC Microbiol. 2008, 8: 72-10.1186/1471-2180-8-72.PubMed CentralView ArticlePubMedGoogle Scholar
- Liu JM, Livny J, Lawrence MS, Kimball MD, Waldor MK, Camilli A: Experimental discovery of sRNAs in Vibrio cholerae by direct cloning, 5S/tRNA depletion and parallel sequencing. Nucleic Acids Res. 2009, 37 (6): e46--10.1093/nar/gkp080.PubMed CentralView ArticlePubMedGoogle Scholar
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Meth. 2008, 5 (7): 621-628. 10.1038/nmeth.1226.View ArticleGoogle Scholar
- Wilhelm BT, Marguerat S, Watt S, Schubert F, Wood V, Goodhead I, Penkett CJ, Rogers J, Bahler J: Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution. Nature. 2008, 453 (7199): 1239-1243. 10.1038/nature07002.View ArticlePubMedGoogle Scholar
- Chan YC, Boor KJ, Wiedmann M: σB-dependent and σB-independent mechanisms contribute to transcription of Listeria monocytogenes cold stress genes during cold shock and cold growth. Appl Environ Microbiol. 2007, 73 (19): 6019-6029. 10.1128/AEM.00714-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Kazmierczak MJ, Wiedmann M, Boor KJ: Contributions of Listeria monocytogenes σB and PrfA to expression of virulence and stress response genes during extra- and intracellular growth. Microbiology. 2006, 152 (6): 1827-1838. 10.1099/mic.0.28758-0.View ArticlePubMedGoogle Scholar
- Tjaden B, Saxena RM, Stolyar S, Haynor DR, Kolker E, Rosenow C: Transcriptome analysis of Escherichia coli using high-density oligonucleotide probe arrays. Nucleic Acids Res. 2002, 30 (17): 3732-3738. 10.1093/nar/gkf505.PubMed CentralView ArticlePubMedGoogle Scholar
- Sultan M, Schulz MH, Richard H, Magen A, Klingenhoff A, Scherf M, Seifert M, Borodina T, Soldatov A, Parkhomchuk D, et al: A global view of gene activity and alternative splicing by deep sequencing of the human transcriptome. Science. 2008, 321 (5891): 956-960. 10.1126/science.1160342.View ArticlePubMedGoogle Scholar
- Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y: RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008, 18 (9): 1509-1517. 10.1101/gr.079558.108.PubMed CentralView ArticlePubMedGoogle Scholar
- Gouy M, Gautier C: Codon usage in bacteria: correlation with gene expressivity. Nucl Acids Res. 1982, 10 (22): 7055-7074. 10.1093/nar/10.22.7055.PubMed CentralView ArticlePubMedGoogle Scholar
- Ikemura T: Codon usage and tRNA content in unicellular and multicellular organisms. Mol Biol Evol. 1985, 2 (1): 13-34.PubMedGoogle Scholar
- Kanaya S, Yamada Y, Kudo Y, Ikemura T: Studies of codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression level and species-specific diversity of codon usage based on multivariate analysis. Gene. 1999, 238 (1): 143-155. 10.1016/S0378-1119(99)00225-5.View ArticlePubMedGoogle Scholar
- Keiler KC: Biology of trans-translation. Ann Rev Microbiol. 2008, 62: 133-151. 10.1146/annurev.micro.62.081307.162948.View ArticleGoogle Scholar
- Trotochaud AE, Wassarman KM: 6S RNA function enhances long-term cell survival. J Bacteriol. 2004, 186 (15): 4978-4985. 10.1128/JB.186.15.4978-4985.2004.PubMed CentralView ArticlePubMedGoogle Scholar
- Loewen PC, Hengge-Aronis R: The role of the sigma factor sigma S (KatF) in bacterial global regulation. Annu Rev Microbiol. 1994, 48: 53-80. 10.1146/annurev.mi.48.100194.000413.View ArticlePubMedGoogle Scholar
- Archambaud C, Nahori MA, Pizarro-Cerda J, Cossart P, Dussurget O: Control of Listeria superoxide dismutase by phosphorylation. J Biol Chem. 2006, 281 (42): 31812-31822. 10.1074/jbc.M606249200.View ArticlePubMedGoogle Scholar
- Chan YC, Raengpradub S, Boor KJ, Wiedmann M: Microarray-based characterization of the Listeria monocytogenes cold regulon in log- and stationary-phase cells. Appl Environ Microbiol. 2007, 73 (20): 6484-6498. 10.1128/AEM.00897-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Graumann PL, Marahiel MA: Cold shock proteins CspB and CspC are major stationary-phase-induced proteins in Bacillus subtilis. Arch Microbiol. 1999, 171 (2): 135-138. 10.1007/s002030050690.View ArticlePubMedGoogle Scholar
- Jin B, Newton SM, Shao Y, Jiang X, Charbit A, Klebba PE: Iron acquisition systems for ferric hydroxamates, haemin and haemoglobin in Listeria monocytogenes. Mol Microbiol. 2006, 59 (4): 1185-1198. 10.1111/j.1365-2958.2005.05015.x.View ArticlePubMedGoogle Scholar
- Olsen KN, Larsen MH, Gahan CGM, Kallipolitis B, Wolf XA, Rea R, Hill C, Ingmer H: The Dps-like protein Fri of Listeria monocytogenes promotes stress tolerance and intracellular multiplication in macrophage-like cells. Microbiology. 2005, 151 (3): 925-933. 10.1099/mic.0.27552-0.View ArticlePubMedGoogle Scholar
- Schmid B, Klumpp J, Raimann E, Loessner MJ, Stephan R, Tasara T: Role of cold shock proteins (Csp) for growth of Listeria monocytogenes under cold and osmotic stress conditions. Appl Environ Microbiol. 2009, 75 (6): 1621-1627. 10.1128/AEM.02154-08.PubMed CentralView ArticlePubMedGoogle Scholar
- Vasconcelos JA, Deneer HG: Expression of superoxide dismutase in Listeria monocytogenes. Appl Environ Microbiol. 1994, 60 (7): 2360-2366.PubMed CentralPubMedGoogle Scholar
- Bigot A, Pagniez H, Botton E, Frehel C, Dubail I, Jacquet C, Charbit A, Raynaud C: Role of FliF and FliI of Listeria monocytogenes in flagellar assembly and pathogenicity. Infect Immun. 2005, 73 (9): 5530-5539. 10.1128/IAI.73.9.5530-5539.2005.PubMed CentralView ArticlePubMedGoogle Scholar
- Way SS, Thompson LJ, Lopes JE, Hajjar AM, Kollmann TR, Freitag NE, Wilson CB: Characterization of flagellin expression and its role in Listeria monocytogenes infection and immunity. Cell Microbiol. 2004, 6 (3): 235-242. 10.1046/j.1462-5822.2004.00360.x.View ArticlePubMedGoogle Scholar
- Finn RD, Tate J, Mistry J, Coggill PC, Sammut SJ, Hotz H-R, Ceric G, Forslund K, Eddy SR, Sonnhammer ELL, et al: The Pfam protein families database. Nucl Acids Res. 2008, 36 (suppl_1): D281-288.PubMed CentralPubMedGoogle Scholar
- Nielsen JS, Olsen AS, Bonde M, Valentin-Hansen P, Kallipolitis BH: Identification of a σB-dependent small noncoding RNA in Listeria monocytogenes. J Bacteriol. 2008, 190 (18): 6264-6270. 10.1128/JB.00740-08.PubMed CentralView ArticlePubMedGoogle Scholar
- Lister R, O'Malley RC, Tonti-Filippini J, Gregory BD, Berry CC, Millar AH, Ecker JR: Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell. 2008, 133 (3): 523-536. 10.1016/j.cell.2008.03.029.PubMed CentralView ArticlePubMedGoogle Scholar
- Morin R, Bainbridge M, Fejes A, Hirst M, Krzywinski M, Pugh T, McDonald H, Varhol R, Jones S, Marra M: Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing. BioTechniques. 2008, 45 (1): 81-94. 10.2144/000112900.View ArticlePubMedGoogle Scholar
- Chai Y, Kolter R, Losick R: A widely conserved gene cluster required for lactate utilization in Bacillus subtilis and its involvement in biofilm formation. J Bacteriol. 2009, 191 (8): 2423-2430. 10.1128/JB.01464-08.PubMed CentralView ArticlePubMedGoogle Scholar
- Hillerich B, Westpheling J: A new GntR family transcriptional regulator in Streptomyces coelicolor is required for morphogenesis and antibiotic production and controls transcription of an ABC transporter in response to carbon source. J Bacteriol. 2006, 188 (21): 7477-7487. 10.1128/JB.00898-06.PubMed CentralView ArticlePubMedGoogle Scholar
- Ogasawara H, Ishida Y, Yamada K, Yamamoto K, Ishihama A: PdhR (pyruvatedehydrogenase complex regulator) controls the respiratory electron transport system in Escherichia coli. J Bacteriol. 2007, 189 (15): 5534-5541. 10.1128/JB.00229-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Ferreira A, O'Byrne CP, Boor KJ: Role of σB in heat, ethanol, acid, and oxidative stress resistance and during carbon starvation in Listeria monocytogenes. Appl Environ Microbiol. 2001, 67 (10): 4454-4457. 10.1128/AEM.67.10.4454-4457.2001.PubMed CentralView ArticlePubMedGoogle Scholar
- Moorhead SM, Dykes GA: The role of the sigB gene in the general stress response of Listeria monocytogenes varies between a strain of serotype 1/2a and a strain of serotype 4c. Curr Microbiol. 2003, 46 (6): 461-466. 10.1007/s00284-002-3867-6.View ArticlePubMedGoogle Scholar
- Garner MR, Njaa BL, Wiedmann M, Boor KJ: Sigma B contributes to Listeria monocytogenes gastrointestinal infection but not to systemic spread in the guinea pig infection model. Infect Immun. 2006, 74 (2): 876-886. 10.1128/IAI.74.2.876-886.2006.PubMed CentralView ArticlePubMedGoogle Scholar
- Sleator RD, Clifford T, Hill C: Gut osmolarity: A key environmental cue initiating the gastrointestinal phase of Listeria monocytogenes infection?. Med Hypotheses. 2007, 69 (5): 1090-1092. 10.1016/j.mehy.2007.02.028.View ArticlePubMedGoogle Scholar
- Sue D, Boor KJ, Wiedmann M: σB-dependent expression patterns of compatible solute transporter genes opuCA and lmo1421 and the conjugated bile salt hydrolase gene bsh in Listeria monocytogenes. Microbiology. 2003, 149 (Pt 11): 3247-3256. 10.1099/mic.0.26526-0.View ArticlePubMedGoogle Scholar
- Sue D, Fink D, Wiedmann M, Boor KJ: σB-dependent gene induction and expression in Listeria monocytogenes during osmotic and acid stress conditions simulating the intestinal environment. Microbiology. 2004, 150 (Pt 11): 3843-3855. 10.1099/mic.0.27257-0.View ArticlePubMedGoogle Scholar
- Core LJ, Waterfall JJ, Lis JT: Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science. 2008, 322 (5909): 1845-1848. 10.1126/science.1162228.PubMed CentralView ArticlePubMedGoogle Scholar
- Cetin MS, Zhang C, Hutkins RW, Benson AK: Regulation of transcription of compatible solute transporters by the general stress sigma factor, σB, in Listeria monocytogenes. J Bacteriol. 2004, 186 (3): 794-802. 10.1128/JB.186.3.794-802.2004.PubMed CentralView ArticlePubMedGoogle Scholar
- Fraser KR, Sue D, Wiedmann M, Boor K, O'Byrne CP: Role of σB in regulating the compatible solute uptake systems of Listeria monocytogenes: osmotic induction of opuC is σB dependent. Appl Environ Microbiol. 2003, 69 (4): 2015-2022. 10.1128/AEM.69.4.2015-2022.2003.PubMed CentralView ArticlePubMedGoogle Scholar
- Kingsford C, Ayanbule K, Salzberg S: Rapid, accurate, computational discovery of Rho-independent transcription terminators illuminates their relationship to DNA uptake. Genome Biol. 2007, 8 (2): R22-10.1186/gb-2007-8-2-r22.PubMed CentralView ArticlePubMedGoogle Scholar
- Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, Barrell B: Artemis: sequence visualization and annotation. Bioinformatics. 2000, 16 (10): 944-945. 10.1093/bioinformatics/16.10.944.View ArticlePubMedGoogle Scholar
- Storey JD, Tibshirani R: Statistical significance for genomewide studies. Proc Natl Acad Sci USA. 2003, 100 (16): 9440-9445. 10.1073/pnas.1530509100.PubMed CentralView ArticlePubMedGoogle Scholar
- Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14 (6): 1188-1190. 10.1101/gr.849004.PubMed CentralView ArticlePubMedGoogle Scholar
- Rice P, Longden I, Bleasby A: EMBOSS: the european molecular biology open software suite. Trends Genet. 2000, 16 (6): 276-277. 10.1016/S0168-9525(00)02024-2.View ArticlePubMedGoogle Scholar
- Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc B. 1995, 57 (1): 289-300.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.