Multiple-omic data analysis of Klebsiella pneumoniae MGH 78578 reveals its transcriptional architecture and regulatory features
- Joo-Hyun Seo†1,
- Jay Sung-Joong Hong†1, 3,
- Donghyuk Kim1,
- Byung-Kwan Cho1, 4,
- Tzu-Wen Huang1, 2,
- Shih-Feng Tsai2,
- Bernhard O Palsson1 and
- Pep Charusanti1Email author
© Seo et al.; licensee BioMed Central Ltd. 2012
Received: 3 August 2012
Accepted: 21 November 2012
Published: 29 November 2012
The increasing number of infections caused by strains of Klebsiella pneumoniae that are resistant to multiple antibiotics has developed into a major medical problem worldwide. The development of next-generation sequencing technologies now permits rapid sequencing of many K. pneumoniae isolates, but sequence information alone does not provide important structural and operational information for its genome.
Here we take a systems biology approach to annotate the K. pneumoniae MGH 78578 genome at the structural and operational levels. Through the acquisition and simultaneous analysis of multiple sample-matched –omics data sets from two growth conditions, we detected 2677, 1227, and 1066 binding sites for RNA polymerase, RpoD, and RpoS, respectively, 3660 RNA polymerase-guided transcript segments, and 3585 transcription start sites throughout the genome. Moreover, analysis of the transcription start site data identified 83 probable leaderless mRNAs, while analysis of unannotated transcripts suggested the presence of 119 putative open reading frames, 15 small RNAs, and 185 antisense transcripts that are not currently annotated.
These findings highlight the strengths of systems biology approaches to the refinement of sequence-based annotations, and to provide new insight into fundamental genome-level biology for this important human pathogen.
KeywordsKlebsiella pneumoniae Infectious disease Transcriptional architecture Omics data Systems biology
The number of infections caused by pathogenic microorganisms that are resistant to at least one antibiotic has grown at an alarming rate over the past several decades. These multi-drug resistant organisms have reduced the clinical utility of many commonly-used antibiotics, and pan-resistant strains now threaten to render some infectious agents nearly untreatable. The Infectious Diseases Society of America has identified six multi-drug resistant pathogens in particular that pose the gravest threat to human health [1, 2], one of which is the Gram-negative bacterium Klebsiella pneumoniae. K. pneumoniae is a member of the family Enterobacteriaceae, and exhibits close genetic relationship to several genera within this family, especially Escherichia. Despite this similarity, many Klebsiella species, including Klebsiella pneumoniae, possess a thick, extracellular polysaccharide capsule that distinguishes this genus from other enterobacteria. This capsule is thought to be a significant virulence factor that helps to protect the bacterium during infection from phagocytosis [3–5] and antimicrobial peptides . K. pneumoniae causes a wide range of diseases worldwide such as pneumonia, urinary tract infections, and surgical wound infections that primarily afflict immunocompromised patients. There are also highly invasive community-acquired pathotypes characterized by bacteremic liver abscesses or endophthalmitis that are particularly endemic in Asia [7–9], especially in Taiwan [10–13], and reports of their occurrence are now emerging in other parts of the world [14–20].
To combat the threat posed by K. pneumoniae and other drug-resistant pathogens, the genomes from many clinical and laboratory-derived isolates have been sequenced to investigate the genetic basis of infection-relevant phenotypes such as virulence and antibiotic resistance [21, 22]. Several notable discoveries have been made through these sequencing efforts, for example the identification of plasmids that bear New Delhi metallo-β-lactamase 1 (NDM-1), a gene that confers resistance to the last-line carbapenem antibiotics that are used to treat difficult K. pneumoniae infections . Other studies have sought to build upon the abundance of sequence data to delineate fundamental operational features of the genome, for instance the identity and binding site locations of major transcription factors, the environmental signals that stimulate transcription of certain genes, the architecture of operons, sub-operons, and transcription units, the existence of small non-coding RNAs, and other elements [24–27]. Such studies often rely on the acquisition and analysis of genome-wide data sets such as chromatin immunoprecipitation combined with microarray hybridization (ChIP-chip), transcriptome profiling, proteomics, and metabolomics, ultimately resulting in a global map of the transcriptional architecture for the bacterium under defined growth conditions. In turn, this map provides a fundamental link between the genotype and phenotype for the organism.
Using ChIP-chip, tiling array, and deep sequencing technology, we report here a delineation of the transcriptional architecture for a pathogenic, multi-drug resistant strain of Klebsiella pneumoniae during exponential and stationary phase growth. Key findings include the detection of over 1000 binding sites for RpoD and RpoS, nearly 200 RNA transcript segments that have multiple transcription start sites, and over 80 leaderless mRNAs.
Analysis of transcriptional architecture
Experimentally derived annotation of the Klebsiella pneumoniae MGH 78578 genome
RNAP binding sites
RpoD binding sites
RpoS binding sites
RTSs – total
RTSs – exponential phase
RTSs – stationary phase
RTSs – both phases
To establish transcription start sites, we performed a simultaneous analysis of both our raw TSS data and the 3660 RTSs and assigned a TSS if it appeared within 200 bp from the 5′ start point of an RTS. This analysis resulted in a total of 3585 TSS signals, 3322 on the chromosome and 263 on the five plasmids. One-hundred ninety-three RTSs were observed to have two or more TSSs. Based on COG classification, the largest group among these 193 RTSs was a set of 26 that are involved in transcription (Additional file 2). Four of these 26 transcription-related RTSs have three or more TSSs. For example, the RTS that includes KPN_01305, a transcriptional regulator involved in biosynthesis and transport of aromatic amino acids, has four TSSs. The existence of multiple TSSs suggests that this and other similar genes are transcribed under multiple, specific conditions rather than under conditions of general growth. When the TSS data was extended to include both the RTS and expression profiling data, we observed 83 leaderless mRNAs, defined here as transcripts with a 5′ UTR length of less than or equal to 5 bp. This number is nearly double what has been reported for other bacteria such as Salmonella enterica serovar Typhimurium strain SL1344 (23 transcripts) , Helicobacter pylori (34 transcripts) [26, 29], and Geobacter sulfurreducens (52 transcripts) .
Among non-coding RNAs, the current annotation for the K. pneumoniae MGH 78578 genome contains 86 tRNAs and 25 rRNAs and an unknown number of small RNAs (sRNA). The presence of sRNAs are much more difficult to predict because genome annotation algorithms are based predominantly on protein coding regions. The possible existence and location of sRNAs are therefore frequently extracted from whole-genome RNA expression data sets that probe not just open reading frames, but the intergenic regions where sRNAs are located as well [30, 31].
Since our tiled-array data provide this whole-genome information, we examined whether we could detect sRNAs in K. pneumoniae by analyzing unannotated transcripts in our data set using the Rfam database . Out of 447 unannotated transcripts, 15 of them matched an sRNA already reported in Rfam (Additional file 3). Among this list, we could detect high-level transcription of SraD (genomic position: 3318095~3318169, + strand), SroB (genomic position: 510363~510437, + strand) and CsrC (genomic position: 4572620~4572846, + strand) during stationary phase growth only, an observation that is consistent with data from other Enterobacteria [33–36]. Nine of the sRNAs, RyhB, SraL, SroB, SraC/RyeA, MicF, SraD, GcvB, SraH and IsrN, are reported to possess the ability to bind Hfq, a bacterial RNA binding protein . Six sRNAs, SroB, SraC/RyeA, MicF, SraD, GcvB, and SraH, are reported to regulate protein translation through antisense binding to target mRNAs. Interestingly, we could detect the transcription of both RyeB and SrcC/RyeA sRNA even though their genomic locations overlap on opposite strands (genomic position of RyeB: 2580980~2581070, – strand; genomic position of SrcC/RyeA: 2580976~2581120, + strand). However, we could detect transcription of SrcC/RyeA only during exponential phase and RyeB only during stationary phase. This observation suggests that the two sRNAs might act in a coordinated manner to regulate different aspects of growth rather than in a concerted, simultaneous manner. We cannot rule out the possibility, however, that SrcC/RyeA and RyeB might be expressed at low levels during stationary and exponential phase, respectively, that are below the detection limits of the measurement and data analysis systems employed here.
Putative open reading frames
Several RTSs in our data sets did not correspond to any ORFs in the current sequence-based annotation for K. pneumoniae MGH 78578. For each of these unannotated transcripts, we investigated whether they might show homology to any known gene products by first searching for start and stop codons in the RTS that yielded the longest DNA sequence and had the same reading frame. This DNA sequence was then translated into a peptide sequence and a BLASTp search performed against the RefSeq database with a cutoff E-value of 0.001. This analysis resulted in 119 putative ORFs, 40 of which reside on one of the five plasmids harbored by this strain (Additional file 4). Most of these putative ORFs are currently annotated as hypothetical proteins, but several show high homology (< E-110, Additional file 4) to annotated genes from other strains or species of Klebsiella or to members from other genera. In turn, this close match raises the probability that the underlying putative ORF does indeed encode a known gene product. For example, the sequence for ‘RTS_NC_009648_322945_323714_+_exp’ is homologous to HAD hydrolase from Klebsiella sp. MS 92–3, but is not annotated as such in K. pneumoniae MGH 78578. Other examples include a set of seven RTSs encoded by the pKPN5 plasmid that show homology to resolvase proteins from E. coli MS 107–1 (Additional file 4). These and other examples (Additional file 4) highlight the significant strength that comes from the use of sample-matched multi –omic data sets to experimentally refine genome annotations based primarily on computational predictions.
Through further analysis of unannotated transcripts, we identified 185 probable antisense transcripts in the K. pneumoniae transcriptome using a cutoff of 90% overlap to the corresponding gene (Additional file 5). This number decreases to 146 probable antisense transcripts when a cutoff of 100% overlap is used. We used the classification scheme of Yin et al.  to group the antisense transcripts into three categories: 5′ overlapping, 3′ overlapping, and ‘completely covered’. This categorization indicates which parts of the two sequences overlap . We further divided ‘completely covered’ into two sub-categories: ‘antisense RTS completely covered by sense RTS’ and ‘sense RTS completely covered by antisense RTS’. The current annotation of the K. pneumoniae MGH 78578 genome lists 5305 genes, approximately 800 more than E. coli or Bacillus subtilis; therefore, 3.5% of the genes in K. pneumoniae have antisense transcription using an overlap of 90%, which is a similar value to what has been reported for E. coli. Additional studies in E. coli as well as data from other bacteria [40, 41] suggest, however, that the proportion of antisense transcripts is closer to ~10-20% of the number of genes in a bacterium. Consequently, the smaller number reported here for K. pneumoniae might reflect an incomplete list of antisense transcripts due to low detection sensitivity.
The expression of antisense transcripts for certain genes often varied with the growth phase. For example, we detected antisense transcription from 8 tRNA genes during stationary phase but only one of these genes, the tRNA for serine (KPN_02431), had antisense transcription during exponential phase. Similarly, there was antisense transcription for fructose-1,6-bisphosphatase (KPN_04626), an amino acid exporter (KPN_02015), the transcriptional regulator gene lysR (KPN_02148), and the transcriptional regulator gene argR (KPN_03645) during stationary phase but not exponential phase. LysR is negatively autoregulated and coordinately activates transcription of lysA (KPN_03252), which encodes the enzyme catalyzing the last step in lysine biosynthesis [42, 43]. ArgR complexed with L-arginine represses the transcription of several genes involved in the biosynthesis and transport of arginine and histidine, and activates genes for arginine catabolism [44, 45]. ArgR represses the expression of ABC transporters for putrescine, lysine, and ornithine as well . Since the inhibition of this large set of genes leads to the reduced uptake of these nutrients, the regulation of ArgR expression by antisense transcription is one possible way to adjust metabolism during stationary phase.
We detected antisense transcripts for the marR and marB genes within the marRAB operon during exponential phase growth. We focused attention on this operon since MarA is known to play a role in pathogenesis: the protein activates genes that mitigate the effects of exposure to environmental stresses such as antibiotics and oxidants [47, 48]. MarR is the transcriptional repressor of the marRAB operon, but the function of MarB is unknown. We could not detect any antisense transcription for marA, suggesting that the regulation of this transcriptional activator occurs through MarR and possibly MarB rather than antisense control of marA itself. Curiously, we detected antisense transcripts for soxS, which is a dual transcriptional activator that helps to protect the cell against oxidative stress , despite the absence of antibiotics or other factors known to promote this phenomenon. Many factors contribute to the ability of K. pneumoniae to resist many antibiotics, but these observations suggest that transcriptional regulators and their antisense transcripts might play a role in this process.
Transcription network among sigma factors
When phase-specific expression levels of the five sigma factors were compared, rpoD, rpoN, and rpoH had higher expression levels in exponential phase than in stationary phase (Additional file 6), whereas rpoS and rpoE had higher expression levels in stationary phase. The expression level of rpoD, rpoN, and rpoH in stationary phase decreased to 48%, 47%, and 73%, respectively, when compared to those in exponential phase. In contrast, the expression level of rpoS and rpoE in stationary phase increased to 430% and 497%, respectively. The dramatic increase of expression level of rpoE is in accordance with a previous report , and implies that RpoE plays a pivotal role in cell survival during prolonged stationary phase.
RpoD and RpoS DNA binding motifs
Transcription of putative virulence genes
The K. pneumoniae MGH 78578 genome contains a number of genes that encode putative virulence factors such as capsular polysaccharides (CPS), siderophore biosynthesis and transport, LPS biosynthesis and transport, and fimbriae . In K. pneumoniae, the expression levels of these genes during stationary phase growth decreased to less than half their exponential phase values (Additional file 6).
The expression of genes associated with siderophore biosynthesis during stationary phase dropped to ~35% of their corresponding exponential phase level (Additional file 6). Reinforcing the expression data, we detected 20 RTSs during exponential phase that contained one or more genes related to siderophore biosynthesis, but we could detect only 10 such RTSs during stationary phase (Additional file 7). When the absolute signal intensity of siderophore-associated genes is taken into account (average log2(signal) = 5.92) and compared to the baseline signal (average log2(signal) = ~6), siderophore-associated genes have nearly no expression during stationary phase.
The expression data for genes involved in CPS biosynthesis followed a similar trend as the expression data for siderophore biosynthesis, but the RTS data differed between the two. During stationary phase, the expression levels of CPS-associated genes fell to 10 - 40% of their values during exponential phase. On the other hand, more RTSs containing CPS-associated genes were detected during stationary (26) than exponential (24) phase. These data imply that post-transcriptional regulation might play a greater role in modulating the transcript abundance of CPS-associated genes than siderophore-associated genes during stationary phase.
In contrast to siderophore- and CPS-associated genes, genes associated with both fimbriae and LPS have similar expression levels in both exponential and stationary phases. The average log2(signal) for the two were ~6.5 and ~8, respectively (Additional file 6). Interestingly, however, several fimbriae-associated genes such as KPN_00843 (ompX, outer membrane protein X) and the gene cluster KPN_03275 ~ KPN_03279 (putative fimbriae-related genes, putative fimbria usher protein and putative pili assembly chaperone) showed much higher expression levels during both growth phases (log2(signal) ≥ 12) than other loci that are also associated with fimbriae. Genbank currently does not associate the KPN_03275 ~ KPN_03279 cluster with a specific type of fimbriae, but they are ~99% homologous at the nucleotide level to the mrk JFDCB cluster from K. pneumoniae NTUH-2044 that encodes type 3 fimbriae. This observation implies that type 3 fimbriae constitute the major class of fimbria expressed by K. pneumoniae MGH 78578. In contrast to fimbriae, little variation was detected among the full set of genes associated with LPS biosynthesis, implying that LPS is continually synthesized regardless of growth phase.
Genome sequences are most commonly annotated using bioinformatics-based algorithms, but these algorithms can misannotate genes or introduce other errors . Moreover, a genome sequence by itself provides scant information concerning its functional operation, for example which genes are activated or repressed under specific growth conditions and how their expression is regulated. Against this backdrop, experiment-based techniques such as gene expression and other -omics data provide a foundation with which to verify or correct computation-based annotations at a genome-wide level. We report here such data for Klebsiella pneumoniae MGH 78578 through the integrated analysis of gene expression, ChIP-chip of RNAP, RpoD and RpoS, and TSS data during exponential and stationary phase growth. The integrated analysis of these different data sets ensures that findings from one particular data set are reinforced by another, thereby minimizing potential false-positive and -negative findings that might emerge when these data sets are analyzed in isolation [58, 59].
Small RNAs are increasingly recognized as important, ubiquitous elements that regulate mRNA half-life, protein translation, and other processes, thereby providing an additional layer of regulatory control of multiple target genes . We detected 15 sRNAs in K. pneumoniae through comparison of intergenic transcripts in our data sets to known sRNAs in the Rfam database. Nine of these fifteen have been reported to act through Hfq, and the expression levels for six of them changed at least two-fold in a Δhfq knockout mutant of K. pneumoniae. The expression level for a seventh, RyhB, changed by less than two-fold in the mutant, while an additional two Hfq-binding sRNAs, SraL and IsrN, are newly detected in our data set. All fifteen sRNAs detected here have also been detected in both E. coli and S. Typhimurium [27, 62], suggesting the existence of a common regulatory network involving these sRNAs that is shared among multiple Enterobacteria. K. pneumoniae likely contains many more sRNAs than the 15 detected here, however, since greater numbers of sRNAs have been reported for both E. coli and S. Typhimurium [27, 62].
As with sRNAs, we detected a much smaller number of TSSs in K. pneumoniae than has been reported for E. coli (3585 in K. pneumoniae versus 4133 in E. coli) and transcription units (3660 in K. pneumoniae versus approximately 4661 in E. coli) even though the current GenBank annotations list approximately 800 more genes for K. pneumoniae than for E. coli. These differences likely stem from the greater number of growth conditions that were investigated in the E. coli study, an observation that highlights the plasticity of the transcriptional architecture within these two bacteria as they respond to different environments. S. Typhimurium by comparison has been reported to contain much fewer TSSs, approximately 1900 in number , but this disparity likely arises from the different data analysis procedures that were employed in the S. Typhimurium study versus those for K. pneumoniae and E. coli.
In contrast to sRNA and TSS data, we detected a greater number of leaderless RNAs for K. pneumoniae than have been identified to date in other bacteria. Although this observation suggests that these transcripts might have a functional impact or evolutionary significance that is unique to K. pneumoniae, we anticipate that deep sequencing of the transcriptomes from additional microorganisms under multiple growth conditions will eventually yield a similar amount of leaderless RNAs.
Beyond delineating the transcriptional architecture of K. pneumoniae, the data presented here highlight the significant impact that the growth phase can have on the expression of virulence genes and, by extension, on drug target selection. For instance, one possible antibiotic development strategy is to interfere with non-essential microbial processes such as capsule, siderophore, and fimbriae biosynthesis and quorum sensing that result in weakened virulence but do not kill the pathogen outright. Although there is some debate [63, 64], such strategies are attractive in large part because resistance is expected to emerge at a much slower rate, thereby prolonging the clinical utility of drugs designed with this mechanism of action. Since these processes are non-essential, however, the underlying enzymes might not be present under all conditions. Without a target to inhibit, antibiotics developed against these enzymes would therefore be expected to have little effect. Our data indicate that attempts to inhibit K. pneumoniae enzymes involved in siderophore biosynthesis might fall under this scenario since the transcriptomic and RTS signals from genes involved in this process are much lower during stationary phase than during exponential phase. These findings emphasize the need for in-depth studies to validate targets before the start of an antibiotic discovery program, in particular to establish whether a potential target enzyme is ultimately produced under infection-relevant conditions.
In conclusion, we report here the operational annotation of the Klebsiella pneumoniae MGH 78578 genome during exponential and stationary phase growth in glucose M9. We identified numerous RTSs, unannotated transcripts (i.e., transcription from intergenic regions), different types of regulatory RNAs, and putative ORFs. Additional experimental data to confirm the existence of sRNAs, antisense transcripts, and putative ORFs would yield further insight into important mechanisms underlying transcriptional regulation of this important human pathogen.
Bacterial strain, medium and growth condition
Glucose M9 minimal media was used as the primary culture medium. Glucose (2 g/L) M9 minimal media is composed of 2 mL/L of 1 M MgSO4, 100 μL/L of 1 M CaCl2, 12.8 g/L Na2HPO4·7H2O, 3 g/L KH2PO4, 0.5 g/L NaCl, 1 g/L NH4Cl and 1 ml trace element solution (100X) containing 1 g EDTA, 29 mg ZnSO4·7H2O, 198 mg MnCl2·4H2O, 254 mg CoCl2·6H2O, 13.4 mg CuCl2, and 147 mg CaCl2. Seed cultures of K. pneumoniae were made by inoculating frozen stocks made with 20% glycerol into 3 mL of glucose M9 minimal media and incubating at 37°C. After overnight growth, 5 mL of the seed culture was inoculated into 50 mL fresh glucose M9 minimal media and further cultured at 37°C until it reached an appropriate optical density at 600 nm (OD).
Gene expression profile analysis
Three milliliters of cell culture media in mid-exponential (OD=0.6) phase or stationary (OD=1.3) phase were mixed with 6 mL RNAprotect Bacteria Reagent (Qiagen, Valencia, CA). Samples were immediately vortexed for 5 seconds and incubated for an additional 5 minutes at room temperature. Samples were then centrifuged at 5000 g for 10 minutes and the supernatant discarded. Total RNA samples were then isolated using a RNeasy Plus Mini kit (Qiagen, Valencia, CA) according to the manufacturer’s instructions. Extracted RNA samples were quantified using a NanoDrop 1000 spectrophotometer and the quality of the isolated RNA was checked by visualization on agarose gels and by measuring the sample’s A260/A280 ratio (>1.8). Ten μg of total RNA was used to make cDNA with amino-allyl dUTP by reverse transcription. The amino-allyl labeled cDNA samples were then coupled with Cy3 monoreactive dyes (Amersham/GE Healthcare, Pittsburgh, PA). Cy3-labeled cDNAs were digested with DNase I (Epicentre/Illumina, Madison, WI) to generate 50~300 bp fragments. High-density oligonucleotide tiling arrays custom manufactured by Roche NimbleGen that consisted of 379,528 50-mer probes spaced 30 bp apart across the whole K. pneumoniae genome were used. Hybridization, wash and scan were performed according to the manufacturer’s instructions. Probe level data were normalized with the RMA (robust multi-array analysis) algorithm in NimbleScan 2.4 without background correction
A previously reported ChIP-chip protocol [65, 66] was adopted here for K. pneumoniae. Genome-wide RNAP (Additional file 8) and RpoD (Additional file 9) binding sites were identified using cultures grown to mid-log phase in triplicate. Corresponding RpoS binding site identification was carried out using cultures grown to stationary phase, also in triplicate (Additional file 10). Six μL each of RNAP, RpoD, and RpoS antibody (all from Neoclone, Madison, WI) were used for each experiment. As a control (mock-IP), 2 μg of normal mouse IgG antibody (Upstate/Millipore/Merck, Billerica, MA) was used. Real-time quantitative PCR was performed with previously known binding sites to test the enrichment of the immunoprecipitated (IP) DNA library . qPCR and amplification of DNA was carried out according to the method of Cho et al. . Samples confirmed to be enriched for IP DNA were next hybridized to the microarray, washed, and scanned according to the manufacturer’s directions (Roche NimbleGen).
Total RNA was extracted from two biological replicates for each growth condition using the same method used to acquire gene expression profiles. Terminator 5′-Phosphate Dependent Exonuclease (Epicentre/Illumina, Madison, WI) was used to enrich 5′ tri-phosphorylated mRNAs from the total RNA including 5′ mono-phosphorylated ribosomal RNA (rRNA) and any degraded mRNA at 30°C for 1 hr following the manufacturer’s instructions. The reaction was terminated by adding 1 μL of 100 mM EDTA (pH 8.0). 5′ tri-phosphorylated RNAs were precipitated by standard ethanol precipitation with 40 μg of glycogen. RNA was precipitated at −80°C for 20 min and pelleted, washed with 70% ethanol, dried in a Speed-Vac for 7 minutes without heat, and resuspended in 20 μL nuclease free water. The tri-phosphorylated RNA was then treated with RNA 5′-polyphosphatase (Epicentre/Illumina, Madison, WI) at 37°C for 30 minutes to generate 5′-end mono-phosphorylated RNA for ligation to adaptors. After the 5′-polyphosphatase treatment, RNA was extracted using phenol-chloroform and ethanol precipitation.
To ligate 5′ small RNA adaptor (5′-GUUCAGAGUUCUACAGUCCGACGAUC-3′) to the 5′-end of the mono-phosphorylated RNA, the enriched RNA samples were incubated with 100 μM of the adaptor and 2.5 U of T4 RNA ligase (New England BioLabs, Ipswich, MA). cDNAs were synthesized using the adaptor-ligated mRNAs as template using a modified small RNA reverse transcriptase (RT) primer from Illumina (5′-CAAGCAGAAGACGGCATACGANNNNNNNNN-3′) and Superscript II Reverse Transcriptase (Life Technologies, Carlsbad, CA). The RNA was mixed with 25 μM modified small RNA RT primer and incubated at 70°C for 10 min and then at 25°C for 10 min. Reverse transcription was carried out at 25°C for 10 min, 37°C for 60 min, and 42°C for 60 min, followed by incubation at 70°C for 10 min. After the reaction, RNA was hydrolyzed by adding 20 μL of 1 N NaOH and incubation at 65°C for 30 min. The reaction mixture was neutralized by adding 20 μL of 1 N HCl. The cDNA samples were amplified using a mixture of 1 μL of the cDNA, 10 μL of Phusion HF buffer (New England BioLabs, Ipswich, MA), 1 μL of dNTPs (10 mM), 1 μL SYBR green (Qiagen, Valencia, CA), 0.5 μL of HotStart Phusion DNA polymerase (New England BioLabs, Ipswich, MA), and 5 pmole of small RNA PCR primer mix. The amplification primers used were 5′-AATGATACGGCGACCACCGACAGGTTCAGAGTTCTACAGTCCGA-3′ and 5′-CAAGCAGAAGACGGCATACGA-3′. Amplification was monitored by a LightCycler (Bio-Rad) and stopped at the beginning of the saturation point. Amplified DNA was run on a 6% TBE gel (Life Technologies, Carlsbad, CA) by electrophoresis and DNA ranging from 100 to 300 bp were selected. Gel slices were dissolved in two volumes of EB buffer (Qiagen, Valencia, CA) and 1/10 volume of 3 M sodium acetate (pH 5.2). The amplified DNA was ethanol-precipitated and resuspended in 15 μL DNAse-free water. The final samples were then quantified using a NanoDrop 1000 spectrophotometer. The amplified cDNA libraries were sequenced on an Illumina Genome Analyzer. Sequence cDNA libraries for K. pneumoniae were aligned onto the reference genome sequence for this organism (Genbank accession number: CP000647.1 to CP000652.1), using Mosaik (http://code.google.com/p/mosaik-aligner) with the following arguments: hash size = 10, mismatach = 0, and alignment candidate threshold = 30 bp. The two biological replicates were processed separately, and only sequence reads present in both replicates and aligned to unique genomic location were considered for further study. The genome coordinates of the 5′-end of these uniquely aligned reads were defined as potential TSS. GenBank lists 5185 ORFs for this organism.
Prediction of putative open reading frames (pORFs)
As a first step, transcripts from intergenic regions (i.e., unannotated transcripts) were collected. For each unannotated transcript, we searched for start and stop codons that formed the longest transcript and had the same reading frame. This sequence was defined as a putative ORF (pORF) and translated into a protein sequence. Theoretically translated protein sequences were searched against the RefSeq database using BLASTp. Best hits with E-value less than or equal to 0.001 were listed as pORFs.
Prediction of putative small RNAs (sRNAs)
Each unannotated transcript was searched against the Rfam database (http://rfam.sanger.ac.uk/). sRNA search results from Rfam gave homologous sRNA class with E-value, which are listed in Additional file 3. If the unannotated transcript did not match an entry in Rfam, it was assumed not to be an sRNA.
Prediction of antisense transcripts
Unannotated transcripts were analyzed to determine whether they overlapped with any RTSs from our data set or annotated genes. These unannotated transcripts were listed as antisense transcripts if the overlap was over 90% (Additional file 5).
RpoD and RpoS binding motif analysis
To identify RpoD-specific promoter sequence motifs, we took 50 bp sequences immediately upstream of TSS signals that were located within RpoD ChIP-chip binding regions and analyzed them using the MEME motif search algorithm. The procedure used to determine RpoS-specific promoter sequence motifs was identical except that we analyzed 60 bp genomic sequences rather than 50.
Raw chIP-chip and transcriptomic data files have been deposited into the GEO database with accession numbers GSE35926 and GSE35927.
This study was funded from a grant from the National Research Institute of Taiwan through PH-099-SP-10.
- Rice LB: Federal funding for the study of antimicrobial resistance in nosocomial pathogens: no ESKAPE. J Infect Dis. 2008, 197 (8): 1079-1081. 10.1086/533452.View ArticlePubMed
- Boucher HW, Talbot GH, Bradley JS, Edwards JE, Gilbert D, Rice LB, Scheld M, Spellberg B, Bartlett J: Bad bugs, no drugs: no ESKAPE! An update from the Infectious Diseases Society of America. Clin Infect Dis. 2009, 48 (1): 1-12. 10.1086/595011.View ArticlePubMed
- Domenico P, Salo RJ, Cross AS, Cunha BA: Polysaccharide capsule-mediated resistance to opsonophagocytosis in Klebsiella pneumoniae. Infect Immun. 1994, 62 (10): 4495-4499.PubMed CentralPubMed
- Evrard B, Balestrino D, Dosgilbert A, Bouya-Gachancard JL, Charbonnel N, Forestier C, Tridon A: Roles of capsule and lipopolysaccharide O antigen in interactions of human monocyte-derived dendritic cells and Klebsiella pneumoniae. Infect Immun. 2010, 78 (1): 210-219. 10.1128/IAI.00864-09.PubMed CentralView ArticlePubMed
- Lawlor MS, Hsu J, Rick PD, Miller VL: Identification of Klebsiella pneumoniae virulence determinants using an intranasal infection model. Mol Microbiol. 2005, 58 (4): 1054-1073. 10.1111/j.1365-2958.2005.04918.x.View ArticlePubMed
- Campos MA, Vargas MA, Regueiro V, Llompart CM, Alberti S, Bengoechea JA: Capsule polysaccharide mediates bacterial resistance to antimicrobial peptides. Infect Immun. 2004, 72 (12): 7107-7114. 10.1128/IAI.72.12.7107-7114.2004.PubMed CentralView ArticlePubMed
- Kohayagawa Y, Nakao K, Ushita M, Niino N, Koshizaki M, Yamamori Y, Tokuyasu Y, Fukushima H: Pyogenic liver abscess caused by Klebsiella pneumoniae genetic serotype K1 in Japan. J Infect Chemother. 2009, 15 (4): 248-251. 10.1007/s10156-009-0695-7.View ArticlePubMed
- Siu LK, Fung CP, Chang FY, Lee N, Yeh KM, Koh TH, Ip M: Molecular typing and virulence analysis of serotype K1 Klebsiella pneumoniae strains isolated from liver abscess patients and stool samples from noninfectious subjects in Hong Kong, Singapore, and Taiwan. J Clin Microbiol. 2011, 49 (11): 3761-3765. 10.1128/JCM.00977-11.PubMed CentralView ArticlePubMed
- Chung DR, Lee SS, Lee HR, Kim HB, Choi HJ, Eom JS, Kim JS, Choi YH, Lee JS, Chung MH, et al: Emerging invasive liver abscess caused by K1 serotype Klebsiella pneumoniae in Korea. J Infect. 2007, 54 (6): 578-583. 10.1016/j.jinf.2006.11.008.View ArticlePubMed
- Fang CT, Lai SY, Yi WC, Hsueh PR, Liu KL, Chang SC: Klebsiella pneumoniae genotype K1: an emerging pathogen that causes septic ocular or central nervous system complications from pyogenic liver abscess. Clin Infect Dis. 2007, 45 (3): 284-293. 10.1086/519262.View ArticlePubMed
- Tsai FC, Huang YT, Chang LY, Wang JT: Pyogenic liver abscess as endemic disease, Taiwan. Emerg Infect Dis. 2008, 14 (10): 1592-1600. 10.3201/eid1410.071254.PubMed CentralView ArticlePubMed
- Fung CP, Chang FY, Lee SC, Hu BS, Kuo BI, Liu CY, Ho M, Siu LK: A global emerging disease of Klebsiella pneumoniae liver abscess: is serotype K1 an important factor for complicated endophthalmitis?. Gut. 2002, 50 (3): 420-424. 10.1136/gut.50.3.420.PubMed CentralView ArticlePubMed
- Wang JH, Liu YC, Lee SS, Yen MY, Chen YS, Wann SR, Lin HH: Primary liver abscess due to Klebsiella pneumoniae in Taiwan. Clin Infect Dis. 1998, 26 (6): 1434-1438. 10.1086/516369.View ArticlePubMed
- Pomakova DK, Hsiao CB, Beanan JM, Olson R, MacDonald U, Keynan Y, Russo TA: Clinical and phenotypic differences between classic and hypervirulent Klebsiella pneumonia: an emerging and under-recognized pathogenic variant. Eur J Clin Microbiol Infect Dis. 2012, 31 (6): 981-989. 10.1007/s10096-011-1396-6.View ArticlePubMed
- Dehghani AR, Masjedi A, Fazel F, Ghanbari H, Akhlaghi M, Karbasi N: Endogenous Klebsiella endophthalmitis associated with liver abscess: first case report from iran. Case Report Ophthalmol. 2011, 2 (1): 10-14. 10.1159/000323449.View Article
- Abate G, Koh TH, Gardner M, Siu LK: Clinical and bacteriological characteristics of Klebsiella pneumoniae causing liver abscess with less frequently observed multi-locus sequences type, ST163, from Singapore and Missouri, US. J Microbiol Immunol Infect. 2012, 45 (1): 31-36. 10.1016/j.jmii.2011.09.002.View ArticlePubMed
- Vila A, Cassata A, Pagella H, Amadio C, Yeh KM, Chang FY, Siu LK: Appearance of Klebsiella pneumoniae liver abscess syndrome in Argentina: case report and review of molecular mechanisms of pathogenesis. Open Microbiol J. 2011, 5: 107-113. 10.2174/1874285801105010107.PubMed CentralView ArticlePubMed
- Sobirk SK, Struve C, Jacobsson SG: Primary Klebsiella pneumoniae liver abscess with metastatic spread to lung and eye, a North-European case report of an emerging syndrome. Open Microbiol J. 2010, 4: 5-7. 10.2174/1874285801004010005.PubMed CentralView ArticlePubMed
- Pope JV, Teich DL, Clardy P, McGillicuddy DC: Klebsiella pneumoniae liver abscess: an emerging problem in North America. J Emerg Med. 2011, 41 (5): e103-e105. 10.1016/j.jemermed.2008.04.041.View ArticlePubMed
- Fierer J, Walls L, Chu P: Recurring Klebsiella pneumoniae pyogenic liver abscesses in a resident of San Diego, California, due to a K1 strain carrying the virulence plasmid. J Clin Microbiol. 2011, 49 (12): 4371-4373. 10.1128/JCM.05658-11.PubMed CentralView ArticlePubMed
- Wu KM, Li LH, Yan JJ, Tsao N, Liao TL, Tsai HC, Fung CP, Chen HJ, Liu YM, Wang JT, et al: Genome sequencing and comparative analysis of Klebsiella pneumoniae NTUH-K2044, a strain causing liver abscess and meningitis. J Bacteriol. 2009, 191 (14): 4492-4501. 10.1128/JB.00315-09.PubMed CentralView ArticlePubMed
- Liu P, Li P, Jiang X, Bi D, Xie Y, Tai C, Deng Z, Rajakumar K, Ou HY: Complete genome sequence of Klebsiella pneumoniae subsp. pneumoniae HS11286, a multidrug-resistant strain isolated from human sputum. J Bacteriol. 2012, 194 (7): 1841-1842. 10.1128/JB.00043-12.PubMed CentralView ArticlePubMed
- Kumarasamy KK, Toleman MA, Walsh TR, Bagaria J, Butt F, Balakrishnan R, Chaudhary U, Doumith M, Giske CG, Irfan S, et al: Emergence of a new antibiotic resistance mechanism in India, Pakistan, and the UK: a molecular, biological, and epidemiological study. Lancet Infect Dis. 2010, 10 (9): 597-602. 10.1016/S1473-3099(10)70143-2.PubMed CentralView ArticlePubMed
- Cho BK, Zengler K, Qiu Y, Park YS, Knight EM, Barrett CL, Gao Y, Palsson BO: The transcription unit architecture of the Escherichia coli genome. Nat Biotechnol. 2009, 27 (11): 1043-1049. 10.1038/nbt.1582.View ArticlePubMed
- Qiu Y, Cho BK, Park YS, Lovley D, Palsson BO, Zengler K: Structural and operational complexity of the Geobacter sulfurreducens genome. Genome Res. 2010, 20 (9): 1304-1311. 10.1101/gr.107540.110.PubMed CentralView ArticlePubMed
- Sharma CM, Hoffmann S, Darfeuille F, Reignier J, Findeiss S, Sittka A, Chabas S, Reiche K, Hackermuller J, Reinhardt R, et al: The primary transcriptome of the major human pathogen Helicobacter pylori. Nature. 2010, 464 (7286): 250-255. 10.1038/nature08756.View ArticlePubMed
- Kroger C, Dillon SC, Cameron AD, Papenfort K, Sivasankaran SK, Hokamp K, Chao Y, Sittka A, Hebrard M, Handler K, et al: The transcriptional landscape and small RNAs of Salmonella enterica serovar Typhimurium. Proc Natl Acad Sci U S A. 2012, 109 (20): E1277-E1286. 10.1073/pnas.1201061109.PubMed CentralView ArticlePubMed
- Deutscher J, Francke C, Postma PW: How phosphotransferase system-related protein phosphorylation regulates carbohydrate metabolism in bacteria. Microbiol Mol Biol Rev. 2006, 70 (4): 939-1031. 10.1128/MMBR.00024-06.PubMed CentralView ArticlePubMed
- Laursen BS, Sorensen HP, Mortensen KK, Sperling-Petersen HU: Initiation of protein synthesis in bacteria. Microbiol Mol Biol Rev. 2005, 69 (1): 101-123. 10.1128/MMBR.69.1.101-123.2005.PubMed CentralView ArticlePubMed
- Hershberg R, Altuvia S, Margalit H: A survey of small RNA-encoding genes in Escherichia coli. Nucleic Acids Res. 2003, 31 (7): 1813-1820. 10.1093/nar/gkg297.PubMed CentralView ArticlePubMed
- Zhang Y, Zhang Z, Ling L, Shi B, Chen R: Conservation analysis of small RNA genes in Escherichia coli. Bioinformatics. 2004, 20 (5): 599-603. 10.1093/bioinformatics/btg457.View ArticlePubMed
- Gardner PP, Daub J, Tate J, Moore BL, Osuch IH, Griffiths-Jones S, Finn RD, Nawrocki EP, Kolbe DL, Eddy SR, et al: Rfam: Wikipedia, clans and the “decimal” release. Nucleic Acids Res. 2011, 39 (Database issue): D141-D145.PubMed CentralView ArticlePubMed
- Weilbacher T, Suzuki K, Dubey AK, Wang X, Gudapaty S, Morozov I, Baker CS, Georgellis D, Babitzke P, Romeo T: A novel sRNA component of the carbon storage regulatory system of Escherichia coli. Mol Microbiol. 2003, 48 (3): 657-670. 10.1046/j.1365-2958.2003.03459.x.View ArticlePubMed
- Gudapaty S, Suzuki K, Wang X, Babitzke P, Romeo T: Regulatory interactions of Csr components: the RNA binding protein CsrA activates csrB transcription in Escherichia coli. J Bacteriol. 2001, 183 (20): 6017-6027. 10.1128/JB.183.20.6017-6027.2001.PubMed CentralView ArticlePubMed
- Rasmussen AA, Eriksen M, Gilany K, Udesen C, Franch T, Petersen C, Valentin-Hansen P: Regulation of ompA mRNA stability: the role of a small regulatory RNA in growth phase-dependent control. Mol Microbiol. 2005, 58 (5): 1421-1429. 10.1111/j.1365-2958.2005.04911.x.View ArticlePubMed
- Rasmussen AA, Johansen J, Nielsen JS, Overgaard M, Kallipolitis B, Valentin-Hansen P: A conserved small RNA promotes silencing of the outer membrane protein YbfM. Mol Microbiol. 2009, 72 (3): 566-577. 10.1111/j.1365-2958.2009.06688.x.View ArticlePubMed
- Vogel J, Luisi BF: Hfq and its constellation of RNA. Nat Rev Microbiol. 2011, 9 (8): 578-589. 10.1038/nrmicro2615.PubMed CentralView ArticlePubMed
- Yin Y, Zhao Y, Wang J, Liu C, Chen S, Chen R, Zhao H: AntiCODE: a natural sense-antisense transcripts database. BMC Bioinforma. 2007, 8: 319-10.1186/1471-2105-8-319.View Article
- Dornenburg JE, Devita AM, Palumbo MJ, Wade JT: Widespread antisense transcription in Escherichia coli. MBio. 2010, 1 (1): e00024-10-10.1128/mBio.00024-10.PubMed CentralView ArticlePubMed
- Sharma A, Nitharwal RG, Singh B, Dar A, Dasgupta S, Dhar SK: Helicobacter pylori single-stranded DNA binding protein–functional characterization and modulation of H. pylori DnaB helicase activity. FEBS J. 2009, 276 (2): 519-531. 10.1111/j.1742-4658.2008.06799.x.View ArticlePubMed
- Guell M, van Noort V, Yus E, Chen WH, Leigh-Bell J, Michalodimitrakis K, Yamada T, Arumugam M, Doerks T, Kuhner S, et al: Transcriptome complexity in a genome-reduced bacterium. Science. 2009, 326 (5957): 1268-1271. 10.1126/science.1176951.View ArticlePubMed
- Stragier P, Richaud F, Borne F, Patte JC: Regulation of diaminopimelate decarboxylase synthesis in Escherichia coli. I. Identification of a lysR gene encoding an activator of the lysA gene. J Mol Biol. 1983, 168 (2): 307-320. 10.1016/S0022-2836(83)80020-5.View ArticlePubMed
- Stragier P, Danos O, Patte JC: Regulation of diaminopimelate decarboxylase synthesis in Escherichia coli. II. Nucleotide sequence of the lysA gene and its regulatory region. J Mol Biol. 1983, 168 (2): 321-331. 10.1016/S0022-2836(83)80021-7.View ArticlePubMed
- Caldara M, Charlier D, Cunin R: The arginine regulon of Escherichia coli: whole-system transcriptome analysis discovers new genes and provides an integrated view of arginine regulation. Microbiology. 2006, 152 (Pt 11): 3343-3354.View ArticlePubMed
- Kiupakis AK, Reitzer L: ArgR-independent induction and ArgR-dependent superinduction of the astCADBE operon in Escherichia coli. J Bacteriol. 2002, 184 (11): 2940-2950. 10.1128/JB.184.11.2940-2950.2002.PubMed CentralView ArticlePubMed
- Cho BK, Federowicz S, Park YS, Zengler K, Palsson BO: Deciphering the transcriptional regulatory logic of amino acid metabolism. Nat Chem Biol. 2012, 8: 65-71.View Article
- Alekshun MN, Levy SB: Regulation of chromosomally mediated multiple antibiotic resistance: the mar regulon. Antimicrob Agents Chemother. 1997, 41 (10): 2067-2075.PubMed CentralPubMed
- Ariza RR, Cohen SP, Bachhawat N, Levy SB, Demple B: Repressor mutations in the marRAB operon that activate oxidative stress genes and multiple antibiotic resistance in Escherichia coli. J Bacteriol. 1994, 176 (1): 143-148.PubMed CentralPubMed
- Demple B: Redox signaling and gene control in the Escherichia coli soxRS oxidative stress regulon–a review. Gene. 1996, 179 (1): 53-57. 10.1016/S0378-1119(96)00329-0.View ArticlePubMed
- Helmann JD, Chamberlin MJ: Structure and function of bacterial sigma factors. Annu Rev Biochem. 1988, 57: 839-872. 10.1146/annurev.bi.57.070188.004203.View ArticlePubMed
- Hengge-Aronis R: Survival of hunger and stress: the role of rpoS in early stationary phase gene regulation in E. coli. Cell. 1993, 72 (2): 165-168. 10.1016/0092-8674(93)90655-A.View ArticlePubMed
- Keseler IM, Collado-Vides J, Santos-Zavaleta A, Peralta-Gil M, Gama-Castro S, Muniz-Rascado L, Bonavides-Martinez C, Paley S, Krummenacker M, Altman T, et al: EcoCyc: a comprehensive database of Escherichia coli biology. Nucleic Acids Res. 2011, 39 (Database issue): D583-D590.PubMed CentralView ArticlePubMed
- Janaszak A, Nadratowska-Wesolowska B, Konopa G, Taylor A: The P1 promoter of the Escherichia coli rpoH gene is utilized by sigma 70 -RNAP or sigma s -RNAP depending on growth phase. FEMS Microbiol Lett. 2009, 291 (1): 65-72. 10.1111/j.1574-6968.2008.01436.x.View ArticlePubMed
- Nitta T, Nagamitsu H, Murata M, Izu H, Yamada M: Function of the sigma(E) regulon in dead-cell lysis in stationary-phase Escherichia coli. J Bacteriol. 2000, 182 (18): 5231-5237. 10.1128/JB.182.18.5231-5237.2000.PubMed CentralView ArticlePubMed
- Typas A, Becker G, Hengge R: The molecular basis of selective promoter activation by the sigmaS subunit of RNA polymerase. Mol Microbiol. 2007, 63 (5): 1296-1306. 10.1111/j.1365-2958.2007.05601.x.View ArticlePubMed
- Podschun R, Ullmann U: Klebsiella spp. as nosocomial pathogens: epidemiology, taxonomy, typing methods, and pathogenicity factors. Clin Microbiol Rev. 1998, 11 (4): 589-603.PubMed CentralPubMed
- Salzberg SL: Genome re-annotation: a wiki solution?. Genome Biol. 2007, 8 (1): 102-PubMed CentralPubMed
- Mooney RA, Davis SE, Peters JM, Rowland JL, Ansari AZ, Landick R: Regulator trafficking on bacterial transcription units in vivo. Mol Cell. 2009, 33 (1): 97-108. 10.1016/j.molcel.2008.12.021.PubMed CentralView ArticlePubMed
- Datta D, Zhao H: Effect of false positive and false negative rates on inference of binding target conservation across different conditions and species from ChIP-chip data. BMC Bioinforma. 2009, 10: 23-10.1186/1471-2105-10-23.View Article
- Gottesman S, Storz G: Bacterial small RNA regulators: versatile roles and rapidly evolving variations. Cold Spring Harb Perspect Biol. 2011, 3 (12): a003798-10.1101/cshperspect.a003798.PubMed CentralView ArticlePubMed
- Chiang MK, Lu MC, Liu LC, Lin CT, Lai YC: Impact of Hfq on global gene expression and virulence in Klebsiella pneumoniae. PLoS One. 2011, 6 (7): e22248-10.1371/journal.pone.0022248.PubMed CentralView ArticlePubMed
- Argaman L, Hershberg R, Vogel J, Bejerano G, Wagner EG, Margalit H, Altuvia S: Novel small RNA-encoding genes in the intergenic regions of Escherichia coli. Curr Biol. 2001, 11 (12): 941-950. 10.1016/S0960-9822(01)00270-6.View ArticlePubMed
- Defoirdt T, Boon N, Bossier P: Can bacteria evolve resistance to quorum sensing disruption?. PLoS Pathog. 2010, 6 (7): e1000989-10.1371/journal.ppat.1000989.PubMed CentralView ArticlePubMed
- Maeda T, Garcia-Contreras R, Pu M, Sheng L, Garcia LR, Tomas M, Wood TK: Quorum quenching quandary: resistance to antivirulence compounds. ISME J. 2012, 6 (3): 493-501. 10.1038/ismej.2011.122.PubMed CentralView ArticlePubMed
- Cho BK, Knight EM, Barrett CL, Palsson BO: Genome-wide analysis of Fis binding in Escherichia coli indicates a causative role for A-/AT-tracts. Genome Res. 2008, 18 (6): 900-910. 10.1101/gr.070276.107.PubMed CentralView ArticlePubMed
- Cho BK, Knight EM, Palsson BO: Genomewide identification of protein binding locations using chromatin immunoprecipitation coupled with microarray. Methods Mol Biol. 2008, 439: 131-145. 10.1007/978-1-59745-188-8_9.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.