Skip to main content
  • Research article
  • Open access
  • Published:

Serotype-conversion in Shigella flexneri:identification of a novel bacteriophage, Sf101, from a serotype 7a strain



Shigella flexneri is the major cause of bacillary dysentery in the developing countries. The lipopolysaccharide (LPS) O-antigen of S. flexneri plays an important role in its pathogenesis and also divides S. flexneri into 19 serotypes. All the serotypes with an exception for serotype 6 share a common O-antigen backbone comprising of N-acetylglucosamine and three rhamnose residues. Different serotypes result from modification of the basic backbone conferred by phage-encoded glucosyltransferase and/or acetyltransferase genes, or plasmid-encoded phosphoethanolamine transferase. Recently, a new site for O-acetylation at positions 3 and 4 of RhaIII, in serotypes 1a, 1b, 2a, 5a and Y was shown to be mediated by the oacB gene. Additionally, this gene was shown to be carried by a transposon-like structure inserted upstream of the adrA region on the chromosome.


In this study, a novel bacteriophage Sf101, encoding the oacB gene was isolated and characterised from a serotype 7a strain. The complete sequence of its 38,742 bp genome encoding 66 open reading frames (orfs) was determined. Comparative analysis revealed that phage Sf101 has a mosaic genome, and most of its proteins were >90% identical to the proteins from 12 previously characterised lambdoid phages. In addition, the organisation of Sf101 genes was found to be highly similar to bacteriophage Sf6. Analysis of the Sf101 OacB identified two amino acid substitutions in the protein; however, results obtained by NMR spectroscopy confirmed that Sf101-OacB was functional. Inspection of the chromosomal integration site of Sf101 phage revealed that this phage integrates in the sbcB locus, thus unveiling a new site for integration of serotype-converting phages of S. flexneri, and determining an alternative location of oacB gene in the chromosome. Furthermore, this study identified oacB gene in several serotype 7a isolates from various regions providing evidence of O-acetyl modification in serotype 7a.


This is the first report on the isolation of bacteriophage Sf101 which contains the S. flexneri O-antigen modification gene oacB. Sf101 has a highly mosaic genome and was found to integrate in the sbcB locus. These findings contribute an advance in our current knowledge of serotype converting phages of S. flexneri.


The burden of bacterial dysentery due to shigellosis continues to be a major concern affecting more than 165 million people annually. S. flexneri is the primary cause of endemic shigellosis prevalent in the developing countries, and is the most frequently isolated species world-wide [1].

In S. flexneri, the O-antigen structure of the LPS, is one of the key virulence determinants required for its pathogenesis and protection against the host defence [2]. The O-antigen of S. flexneri, with the exception of serotype 6, consists of a polysaccharide backbone comprised of the repeating tetrasaccharide units (N-acetylglucosamine and three rhamnose residues), and modifications to the basic O-antigen backbone by the addition of glucosyl, O-acetyl, or phosphoethanolamine groups to one or more sugars, give rise to various different serotypes of S. flexneri [36]. To date at least 19 serotypes of S. flexneri have been identified [7].

The genes responsible for glucosylation of S. flexneri O-antigen, gtrA, gtrB and gtr(type), are arranged in a single operon known as the gtr cluster [813]. Whereas, the acetylation is mediated by a single gene ‘oac’ encoding O-acetyltransferase [14]. While the glucosylation is known to occur on any of the sugar residues, oac-encoded O-acetylation occurs at position 2 of RhaI residue. Additionally, the glucosyl or the O-acetyl modification genes are carried by bacteriophages, which mediate serotype-conversion by integrating into the host chromosome [3]. Six such serotype converting phages have been identified: SfI, SfII, SfIV, SfV, and SfX, which encode the gtr gene cluster, and Sf6 which encodes the oac gene [10, 1518]. In contrast to the phage-encoded glucosylation and O-acetylation, the phosphoethanolamine modification at position 3 of RhaII and/or RhaIII, is encoded by a plasmid-borne opt gene [6].

Recently, new sites for the O-acetylation of S. flexneri O-antigen have been identified. These are O-acetylation at position 3 (major) and 4 (minor) of RhaIII (3/4-O-acetylation) in serotypes 1a, 1b, 2a, 5a, Y, 6 and 6a, and at position 6 of GlcNAc in serotype 2a, 3a, and Y [1922]. Moreover, the degree of 3/4-O-acetylation has also been shown to vary between the ranges 30-70% (at position 3) and 15-30% (at position 4), within the strains of one serotype [21]. Another recent study revealed that the 3/4-O-acetylation modification in S. flexneri is mediated by the oacB gene, which was shown to be carried by a transposon-like structure located upstream of the adrA gene on the chromosome [23]. In this study, we report the isolation and characterisation of a novel bacteriophage Sf101, from a wild type serotype 7a strain, and show for the first time that oacB gene was located on the intact genome of the prophage, Sf101. The complete genome sequence of the Sf101 phage was determined. Comparative genomics used for highlighting important genetic similarities with other lambdoid phages indicated that the Sf101 phage has a mosaic genome and Sf6-like genome architecture. Analysis of a specific chromosomal integration site for phage Sf101 revealed that the phage integrates into the sbcB locus, thus identifying a new site for the integration of the serotype converting phages of S. flexneri, and a new location of oacB in S. flexneri chromosome. Additionally, this study also identified oacB gene in several serotype 7a isolates from various geographical regions, suggesting that the 3/4-O-acetylation modification is not uncommon in serotype 7a of S. flexneri.


Bacterial strains, bacteriophage and media

The induction of bacteriophage Sf101 from serotype 7a S. flexneri strain SFL1683 was performed using UV irradiation protocol described by Adam et al. [8]. Bacteriophage stocks were prepared by picking a single plaque, propagating on serotype Y strain (SFL124) [24], and precipitating phage using polyethylene glycol, as described in Sambrook et al. [25]. S. flexneri strain SFL1691 (serotype 7a) lacking the 3/4-O-acetylation on the RhaIII was used as host for Sf101-OacB functional analysis. E. coli JM109 was used for cloning experiments.

Bacteria were grown in Luria–Bertani (LB) broth or LB agar supplemented with Chloramphenicol (25 μg/ml) or Erythromycin (250 μg/ml) when appropriate. NZCYM broth was employed for routine propagation of phage.

DNA methods

Sf101 phage DNA was isolated from purified phage stock by treatment with Proteinase K as described by Sambrook et al. [24]. Bacterial genomic DNA was isolated using GE Healthcare genomic DNA isolation kit (GE Healthcare), according to the manufacturer’s instructions. Plasmid isolation was performed using the Axyprep Plasmid Miniprep kit (Axygen Biosciences). Primers used in this study were synthesized by Sigma-Aldrich and are listed in Additional file 1: Table S1. PCR amplification was performed using the PfuUltra II Fusion HS DNA Polymerase (Stratagene) according to the manufacturer’s directions. Purification of the PCR products was achieved by using the Wizard SV Gel and PCR Clean Up System (Promega). DNA sequencing was performed using Big Dye Terminator v3.1 Cycle Sequencing kit as recommended by the manufacturer and were run on an AB 3730 capillary sequencer at Biomedical Resources Facility, John Curtin School of Medical Research, Australian National University.

Sf101 oacB was amplified using Sf101 phage DNA as template, using primers Sf101-OacB-Fwd and Rev. The purified product was then cloned into vector pBC SK + (Stratagene) to generate plasmid pNV2073. Erythromycin gene, PCR amplified from plasmid pTRKH2 using Em primer pair was then cloned into pNV2073 to generate plasmid pNV2074. The recombinant plasmids were transformed by electroporation and maintained in JM109 cells. pNV2074 was also introduced into S. flexneri strain SFL1691 to produce SFL2516.

Electron microscopy

The purified phage was absorbed on carbon-coated copper grids, negatively stained with 2% phosphotungstic acid (pH 7.0) and visualized with a Hitachi H7000 transmission electron microscope.

Host range determination

The host range of phage Sf101 was determined as described previously [17]. Briefly, different dilutions of phage stock were made in SM buffer; 5-10 μl of each dilution was then spotted on required bacterial lawn on an agar plate. Lytic activity (clear zone encompassing the phage drop) was examined following overnight incubation at 37°C.

Sequencing of phage genomic DNA

The purified Sf101 phage DNA was sequenced using the Ion Torrent PGM 314 chip (Life Technologies) at the Australian Genome Resource Facility, University of Queensland. The reads generated were assembled into contigs using CLC Genomics Workbench (Ver 5.5.1, CLC Bio). To close the gaps between the contigs and re-sequence the regions of low-quality, the desired segments were PCR amplified using phage DNA as template and the purified PCR products were sequenced as described above.

Analysis of sequence

Open reading frames were determined using CLC Main Workbench (Ver 5.5.1, CLC Bio) and NCBI ORF finder program. Genes within orfs were predicted based on homologies to the known genes found by BlastN and BlastP searches and the presence of Shine-Dalgarno ribosome binding sites. The Rho-independent terminators were identified using ARLOND terminator finding program [26] and tRNAscan-SE search server was used to scan for tRNA [27].

The protein level alignments were performed using ClustalW [28] and BioEdit Sequence Alignment Editor [29]. The accession numbers for the phages used for comparative genomics were: CUS-3 (CP000711), Sf6 (NC_005344), HK544 (NC_019767), mEP043 c-1 (NC_019706), SfX (NC_017328), mEp213 (NC_019720), mEp234 (NC_019715), P22 (NC_002371), epsilon34 (NC_011976), Ime10 (NC_019501), HK446 (NC_019714), Phage 2851 (FM180578).

Preparation of O-polysaccharide and NMR spectroscopy

Lipopolysaccharide of strains SFL1683, SFL1691 and SFL2516 were isolated by the phenol water extraction of dried bacterial cells [30]. Lipid-free polysaccharide (PS) was prepared by treatment of the LPS with dilute acetic acid followed by purification using gel permeation chromatography [31].

1H NMR spectra of the LPS in D2O solution were recorded at 80°C on a Bruker Avance II 500 MHz spectrometer. 1D and 2D NMR experiments [32] of the PS were recorded in D2O solution on Bruker Avance 500 MHz and Bruker Avance III 700 MHz spectrometers, equipped with 5 mm TCI Z-Gradient CryoProbes, at 24°C for strain SFL1683 and at 20°C for strain SFL2516. Data processing was performed using vendor-supplied software. Chemical shifts are reported in ppm using internal sodium 3-trimethylsilyl-(2,2,3,3-2H4)-propanoate (TSP, δH 0.00) or external 1,4-dioxane in D2O (δC 67.40) as references.

Availability of supporting data

The nucleotide sequence of Sf101 phage reported in this article has been deposited in the GenBank database as accession number KJ832078.

Results and discussion

Isolation of Sf101

A novel bacteriophage named Sf101 was induced and isolated from serotype 7a, strain SFL1683. Phage Sf101 formed clear, round plaques when plated onto the indicator strain SFL124 (serotype Y). In order to confirm that SFL1683 is the host strain of Sf101 phage, Southern hybridization of HindIII digested genomic DNA of the host, using DIG-labelled Sf101 as a probe was performed. Results confirmed the presence of phage genome integrated into the host chromosome (data not shown).

Morphology of Sf101

Electron microscopic examination of Sf101 phage preparation revealed that it has an isometric head (ca. 50 nm) and a short tail (ca. 16 nm) (Figure 1). According to the morphological classification of Bradley, these characteristic features are typical of phage in the C group morphology of the family Podoviridae and order Caudovirale [33]. Among the other serotype converting phages of Shigella, appearance of Sf101 phage resembles phage Sf6 and SfX [34, 35].

Figure 1
figure 1

Electron micrograph of phage Sf101 negatively stained with 2% phosphotungstic acid. Scale bar, 50 nm.

Host range

Twelve serotypes (1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 7a, X, and Y) of S. flexneri were tested for sensitivity against Sf101 phage. Together, Sf101 was able to infect 4 serotypes: 1a, 1b, 3b and Y. Interestingly, we observed some differences in the phage ability to form plaques on different serotypes. Sf101 plating efficiency was similar on serotypes Y and 1a; however, it was 105 fold lower when plated on serotypes 3b and 1b. In a number of short-tailed phages, adsorption is mediated by an initial reversible binding of the phage to bacterial LPS, followed by an irreversible attachment of the phage to an unknown secondary receptor, the presence of which has been found to be necessary for infection. Recently, OmpA and OmpC were identified as the secondary receptors for bacteriophage Sf6, and were shown to dramatically affect the rate and efficiency of bacteriophage Sf6 infections [36]. Bacteriophage Sf101 being similar to Sf6 might also require presence of secondary receptors for infection, and the difference in the plating efficiencies observed might be due to mutation in the secondary receptors of these strains.

Genome properties and organisation

The complete genome of Sf101 phage was sequenced using 314 chips on an Ion torrent PGM, producing an average read length of approx. 125 bp. A total of approx. 21,069 reads were generated which were then de-novo assembled into 6 major contigs using CLC Genomics Workbench (Ver 5.5.1, CLC Bio). The accuracy of the contigs was verified by mapping the reads back to the contigs. The regions of ambiguity and the gaps between the contigs were then addressed by targeted Sanger sequencing to obtain a single contiguous sequence.

The complete genome of Sf101 phage consists of 38,742 bp, with the overall G + C content of 47.4%. A total of 66 protein coding genes with a plausible Shine-Dalgarno sequence were predicted from the genome sequence. Among which 41 orfs are transcribed from the sense strand and 25 are on the antisense strand. The majority of the orfs initiate translation with an ATG, whereas only 6 start with GTG. The sizes of the gene products varied from 24 a.a. to 911 a.a.

The translated orf products were also compared with known protein sequences using BlastP (Additional file 2: Table S2). Based on the similarities, 43 out of 66 orfs were assigned putative functions, while the other 23 orfs exhibited similarity to uncharacterised proteins. The overall genetic organisation of Sf101 phage (Figure 2a), consisting of "DNA packaging and structure - serotype conversion - regulation – recombination - replication – nin and the lysis region", suggested that this phage is a member of lambdoid family. Additionally, the layout of the genes was similar to that of previously sequenced serotype converting Shigella phages Sf6 and SfX.

Figure 2
figure 2

Genome map and integration site of phage Sf101. (a) The Sf101 genome is shown with a scale in bp. orfs are represented by arrows oriented in the direction of transcription. Putative functional modules are indicated above the arrows. Black knobs below the scale depict the Rho-independent terminators and the attachment site of the phage (attP) is indicated with a blue vertical arrow. (b) Schematic representation of phage Sf101 integrative recombination where attP on the phage genome recombines with the attB within sbcB gene of S. flexneri chromosome to form the two junction’s recombination sites attL and attR. Arrows P1 (sbcB-dwn-att-Fwd) and P2 (Orf22-up-Rev) on the map indicate the primers used in the PCR.

The Sf101 genes were generally tightly spaced occupying 94.5% of the genome. However, there were several large (~200 to 600 bp) apparently non-coding regions within the genome (e.g. between orfs 16-17, 17-18, 19-20, 23-24 and 37-38). Although no tRNA genes were identified in these or other locations by using a tRNA scanning program, several putative Rho-independent transcription terminators were identified and are shown in Figure 2a.

Relationship to other phages

Initial whole genome Blast of phage Sf101 against the NCBI database showed that bacteriophage Sf101 is related to several lambdoid phages originating from various hosts, like S. flexneri, E. coli and Salmonella. To better understand the relatedness, comparison of proteins encoded by Sf101 phage to those of the 12 most related phages was carried out. Figure 3 shows that most of the proteins encoded by the phage Sf101 were >90% identical with proteins from previously characterised lambdoid phages. Moreover, the genome of phage Sf101 is highly mosaic with the left half of the phage most homologous to E. coli phage CUS-3 and the right half most homologous to phage Sf6 and HK544.

Figure 3
figure 3

Comparison of Sf101 with related phages. Proteins encoded by Sf101 phage were compared with those of 12 other lambdoid phages from E. coli, S. flexneri and Salmonella. The coloured arrows represent the level of amino acid sequence identity between Sf101 protein and its counterpart in other phages. The red and green colours indicate >90% and >80% identity, respectively. The purple arrows on Sf101 map represent proteins with no phage-borne homologue.

The module containing DNA packaging and structural proteins is the largest module of the Sf101 phage, and is predicted to contain 15 orfs (orf 1 to 15). While the packaging proteins of Sf101 were identical to Sf6, most of the proteins responsible for phage head assembly shared >90% identity with their counterparts in CUS-3 (Figure 3). One of the exceptions was ORF9 in the head completion module. orf9 with two other genes of the module encode for proteins required for stabilizing the phage head after DNA gets loaded in it. Although, ORF9 shared only 50% identity with its equivalent in CUS-3, it was found to be >90% identical to its homologue in phage P22. The major difference between CUS-3 and Sf101 head morphogenesis proteins, was in the DNA injection module. Sf101 ORF11, 12 and 13, forming this module showed only 85%, 34% and 32% identity to their equivalents in CUS-3. In general, the injection proteins are known to be more variable than the rest of the virion assembly proteins because these proteins are released by the virion during DNA injection and function in the host during or after the injection [37].

orf15 of Sf101 encodes the tail spike protein that adsorbs to the cell receptors (O-antigen) of the host. Analysis of the tail spike protein of Sf101 phage revealed conservation in the amino terminal portion than in the carboxyl terminal region. The amino terminal residues were homologous to the cognate amino acid residues of CUS-3, Sf6, SfX, and epsilon34 (90, 90, 90, and 85% identity, respectively). However, very limited homology was observed between the carboxyl terminal residues. The reason for this could be that the amino terminal domain of the P22 like tail spikes are known to bind to virion head and is therefore more conserved. The carboxy terminal domain, on the other hand, contains the O-antigen binding site and so is specific for each phage [37, 38]. The integrase of phage Sf101 (encoded by orf18) shared 99% identity with the integrases of HK544, mEp234 and phage 2851. Moreover, further analysis of Sf101 genome sequence has revealed a 13 bp sequence (5’CGTAATCGTGAAA 3’) located upstream of the int gene, that is, identical to the attP of phage 2851 [39]. While the integration sites and attP sequence of HK544 and mEp234 are not known, phage 2851 has been shown to integrate in the sbcB locus [39]. The extremely high similarity of the Sf101 phage attP and Int protein to those of phage 2851 suggested that the integration site of Sf101 phage must be at the sbcB locus, identical to phage 2851. We amplified and sequenced one of the Sf101 prophage-bacterial genomic junctions that spans between Sf101 orf22 and sbcB gene (Figure 2b). A 3 kb sequence was obtained in which 13 bp attL sequence was identified; thus confirming the integration of Sf101 phage into the 5’ end of sbcB gene (encoding 3’-5’ exonuclease). This is unlike all the other known serotype-converting phages of S. flexneri which are known to integrate either into the agrW or thrW tRNA gene [10, 1518].

The early and regulatory regions located in the right half of the genome shared homology to CUS-3, Sf6, HK544, mEp043c-1, mEp213, mEp234 and HK446. However, the proteins of regulation and the replication module were most similar (>90% identical) to their cognates in HK446 and mEp043c-1, respectively. The nin region of the Sf101 phage (ORF46-54) was identical to that of phage HK544, with ninX being the only exception showing only 43% identity. Moreover, nin E, F, G, H, and I of Sf101 phage were also conserved in Sf6, mEp213, mEp234 and HK446 phages.

The antitermination Q and a set of predicted lysis genes lie downstream of the nin region. The lysis module of phage Sf101 is typical of lambdoid phages, consisting of holin, anti-holin, lysin, Rz and Rz1 proteins (encoded by orfs 57-61). Additionally, orf61 of Sf101 phage lies inside orf60, in a second reading frame, as in other lamdoid phages [38]. Homologues of holin, anti-holin, lysin, and Rz were seen in all the phages shown in Figure 3, except for phage P22 and phage 2851. In contrast, equivalents of Rz1 of phage Sf101 were only present in CUS-3, Sf6, P22 and epsilon34. To the right of the lytic cassette, orfs 62, 64, 65 and 66 encoded proteins which shared >90% identity with their homologues in SfX and Sf6, whereas orf63 encoded Rha family protein was identical to its cognate in phage CUS-3.

Novel genes of Sf101 phage

Sf101 phage proteins encoded by orfs 16, 17, 41 and 56, were identified as proteins with no known phage-borne homologues. Whilst ORF17 and 56 shared similarity with uncharacterised proteins of E. coli, ORF41 had no homologue in E. coli or Shigella and showed limited homology with proteins from Saprospira grandis and Bipolaris maydis. BlastP results of ORF16 revealed that it is an acyltransferase family protein, sharing 99% identity with the protein encoded by gene SF0315 (named oacB) of S. flexneri 2a str 301. OacB has recently been shown to confer the 3/4-O-acetylation modification of RhaIII of S. flexneri O-antigen [23].

Pairwise alignments of Sf101 orf16 (Sf101-oacB) with the oacB at the DNA and protein levels were carried out. The analysis identified three base substitutions (G486 to A, C692 to T and G796 to A) in the Sf101-oacB. While the first mutation had no effect at the protein level, the other two mutations resulted in two amino acid substitutions in Sf101-OacB: A231 to V and V266 to I (Additional file 3: Figure S1). Since, A, I and V are all neutral amino acids with similar structure, it was expected that these mutations will impose minimum effect on the function of OacB. This was verified by performing NMR spectroscopy experiments.

Confirming the function of Sf101-OacB

In order to confirm the function of Sf101-OacB, we first analysed the LPS of the Sf101 host (SFL1683) by 1H NMR spectroscopy. Results revealed NMR resonances at δH 2.21 and 2.17 indicating the presence of O-acetyl groups [40, 41]. oacB gene from Sf101 phage was then cloned into a plasmid (pNV2074) and introduced into a serotype 7a strain (SFL1691), which was PCR negative for oacB gene, to create a recombinant strain SFL2516. The 1H NMR spectrum acquired on the LPS of strain SFL2516 revealed resonances, inter alia, at δH 2.21, 2.17 and 2.15; however, NMR signals were absent in the spectral region for O-acetyl groups for the SFL1691 LPS.

LPS of strains SFL1683 and SFL2516 were then treated by dilute aqueous acetic acid and purified by size exclusion chromatography to obtain the polysaccharide materials (PS). The 1H NMR spectra of these PS showed resonances, inter alia, at δH 2.21 (0.8 H), 2.16 (0.5 H), 2.11 (0.9 H) and 2.07 (2.1 H) for the SFL1683 strain (Figure 4a) and at δH 2.22 (0.8 H), 2.17 (0.7 H), 2.13 (0.8 H) and 2.08 (2.2 H) for the SFL2516 strain, indicating partial O-acetyl substitution. The O-acetyl substitution position(s) were then determined by performing the 1H,1H-TOCSY experiments employing an array of mixing times of increasing lengths, which have proven to be particularly powerful at unraveling the spin-systems of the sugar residues and subsequently the O-acetyl substitution position(s) [42]. Using mixing times in the range 10–120 ms, the 1H,1H-TOCSY spectra of the PS from strain SFL2516 revealed, inter alia, spin systems originating from δH 1.29 (H6 of RhaIII3Ac) to 3.79 (H5), 3.53 (H4), 5.09 (H3) and 4.26 (H2), from δH 1.16 (H6 of RhaIII4Ac) to 3.86 (H5), 4.80 (H4), 4.10 (H3) and 4.22 (H2), as well as from δH 1.26 (H6 of RhaIII) to 3.68 (H5), 3.34 (H4), 3.87 (H3) and 4.14 (H2) (Figure 4b). The 1H,1H-TOCSY spectra of the PS from strain SFL1683 were closely similar, and taken together these results were fully consistent with the NMR data reported by Wang et al. [23] for partial O-acetyl substitution at positions O3 and O4 of RhaIII in O-antigens from other S. flexneri serotypes. Thus, O-acetylation was present to about 1/4 at O3 and to about 1/5 at O4 in the two strains investigated herein.

Figure 4
figure 4

NMR. Selected regions of (a) 1H NMR spectrum of the PS from strain SFL1683 showing resonances from O- and N-acetyl groups and (b) 1H,1H-TOCSY (τmix 120 ms) NMR spectrum of the PS from strain SFL2516 showing spin systems originating from the H6 protons in the methyl groups of rhamnosyl residues. Annotations are given with respect to the O-acetylation pattern in the O-antigen.

Identification of conserved residues and motifs of OacB

BlastP results of Sf101-OacB also revealed that this protein shared some homology with acyltransferases from other species. All the homologues were grouped under the superfamily COG1835 (acyl_transf_3). Multiple alignment of the Sf101-OacB with 20 of its homologues from various species was carried out for identification of the conserved residues and motifs. Results of alignment (Additional file 4: Figure S2) showed several conserved motifs: DGxRGxLAxxVxxHH, FFxITG(YorF)LFxxK, WxLxYEWxFY and YSXYLxHG (where x is any amino acid), present at similar positions of the proteins. The first 3 motifs were found in the amino terminal part of the protein while the fourth one was in the carboxy terminus. Several other conserved amino acids were also identified in close proximity to these motifs. The high levels of conservation of various amino acid motifs suggests that they may be involved in specific conserved functions.

Distribution of oacB in S. flexneri7a strains

The 3/4-O-acetyl modification observed in this study was in a serotype 7a strain. However, serotype 7a is known to lack any O-acetyl modification to date [21]. Thus, several serotype 7a isolates from different regions (4 from Bangladesh, 9 from Egypt, 1 from Sweden, 31 from UK, and 14 from Vietnam) were screened to identify a possible 3/4-O-acetylation modification. As the antiserum specific for 3/4-O-acetyl modification was not available, a PCR screening of 59 serotype 7a strains with oacB specific primers was performed. As shown in Table 1 the expected PCR product was amplified from 7 out of 59 strains (1 isolate each from Sweden, UK, and Egypt, and 4 from Bangladesh). The sequencing of the PCR product from the positive strains revealed that the oacB gene sequence in 2 strains (1 from Egypt and 1 from Sweden) was identical to oacB of Sf101 phage (Additional file 3: Figure S1). However, the oacB gene in the other 5 strains was almost identical to oacB from S. flexneri 2a str 301 except a base substitution at position 1146, which resulted in I382 to M mutation in the OacB protein of all 5 strains. However, as I and M are neutral amino acids, this mutation is expected to have no effect on the function of OacB. These results suggest that the 3/4-O-acetyl modification in serotype 7a is not uncommon. Moreover, in order to determine if oacB gene in these 7a strains was carried by Sf101 phage, PCRs targeting three different regions of Sf101 phage (orf15-orf16, orf30-orf39, and orf60-orf66) were performed on the above 7 oacB positive strains. Results revealed that 1 out of the 7 tested strains carried complete Sf101 phage suggesting this phage may have contributed in the dissemination of the oacB gene in 7a strains (Table 1).

Table 1 PCR screening of oacB in serotype 7a isolates from various regions

Integration site of Sf101 phage suggests that the oacB gene in Sf101 lysogens is located in the sbcB locus. However, the reported site for oacB gene in serotype 1a, 1b, 2a, 5a and Y is upstream of the adrA gene (which encodes diguanylate cyclase AdrA) [23]. Thus, the location of oacB gene in the 7 oacB positive serotype 7a strains was determined by performing PCR using primer specific for oacB gene and adrA. Results indicate that the location of oacB in the Sf101 lysogen was the sbcB locus, while in the other 6 strains oacB was located upstream of the adrA gene. Change in the chromosomal position of oacB from sbcB to adrA locus is likely due to disruption of the prophage by IS elements and mobilization of oacB due to the combination of 2 or more IS elements flanking oacB. This hypothesis is supported by our analysis of the region flanking sbcB gene in 10 previously sequenced S. flexneri genomes obtained from the NCBI database. Results revealed that the region upstream of sbcB gene is extremely variable containing insertion elements in all, and Sf101 like Int in 8 out of 10 strains. Taken together, our analysis indicate that a possible presence and disruption of Sf101 phage by IS elements could have resulted in the mobilization of oacB in some and loss in the other strains. Additionally, Wang et al. have also reported that the oacB gene located upstream of adrA in serotypes 1a, 1b, 2a, 5a & Y was carried by a transposon-like structure [23].


This study reports the isolation and characterisation of a novel bacteriophage Sf101 from S. flexneri serotype 7a strain. Complete genome sequence of Sf101 phage was determined and found to contain functional oacB gene making it a serotype converting phage of S. flexneri. The comparative genomic analysis of Sf101 to 12 other lambdoid phages revealed that bacteriophage Sf101 has a highly mosaic genome and Sf6-like genome organisation. Sf101 was found to integrate in the sbcB locus representing a new genomic location of oacB gene in Sf101 lysogen. Additionally, this study for the first time identified oacB gene in several serotype 7a isolates from different regions, providing evidence of 3/4-O-acetylation modification in serotype 7a of S. flexneri. These findings will further our understanding on the serotype converting phages of Shigella which will be useful in comprehending the role these bacteriophages play in the survival of S. flexneri in the environment and human host.


  1. Kotloff KL, Winickoff JP, Ivanoff B, Clemens JD, Swerdlow DL, Sansonetti PJ, Adak GK, Levine MM: Global burden of Shigella infections: implications for vaccine development and implementation of control strategies. Bull World Health Organ. 1999, 77 (8): 651-666.

    CAS  PubMed Central  PubMed  Google Scholar 

  2. Morona R, Daniels C, Van Den Bosch L: Genetic modulation of Shigella flexneri 2a lipopolysaccharide O antigen modal chain length reveals that it has been optimized for virulence. Microbiology. 2003, 149 (Pt 4): 925-939.

    Article  CAS  PubMed  Google Scholar 

  3. Allison GE, Verma NK: Serotype-converting bacteriophages and O-antigen modification in Shigella flexneri. Trends Microbiol. 2000, 8 (1): 17-23. 10.1016/S0966-842X(99)01646-7.

    Article  CAS  PubMed  Google Scholar 

  4. Foster RA, Carlin NI, Majcher M, Tabor H, Ng LK, Widmalm G: Structural elucidation of the O-antigen of the Shigella flexneri provisional serotype 88-893: structural and serological similarities with S. flexneri provisional serotype Y394 (1c). Carbohydr Res. 2011, 346 (6): 872-876. 10.1016/j.carres.2011.02.013.

    Article  CAS  PubMed  Google Scholar 

  5. Kenne L, Lindberg B, Petersson K: Basic structure of the oligosaccharide repeating-unit of the Shigellaflexneri O-antigens. Carbohydr Res. 1977, 56 (2): 363-370. 10.1016/S0008-6215(00)83357-1.

    Article  CAS  PubMed  Google Scholar 

  6. Sun Q, Knirel YA, Lan R, Wang J, Senchenkova SN, Jin D, Shashkov AS, Xia S, Perepelov AV, Chen Q, Wang Y, Wang H, Xu J: A novel plasmid-encoded serotype conversion mechanism through addition of phosphoethanolamine to the O-antigen of Shigella flexneri. PLoS One. 2012, 7 (9): e46095-10.1371/journal.pone.0046095.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  7. Sun Q, Lan R, Wang J, Xia S, Wang Y, Wang Y, Jin D, Yu B, Knirel YA, Xu J: Identification and characterization of a novel Shigella flexneri serotype Yv in China. PLoS One. 2013, 8 (7): e70238-10.1371/journal.pone.0070238.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  8. Adams MM, Allison GE, Verma NK: Type IV O antigen modification genes in the genome of Shigella flexneri NCTC 8296. Microbiology. 2001, 147 (Pt 4): 851-860.

    Article  CAS  PubMed  Google Scholar 

  9. Huan PT, Whittle BL, Bastin DA, Lindberg AA, Verma NK: Shigella flexneri type-specific antigen V: cloning, sequencing and characterization of the glucosyl transferase gene of temperate bacteriophage SfV. Gene. 1997, 195 (2): 207-216. 10.1016/S0378-1119(97)00144-3.

    Article  CAS  PubMed  Google Scholar 

  10. Mavris M, Manning PA, Morona R: Mechanism of bacteriophage SfII-mediated serotype conversion in Shigella flexneri. Mol Microbiol. 1997, 26 (5): 939-950. 10.1046/j.1365-2958.1997.6301997.x.

    Article  CAS  PubMed  Google Scholar 

  11. Pradip Adhikari GA, Whittle B, Verma NK: Serotype 1a O-antigen modification: molecular characterization of the genes involved and their novel organization in the Shigella flexneri chromosome. J Bacteriol. 1999, 181 (15): 4711-4718.

    PubMed Central  Google Scholar 

  12. Stagg RM, Tang SS, Carlin NI, Talukder KA, Cam PD, Verma NK: A novel glucosyltransferase involved in O-antigen modification of Shigella flexneri serotype 1c. J Bacteriol. 2009, 191 (21): 6612-6617. 10.1128/JB.00628-09.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  13. Verma NK, Verma DJ, Huan PT, Lindberg AA: Cloning and sequencing of the glucosyl transferase-encoding gene from converting bacteriophage X (SFX) of Shigella flexneri. Gene. 1993, 129 (1): 99-101. 10.1016/0378-1119(93)90702-5.

    Article  CAS  PubMed  Google Scholar 

  14. Verma NK, Brandt JM, Verma DJ, Lindberg AA: Molecular characterization of the O-acetyl transferase gene of converting bacteriophage SF6 that adds group antigen 6 to Shigella flexneri. Mol Microbiol. 1991, 5 (1): 71-75. 10.1111/j.1365-2958.1991.tb01827.x.

    Article  CAS  PubMed  Google Scholar 

  15. Allison GE, Angeles D, Tran-Dinh N, Verma NK: Complete genomic sequence of SfV, a serotype-converting temperate bacteriophage of Shigella flexneri. J Bacteriol. 2002, 184 (7): 1974-1987. 10.1128/JB.184.7.1974-1987.2002.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  16. Casjens S, Winn-Stapley DA, Gilcrease EB, Morona R, Kuhlewein C, Chua JE, Manning PA, Inwood W, Clark AJ: The chromosome of Shigella flexneri bacteriophage Sf6: complete nucleotide sequence, genetic mosaicism, and DNA packaging. J Mol Biol. 2004, 339 (2): 379-394. 10.1016/j.jmb.2004.03.068.

    Article  CAS  PubMed  Google Scholar 

  17. Jakhetia R, Talukder KA, Verma NK: Isolation, characterization and comparative genomics of bacteriophage SfIV: a novel serotype converting phage from Shigella flexneri. BMC Genomics. 2013, 14: 677-10.1186/1471-2164-14-677.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  18. Sun Q, Lan R, Wang Y, Wang J, Wang Y, Li P, Du P, Xu J: Isolation and genomic characterization of SfI, a serotype-converting bacteriophage of Shigella flexneri. BMC Microbiol. 2013, 13 (1): 39-10.1186/1471-2180-13-39.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  19. Kubler-Kielb J, Vinogradov E, Chu C, Schneerson R: O-Acetylation in the O-specific polysaccharide isolated from Shigella flexneri serotype 2a. Carbohydr Res. 2007, 342 (3–4): 643-647.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  20. Perepelov AV, L'Vov VL, Liu B, Senchenkova SN, Shekht ME, Shashkov AS, Feng L, Aparin PG, Wang L, Knirel YA: A similarity in the O-acetylation pattern of the O-antigens of Shigellaflexneri types 1a, 1b, and 2a. Carbohydr Res. 2009, 344 (5): 687-692. 10.1016/j.carres.2009.01.004.

    Article  CAS  PubMed  Google Scholar 

  21. Perepelov AV, Shekht ME, Liu B, Shevelev SD, Ledov VA, Senchenkova SN, L'Vov VL, Shashkov AS, Feng L, Aparin PG, Wang L, Knirel YA: Shigella flexneri O-antigens revisited: final elucidation of the O-acetylation profiles and a survey of the O-antigen structure diversity. FEMS Immunol Med Microbiol. 2012, 66 (2): 201-210. 10.1111/j.1574-695X.2012.01000.x.

    Article  CAS  PubMed  Google Scholar 

  22. Perepelov AV, Shevelev SD, Liu B, Senchenkova SN, Shashkov AS, Feng L, Knirel YA, Wang L: Structures of the O-antigens of Escherichia coli O13, O129, and O135 related to the O-antigens of Shigella flexneri. Carbohydr Res. 2010, 345 (11): 1594-1599. 10.1016/j.carres.2010.04.023.

    Article  CAS  PubMed  Google Scholar 

  23. Wang J, Knirel YA, Lan R, Senchenkova SN, Luo X, Perepelov AV, Wang Y, Shashkov AS, Xu J, Sun Q: Identification of an O-Acyltransferase Gene (oacB) that mediates 3- and 4-O-Acetylation of Rhamnose III in Shigella flexneri O Antigens. J Bacteriol. 2014, 196 (8): 1525-1531. 10.1128/JB.01393-13.

    Article  PubMed Central  PubMed  Google Scholar 

  24. Lindberg AA, Karnell A, Stocker BA, Katakura S, Sweiha H, Reinholt FP: Development of an auxotrophic oral live Shigella flexneri vaccine. Vaccine. 1988, 6 (2): 146-150. 10.1016/S0264-410X(88)80018-5.

    Article  CAS  PubMed  Google Scholar 

  25. Sambrook J, Fritsch EF, Maniatis T: Molecular Cloning: a Laboratory Manual. 1989, Cold Spring Harbor, NY: Cold Spring Harbor Laboratory, 2

    Google Scholar 

  26. Gautheret D, Lambert A: Direct RNA motif definition and identification from multiple sequence alignments using secondary structure profiles. J Mol Biol. 2001, 313 (5): 1003-1011. 10.1006/jmbi.2001.5102.

    Article  CAS  PubMed  Google Scholar 

  27. Schattner P, Brooks AN, Lowe TM: The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005, 33 (Web Server issue): W686-W689.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  28. Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22 (22): 4673-4680. 10.1093/nar/22.22.4673.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  29. Hall TA: BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999, 41: 95-98.

    CAS  Google Scholar 

  30. Westphal OJK: Bacterial lipopolysaccharides: extraction with phenol-water and further applications of the procedure. Methods Carbohydr Chem. 1965, 5: 83-90.

    CAS  Google Scholar 

  31. Chassagne P, Fontana C, Guerreiro C, Gauthier C, Phalipon A, Widmalm G, Mulard LA: Structural Studies of the O-Acetyl-Containing O-Antigen from a Shigella flexneri serotype 6 strain and synthesis of oligosaccharide fragments thereof. Eur J Org Chem. 2013, 19: 4085-4106.

    Article  Google Scholar 

  32. Widmalm G: General NMR spectroscopy of carbohydrates and conformational analysis in solution. Comprehensive glycoscience. Edited by: Kamerling JP. 2007, Oxford: Elsevier, 101-132. vol. 2

    Chapter  Google Scholar 

  33. Bradley DE: Ultrastructure of bacteriophage and bacteriocins. Bacteriol Rev. 1967, 31 (4): 230-314.

    CAS  PubMed Central  PubMed  Google Scholar 

  34. Guan S, Bastin DA, Verma NK: Functional analysis of the O antigen glucosylation gene cluster of Shigella flexneri bacteriophage SfX. Microbiology. 1999, 145 (Pt 5): 1263-1273.

    Article  CAS  PubMed  Google Scholar 

  35. Lindberg AA, Wollin R, Gemski P, Wohlhieter JA: Interaction between bacteriophage Sf6 and Shigella flexneri. J Virol. 1978, 27 (1): 38-44.

    CAS  PubMed Central  PubMed  Google Scholar 

  36. Parent KN, Erb ML, Cardone G, Nguyen K, Gilcrease EB, Porcek NB, Pogliano J, Baker TS, Casjens SR: OmpA and OmpC are critical host factors for bacteriophage Sf6 entry in Shigella. Mol Microbiol. 2014, 92 (1): 47-60. 10.1111/mmi.12536.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  37. Casjens SR, Thuman-Commike PA: Evolution of mosaically related tailed bacteriophage genomes seen through the lens of phage P22 virion assembly. Virology. 2011, 411 (2): 393-415. 10.1016/j.virol.2010.12.046.

    Article  CAS  PubMed  Google Scholar 

  38. Berget PB, Poteete AR: Structure and functions of the bacteriophage P22 tail protein. J Virol. 1980, 34 (1): 234-243.

    CAS  PubMed Central  PubMed  Google Scholar 

  39. Strauch E, Hammerl JA, Konietzny A, Schneiker-Bekel S, Arnold W, Goesmann A, Puhler A, Beutin L: Bacteriophage 2851 is a prototype phage for dissemination of the Shiga toxin variant gene 2c in Escherichia coli O157:H7. Infect Immun. 2008, 76 (12): 5466-5477. 10.1128/IAI.00875-08.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  40. Jansson PE, Kenne L, Schweda E: Nuclear magnetic resonance and conformational studies on monoacetylated methyl D-Gluco- and D-Galactopyranosides. J Chem Soc Perk T. 1987, 1 (2): 377-383.

    Article  Google Scholar 

  41. Rönnols J, Pendrill R, Fontana C, Hamark C, Angles d'Ortoli T, Enström O, Ståhle J, Zaccheus MV, Säwén E, Hahn LE, Iqbal S, Widmalm G: Complete 1H and 13C NMR chemical shift assignments of mono- to tetrasaccharides as basis for NMR chemical shift predictions of oligosaccharides using the computer program CASPER. Carbohydr Res. 2013, 380: 156-166.

    Article  PubMed  Google Scholar 

  42. Fontana C, Ramström K, Weintraub A, Widmalm G: Structural studies of the O-antigen polysaccharide from Escherichia coli O115 and biosynthetic aspects thereof. Glycobiology. 2013, 23 (3): 354-362. 10.1093/glycob/cws161.

    Article  CAS  PubMed  Google Scholar 

Download references


We thank Phung Cam, A. El-Gendy, Claire Jenkins and Kaisar Talukder for providing S. flexneri serotype 7a strains. We are grateful to Cathy Gillespie of the Microscopy and Cytometry Resource Facility at the Australian National University, for help with the electron microscope.

This work was supported by grants from the Swedish Research Council and The Knut and Alice Wallenberg Foundation to GW.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Naresh K Verma.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

RJ contributed to the experimental design of the study, carried out all experiments, analysis and drafted the manuscript. AM participated in the induction of the phage. JS and GW conducted and analysed the NMR experiments. NKV conceived and directed the study, participated in its experimental design, analysis of the results. NKV and GW critically revised the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material

Additional file 1: Table S1: List of primes used in this study. (DOCX 17 KB)

Additional file 2: Table S2: Analysis of predicted orfs and proteins of Sf101. (DOCX 26 KB)


Additional file 3: Figure S1: Multiple alignment of the amino acid sequence of OacB from Sf101, S. flexneri 2a str 301, and 7 serotype 7a strains performed using ClustalW. Amino acid substitutions in Sf101 OacB when compared with OacB of S. flexneri 2a str 301 are boxed in green. Residues highlighted in red are identical to OacB from Sf101, while the ones in green share identity with OacB of S. flexneri 2a str 301. Residues in grey are point mutations in the protein sequences of serotype 7a strains. (DOCX 21 KB)


Additional file 4: Figure S2: Alignment of Sf101 OacB with homologues acyltansferases. The OacB protein from Sf101 phage was aligned with its homologues in several other species using ClustalW. Regions highlighted in pink are conserved amino acids. Conserved motifs are shown by green lines above the alignment. (DOCX 42 KB)

Authors’ original submitted files for images

Rights and permissions

Open Access  This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit

The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jakhetia, R., Marri, A., Ståhle, J. et al. Serotype-conversion in Shigella flexneri:identification of a novel bacteriophage, Sf101, from a serotype 7a strain. BMC Genomics 15, 742 (2014).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: