Skip to main content
  • Research article
  • Open access
  • Published:

Identification and characterization of repetitive extragenic palindromes (REP)-associated tyrosine transposases: implications for REP evolution and dynamics in bacterial genomes

Abstract

Background

Bacterial repetitive extragenic palindromes (REPs) compose a distinct group of genomic repeats. They usually occur in high abundance (>100 copies/genome) and are often arranged in composite repetitive structures - bacterial interspersed mosaic elements (BIMEs). In BIMEs, regularly spaced REPs are present in alternating orientations. BIMEs and REPs have been shown to serve as binding sites for several proteins and suggested to play role in chromosome organization and transcription termination. Their origins are, at present, unknown.

Results

In this report, we describe a novel class of putative transposases related to IS200/IS605 transposase family and we demonstrate that they are obligately associated with bacterial REPs. Open reading frames coding for these REP-associated tyrosine transposases (RAYTs) are always flanked by two REPs in inverted orientation and thus constitute a unit reminiscent of typical transposable elements. Besides conserved residues involved in catalysis of DNA cleavage, RAYTs carry characteristic structural motifs that are absent in typical IS200/IS605 transposases. DNA sequences flanking rayt genes are in one third of examined cases arranged in modular BIMEs. RAYTs and their flanking REPs apparently coevolve with each other. The rayt genes themselves are subject to rapid evolution, substantially exceeding the substitution rate of neighboring genes. Strong correlation was found between the presence of a particular rayt in a genome and the abundance of its cognate REPs.

Conclusions

In light of our findings, we propose that RAYTs are responsible for establishment of REPs and BIMEs in bacterial genomes, as well as for their exceptional dynamics and species-specifity. Conversely, we suggest that BIMEs are in fact a special type of nonautonomous transposable elements, mobilizable by RAYTs.

Background

Transposable elements (TEs), or transposons, are a large group of mobile genetic elements with ability to actively transfer themselves into new locations in their host´s DNA. This process, called transposition, is catalyzed by transposases, coded for by TEs themselves. Insertion sequences (ISs) present the simplest examples of TEs.

The IS200/IS605 family of transposable elements was first described in genus Salmonella [1] and further in many other bacterial and archaeal genomes [2]. Contrary to the majority of TEs that transpose using transposases whose active site is composed of a triad of acidic residues (DDE transposases), known members of the IS200/IS605 family lack terminal inverted repeats and do not generate larger target site duplications upon transposition [3]. Crystal structures of two IS200/IS605 transposases have been solved (PDB IDs: 2a6o and 2f4f) [4, 5]. Their fold is remarkably similar to proteins involved in rolling circle (RC) replication - conjugative plasmid relaxases and viral Rep proteins [4, 5]. This similarity is further supported by shared mechanism of DNA cleavage: transesterification reaction takes place between DNA strand and conserved tyrosine residue, resulting in covalent protein-DNA intermediate. A histidine-hydrophobic-histidine motif and a divalent metal (magnesium) cation are another mandatory components of properly assembled active site, aiding the nucleophilic attack of catalytic tyrosine [6, 7]. Next trait common for both IS200/IS605 transposases and RC enzymes is that the cleavage of DNA depends on the recognition of stem-loop structures, present at either the origin of RC replication or IS termini [6, 7]. IS200/IS605 transposases are the smallest transposases known, with average length below 150 amino acids. To encompass size limitation, they work as a homodimer with two hybrid active sites, each composed of tyrosine from first unit and the histidine-hydrophobic-histidine motif from second unit [4, 5].

As determination of eukaryotic genomic sequences progressed in the last two decades, it has become obvious that their genetic information is littered with highly repetitive, "junk" DNA. More detailed analyses of these repetitive elements revealed that many of them are actually special cases of TEs. They generally retain conserved terminal sequences (for example inverted repeats) of their corresponding full-length transposons, which are important for transposition initiation, but lack completely or partially the transposase gene. Therefore, transposase encoded by "parental" full-length transposons needs to be supplied in trans. These repetitive elements are thus called nonautonomous TEs. Three groups of nonautonomous TEs account for substantial fractions of eukaryotic genomes. The first group is represented by short interspersed nuclear elements (Alu-like), derived from non-LTR-retrotransposons [8]. Helitrons, the second type of nonautonomous TEs, are thought to be mobilized by Y-2 type transposases, that are homologous to RC replication relaxases [9]. The last type, miniature inverted repeat transposable elements (MITEs), is present in both eukaryotes and prokaryotes. Most studied MITEs are related to two homologous insertion sequence families, IS630 (prokaryotic) and Tc-Mariner (eukaryotic) [10], both employing DDE catalytic mechanism. IS630-derived MITEs in prokaryotic genomes include Correia elements in Neisseria species [11] and RUP elements in Streptococcus pneumoniae [12]. Besides these, MITEs related to other IS families have been identified in prokaryotes [2].

Repetitive extragenic palindromic sequences (REPs) were originally identified in enteric bacteria [13] and later in several other bacterial taxa [14–16] as a class of abundant repeats with characteristic architecture. REP elements contain imperfect palindrome in their sequence. The majority of REPs are arranged in repeats of higher order, bacterial interspersed mosaic elements (BIMEs) [17]. In BIME-1, two oppositely orientated REPs are located close to each other. The inter-REP sequence interacts with integration host factor (IHF) [18]. BIME-2 and atypical BIMEs are composed of several tandemly repeated BIME-1-like units [19] and have been shown to strongly bind DNA gyrase [20]. REPs themselves interact with DNA polymerase I [21] and facilitate Rho-dependent transcription termination [22].

Our present results describe an intimate relationship between REP and BIME elements and one apparently monophyletic group of IS200/IS605 transposases. Because of striking similarities to known nonautonomous TEs, we propose that BIMEs are in fact nonautonomous TEs and that IS200/IS605 transposases are responsible for their mobilization.

Results

Case study - genus Stenotrophomonas

We have studied mechanisms of high-level tetracycline resistance in bacteria from agricultural soil treated with manure from tetracycline-fed animals. Among tetracycline-resistant isolates, identified as Stenotrophomonas maltophilia, Variovorax paradoxus and Chryseobacterium balustinum, horizontal gene transfer from S. maltophilia to other two species was detected. The transferred nucleotide sequence was 90% identical to a histidine kinase/response regulator/sodium-symporter family gene, present in both sequenced S. maltophilia strains. We investigated the region surrounding this gene in sequenced stenotrophomonads for the presence of genes known to be involved in horizontal transfer of genetic information. A putative transposase of the IS200/IS605 family was found one gene away from histidine kinase in S. maltophilia R551-3. Analysis of sequences flanking the transposase gene revealed inverted repeats containing an imperfect palindrome. More sequences identical to these inverted repeats were observed scattered in several instances between neighboring genes (Figure 1A).

Figure 1
figure 1

Stenotrophomonas RAYTs. (A) Schematic representation of a segment of S. maltophilia R551-3 genome containing putative IS200/IS605 family transposase gene (orange arrow), histidine kinase/response regulator/sodium-symporter family gene (blue arrow) and several short palindromic repeats (red arrows). (B) General structure of Stenotrophomonas rayt genes flanked by REPs. The whole REPs and their orientation are denoted with red arrows. Details in REP structure (bottom) are marked with arrows: blue - GT(A/G)G head, green - palindrome-forming sequence, pink - noncomplementary middle part of palindrome.

We performed a BLAST search that revealed five apparent homologs of this transposase in genomes of sequenced stenotrophomonads. Their genes were all found to be delimited by inverted repeats of the same architecture (Figure 1B). The 5-GT(A/G)G "head" is immediately followed by perfectly complementary, GC-rich palindrome, interrupted by 2-4 bases in its middle (Table 1, bottom). Due to the presence of multiple copies of these repeated sequences in the proximity of the transposase gene (see above), we scanned whole Stenotrophomonas genomes for additional copies of repeats flanking each particular transposase homolog. The number of hits ranged from 37 up to 427 perfect copies of given repeat per genome (Table 1, bottom). Because of their palindromic nature and abundance, features they share with published REP sequences, they will be called REPs and their cognate transposases will be called REP-associated tyrosine transposases (RAYTs).

Table 1 Summary information on identified RAYTs and REPs

We noticed that some of the REPs identified were arranged in clusters. Ten clusters composed of REPs were then analysed in detail (Figure 2). The core (basic module) of each of these compound structures consists at least of two inverted REPs, separated by two intervening segments. Several of these basic modules are connected to each other in a head-to-tail fashion. The inter-REP segments do not show any homology with each other and vary substantially in length, suggesting that these clusters arose repeatedly and independently. Because of their exceptional structural similarities with published BIMEs, they will be called BIMEs.

Figure 2
figure 2

Schematic representation of S. maltophilia R551-3 and S. maltophilia K279a BIMEs. Host strain is indicated, followed by BIME coordinates. Each unique inter-REP sequence is assigned a different letter. Basic modules are bracketed, their numbers are denoted. REPs and their orientation are marked with arrows: red - Smal4 REP, green - Smal3 REP, blue - S_sp2 REP. Asterisks indicate modified REPs. Large orange arrow denotes gene coding for Smal4 RAYT.

Stenotrophomonas BIMEs show several interesting aspects. Some of them are hybrid and contain REPs from two different RAYTs. Moreover, slightly modified REPs occur in BIMEs, differing only in a few nucleotide positions. Still, in all cases, the palindromic features of REPs are preserved, suggesting selection for complementary mutations. Intriguingly, one rayt gene (Smal4) is directly associated with a BIME, its downstream REP being one of the BIME-constituting REPs.

Since all six rayt genes are flanked by two inverted REPs, this type of organization is likely to be subject to evolutionary preservation. To estimate evolutionary relationship between these elements, phylogenetic trees were constructed from RAYT amino acid sequences and REP nucleotide sequences, respectively. Both phylograms display the same topology (Figure 3), suggesting that RAYTs coevolve with their cognate REPs and that their typical organization is ancestral.

Figure 3
figure 3

Coevolution of RAYTs and REPs. Unrooted phylograms, constructed from (A) Stenotrophomonas RAYT amino acid sequences and (B) Stenotrophomonas REP nucleotide sequences.

RAYTs in other bacteria

We wondered if similar RAYTs, REPs and BIMEs also occur together in other bacterial taxa. Using Smal1 RAYT sequence as query, exhaustive BLAST search was performed to identify RAYT homologs in other prokaryotic organisms. Retrieved homologs, all of which contained the "Pfam01797: Transposase_17" domain (peculiar to IS200/IS605 transposases), were tested for the presence of palindrome-containing inverted repeats flanking their genes. Subsequently, the number of these putative REPs in host genomes was determined. Only RAYTs associated with abundant REPs were further analysed. Detected RAYTs are listed in Table 1. RAYT homologs suiting our criteria were only found in gammaproteobacteria.

All detected REPs consist of GT(A/G)G head and GC-rich imperfect palindrome with potential to form stem-loop structures in single-stranded state (Table 1). Importantly, in all cases when REP sequences were determined in bacterial species taken into our analysis prior to this work, REPs identified by our approach are in agreement with these sequences. This concerns Escherichia coli [19], Salmonella sp. [23], Pseudomonas putida Pput2 [16] and Stenotrophomonas maltophilia Smal4 [24] REPs. For example, E. coli RAYT-coding gene (yafM) is delimited by two different REPs (Table 1). These are in fact Y and Z2 palindromic units, constituents of modular BIMEs (BIME-2 and atypical BIMEs) [25]. E. coli rayt itself is flanked by BIME-2 on both sides. Similar direct association with BIME was observed in total for one third of detected RAYTs (Table 1) in various species.

Further, we examined distribution of identified REPs in host genomes. Analysis revealed that most REPs are arranged in clusters (Additional file 1). In some cases (pseudomonads, Thioalkalivibrio sp.), the most predominant type of clusters is a doublet of REPs in inverted orientation. These REP doublets, together with embedded inter-REP sequences, might themselves represent compound repeated elements, analogous to E. coli BIME-1. This is supported by structure of recently described Pseudomonas fluorescens repeats [26]. The R0 family consists of 612 repeats (89 bp in length) that have two inverted elements at their termini, identical with Pflu1 REPs.

In contrast to the doublet arrangement, Xanthomonas campestris REPs are in great majority found in large clusters, consisting of regularly spaced REPs in alternating orientations (Additional file 1), typical features of BIMEs. In remaining cases, solitary REPs are found along with doublets and BIMEs.

Preliminary analysis confirmed that the great majority of all identified REPs are extragenic (data not shown) and thus further fulfill the definition of REP elements.

Evolution of RAYTs and REPs

Since REPs share several common structural features, they are likely to represent a group of related elements. We wondered if the same is true for RAYTs. Because RAYTs were detected due to similarity of their protein sequences (see above), they are thought to be structurally related. To specify this relationship, an alignment of selected RAYTs together with reference set of "typical" IS200/IS605 transposases was constructed (Figure 4). The alignment reveals that all catalytically confirmed residues - histidine-hydrophobic-histidine motif and nucleophilic tyrosine - are conserved in both groups. It is thus reasonable to conclude that RAYTs are capable of cleaving DNA with formation of DNA-RAYT covalent intermediate. On the contrary, several motifs and conserved residues are peculiar only to RAYTs. This is in particular true for 100% conserved threonine near N-terminus and the NP(L/V)(R/K)xG motif that is located close to C-terminus adjacently to nucleophilic tyrosine.

Figure 4
figure 4

Multiple sequence alignment of selected RAYTs and reference set of IS 200 /IS 605 family transposases. Conserved residues are highlighted: blue - conserved in RAYTs, yellow - conserved in reference IS200/IS605 transposases, green - conserved in both groups. Substitutions for residues with similar chemical properties are permitted: acidic - D, E, basic - H, K, R, aromatic - F, Y, W, branched-chain hydrophobic- I, L, V. Conserved residues that constitute catalytic center are denoted with red asterisk. Reference set of IS200/IS605 transposases, along with their symbols, was taken from [5].

The presence of these unique structural features could signify that RAYTs are monophyletic group of proteins. The question therefore arises as to whether the entire RAYT clade has been evolving with their corresponding REPs, as seen in Stenotrophomonas (Figure 3). Due to rather high divergence of REPs, it is not possible to construct their accurate phylogram. However, REPs show group-specific features that correlate well with phylogenetic grouping of their cognate RAYTs. For example, enterobacterial RAYTs are clearly monophyletic (Additional file 2) and accordingly, their REPs are rather long, substantially dimorphic and their palindrome is interrupted twice (Table 1). Furthermore, uniquely for REPs of monophyletic Pseudomonas and Xanthomonas RAYTs (Additional file 2), 5´-GA-3´ dinucleotide is inserted between their GT(A/G)G head and palindrome-forming part (Table 1). Together, these observations support long-term coevolution of RAYTs and and their cognate REPs.

Next, we examined chromosomal localization of rayt genes. Among RAYTs listed in Table 1, three couples of orthologous rayt genes (Pput1 and Pput2, Pput3 and Pput4, Smal3 and S_sp2), located in the same genomic context in different host species or strains, were identified (Figure 5). These orthologs have, due to the shared synteny, unambiguously evolved from a common ancestor and allow us to trace back changes they have gone through following divergence event. Although orthologous rayt genes do not change their genomic position, their flanking REPs differ in up to three point mutations (Table 1) and still retain palindromicity and inverted repeat arrangement. Evidently, strong selective pressure works for preservation of these REP traits, underlining their functional importance. It is extremely improbable that repeated changes in REP sequences flanking these orthologs result merely from random fixation of successive random mutations.

Figure 5
figure 5

Sequence identity between orthologous RAYTs and proteins coded for by neighboring orthologous genes.

Comparison of sequence identity between orthologous rayt genes revealed an interesting phenomenon. In all three cases, the degree of identity of the RAYT amino acid sequences was significantly less than that of the flanking genes (Figure 5), suggesting that RAYTs evolve more faster than protein products of common genes. Possible explanation for this accelerated evolution is included in the Discussion section.

Relationship of RAYTs, REPs and BIMEs

We have shown so far that RAYTs and REPs are evolutionarily and physically connected. Since REPs are known to be species (or strain)-specific and the same applies to RAYTs (Table 1), it is possible that the presence of a particular RAYT itself in one bacterium might be responsible for proliferation of corresponding REPs.

Where genome sequences suitable for comparison were available, strains differing by the presence or absence of a particular rayt gene were tested for prevalence of REPs in their genomes. In most cases, a strong correlation between rayt presence and total number of its cognate REPs was found (Table 2), rayt-bearing strains containing on average ten times more REPs in their genomes than strains devoid of rayt genes. These results indeed suggest that presence of a given RAYT is the direct cause of REP sequences proliferation over host chromosome.

Table 2 Correlation between REP numbers and presence or absence of their cognate RAYTs in different bacterial strains

In search of support for this hypothesis, we found that in three marine gammaproteobacteria and one betaproteobacterium (all possessing clear RAYT homologs), the distribution of inverted palindromic repeats flanking their rayt genes is not genome-wide (as in other REP cases). Instead, REPs are accumulated proximally to particular rayt gene (Additional file 3). The REP-containing regions span at most two hundreds of kilobases. In the case of the marine gammaproteobacteria, the physical association between rayt genes and REPs is very pronounced. Thauera sp. (a betaproteobacterium) is of special interest because it has obviously acquired its RAYT by horizontal transfer from gammaproteobacteria. This RAYT displays highest sequence similarity to Pseudomonas RAYTs (56% identical residues), has no counterpart in other betaproteobacteria and its REP sequences are also Pseudomonas-like (Table 1, Additional file 3). High numbers of REPs are present in the Thauera genome. More than a third are located proximally to rayt gene. This suggests that, following acquisition of the rayt gene, new REP copies have been preferentially produced in its vicinity.

Physical association with rayt genes was already shown for BIMEs (Table 1). Upon closer examination, we detected four cases where 3´ end of rayt gene, together with sequence between rayt stop codon and downstream REP, is integrated into BIME, becoming a part of BIME´s inter-REP segment (Additional file 4). This unexpected observation proves that the mechanism responsible for establishment of BIMEs is also directed to rayt genes.

Discussion

We have characterized a novel class of transposases, closely related to IS200/IS605 family. What makes these transposases (RAYTs) unique is the obligate delimitation of their genes by two inverted palindromic sequences (REPs), which are at the same time highly overrepresented in host genomes. We have shown that this type of organization (REP-rayt-REP, Figure 1B) has been preserved during evolution and that both RAYTs and REPs undergo long-term coevolution. Characteristic structural elements in both RAYT and REP sequences suggest that all detected RAYTs and REPs are descendants of a common ancestor. We propose that their origin dates to the period after branching of the gammaproteobacteria, since no homologs have been found in other major bacterial lineages.

The structure of a rayt gene flanked by two oppositely orientated REPs is strikingly reminiscent of the organization of a typical bacterial insertion sequence. The position of REPs as terminal sequences for RAYT-encoding genes is supported by the fact that they are in many cases located very close or even immediately downstream of the rayt gene stop codon (Additional file 4), excluding additional terminal sequences. There are other known transposase genes associated with REPs, however, all of them are contained in bona fide ISs, complete with their own terminal sequences [27–34]. These ISs use REPs as targets for their transposition.

We have not found typical signs of IS-like mobility for RAYTs, i.e. presence of their multiple copies in host genomes and changes of chromosomal location. This might indicate that RAYTs have lost the ability to transpose their own genes. Still, there are at least two reasons to assume that RAYTs recognize REPs and cleave DNA strand in their proximity. By mere analogy, transposases always bind and cleave sequences that flank their genes during the course of transposition. This precise positioning of REPs by rayt genes is conserved. Moreover, related IS200/IS605 transposases recognize stem-loop structures [4, 5] that can readily arise from imperfect palindromes like those contained in REP sequences.

One of the most interesting outcomes of this study is the previously unrecognized wide distribution of BIME elements. BIMEs were detected in most of RAYT- and REP-carrying species (Figure 2 and data not shown). Apparently, there is a common mechanism of BIMEs formation. The mechanism is targeted to rayt genes, one third of which are directly associated with BIMEs (Table 1). Furthermore, 3´ termini of rayt genes were found captured between REPs in four rayt-adjacent BIMEs (Additional file 4). BIMEs are known to exhibit extensive interstrain differences in length and distribution [24, 27] that seem unlikely to result solely from processes such as homologous recombination or DNA polymerase strand-slippage. We hypothesize that the putative RAYT-catalyzed reaction, as described further, may pose the driving force behind BIME establishment and dynamics.

In the simplest case, the information contained in a REP sequence would be sufficient for its recognition and cleavage by RAYT. Because of high level of conservation of the 5´ head sequences (Table 1), we hypothesize that they might serve as determinants of position of cleavage site. Presumed REP-targeted RAYT activity would then result, for example, in reversible formation of a free hydroxyl group and covalent attachment of 5´ terminus of REP sequence to RAYT protein (Figure 6A). Host genomes typically harbor hundreds of REPs and all of them present potential substrates for RAYTs. The RAYT activity can thus account for various imaginable DNA rearrangements. There are two important aspects of presumed RAYT catalysis. Firstly, the transiently present free hydroxyl group can serve as a primer for initiation of DNA replication. Secondly, in trans ligation (reverse RAYT-catalyzed reaction) might occur relatively frequently between two RAYTs that act on different REPs. Because the assembly of catalytic site in related IS200/IS605 transposases is achieved by dimerization (due to their limited size), RAYTs probably form dimers as well. The physical proximity of two subunits enhances the frequency of in trans ligations.

Figure 6
figure 6

Model of RAYT action. (A) REP-specific DNA cleavage and ligation - scheme of hypothetical reaction (B) Model of RAYT-dependent BIME proliferation. REPs are represented with red arrows at corresponding DNA strands. REP-complementary sequences at opposite strands are represented with striped arrows. Two inter-REP segments are denoted in green and blue, respectively. RAYTs that form one dimer are denoted in the same color. Asterisks denote free OH groups. For details about the model, see text.

We suggest that REP-dependent RAYT activity is responsible for some of the unusual observations regarding REPs. For example, high number of REPs in host genome is conditioned by presence of their cognate RAYT (Table 2). Further, the substitution rate for rayt genes was shown to greatly exceed the rate of substitutions in surrounding host genes (Figure 5). If RAYTs cleave in adjacency of their flanking REPs, resulting OH groups may prime DNA replication into rayt gene, leading to partial or complete replacement of one or both strands. When several rounds of such replication are performed during each cell cycle, excessive mutations accumulate. Although this is a rather complicated theory, the alternatives, like strong positive selection for mutated RAYTs, are equally uneasy to substantiate.

Another process we propose is RAYT-dependent is the preferential formation of new REPs in vicinity of a rayt gene (Additional file 3), following its horizontal transfer into the host. In this case, acquired RAYT obviously causes new REPs´ production, possibly through multiplication of existing REPs flanking its gene.

A possible model of BIME formation is depicted in Figure 6B. Starting with one basic module of BIMEs (two directly repeated REPs and one REP between them in inverted orientation - Figure 2), RAYT dimer cleaves at both top-strand REPs. Another RAYT dimer works on bottom strand, due to presence of single REP, only one unit of the dimer is attached to REP after cleavage. Upon in trans ligation within the frame of "yellow" dimer, circularized basic module and bottom strand hold together by their complementary parts. The circle is primed by the free OH group resulting from RAYT cleavage of bottom strand. At this point, rolling circle replication of basic module begins. The main replicative DNA polymerase (Pol III holoenzyme in E. coli) might accomplish the process on its own, since it was shown to possess intrinsic moderate strand-displacement activity [35]. The amplified basic module (BIME) is cut off from the rolling circle after the second unit of "blue" RAYT dimer cleaves newly synthetized REP. Then, second in trans ligation within the frame of "blue" dimer integrates BIME into the bottom strand. Following replication of chromosome and separation of daughter cells, one of them contains a modular BIME in its genome.

Taken together, we have gathered considerable amount of in silico evidence to propose significant role of transposases in generation of bacterial intergenic repeats. If our assumptions are true, then REPs and BIMEs represent a novel class of nonautonomous TEs. To confirm this, additional experiments are needed to simulate interaction between RAYTs and REPs in vivo and in vitro.

Conclusions

Our findings offer an alternative approach for rapid identification of REPs in gammaproteobacterial genome sequences. Putative RAYT homologs can easily be found by a simple BLAST of conserved C-terminal part of any of known RAYTs against particular genome sequence. Invertedly positioned REPs can then be located flanking the rayt gene. Known REPs proved to be a useful tool for typing of intraspecific isolates, with high discriminatory power due to extensive REP dynamics [16, 36]. REP typing is, compared to other methods, very fast and inexpensive, since it only requires one PCR, run from REP-complementary primers against chromosomal DNA template.

Upon determination of REP sequences, BIMEs can readily be identified in host genomes. Since BIMEs exhibit exceptional length polymorphism, they have been utilized as reliable markers for strain determination. As in previous case, the procedure is advantageous because of its quickness and simplicity [24, 27].

Methods

Bacterial genome sequences were downloaded from NCBI web site [37].

Direct and inverted repeats in rayt-flanking sequences were looked up with OligoRep [38].

REPs position and total number determinations and graphical plots were performed using pDRAW32 [39].

Multiple protein sequence alignments were constructed using MCOFFEE [40]. Phylogenetic trees were constructed using Drawtree or Drawgram applications from MOBYLE package [41]. Protein sequence trees were constructed from template CLUSTALX tree files (.ph) generated during MCOFFEE alignment. REP sequence trees were constructed from CLUSTALX guide tree files (.dnd) after being aligned with CLUSTALW [41].

Abbreviations

BIME:

 Bacterial Insterspersed Mosaic Element

IS:

 Insertion Sequence

MITE:

 Miniature Inverted repeat Transposable Element

RAYT:

 REP-Associated tYrosine Transposase

RC:

 Rolling Circle

REP:

 Repetitive Extragenic Palindrome

TE:

 Transposable Element

References

  1. Lam S, Roth JR: IS200: a Salmonella-specific insertion sequence. Cell. 1983, 34 (3): 951-960. 10.1016/0092-8674(83)90552-4.

    Article  CAS  PubMed  Google Scholar 

  2. Filee J, Siguier P, Chandler M: Insertion sequence diversity in archaea. Microbiol Mol Biol Rev. 2007, 71 (1): 121-157. 10.1128/MMBR.00031-06.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  3. IS Finder. [http://www-is.biotoul.fr/]

  4. Ronning DR, Guynet C, Ton-Hoang B, Perez ZN, Ghirlando R, Chandler M, Dyda F: Active site sharing and subterminal hairpin recognition in a new class of DNA transposases. Mol Cell. 2005, 20 (1): 143-154. 10.1016/j.molcel.2005.07.026.

    Article  CAS  PubMed  Google Scholar 

  5. Lee HH, Yoon JY, Kim HS, Kang JY, Kim KH, Kim DJ, Ha JY, Mikami B, Yoon HJ, Suh SW: Crystal structure of a metal ion-bound IS200 transposase. J Biol Chem. 2006, 281 (7): 4261-4266. 10.1074/jbc.M511567200.

    Article  CAS  PubMed  Google Scholar 

  6. Barabas O, Ronning DR, Guynet C, Hickman AB, Ton-Hoang B, Chandler M, Dyda F: Mechanism of IS200/IS605 family DNA transposases: activation and transposon-directed target site selection. Cell. 2008, 132 (2): 208-220. 10.1016/j.cell.2007.12.029.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  7. Monzingo AF, Ozburn A, Xia S, Meyer RJ, Robertus JD: The structure of the minimal relaxase domain of MobA at 2.1 A resolution. J Mol Biol. 2007, 366 (1): 165-178. 10.1016/j.jmb.2006.11.031.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  8. Prak ET, Kazazian HH: Mobile elements and the human genome. Nat Rev Genet. 2000, 1 (2): 134-144. 10.1038/35038572.

    Article  CAS  PubMed  Google Scholar 

  9. Kapitonov VV, Jurka J: Rolling-circle transposons in eukaryotes. Proc Natl Acad Sci USA. 2001, 98 (15): 8714-8719. 10.1073/pnas.151269298.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  10. Siguier P, Filee J, Chandler M: Insertion sequences in prokaryotic genomes. Curr Opin Microbiol. 2006, 9 (5): 526-531. 10.1016/j.mib.2006.08.005.

    Article  CAS  PubMed  Google Scholar 

  11. Buisine N, Tang CM, Chalmers R: Transposon-like Correia elements: structure, distribution and genetic exchange between pathogenic Neisseria sp. FEBS Lett. 2002, 522 (1-3): 52-58. 10.1016/S0014-5793(02)02882-X.

    Article  CAS  PubMed  Google Scholar 

  12. Oggioni MR, Claverys JP: Repeated extragenic sequences in prokaryotic genomes: a proposal for the origin and dynamics of the RUP element in Streptococcus pneumoniae. Microbiology. 1999, 145 (Pt 10): 2647-2653.

    Article  CAS  PubMed  Google Scholar 

  13. Higgins CF, Ames GF, Barnes WM, Clement JM, Hofnung M: A novel intercistronic regulatory element of prokaryotic operons. Nature. 1982, 298 (5876): 760-762. 10.1038/298760a0.

    Article  CAS  PubMed  Google Scholar 

  14. Tobes R, Ramos JL: REP code: defining bacterial identity in extragenic space. Environ Microbiol. 2005, 7 (2): 225-228. 10.1111/j.1462-2920.2004.00704.x.

    Article  CAS  PubMed  Google Scholar 

  15. Tobes R, Pareja E: Repetitive extragenic palindromic sequences in the Pseudomonas syringae pv. tomato DC3000 genome: extragenic signals for genome reannotation. Res Microbiol. 2005, 156 (3): 424-433. 10.1016/j.resmic.2004.10.014.

    Article  CAS  PubMed  Google Scholar 

  16. Aranda-Olmedo I, Tobes R, Manzanera M, Ramos JL, Marques S: Species-specific repetitive extragenic palindromic (REP) sequences in Pseudomonas putida. Nucleic Acids Res. 2002, 30 (8): 1826-1833. 10.1093/nar/30.8.1826.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  17. Gilson E, Saurin W, Perrin D, Bachellier S, Hofnung M: Palindromic units are part of a new bacterial interspersed mosaic element (BIME). Nucleic Acids Res. 1991, 19 (7): 1375-1383. 10.1093/nar/19.7.1375.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  18. Boccard F, Prentki P: Specific interaction of IHF with RIBs, a class of bacterial repetitive DNA elements located at the 3' end of transcription units. Embo J. 1993, 12 (13): 5019-5027.

    CAS  PubMed Central  PubMed  Google Scholar 

  19. Bachellier S, Clement JM, Hofnung M: Short palindromic repetitive DNA elements in enterobacteria: a survey. Res Microbiol. 1999, 150 (9-10): 627-639. 10.1016/S0923-2508(99)00128-X.

    Article  CAS  PubMed  Google Scholar 

  20. Espeli O, Boccard F: In vivo cleavage of Escherichia coli BIME-2 repeats by DNA gyrase: genetic characterization of the target and identification of the cut site. Mol Microbiol. 1997, 26 (4): 767-777. 10.1046/j.1365-2958.1997.6121983.x.

    Article  CAS  PubMed  Google Scholar 

  21. Gilson E, Perrin D, Hofnung M: DNA polymerase I and a protein complex bind specifically to E. coli palindromic unit highly repetitive DNA: implications for bacterial chromosome organization. Nucleic Acids Res. 1990, 18 (13): 3941-3952. 10.1093/nar/18.13.3941.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  22. Espeli O, Moulin L, Boccard F: Transcription attenuation associated with bacterial repetitive extragenic BIME elements. J Mol Biol. 2001, 314 (3): 375-386. 10.1006/jmbi.2001.5150.

    Article  CAS  PubMed  Google Scholar 

  23. Gilson E, Perrin D, Saurin W, Hofnung M: Species specificity of bacterial palindromic units. J Mol Evol. 1987, 25 (4): 371-373. 10.1007/BF02603122.

    Article  CAS  PubMed  Google Scholar 

  24. Roscetto E, Rocco F, Carlomagno MS, Casalino M, Colonna B, Zarrilli R, Di Nocera PP: PCR-based rapid genotyping of Stenotrophomonas maltophilia isolates. BMC Microbiol. 2008, 8: 202-10.1186/1471-2180-8-202.

    Article  PubMed Central  PubMed  Google Scholar 

  25. Distribution of repeated palindromes on E. coli K-12 chromosome. [http://www.pasteur.fr/recherche/unites/pmtg/repet/distrib.html]

  26. Silby MW, Cerdeno-Tarraga AM, Vernikos GS, Giddens SR, Jackson RW, Preston GM, Zhang XX, Moon CD, Gehrig SM, Godfrey SA: Genomic and genetic analyses of diversity and plant interactions of Pseudomonas fluorescens. Genome Biol. 2009, 10 (5): R51-10.1186/gb-2009-10-5-r51.

    Article  PubMed Central  PubMed  Google Scholar 

  27. Bachellier S, Clement JM, Hofnung M, Gilson E: Bacterial interspersed mosaic elements (BIMEs) are a major source of sequence polymorphism in Escherichia coli intergenic regions including specific associations with a new insertion sequence. Genetics. 1997, 145 (3): 551-562.

    CAS  PubMed Central  PubMed  Google Scholar 

  28. Tobes R, Pareja E: Bacterial repetitive extragenic palindromic sequences are DNA targets for Insertion Sequence elements. BMC Genomics. 2006, 7: 62-10.1186/1471-2164-7-62.

    Article  PubMed Central  PubMed  Google Scholar 

  29. Ramos-Gonzalez MI, Campos MJ, Ramos JL, Espinosa-Urgel M: Characterization of the Pseudomonas putida mobile genetic element ISPpu10: an occupant of repetitive extragenic palindromic sequences. J Bacteriol. 2006, 188 (1): 37-44. 10.1128/JB.188.1.37-44.2006.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  30. Clement JM, Wilde C, Bachellier S, Lambert P, Hofnung M: IS1397 is active for transposition into the chromosome of Escherichia coli K-12 and inserts specifically into palindromic units of bacterial interspersed mosaic elements. J Bacteriol. 1999, 181 (22): 6929-6936.

    CAS  PubMed Central  PubMed  Google Scholar 

  31. Choi S, Ohta S, Ohtsubo E: A novel IS element, IS621, of the IS110/IS492 family transposes to a specific site in repetitive extragenic palindromic sequences in Escherichia coli. J Bacteriol. 2003, 185 (16): 4891-4900. 10.1128/JB.185.16.4891-4900.2003.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  32. Wilde C, Bachellier S, Hofnung M, Carniel E, Clement JM: Palindromic unit-independent transposition of IS1397 in Yersinia pestis. J Bacteriol. 2002, 184 (17): 4739-4746. 10.1128/JB.184.17.4739-4746.2002.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  33. Wilde C, Escartin F, Kokeguchi S, Latour-Lambert P, Lectard A, Clement JM: Transposases are responsible for the target specificity of IS1397 and ISKpn1 for two different types of palindromic units (PUs). Nucleic Acids Res. 2003, 31 (15): 4345-4353. 10.1093/nar/gkg494.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  34. Wilde C, Bachellier S, Hofnung M, Clement JM: Transposition of IS1397 in the family Enterobacteriaceae and first characterization of ISKpn1, a new insertion sequence associated with Klebsiella pneumoniae palindromic units. J Bacteriol. 2001, 183 (15): 4395-4404. 10.1128/JB.183.15.4395-4404.2001.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  35. Canceill D, Viguera E, Ehrlich SD: Replication slippage of different DNA polymerases is inversely related to their strand displacement efficiency. J Biol Chem. 1999, 274 (39): 27481-27490. 10.1074/jbc.274.39.27481.

    Article  CAS  PubMed  Google Scholar 

  36. Versalovic J, Koeuth T, Lupski JR: Distribution of repetitive DNA sequences in eubacteria and application to fingerprinting of bacterial genomes. Nucleic Acids Res. 1991, 19 (24): 6823-6831. 10.1093/nar/19.24.6823.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  37. National Center for Biotechnology Information. [http://www.ncbi.nlm.nih.gov/]

  38. OligoRep system. [http://wwwmgs.bionet.nsc.ru/mgs/programs/oligorep/]

  39. pDRAW32 DNA analysis software. [http://www.acaclone.com/]

  40. Moretti S, Armougom F, Wallace IM, Higgins DG, Jongeneel CV, Notredame C: The M-Coffee web server: a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Nucleic Acids Res. 2007, W645-648. 10.1093/nar/gkm333. 35 Web Server

  41. Mobyle, a portal for bioinformatics analyses. [http://mobyle.pasteur.fr/cgi-bin/portal.py]

Download references

Acknowledgements

This work was supported by Research Center LC06066 from the Ministry of Education, Youth and Sports of the Czech Republic.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jaroslav Nunvar.

Additional information

Authors' contributions

JN carried out the analyses and wrote the manuscript. TH identified horizontal transfer in tetracycline-resistant strains. IL supervised the work and critically read the manuscript. All authors read and approved of the final manuscript.

Electronic supplementary material

12864_2009_2638_MOESM1_ESM.PDF

Additional File 1: Distribution of REP sequences in genomes of selected bacteria. REP coordinates, orientation and number of mismatches with respect to REP sequences in Table 1 are indicated. If dimorphic REPs appertain to a given RAYT (for example in Enterobacter sakazakii), they are denoted as REPA (upper line in Table 1) and REPB (lower line in Table 1). (PDF 743 KB)

12864_2009_2638_MOESM2_ESM.PDF

Additional File 2: Phylogram of all RAYT proteins listed in Table1with reference IS 200 /IS 605 transposases as outgroup. Putative root of phylogram is denoted with a star. (PDF 77 KB)

Additional file 3: Examples of colocalization of REP sequences with rayt genes.(PDF 153 KB)

12864_2009_2638_MOESM4_ESM.PDF

Additional File 4: Incorporation of rayt gene 3´ terminus into BIME. REPs are highlighted in red, their GT(A/G)G head is in bold and blue. rayt gene is denoted in italics, bold and underlined. Inter-REP segments are highlighted in blue and yellow, or gray, or green, respectively. (PDF 14 KB)

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Nunvar, J., Huckova, T. & Licha, I. Identification and characterization of repetitive extragenic palindromes (REP)-associated tyrosine transposases: implications for REP evolution and dynamics in bacterial genomes. BMC Genomics 11, 44 (2010). https://doi.org/10.1186/1471-2164-11-44

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1471-2164-11-44

Keywords