Complete genome determination and analysis of Acholeplasma oculi strain 19L, highlighting the loss of basic genetic features in the Acholeplasmataceae
BMC Genomics volume 15, Article number: 931 (2014)
Acholeplasma oculi belongs to the Acholeplasmataceae family, comprising the genera Acholeplasma and ‘Candidatus Phytoplasma’. Acholeplasmas are ubiquitous saprophytic bacteria. Several isolates are derived from plants or animals, whereas phytoplasmas are characterised as intracellular parasitic pathogens of plant phloem and depend on insect vectors for their spread. The complete genome sequences for eight strains of this family have been resolved so far, all of which were determined depending on clone-based sequencing.
The A. oculi strain 19L chromosome was sequenced using two independent approaches. The first approach comprised sequencing by synthesis (Illumina) in combination with Sanger sequencing, while single molecule real time sequencing (PacBio) was used in the second. The genome was determined to be 1,587,120 bp in size. Sequencing by synthesis resulted in six large genome fragments, while the single molecule real time sequencing approach yielded one circular chromosome sequence. High-quality sequences were obtained by both strategies differing in six positions, which are interpreted as reliable variations present in the culture population. Our genome analysis revealed 1,471 protein-coding genes and highlighted the absence of the F1FO-type Na+ ATPase system and GroEL/ES chaperone. Comparison of the four available Acholeplasma sequences revealed a core-genome encoding 703 proteins and a pan-genome of 2,867 proteins.
The application of two state-of-the-art sequencing technologies highlights the potential of single molecule real time sequencing for complete genome determination. Comparative genome analyses revealed that the process of losing particular basic genetic features during genome reduction occurs in both genera, as indicated for several phytoplasma strains and at least A. oculi. The loss of the F1FO-type Na+ ATPase system may separate Acholeplasmataceae from other Mollicutes, while the loss of those genes encoding the chaperone GroEL/ES is not a rare exception in this bacterial class.
Acholeplasma species comprise bacteria of the family Acholeplasmataceae in the class Mollicutes, characterised by the lack of sterol requirement for growth and thereby separated from Mycoplasmataceae and Spiroplasmataceae. The majority of Acholeplasma spp. are described as saprophytes and commensals. An evident assignment as pathogens is hampered by the fact that several Acholeplasma spp. are distributed ubiquitously. Moreover, no primary pathogen is described for this genus. However, the isolation of strains from diseased animals, and classification as putative animal pathogens, applies to species such as A. axanthum and A. oculi[2, 3]. This assignment of the type strain A. oculi 19L (syn. A. oculusi) was the result of its isolation from goat eyes infected with keratoconjunctivitis and re-infection experiments . However, the assignment of A. oculi to this disease is rare in contrast to several Mycoplasma spp. .
Besides Acholeplasma, the Acholeplasmataceae family also includes the provisory taxon ‘Candidatus Phytoplasma’. Phytoplasmas are associated with several hundred plant diseases – and thus significant economic losses . After insect vector-mediated transmission, phytoplasmas colonise as intracellular obligate parasites the sieve cells of a plant, often resulting in abnormal growth and reduced vitality. No general evidence for pathogenesis by acholeplasmas in colonised insects and plants has been provided to date. However, a recently published study on the A. laidlawii strain PG-8 supports its phytopathogenicity, which can be increased after nanotransformation in ultramicroform cells and might be correlated to extracellular vesicle formation under experimental conditions . Further studies are needed in this respect, but the results may indicate a mechanism shared by both genera. In contrast, experimentally proven effector proteins or membrane proteins involved in phytoplasma-host interaction were not identified in the acholeplasmas . These genetic elements of phytoplasmas might have originated from horizontal gene transfers. Massive gene loss, in combination with duplication events and genome instability, separates the phytoplasmas from the acholeplasmas. The complete genome sequences of eight strains of this family have been published, comprising the acholeplasmas A. laidlawii strain PG-8A , A. brassicae strain O502 and A. palmae strain J233  and the phytoplasmas ‘Ca. P. australiense’ strain rp-A  and NZSb11 , ‘Ca. P. asteris’ strain OY-M  and AY-WB  and ‘Ca. P. mali’ strain AT . In the past, all five phytoplasma strains and A. laidlawii were sequenced by applying the whole genome shotgun approach and using plasmid or fosmid libraries as templates for dye-terminator sequencing (Sanger sequencing). In determining the chromosome sequences of A. brassicae and A. palmae, a combination of Sanger sequencing and next generation sequencing methods (pyrosequencing, 454 Life Sciences/Roche) was applied for the first time to this bacterial family .
Both taxa show characteristic gene losses. In comparison to acholeplasmas, phytoplasmas lack the F1FO ATPase synthetase complex, the cell division protein FtsZ, a wider variety of ABC transporters, the Rnf complex and the membrane protein SecG of the Sec-dependent secretion system. Moreover, acholeplasmas possess a rich repertoire of enzymes involved in carbohydrate metabolism, fatty acids, isoprenoids and partial amino acid metabolism . Because these findings were inferred from the analyses of three acholeplasma and five phytoplasma genome sequences, it remains unclear as to what extent these differences between the two genera can be truly generalised or if the other acholeplasmas might share some of these features of their genetic repertoire with the phytoplasmas. Therefore, we determined the complete genome of A. oculi strain 19L by applying two different strategies based on sequencing by synthesis (SBS, Illumina) and, in a second approach, single molecule real time (SMRT, PacBio) sequencing. The subsequent analyses highlight the efficiency of current sequencing technologies and provide remarkable insights into the evolution of Acholeplasmataceae.
Comparison of assemblies derived from SBS and SMRT sequencing
The SBS and SMRT approaches enabled the efficient reconstruction of the complete genome sequence in independent experiments. SBS sequencing provided 1,095,574 paired-end quality passed reads with an average length of 101 nt (total read length of 110,652,974 nt). De novo assembly of the SBS-derived reads alone led to the incorporation of 964,613 reads (88%) into six large contigs (513260 bp, 244477 bp, 109253 bp, 547590 bp, 106516 bp and 52461 bp in size), showing a 64-fold read coverage on average (Table 1, Figure 1) and reaching a total contig length of 1,573,557 bp. In turn, the mapping of SBS reads on the finished genome sequence revealed no uncovered regions. Gap regions derived from the SBS de novo assembly cover repetitive sequences of high similarity. In detail, two gaps (4,748 bp and 4,898 bp in size) include the rRNA operon regions (99% sequence identity), two gaps (1,661 bp each) include two transposases (100% sequence identity), one gap (186 bp) borders Acholeplasma phage L2 (>92% sequence identity) and the smallest gap (176 bp) is located close to a heavy metal translocating P-type ATPase (92% sequence identity). Gaps derived from the assembly of SBS reads were closed by primer-walking (Sanger sequencing), resulting in a complete circular chromosomal sequence.
The SMRT sequencing approach provided 42,300 SMRT reads with a mean read length of 6,747 nt (total read length of 285,414,973 nt). A total of 38,875 reads enabled the gapless reconstruction of the circular chromosome at a size of 1,587,116 bp. A consensus concordance of 99.9991% and 144.1-fold average sequence coverage were reached. Around 8% of the SMRT-derived reads were rejected due to insufficient quality or incomplete read processing during the assembly.
Apart from the 14 kb size difference in the total contig length obtained by the two methods, it is remarkable that only 28 positions differ, concerning 31 bases in total (Table 2). Amplifying these positions, and re-sequencing by Sanger, led to the unambiguous assignment of 22 positions. In addition, the results indicated the presence of six polymorphic positions. Thus, both approaches enabled the efficient reconstruction of the complete chromosome sequence. The final sequence is in accordance with the results obtained through the Sanger sequencing of PCR products covering the ambiguous regions. Consequently, only polymorphisms occurring at high frequency in one PCR template were detected as double peaks in the chromatograms. We conclude that the majority of differences indicate that polymorphisms were already present in the original cell population. Furthermore, one of the substitutions results in a non-synonymous exchange (Aocu_13520/ rpsE) within the 30S ribosomal protein S5, while three substitutions result in synonymous base exchanges (Aocu_08380, Aocu_10040/ mutS, Aocu_14610) and two positions are located in intragenic regions. A high incidence of differences (14 nucleotides, Table 2) was identified within a putative phage-associated region (position 972,742-977,480), which may undergo rapid degradation and comprises an integrase (Aocu_08840) and fragments of a restriction-modification system (truncated restriction endonuclease, Aocu_08860; truncated restriction endonuclease, Aocu_08870 and hsdM, Aocu_08880).
In summary, Sanger sequencing confirmed the SBS-derived sequences for 22 out of 28 differences. The deviating SMRT data at these positions may indicate errors or rare sequence variations within the final chromosome sequence of 1,587,120 bp. Polymorphic sites at six sequence positions of the chromosome are supported by SMRT assembly, deviating SBS reads and Sanger sequencing.
Benchmarks of the genome of A. oculi and its comparison to other Acholeplasmataceae
The finished consensus sequence of A. oculi strain 19L consists of 1,587,120 bp encoding two rRNA operons, 34 tRNA genes and 1,471 predicted protein coding genes (Table 3). A. laidlawii strain PG-8A is the closest known relative of A. oculi strain 19L, which is supported by the construction of the phylogenetic tree (Figure 2). This close relationship is also reflected by the prediction of 1,068 shared proteins (77%) compared to 866 (60%) and 973 (57%) proteins shared with A. brassicae strain O502 and A. palmae strain J233 (Figure 3). The predicted core of the four Acholeplasma spp. consists of 703 proteins and the calculated pan-genome comprises 2,867 proteins in total (Figure 4). The highest number of unique proteins (570) possesses A. brassicae, which also exhibits the largest genome in this family (1,877,792 bp, 1,704 protein coding genes, Table 3).
Only a basic set of proteins is shared between A. oculi and the five complete phytoplasma genomes. The 293 to 310 predicted shared proteins (27% to 59%) are consistent with previously calculated numbers for other acholeplasmas [7, 8] (Figure 3). The second highest number of unique proteins (440) is predicted for A. palmae (Figure 4), which is the closest known relative of the phytoplasmas  (Figure 2) and is also supported by the highest number of predicted shared proteins with the five completely sequenced phytoplasmas (Additional file 1). In second position among the acholeplasmas, A. oculi shares many of its proteins with the phytoplasmas, supported by its phylogenetic assignment and the received PanOCT results. This analysis is also supported by A. oculi and A. laidlawii, which share the highest number of proteins amongst the acholeplasmas (Figure 3). The phytoplasmas’ genome reduction process is reflected by the low number of 294 proteins assigned to the shared core within the pan-genome (2,077 proteins in total; Figure 5). Phytoplasma genomes are characterised by extensive gene losses, transposon-mediated gene duplication  and horizontal gene-integration events . Comparing the pan-genomes of acholeplasmas and phytoplasmas, Venn analysis highlights basic differences in the overall gene content. Complete Acholeplasma and ‘Ca. Phytoplasma’ genomes collectively encode 402 and 14 predicted unique proteins, respectively (Figure 6, Additional file 2). The 14 unique genes, which are common to the genus ‘Ca. Phytoplasma’, encode nine hypothetical proteins and five proteins with known functions. Two of the hypothetical proteins contain a sequence-variable mosaic (SVM) motif  and comprise SAP05 (AYWB_032), which is described as a putative effector protein  inducing the formation of smooth young rosette leaves that lack serrations along the leaf margin , and SAP30 (AYWB_402), which is similar to SAP11 containing an eukaryotic nuclear localisation signal [19, 20]. This group of unique genes also includes two phytoplasma proteins involved in a suggested alternative pathway in the carbohydrate metabolism of phytoplasmas [7, 13, 21]. The malate/Na + symporter (MleP) provides a carbon source which undergoes oxidative decarboxylation by malate dehydrogenase (SfcA), thereby providing pyruvate processed by the dehydrogenase multienzyme complex and providing acetyl coenzyme A. The phosphotransacetylation of acetyl-CoA performed by the PduL-like protein provides acetyl-phosphate, which is processed via acetate kinase (AckA) and results in the formation of ATP and acetate. A. oculi does not encode MleP, SfcA and the phosphate acetyltransferase (Pta), which is common in mycoplasmas, though it is suggested that PduL fulfills this function in Acholeplasmataceae[7, 21] including A. oculi. However, the alternative energy-yielding pathway of phytoplasmas utilising malate is clearly not encoded in the analysed acholeplasma genomes.
Furthermore, the PanOCT analysis predicted that phytoplasmas encode unique AAA+ ATPase, thymidylate kinase and a DNA-dependent RNA polymerase sigma 70 factor RpoD (IPR013325, IPR014284, IPR007627, Additional file 2). RpoD exhibits only insignificant BlastP hits to acholeplasmas’ sigma factors (minimal e-value 9e-08, score 47), and no ortholog was predicted via PanOCT. The existence of a phytoplasma-specific sigma factor points towards some peculiarities in their regulatory system. The other two deduced proteins showed similarities in BlastP analysis to some acholeplasma proteins, albeit they differed in small domain structures. For instance, the AAA+ ATPase of phytoplasmas gave a hit to the ATP-dependent zinc metalloprotease FtsH, which also contains the AAA+ domain structure, and the thymidylate kinases of acholeplasmas showed an additional conserved site (predicted by the PROSITE database search, http://prosite.expasy.org/) – contrary to the thymidylate kinases of phytoplasmas. The overall high number of 402 unique proteins for the four acholeplasmas is interpreted with respect to the diverse environments colonised by acholeplasmas.
Particularities of A. oculistrain 19L
A high percentage (55% – 148) of the 271 proteins predicted to be ‘unique’ for A. oculi in the Acholeplasmataceae are annotated as hypothetical proteins (Figure 1). This set of unique proteins contains phage-related proteins similar to the Acholeplasma phage L2 [GenBank: L13696.1] , including the proteins L2_7, L2_9, L2_11 and L2_12, and which are organised in clusters at three different chromosome regions (118,599-126,732 Aocu_01170-01290; 650,878-659,350 Aocu_05790-5880; 1,062,230-1,068,906 Aocu_09710-9810). In addition, two recombinases (Aocu_08890, Aocu_13990), two integrases (Aocu_05790, Aocu_08840) and one resolvase (Aocu_14000) belong to the list of unique proteins. Besides phage-assigned elements, six transposases were identified, including a mutator type (Aocu_01380), IS3/IS911 family protein (Aocu_01570), IS200-like (Aocu_03680), IS204/IS1001/IS1096/IS1165 (Aocu_04230/50) and other transposon-related elements (insertion element subunit, Aocu_03690). Furthermore, a putative complex transposon is associated with a region carrying a high number of unique proteins (134,519-157,461, Aocu_01380-01570). Beside other proteins, it encodes two oligopeptide ABC transporter components (periplasmatic component oppA and ATP-binding protein oppD) and six glycosyl hydrolase family proteins and a periplasmatic binding protein/LacI transcriptional regulator. A second candidate for a complex transposon (423,889-490,302 Aocu_03680-4250) encodes a UDP-galactopyranose mutase (Glf, Aocu_04190) in addition to another Glf (Aocu_04670) outside the proposed complex transposon. These genes set A. oculi apart from other acholeplasmas. The UDP-galactopyranose mutase is involved in the conversion of UDP-galactopyranose (UDP-GALP) into UDP-galactofuranose (UDP-GALF) (IPR004379), which is the precursor to D-galactofuranose and is often found in the lipopolysaccharide O antigens of several Gram-negative bacteria .
A. oculi is also separated from the other three acholeplasmas by the presence of a putative manganese efflux pump (MntP, Aocu_03470), thereby enabling the exportation of manganese ions, which are toxic in higher amounts. The functional relevance of MntP for manganese homoeostasis has been demonstrated for E. coli. The direct comparison of the A. oculi and the E. coli MntP protein shows 31% identical and 56% similar residues. In addition, A. oculi encodes a cadmium resistance transporter (CadD, Aocu_08600) and one amidohydrolase (AmhX, Aocu_08940). In Bacillus subtilis, AmhX enables the cleavage of the amide bond between non-active conjugated amino acids and may mobilise indole-3-acetic acid (IAA) from inactive storage forms in plants besides several other functions  (IPR017439 ). A. oculi was also detected on plant surfaces . Therefore, one may speculate whether A. oculi can stimulate the growth of colonised plant tissue. Hints for such a manipulation of the IAA metabolism of plants have also been obtained for A. palmae and A. brassicae encoding a putative auxin efflux carrier protein , though no experimental studies are available.
A. oculi is separated from the other Acholeplasma spp. by encoding several additional transcriptional regulators such as ubiC (Aocu_00680), gntR (Aocu_00690), Cro/C1 family proteins (Aocu_01770, Aocu_05750, Aocu_08910 and Aocu_13020) and TetR family proteins (Aocu_14450) not assigned to other Cro/C1-type or TetR family proteins in this family. In total, A. oculi encodes 13 Cro/C1 family proteins, nine of which are shared, and four TetR family proteins, one of which is shared by the other acholeplasmas.
Furthermore, A. oculi is separated from other Acholeplasmataceae by encoding the GDP-D-glycero-α-D-manno-heptose biosynthesis pathway providing D-glycero-D-manno-heptose (HddA, GmhA, HddC, GmhB; Aocu_04590-620). This is a precursor of the inner core lipopolysaccharide . These proteins are similar to those found in the pathway that was reconstructed for the Gram-positive bacteria Aneurinibacillus thermoaerophilus strain DSM 10155 (member of Bacillus/Clostridium group) . For acholeplasmas, there is only one report by Mayberry et al.  that A. modicum contains heptose among the glycolipids.
Moreover, A. oculi encodes two additional proteins, thus playing a role in the biosynthesis of the amino acid methionine. MetW (Aocu_08790) synthesises methionine from homoserine (IPR010743 ), which provides an additional pathway to produce methionine needed in the initiation of translation. The diaminopimelate epimerase (DapF, Aocu_08990) belongs to the aspartate pathway (IPR001653), from which the four amino acids lysine, threonine, methionine and isoleucine can be synthesised.
All species of the Acholeplasmataceae encode a protein core for the Sec-dependent secretion system (Ffh, FtsY, SecA, SecE, SecY and YidC), whereas the four analysed Acholeplasma spp. additionally encode the membrane protein SecG. The chaperone SecB, which is only encoded in A. laidlawii and A. oculi, binds the precursor protein and directs it to the SecA protein. The function of SecB can also be fulfilled by the proteins DnaK and DnaJ , which are encoded in all genome sequences of the family, or by GroEL and GroES . A. oculi lacks the common chaperone GroEL/ES (Figure 7), consistent with conclusions drawn from the draft sequences of phytoplasma strains  and the analyses of other species in the Mollicutes that these genes are not essential . The complete genome sequences of the Acholeplasmataceae encode the trigger factor (TF), dnaK, dnaJ, grpE and hrcA. Other heat shock proteins, such as Hsp20, were not identified in A. palmae and ‘Ca. P. mali’. Hsp33 is only identified in the acholeplasmas.
Beside GroEL/ES, A. oculi lacks the complete gene set encoding the F1FO-type Na+ ATPase, which was identified in A. laidlawii, A. brassicae and A. palmae (Figures 8 and 9). Therefore, A. oculi, A. laidlawii and A. palmae encode one V-type Na+ ATPase. A. palmae differs in gene content by encoding no atpC subunit for this ATPase. In addition, all genes encoding the V1VO H+ ATPase are present in all four acholeplasma strains. Summing up, each acholeplasma species possesses at least one full operon which encodes at least either one H+ or one Na+ ATPase system. The NtpG subunit, namely the rotated central stalk next to NtpD and NtpC , is missing in all species. In contrast to the acholeplasmas, the F- and V-type ATPases were not identified in phytoplasmas.
F1FO ATPases and V1VO ATPases are membrane complexes which function either as H+- or Na+-translocators  (Figure 9). The F1FO ATPase consists of two units – the integral membrane protein FO (atpBEF) acting as a proton channel and the peripheral catalytic stalk F1 (atpHAGDC). The V1VO ATPase is built by the integral membrane protein VO (ntpIK) and the peripheral catalytic stalk V1 (ntpECFABD) . The difference between both transporters is that the V-type ATPase only works in one direction by hydrolysing ATP to produce either a proton or a sodium motive force, while additionally the F-type ATPase is able to act in the other direction by allowing the regulation of the cellular ion pool using the proton motive force, which leads to ATP generation .
Following the sequence-based prediction of Dzioba et al. , the classification of the ion translocating profile can be inferred from the alignment of the protein sequences of the subunits AtpE (F-type ATPase) and NtpK (V-type ATPase). Certain conserved binding motifs are represented by the amino acids at specific positions, in order to specify an H+- or a Na+-translocation (Figure 9). As a result, one F1FO-type Na+ ATPase is suggested to be encoded by all acholeplasmas except for A. oculi, and one V-type Na+ ATPase is predicted for all acholeplasmas except for A. brassicae. It remains unclear as to whether the V-type Na+ ATPase of A. palmae is working despite the lack of an atpC subunit, although this species additionally encodes the F1FO-type Na+ ATPase. Moreover, protein sequence alignment leads to the conclusion that all acholeplasmas encode one V-type H+ ATPase. Deductively, all acholeplasmas encode at least one Na+ and one H+ translocator. This finding stands in accordance with the evidence that Acholeplasma laidlawii strain B possesses a (Na+-Mg2+)-ATPase which is capable of actively extruding sodium ions against the concentration gradient . This previously described, but not genetically characterised, cation pump was linked to the characteristically low intracellular sodium level of these bacteria.
Ultimately, the loss of the F1FO-type Na+ ATPase in Acholeplasmataceae, as is the case for A. oculi, may probably be compensated by the V-type Na+ ATPase. The loss of this genetic module in phytoplasmas remains unclear, but it might be interpreted in respect to the adaptation of phytoplasmas in the intracellular environment with constant osmotic conditions. The comparison of both V-type ATPase operons encoded by A. oculi highlights low sequence identities of the involved proteins (24% to 52%) and differences in protein lengths (Figure 9). This leads to the suggestion that the operons did not derive from a duplication event.
Besides F1FO ATPase, the loss of groEL/ES is remarkable. Native protein folding is conducted by molecular chaperones such as GroEL/ES (Hsp60), DnaK (Hsp70), DnaJ, GrpE, SecB and other heat-shock proteins (Hsp) . GroEL complexes (800 kDa) consisting of two stacked heptameric rings exhibit ATPase activity . The smaller GroES (10 kDa), together with ATP, binds to GroEL and forms the GroEL/GroES complex. DnaK prevents off-pathway reactions or stabilises certain folding intermediates. DnaJ and GrpE act as co-helpers for DnaK . GroEL/ES is probably replaced by the trigger factor (TF) and DnaK, which has already been shown by Kerner et al.  for E. coli or by the HrcA protein, which is commonly found as a part of the heat-shock regulation of bacteria . TF/DnaK and HrcA are encoded in all analysed species of the Acholeplasmataceae (Figure 7). Several Mollicutes are known to have lost groEL and groES, such as Mesoplasma florum, Mycoplasma hyopneumoniae, Ureaplasma parvum serovar 3, Ureaplasma urealyticum, Mycoplasma mobile and some further Mycoplasma spp. . It is likely that there are even more Mollicutes lacking these proteins. Saccardo et al.  suggested, based on draft sequences, that there are four phytoplasma strains of the 16SrIII group that probably lack GroEL/ES. The possibility that this genetic feature can be lost within the Mollicutes is supported by experiments with transposon mutagenesis, showing that GroEL is not or only weakly regulated during heat shock for M. genitalium or M. pneumonia, thereby leading to the suggestion that this chaperone is not essential for Mycoplasma spp. in general and may represent an evolutionary remnant. Evolutionary loss could apparently be possible due to either the fact that GroEL is immunogenic, and therefore it would be advantageous to get rid of it by avoiding an immune response in mammals  – a benefit for A. oculi when infecting mammals – or alternatively Mollicutes possess small genomes which encode few proteins; consequently, they own fewer substrate proteins, which have to be correctly folded by GroEL.
This study demonstrated the efficiency of the SMRT approach in the complete de novo determination of bacterial genomes. A. oculi encodes, like other Acholeplasma spp., rich genetic content in comparison to phytoplasmas. The relatively small core genome of phytoplasmas should be interpreted with respect to their intracellular parasitism and their corresponding poor metabolic repertoire. In contrast, acholeplasmas depend on a richer genetic repertoire due to their widespread distribution and colonisation of diverse micro-habitats. However, for the first time, the deduced protein content of A. oculi highlights that the loss of basic genetic elements, including the chaperone GroEL/ES and the F1FO-type Na+ ATPase system, took place in both genera of the Acholeplasmataceae. One could therefore speculate that the common V-type H+ ATPase system in acholeplasmas may regulate the cellular proton pool, and the V-type Na+ ATPase system may compensate for the lack of the F1FO-type Na+ ATPase. The loss of GroEL/ES is interpreted as being not extraordinary for Mollicutes and seems to have occurred several times within this class.
A. oculi strain 19L isolate was kindly provided by Jerry K. Davis (Purdue University School of Veterinary Medicine, West Lafayette, Ind., USA) from the strain collection of the International Organization for Mycoplasmology (IOM). Cells were cultivated in ATCC® Medium 1039 (http://www.atcc.org) supplemented with 0.2% polymyxin B (Roth, Karlsruhe, Germany) and 0.2% penicillin G (Merck, Darmstadt, Germany) at 28°C for about 14 days and collected by centrifugation (20 min, 10,000 rpm, 4°C). The DNA isolation of A. oculi strain 19L for SBS was performed with the DNeasy Blood & Tissue Kit (Qiagen, Hildesheim, Germany) and according to the manufacturer’s instruction. DNA isolation needed for preparing the PacBio 10-kb library high molecular weight genomic DNA was performed according to Moore et al. .
Sequencing and assembly of SBS data
DNA-Seq libraries were prepared from fragmented DNA (COVARIS S2, Woburn, Massachusetts, USA) according to recommendations made by the supplier (TruSeq DNA sample preparation v2 guide, Illumina, San Diego, CA, USA). Libraries were quantified by fluorometry, immobilised and processed onto a flow cell with a cBot followed by sequencing by synthesis by applying TruSeq v3 chemistry on a HiSeq2500 (all components by Illumina).
The de novo assembly of the reads was performed in CLC Genomics Workbench 7.0 (http://www.clcbio.com). The assembly data was exported as a BAM file, indexed using SAMtools  and imported in Gap5 . Gaps were closed by PCR and primer-walking by applying dye-terminator sequencing performed on an ABI 310 capillary sequencer (Life technologies, Carlsbad, CA, USA).
Sequencing and assembly of SMRT data
A 10-kb library was prepared and processed as recommended by Pacific Biosciences (http://www.smrtcommunity.com/SampleNet/Sample-Prep). Library construction and subsequent sequencing were performed using the SMRTbell Template Preparation Reagent Kit 1.0, DNA/Polymerase binding kit P4-C2, MagBead Kit and DNA Sequencing Kit 2.0 (all components supplied by Pacific Biosciences, Menlo Park CA, USA.). The genome was sequenced using PacBio RS II technology (P4-C2 chemistry). Data collected on the PacBio RS II instrument were processed and filtered (SMRT analysis software, version 2.1). All experiments were conducted according to the manufacturers’ instructions on a single SMRT cell. Obtained data were analysed on the SMRT Portal V2.1.1 (http://www.pacb.com/devnet/) by applying the integrated Celera® Assembler. SMRT sequencing and SBS were performed by the Max Planck-Genome-centre Cologne, Germany (http://mpgc.mpipz.mpg.de/home/).
Identification of sequencing differences comparing both sequencing methods
Rare sequencing differences were identified via BlastN (low complexity filter off, word size 7)  by applying the SMRT-derived genome sequence as a reference. In addition, SBS data were mapped onto the SMRT sequence in CLC Genomics Workbench 7.0, and Primer-BLAST  was used for designing oligonucleotide pairs, thus enabling the PCR amplification of conflict regions (Table 2). Sequences of PCR products were determined by applying dye-terminator sequencing.
Annotation of the genome sequence
The oriC region was determined through the cumulative GC-skew calculation of the chromosome sequence in Artemis  and the determination of the DnaA-boxes . The adjusted genome sequence was automatically annotated in RAST  and annotation was manually curated in Artemis by incorporating additional analyses obtained from the InterProScan database , RNAmmer  and tRNAscan-SE . The annotated genome sequence of A. oculi strain 19L, including information on polymorphisms, was deposited [EMBL:LK028559]). Read data for SBS [EMBL:ERX463488] and SMRT [EMBL:ERX470328] were submitted to the European Nucleotide Archive (http://www.ebi.ac.uk/ena/).
Prediction of orthologous proteins within the Acholeplasmataceaefamily
Orthologous proteins of Acholeplasmataceae were calculated by PanOCT  by applying the default parameters and protein data of A. laidlawii strain PG-8A [Genbank:CP000896.1] , A. brassicae strain O502 [GenBank:FO681348.1] , A. palmae strain J233 [GenBank:FO681347] , ‘Ca. P. australiense’ strain rp-A [GenBank:AM422018.1]  and NZSb11 [GenBank:CP002548.1] , ‘Ca. P. asteris’ strain OY-M [GenBank:AP006628.2]  and AY-WB [GenBank:CP000061.1]  and ‘Ca. P. mali’ strain AT [GenBank:CU469464.1] . The results obtained by the software were also used for the prediction of the pan-, dispensable- and core-genome  of each genera and the family .
Phylogenetic analysis of the Acholeplasmataceae
Alignment and the phylogenetic tree were calculated for 64 16S rRNA genes of the Acholeplasmataceae by using the maximum likelihood algorithm implemented in CLC Genomics Workbench 7.0 and by applying the parameter ‘very accurate’ for the alignment with a gap open cost of 10.0 and a gap extension cost of 1.0. The phylogenetic tree was constructed by using a 1,000 bootstrapped maximum likelihood algorithm, whereby the ‘UPGMA’ construction method and the ‘Jukes Cantor’ nucleotide substitution model were chosen. Mycoplasma genitalium strain G37 was added, in order to root the tree.
Freundt EA, Whitcomb RF, Barile MF, Razin S, Tully JG: Proposal for Elevation of the Family Acholeplasmataceae to Ordinal Rank: Acholeplasmatales. Int J Syst Bacteriol. 1984, 34 (3): 346-349. 10.1099/00207713-34-3-346.
Al-Aubaidi JM, Dardiri AH, Muscoplatt CC, McCauley EH: Identification and characterization of Acholeplasma oculusi spec. nov. from the eyes of goats with keratoconjunctivitis. Cornell Vet. 1973, 63 (1): 117-129.
Kisary J, El-Ebeedy AA, Stipkovits L: Mycoplasma infection of geese. II. Studies on pathogenicity of mycoplasmas in goslings and goose and chicken embryos. Avian Pathol. 1976, 5 (1): 15-20. 10.1080/03079457608418165.
DaMassa AJ, Wakenell PS, Brooks DL: Mycoplasmas of Goats and Sheep. J Vet Diagn Invest. 1992, 4 (1): 101-113. 10.1177/104063879200400126.
Strauss E: Microbiology. Phytoplasma research begins to bloom. Science. 2009, 325 (5939): 388-390. 10.1126/science.325_388.
Chernov VM, Chernova OA, Mouzykantov AA, Baranova NB, Gorshkov OV, Trushin MV, Nesterova TN, Ponomareva AA: Extracellular membrane vesicles and phytopathogenicity of Acholeplasma laidlawii PG8. Scientific World Journal. 2012, 2012: 315474-
Kube M, Siewert C, Migdoll AM, Duduk B, Holz S, Rabus R, Seemuller E, Mitrovic J, Muller I, Buttner C, Reinhardt R: Analysis of the Complete Genomes of Acholeplasma brassicae, A. palmae and A. laidlawii and Their Comparison to the Obligate Parasites from ‘Candidatus Phytoplasma’. J Mol Microbiol Biotechnol. 2014, 24 (1): 19-36. 10.1159/000354322.
Lazarev VN, Levitskii SA, Basovskii YI, Chukin MM, Akopian TA, Vereshchagin VV, Kostrjukova ES, Kovaleva GY, Kazanov MD, Malko DB, Vitreschak AG, Sernova NV, Gelfand MS, Demina IA, Serebryakova MV, Galyamina MA, Vtyurin NN, Rogov SI, Alexeev DG, Ladygina VG, Govorun VM: Complete genome and proteome of Acholeplasma laidlawii. J Bacteriol. 2011, 193 (18): 4943-4953. 10.1128/JB.05059-11.
Tran-Nguyen LT, Kube M, Schneider B, Reinhardt R, Gibb KS: Comparative genome analysis of “Candidatus Phytoplasma australiense” (subgroup tuf-Australia I; rp-A) and “Ca. Phytoplasma asteris” Strains OY-M and AY-WB. J Bacteriol. 2008, 190 (11): 3979-3991. 10.1128/JB.01301-07.
Andersen MT, Liefting LW, Havukkala I, Beever RE: Comparison of the complete genome sequence of two closely related isolates of ‘Candidatus Phytoplasma australiense’ reveals genome plasticity. BMC Genomics. 2013, 14: 529-10.1186/1471-2164-14-529.
Oshima K, Kakizawa S, Nishigawa H, Jung HY, Wei W, Suzuki S, Arashida R, Nakata D, Miyata S, Ugaki M, Namba S: Reductive evolution suggested from the complete genome sequence of a plant-pathogenic phytoplasma. Nat Genet. 2004, 36 (1): 27-29. 10.1038/ng1277.
Bai X, Zhang J, Ewing A, Miller SA, Jancso Radek A, Shevchenko DV, Tsukerman K, Walunas T, Lapidus A, Campbell JW, Hogenhout SA: Living with genome instability: the adaptation of phytoplasmas to diverse environments of their insect and plant hosts. J Bacteriol. 2006, 188 (10): 3682-3696. 10.1128/JB.188.10.3682-3696.2006.
Kube M, Schneider B, Kuhl H, Dandekar T, Heitmann K, Migdoll AM, Reinhardt R, Seemuller E: The linear chromosome of the plant-pathogenic mycoplasma ‘Candidatus Phytoplasma mali’. BMC Genomics. 2008, 9: 306-10.1186/1471-2164-9-306.
Volokhov DV, Neverov AA, George J, Kong H, Liu SX, Anderson C, Davidson MK, Chizhikov V: Genetic analysis of housekeeping genes of members of the genus Acholeplasma: phylogeny and complementary molecular markers to the 16S rRNA gene. Mol Phylogenet Evol. 2007, 44 (2): 699-710. 10.1016/j.ympev.2006.12.001.
Wei W, Davis RE, Jomantiene R, Zhao Y: Ancient, recurrent phage attacks and recombination shaped dynamic sequence-variable mosaics at the root of phytoplasma genome evolution. Proc Natl Acad Sci U S A. 2008, 105 (33): 11827-11832. 10.1073/pnas.0805237105.
Jomantiene R, Zhao Y, Davis RE: Sequence-variable mosaics: composites of recurrent transposition characterizing the genomes of phylogenetically diverse phytoplasmas. DNA Cell Biol. 2007, 26 (8): 557-564. 10.1089/dna.2007.0610.
Bai X, Correa VR, Toruno TY, el Ammar D, Kamoun S, Hogenhout SA: AY-WB phytoplasma secretes a protein that targets plant cell nuclei. Mol Plant Microbe Interact. 2009, 22 (1): 18-30. 10.1094/MPMI-22-1-0018.
MacLean AM, Hogenhout SA: Phytoplasmas and Spiroplasmas: The Phytopathogenic Mollicutes of the Phloem. Phloem: Molecular Cell Biology, Systemic Communication, Biotic Interaction. Edited by: Thompson GA, van Bel AJE. 2013, Iowa: Wiley-Blackwell, 305-
Bai X, Ammar ED, Hogenhout SA: A secreted effector protein of AY-WB phytoplasma accumulates in nuclei and alters gene expression of host plant cells, and is detected in various issues of the leafhopper Macrosteles quadrilineatus. Bull Insectol. 2007, 60 (2): 217-218.
Siewert C, Luge T, Duduk B, Seemuller E, Buttner C, Sauer S, Kube M: Analysis of Expressed Genes of the Bacterium ‘Candidatus Phytoplasma mali’ Highlights Key Features of Virulence and Metabolism. PLoS One. 2014, 9 (4): e94391-10.1371/journal.pone.0094391.
Kube M, Mitrovic J, Duduk B, Rabus R, Seemuller E: Current view on phytoplasma genomes and encoded metabolism. Scientific World Journal. 2012, 2012: 185942-
Maniloff J, Kampo GJ, Dascher CC: Sequence analysis of a unique temperature phage: mycoplasma virus L2. Gene. 1994, 141 (1): 1-8. 10.1016/0378-1119(94)90120-1.
Sanders DA, Staines AG, McMahon SA, McNeil MR, Whitfield C, Naismith JH: UDP-galactopyranose mutase has a novel structure and mechanism. Nat Struct Biol. 2001, 8 (10): 858-863. 10.1038/nsb1001-858.
Waters LS, Sandoval M, Storz G: The Escherichia coli MntR miniregulon includes genes encoding a small protein and an efflux pump required for manganese homeostasis. J Bacteriol. 2011, 193 (21): 5887-5897. 10.1128/JB.05872-11.
Kempf B, Bremer E: A novel amidohydrolase gene from Bacillus subtilis cloning: DNA-sequence analysis and map position of amhX. FEMS Microbiol Lett. 1996, 141 (2–3): 129-137.
Bartel B, Fink GR: ILR1, an amidohydrolase that releases active indole-3-acetic acid from conjugates. Science. 1995, 268 (5218): 1745-1748. 10.1126/science.7792599.
Brown DR, Bradbury JM, Johansson K-E: Family I. Acholeplasmataceae. Bergey’s Manual of Systematic Bacteriology: The Bacteroidetes, Spirochaetes, Tenericutes (Mollicutes), Acidobacteria, Fibrobacteres, Fusobacteria, Dictyoglomi, Gemmatimonadetes, Lentisphaerae, Verrucomicrobia, Chlamydiae, and Planctomycetes, Volume 4. Edited by: Goodfellow M, Kämpfer P, Busse H-J, Trujillo ME, Suzuki KI, Ludwig W, Whitman WB. 2010, New York: Springer, 688-689. 2
Kneidinger B, Marolda C, Graninger M, Zamyatina A, McArthur F, Kosma P, Valvano MA, Messner P: Biosynthesis pathway of ADP-L-glycero-beta-D-manno-heptose in Escherichia coli. J Bacteriol. 2002, 184 (2): 363-369. 10.1128/JB.184.2.363-369.2002.
Mayberry WR, Smith PF, Langworthy TA: Heptose-containing pentaglycosyl diglyceride among the lipids of Acholeplasma modicum. J Bacteriol. 1974, 118 (3): 898-904.
Alaminos M, Ramos JL: The methionine biosynthetic pathway from homoserine in Pseudomonas putida involves the metW, metX, metZ, metH and metE gene products. Arch Microbiol. 2001, 176 (1–2): 151-154.
Wild J, Altman E, Yura T, Gross CA: DnaK and DnaJ heat shock proteins participate in protein export in Escherichia coli. Genes Dev. 1992, 6 (7): 1165-1172. 10.1101/gad.6.7.1165.
Bochkareva ES, Solovieva ME, Girshovich AS: Targeting of GroEL to SecA on the cytoplasmic membrane of Escherichia coli. Proc Natl Acad Sci U S A. 1998, 95 (2): 478-483. 10.1073/pnas.95.2.478.
Saccardo F, Martini M, Palmano S, Ermacora P, Scortichini M, Loi N, Firrao G: Genome drafts of four phytoplasma strains of the ribosomal group 16SrIII. Microbiology. 2012, 158 (Pt 11): 2805-2814.
Mitrovic J, Siewert C, Duduk B, Hecht J, Molling K, Broecker F, Beyerlein P, Buttner C, Bertaccini A, Kube M: Generation and analysis of draft sequences of ‘stolbur’ phytoplasma from multiple displacement amplification templates. J Mol Microbiol Biotechnol. 2014, 24 (1): 1-11. 10.1159/000353904.
Murata T, Yamato I, Kakinuma Y, Shirouzu M, Walker JE, Yokoyama S, Iwata S: Ion binding and selectivity of the rotor ring of the Na + -transporting V-ATPase. Proc Natl Acad Sci U S A. 2008, 105 (25): 8607-8612. 10.1073/pnas.0800992105.
Dzioba J, Hase CC, Gosink K, Galperin MY, Dibrov P: Experimental verification of a sequence-based prediction: F1F0-type ATPase of Vibrio cholerae transports protons, not Na+ ions. J Bacteriol. 2003, 185 (2): 674-678. 10.1128/JB.185.2.674-678.2003.
Mulkidjanian AY, Galperin MY, Makarova KS, Wolf YI, Koonin EV: Evolutionary primacy of sodium bioenergetics. Biol Direct. 2008, 3: 13-10.1186/1745-6150-3-13.
Mulkidjanian AY, Makarova KS, Galperin MY, Koonin EV: Inventing the dynamo machine: the evolution of the F-type and V-type ATPases. Nat Rev Microbiol. 2007, 5 (11): 892-899. 10.1038/nrmicro1767.
Deckers-Hebestreit G, Altendorf K: The F0F1-type ATP synthases of bacteria: structure and function of the F0 complex. Annu Rev Microbiol. 1996, 50: 791-824. 10.1146/annurev.micro.50.1.791.
Mahajan S, Lewis RN, George R, Sykes BD, McElhaney RN: Characterization of sodium transport in Acholeplasma laidlawii B cells and in lipid vesicles containing purified A. laidlawii (Na+-Mg2+)-ATPase by using nuclear magnetic resonance spectroscopy and 22Na tracer techniques. J Bacteriol. 1988, 170 (12): 5739-5746.
Wild J, Rossmeissl P, Walter WA, Gross CA: Involvement of the DnaK-DnaJ-GrpE chaperone team in protein secretion in Escherichia coli. J Bacteriol. 1996, 178 (12): 3608-3613.
Braig K, Otwinowski Z, Hegde R, Boisvert DC, Joachimiak A, Horwich AL, Sigler PB: The crystal structure of the bacterial chaperonin GroEL at 2.8 A. Nature. 1994, 371 (6498): 578-586. 10.1038/371578a0.
Kerner MJ, Naylor DJ, Ishihama Y, Maier T, Chang HC, Stines AP, Georgopoulos C, Frishman D, Hayer-Hartl M, Mann M, Hartl FU: Proteome-wide analysis of chaperonin-dependent protein folding in Escherichia coli. Cell. 2005, 122 (2): 209-220. 10.1016/j.cell.2005.05.028.
Clark GW, Tillier ER: Loss and gain of GroEL in the Mollicutes. Biochem Cell Biol. 2010, 88 (2): 185-194. 10.1139/O09-157.
Hutchison CA, Peterson SN, Gill SR, Cline RT, White O, Fraser CM, Smith HO, Venter JC: Global transposon mutagenesis and a minimal Mycoplasma genome. Science. 1999, 286 (5447): 2165-2169. 10.1126/science.286.5447.2165.
Moore AM, Patel S, Forsberg KJ, Wang B, Bentley G, Razia Y, Qin X, Tarr PI, Dantas G: Pediatric fecal microbiota harbor diverse and novel antibiotic resistance genes. PLoS One. 2013, 8 (11): e78822-10.1371/journal.pone.0078822.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Genome Project Data Processing S: The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009, 25 (16): 2078-2079. 10.1093/bioinformatics/btp352.
Bonfield JK, Whitwham A: Gap5–editing the billion fragment sequence assembly. Bioinformatics. 2010, 26 (14): 1699-1703. 10.1093/bioinformatics/btq268.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215 (3): 403-410. 10.1016/S0022-2836(05)80360-2.
Ye J, Coulouris G, Zaretskaya I, Cutcutache I, Rozen S, Madden TL: Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction. BMC Bioinformatics. 2012, 13: 134-10.1186/1471-2105-13-134.
Carver T, Berriman M, Tivey A, Patel C, Bohme U, Barrell BG, Parkhill J, Rajandream MA: Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database. Bioinformatics. 2008, 24 (23): 2672-2676. 10.1093/bioinformatics/btn529.
Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O: The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008, 9: 75-10.1186/1471-2164-9-75.
Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R: InterProScan: protein domains identifier. Nucleic Acids Res. 2005, 33 (Web Server issue): W116-W120.
Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW: RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007, 35 (9): 3100-3108. 10.1093/nar/gkm160.
Lowe TM, Eddy SR: tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997, 25 (5): 955-964. 10.1093/nar/25.5.0955.
Fouts DE, Brinkac L, Beck E, Inman J, Sutton G: PanOCT: automated clustering of orthologs using conserved gene neighborhood for pan-genomic analysis of bacterial strains and closely related species. Nucleic Acids Res. 2012, 40 (22): e172-10.1093/nar/gks757.
Medini D, Donati C, Tettelin H, Masignani V, Rappuoli R: The microbial pan-genome. Curr Opin Genet Dev. 2005, 15 (6): 589-594. 10.1016/j.gde.2005.09.006.
Kahlke T, Goesmann A, Hjerde E, Willassen NP, Haugen P: Unique core genomes of the bacterial family vibrionaceae: insights into niche adaptation and speciation. BMC Genomics. 2012, 13: 179-10.1186/1471-2164-13-179.
The German Research Foundation (DFG), through projects KU 2679/2-1 and BU 890/21-1, the German Ministry for Education and Research (BMBF, RNomics no. 01GS0805), the Ministry of Education and Science (Republic of Serbia, Grant No. TR31043) and the German Academic Exchange Service (DAAD/BMBF), through project 56266384, supported this work.
The authors declare that they have no competing interests.
Conceived and designed experiments: CS RR MK. Performed experiments: CS BH RR MK. Analysed data: CS BH RR WRH BD MK. Contributed reagents/materials/analysis tools: CS RR CB MK. Wrote the paper: CS WRH BH BD CB MK. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Number of shared proteins for each species of the Acholeplasmataceae predicted by PanOCT. Acholeplasmas are highlighted in orange and phytoplasmas in green. Predicted orthologs in phytoplasmas and acholeplasmas are highlighted in blue. The highest number of shared proteins between acholeplasmas and phytoplasmas is underlined. Abbreviations: Acholeplasma oculi, Aocu; A. laidlawii, Alai; A. palmae, Apal; A. brassicae, Abra; ‘Candidatus Phytoplasma mali’ strain AT, Pmal; ‘Ca. P. australiense’ strain rp-A, Paus; ‘Ca. P. australiense’ strain NZSb11, SLY; ‘Ca. P. asteris’ strain OY-M, OY-M; ‘Ca. P. asteris’ strain AY-WB, AY-WB. (DOCX 112 KB)
Additional file 2: Listed orthologous proteins of the Acholeplasmataceae and ‘unique’ proteins of Acholeplasma and ‘ Candidatus Phytoplasma’. The pan analysis led to a core of 243 proteins for the family Acholeplasmataceae, to 402 ‘unique’ proteins for the genus Acholeplasma and to 14 ‘unique’ proteins for the genus ‘Ca. Phytoplasma’, respectively. Results were predicted by PanOCT analysis. The five completely sequenced phytoplasmas are highlighted in green, and the four completely sequenced acholeplasmas are highlighted in orange. (XLSX 102 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Siewert, C., Hess, W.R., Duduk, B. et al. Complete genome determination and analysis of Acholeplasma oculi strain 19L, highlighting the loss of basic genetic features in the Acholeplasmataceae. BMC Genomics 15, 931 (2014). https://doi.org/10.1186/1471-2164-15-931