Research article | Open | Published:
Genome sequence of the pattern forming Paenibacillus vortex bacterium reveals potential for thriving in complex environments
BMC Genomicsvolume 11, Article number: 710 (2010)
The pattern-forming bacterium Paenibacillus vortex is notable for its advanced social behavior, which is reflected in development of colonies with highly intricate architectures. Prior to this study, only two other Paenibacillus species (Paenibacillus sp. JDR-2 and Paenibacillus larvae) have been sequenced. However, no genomic data is available on the Paenibacillus species with pattern-forming and complex social motility. Here we report the de novo genome sequence of this Gram-positive, soil-dwelling, sporulating bacterium.
The complete P. vortex genome was sequenced by a hybrid approach using 454 Life Sciences and Illumina, achieving a total of 289× coverage, with 99.8% sequence identity between the two methods. The sequencing results were validated using a custom designed Agilent microarray expression chip which represented the coding and the non-coding regions. Analysis of the P. vortex genome revealed 6,437 open reading frames (ORFs) and 73 non-coding RNA genes. Comparative genomic analysis with 500 complete bacterial genomes revealed exceptionally high number of two-component system (TCS) genes, transcription factors (TFs), transport and defense related genes. Additionally, we have identified genes involved in the production of antimicrobial compounds and extracellular degrading enzymes.
These findings suggest that P. vortex has advanced faculties to perceive and react to a wide range of signaling molecules and environmental conditions, which could be associated with its ability to reconfigure and replicate complex colony architectures. Additionally, P. vortex is likely to serve as a rich source of genes important for agricultural, medical and industrial applications and it has the potential to advance the study of social microbiology within Gram-positive bacteria.
Paenibacillus vortex strain V453  is a bacterial species discovered in the early 90's . It is a social microorganism that forms colonies with remarkably complex and dynamic architectures (Figure 1) [2–4]. The genus Paenibacillus, including P. vortex, was originally considered a part of the genus Bacillus but was later reclassified as a separate genus in 1993 . These facultative anaerobic, spore-forming bacteria are found in a variety of heterogeneous environments, such as soil, rhizosphere, insect larvae, and clinical samples [6–9].
To face the challenges posed by these environments, Paenibacillus spp. produce a wealth of enzymes and proteases as well as a great variety of antimicrobial substances that affect a wide range of microorganisms [10–12]. The possession of these advanced defensive and offensive strategies render the Paenibacillus spp. bacteria as a rich source of useful genes for agricultural, medical, industrial applications. Despite this potential, genome sequencing of Paenibacillus spp. to date is limited and is currently available only for two species Paenibacillus larvae and Paenibacillus sp. JDR-2.
A successful behavioral strategy utilized by some Paenibacillus spp. is to cooperatively form and develop large and intricately organized colonies of 109-1012 cells. Being part of a large cooperative, the bacteria can better compete for food resources and be protected against antibacterial assaults [3, 13]. Two of the most fascinating pattern-forming Paenibacillus spp. bacteria, are P. vortex[3, 14] and P. dendritiformis[3, 15]. Under laboratory growth conditions, these bacteria can develop, like other social bacteria, colonies that behave much like a multi-cellular organism, with cell differentiation and task distribution [16–19] (see also Additional file 1 section I).
P. vortex possesses advanced social motility employing cell-cell attractive and repulsive chemotactic signaling and physical links (Additional file 1 section I). When grown on soft surfaces, the collective motility is reflected by the formation of foraging swarms  that act as arms sent out in search for food (Additional file 1 section I and Additional files 2, 3, 4, 5, 6). These swarms have an aversion to crossing each other's trail and collectively change direction when food is sensed. The swarms can even split and reunite when detecting scattered patches of nutrients .
When grown on hard surfaces, P. vortex generates special aggregates of dense bacteria that are pushed forward by repulsive chemotactic signals sent from the cells at the back (see Additional file 1 section I). These rotating aggregates (termed vortices), are similar to the rotating bacteria groups generated by Paenibacillus alvei and Bacillus circulans, pave the way for the colony to expand. The vortices serve as building blocks of colonies with special modular organization (Figure 1 and Additional file 1 section I).
Accomplishing such intricate cooperative ventures requires sophisticated cell-cell communication [3, 19, 22–24]. Communicating with each other, bacteria exchange information regarding population size, a myriad of individual environmental measurements at different locations, their internal states and their phenotypic and epigenetic adjustments . The bacteria collectively sense the environment and execute distributed information processing to glean and assess relevant information [3, 19, 25]. Next, the bacteria respond accordingly, by reshaping the colony while redistributing tasks and cell differentiations, and turning on defense and offense mechanisms [3, 16–19, 25, 26], thus achieving better adaptability to heterogeneous environments . Such collective, decentralized, adaptive decision making is a form of swarm intelligence, a term originally derived from cybernetics but applicable to some aspects of colonial organisms including ants, birds, humans and bacteria [27–29]. In terms of collective social behaviour, P. vortex has been studied extensively at the level of mathematical modeling [3, 30–32] and now requires a sequenced genome to connect this approach with the underlying genetics.
Comparative genomic analysis revealed that bacteria successful in heterogeneous and competitive environments often contain extensive signal transduction and regulatory networks [33–35]. It is likely that advanced social behavior  and elevated collective adaptability  are underpinned by a highly developed signal transduction system consisting of modular domains forming a network of sensors, transducers and responders [34, 36, 37].
In this report we present the de novo genome sequence of the P. vortex, which was obtained by utilizing a hybrid deep-sequencing approach using 454 and Illumina techniques [38, 39]. We further performed detailed comparative genomic analysis with a dataset of 500 complete bacterial genomes to discover P. vortex unique properties. The results revealed that P. vortex has one of the highest number of signal transduction genes among all the Gram-positive bacteria in the dataset. Only two other Gram-positive bacteria strains, the Paenibacillus sp. JDR-2 and the Geobacillus sp. Y412MC10, have more TCS genes. These two species and P. vortex have equal normalized combined score of TCS, TFs, transport and defense related genes (see material and methods), which is significantly higher than the combined score of all other bacteria in the data set.
The analysis also unveiled genes required for competition over resources (e.g. iron, amino acids and sugars), for producing offensive compounds (antibiotics and lytic enzymes) and for defense (resistance to antibiotics and other toxins). These genes can support traits needed for thriving in the heterogeneous and highly competitive environments.
Sequencing of the P. vortex genome
De novo assembly of the P. vortex genome was obtained using the two leading deep-sequencing technologies: Roche 454 Genome Sequencer (GS 20)  and Illumina Genome Analyzer (GA) . Using the Roche 454 and the Illumina GA technologies, 19× coverage of single reads and 270× total average coverage of single and paired-end mapped reads was produced respectively (Table 1). The reads from each technology were first assembled separately and then joined into a hybrid assembly to improve scaffold size and quality (Additional file 1 section II). The hybrid assembly (Additional file 1 Figure S6) contains 56 scaffolds totaled 6,385,925 bp with N50 scaffold size of 213,399 bp and largest scaffold of 699,613 bp. Notably, the contigs from the two technologies could be joined easily as no miss-assemblies were detected between the two sets of contigs. The first version of the Whole Genome Shotgun project described in this paper has been deposited at [GenBank: ADHJ00000000].
Assembly accuracy and completeness
To estimate the accuracy and the completeness of the hybrid assembly we performed detailed comparison between the 454 and the Illumina contigs. The results show that the 454 contigs covered 99.93% of the hybrid assembly with an average distance between contigs comprising the hybrid scaffolds of -5 bp and total 890 bp missing from the hybrid assembly (Additional file 1 Figure S6 B, C, D). The Illumina contigs covered 99.81% of the hybrid assembly with average distance between contigs of -10 bp and missing total 4,500 bp (Additional file 1 Table S3). The overall sequence identity between the two technologies was 99.8%. These results and the fact that there were no miss-assemblies demonstrate that although the P. vortex assembly is in several contigs, it provides complete genome coverage and with an extremely high accuracy (Additional file 1 section II).
To obtain a putative order of the P. vortex scaffolds, we used Geobacillus. sp. Y412MC10 genome [Refseq: NC_013406] as a reference and ordered the P. vortex scaffolds accordingly. Our preliminary genomic comparison identified Geobacillus sp. Y412MC10 as the closest bacteria with a complete genome to P. vortex. The identification was based on phylogenetic analysis of 16 S rDNA placing the Geobacillus sp. Y412MC10 within the P. vortex clade (Figure 2A) and further supported by genomic clustering of Cluster of Orthologous Groups (COG) profiles [42, 43] (Figure 2B). BLASTn comparison results of P. vortex genome vs. Geobacillus sp. Y412MC10, revealed that 2/3 of the P. vortex genome could be matched to Geobacillus sp. Y412MC10 with an average sequence identity of 86.69% over a mean alignment length of 783 bp (Additional file 1 Figure S7).
General genome statistics
The P. vortex genome is composed of a circular chromosome (6,385,925 bp) with an average G+C content of 48.7% (Figure 3). A total of 6,437 open reading frames (ORFs) were identified covering 86% of the P. vortex genome (Table 2). Among the predicted ORFs, 4,475 (70%) were assigned with a putative function, whereas 1,962 (30%) were identified as hypothetical proteins. We identified 73 non-coding RNA genes and 54 tRNA genes predicted to incorporate 18 amino acids into polypeptides. The tentative location of the origin of replication (ORI) was identified based on its proximity to dnaA gene, known to serve as a transcription initiator protein .
We have identified several types of repetitive sequences: 184 global repeats (sequence that is present in at least two copies in two different locations), 32 local inverted repeats and 231 tandem repeats within the P. vortex genome (Figure 3) (for methods see Additional file 1 section VII). Such sequences were suggested to play an important functional role in genome plasticity , by means of homologous recombination (HR), horizontal transfer or transposition in the genome [46–48]. HR has relevant roles in DNA repair, chromosome segregation and generation of genetic variation. Crossover events might produce genome rearrangements, such as deletions, leading to the loss of all genetic information in that region or duplications which could increase the amount of genetic information . Additionally, repeats located within regulatory regions might constitute an on/off switch of gene expression at the transcriptional level . Similarly, repeats located within coding regions can induce a premature ending of translation when a mutation changes the number of repeats . However, detailed mechanisms and functions of most repeats are still unknown.
Repetitive sequences are the major reason for the difficulty we encountered in finishing the genome assembly into a complete sequence. Analysis of the scaffold ends (100 bp of each end) revealed that 78% of them have repetitive sequences that are on average 37 bp long and could be mapped on average onto 5 different scaffold ends.
We note that some regions in the P. vortex genome have an extremely high coverage (see areas marked in blue, second circle, Figure 3). Although, the assembly algorithms tend to collapse the highly identical repetitive sequences into one copy, high coverage in that specific area might serve as a signature for identifying regions present in several copy numbers in the genome . For example, the ribosomal unit (16 S, 23 S and 5S) has approximately 5 times higher coverage than the average, suggesting that this unit appears approximately 5 times in the P. vortex genome. Interestingly, the Geobacillus sp. Y412MC10 has 8 copies of the ribosomal unit.
Functional validation by custom microarray
We used specially designed Agilent custom microarray submitted to EMBL-EBI [ArrayExpress: E-MEXP-3019] to validate the annotation. The microarray (Additional file 1 section IV) includes 105,000 oligos of 60 bp long, which corresponds to all the predicted ORFs and the intergenic regions.
Hybridization of the genomic DNA validated 91,324 probes (88%) of the total designed probes and no missed regions were found (see Additional file 1 section IV for more details). Hybridization of the pooled RNA from different growth conditions confirmed 4,701 (73%) of the predicted ORFs. The remaining 1,736 (27%) ORFs were not detectable under the tested conditions. Out of those, 1,064 ORFs have an assigned putative function and 672 are hypothetical. Hybridization of predicted 73 non-coding RNAs located within the intergenic regions, confirmed 43 (58%).
We performed detailed comparative analysis between the P. vortex genome and a set of 500 complete bacterial genomes of 2-10 Mbp (Additional file 7). Bacterial genomes available with draft sequence were not included in the analysis. Specifically, we focused on a reduced set of 261 genomes with genome size of 4-8 Mbp (closer to the P. vortex genome size) and a subset of 50 soil bacteria genomes within this group (Additional file 8). The comparison was done with regard to four gene systems which are related to complex bacterial lifestyle and adaptability to fluctuating environments: two-component systems, transcription factors, defense mechanisms and transport systems.
Two-component system (TCS)
Using Pfam motifs  we identified a total of 210 TCS related genes in the P. vortex genome; 103 response regulators (RRs), 97 histidine-kinases (HKs) and 10 hybrid kinases. The number of TCS genes was linear with genome size in agreement with . Among the 500 bacterial genomes, P. vortex was at the upper 1% of the population (Figure 4A, Additional file 1 Figure S12), along with two Gram-positive bacteria strains Paenibacillus sp. JDR-2 (7.08 Mbp) and Geobacillus sp. Y412MC10 (7.12 Mbp) and two Gram-negative bacteria strains, the predator myxobacterium M. xanthus (9.13 Mbp) and the cyanobacterium N. punctiforme PCC 73102 (9.05 Mbp). Our results show that similarly to the absolute gene numbers, the relative gene numbers of the tested categories in P. vortex genome is also significantly higher compared to the rest of the 500 genomes (Additional file 1 Figure S11).
Structural classification of the P. vortex RRs according to previously proposed scheme  revealed relatively high number of 37 OmpR family and 30 AraC family DNA-binding response regulators. Class organization of the P. vortex TCS proteins as described in  revealed 150 HK-RR paired, 32 orphaned (isolated) and 21 in complex gene clusters (for more details see Additional file 1 section VII). Neighborhood analysis of the TCS surrounding genes revealed that 101 (30%) are transport related genes, 46 (12.6%) have regulatory functions (mainly consist of transcription factors), and 35 (9.6%) belong to the energy metabolism category (mainly employing biosynthesis and degradation of polysaccharides).
Transcription Factors (TFs)
Using the method described in , we identified a total of 411 TFs in P. vortex genome, which placed it at the upper 5% of the 500 bacteria set (Figure 4C). This number is considerably higher than the average 158 ± 111 TFs among the 500 bacterial genomes and higher than the average 208 ± 92 TFs among the subset of 261 genomes with size 4-8 Mbp sizes. Among the subset of 50 soil bacteria genomes, only two strains, Paenibacillus sp. JDR-2 (7.08 Mbp) and Delftia acidovorans SPH-1 (6.76 Mbp) have a higher number of TFs genes. We note that an overall linear dependence between the TFs and the genome size was found (Figure 4C).
P. vortex encodes an extensive set of 700 transport related genes. Among the 500 bacterial genomes, P. vortex was at the upper 1% of the population (Figure 4B), along with additional five strains Rhizobium leguminobarum bv. viciae (7.75 Mbp), Geobacillus sp. Y412MC10 (7.12 Mbp), Paenibacillus sp. JDR-2 (7.08 Mbp), Sinorhizobium meliloti 1021 (6.7 Mbp) and Sinorhizobium medicae WSM419 (6.8 Mbp). About a third, 258 (35%) of the genes are involved in carbohydrate transport, 42 genes encode components of iron transporters, 23 genes encode components of amino acid transporters and 39 genes encode components of oligo/dipeptide transporters. The latter could be used as nutrient sources, as well as signal molecules regulating bacterial development, virulence, and conjugal plasmid transfer .
The P. vortex genome contains 138 genes related to resistance against inhibitory substances such as antibiotics, copper, aluminium, arsenic and toxic anions (Figure 4D). The proximity of TCS genes to ABC transporters is known to form specific and efficient detoxification units . Out of the 138 genes, 90 are transporter-encoding genes. Non-transport related genes include antibiotic resistance encoding genes such as penicillin binding proteins, beta-lactames, chloramphenicol posphotransferases/acetyltransferases, vanZ and vanW glycopeptide antibiotics resistance genes. Apart from Streptomyces griseus NBRC 13350 (8.54 Mbp), P. vortex harbors the highest number of defense related genes among the 500 analyzed genomes. Additionally, P. vortex has the highest number of these genes compared to the subset of 261 genomes with a 4-8 Mbp genome size (the average for this subset is 60 ± 20).
The combined score
When compiling the four indices into a combined score, P. vortex and two other Gram-positive bacteria strains, the Paenibacillus sp. JDR-2 and the Geobacillus sp. Y412MC10 stand out among the 500 genomes in the dataset (Figure 4E). These two species and P. vortex have equal normalized combined score (Figure 4F), which is significantly higher than the combined score of all other bacteria in the dataset.
Motility and Chemotaxis
Upon growth on semi-solid surfaces P. vortex exhibits at least one form of swarming motility, a flagellum-driven social form of surface locomotion [57–60]. In Figure 5A, we show that propagating P. vortex swarms can collectively change direction towards organic matter added to an agar plate, and can even split and reunite when detecting scattered patches of food (see Additional file 1 section I for more details). We previously showed, using flagellar staining and light microcopy that swarming P. vortex was peritrichously flagellated (2 to 8 flagella per μm of cell length, 25 to 30 nm wide and > 5 μm long) . These results are in agreement with the dimensions measured by scanning electron microscopy (Figure 5B, C, D). Flagellar motility genes were indentified within the P. vortex genome. These genes are located within five different loci, two of which contain the majority of the genes and are 8.4 kb and 27.1 kb long (Figure 6 and Additional file 1 Figure S13).
Social motility could also be powered by the extension and retraction of type IV pili [61, 62]. P. vortex genome contains several pili-related genes such as pilZ, pilT, flp pilus assembly protein and prepilin type IV. However, we could not identify all the genes known to be involved in biogenesis and motility of type IV pili [63–65]. Furthermore, the fastest known rate of type IV pili related movement does not exceed 50 μm/min [66, 67], whereas, P. vortex has an average movement rate of 300 μm/min (data not shown).
Previous studies suggest that the vortices are formed by attractive interaction between swarming cells which can be mediated via attractive chemotactic signaling and/or physical links . The P. vortex genome contains several chemotaxis related genes, including the cheA, cheB, cheC, cheD, cheW and cheY. Many of the chemotaxis genes are located within the large motility loci (Figure 6 and Additional file 1 Figure S14). Additional 16 MCP (methyl-accepting chemotaxis) genes were found in other locations along the genome.
Sporulation and competence
Formation of spores and uptake of foreign DNA represent an important aspect of bacterial survival strategies. P. vortex genome encodes an extensive set of 153 genes responsible for sporulation including cell division, engulfment, cortex and coat synthesis, maturation and germination (Additional file 9). The identified sporulation genes included one of the conserved PFAM domains [53, 68], TIGR domains , COG categories  or KEGG pathways  associated with sporulation (Additional file 10).
Although, 9 competence-related genes such as comEA, comer and comEC were identified, they represent only a small portion of the complete competence pathway [71–73]. Additionally, we did not identify homologous genes that belong to the Rap system, which plays an important role in the cell decision-making between sporulation and competence [74, 75]. It is therefore possible that the common pathway described for sporulation and competence in other Gram-positive bacteria  is different in P. vortex.
Clusters of Multifunctional Enzymes-Secondary Metabolites
Non-ribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs) are large multi-domain proteins that catalyze the biosynthesis of small molecules with potent biological activity. These molecules, which are mainly produced by bacteria and fungi, often serve as "chemical weapons" against neighboring organisms . Due to their antifungal and antibacterial activities these compounds are also used for medical purposes in the pharmaceutical industry. The PKS genes for a certain polyketide are usually organized in one operon in bacteria and in gene clusters in eukaryotes. We identified 13 PKS, 9 NRPS and 14 PKS genes in the P. vortex genome, which were arranged in the following clusters: (i) a 43 kb PKS gene cluster, which is comprised of 13 PKSs involved in polyketide synthesis (Figure 7A); (ii) cluster of 15 kb encoding 9 NRPS, which might be involved in siderophore production (Figure 7B) similar to the bacillibactin siderophore produced by Bacillus amylolyquefaciens and Bacillus subtilis, and (iii) a hybrid 14 PKS/NRPS gene cluster of 23 kb involved in the production of bacitracin-like antibiotics (Figure 7C).
Antagonistic effects of bacteria directed against competing organisms could also result from the enzymatic activity of extracellular degrading enzymes. Seven chitinase and four 1,3-beta glucanase encoding genes were identified in the P. vortex genome. These enzymes are involved in degradation of polysaccharide components of the fungal cell wall . Direct tests showed that the P. vortex can significantly inhibit the growth of Verticillium dahlia, a fungal plant pathogen causing vascular wilt diseases in a broad range of host plants. This plant pathogen is distributed in soil worldwide and is of major threat to agriculture crop production, especially in temperate areas of the world . P. vortex, which was inoculated six days after V. dahliae inoculation, to allow the establishment of healthy fungal colonies, was able to significantly inhibit the growth of V. dahliae (Figure 8A, B). In the first six days, the diameter of V. dahliae colonies was identical for all treatment and control plates (~1.5 cm). During 15 days post inoculation of the P. vortex, V. dahliae colonies grew only ~1.2 cm in diameter, compared to control colonies which grew 2.5 times faster (~2.9 cm in diameter) (Figure 8C).
Whole-genome shotgun pyrosequencing has proved remarkably useful for the large-scale sequencing of bacterial genomes [81–83]. High-quality de novo assemblies can be obtained with relatively few errors and gaps when the sequence read coverage redundancy is 15-fold or greater. Closing all the gaps in each genome sequence is time-consuming and costly; therefore, in the near future there will be an excess of draft bacterial sequences versus closed genomes in public databases.
This study presents a de novo assembly of the P. vortex genome utilizing a hybrid deep-sequencing strategy using a Roche 454 Genome Sequencer (GS 20) and an Illumina Genome Analyzer. The use of the two next-generation leading technologies and the combination of the results into a hybrid assembly overcame the drawbacks of each technology and resulted in longer scaffolds. We demonstrated that the sequence identity between the two methods was 99.88%, reflecting the low error rate of both sequences. The genome sequence, the predicted transcripts and the non-coding RNAs were further validated by hybridization to custom microarray.
Notably, even when using several algorithms and an extremely high coverage, the data could not be assembled into a single sequence. Analysis of the ends of contigs revealed that the unassembled contigs have small repetitive sequences at their ends. The existence of high number of repetitive sequences is a generic obstacle that tempers the ability of the assembly algorithms to generate a single version of the complete genome, and more so when working with short reads. It has been shown that sequence repeats have a functional role that can contribute to genomic plasticity which allows rapid adaptation to environmental changes .
P. vortex was originally isolated from colonies of B. subtilis, soil bacteria commonly found in the rhizosphere [84, 85]. The Rhizosphere is characterized by large environmental fluctuations, which act as a selecting force determining the diversity of the microbial community [86–89]. The features identified in the genome of P. vortex suggest that these bacteria can lead a successful lifestyle in the highly competitive environment of the rhizosphere as well as serve as an efficient plant beneficial rhizobacteria (PBR). PBR competitively colonize plant roots and can simultaneously act as biofertilizers and as antagonists (biopesticides) of recognized root pathogens .
Comparative genomics and comparative network biology are emerging as key tools in understanding of how bacteria respond cooperatively to challenging complex environments. In particular, it was previously suggested that bacteria successful in heterogeneous and competitive environments often contain extensive signal transduction and regulatory networks [25, 34, 91]. These observations, and the fact that signal transduction networks afford intracellular information processing , led to the notion that the number and fraction of signal transduction genes can be used as a measure of the "Bacteria IQ" [34, 91]. Detailed comparative genomic analysis revealed that the P. vortex's genome and the genome of the Gram-negative, social and predatory bacterium M. xanthus have exceptionally high number of TCS genes, supporting the notion that they are required for advanced social behavior.
The P. vortex species is marked by its complex spatial organization of the colony, with the bacteria forming different patterns to better cope with the environment [3, 4, 14, 93]. Pattern-formation and self-organization in microbial systems is an intriguing phenomenon that might also provide insights into the evolutionary development of the concerted action of cells in higher organisms . Therefore, sequencing of the P. vortex genome paves the way to understanding of regulatory processes involved in cell-cell communication and colonial patterning and more generally, to understanding of cooperative bacterial response to changing environmental conditions. Such information should facilitate increased exploitation of Paenibacillus spp. in industrial, agricultural and medical fields, as well as help us comprehend the evolutionary development of multicellular organisms.
The P. vortex genome was sequenced using a hybrid deep-sequencing approach resulting in an estimated genome size of 6.3 Mb. A total of 6,437 ORFs were identified and 73% of them confirmed using specially designed Agilent custom microarray chip. The results of the two sequencing methods were compared resulting in 99.88% sequence identity, reflecting low error rate of both sequences. The use of the two next-generation leading technologies and the combination of the results into a hybrid assembly overcame the drawbacks of each technology and resulted in longer scaffolds.
Comparative genomics analysis with 500 complete bacterial genomes revealed that P. vortex has one of the highest number of TCS genes among all the Gram-positive bacteria in the dataset. High numbers of TCS genes were also found in the genome of the social predator M. xanthus, supporting the notion that they are required for advanced social behavior. M. xanthus serves as an important Gram-negative bacterial model for the study of multicellularity in prokaryotes . Similarly, P. vortex may have the potential to provide significant insights on cell-cell interactions, pattern formation and social behavior in Gram-positive bacteria. Additionally, P. vortex encodes an extensive set of TFs, transport and defense related genes. These findings suggest that P. vortex has a highly developed signal transduction system and that these genes can support traits needed for thriving in heterogeneous, fluctuating and highly competitive environments.
The genome sequence of P. vortex provides the basis for understanding of social organization and pattern formation within Gram-positive bacteria. P. vortex is the first sequenced Paenibacillus species reported to show these properties and this work supports the development of genetic approaches to the study of prokaryotic multicellularity and multi-agent decision making (swarm intelligence). Furthermore, this organism is likely to become a valuable resource for exploitation within biotechnology.
P. vortex DNA was prepared at two separate times for the 454 and Illumina sequencing runs following the standard Roche and Illumina protocols respectively. P. vortex was grown in Luria-Bertani (LB) medium, at 37°C with shaking (200 rpm) over night. DNA was extracted from 2 ml cell culture (109/ml), using Qiagen, DNeasy Blood and Tissue Kit, according to the manufacture's protocol with the following modifications; cells were incubated with Lysosyme for 45 minutes prior extraction. Elution from Qiagen column was performed with 200 μl buffer AE (10 mM Tris-HCl, 0.5 mM EDTA pH 9.0).
We used a hybrid sequencing approach that incorporates 454 pyrosequencing with Illumina Genome Analyzer. Sequencing by both methods was performed in compliance with manufacturer's instructions Roche and Illumina accordingly.
The 454 reads were assembled using Newbler Assembler  version number 1.0.53. To obtain optimized results for the assembly of Illumina short reads we tested several algorithms (Additional file 1 Table S2), but eventually selected Velvet . Velvet's algorithm handled single and paired-end reads and produced contigs with highest sequence identity of 99.88% to those produced by the 454. Algorithms used to assemble short reads are Velvet 0.7.28, Edena 2.1.1 and Euler-SR 1.0. Velvet algorithm was used with parameter hash length of 31, insert length of 250 and minimum contig length 50. Edena algorithm was used with a minimum overlap parameter of 23. The final step included the assembly of the Newbler (454) and Velvet (Illumina) contigs using Minimus 2.0.5 .
The DNA sequence was run through JCVI's prokaryotic annotation pipeline (JCVI Annotation Service), which includes gene finding by Glimmer, Blast-extend-repraze (BER) searches, HMM searches, TMHMM searches, SignalP predictions, and automatic annotations from AutoAnnotate. Additionally, the DNA sequence was annotated using NCBI Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP) and the combined annotation was submitted to [GenBank: ADHJ00000000].
Phylogenetic analysis of 16S
The construction of the phylogenetic tree of 22 taxa was based on 16 S rRNA sequences downloaded in fasta format from DNA Data Bank of Japan (DDBJ) ftp://ftp.ddbj.nig.ac.jp/ddbj_database/16S/. The alignment of the chosen sequences was performed using ClustaX  and the construction of the phylegenetic tree using Neighbor-Joining algorithm . The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (500 replicates) was also calculated . The evolutionary distances were computed using the Maximum Composite Likelihood method  utilizing Mega 4 software .
Identification of Two-Component System and Transcription Factor genes
The approach used to identify putative TCS and TF genes utilized HMM (Hidden Markov Model) profiles found in Pfam database of protein families http://pfam.sanger.ac.uk/. TCS genes were identified similarly to that previously described by  and  and TF genes were identified as described by . The compiled list of Pfam domains that was used to identify TCS and TFs is presented in Additional file 11 and 12 respectfully. Additional methods description is included in Additional file 1 section VII.
Identification of Transporters and Defense related genes
To identify putative transport and defense related genes we utilized Cluster of Orthologous Groups (COG) profiles [42, 43]. The compiled list of COG profiles that were used to identify transport and defense related genes is presented in Additional file 13 and 14 respectfully.
The combined score was calculated as an average of the standard deviation (stdev) of two-component system, transcription factor, transport and defense genes for the dataset of 500 bacterial genomes. The combined score was calculated both as normalized and non-normalized to genome size.
Experiment procedure of P. vortex effect on Verticillium dahlia
V. dahliae was grown on trypsin soy agar plates (TSA), at 28°C. A 10 day old stock plate was used to initiate the experiments as follows: A startup slice of 0.5 mm diameter was cut from the colony edge and placed on a fresh TSA plate. The fungal slice was positioned 1 cm away from the center of a 9 mm Petri dish. Plates were incubated untill V. dahliae colonies reached 1.5 cm diameter (6 days incubation). At this time-point an overnight P. vortex culture, grown in LB, 28°C, with shaking (200 rpm), was inoculated in a 6 cm long line, horizontal to V. dahliae. P. vortex was positioned 2.5 cm away from the V. dahliae colony center. V. dahliae colonies without the inoculation of P. vortex served as control. All tests were carried out in triplicate.
Submission to the international collection deposits
Isolate P. vortex sp. nov. V453 was deposited at the Bacillus Genetic Stock Center (BGSC), Columbus, OH, USA, as strain 31A2T and at the Belgium Coordinated Collection of Microorganisms (BCCM/LMG) as strain LMG 25955.
Open Reading Frames
Cluster of Orthologous Groups
Non-Ribosomal Peptide Synthetase
Plant Beneficial Rhizobacteria
Hidden Markov Model
Sirota-Madi A, Brainis I, Ingham C, Helman Y, Gutnick DL, Ben-Jacob E: Paenibacillus vortex sp. nov.: proposal for a new pattern-forming species with advanced collective motility and complex colony organization. IJSEM.
Ben-Jacob E, Shochet O, Tenenbaum A, Avidan O: Evolution of complexity during growth of bacterial colonies. NATO Advanced Research Workshop; Santa Fe, USA. Edited by: Cladis PE, Palffy-Muhorey P. 1995, Addison-Wesley Publishing Company, 619-633.
Ben-Jacob E: Bacterial self-organization: co-enhancement of complexification and adaptability in a dynamic environment. Phil Trans R Soc Lond A. 2003, 361: 1283-1312. 10.1098/rsta.2003.1199.
Ben-Jacob E, Cohen I, Gutnick DL: Cooperative organization of bacterial colonies: from genotype to morphotype. Annu Rev Microbiol. 1998, 52: 779-806. 10.1146/annurev.micro.52.1.779.
Ash C, Priest FG, Collins MD: Molecular identification of rRNA group 3 bacilli (Ash, Farrow, Wallbanks and Collins) using a PCR probe test. Proposal for the creation of a new genus Paenibacillus. Antonie Van Leeuwenhoek. 1993, 64: 253-260. 10.1007/BF00873085.
Lal S, Tabacchioni S: Ecology and biotechnological potential of Paenibacillus polymyxa: a minireview. Indian J Microbiol. 2009, 49: 2-10. 10.1007/s12088-009-0008-y.
McSpadden Gardener BB: Ecology of Bacillus and Paenibacillus spp. in Agricultural Systems. Phytopathology. 2004, 94: 1252-1258. 10.1094/PHYTO.2004.94.11.1252.
Montes MJ, Mercade E, Bozal N, Guinea J: Paenibacillus antarcticus sp. nov., a novel psychrotolerant organism from the Antarctic environment. Int J Syst Evol Microbiol. 2004, 54: 1521-1526. 10.1099/ijs.0.63078-0.
Ouyang J, Pei Z, Lutwick L, Dalal S, Yang L, Cassai N, Sandhu K, Hanna B, Wieczorek RL, Bluth M, Pincus MR: Case report: Paenibacillus thiaminolyticus: a new cause of human infection, inducing bacteremia in a patient on hemodialysis. Ann Clin Lab Sci. 2008, 38: 393-400.
Konishi J, Maruhashi K: 2-(2'-Hydroxyphenyl)benzene sulfinate desulfinase from the thermophilic desulfurizing bacterium Paenibacillus sp. strain A11-2: purification and characterization. Appl Microbiol Biotechnol. 2003, 62: 356-361. 10.1007/s00253-003-1331-6.
Raza W, Yang W, Shen QR: Paenibacillus polymyxa: Antibiotics, Hydrolytic Enzymes and Hazard Assessment. J Plant Pathol. 2008, 90: 419-430.
Watanapokasin RY, Boonyakamol A, Sukseree S, Krajarng A, Sophonnithiprasert T, Kanso S, Imai T: Hydrogen production and anaerobic decolorization of wastewater containing Reactive Blue 4 by a bacterial consortium of Salmonella subterranea and Paenibacillus polymyxa. Biodegradation. 2009, 20: 411-418. 10.1007/s10532-008-9232-0.
Shapiro JA: The significances of bacterial colony patterns. Bioessays. 1995, 17: 597-607. 10.1002/bies.950170706.
Ingham CJ, Ben-Jacob E: Swarming and complex pattern formation in Paenibacillus vortex studied by imaging and tracking cells. BMC Microbiol. 2008, 8: 36-10.1186/1471-2180-8-36.
Ben-Jacob E, Schochet O, Tenenbaum A, Cohen I, Czirok A, Vicsek T: Generic modelling of cooperative growth patterns in bacterial colonies. Nature. 1994, 368: 46-49. 10.1038/368046a0.
Aguilar C, Vlamakis H, Losick R, Kolter R: Thinking about Bacillus subtilis as a multicellular organism. Curr Opin Microbiol. 2007, 10: 638-643. 10.1016/j.mib.2007.09.006.
Dunny GM, Brickman TJ, Dworkin M: Multicellular behavior in bacteria: communication, cooperation, competition and cheating. Bioessays. 2008, 30: 296-298. 10.1002/bies.20740.
Shapiro JA, Dworkin M: Bacteria as multicellular organisms. 1997, Oxford University Press, USA, 1
Ben-Jacob E, Becker I, Shapira Y, Levine H: Bacterial linguistic communication and social intelligence. Trends Microbiol. 2004, 12: 366-372. 10.1016/j.tim.2004.06.006.
Cohen I, Ron I, Ben-Jacob E: From branching to nebula patterning during colonial development of the Paenibacillus alvei bacteria. Physica A. 2000, 286: 321-336. 10.1016/S0378-4371(00)00335-6.
Komoto A, Hanaki K, Maenosono S, Wakano JY, Yamaguchi Y, Yamamoto K: Growth dynamics of Bacillus circulans colony. J Theor Biol. 2003, 225: 91-97. 10.1016/S0022-5193(03)00224-8.
Bassler BL, Losick R: Bacterially speaking. Cell. 2006, 125: 237-246. 10.1016/j.cell.2006.04.001.
Bischofs IB, Hug JA, Liu AW, Wolf DM, Arkin AP: Complexity in bacterial cell-cell communication: quorum signal integration and subpopulation signaling in the Bacillus subtilis phosphorelay. Proc Natl Acad Sci USA. 2009, 106: 6459-6464. 10.1073/pnas.0810878106.
Kolter R, Greenberg EP: Microbial sciences: the superficial life of microbes. Nature. 2006, 441: 300-302. 10.1038/441300a.
Dwyer DJ, Kohanski MA, Collins JJ: Networking opportunities for bacteria. Cell. 2008, 135: 1153-1156. 10.1016/j.cell.2008.12.016.
Wolf DM, Fontaine-Bodin L, Bischofs I, Price G, Keasling J, Arkin AP: Memory in microbes: quantifying history-dependent behavior in a bacterium. PLoS One. 2008, 3: e1700-10.1371/journal.pone.0001700.
Ben Jacob E: The cybernetic genome. Physica A. 1998, 249: 407-414. 10.1016/S0378-4371(97)00500-1.
Bonabeau E, Dorigo M, Theraulaz G: Swarm intelligence: from natural to artificial systems. 1999, New York: Oxford University Press
Taylor RG, Welch RD: Chemotaxis as an emergent property of a swarm. J Bacteriol. 2008, 190: 6811-6816. 10.1128/JB.00662-08.
Ben-Jacob E, Cohen I, Czirók A, Vicsek T, Gutnick DL: Chemomodulation of cellular movement, collective formation of vortices by swarming bacteria, and colonial development. Physica A. 1997, 238: 181-197. 10.1016/S0378-4371(96)00457-8.
Ben-Jacob E, Cohen I, Levine H: Cooperative self-organization of microorganisms. Adv Phys. 2000, 49: 395-554. 10.1080/000187300405228.
Czirok A, Ben-Jacob E, Cohen II, Vicsek T: Formation of complex bacterial colonies via self-generated vortices. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 1996, 54: 1791-1801.
Alon U: An Introduction to Systems Biology: Design Principles of Biological circuits. 2006, London, UK: CRC Press
Galperin MY, Gomelsky M: Bacterial Signal Transduction Modules: from Genomics to Biology. ASM News. 2005, 71: 326-333.
Whitworth DE, Cock PJ: Two-component systems of the myxobacteria: structure, diversity and evolutionary relationships. Microbiology. 2008, 154: 360-372. 10.1099/mic.0.2007/013672-0.
Hellingwerf KJ: Bacterial observations: a rudimentary form of intelligence?. Trends Microbiol. 2005, 13: 152-158. 10.1016/j.tim.2005.02.001.
Mascher T, Helmann JD, Unden G: Stimulus perception in bacterial signal-transducing histidine kinases. Microbiol Mol Biol Rev. 2006, 70: 910-938. 10.1128/MMBR.00020-06.
MacLean D, Jones JD, Studholme DJ: Application of 'next-generation' sequencing technologies to microbial genetics. Nat Rev Microbiol. 2009, 7: 287-296.
Mardis ER: The impact of next-generation sequencing technology on genetics. Trends Genet. 2008, 24: 133-141.
Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, et al: Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005, 437: 376-380.
Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, Brown CG, Hall KP, Evers DJ, Barnes CL, Bignell HR, et al: Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008, 456: 53-59. 10.1038/nature07517.
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, et al: The COG database: an updated version includes eukaryotes. BMC Bioinformatics. 2003, 4: 41-10.1186/1471-2105-4-41.
Tatusov RL, Koonin EV, Lipman DJ: A genomic perspective on protein families. Science. 1997, 278: 631-637. 10.1126/science.278.5338.631.
Moriya S, Kato K, Yoshikawa H, Ogasawara N: Isolation of a dnaA mutant of Bacillus subtilis defective in initiation of replication: amount of DnaA protein determines cells' initiation potential. Embo J. 1990, 9: 2905-2910.
Aras RA, Kang J, Tschumi AI, Harasaki Y, Blaser MJ: Extensive repetitive DNA facilitates prokaryotic genome plasticity. Proc Natl Acad Sci USA. 2003, 100: 13579-13584. 10.1073/pnas.1735481100.
Bennett PM: Genome plasticity: insertion sequence elements, transposons and integrons, and DNA rearrangement. Methods Mol Biol. 2004, 266: 71-113.
Ochman H, Lawrence JG, Groisman EA: Lateral gene transfer and the nature of bacterial innovation. Nature. 2000, 405: 299-304. 10.1038/35012500.
Rocha EP, Blanchard A: Genomic repeats, genome plasticity and the dynamics of Mycoplasma evolution. Nucleic Acids Res. 2002, 30: 2031-2042. 10.1093/nar/30.9.2031.
Romero D, Martinez-Salazar J, Ortiz E, Rodriguez C, Valencia-Morales E: Repeated sequences in bacterial chromosomes and plasmids: a glimpse from sequenced genomes. Res Microbiol. 1999, 150: 735-743. 10.1016/S0923-2508(99)00119-9.
van Ham SM, van Alphen L, Mooi FR, van Putten JP: Phase variation of H. influenzae fimbriae: transcriptional control of two divergent genes through a variable combined promoter region. Cell. 1993, 73: 1187-1196. 10.1016/0092-8674(93)90647-9.
Henderson IR, Owen P, Nataro JP: Molecular switches--the ON and OFF of bacterial phase variation. Mol Microbiol. 1999, 33: 919-932. 10.1046/j.1365-2958.1999.01555.x.
Medvedev P, Stanciu M, Brudno M: Computational methods for discovering structural variation with next-generation sequencing. Nat Methods. 2009, 6: S13-20. 10.1038/nmeth.1374.
Finn RD, Tate J, Mistry J, Coggill PC, Sammut SJ, Hotz HR, Ceric G, Forslund K, Eddy SR, Sonnhammer EL, Bateman A: The Pfam protein families database. Nucleic Acids Res. 2008, 36: D281-288. 10.1093/nar/gkm960.
Galperin MY: Structural classification of bacterial response regulators: diversity of output domains and domain combinations. J Bacteriol. 2006, 188: 4169-4182. 10.1128/JB.01887-05.
Wilson D, Charoensawan V, Kummerfeld SK, Teichmann SA: DBD--taxonomically broad transcription factor predictions: new content and functionality. Nucleic Acids Res. 2008, 36: D88-92. 10.1093/nar/gkm964.
Camilli A, Bassler BL: Bacterial small-molecule signaling pathways. Science. 2006, 311: 1113-1116. 10.1126/science.1121357.
Fraser GM, Hughes C: Swarming motility. Curr Opin Microbiol. 1999, 2: 630-635. 10.1016/S1369-5274(99)00033-8.
Ghelardi E, Celandroni F, Salvetti S, Beecher DJ, Gominet M, Lereclus D, Wong AC, Senesi S: Requirement of flhA for swarming differentiation, flagellin export, and secretion of virulence-associated proteins in Bacillus thuringiensis. J Bacteriol. 2002, 184: 6424-6433. 10.1128/JB.184.23.6424-6433.2002.
Macfarlane S, Hopkins MJ, Macfarlane GT: Toxin synthesis and mucin breakdown are related to swarming phenomenon in Clostridium septicum. Infect Immun. 2001, 69: 1120-1126. 10.1128/IAI.69.2.1120-1126.2001.
Senesi S, Celandroni F, Salvetti S, Beecher DJ, Wong AC, Ghelardi E: Swarming motility in Bacillus cereus and characterization of a fliY mutant impaired in swarm cell differentiation. Microbiology. 2002, 148: 1785-1794.
Li Y, Sun H, Ma X, Lu A, Lux R, Zusman D, Shi W: Extracellular polysaccharides mediate pilus retraction during social motility of Myxococcus xanthus. Proc Natl Acad Sci USA. 2003, 100: 5443-5448. 10.1073/pnas.0836639100.
Sun H, Zusman DR, Shi W: Type IV pilus of Myxococcus xanthus is a motility apparatus controlled by the frz chemosensory system. Curr Biol. 2000, 10: 1143-1146. 10.1016/S0960-9822(00)00705-3.
Mattick JS: Type IV pili and twitching motility. Annu Rev Microbiol. 2002, 56: 289-314. 10.1146/annurev.micro.56.012302.160938.
Proft T, Baker EN: Pili in Gram-negative and Gram-positive bacteria-structure, assembly and their role in disease. Cell Mol Life Sci. 2009, 66: 613-635. 10.1007/s00018-008-8477-4.
Varga JJ, Nguyen V, O'Brien DK, Rodgers K, Walker RA, Melville SB: Type IV pili-dependent gliding motility in the Gram-positive pathogen Clostridium perfringens and other Clostridia. Mol Microbiol. 2006, 62: 680-694. 10.1111/j.1365-2958.2006.05414.x.
Harshey RM: Bacterial motility on a surface: many ways to a common goal. Annu Rev Microbiol. 2003, 57: 249-273. 10.1146/annurev.micro.57.030502.091014.
Skerker JM, Berg HC: Direct observation of extension and retraction of type IV pili. Proc Natl Acad Sci USA. 2001, 98: 6901-6904. 10.1073/pnas.121171698.
Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL, et al: The Pfam protein families database. Nucleic Acids Res. 2004, 32: D138-141. 10.1093/nar/gkh121.
Haft DH, Selengut JD, White O: The TIGRFAMs database of protein families. Nucleic Acids Res. 2003, 31: 371-373. 10.1093/nar/gkg128.
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006, 34: D354-357. 10.1093/nar/gkj102.
Kovacs AT, Smits WK, Mironczuk AM, Kuipers OP: Ubiquitous late competence genes in Bacillus species indicate the presence of functional DNA uptake machineries. Environ Microbiol. 2009, 11: 1911-1922. 10.1111/j.1462-2920.2009.01937.x.
Spizizen J: Transformation of Biochemically Deficient Strains of Bacillus Subtilis by Deoxyribonucleate. Proc Natl Acad Sci USA. 1958, 44: 1072-1078. 10.1073/pnas.44.10.1072.
van Sinderen D, Venema G: comK acts as an autoregulatory control switch in the signal transduction route to competence in Bacillus subtilis. J Bacteriol. 1994, 176: 5762-5770.
Piggot PJ, Losick R: Bacillus subtilis and its Closest Relatives. Genes to Cells. Edited by: Sonenshein L, Losick R, Hoch JA. 2002, Washington DC: American Society for Microbiology, 483-517.
Sonenshein AL: Control of sporulation initiation in Bacillus subtilis. Curr Opin Microbiol. 2000, 3: 561-566. 10.1016/S1369-5274(00)00141-7.
Schultz D, Wolynes PG, Ben-Jacob E, Onuchic JN: Deciding fate in adverse times: sporulation and competence in Bacillus subtilis. Proc Natl Acad Sci USA. 2009, 106: 21027-21034. 10.1073/pnas.0912185106.
Lautru S, Challis GL: Substrate recognition by nonribosomal peptide synthetase multi-enzymes. Microbiology. 2004, 150: 1629-1636. 10.1099/mic.0.26837-0.
Chen XH, Koumoutsi A, Scholz R, Borriss R: More than anticipated-production of antibiotics and other secondary metabolites by Bacillus amyloliquefaciens FZB42. J Mol Microbiol Biotechnol. 2009, 16: 14-24. 10.1159/000142891.
El-Katatny MH, Gudelj M, Robra KH, Elnaghy MA, Gubitz GM: Characterization of a chitinase and an endo-beta-1,3-glucanase from Trichoderma harzianum Rifai T24 involved in control of the phytopathogen Sclerotium rolfsii. Appl Microbiol Biotechnol. 2001, 56: 137-143. 10.1007/s002530100646.
Bhat RG, Subbarao KV: Host Range Specificity in Verticillium dahliae. Phytopathology. 1999, 89: 1218-1225. 10.1094/PHYTO.1922.214.171.1248.
Almeida NF, Yan S, Lindeberg M, Studholme DJ, Schneider DJ, Condon B, Liu H, Viana CJ, Warren A, Evans C, et al: A draft genome sequence of Pseudomonas syringae pv. tomato T1 reveals a type III effector repertoire significantly divergent from that of Pseudomonas syringae pv. tomato DC3000. Mol Plant Microbe Interact. 2009, 22: 52-62. 10.1094/MPMI-22-1-0052.
Aury JM, Cruaud C, Barbe V, Rogier O, Mangenot S, Samson G, Poulain J, Anthouard V, Scarpelli C, Artiguenave F, Wincker P: High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies. BMC Genomics. 2008, 9: 603-10.1186/1471-2164-9-603.
Snyder LA, Loman N, Pallen MJ, Penn CW: Next-generation sequencing--the promise and perils of charting the great microbial unknown. Microb Ecol. 2009, 57: 1-3. 10.1007/s00248-008-9465-9.
Pandey A, Palni LM: Bacillus species: the dominant bacteria of the rhizosphere of established tea bushes. Microbiol Res. 1997, 152: 359-365.
Juhnke ME, Mathre DE, Sands DC: Identification and Characterization of Rhizosphere-Competent Bacteria of Wheat. Appl Environ Microbiol. 1987, 53: 2793-2799.
Hinsinger P: Structure and function of the rhizosphere: mechanisms at the soil-root interface. Ol Corps Gras, Lipides. 1998, 5: 340-341.
Hinsinger P, Bengough AG, Vetterlein D, Young IM: Rhizosphere: biophysics, biogeochemistry and ecological relevance. Plant and Soil. 2009, 321: 117-152. 10.1007/s11104-008-9885-9.
Hinsinger P, Plassard C, Jaillard B: Rhizosphere: A new frontier for soil biogeochemistry. J Geochem Explor. 2006, 88: 210-213. 10.1016/j.gexplo.2005.08.041.
Jones DL, Hinsinger P: The rhizosphere: complex by design. Plant and Soil. 2008, 312: 1-6. 10.1007/s11104-008-9774-2.
Bloemberg GV, Lugtenberg BJ: Molecular basis of plant growth promotion and biocontrol by rhizobacteria. Curr Opin Plant Biol. 2001, 4: 343-350. 10.1016/S1369-5266(00)00183-7.
Galperin MY: A census of membrane-bound and intracellular signal transduction proteins in bacteria: bacterial IQ, extroverts and introverts. BMC Microbiol. 2005, 5: 35-10.1186/1471-2180-5-35.
Velicer GJ, Yu YT: Evolution of novel cooperative swarming in the bacterium Myxococcus xanthus. Nature. 2003, 425: 75-78. 10.1038/nature01908.
Ben-Jacob E: From snowflake formation to growth of bacterial colonies II: Cooperative formation of complex colonial patterns. Contem Phys. 1997, 38: 205-241. 10.1080/001075197182405.
Kaiser D: Building a multicellular organism. Annu Rev Genet. 2001, 35: 103-123. 10.1146/annurev.genet.35.102401.090145.
Zerbino DR, Birney E: Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008, 18: 821-829. 10.1101/gr.074492.107.
Sommer DD, Delcher AL, Salzberg SL, Pop M: Minimus: a fast, lightweight genome assembler. BMC Bioinformatics. 2007, 8: 64-10.1186/1471-2105-8-64.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25: 4876-4882. 10.1093/nar/25.24.4876.
Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4: 406-425.
Felsenstein J: Confidence-Limits on Phylogenies - an Approach Using the Bootstrap. Evolution. 1985, 39: 783-791. 10.2307/2408678.
Tamura K, Nei M, Kumar S: Prospects for inferring very large phylogenies by using the neighbor-joining method. Proc Natl Acad Sci USA. 2004, 101: 11030-11035. 10.1073/pnas.0404206101.
Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599. 10.1093/molbev/msm092.
Lavin JL, Kiil K, Resano O, Ussery DW, Oguiza JA: Comparative genomic analysis of two-component regulatory proteins in Pseudomonas syringae. BMC Genomics. 2007, 8: 397-10.1186/1471-2164-8-397.
Cock PJ, Whitworth DE: Evolution of prokaryotic two-component system signaling pathways: gene fusions and fissions. Mol Biol Evol. 2007, 24: 2355-2357. 10.1093/molbev/msm170.
We are thankful to the contribution of the late Vladimir Alexandrovich Drachev during early stage of this research effort. We thank JCVI for providing the JCVI Annotation Service. We thank Relly Foler from DYN-GS. This research has been supported by the Tauber Family Foundation and the Maguy-Glass Chair in Physics of Complex systems at Tel Aviv University and by the National Science Foundation Grants PHY-0216576 and 0225630 at UCSD.
The authors declare that they have no competing interests.
ASM, TO, YH, CI, DLG, DL and EBJ were involved in study design. ASM, TO, YH, CI, IB, DR, EH, RCM, DL, EBJ performed the experiments. IB, RCM and SBZ contributed reagents, materials and analysis tools. ASM, TO, YH, CI, LB, DLE, VG, VK, RCM, DL, EBJ were involved in data analysis. ASM, TO, YH, CI, EBJ wrote the paper. All authors read and approved the final manuscript.
Alexandra Sirota-Madi, Tsviya Olender, Yael Helman contributed equally to this work.