Comparative genomics of the family Vibrionaceae reveals the wide distribution of genes encoding virulence-associated proteins
© Lilburn et al; licensee BioMed Central Ltd. 2010
Received: 19 February 2010
Accepted: 10 June 2010
Published: 10 June 2010
Species of the family Vibrionaceae are ubiquitous in marine environments. Several of these species are important pathogens of humans and marine species. Evidence indicates that genetic exchange plays an important role in the emergence of new pathogenic strains within this family. Data from the sequenced genomes of strains in this family could show how the genes encoded by all these strains, known as the pangenome, are distributed. Information about the core, accessory and panproteome of this family can show how, for example, genes encoding virulence-associated proteins are distributed and help us understand how virulence emerges.
We deduced the complete set of orthologs for eleven strains from this family. The core proteome consists of 1,882 orthologous groups, which is 28% of the 6,629 orthologous groups in this family. There were 4,411 accessory orthologous groups (i.e., proteins that occurred in from 2 to 10 proteomes) and 5,584 unique proteins (encoded once on only one of the eleven genomes). Proteins that have been associated with virulence in V. cholerae were widely distributed across the eleven genomes, but the majority was found only on the genomes of the two V. cholerae strains examined.
The proteomes are reflective of the differing evolutionary trajectories followed by different strains to similar phenotypes. The composition of the proteomes supports the notion that genetic exchange among species of the Vibrionaceae is widespread and that this exchange aids these species in adapting to their environments.
Genomic comparisons among multiple strains of the same species have revealed that the overlap in gene content of any two strains is not complete, that is, the genomic resources of a species are not represented by the genome sequence of a single strain. In 2005 Tettelin et al. showed that the number of unique genes in seven genome sequences from Streptococcus agalacticae, which was termed the pangenome, far exceeded the number of genes found in any one strain and that this pangenomic repertoire would increase by at least 30 genes for every new genome of this species that was sequenced. Furthermore, it appeared that this increase would go on indefinitely . The "infinite genome" phenomenon was not universal, for example, sequencing more than four genomes of Bacillus anthracis does not add any more genes to its pangenome. A terminology for classification of the genes found on sets of related genomes has been developed. As mentioned above the set of genes from all the genomes is called the pangenome (and the set of all encoded proteins is called the panproteome). The subset of these genes that is found on all the genomes is called the core genome and the set of genes that is found on more than one but not all genomes is called the accessory (or distributed) genome.
One of the original motivations for characterizing the pangenome was to understand what constitutes a bacterial species, but the value of such studies extends to understanding adaptation and evolution in the prokaryotes. Several studies have looked at pathogenic species and their non-pathogenic relatives in an effort to discover, for example, which genes might be unique to the pathogen and therefore drive pathogenesis (for example, see [2–6]. Few studies have looked at non-pathogenic relatives that lie outside the genus of the pathogen. Here we define a panproteome for the Vibrionaceae, a family containing several well-known human pathogens and species that play important roles in the marine ecosystem as nutrient cyclers, as partners in symbioses and as pathogens of fish and shellfish .
The most recent edition of Bergey's Manual of Systematic Bacteriology , divides the family Vibrionaceae into three genera encompassing 51 species. Since the appearance of that volume, several species of Vibrio have been moved into a new genus, Aliivibrio, and in 2006 Thompson and Swings estimated that the family included over 80 species . This family provides a unique framework for examining the emergence of pathogenesis and the causes of virulence because of the combination of taxa it contains. Genome sequences from this family represent three species of human pathogen each with a different modality of infection and clinical manifestation: Vibrio cholerae, V. vulnificus and V. parahaemolyticus. Two of the genomes are from strains of V. cholerae that are both agents of pandemic cholera (strains O395 and N16961). The biotype represented by N16961, El Tor, is a so-called seventh pandemic strain. The El Tor biotype recently supplanted the sixth pandemic strains (represented by the classical biotype O395 strain) as the primary cause of pandemic cholera. Two more genomes represent V. vulnificus (strains CMCP6 and YJ016). This organism causes septicemia and the infections are rapidly fatal in persons having high levels of iron in their serum. A fifth genome represents V. parahaemolyticus, which causes gastroenteritis. Infection is usually associated with the ingestion of raw or undercooked seafood. Three more genomes represent strains pathogenic to marine organisms. V. harveyi has been identified as a pathogen of coral and shrimp , V. splendidus is a pathogen of oysters, mussels and scallops  and Aliivibrio salmonicida is the causative agent of Hitra disease in salmonid species . Marine pathogens from the Vibrionaceae can have a serious impact on aquaculture operations . The remaining three genomes are all non-pathogenic strains. Two genomes represent A. fischeri (strains ES114 and MJ11), a species that, like other Vibrionaceae, forms commensal relationships with fish and squid . ES114 was isolated from squid, where it colonizes a specialized light organ, while MJ11 was isolated from a fish . Finally, the eleventh genome represents a species that is not known to be either pathogenic or host-associated - Photobacterium profundum strain SS9. This species is notable for its ability to thrive at high pressure .
Although notable for the characteristics mentioned in the previous paragraph, many members of the Vibrionaceae can be isolated from more than one niche; often they are free-living as well as associated with one or more hosts. For example, V. cholerae is free-living, but can form biofilms on the exoskeletons of marine invertebrates such as shrimp, or colonize the trophozoites of amoeba [7, 16], or, of course, it can survive and grow in the human intestine. Moving between these niches requires physiological adaptability and it is clear that this adaptability involves the sharing of genetic material among strains. For example, although V. cholerae is known as a disease-causing bacterium, only about 0.6% of the V. cholerae strains that can be detected in the environment are capable of producing the cholera toxin . Similarly, a survey by Rahman et al. indicated that only a relatively small fraction of V. cholerae strains in the environment have all the genes needed for the pandemic strain phenotype, but relatively large numbers of strains have some of the genes that drive this phenotype . For example, 79% of the environmental strains and all the clinical strains examined in their study carried the hyl A gene, which encodes hemolysin, an accessory virulence factor. One of the most common carbon sources in the marine environment, chitin, also induces the uptake of extracellular DNA by V. cholerae, adding support to the notion that genetic exchange is part of the adaptive strategy of the Vibrionaceae [19, 20] and Udden et al. have demonstrated the transfer of the CTX phage genome from a non-toxigenic environmental strain to an O1 El Tor strain in the presence of chitin . Gene flow within the species V. cholerae was recently examined by Chun et al. who found that the pangenome contains 2,432 orthologs shared by all 23 V. cholerae strains examined and 6,953 non-redundant genes in total . They identified 12 lineages and propose that horizontal gene transfer largely drives diversification. A recent report indicating that a more virulent form of cholera is emerging and is caused by a recombinant O1 El Tor strain  underscores the need to explore the nature of the genetic pool available to the Vibrionaceae beyond a single species or genus. This knowledge will help us to better understand how and perhaps when, such new threats can arise. Here, we define and explore the panproteome of the family Vibrionaceae.
Within the context of the family-level panproteome, we examine how virulence-associated proteins from V. cholerae N16961 are partitioned among members of the family. We find that many of the proteins are widely distributed across the family, leading us to hypothesize that members of this family represent a resource for the genetic diversity of V. cholerae and the other pathogens. We also find genes that are unique to V. cholerae and examine their evolutionary history. Using these approaches we can better understand the evolutionary forces driving the emergence of pathogenic strains. On a practical level, in order to control or eliminate pathogenic strains, the genomic repertoire available to a species must be defined so that any measures devised towards these ends are effective against all the strains in a species .
Results and Discussion
The core, accessory, and panproteome of the Vibrionaceae
Coding sequences, orthologous proteins, and their frequency in the 11 Vibrionaceae genomes used in this study
No. genes in genome
No. orthologous groups in strain
% CDS found in core genome
% CDS found in orthologous group
Orthologous groups unique to:
Vibrio cholerae El Tor N16961
V. cholerae O395
V. parahaemolyticus RIMD 2210633
V. vulnificus CMCP6
V. vulnificus YJ016
V. harveyi ATCC® BAA-1116™
V. splendidus LGP32
Aliivibrio fischeri MJ11
A. fischeri ES114
A. salmonicida LFI1238
Photobacterium profundum SS9
Functional classification of orthologous groups
Of the accessory orthologs, less than 24% could be classified successfully. This is to be expected since members of this set of proteins are taxon-specific and therefore more likely to be little known. Relative to the core proteome, the accessory proteome was enriched for only one COG class, T (Signal transduction mechanisms). Signal transduction enables each strain to sense and adapt to its environment. One class of such proteins, the methyl accepting chemotaxis proteins (MCPs), is known to be over-represented in the proteomes of the Vibrionaceae. The Microbial Signaling Database  contains information on 777 prokaryotes and the median number of MCPs per strain is three; if we consider only the 485 prokaryotes that have at least one MCP, the median number per strain is 12; in the Vibrionaceae the number of MCPs per genome ranges from 17 to 52.
We did not attempt to place the single copy CDS into COG groups, and only one of the 336 orthologous groups that were strain-specific could be placed in a COG class. When the annotations for proteins unique to V. cholerae N16961, which is the best studied of the strains in the Vibrionaceae, were retrieved from NCBI, only about 5% of the unique CDS had been placed in a COG class. The functionalities of the 5,584 strain-specific CDS from the other strains are likely to be equally obscure.
Distribution of orthologous groups across the eleven genomes
It is obvious from Figure 3 that the distribution pattern of the orthologs does not always follow the path of vertical descent - clearly horizontal gene transfer has occurred. Several surveys of the genetic make-up of environmental and clinical strains of V. cholerae have shown that most of the genetic variability is found on genomic islands (see, for example, [22, 32]). Figure 3 highlights the genomic islands, and the relative paucity of orthologs for proteins found encoded on these islands underscores their unusual nature; proteins encoded on the more "stable" regions of the large chromosome are more widely distributed. Genomic islands VSP-I and VSP-II are the Vibrio seventh pandemic regions and are found in the serotype O1 El Tor strains like N16961, but not on the sixth pandemic O1 strains like O395, although the insertion sites for the islands do exist on this strain. The absence of these elements is evidence that the proteins encoded on these elements are not required for the epidemic phenotype. Interestingly, all four of these genomic islands can excise from the genome and form circular intermediates [33, 34]. Furthermore, the insertion site for these genomic islands can be found on other species of the Vibrionaceae . The VSP-II insertion site, for example, brackets a much larger genomic island found in V. vulnificus YJ016 (where it is known as VVI-I, see Additional file 4 Figure S3) . The VPI-1 insertion site, which brackets genes relevant to TCP expression, encloses completely different sets of genes in V. vulnificus strains YJ016 and CMCP6, in V. parahaemolyticus strain RIMD2210633, in A. fischeri E114 and in P. profundum SS9 .
While in the large chromosome the genomic islands appear to contain most of the genomic diversity, the small chromosome is more uniformly mixed, as reflected by the mosaic of colors. On this chromosome we observe that the pattern expected from vertical descent, an ortholog gradient from V. cholerae N16961 through to A. salmonicida, is more obvious, although the superintegron stands out as a region of intense diversification.
Virulence-associated orthologous groups
Aside from the recognized genomic islands marked in Figure 4, we can recognize at least three other regions on the large chromosome, marked A, B, and C, that encode virulence proteins that are unique to V. cholerae. Regions A and B are associated with cell-surface polysaccharides. Region A (VC0245-VC0250) encodes gene products required for the synthesis of the O-type antigens, which give the two strains of V. cholerae their O1 serotype. Loss of the ability to synthesize an O-antigen, for example, by disruption of VC0249 (rfb L), has been shown to severely impair V. cholerae's ability to establish an infection in mice . A search for homologs of the six region A proteins in the UniRef50 database , which clusters proteins on the basis of a sequence identity of 50% or greater, revealed that for all six proteins the other members of their respective clusters were all from other serotype O1 strains of V. cholerae and had a sequence identity of < 99%. A more extensive search in the IMG database  showed that the best match homologs with less than 50% identity were from outside the family Vibrionaceae.
Region B (VC0926-VC0933) includes proteins involved in biofilm formation. Loci VC0926 and VC0927 are part of the vps I cluster (Vibrio polysaccharide synthesis) . Loci VC0928 through VC0933 (also known as rmb A through F ) are interposed between vps I and a second vps cluster, vps II. Biofilm formation is a regulated response to environmental conditions and the rmb genes are all expressed when the vps I and vps II clusters are up-regulated . Deletion of any of them suppresses the formation of the biofilms associated with vps gene expression . Expression of the Vibrio polysaccharide (VPS) genes has been linked to increased survival of toxigenic strains challenged with chlorine (and is thus an important trait for survival in municipal water supplies) , changes in osmotic pressure and pH and oxidative stress [41, 45, 46]. Furthermore, in vivo experiments have linked the biofilm formation associated with the VPS protein expression to virulence [47, 48]. All of the most similar orthologs to VC0926 and VC0927 are found in various serotypes of V. cholerae; outside this species, the genus Burkholderia has homologs with over 60% identity, but there are no orthologs in any other Vibrionaceae. Orthologs of VC0928 appear to be restricted to V. cholerae. Orthologs of VC0930, a putative hemolysin do not occur in any other Vibrionaceae species, although some strains of V. cholerae have more than one ortholog. VC0931, VC0932 and VC0933 are confined to Vibrio species, but low-identity orthologs of VC0931 and VC0932 are seen in many genera. Like the O-antigen, the proteins encoded by vps I and by the rbm genes appears to be unique to V. cholerae, but not only in strains with the virulent, toxigenic phenotype. Thus, these proteins count among the accessory virulence proteins and play a dual role, as they also help V. cholerae survive outside the host. Interestingly, the genes encoded on the vps II cluster are found in other members of the Vibrionaceae, including A. fischeri, V. harveyi, V. parahaemolyticus, and V. vulnificus. In the latter species, three of the vsp II genes (orthologs of VC0934, VC0936, and VC0937) are duplicated, a lineage specific expansion that we speculate was mediated via horizontal gene transfer .
Region C (VC1394-VC1406) encloses one of three clusters of genes on the V. cholerae chromosomes that are paradigmatically involved in chemotaxis. The clusters are designated clusters I through III  and region C includes the cluster I chemotaxis genes. Clusters II and III are found on chromosomes 1 and 2 respectively (Figure 4). Only the proteins encoded in cluster II (VC2059-VC2065) can be shown to be involved in chemotactic signaling in vitro [50, 51]. Although the histidine kinase (CheA-1) and response regulators (CheY-1 and CheY-2) encoded in region C are homologous to the cognate proteins in cluster II, they cannot complement cluster II deletion mutants. CheY-1 and CheY-2 are each missing amino acid residues involved in the interaction between CheY and FliM [51, 52]. FliM is part of the flagellar assembly and mediates a change in rotational direction of the flagellum, which is important to chemotaxis. Phylogenetic analysis of the Che cluster I proteins shows that, within the Vibrionaceae, they are exclusively found in V. cholerae strains (Additional file 2 Figure S1). In addition, with the exception of CheA, they are more similar to the Che proteins encoded in cluster III than to those encoded on cluster II. Gene neighborhood analysis using the IMG resource  shows that the cluster I homologs always group together on the large chromosome. Interestingly, the four MCP homologs flanking the Che homologs in region C are, like the CheY-1 and -2 homologs, unusual. While 38 of the 45 the MCPs found in V. cholerae are members of the 40H group, as classified by Alexander and Zhulin [29, 53], only one of the four MCPs in region C is a member of this class. The two MCPs (the VC1394 and VC1403 orthologs) immediately flanking region C each have unusual domain architectures. VC1394 orthologs are classified as Unaligned Membrane-bound (UM) MCPs, indicating that they are membrane bound but lack an identifiable sensory domain. UM MCPs are found in many strains, but their function is not known. VC1403 homologs are from group 44H and have two MCP signal domains; there is only one 44H MCP in each V. cholerae genome. The expression of the MCP encoded by VC1403 appears to be enhanced under anaerobic conditions  similar to those found in the intestine. The last protein in region C, VC1406, is a 24H MCP. This group has no obvious methyl binding sites, sites that are important for signal transduction. One possibility is that this type of MCP modulates signal amplification within MCP arrays. The unusual nature of the MCPs and Che proteins found in region C hints at functions other than chemosensing-driven motility. Their potential involvement in virulence is indicated by at least two studies. In the first, transcription of genes in this region was shown to be increased under conditions designed to induce expression of virulence genes  and in the second VC1397 and VC1399 were shown to be expressed in the course of human infections .
Region C is part of a 168 kb cluster identified using M-GCAT  (Additional file 3 Figure S2). Alignment of this cluster using the MUSCLE algorithm  bundled with M-GCAT showed that there was 99.5% identity among the sequences from all the complete V. cholerae genomes currently available. Region C is also conserved in the draft V. cholerae genomes available through IMG. Recently, Chun et al. identified the region spanning loci VC1393 to VC1405 as a genomic island  and it largely overlaps region C of Figure 4. Their analysis of the distribution of this region is consistent with ours. Like other putative genomic islands, it does not partition among strains of V. cholerae strictly according to pathogenicity. Nonetheless, it is invariably present in seventh pandemic strains and is in fact only absent from a single strain that has a CTXphi region, that is, V51, a clinical isolate from the United States. This supports the idea that the region C orthologs have a role in epidemic virulence.
Functions of virulence-associated orthologs
The two V. cholerae strains share 319 orthologous groups that are not found in the other strains. Only 62 of these orthologs are classified as virulence proteins and most of them are found in the regions marked in Figure 4. The fact that so few putative virulence genes are unique to these two strains testifies to the wide distribution of virulence-associated genes among the Vibrionaceae. Just over one third of the shared virulence-associated orthologs were placed in a COG functional group. COG classes T and N are over-represented in this set. Of course, the orthologs encoded in region C are members of these COG classes and exemplify some of the signaling and regulatory processes that are doubtless required for successful colonization of the human host. The high proportion of unclassified orthologous groups in the set of virulence-associated orthologs unique to V. cholerae represents a pool of unrecognized functionalities that may equip the species to survive and flourish in the environments it faces. For example, there are four hypothetical proteins, loci VCA0881 through VCA0884 in strain N16961, that appear to form an operon . When a search of the National Microbial Pathogens Data Resource (NMPDR) was done, two of these proteins were found to be annotated as non-hemolytic enterotoxin lytic components. These proteins are found in other strains of V. cholerae, but not in any other species of Vibrionaceae. The most similar proteins outside this species are the non-hemolytic enterotoxins encoded by some Bacillus species, and are known to contribute to the pathogenic effects of these species.
It is surprising that these two strains of V. cholerae do not share more virulence proteins. One possible explanation is that their virulence, while employing the same CTXphi and toxin co-regulated pilus genes, are built on different underlying adaptations to human host-associated selective pressures. Both are O1 serotype strains, but O395 is a Classical biotype strain (also known as a sixth pandemic strain) while N16961 is an El Tor biotype strain (also known as a seventh pandemic strain). The two strains appear to have arisen independently. A recent analysis by Feng et al.  suggests that the mutation rate in Vibrio species is much higher than originally thought and that the El Tor biotype acquired its capability to become a pandemic strain independently of O395. The adaptations that arose in N16961 on its evolutionary path to virulence have allowed it to displace O395 as the causative agent of cholera around the world. A similar situation has been inferred in Escherichia coli, pathogenic strains of which are thought to have arisen more than once . Thus the complete set of genes involved in the cholera virulence phenotype may be quite different in the two strains, with only a subset of the potential virulence factors used in each strain.
Looking more widely at the Vibrionaceae we find there are 943 orthologous groups that contain only proteins from human pathogens (V. cholerae, V. vulnificus and V. parahaemolyticus) and 1,907 orthologous groups that contain only proteins from pathogenic strains (the human pathogens plus V. harveyi, V. splendidus and A. salmonicida). It would be misleading to assess these shared sets of genes in terms of virulence potential as these species have different modalities of infection and virulence. However, rearranging our core and accessory set of orthologs in the context of, for example, the genome of V. vulnificus YJ016 immediately reveals sets of orthologs that are unique to the species.
Twelve genomic islands have been defined for V. vulnificus YJ016 and they are shown in Additional file 4 Figure S3 [35, 36, 61]. Orthologs to the proteins encoded on these genomic islands exist on other strains (See Additional file 4 Figure S3), but, as discussed above, frequently the insertion site containing a genomic island in V. vulnificus YJ016 will enclose a different set of genes in another species. Regions VVI-I through VVI-IX meet the canonical requirements for genomic islands as defined in the literature  and six of them were found to be unique to strain YJ016 . The regions VVI-X through VVI-XII are found in strains YJ016 and CMCP6, but not in other species. These regions do not have all the features normally associated with genomic islands, but they do seem to show a presence/absence pattern across strains of V. vulnificus that indicates they are mobile. The presence of region VVI-XII was found to correlate with strains of V. vulnificus that were, or had the genetic potential to be, clinical isolates . Scanning Additional file 4 Figure S3, we can see other regions containing strings of orthologs unique to V. vulnificus. The orthologs encoded on loci VV1999 to VV2015 (YJ016 loci numbering) and marked as region A in Additional file 4 Figure S3 are a good example. The orthologs in this region span 15.6 kb. They include VV2003 to VV2012. Experiments examining the effect of changes in environmental concentrations of iron on transcription showed that these loci were induced under iron-limiting conditions and that they form an operon [63, 64]. Miyamoto et al. also showed that the Fur (ferric uptake regulator) protein regulates the expression of these genes. Iron availability is known to play a major role in the virulence of V. vulnificus , but Alice et al. were not able to link these genes to virulence. Other experiments designed to elucidate the role of the AphB regulatory protein in V. vulnificus showed that AphB regulates the expression of the genes in region A, along with many others. AphB mutants are less virulent than wild-type strains , and show a reduced ability to adhere to host cells. Interestingly, VV2003 to VV 2012 encode protein domains associated with the formation of pili thought to be involved in adherence. With the exception of VV2007 (Flp pilus assembly protein TadA), the encoded proteins do not have over 30% identity with other proteins containing similar domains and may carry out novel adherence functions associated with iron-poor environments such as those usually found in human serum. We feel these orthologs merit further investigation, as they are unique to V. vulnificus, and may contribute to functions that make this species one of the deadliest food-borne pathogens known.
The panproteome of the Vibrionaceae consists of 12,213 unique groups. 1,882 (15%) of these groups form the core proteome and 4,411 groups (36%) form the accessory proteome. These numbers are consistent with those reported for analyses of related groups at the species [22, 67] and family levels  and are similar to a genus-level study from a more distantly related taxon . This rather high level of sequence conservation is reflected in the conservation of so many virulence-related proteins from V. cholerae across the Vibrionaceae. It has been observed that the size (and potential size) of the panproteome, which is directly related to sequence diversity, appears to correlate with the life-style of the bacterium rather than with its taxonomic classification . In the species V. cholerae, Keymer et al.  have shown that the non-homogenous marine environment drives the formation of diverse populations of V. cholerae strains, each influenced by the environment in which it lives. Similarly, V. splendidus has been shown to form distinct populations within different ecological niches in the marine environment . Hence, although apparent niche-driven generation of diversity may oppose it, many of the genes associated with the virulence of V. cholerae are widely distributed within the Vibrionaceae. This conservation may not be a response to the need to survive in the human intestine, which many strain from this family never see, but a sign that the virulence-associated proteins are fitness factors required by V. cholerae and other Vibrionaceae to ensure their physiological flexibility. The marine environment allows niche specialization and the coexistence of a variety of genotypes, but it is also dynamic, and the barriers to genetic exchange among the genotypes are occasionally removed. V. cholerae can exchange genetic material via phage-mediated mechanisms, conjugation and chitin-induced natural competence . The latter mechanism does not require special sequence elements or dedicated enzymology to achieve integration of donor DNA, and much of the necessary machinery is conserved not only with in the species V. cholerae, but across the Vibrionaceae (see Additional file 1 Table S1). We observed above that some genomic islands appear to move across the species boundary and we believe that movement across the genus boundary also occurs. High rates of genetic exchange can help species survive environmental challenges, as has been observed for Streptococcus pneumoniae  and this principle may operate for the Vibrionaceae, but presumably the high rates of recombination, which, in principle, should lead to genomic homogenization, are balanced by forces that ensure the species within the Vibrionaceae remain distinct . Interestingly, Stine et al. recently showed that concurrent outbreaks of cholera in Bangladesh were caused by different genotypes of epidemic V. cholerae . This supports the idea that the gene pool sustained by the Vibrionaceae aids in the emergence of multiple epidemic strains when conditions are favorable. Other species within the Vibrionaceae have their own unique sets of proteins, which can contribute to the pathogenic phenotype. Our approach facilitates the identification of such sets, highlighting proteins that, like the signal transduction systems discussed above, may play key roles in virulence.
Genomes and annotation
Eleven genomes from the family Vibrionaceae including V. cholerae strains O1 biovar El Tor str. N16961 [GenBank: NC_002505, NC_002506]  and O1 biovar classical str. O395 [GenBank: NC_009456, NC_009457] , V. parahaemolyticus RIMD 2210633 [GenBank: NC_004603, NC_004605] , V. vulnificus CMCP6 [GenBank: NC_004459, NC_004460] , V. vulnificus YJ016 [GenBank: NC_005139, NC_005140, NC_005128] , V. harveyi ATCC® BAA-1116™ [GenBank: NC_009777, NC_009783, NC_009784], V. splendidus LGP32 [GenBank: NC_011744, NC_011753] , Aliivibrio (Vibrio) fischeri ES114 [GenBank: NC_006840, NC_006841, NC_006842] , A. fischeri MJ11 [GenBank: NC_0111866, NC_0111865, NC_0111864] , A. salmonicida LFI1238 [GenBank: NC_011312, NC_011313, NC_011314, NC_011311, NC_011315, NC_011316] , and Photobacterium profundum SS9 [GenBank: NC_006370, NC_006371, NC_005871]  were downloaded from the J. Craig Venter Institute's Comprehensive Microbial Resource .
Additional annotation for individual strains was retrieved from the National Microbial Pathogen Data Resource (NMPDR)  The UCSC Archaeal Genome Browser  the Integrated Microbial Genomes system  and from UniProt .
Identification of orthologous groups
The procedure is described in Gu et al. . Open reading frames from the genome sequences were analyzed using OrthoMCL  to detect and group the orthologous proteins in the 11 strains of Vibrionaceae. OrthoMCL is a good choice for ortholog detection as it has reasonably low false positive and false negative detection rates and can detect orthologs across a group of genomes . From the results we identified three sets of proteins: (i) those that were encoded by all 11 strains, and which we call the core proteome of the Vibrionaceae, (ii) those encoded on two or more, but less than eleven of, the genomes. We refer to these as the accessory proteome, and (iii) those that were encoded on only one genome, which we refer to as the strain-unique proteome. Together these three sets compose the panproteome of the Vibrionaceae. A hierarchical functional classification of the proteins that fell into OrthoMCL groups was performed by searching against the Clusters of Orthologous Groups (COG) database .
We scored the presence or absence of all orthologs in each of the eleven genomes and used the accessory proteome to calculate the phylogenetic relationships among the 11 strains using the program dollop from the PHYLIP package . Huson and Steel have demonstrated the superiority of Dollo parsimony over distance-based methods for the deduction of phylogenetic trees based on gene content . The tree was drawn using Dendroscope . The presence and absence of each ortholog in each genome was visualized using the R statistical package and the gplots library. Orthologs were ordered in the plots according to their occurrence in the genome of interest.
Phylogenetic analysis of the CheA, CheB, CheY and CheZ orthologs of the V. cholerae proteins was done as follows. First, the sequences of the orthologs were collected in fasta format from the UniProt database. Alignments were carried out using the L-INS-I method of the MAFFT software, version 6.713  and evaluated using pfaat . Maximum likelihood trees were inferred using the Treefinder software . The Whelan and Goldman substitution model  was used.
This work is supported in part by NIH grant 1R21AI067543 to T.G. Lilburn and Y. Wang, NIH grants SC1GM081068 and SC1AI080579 to Y. Wang, and the PSC-CUNY Research Award PSCREG-39-497 and CUNY Summer Research Award to J. Gu. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of General Medical Sciences, National Institute of Allergy and Infectious Diseases or the National Institutes of Health. These agencies had no role in the collection, analysis and interpretation of the data or in the writing of or decision to submit this manuscript.
- Tettelin H, Masignani V, Cieslewicz MJ, Donati C, Medini D, Ward NL, Angiuoli SV, Crabtree J, Jones AL, Durkin AS: Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial "pan-genome". Proc Natl Acad Sci USA. 2005, 102 (39): 13950-13955. 10.1073/pnas.0506758102.PubMed CentralPubMedView ArticleGoogle Scholar
- Han CS, Xie G, Challacombe JF, Altherr MR, Bhotika SS, Brown N, Bruce D, Campbell CS, Campbell ML, Chen J: Pathogenomic sequence analysis of Bacillus cereus and Bacillus thuringiensis isolates closely related to Bacillus anthracis. J Bacteriol. 2006, 188 (9): 3382-3390. 10.1128/JB.188.9.3382-3390.2006.PubMed CentralPubMedView ArticleGoogle Scholar
- Rasko DA, Rosovitz MJ, Myers GS, Mongodin EF, Fricke WF, Gajer P, Crabtree J, Sperandio V, Ravel J: The pan-genome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates. J Bacteriol. 2008, 190 (20): 6881-6893. 10.1128/JB.00619-08.PubMed CentralPubMedView ArticleGoogle Scholar
- Sim SH, Yu Y, Lin CH, Karuturi RK, Wuthiekanun V, Tuanyok A, Chua HH, Ong C, Paramalingam SS, Tan G: The core and accessory genomes of Burkholderia pseudomallei: implications for human melioidosis. PLoS Pathog. 2008, 4 (10): e1000178-10.1371/journal.ppat.1000178.PubMed CentralPubMedView ArticleGoogle Scholar
- Schoen C, Blom J, Claus H, Schramm-Glück A, Brandt P, Müller T, Goesmann A, Joseph B, Konietzny S, Kurzai O: Whole-genome comparison of disease and carriage strains provides insights into virulence evolution in Neisseria meningitidis. Proc Natl Acad Sci USA. 2008, 105 (9): 3473-3478. 10.1073/pnas.0800151105.PubMed CentralPubMedView ArticleGoogle Scholar
- Janvilisri T, Scaria J, Thompson AD, Nicholson A, Limbago BM, Arroyo LG, Songer JG, Gröhn YT, Chang YF: Microarray identification of Clostridium difficile core components and divergent regions associated with host origin. J Bacteriol. 2009, 191 (12): 3881-3891. 10.1128/JB.00222-09.PubMed CentralPubMedView ArticleGoogle Scholar
- Thompson FL, Iida T, Swings J: Biodiversity of vibrios. Microbiol Mol Biol Rev. 2004, 68 (3): 403-431. 10.1128/MMBR.68.3.403-431.2004.PubMed CentralPubMedView ArticleGoogle Scholar
- Farmer JJ, Janda JM, Brenner FW, Cameron DN, Birkhead KM: Genus I. Vibrio Pacini 1854, 411 AL . Bergey's Manual of Systematic Bacteriology. Edited by: Brenner DJ, Krieg NR, Staley JT. 2005, New York, NY: Springer, 2: 494-546.Google Scholar
- Thompson FL, Swings JG: Taxonomy of the Vibrios. The biology of vibrios. Edited by: Thompson FL, Austin B, Swings JG. 2006, Washington, DC: ASM Press, 29-43.View ArticleGoogle Scholar
- Austin B, Zhang XH: Vibrio harveyi: a significant pathogen of marine vertebrates and invertebrates. Lett Appl Microbiol. 2006, 43 (2): 119-124. 10.1111/j.1472-765X.2006.01989.x.PubMedView ArticleGoogle Scholar
- LeRoux F, Austin B: Vibrio splendidus. The Biology of Vibrios. Edited by: Thompson FL, Austin B, Swings J. 2006, Washington, DC: ASM Press, 285-296.View ArticleGoogle Scholar
- Hjerde E, Lorentzen MS, Holden MT, Seeger K, Paulsen S, Bason NC, Churcher C, Harris D, Norbertczak H, Quail MA: The genome sequence of the fish pathogen Aliivibrio salmonicida strain LFI1238 shows extensive evidence of gene decay. BMC Genomics. 2008, 9 (1): 616-10.1186/1471-2164-9-616.PubMed CentralPubMedView ArticleGoogle Scholar
- Ruby EG: Lessons from a cooperative, bacterial-animal association: the Vibrio fischeri-Euprymna scolopes light organ symbiosis. Annu Rev Microbiol. 1996, 50: 591-624. 10.1146/annurev.micro.50.1.591.PubMedView ArticleGoogle Scholar
- Ruby EG, Nealson KH: Symbiotic association of Photobacterium fischeri with the marine luminous fish Monocentris japonica; a model of symbiosis based on bacterial studies. The Biological bulletin. 1976, 151 (3): 574-586. 10.2307/1540507.PubMedView ArticleGoogle Scholar
- Vezzi A, Campanaro S, D'Angelo M, Simonato F, Vitulo N, Lauro FM, Cestaro A, Malacrida G, Simionati B, Cannata N: Life at depth: Photobacterium profundum genome sequence and expression analysis. Science. 2005, 307 (5714): 1459-1461. 10.1126/science.1103341.PubMedView ArticleGoogle Scholar
- Abd H, Saeed A, Weintraub A, Nair GB, Sandström G: Vibrio cholerae O1 strains are facultative intracellular bacteria, able to survive and multiply symbiotically inside the aquatic free-living amoeba Acanthamoeba castellanii. FEMS Microbiol Ecol. 2007, 60 (1): 33-39. 10.1111/j.1574-6941.2006.00254.x.PubMedView ArticleGoogle Scholar
- Faruque SM, Chowdhury N, Kamruzzaman M, Dziejman M, Rahman MH, Sack DA, Nair GB, Mekalanos JJ: Genetic diversity and virulence potential of environmental Vibrio cholerae population in a cholera-endemic area. Proc Natl Acad Sci USA. 2004, 101 (7): 2123-2128. 10.1073/pnas.0308485100.PubMed CentralPubMedView ArticleGoogle Scholar
- Rahman M, Biswas K, Hossain MA, Sack RB, Mekalanos J, Faruque SM: Distribution of genes for virulence and ecological fitness among diverse Vibrio cholerae population in a cholera endemic area: Tracking the evolution of pathogenic strains. DNA Cell Biol. 2008, 27 (7): 347-355. 10.1089/dna.2008.0737.PubMed CentralPubMedView ArticleGoogle Scholar
- Meibom KL, Blokesch M, Dolganov NA, Wu CY, Schoolnik GK: Chitin induces natural competence in Vibrio cholerae. Science. 2005, 310 (5755): 1824-1827. 10.1126/science.1120096.PubMedView ArticleGoogle Scholar
- Pruzzo C, Vezzulli L, Colwell RR: Global impact of Vibrio cholerae interactions with chitin. Environ Microbiol. 2008, 10 (6): 1400-1410. 10.1111/j.1462-2920.2007.01559.x.PubMedView ArticleGoogle Scholar
- Udden SM, Zahid MS, Biswas K, Ahmad QS, Cravioto A, Nair GB, Mekalanos JJ, Faruque SM: Acquisition of classical CTX prophage from Vibrio cholerae O141 by El Tor strains aided by lytic phages and chitin-induced competence. Proc Natl Acad Sci USA. 2008, 105 (33): 11951-11956. 10.1073/pnas.0805560105.PubMed CentralPubMedView ArticleGoogle Scholar
- Chun J, Grim C, Hasan N, Lee J, Choi S, Haley B, Taviani E, Jeon Y, Kim D, Lee J: Comparative genomics reveals mechanism for short-term and long-term clonal transitions in pandemic Vibrio cholerae. Proc Natl Acad Sci USA. 2009, 106 (36): 15442-15447. 10.1073/pnas.0907787106.PubMed CentralPubMedView ArticleGoogle Scholar
- Siddique AK, Nair GB, Alam M, Sack DA, Huq A, Nizam A, Longini IM, Qadri F, Faruque SM, Colwell RR: El Tor cholera with severe disease: a new threat to Asia and beyond. Epidemiology and infection. 2009, 1-6.Google Scholar
- Medini D, Serruto D, Parkhill J, Relman DA, Donati C, Moxon R, Falkow S, Rappuoli R: Microbiology in the post-genomic era. Nat Rev Microbiol. 2008, 6 (6): 419-430.PubMedGoogle Scholar
- Gu J, Neary J, Cai H, Moshfeghian A, Rodriguez SA, Lilburn TG, Wang Y: Genomic and systems evolution in Vibrionaceae species. BMC Genomics. 2009, 10 (Suppl 1): S11-10.1186/1471-2164-10-S1-S11.PubMed CentralPubMedView ArticleGoogle Scholar
- Lefébure T, Stanhope MJ: Evolution of the core and pan-genome of Streptococcus: positive selection, recombination, and genome composition. Genome Biol. 2007, 8 (5): R71-10.1186/gb-2007-8-5-r71.PubMed CentralPubMedView ArticleGoogle Scholar
- Uchiyama I: Multiple genome alignment for identifying the core structure among moderately related microbial genomes. BMC Genomics. 2008, 9 (1): 515-10.1186/1471-2164-9-515.PubMed CentralPubMedView ArticleGoogle Scholar
- Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN: The COG database: an updated version includes eukaryotes. BMC Bioinformatics. 2003, 4 (1): 41-10.1186/1471-2105-4-41.PubMed CentralPubMedView ArticleGoogle Scholar
- Ulrich LE, Zhulin IB: MiST: a microbial signal transduction database. Nucl Acids Res. 2007, 35 (Suppl 1): D386-390. 10.1093/nar/gkl932.PubMed CentralPubMedView ArticleGoogle Scholar
- Felsenstein J: PHYLIP (Phylogeny Inference Package) Ver. 3.6. 2005, Department of Genome Sciences, University of Washington, Seattle, WA: Distributed by the authorGoogle Scholar
- Urbanczyk H, Ast JC, Kaeding AJ, Oliver JD, Dunlap PV: Phylogenetic analysis of the incidence of lux gene horizontal transfer in Vibrionaceae. J Bacteriol. 2008, 190 (10): 3494-3504. 10.1128/JB.00101-08.PubMed CentralPubMedView ArticleGoogle Scholar
- Miller MC, Keymer DP, Avelar A, Boehm AB, Schoolnik GK: Detection and transformation of genome segments that differ within a coastal population of Vibrio cholerae strains. Appl Environ Microbiol. 2007, 73 (11): 3695-3704. 10.1128/AEM.02735-06.PubMed CentralPubMedView ArticleGoogle Scholar
- Murphy RA, Boyd EF: Three pathogenicity islands of Vibrio cholerae can excise from the chromosome and form circular intermediates. J Bacteriol. 2008, 190 (2): 636-647. 10.1128/JB.00562-07.PubMed CentralPubMedView ArticleGoogle Scholar
- Rajanna C, Wang J, Zhang D, Xu Z, Ali A, Hou YM, Karaolis DK: The vibrio pathogenicity island of epidemic Vibrio cholerae forms precise extrachromosomal circular excision products. J Bacteriol. 2003, 185 (23): 6893-6901. 10.1128/JB.185.23.6893-6901.2003.PubMed CentralPubMedView ArticleGoogle Scholar
- Quirke AM, Reen FJ, Claesson MJ, Boyd EF: Genomic island identification in Vibrio vulnificus reveals significant genome plasticity in this human pathogen. Bioinformatics. 2006, 22 (8): 905-910. 10.1093/bioinformatics/btl015.PubMedView ArticleGoogle Scholar
- O'Shea YA, Finnan S, Reen FJ, Morrissey JP, O'Gara F, Boyd EF: The Vibrio seventh pandemic island-II is a 26.9 kb genomic island present in Vibrio cholerae El Tor and O139 serogroup isolates that shows homology to a 43.4 kb genomic island in V. vulnificus. Microbiology. 2004, 150 (Pt 12): 4053-4063. 10.1099/mic.0.27172-0.PubMedView ArticleGoogle Scholar
- Gu J, Wang Y, Lilburn T: A comparative genomics, network-based approach to understanding virulence in Vibrio cholerae. J Bacteriol. 2009, 191 (20): 6262-6272. 10.1128/JB.00475-09.PubMed CentralPubMedView ArticleGoogle Scholar
- Ali A, Mahmud ZH, Morris JG, Sozhamannan S, Johnson JA: Sequence analysis of TnphoA insertion sites in Vibrio cholerae mutants defective in rugose polysaccharide production. Infect Immun. 2000, 68 (12): 6857-6864. 10.1128/IAI.68.12.6857-6864.2000.PubMed CentralPubMedView ArticleGoogle Scholar
- Suzek BE, Huang H, McGarvey P, Mazumder R, Wu CH: UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics. 2007, 23 (10): 1282-1288. 10.1093/bioinformatics/btm098.PubMedView ArticleGoogle Scholar
- Markowitz VM, Korzeniewski F, Palaniappan K, Szeto E, Werner G, Padki A, Zhao X, Dubchak I, Hugenholtz P, Anderson I: The integrated microbial genomes (IMG) system. Nucl Acids Res. 2006, D344-348. 10.1093/nar/gkj024. 34 Database
- Yildiz FH, Schoolnik GK: Vibrio cholerae O1 El Tor: identification of a gene cluster required for the rugose colony type, exopolysaccharide production, chlorine resistance, and biofilm formation. Proc Natl Acad Sci USA. 1999, 96 (7): 4028-4033. 10.1073/pnas.96.7.4028.PubMed CentralPubMedView ArticleGoogle Scholar
- Fong JCN, Yildiz FH: The rbm BCDEF gene cluster modulates development of rugose colony morphology and biofilm formation in Vibrio cholerae. J Bacteriol. 2007, 189 (6): 2319-2330. 10.1128/JB.01569-06.PubMed CentralPubMedView ArticleGoogle Scholar
- Yildiz FH, Liu XS, Heydorn A, Schoolnik GK: Molecular analysis of rugosity in a Vibrio cholerae O1 El Tor phase variant. Mol Microbiol. 2004, 53 (2): 497-515. 10.1111/j.1365-2958.2004.04154.x.PubMedView ArticleGoogle Scholar
- Rice EW, Johnson CJ, Clark RM, Fox KR, Reasoner DJ, Dunnigan ME, Panigrahi P, Johnson JA, Morris JG: Chlorine and survival of "rugose" Vibrio cholerae. Lancet. 1992, 340 (8821): 740-10.1016/0140-6736(92)92289-R.PubMedView ArticleGoogle Scholar
- Wai S, Mizunoe Y, Takade A, Kawabata S, Yoshida S: Vibrio cholerae O1 strain TSI-4 produces the exopolysaccharide materials that determine colony morphology, stress resistance, and biofilm formation. Appl Environ Microbiol. 1998, 64 (10): 3648-3655.PubMed CentralPubMedGoogle Scholar
- Watnick PI, Lauriano CM, Klose KE, Croal L, Kolter R: The absence of a flagellum leads to altered colony morphology, biofilm development and virulence in Vibrio cholerae O139. Mol Microbiol. 2001, 39 (2): 223-235. 10.1046/j.1365-2958.2001.02195.x.PubMed CentralPubMedView ArticleGoogle Scholar
- Rashid MH, Rajanna C, Zhang D, Pasquale V, Magder LS, Ali A, Dumontet S, Karaolis DKR: Role of exopolysaccharide, the rugose phenotype and VpsR in the pathogenesis of epidemic Vibrio cholerae. FEMS Microbiol Lett. 2004, 230 (1): 105-113. 10.1016/S0378-1097(03)00879-6.PubMedView ArticleGoogle Scholar
- Zhu J, Mekalanos JJ: Quorum sensing-dependent biofilms enhance colonization in Vibrio cholerae. Dev Cell. 2003, 5 (4): 647-656. 10.1016/S1534-5807(03)00295-8.PubMedView ArticleGoogle Scholar
- Boin MA, Austin MJ, Häse CC: Chemotaxis in Vibrio cholerae. FEMS Microbiol Lett. 2004, 239 (1): 1-8. 10.1016/j.femsle.2004.08.039.PubMedView ArticleGoogle Scholar
- Gosink KK, Kobayashi R, Kawagishi I, Hase CC: Analyses of the roles of the three che A homologs in chemotaxis of Vibrio cholerae. J Bacteriol. 2002, 184 (6): 1767-1771. 10.1128/JB.184.6.1767-1771.2002.PubMed CentralPubMedView ArticleGoogle Scholar
- Hyakutake A, Homma M, Austin MJ, Boin MA, Hase CC, Kawagishi I: Only one of the five CheY homologs in Vibrio cholerae directly switches flagellar rotation. J Bacteriol. 2005, 187 (24): 8403-8410. 10.1128/JB.187.24.8403-8410.2005.PubMed CentralPubMedView ArticleGoogle Scholar
- Dasgupta J, Dattagupta JK: Structural determinants of V. cholerae CheYs that discriminate them in FliM binding: comparative modeling and MD simulation studies. J Biomol Struct Dyn. 2008, 25 (5): 495-503.PubMedView ArticleGoogle Scholar
- Alexander RP, Zhulin IB: Evolutionary genomics reveals conserved structural determinants of signaling and adaptation in microbial chemoreceptors. Proc Natl Acad Sci USA. 2007, 104 (8): 2885-2890. 10.1073/pnas.0609359104.PubMed CentralPubMedView ArticleGoogle Scholar
- Kan B, Habibi H, Schmid M, Liang W, Wang R, Wang D, Jungblut PR: Proteome comparison of Vibrio cholerae cultured in aerobic and anaerobic conditions. Proteomics. 2004, 4 (10): 3061-3067. 10.1002/pmic.200400944.PubMedView ArticleGoogle Scholar
- Beyhan S, Tischler AD, Camilli A, Yildiz FH: Differences in gene expression between the classical and El Tor biotypes of Vibrio cholerae O1. Infect Immun. 2006, 74 (6): 3633-3642. 10.1128/IAI.01750-05.PubMed CentralPubMedView ArticleGoogle Scholar
- Hang L, John M, Asaduzzaman M, Bridges EA, Vanderspurt C, Kirn TJ, Taylor RK, Hillman JD, Progulske-Fox A, Handfield M: Use of in vivo-induced antigen technology (IVIAT) to identify genes uniquely expressed during human infection with Vibrio cholerae. Proc Natl Acad Sci USA. 2003, 100 (14): 8508-8513. 10.1073/pnas.1431769100.PubMed CentralPubMedView ArticleGoogle Scholar
- Treangen TJ, Messeguer X: M-GCAT: interactively and efficiently constructing large-scale multiple genome comparison frameworks in closely related species. BMC Bioinformatics. 2006, 7: 433-10.1186/1471-2105-7-433.PubMed CentralPubMedView ArticleGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucl Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.PubMed CentralPubMedView ArticleGoogle Scholar
- Price M, Huang KH, Alm E, Arkin A: A novel method for accurate operon predictions in all sequenced prokaryotes. Nucl Acids Res. 2005, 33 (3): 880-892. 10.1093/nar/gki232.PubMed CentralPubMedView ArticleGoogle Scholar
- Feng L, Reeves PR, Lan R, Ren Y, Gao C, Zhou Z, Ren Y, Cheng J, Wang W, Wang J: A recalibrated molecular clock and independent origins for the cholera pandemic clones. PLoS ONE. 2008, 3 (12): e4053-10.1371/journal.pone.0004053.PubMed CentralPubMedView ArticleGoogle Scholar
- Cohen ALV, Oliver JD, DePaola A, Feil EJ, Boyd EF: Emergence of a virulent clade of Vibrio vulnificus and correlation with the presence of a 33-kilobase genomic island. Appl Environ Microbiol. 2007, 73 (17): 5553-5565. 10.1128/AEM.00635-07.PubMed CentralPubMedView ArticleGoogle Scholar
- Dobrindt U, Hochhut B, Hentschel U, Hacker J: Genomic islands in pathogenic and environmental microorganisms. Nat Rev Microbiol. 2004, 2 (5): 414-424. 10.1038/nrmicro884.PubMedView ArticleGoogle Scholar
- Alice AF, Naka H, Crosa JH: Global gene expression as a function of the iron status of the bacterial cell: influence of differentially expressed genes in the virulence of the human pathogen Vibrio vulnificus. Infect Immun. 2008, 76 (9): 4019-4037. 10.1128/IAI.00208-08.PubMed CentralPubMedView ArticleGoogle Scholar
- Miyamoto K, Kosakai K, Ikebayashi S, Tsuchiya T, Yamamoto S, Tsujibo H: Proteomic analysis of Vibrio vulnificus M2799 grown under iron-repleted and iron-depleted conditions. Microb Pathog. 2009, 46 (3): 171-177. 10.1016/j.micpath.2008.12.004.PubMedView ArticleGoogle Scholar
- Gulig PA, Bourdage KL, Starks AM: Molecular pathogenesis of Vibrio vulnificus. J Microbiol. 2005, 43: 118-131.PubMedGoogle Scholar
- Jeong HG, Choi SH: Evidence that AphB, essential for the virulence of Vibrio vulnificus, is a global regulator. J Bacteriol. 2008, 190 (10): 3768-3773. 10.1128/JB.00058-08.PubMed CentralPubMedView ArticleGoogle Scholar
- Keymer DP, Miller MC, Schoolnik GK, Boehm AB: Genomic and phenotypic diversity of coastal Vibrio cholerae strains is linked to environmental factors. Appl Environ Microbiol. 2007, 73 (11): 3705-3714. 10.1128/AEM.02736-06.PubMed CentralPubMedView ArticleGoogle Scholar
- Tettelin H, Riley D, Cattuto C, Medini D: Comparative genomics: the bacterial pan-genome. Curr Opin Microbiol. 2008, 11 (5): 472-477. 10.1016/j.mib.2008.09.006.PubMedView ArticleGoogle Scholar
- Hunt DE, David LA, Gevers D, Preheim SP, Alm E, Polz MF: Resource partitioning and sympatric differentiation among closely related bacterioplankton. Science. 2008, 320 (5879): 1081-1085. 10.1126/science.1157890.PubMedView ArticleGoogle Scholar
- Hanage WP, Fraser C, Tang J, Connor TR, Corander J: Hyper-recombination, diversity, and antibiotic resistance in pneumococcus. Science. 2009, 324 (5933): 1454-1457. 10.1126/science.1171908.PubMedView ArticleGoogle Scholar
- Fraser C, Alm EJ, Polz MF, Spratt BG, Hanage WP: The bacterial species challenge: making sense of genetic and ecological diversity. Science. 2009, 323 (5915): 741-746. 10.1126/science.1159388.PubMedView ArticleGoogle Scholar
- Stine OC, Alam M, Tang L, Nair GB, Siddique AK, Faruque SM, Huq A, Colwell R, Sack RB, Morris JG: Seasonal cholera from multiple small outbreaks, rural Bangladesh. Emerging Infect Dis. 2008, 14 (5): 831-833. 10.3201/eid1405.071116.PubMed CentralPubMedView ArticleGoogle Scholar
- Heidelberg JF, Eisen JA, Nelson WC, Clayton RA, Gwinn ML, Dodson RJ, Haft DH, Hickey EK, Peterson JD, Umayam L: DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae. Nature. 2000, 406: 477-483. 10.1038/35020000.PubMedView ArticleGoogle Scholar
- Makino K, Oshima K, Kurokawa K, Yokoyama K, Uda T, Tagomori K, Iijima Y, Najima M, Nakano M, Yamashita A: Genome sequence of Vibrio parahaemolyticus: a pathogenic mechanism distinct from that of V. cholerae. Lancet. 2003, 361 (9359): 743-749. 10.1016/S0140-6736(03)12659-1.PubMedView ArticleGoogle Scholar
- Kim YR, Lee SE, Kim CM, Kim SY, Shin EK, Shin DH, Chung SS, Choy HE, Progulske-Fox A, Hillman JD: Characterization and pathogenic significance of Vibrio vulnificus antigens preferentially expressed in septicemic patients. Infect Immun. 2003, 71 (10): 5461-5471. 10.1128/IAI.71.10.5461-5471.2003.PubMed CentralPubMedView ArticleGoogle Scholar
- Chen CY, Wu KM, Chang YC, Chang CH, Tsai HC, Liao TL, Liu YM, Chen HJ, Shen AB, Li JC: Comparative genome analysis of Vibrio vulnificus, a marine pathogen. Genome Res. 2003, 13 (12): 2577-2587. 10.1101/gr.1295503.PubMed CentralPubMedView ArticleGoogle Scholar
- Le Roux F, Zouine M, Chakroun N, Binesse J, Saulnier D, Bouchier C, Zidane N, Ma L, Rusniok C, Lajus A: Genome sequence of Vibrio splendidus: an abundant planctonic marine species with a large genotypic diversity. Environ Microbiol. 2009, 11 (8): 1959-1970. 10.1111/j.1462-2920.2009.01918.x.PubMedView ArticleGoogle Scholar
- Ruby EG, Urbanowski M, Campbell J, Dunn A, Faini M, Gunsalus R, Lostroh P, Lupp C, McCann J, Millikan D: Complete genome sequence of Vibrio fischeri: a symbiotic bacterium with pathogenic congeners. Proc Natl Acad Sci USA. 2005, 102 (8): 3004-3009. 10.1073/pnas.0409900102.PubMed CentralPubMedView ArticleGoogle Scholar
- Mandel MJ, Wollenberg MS, Stabb EV, Visick KL, Ruby EG: A single regulatory gene is sufficient to alter bacterial host range. Nature. 2009, 458 (7235): 215-218. 10.1038/nature07660.PubMed CentralPubMedView ArticleGoogle Scholar
- Peterson JD, Umayam LA, Dickinson T, Hickey EK, White O: The Comprehensive Microbial Resource. Nucl Acids Res. 2001, 29 (1): 123-125. 10.1093/nar/29.1.123.PubMed CentralPubMedView ArticleGoogle Scholar
- McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes S, Hwang K, Kubal M: The National Microbial Pathogen Database Resource (NMPDR): a genomics platform based on subsystem annotation. Nucl Acids Res. 2007, 35: D347-353. 10.1093/nar/gkl947.PubMed CentralPubMedView ArticleGoogle Scholar
- Schneider KL, Pollard KS, Baertsch R, Pohl A, Lowe TM: The UCSC Archaeal Genome Browser. Nucl Acids Res. 2006, D407-410. 10.1093/nar/gkj134. 34 Database
- Jain E, Bairoch A, Duvaud S, Phan I, Redaschi N, Suzek BE, Martin MJ, McGarvey P, Gasteiger E: Infrastructure for the life sciences: design and implementation of the UniProt website. BMC Bioinformatics. 2009, 10: 136-10.1186/1471-2105-10-136.PubMed CentralPubMedView ArticleGoogle Scholar
- Li L, Stoeckert CJ, Roos DS: OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003, 13 (9): 2178-2189. 10.1101/gr.1224503.PubMed CentralPubMedView ArticleGoogle Scholar
- Chen F, Mackey AJ, Vermunt JK, Roos DS: Assessing performance of orthology detection strategies applied to eukaryotic genomes. PLoS ONE. 2007, 2 (4): e383-10.1371/journal.pone.0000383.PubMed CentralPubMedView ArticleGoogle Scholar
- Tatusov RL, Galperin MY, Natale DA, Koonin EV: The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucl Acids Res. 2000, 28 (1): 33-36. 10.1093/nar/28.1.33.PubMed CentralPubMedView ArticleGoogle Scholar
- Huson DH, Steel M: Phylogenetic trees based on gene content. Bioinformatics. 2004, 20 (13): 2044-2049. 10.1093/bioinformatics/bth198.PubMedView ArticleGoogle Scholar
- Huson DH, Richter DC, Rausch C, Dezulian T, Franz M, Rupp R: Dendroscope: An interactive viewer for large phylogenetic trees. BMC Bioinformatics. 2007, 8: 460-10.1186/1471-2105-8-460.PubMed CentralPubMedView ArticleGoogle Scholar
- Katoh K, Toh H: Recent developments in the MAFFT multiple sequence alignment program. Brief Bioinform. 2008, 9 (4): 286-298. 10.1093/bib/bbn013.PubMedView ArticleGoogle Scholar
- Caffrey D, Dana P, Mathur V, Ocano M, Hong E-J, Wang Y, Somaroo S, Caffrey B, Potluri S, Huang E: PFAAT version 2.0: A tool for editing, annotating, and analyzing multiple sequence alignments. BMC Bioinformatics. 2007, 8 (1): 381-10.1186/1471-2105-8-381.PubMed CentralPubMedView ArticleGoogle Scholar
- Jobb G: TREEFINDER version of October 2008. 2008, Munich, Gemany: Distributed by the author at, [http://www.treefinder.de]Google Scholar
- Whelan S, Goldman N: A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001, 18 (5): 691-699.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.