- Research article
- Open Access
Genome analysis of Salmonella enterica subsp. diarizonae isolates from invasive human infections reveals enrichment of virulence-related functions in lineage ST1256
BMC Genomicsvolume 20, Article number: 99 (2019)
Salmonella enterica subsp. diarizonae (IIIb) is frequently isolated from the environment, cold-blooded reptiles, sheep and humans; however only a few studies describe the isolation of this subspecies from invasive human infections. The factors contributing to this unusual behavior are currently unknown.
We report here the genome features of two diarizonae strains, SBO13 and SBO27, isolated from endocervical tissue collected post-abortion and from cerebrospinal fluid of a newborn child, respectively, in the city of Santa Cruz, Bolivia. Although isolated six years apart, SBO27 in 2008 and SBO13 in 2014, both strains belong to the same sequence type 1256 (ST1256) and show a high degree of genome conservation sharing more than 99% of their genes, including the conservation of a ~ 10 kb plasmid. A prominent feature of the two genomes is the presence of 24 genomic islands (GIs), in addition to 10 complete Salmonella pathogenicity islands (SPI) and fragments of SPI-7, a pathogenicity island first reported in the human-adapted serovar Typhi. Some of the GIs identified in SBO13 and SBO27 harbor genes putatively encoding auto-transporters involved in adhesion, lipopolysaccharide modifying enzymes, putative toxins, pili-related proteins, efflux pumps, and several putative membrane cation transport related-genes, among others. These two Bolivian isolates also share genes encoding the type-III secretion system effector proteins SseK2, SseK3 and SlrP with other diarizonae sequence types (ST) mainly-associated with infections in humans. The sseK2, sseK3 and slrP genes were either absent or showing frameshift mutations in a significant proportion of genomes from environmental diarizonae isolates.
The comparative genomic study of two diarizonae strains isolated in Bolivia from human patients uncovered the presence of many genes putatively related to virulence. The statistically-significant acquisition of a unique combination of these functions by diarizonae strains isolated from humans may have impacted the ability of these isolates to successfully infect the human host.
The genus Salmonella comprises Gram-negative pathogenic bacteria that infect a large variety of hosts, including humans, livestock, reptiles, birds and insects . The genus includes two species, S. enterica and S. bongori, with S. enterica further subdivided in six subspecies: enterica (I), salamae (II), arizonae (IIIa), diarizonae (IIIb), houtenae (IV), and indica (VI) .
S. enterica subsp. enterica is responsible for most infections in warm-blooded (homeotherm) animals and comprises more than 1580 serovars that are differentiated by distinct somatic (O-antigen of the lipopolysaccharide, LPS) and flagellar antigenic formulae . Some of these serovars are host-adapted while others cause diseases in a broad range of hosts . Host-adapted serovars are frequently associated with systemic diseases like typhoid fever. In contrast, non-typhoidal Salmonella (NTS) often cause gastroenteritis in immunocompetent hosts. NTS serovars have also been linked to bacteraemia in immunocompromised and even in immunocompetent individuals . The most common NTS isolated from human patients include Enteritidis, Typhimurium, Infantis and Newport [5, 6].
Although more than 99% of the S. enterica strains isolated from human patients belong to the subsp. enterica, it is not unusual, especially in children, to find reports describing isolation of strains belonging to other subspecies . Many of these cases involve Salmonella infections linked to contact with cold-blooded animals, like reptiles kept as pets [8, 9]. Some studies have shown the presence of all Salmonella subspecies as part of the reptile gut microbiota, which may represent a risk of transmission to more susceptible (warm-blooded) hosts .
Pet snakes are reported to be an important reservoir of S. enterica subsp. diarizonae (hereafter diarizonae) . This subspecies is also often isolated from sheep [11,12,13,14] and is increasingly associated with infections in humans. Although in some of these cases the source of the infection could be traced to reptile pets and consumption of sheep meat [15,16,17]; it remains unknown whether human-to-human transmissions occur.
In this study, we report the genome analysis of two diarizonae strains with antigenic formula 48:i:z and sequence type ST1256, which were isolated from human patients in the city of Santa Cruz, Bolivia. Although these two isolates were isolated 6 years apart (2008 and 2014), our analysis shows a high degree of genome synteny between them. Remarkably, these two Bolivian isolates share a defined set of putative virulence genes with diarizonae of other sequence types (ST) mainly-associated with human infections. These factors include some effector proteins translocated by type III secretion systems, a genomic island (GI-6) and a fragment of the Salmonella pathogenicity island 7 (SPI-7).
Results and discussion
Molecular typing of SBO13 and SBO27 strains
Pet snakes are common reservoirs of diarizonae and, consequently, potential transmission sources. However, invasive infections in humans are rarely reported [10, 15, 17, 18]. Accordingly, we were interested in analyzing the genome of two diarizonae strains isolated from two human samples in the city of Santa Cruz, Bolivia. These strains were cultured from cerebrospinal fluid of a 36 h-old newborn that died (strain SBO27, isolated in 2008) and, endocervical tissue collected from a 23-year old woman who miscarried after ten weeks of pregnancy (strain SBO13, isolated in 2014). There was no record of the immune status of the infected pregnant woman whereas no immune or weight defects were observed in the newborn. This newborn was however delivered by a mother having urinary infections during pregnancy.
In the initial phenotypic analyses, isolates SBO13 and SBO27 were identified as diarizonae, as they are positive for malonate utilization but negative for dulcitol fermentation. Plasmid profile analyses showed the presence of one plasmid of ~ 10 kb in both strains. Antimicrobial susceptibility tests indicated that SBO13 and SBO27 are susceptible to most common beta-lactams, aminoglycosides, trimethoprim-sulfametoxazol and quinonoles.
Both SBO13 and SBO27 strains were assigned to serovar IIIb 48:i:z by serological and molecular methods, according to the White-Kauffmann-Le Minor scheme . Multi-locus sequence typing (MLST) analysis revealed that both isolates belong to ST1256, according to the Achtman scheme . Serovar designation and ST were confirmed using the genome sequence and the ST prediction tool available in the EnteroBase database (https://enterobase.warwick.ac.uk/).
Genome characteristics of the diarizonae isolates SBO13 and SBO27
The draft genomes of the diarizonae isolates SBO13 and SBO27 have 5,031,187 and 5,035,353 bp, respectively, with a G + C average content of 51.4%. The genome of SBO13 has a coding capacity for 4649 predicted proteins, 8 rRNAs, 80 tRNAs, and 165 ncRNAs (GenBank accession #PVNR00000000). The coding capacity of the SBO27 genome accounts for 4635 predicted proteins, 8 rRNAs, 80 tRNAs and 165 ncRNAs (GenBank accession #PVNQ00000000). SBO13 and SBO27 harbor one plasmid of 9377 and 9246 bp, respectively, with a notorious synteny and similar gene content (Additional file 1: Table S1). No antibiotic resistance-related genes were identified in these small plasmids, which encode several hypothetical proteins of unknown function, two putative regulatory proteins and an integrase (Additional file 1: Table S1).
Despite the sensitivity of SBO13 and SBO27 to many antimicrobials, a search in the ‘Comprehensive antibiotic resistance database’ (CARD) (https://card.mcmaster.ca/), revealed the presence of a reduced number of genes linked to antimicrobial resistance in both isolates. These consisted mainly of genes encoding components of efflux pumps of the major facilitator superfamily (MFS). Mutant alleles in glpT and uphT, which confer fosfomycin resistance based on a target modification mechanism, were also detected (Additional file 2: Table S2).
The genomes of SBO13 and SBO27 were initially annotated with Prokka  for phylogenomic analyses (see Methods) and subsequently by NCBI following our submission. To functionally analyse the SBO13 and SBO27 genomes, we used the subsystem annotation available at RAST server (http://rast.nmpdr.org/) [21, 22], which allows classification by biological processes. In SBO13, a total of 2771 protein-encoding genes were allocated to 571 annotated subsystems, defined as biological processes or structural complexes supported by a set of functional roles . Most of these genes were predicted to be involved in carbohydrate (697 genes) and amino acid (439 genes) metabolism (Additional file 3: Figure S1). Interestingly, a considerable number of genes, 237, were predicted to be associated with transport-related processes and another 130 genes predicted to encode protein secretion systems, some of which have been identified as important traits involved in bacterial virulence and pathogenesis . Similar results regarding the distribution of genes in the different subsystems categories were obtained for the SBO27 genome (not shown).
Phylogenetic analysis of diarizonae SBO13 and SBO27 genomes
To infer the phylogenetic affinities of the SBO13 and SBO27 genomes, we estimated a maximum-likelihood phylogeny from the consensus pan-genome (presence-absence) matrix of 18,446 consensus clusters computed from 109 genomes representative of the distinct Salmonella species and subspecies (Fig. 1). From the 551 diarizonae genomes available in EnteroBase in an accessible status (Additional file 4: Table S3), we included in the analysis 70 draft (Additional file 5: Table S4) and 6 complete genomes (Additional file 6: Table S5). These were selected based on two main criteria. First, the different source and geographic location of the isolate; and, secondly, a sample size roughly proportional to the number of isolates with a particular ST (see Methods).
The pan-genome phylogeny resolved the S. bongori and S. enterica species clades and the six S. enterica subspecies with maximal support (Fig. 1). The Bolivian isolates SBO13 and SBO27 formed a perfectly supported tree with other ST1256 isolates. The remaining diarizonae genomes also grouped mainly by their respective ST (Fig. 1). However, there was no apparent clustering of the diarizonae genomes by their isolation source, implying that no major differences in genome content exists among diarizonae isolates from human, animal, plant or environmental sources (Fig. 1). This is reflected by the observation that most major diarizonae clades contain strains from different sources (Additional file 4: Table S3). On the other hand, the maximum-likelihood core-genome phylogeny computed from 1058 top-scoring markers selected by the GET_PHYLOMARKERS pipeline  out of the 1650 consensus clusters (see Methods) computed by GET_HOMOLOGUES , resolved the same major groups (species and subspecies) within the Salmonella genus; although, as expected, with much lower resolution within the S. enterica subspecies (Additional file 7: Figure S2).
As mentioned above, the MLST analyses showed that SBO13 and SBO27 belong to the ST1256 (Additional file 5: Table S4). This is the fourth most abundant ST for diarizonae found in EnteroBase as of September 2018 (https://enterobase.warwick.ac.uk/). Interestingly, diarizonae strains with ST1256 are recovered mainly from human infections (37 of 40 isolates) (Additional file 4: Table S3). To date, isolates of ST1256 have been only reported in Europe and North America (Additional file 4: Table S3), which make SBO13 and SBO27 the first cases of diarizonae ST1256 isolated from South America. Similar to ST1256, other abundant diarizonae STs also display association with human infections. That is the case of ST233 (37 of 52 isolates); ST430 (21 of 31 isolates); and, ST432 (15 of 51 isolates, some of them with no source information). There are, however, other abundant STs, like ST63, isolated mainly in North America, for which human isolates are rare (1 of 28 isolates) (Additional file 4: Table S3). These differences also depend on important parameters such as the geographic location in which the strains were obtained. For example, a high percentage of the diarizonae STs strongly associated with human infections (ST233, ST430 and ST432); however, most strains were isolated in the United Kingdom (Additional file 4: Table S3). Therefore, this apparent association between STs and human sources may in part reflect a sampling bias. Overall, diarizonae strains appear to have a broad ecological valence, which indicates that much remains to be learned about their environmental distribution, molecular basis of host colonization and tissue tropism.
Genomic islands (GIs) of diarizonae SBO13 and SBO27 strains
To further characterize the genomes of diarizonae isolates SBO13 and SBO27, an all-against-all BLAST comparison was performed using nine complete genomes from distinct Salmonella species, subspecies, and serovars. These genomes included: six diarizonae strains with complete genome (Additional file 6: Table S5); S. bongori (strain NCTC_12,419); serovar Typhi (strain CT18); and serovar Typhimurium (strain SL1344) (Fig. 2a). A total of 24 genomic islands (GIs), designated as GI-1 to GI-24 were predicted in SBO13 and SBO27 using SIGIHMM and IslandPath-DIMOB (Fig. 2a, Additional file 8 : Table S6). Two islands, GI-17 and GI-20, were found in SBO13 and SBO27 and mostly absent in the rest of the genomes analysed (Fig. 2a). Detailed inspection of GI-17 and GI-20 revealed the presence of predicted phage-related genes along with a high proportion of genes encoding hypothetical proteins, suggesting that they are likely remnants of prophages acquired by these two strains. In addition, SBO13 and SBO27 contain an additional 11.8 kb terminal extension at GI-18 that is absent in the rest of strains analysed, making it the largest GI found in diarizonae (115 kb). This GI-18 region harbors genes encoding for predicted proteins related to virulence such as fimbrial proteins, an efflux system and a putative toxin (Additional file 8: Table S6). Interestingly, the analysis of two other large islands, GI-1 and GI-19, also revealed the presence of genes putatively involved in virulence. For example, genes encoding predicted proteins with homology to toxin-coregulated pilus biosynthesis-like proteins E (TcpE) (SBO13_30280) and TcpQ (SBO13_30220) , were found in GI-1; while several genes predicted to be related to iron transport such as the ferrous iron permease, efeU, the cytoplasmic ferritin, ftn, or the Fur regulated iron transporter, sitA, were identified in GI-19 (Additional file 8: Table S6). Accordingly, it is known that iron increases the pathogenic potential of Salmonella and other enteric pathogens during the infection of the intestinal epithelium .
On the other hand, some of the genes present in GI-6 and shared by SBO13 and SBO27 define a cluster of genes that is flanked by two transposase-encoding genes and is mostly predicted to be related to membrane transport (Fig. 2b, Additional file 8: Table S6). Most genes of this cluster have homologs in diarizonae strain HZS154 (isolated from stool of a human patient), including an auto-transporter bearing an outer membrane beta-barrel domain (SBO13_24810, SBO13_24900), a disulfide bond formation protein B, DsbB (SBO13_24840), an ABC-transporter (SBO13_24870), an iron permease (SBO13_24860), an outer membrane porin (SBO13_24880) and an O-antigen LPS length determinant protein (SBO13_24890), among others (Fig. 2b, Additional file 8: Table S6).
Pathogenicity islands (SPIs) in diarizonae SBO13 and SBO27
Although there are few human clinical reports from S. enterica isolates not belonging to subsp. enterica [7, 18], diarizonae has been found sporadically responsible for maxillary sinusitis, bacteremia, cervical lymphadentitis and gastroenteritis in adults and children [15, 17, 28, 29]. Our whole genome comparison revealed that diarizonae SBO13 and SBO27 harbor 10 known Salmonella pathogenicity islands (SPI-1, SPI-2, SPI-3, SPI-4, SPI-5, SPI-9, SPI-11, SPI-13, SPI-18, and SPI-21) and a significant portion of SPI-7 (Fig. 2a). SPI-21, which encodes a type VI secretion system as well as the bacterial protein toxins pyocins, is shared only by diarizonae and arizonae and has been proposed to enable these two subspecies to compete with other bacteria . Of interest, all diarizonae genomes lack SPI-6 and SPI-12, which have been shown to be important for competition and virulence in subsp. enterica [31, 32].
The most intensively studied pathogenicity islands in Salmonella are SPI-1, SPI-2, SPI-3, SPI-4, and SPI-5, which play important role in adhesion, invasion and the intracellular lifestyle of the pathogen . To assess putative differences in these five islands among diarizonae strains, we analysed their gene content by BLAST in SBO13 and SBO27 and, in a representative group of 15 diarizonae strains collected from human, animal, plant and environmental sources (Additional file 6: Table S5). Among these are included those six diarizonae isolates with known complete genome used in previous analyses. High conservation in gene content was observed for these pathogenicity islands (SPI-1 to SPI-5) in all diarizonae isolates, irrespective of their origin (Fig. 3).
SPI-7 in serovar Typhi comprises a DNA region of 134 kb, which contains approximately 150 predicted genes . In addition, SPI-7 has a mosaic structure, comprising regions thought to be involved in island mobility and other regions implicated in virulence: the viaB- operon, related to production of Vi antigen; a prophage region encoding the SopE type-III effector; a type-IV secretion system (pil-locus); and, the rep-cluster, with homology to a plasmid replication regions . SPI-7 was first discovered in Typhi but it has been also found in serovar Paratyphi C and some strains of serovar Dublin [36, 37]. Noteworthy, although a recent study showed SPI-7 to be present in the genome of diarizonae strain 1579 and absent in strain 639, no further details about the gene content of this particular SPI-7 fragment were provided .
To gain further insights into the organization of the diarizonae genomic region containing SPI-7 genes, a sequence homology analysis using SPI-7 genes from Typhi strain CT-18 was run with the same set of diarizonae genomes used in the analysis of SPI-1 to SPI-5 islands: 15 representative diarizonae strains from diverse sources along with SBO13 and SBO27 (Additional file 6: Table S5). The initial TBLASTN analysis using as queries SPI-7 proteins from Typhi strain CT-18 detected 66 of a total of 128 genes of the SPI-7 island in SBO13 and SBO27 with a coverage of ≥85%. Moreover, the SPI-7 genes found showed a high synteny compared to Typhi SPI-7 (Additional file 9: Figure S3, see below). Since a high proportion (22/66) of those TBLASTN-hits showed low-medium identity at the protein level (~ 40–50%), we decided to assess the TBLASTN analysis using an identity cut-off > 40%.
The SPI-7 gene distribution divided the 17 diarizonae strains analyzed in three main clades, A, B and C (Fig. 4). Clade A, which bears a significant portion of the sopE phage but lacking other virulence-related regions, included strains outside the subsp. diarizonae and a few diarizonae strains from diverse sources (Fig. 4). Clade B, composed of diarizonae strains from diverse sources, missed a large portion of SPI-7 while clade C, in which SBO13 and SBO27 were present, bear a significance portion of SPI-7 genes predicted to be related to virulence (pil-locus and tra-region) (Fig. 4). As expected, the Vi capsule polysaccharide-biosynthetic operon (viaB) was absent in all the diarizonae strains analysed (Fig. 4).
In the case of the Bolivian isolates SBO13 and SBO27, we found homology with several regions of serovar Typhi SPI-7: the rep cluster (homology to plasmid replication region), the pil locus, genes from the tra region involved in DNA transfer, and most genes from the STY4663-cutA3 region, including an ortholog of the Typhi integrase int-4680 (Fig. 4 and Additional file 9: Figure S3). The only region implicated in virulence found in this diarizonae SPI-7 fragment was therefore the type IVB pilus (pil) locus (Fig. 4). The type-IVB pilus system has been characterized in Typhi and Dublin as an important element in adhesion to and/or invasion of human epithelial cells [36, 38]. This type IVB pilus is also involved in increasing the inflammatory response in monocytes . Hence, the presence of the pil locus in diarizonae SBO13 and SBO27 could have contributed to their ability to infect humans. For some pil gene products, the level of amino acid identity of the SBO13 and SBO27 pili proteins was ≤50% when compared with the orthologous Typhi pili proteins. This observation opens the possibility of different acquisition events of the locus, at least for some sub-regions (Additional file 9: Figure S3). Interestingly, despite showing high sequence identity between diarizonae and Typhi, the STY4663-cutA3 cluster appears upstream of parB-ssb region in diarizonae instead of the 3′ terminal region as in Typhi (Additional file 9: Figure S3). It is also notorious the lack of genes coding for the Vi capsule polysaccharide (viaB operon) and, as mentioned above, the absence of most of the SopE-phage. This scenario is reminiscent of that seen in other pathogens harboring genomic islands that share synteny with SPI-7 such as Pseudomonas fluorescens, Ralstonia metallidurans or Yersinia enterocolitica . In all these cases, the SPI-7 viaB operon is either absent or replaced by other genes.
Bacteriophages and phage remnants are found in most of Salmonella genomes and play an important role in both inter-species and inter-strain variability [40, 41]. Four regions were identified in the chromosome of diarizonae SBO13 and SBO27 using the PHASTER tool (Fig. 2a, Additional file 10 : Table S7). PHASTER predicted the presence in these four regions of ‘incomplete’ set of genes, therefore likely corresponding to phage remnants. Two of these phage remnants -numbers 3, 4 in SBO13; and, 2, 3 in SBO27- have the same length and G + C content in the two strains, suggesting that they correspond to the same genetic element. The identified phage remnants displayed a mosaic of genes from related bacteriophages such as SEN34, Gifsy1, P4, and prophage elements from other bacterial genera. Among the genes mapping to these phage remnants, we found one encoding the virulence factor PagK. This protein and its homologues (PagJ and PagK2) belong to a set of virulence factors proposed to be translocated into the host cytoplasm of macrophages via bacterial outer membrane vesicles . It has been reported that Salmonella strains lacking these three homologues are attenuated for virulence in murine infections  (Additional file 10: Table S7). The precise role of these phage remnants, which encode a high number of hypothetical proteins, in the biology of SBO13 and SBO27 is yet to be explored.
Virulence factors in diarizonae SBO13 and SBO27
Effector proteins translocated into eukaryotic host cells by specialized type-III secretion systems (T3SS) contribute largely to Salmonella virulence [43,44,45]. The combined action of these effector proteins allows Salmonella to alter host cell functions enabling intracellular survival and replication within eukaryotic cells and transmission [44, 45]. Likewise, many studies have demonstrated a prominent role of diverse adhesins in promoting the interaction of Salmonella with host cells .
To further investigate virulence-related traits in diarizonae SBO13 and SBO27, we searched by TBLASTN the presence of type-III effector proteins and adhesin operons in the Virulence Factors DataBase (VFDB, http://www.mgc.ac.cn/VFs/) (Additional file 11: Table S8) using the set 15 diarizonae genomes previously described (Additional file 6: Table S5). This comparative analysis revealed that 12 out of 41 type III effectors were encoded by all the diarizonae genomes: SteC, SseJ, SseG, SseF, SptP, SopE, SopE2, SopB, SipA, SipB, SipD and AvrA (Fig. 5). Interestingly, 11 out of the 17 diarizonae strains examined, including SBO13 and SBO27, harbor genes encoding the glycosyltransferases SseK2 and SseK3, which impair proper immune response to infection via TNFα-stimulated NF-κB signaling . Noteworthy, 8 out of the 9 diarizonae strains associated with human infections were positive for sseK2 and sseK3 (Fig. 5). Moreover, an extra type III effector, SlrP, was found in some of the strains associated with human and animal infections (Fig. 5). SlrP has E3 ubiquitin ligase activity and has been shown to inhibit the release of IL-1β . A detailed inspection of the SlrP amino acid sequence across the diarizonae strains analyzed revealed that SlrP was present in some isolates as a truncated protein due to a frame-shift mutation (Fig. 5). As mentioned above, SrlP appears as the full-length version in the Bolivian strains SBO13 and SBO27 (Fig. 5). SlrP interacts with mammalian thioredoxin-1 leading to a decrease in thioredoxin activity and host cell death . Hence, the incorporation of SlrP together with the glycosyltransferases SseK2 and SseK3 might have facilitated infection in humans by the SBO13 and SBO27 isolates. Noteworthy, no selective acquisition or loss of fimbrial operons was observed for the diarizonae isolates examined. The only exception was the stdA and stdC fimbrial genes, which in the Bolivian isolates SBO13 and SBO27 encode proteins with a low coverage (lower than 50%) in comparison to other diarizonae strains (Fig. 5).
Presence of SBO13 and SBO27 virulence determinants in diarizonae draft genome assemblies available at EnteroBase
The genome analysis of diarizonae SBO13 and SBO27 revealed a defined set of distinctive features linked to virulence such as the SPI-7 pil-locus, the transport-related cluster of GI-6, and genes the T3SS effectors SlrP, SseK2 and SseK3. To get additional insights about the distribution of this set of genes in other diarizonae strains, the presence of these markers was assessed by TBLASTN in the 376 diarizonae draft genome assemblies available at EnteroBase (https://enterobase.warwick.ac.uk/) (Additional file 12: Table S9 and Additional file 13: Table S10). These 376 genomes comprised those ST with a minimum of 5 isolates. The SPI-7-related pil-locus was found in 44.8% of the diarizonae genomes whereas the GI-6 cluster was found only in 29.6% of them (Additional file 14: Table S11). Therefore, these genome regions are present in less than half of the diarizonae genomes analyzed. The genes encoding the T3SS effectors SlrP, SseK2 and SseK3 were found in 80.5, 46.9 and 45.9% of the 376 diarizonae genomes, respectively (query coverage ≥85%; Additional file 14: Table S11). To test the possibility that these virulence factors -SlrP, SseK2, SseK3, GI-6 and SPI-7- could be involved in facilitating human infection, we analyzed their distribution in diarizonae STs statistically associated with humans (Fisher exact test p-value < 0.05) (Additional file 13: Table S10). Among these virulence-related functions, SlrP, GI-6 and SPI-7 were not only over-represented in STs statistically associated with the human host, but also in isolates collected from human infections regardless of whether their respective STs were associated or not to human host (Additional file 14: Table S11). Even more, in five STs: 1845, 3996, 4086, 876 and 1256 (SBO13 and SBO27), recovered mainly-from human infections (Fisher exact test p-value < 0.05) (Additional file: 13: Table S10), these virulence determinants were presents at least in the ≥75% of the isolates (Additional file 13: Table S10). Altogether, these data suggest that this repertoire of virulence factors could contribute in subspecies diarizonae to increase the potential to cause infection in humans.
This study provides insights into the genome structure of two diarizonae strains, SBO13 and SBO27, which were isolated in Bolivia from invasive infections in humans. Pan-genome phylogeny clustered these two Bolivian isolates with diarizonae strains of same genotype, ST1256, with no obvious differences with other diarizonae ST comprising mainly-human, animal or environmental isolates. However, the subsequent study of virulence-related functions encoded by the SBO13 and SBO27 genomes pointed to a unique gene repertoire that may facilitate infection of the human host. The statistical analyses revealed that, as it occurs in SBO13 and SBO27 strains, a higher percentage of diarizonae human isolates harbor genes encoding the type III effectors SseK2, SseK3 and SlrP; the SPI-7 pil-locus, and predicted membrane transport-related genes mapping in GI-6.
Isolation and characterization of S. enterica subsp. diarizonae strains SBO13 and SBO27
The strains were isolated from two human infection cases and sent to the National Centre for Tropical Diseases (CENETROP, Santa Cruz, Bolivia). Strain SBO27 was obtained from cerebrospinal fluid from a 36 h-old neonate who died in 2008, and strain SBO13 was isolated from endocervical tissue collected post-abortion at ten-weeks of gestation from a 23-year old pregnant female in 2014. The isolates were identified as S. enterica by standard biochemical tests. The antigenic formula was evaluated by serology and by PCR, and plasmid profiles were visualized as previously described . Antimicrobial susceptibility tests were performed by the Kirby-Bauer disk diffusion method and the breakpoints were considered according to the Clinical Laboratory Standards Institute .
For strains SBO13 and SBO27 strains, the initial multi-locus sequence typing (MLST) was performed by PCR and sequencing as described elsewhere , and confirmed after genome analysis using a web tool available at https://cge.cbs.dtu.dk/services/MLST/.
Genome sequencing and annotation
Purified genomic DNA was commercially sequenced at the Genomics Core of Arizona State University on an Illumina MiSeq instrument (~ 550 bp PE libraries; 2 × 300 sequencing cycles). Genome assembly and annotation were performed as previously described . Briefly, the reads were processed with trimmomatic  to remove adaptors and trim poor-quality bases before de-novo assembly using spades 3.11.0 . Scaffolds were ordered using multiple complete diarizonae genomes as references with the aid of MeDuSa v1.6 . The resulting assemblies were subjected to a final polishing step by remapping reads with BWA (Li and Durbin, 2009) and using Pilon v1.22  to correct the sequences and gap filling/closure. In-house annotation was performed with Prokka  and a local S. enterica-specific database of selected reference sequences (Prokka annotation data are public and available at https://figshare.com/s/38a6ebb54e78e5885b31). Sequences released to the public domain were annotated with the NCBI prokaryotic genome automatic pipeline during the submission process. S. enterica subsp. diarizonae genome assemblies downloaded from EnteroBase as FASTA-formatted contigs were likewise annotated with Prokka.
Comparative genome analyses
The genomes were aligned with MAUVE aligner version 2.4.0 using the progressive algorithm with default settings . Genome comparison was undertaken using BLAST Ring Image Generator (BRIG) . Salmonella pathogenicity islands (SPIs), T3SS effectors and GI-6 transport related gene cluster were analyzed using TBLASTN. Putative island prediction was conducted using IslandViewer4 . The genomic context of SPI-7 genes was drawn and analysed using EASYfig software . Prophages sequences were predicted using the PHASTER server, a new version of PHAST . Antibiotic resistance genes were searched using the Comprehensive Antibiotic Resistance Database (https://ardb.cbcb.umd.edu/blast/)  and Resfinder . The annotation of clusters of orthologous groups (COGs) was generated using eggNOGmapper . For Cluster analysis and visualization, we used the Python visualization library, Seaborn . Clustering analysis was performed using the complete linkage method.
Core- and pan-genome phylogenies were estimated under the maximum-likelihood (ML) optimality criterion using the GET_HOMOLOGUES  and GET_PHYLOMARKERS  software suites. Briefly, get_homologues.pl was used in combination with compare_clusters.pl to compute a consensus core-genome, resulting from the clustering of the all-against-all BLASTP results (with query coverage ≥90%; -C 90) with the BDBH, COGtriangles and OMCL algorithms implemented in GET_HOMOLOGUES, as detailed elsewhere . A consensus pan-genome was computed in a similar manner from the COGtriangles and OMCL clusters. The consensus core-genome clusters were then fed into the GET_PHYLOMARKERS pipeline to select alignments with optimal phylogenetic attributes, namely those without significant evidence for recombination, producing tree topologies and branch-lengths not significantly deviating from the expected distribution of these parameters under the multispecies coalescent, and displaying average branch support values > 0.6 (see  for the details). The alignments passing these filters were concatenated and a ML phylogeny estimated with IQ-TREE 1.6.1  using the best fitting model and selecting the phylogeny with the highest likelihood score from those found among independent searches. The ML pan-genome phylogeny was estimated from the pan-genome (presence-absence) matrix computed by compare_clusters.pl from the GET_HOMOLOGUES suite with the aid of the estimate_pangenome_phylogenies.sh script from the GET_PHYLOMARKERS package, which calls IQ-TREE to fit the binary model to the data, and performing ten independent searches under the best-fit model. Phylogenetic trees were displayed and edited with FigTree v1.4.3.
Comparative analyses in EnteroBase
The EnteroBase database (http://enterobase.warwick.ac.uk/species/index/senterica) was examined to determine the sequence type (STs) assignment for the strains (besides SBO13 and SBO27) included in this study and to analyze draft genomes of other diarizonae strains.
As of September 19, 2018; 757 diarizonae entries were found in EnteroBase. From these, 551 draft genome assemblies were available and downloaded as FASTA-formatted contigs. For the phylogenetic analysis, a sample of 76 diarizonae genomes were selected (Additional file 4: Table S3 and Additional file 5: Table S4) based on two main criteria. First, different source and geographic location of the isolation, and secondly, a representative sample roughly poportional to the number of isolates with a particular ST. We only considered those diarizonae genomes with STs represented in EnteroBase by more than 5 isolates. These STs were further differentiated in three classes, having 5–10, 11–20 and > 21 isolates. We further selected 2, 3 and 5 representative strains for each ST falling in each of these three classes, which conformed the 76 diarizonae genomes indicated above (Additional file 4: Table S3 and Additional file 5: Table S4). A subsample of 15 diarizonae strains was selected from the pan-genomic phylogenetic tree (Fig. 1 and Additional file 6: Table S5) and used for gene content comparative analyses (Figs. 3, 4 and 5).
For statistical association analyses, only STs with ≥5 isolates were considered from the sample of 376 genomes. These draft assemblies (FASTA contigs) were formatted with makeblastdb. The local genome databases were searched for the presence of the serovar Typhimurium strain SL1344 genes slrP, sseK2 and sseK3 using TBLASTN. Homologous proteins encoded by genes mapping in SPI-7 and GI-6 from strain SBO13 were searched with TBLASTN. Only hits with ≥90% query coverage and ≥ 70% of identity were considered as bonafide homologues (except for SPI-7, where ≥40% identity was used as cut-off for pil-locus genes). Fisher exact test was used to identify STs statistically associated with human host (p-value < 0.05).
Cluster of orthologous group
Multi-locus sequence typing
Salmonella pathogenicity island
Type-III secretion system
Winfield MD, Groisman EA. Role of nonhost environments in the lifestyles of Salmonella and Escherichia coli. Appl Environ Microbiol. 2003;69(7):3687–94.
Issenhuth-Jeanjean S, Roggentin P, Mikoleit M, Guibourdenche M, de Pinna E, Nair S, Fields PI, Weill FX. Supplement 2008-2010 (no. 48) to the white-Kauffmann-Le minor scheme. Res Microbiol. 2014;165(7):526–30.
Baumler A, Fang FC. Host specificity of bacterial pathogens. Cold Spring Harb Perspect Med. 2013;3(12):a010041.
Keestra-Gounder AM, Tsolis RM, Baumler AJ. Now you see me, now you don't: the interaction of Salmonella with innate immune receptors. Nat Rev Microbiol. 2015;13(4):206–16.
Hendriksen RS, Vieira AR, Karlsmose S, Lo Fo Wong DM, Jensen AB, Wegener HC, Aarestrup FM: Global monitoring of Salmonella serovar distribution from the World Health Organization global foodborne infections network country data Bank: results of quality assured laboratories from 2001 to 2007. Foodborne Pathog Dis 2011, 8(8):887–900.
European Food Safety Authority. The European Union summary report on trends and sources of zoonoses, zoonotic agents and food-borne outbreaks in 2016. EFSA J. 2017;15(12):5077. https://doi.org/10.2903/j.efsa.2017.5077.
Lamas A, Miranda JM, Regal P, Vazquez B, Franco CM, Cepeda A. A comprehensive review of non-enterica subspecies of Salmonella enterica. Microbiol Res. 2018;206:60–73.
Hoelzer K, Moreno Switt AI, Wiedmann M. Animal contact as a source of human non-typhoidal salmonellosis. Vet Res. 2011;42:34.
Whiley H, Gardner MG, Ross K. A review of Salmonella and Squamates (lizards, snakes and Amphisbians): implications for public health. Pathogens. 2017;6(3).
Schroter M, Roggentin P, Hofmann J, Speicher A, Laufs R, Mack D. Pet snakes as a reservoir for Salmonella enterica subsp. diarizonae (serogroup IIIb): a prospective study. Appl Environ Microbiol. 2004;70(1):613–5.
Davies RH, Evans SJ, Preece BE, Chappell S, Kidd S, Jones YE. Increase in Salmonella enterica subspecies diarizonae serovar 61:k:1,5,(7) in sheep. Vet Rec. 2001;149(18):555–7.
Alvseike O, Skjerve E. Prevalence of a Salmonella subspecies diarizonae in Norwegian sheep herds. Prev Vet Med. 2002;52(3–4):277–85.
Alvseike O, Vardund T, Lindstedt B, Heir E, Eriksson E, Kapperud G. Molecular epidemiology and population genetics of Salmonella subspecies diarizonae in sheep in Norway and Sweden. Epidemiol Infect. 2004;132(2):253–61.
Stokar-Regenscheit N, Overesch G, Giezendanner R, Roos S, Gurtner C. Salmonella enterica subsp. diarizonae serotype 61:k:1,5,(7) associated with chronic proliferative rhinitis and high nasal colonization rates in a flock of Texel sheep in Switzerland. Prev Vet Med. 2017;145:78–82.
Hervas JA, Rosell A, Hervas D, Rubio R, Duenas J, Mena A. Reptile pets-associated Salmonella enterica subspecies diarizonae gastroenteritis in a neonate. Pediatr Infect Dis J. 2012;31(10):1102–3.
Soren K, Lindblad M, Jernberg C, Eriksson E, Melin L, Wahlstrom H, Lundh M. Changes in the risk management of Salmonella enterica subspecies diarizonae serovar 61: (k):1, 5, (7) in Swedish sheep herds and sheep meat due to the results of a prevalence study 2012. Acta Vet Scand. 2015;57:6.
Horvath L, Kraft M, Fostiropoulos K, Falkowski A, Tarr PE. Salmonella enterica subspecies diarizonae maxillary sinusitis in a Snake handler: first report. Open Forum Infect Dis. 2016;3(2):ofw066.
Gerlach RG, Walter S, McClelland M, Schmidt C, Steglich M, Prager R, Bender JK, Fuchs S, Schoerner C, Rabsch W, et al. Comparative whole genome analysis of three consecutive Salmonella diarizonae isolates. Int J Med Microbiol. 2017;307(8):542–51.
Achtman M, Wain J, Weill FX, Nair S, Zhou Z, Sangal V, Krauland MG, Hale JL, Harbottle H, Uesbeck A, et al. Multilocus sequence typing as a replacement for serotyping in Salmonella enterica. PLoS Pathog. 2012;8(6):e1002776.
Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–9.
Aziz RK, Bartels D, Best AA, De Jongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M. The RAST server: rapid annotations using subsystems technology. BMC Genomics. 2008;9(1):75.
Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, Edwards RA, Gerdes S, Parrello B, Shukla M. The SEED and the rapid annotation of microbial genomes using subsystems technology (RAST). Nucleic Acids Res. 2013;42(D1):D206–14.
Green ER, Mecsas J. Bacterial secretion systems–an overview. Microbiology spectrum. 2016;4(1). https://doi.org/10.1128/microbiolspec.VMBF-0012-2015.
Vinuesa P, Ochoa L, Contreras-Moreira B. GET_PHYLOMARKERS, a software package to select optimal orthologous clusters for phylogenomics and inferring pan-genome phylogenies, used for a critical geno-taxonomic revision of the genus Stenotrophomonas. Front Microbiol. 2018.
Contreras-Moreira B, Vinuesa P. GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis. Appl Environ Microbiol. 2013;79(24):7696–701.
Kaufman MR, Shaw CE, Jones ID, Taylor RK. Biogenesis and regulation of the Vibrio cholerae toxin-coregulated pilus: analogies to other virulence factor secretory systems. Gene. 1993;126(1):43–9.
Kortman GA, Boleij A, Swinkels DW, Tjalsma H. Iron availability increases the pathogenic potential of Salmonella typhimurium and other enteric pathogens at the intestinal epithelial interface. PLoS One. 2012;7(1):e29968.
Chong Y, Kwon OH, Lee SY, Chung KS, Shimada T. Salmonella enterica subspecies diarizonae bacteremia in an infant with enteritis--a case report. Yonsei Med J. 1991;32(3):275–8.
Fermaglich LJ, Routes JM, Lye PS, Kehl SC, Havens PL. Salmonella cervical lymphadenitis in an immunocompetent child exposed to a Snake at an educational exhibit: a case report. Infect Dis Clin Pract. 2012;20(4):289–90.
Desai PT, Porwollik S, Long F, Cheng P, Wollam A, Clifton SW, Weinstock GM, McClelland M. Evolutionary genomics of Salmonella enterica subspecies. MBio. 2013;4(2):e00579–12.
Sana TG, Flaugnatti N, Lugo KA, Lam LH, Jacobson A, Baylot V, Durand E, Journet L, Cascales E, Monack DM. Salmonella typhimurium utilizes a T6SS-mediated antibacterial weapon to establish in the host gut. Proc Natl Acad Sci U S A. 2016;113(34):E5044–51.
Tomljenovic-Berube AM, Henriksbo B, Porwollik S, Cooper CA, Tuinema BR, McClelland M, Coombes BK. Mapping and regulation of genes within Salmonella pathogenicity island 12 that contribute to in vivo fitness of Salmonella enterica Serovar typhimurium. Infect Immun. 2013;81(7):2394–404.
Hensel M. Evolution of pathogenicity islands of Salmonella enterica. Int J Med Microbiol. 2004;294(2–3):95–102.
Pickard D, Wain J, Baker S, Line A, Chohan S, Fookes M, Barron A, Gaora PÓ, Chabalgoity JA, Thanky N. Composition, acquisition, and distribution of the vi exopolysaccharide-encoding Salmonella enterica pathogenicity island SPI-7. J Bacteriol. 2003;185(17):5055–65.
Seth-Smith HM. SPI-7: Salmonella’s vi-encoding pathogenicity island. J Infec Dev Ctries. 2008;2(04):267–71.
Morris C, Tam CK, Wallis TS, Jones PW, Hackett J. Salmonella enterica serovar Dublin strains which are vi antigen-positive use type IVB pili for bacterial self-association and human intestinal cell entry. Microb Pathog. 2003;35(6):279–84.
Liu W-Q, Liu G-R, Li J-Q, Xu G-M, Qi D, He X-Y, Deng J, Zhang F-M, Johnston RN, Liu S-L. Diverse genome structures of Salmonella paratyphi C. BMC Genomics. 2007;8(1):290.
Zhang X-L, Tsui IS, Yip CM, Fung AW, Wong DK-H, Dai X, Yang Y, Hackett J, Morris C. Salmonella enterica serovar Typhi uses type IVB pili to enter human intestinal epithelial cells. Infect Immun. 2000;68(6):3067–73.
Wang F, Zhang X-L, Zhou Y, Ye L, Qi Z, Wu J: Type IVB piliated Salmonella typhi enhance IL-6 and NF-κB production in human monocytic THP-1 cells through activation of protein kinase C. Immunobiology 2005, 210(5):283–293.
Ehrbar K, Hardt WD. Bacteriophage-encoded type III effectors in Salmonella enterica subspecies 1 serovar typhimurium. Infect Genet Evol. 2005;5(1):1–9.
Hiley L, Fang NX, Micalizzi GR, Bates J. Distribution of Gifsy-3 and of variants of ST64B and Gifsy-1 prophages amongst Salmonella enterica Serovar typhimurium isolates: evidence that combinations of prophages promote clonality. PLoS One. 2014;9(1):e86203.
Yoon H, Ansong C, Adkins JN, Heffron F. Discovery of Salmonella virulence factors translocated via outer membrane vesicles to murine macrophages. Infect Immun. 2011;79(6):2182–92.
Jennings E, Thurston TL, Holden DW. Salmonella SPI-2 type III secretion system effectors: molecular mechanisms and physiological consequences. Cell Host Microbe. 2017;22(2):217–31.
Galan JE, Lara-Tejero M, Marlovits TC, Wagner S. Bacterial type III secretion systems: specialized nanomachines for protein delivery into target cells. Annu Rev Microbiol. 2014;68:415–38.
Van Der Heijden J, Finlay BB. Type III effector-mediated processes in Salmonella infection. Future Microbiol. 2012;7(6):685–703.
Wagner C, Hensel M. Adhesive mechanisms of Salmonella enterica. Adv Exp Med Biol. 2011;715:17–34.
Bernal-Bayard J, Ramos-Morales F. Salmonella type III secretion effector SlrP is an E3 ubiquitin ligase for mammalian thioredoxin. J Biol Chem. 2009;284(40):27587–95.
Iriarte A, Giner-Lamia J, Silva C, Betancor L, Astocondor L, Cestero JJ, Ochoa T, Garcia C, Puente JL, Chabalgoity JA, et al. Draft genome sequence of Salmonella enterica subsp. enterica Serovar Infantis strain SPE101, isolated from a chronic human infection. Genome Announc. 2017;5(29).
CLSI: Performance Standards for Antimicrobial Susceptibility Testing. 26th Edition: Clinical and Laboratory Standards Institute (CLSI), Wayne, PA.; 2016. ISBN 1-56238-923-8, ISSN 1558-6502.
Vinuesa P, Puente JL, Calva E, Zaidi MB, Silva C. Complete genome sequence of Salmonella enterica Serovar typhimurium strain SO3 (sequence type 302) isolated from a baby with meningitis in Mexico. Genome Announc. 2016;4(2).
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19(5):455–77.
Bosi E, Donati B, Galardini M, Brunetti S, Sagot MF, Lio P, Crescenzi P, Fani R, Fondi M. MeDuSa: a multi-draft based scaffolder. Bioinformatics. 2015;31(15):2443–51.
Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One. 2014;9(11):e112963.
Darling AC, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14(7):1394–403.
Alikhan N-F, Petty NK, Zakour NLB, Beatson SA. BLAST ring image generator (BRIG): simple prokaryote genome comparisons. BMC Genomics. 2011;12(1):402.
Bertelli C, Laird MR, Williams KP, Group SFURC, Lau BY, Hoad G, Winsor GL, Brinkman FS. IslandViewer 4: expanded prediction of genomic islands for larger-scale datasets. Nucleic Acids Res. 2017;45(W1):W30–5.
Sullivan MJ, Petty NK, Beatson SA. Easyfig: a genome comparison visualizer. Bioinformatics. 2011;27(7):1009–10.
Arndt D, Grant JR, Marcu A, Sajed T, Pon A, Liang Y, Wishart DS. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 2016;44(W1):W16–21.
Jia B, Raphenya AR, Alcock B, Waglechner N, Guo P, Tsang KK, Lago BA, Dave BM, Pereira S, Sharma AN, et al. CARD 2017: expansion and model-centric curation of the comprehensive antibiotic resistance database. Nucleic Acids Res. 2017;45(D1):D566–73.
Zankari E, Hasman H, Cosentino S, Vestergaard M, Rasmussen S, Lund O, Aarestrup FM, Larsen MV. Identification of acquired antimicrobial resistance genes. J Antimicrob Chemother. 2012;67(11):2640–4.
Huerta-Cepas J, Forslund K, Coelho LP, Szklarczyk D, Juhl Jensen L, von Mering C, Bork P. Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper. Mol Biol Evol. 2017;34(8):2115–22.
Waskom M, Botvinnik O, O'Kane D, Hobson P, Lukauskas S, Gemperline DC, Augspurger T, Halchenko Y, Cole JB, Warmenhoven J et al. mwaskom/seaborn: v0.8.1 (September 2017) (Version v0.8.1). Zenodo. doi:https://doi.org/10.5281/zenodo.883859
Vinuesa P, Contreras-Moreira B. Robust identification of orthologues and paralogues for microbial pan-genomics using GET_HOMOLOGUES: a case study of pIncA/C plasmids. Methods Mol Biol. 2015;1231:203–32.
Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32(1):268–74.
We are grateful to the members of the Salmoiber CYTED Network “Control de la salmonelosis en Iberoamérica” (215RT-0493): Fernando C. Soncini, Eleonora García-Vescovi (Universidad de Rosario-CONICET, Argentina); Griselda Flores, José Pedraza (Universidad Autónoma Gabriel René Moreno, Santa Cruz, Bolivia); Lucia Yim (Universidad de la República, Montevideo, Uruguay); Coralith García, Lizeth Astocondor, Theresa Ochoa, Noemí Hinostroza (Instituto de Medicina Tropical Alexander von Humboldt-Universidad Peruana Cayetano Heredia, Lima, Perú); and, M. Graciela Pucciarelli (Universidad Autónoma de Madrid, Centro de Biología Molecular Severo Ochoa –CBMSO-, Madrid, Spain). Alfredo Hernández-Alvarez and Victor del Moral from the Centro de Ciencias Genómicas, UNAM, are gratefully acknowledged for help with server system administration. We acknowledge support of the publication fee by the CSIC Open Access Publication Support Initiative through its Unit of Information Resources for Research (URICI).
This work was supported by grants 215RT-0493 from Programa Iberoamericano de Ciencia y Tecnología para el Desarrollo (CYTED), BIO2016–77639-P (AEI/FEDER, UE) and PCIN-2016-082, from Spanish Ministry of Economy and Competitiveness and European Regional Development Funds (FEDER), awarded to FGdP; and, grants DGAPA IN213516, CONACyT 239659 and FC-2015-2/950, awarded to JLP. PV gratefully acknowledges funding grants DGAPA-PAPIIT IN211814 and IN206318 and CONACyT FC-2015-2/879. Other supporting grants were from Comisión Sectorial de Investigación Científica (CSIC), Universidad de la República (Proyecto Grupos I + D 2014), awarded to LB. CS was supported by CONACyT FC-2015-2/879. The funding bodies had no role in in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Availability of data and materials
Draft genome projects have been deposited at DDBJ/ENA/GenBank under the accession numbers PVNR00000000 (BioProject PRJNA436348; BioSample SAMN08623074) and PVNQ00000000 (BioProject PRJNA436348; BioSample SAMN08623075) for S. enterica subsp. diarizonae strains SBO13 and SBO27, respectively, under BioProject PRJNA436348. The versions described in this paper are the first versions, PVNR01000000 and PVNQ01000000.
Ethics approval and consent to participate
Human endocervical tissue and cerebrospinal fluid in which S. enterica subsp. diarizonae strains were isolated, were sent from Hospital ‘Materno-Infantil’, Santa Cruz, Bolivia, for microbiological analysis to the Centro Nacional de Enfermedades Tropicales (CENETROP) in the same city of Santa Cruz. These samples were accompanied by the mandatory medical orders for diagnosis. Patients signed a written document giving consent to the Hospital ‘Materno-Infantil’ for medical procedures required for treatment, including tissue sample collection and their use for analysis and research.
Consent for publication
The authors declare they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This manuscript is dedicated to the memory of Griselda Flores, member of the SalmoIber CYTED Network, who unexpectedly passed away last October 10th, 2018, in the city of Santa Cruz, Bolivia
Table S1. Genes present in the plasmids of S. enterica subsp. diarizonae strains SBO13 and SBO27. (XLSK 11 kb)
Table S2. Genes related to antibiotic resistance found in S. enterica subsp. diarizonae strains SBO13 and SBO27. (XLXS 18 kb)
Figure S1. Subsystem category distribution of the S. diarizonae strain SBO13 genome. The genome of diarizonae SBO13 was annotated using the Rapid Annotation System technology (RAST) server and classified in 27 categories and 571 subsystems. Total proteins annotated with a putative function covered a 56% of the subsystems (green bar). The pie chart represents the percentage distribution of the subsystems categories. (PDF 377 kb)
Table S3. S. enterica subsp. diarizonae genomes available in Enterobase in a accesible status (XLSX 162 kb)
Table S4. S. enterica subsp. diarizonae draft genomes used in the phylogenomic analyses (XLSX 30 kb)
Table S5. S. enterica subsp. diarizonae genomes of strains isolated from diverse sources that were further analysed in this study. (XLSX 12 kb)
Figure S2. Maximum likelihood core-genome phylogeny (species tree) for 76 Salmonella spp. strains. The core-genome was inferred from the 303 top-scoring markers selected of the 1650 consensus clusters computed by GET_HOMOLOGUES. The bar represents the expected number of substitutions per site under the best-fitting GTR + F + ASC + R3 model. Internal nodes are colored by the combined approximate Bayesian support / ultra-fast bootstrap support values, respectively, as indicated on the legend. The inset shows the distribution of Robinson-Foulds gene tree distances to the species-tree. (PDF 1150 kb)
Table S6. Genomic islands (GI) found in S. enterica subsp. diarizonae SBO13 (XLSX 43 kb)
Figure S3. Synteny analysis of the SPI-7 island between Typhi CT18 and S. diarizonae SBO13 and SBO27. (A) Comparison of the entire island; (B) detail of the rep, pil and tra loci. The vertical grey bars represent BLAST identity of homologous regions (minimal identity for matches 40%). The gradient of the grey colour bars represents BLAST identity (%). (PDF 434 kb)
Table S7. Phophages carried by the S. enterica subsp. diarizonae SBO13 and SBO27 (XLSX 10 kb)
Table S8. Virulence factors proteins encoded by S. enterica subsp. diarizonae genomes and representative genomes from other Salmonella species and subspecies (XLSX 19 kb)
Table S9. Presence/absence of slrP, sseK2, sseK3, GI6 and SPI-7 in S. enterica subsp. diarizonae draft genomes available at EnteroBase. (XLSX 39 kb)
Table S10. S. enterica subsp. diarizonae isolates statistically associated with human infections (% versus total). (XLSX 13 kb)
Table S11. Virulence determinants distribution along S. enterica subsp. diarizonae ST associated and non-associated with human host (% vs total). (XLSX 10 kb)