- Open Access
A database for the taxonomic and phylogenetic identification of the genus Bradyrhizobium using multilocus sequence analysis
© Azevedo et al.; licensee BioMed Central Ltd. 2015
- Published: 26 May 2015
Biological nitrogen fixation, with an emphasis on the legume-rhizobia symbiosis, is a key process for agriculture and the environment, allowing the replacement of nitrogen fertilizers, reducing water pollution by nitrate as well as emission of greenhouse gases. Soils contain numerous strains belonging to the bacterial genus Bradyrhizobium, which establish symbioses with a variety of legumes. However, due to the high conservation of Bradyrhizobium 16S rRNA genes - considered as the backbone of the taxonomy of prokaryotes - few species have been delineated. The multilocus sequence analysis (MLSA) methodology, which includes analysis of housekeeping genes, has been shown to be promising and powerful for defining bacterial species, and, in this study, it was applied to Bradyrhizobium, species, increasing our understanding of the diversity of nitrogen-fixing bacteria.
Classification of bacteria of agronomic importance is relevant to biodiversity, as well as to biotechnological manipulation to improve agricultural productivity. We propose the construction of an online database that will provide information and tools using MLSA to improve phylogenetic and taxonomic characterization of Bradyrhizobium, allowing the comparison of genomic sequences with those of type and representative strains of each species.
A database for the taxonomic and phylogenetic identification of the Bradyrhizobium, genus, using MLSA, will facilitate the use of biological data available through an intuitive web interface. Sequences stored in the on-line database can be compared with multiple sequences of other strains with simplicity and agility through multiple alignment algorithms and computational routines integrated into the database. The proposed database and software tools are available at http://mlsa.cnpso.embrapa.br, and can be used, free of charge, by researchers worldwide to classify Bradyrhizobium, strains; the database and software can be applied to replicate the experiments presented in this study as well as to generate new experiments. The next step will be expansion of the database to include other rhizobial species.
- Bradyrhizobium database
- Taxonomic of Prokaryotes
- Phylogeny of Prokaryotes
- Multilocus Sequence Analysis
- 16S rRNA Gene
- Pattern Recognition
Taxonomy of prokaryotes is gaining increasing attention duo to both the valoration of biodiversity and the recognition of the economic value of many microorganisms. Phylogenetic studies are also key for determining the exact taxonomic position of organisms, as well as to determine their evolutionary history, indicating their relations with other groups and their places in families and kingdoms.
Bacterial phylogeny is based mainly on sequence data of biological macro-molecules; highly conserved molecules help to compare distantly related organisms, whereas molecules that change rapidly help to elucidate small and recent changes . The 16S rRNA gene is broadly elected as the backbone of prokaryote taxonomy and phylogeny  and repositories of both 16S rRNA genes and other biological data are increasing every day, generating large datasets ; efficient organization of this information is critical to scientific progress.
The term "rhizobia" applies to soil-borne bacteria that are capable of fixing atmospheric nitrogen N2 in symbioses with, and for the benefit of, plants, the vast majority of which are legumes. Yearly, billions of dollars are saved worldwide thanks to the action of rhizobia, in crops that otherwise would require application of nitrogen fertilizers to achieve optimal yields. However, despite their importance to the agriculture and to the environment, studies on phylogeny and taxonomy of rhizobia are relatively scarce, including in some countries where genetic diversity is high, such as Brazil . The genus Bradyrhizobium, used in this study, is currently composed of 19 species recognized by the International Committee of Taxonomy; it has been suggested to be the ancestor of all rhizobia, having originated in the tropics e.g. [5–8]. The genus includes important strains, such as those known to contribute superior rates of N2 fixation to grain crops such as soybean (Glycine max (L.) Merr.) . However, one main limitation in taxonomy and phylogeny studies of Bradyrhizobium is that its 16S rRNA gene is highly conserved, making it difficult to capture the diversity observed in other phenotypic and genotypic analyses and to define and delineate species [4, 10–13]. Therefore, one interesting approach has been to use the multilocus sequencing analysis (MLSA) methodology, including the analysis of housekeeping genes which is conserved but with a higher rate of evolution, to more precisely detect diversity within the genus Bradyrhizobium [8, 12, 14].
Some technologies have been developed in order to improve the identification process of biological entities, such as PseudoMLSA Database  and EZTaxon . The former has a model similar to that proposed in our study, including the possibility of performing similarity searches using Blast , phylogenetic inference by CLUSTAL Omega  and PHYLIP  for Pseudomonas species. With EZTaxon  it is possible to identify all types of prokaryotes, using an information database along with 16S rRNA gene sequences. By contrast, our study provides a new database with the combination of different software tools for multiple sequence alignments and techniques for automatic pre-processing and post-processing the genomic sequences that are necessary for carrying out the MLSA, and, hence, identify biological entities.
The database for taxonomic identification and phylogenetics of the genus Bradyrhizobium through MLSA described in our study represents a repository for genomic sequences of Bradyrhizobium species. The main objective is to be an online database, open sourced with helpful information and tools in order to elucidate the taxonomy and phylogenetic analysis of these organisms. The current version of the database represents a selection of genes assigned to the genus Bradyrhizobium that are commonly used and are validated, and were updated through June 2014. The web interface developed for this system enables users to perform analyses of similarity of their datasets, as well as to make queries and downloads in the stored genomic sequences.
The need for a more informative database of species of rhizobia with useful genes for applying the MLSA methodology results from the fact that currently generated sequences for identification and rating of these organisms are scattered across various databases, and gathering this information is a time-consuming process. We started the procedure with the genus Bradyrhizobium - i.e. the most difficult in terms of rhizobial taxonomy - due to its highly conserved 16S rRNA gene sequence [9–14] and due to interest in its evolution since it is considered as the ancestor of all rhizobia [5–9]. In due course, the database will be expanded to include other rhizobial species.
Current Taxonomic Analysis
Taxonomic consensus is best achieved when different types of data and information (phenotypic, genotypic, phylogenetic) are combined. This integrated model of information is called polyphasic taxonomy, and a bacterial species is defined as a group of genomically alike strains that share a high degree of similarity in several independent features . The phenotypic data are obtained through studies involving gene expression, protein analysis and function, chemotaxonomic markers, and other characteristics that correspond to the final expression of genes [21–23]. For genotyping studies, the information is obtained from both DNA and RNA. Various methodologies can be cited for this purpose, including G+C mol% of DNA; DNA-DNA hybridization (DDH); restriction-fragment-length polymorphism (RFLP); pulsed-field gel electrophoresis (PFGE); gene sequencing; and PCR-fingerprinting . The DDH method is based on physico-chemical properties of the DNA and has been required for the definition of most prokaryote species. However, DDH has several limitations, including low reproducibility among laboratories, high labour demand, cost and time consumption due to the need for hybridization of a large number of strains [23, 25]. Furthermore, there is no database that allows the comparison of results from different studies .
Comparisons of the ribosomal 16S rRNA gene represent the basis of modern taxonomic analysis; important databases comprise 16S rRNA genes, such as the ribosomal database project at https://rdp.cme.msu.edu. However, a limitation is the high degree of nucleotide-sequence conservation in this gene across genera-including Bradyrhizobium-makiiig it difficult to distinguish closely related species [24, 27–32]. Consequently, it is important to develop new techniques that can complement the results obtained from 16S rRNA gene-sequence data, as well as replace DDH for taxonomic purposes. It is also important to establish databases that facilitate analyses of new strains.
Multilocus Sequence Analysis (MLSA)
Identifying organisms as prokaryotic and the delineation of species are the main foci of the taxonomy of microorganisms . Thus, although the levels of identity-obtained in the analysis of the sequences of the 16S rRNA gene and of DDH are still considered as molecular criteria for classification of species, it is expected that additional taxonomic information can be obtained from complete genome sequences , and MLSA has been increasingly suggested as a replacement for DDH [9, 35, 36].
MLSA represents a strategic alternative to avoid the effects of genetic recombination and horizontal transfer occurring in a specific single gene [33, 35]. In addition, it can clarify the distinction between highly related species, or species where the analysis of the 16S rRNA genes shows low resolution, since the chosen housekeeping genes-comprising genes involved in cellular metabolism, i.e. those essential for the survival of the microorganism -present faster evolutionary rates than do the ribosomal genes, but with a level of conservation sufficient to reveal evolutionary information [21, 24, 25, 27, 36]. The choice of housekeeping genes should follow certain criteria, including: i) presence in the genome in a single copy; ii) being distributed in the genome with a minimal distance between the genes of 100 kb; iii) containing sufficient nucleotide length to allow its sequencing; iv) containing sufficient information for its analysis [13, 25, 27, 36–38].
The MLSA methodology has been increasingly used to improve bacterial taxonomy, providing a tool suitable for defining species and revealing their taxonomic relationships. Several studies have shown that MLSA may provide high resolution, allowing the discrimination of isolates at the species level [14, 25, 36, 38–41], which would not be possible by analysis exclusively by 16S rRNA-gene sequencing [12, 33, 35]. The distinction at the species level is achieved by MLSA analysis through algorithms for estimating evolutionary distance between strains. In the particular case of rhizobia, housekeeping genes used in recent years as phylogenetic markers for the species classification include atpD, recA, glnA, glnB, dnaK, thrC and git A . However, taking into account the large number of microorganisms that remain to be identified and classified, and the improvement of microbiology data generation, there is need for the development of new databases and software tools for their analysis [33, 35].
The computational infrastructure used to provide the set of services described in this work is hosted at the National Soybean Research Center of the Brazilian Agricultural Research Corporation (Embrapa Soja). All applications and tools required for the operation of the database were configured for the platform Linux Ubuntu Server 4.13 with Apache 2.4.7, the MySQL database-management system, and the phpAdmin 4.2.2 data-modelling tool.
The relational model of the proposed database follows the scheme proposed by the BioSQL project , considering that it is a standard solution for storing sequences of molecular modelling, and it has compatibility with other bioinformatics projects such as BioPerl, BioPython, BioJava and BioRuby. The database was developed by considering the same data structure used in GenBank . Therefore, it is expected that the database-updating process will not be a time-consuming task, and its usability can be improved in the future. BioSQL allows customization of its schema through extension modules, such as the PhyloDB, which allows the storage of taxonomy and phylogenetic trees. Besides MySQL, relational databases such as PostgreSQL, HSQLDB, Apache Derby and Oracle also support this bioinformatics tool. The adopted BioSQL schema is available as additional file 1.
GenBank files are used to provide the required information and keep it updated in the database. Sequences, resources and notes are included in the database from BioPython scripts and the SeqIO module . Multiple alignments were adopted by means of the algorithms CLUSTAL Omega  and MUSCLE . The verification of the homology between nucleotides of the bacterial genes was also integrated as a software tool into the web interface of the proposed database. This process is very important for identifying regions aligned among various species and plays a key role in the application of the MLSA methodology, in order that only after aligning and trimming of all the analysed sequences of equal size, it is possible to perform the phylogenetic and taxonomic inferences of the analysed species. The multiple sequence alignment is performed by means of web services developed by the European Bioinformatics Institute (EMBL-EBI), available for CLUSTAL Omega [http://www.ebi.ac.uk/Tools/webservices/services/msa/clustalo_soap] and for MUSCLE [http://www.ebi.ac.uk/Tools/webservices/services/msa/muscle_soap].
Finally, scripts in PHP and Java Script were developed in order to parameterize and to perform the post processing of the bioinformatics tools available in the database. These scripts are important to make the cropping areas of common genes aligned, allowing individual analyses of these genes and concatenating the loci for the application of the MLSA methodology.
The database presented in this work consists of 286 genomic sequences, distributed in six specific housekeeping genes, namely: atpD, dnaK, glnll, recA, gyrB and rpoB. Nineteen species of the Bradyrhizobium genus were considered: B. betae, B. canariense, B. cytsi, B. daqingense, B. denitrificans, B. diazoefficiens, B. elkanii, B. huanghuaihaiense, B. icense, B. iriomotense, B. japonicum, B. jicamae, B. lablabi, B. liaoningense, B. oligorophicum, B. pachyrhizi, B. paxllaeri, B. rifense and B. yuanmingense.
GenBank accession numbers of the sequences used in this work.
B. betae LMG 21987 T
B. canariense LMG 22265 T
B. cylisiCTAW 11 T
B. dagingense CCBAU 15774 T
B. deniirificans 8443
B. diazoefficiens USDA 110 T
B. elkanii USDA 76 T
B. huanghuaihaiense CCBAU 23303 T
B. iriomoiense EK 05 T
B. japonicum USDA 6 T
B. jicamae PAC 68 T
B. lablabi CCBAU 23086 T
B. liaoningense LMG 18230 T
B. pachyrhizi PAC 48 T
B. rifense CTAW 71 T
B. yuanmingense LMG 21827 T
B. icense LMTR 13
B. oligoirophicum LMG 10732
B. paxllaeri LMTR 21
Rhodopseudomonas palusiris CGA009
Rhizobium pisi strain DSM 30132
Observing Figure 1, we see that the user must provide data from one to six genes in the analysis. The next step consists of loading of the sequences stored in the database according to the sequences of the genes inserted by the user. Thus, the multiple alignment is performed by considering the input and the database sequences through the EBI-EML web service from which the user can choose to use the CLUSTAL Omega or MUSCLE algorithms. After performing the multiple alignment, a script will select and cut off the aligned regions of all sequences related to each specific gene. This task will produce sequences of equal sizes. After the alignment of all sequences for each one of the three genes, a new script will perform a concatenation of the gene sequences, thus producing a new sequence. At the end of this process, a new multiple alignment is performed with the concatenated sequences, and the results are processed by a script in order to produce the following outputs:
Text with the results of the multiple gene alignments;
Parameters for phylogenetic tree generating;
which will assist in the classification of the organism.
The similarity matrix (score) produces an objective result, from which it is possible to verify the proximity between sequenced species (input) and all species available in the Bradyrhizobium database containing the three selected genes by the user.
In our study, validation was performed by using 16 strains, 14 of which represent type strains of the genus Bradyrhizobium: B. betae LMG 21987 T , B. canariense LMG 22265 T , B. cytsi CTAW11 T , B. diazoefficiens SEMI A 5060, B. diazoefficiens SEMIA 5080, B. diazoefficiens SEMIA 6059, B. diazoefficiens USDA 110 T , B. elkanii USDA 76 T , B. iriomotense EK05 T , B. japonicum USDA 6 T , B. japonicum SEMIA 5079, B. jicamae PAC 68 T , B. lablabi CCBAU 23086 T , B. lianingense LMG 18230T. A sequence representing an outgroup was included in the database: Rhodopseudomonas palustris CGA009. The last adopted sequence belongs to R. pisi DSM 30132T, included as a negative control, i.e. a strain belonging to the genus Rhizobium rather than Bradyrhizobium. All genome sequences were collected from GenBank .
Subset of genes used to test the proposed database by the MLSA methodology.
Quantity of Strains
Quantity of Strains for Genes Used
Algorithm for the Multiple Sequence Alignment
atpD, dnaK, glnll
dnaK, recA, gyrB
atpD, dnaK, glnll, recA
atpD, dnaK, glnll
dnaK, recA, gyrB
atpD, dnaK, glnll, recA
Parameters for the execution of multiple sequence alignment algorithm.
Dealing input sequences
Mbed-like clustering guide-tree
Mbed-like clustering iteration
Number of combined iterations
Max guide tree iterations
Max hmm iterations
The table available as additional file 2 shows an identity matrix created where the values represent the similarity values between the sequences of the species of the database and the species used for the input test, Bradyrhizobium betae LMG 21987 t . AS expected, the similarity rate of 100% was found between the input test and the species B. betae LMG 21987 T . The similarity matrix also allows confirms the current taxonomy of the Bradyrhizobium genus (5), with B. betae LMG 21987 T showing higher similarity with B. diazoefficiens strains SEMIA 5060, SEMIA 5080, SEMIA 6059 and with the type strain B. diazoefficiens USDA 110 T , of 96.29%, 96.13%, 96.06% and 96.06, respectively. None of the three strains was found to be the same species as the input test because they are all below the cut-off of 98.70%.
Summary of the results.
Cut Off Used
atpD dnaK glnll recA
atpD dnaK glnll recA
dnaK recA gyrB
atpD dnaK glnll
dnaK recA gyrB
atpD dnaK glnll
Using algorithm CLUSTAL Omega the subset of genes atpD+dnaK+glnll shows values of 96.16% for accuracy, 100.00% for precision, 65.83% for recall and 73.64% for f-score, while considering the subset of genes dnaK+recA+gyrB, the values were of 98.33%, 100.00%, 85.78% and 88.89%, for subset with 4 genes atpD+dnaK+glnll, the values were of 97.26%, 100.00%, 75.39% and 81.39% for accuracy, precision, recall and f-score, respectively.
Using MUSCLE algorithm for analyse the same subset of genes atpD+dnaK+glnll shows values of 95.72% for accuracy, 92.00% for precision, 66.94% for recall and 71.26% for f-score, while considering the subset of genes dnaK+recA+gyrB, the values were of 97.92%, 100.00%, 82.44% and 86.98%, and for subset with 4 genes atpD+dnaK+glnll, the values were of 97.15%, 100.00%, 77.50% and 82.59% for accuracy, precision, recall and f-score, respectively.
Using the CLUSTAL Omega algorithm and the dnaK+recA+gyrB genes, the strain B. diazofficiens SEMIA 5080 was correctly identified as B. diazoefficiens; the classification indicated similarities of 99.92% with strain SEMIA 5060, of 99.52% with SEMIA 6059 and of 99.20% with the type strain B. diazoefficiens USDA 110 T . This result indicates the correctness of the method for the classification of these SEMIA strains, which are different but fit into the same B. diazoefficiens species. The genes atpD+dnak+glnll analysed with the same algorithm showed similarities of 99.84% with B. diazoefficiens SEMIA 5060, 99.59% with B. diazoefficiens USDA 110 T and 85.28% for B. diazoefficiens 6059.
In an additional test, considering the sequences related to B. japonicum strain SEMIA 5079 as input, we found that genes dnaK+atpD+glnll analysed with the CLUSTAL algorithm Omega resulted in the correct identification of the species and that the strain showed similarity with other strains, of 99.69% with B. japonicum USDA 6 T and of 98.84% with SEMIA 511. When analysed with the MUSCLE algorithm, the results were of 99.69% with B. japonicum UADA 6 T , of 99.30% with SEMIA 512 and of 98.83% with SEMIA 511.
Another result demonstrating increased precision from the selection of certain genes was observed in the analysis of the species B. liaoningense LMG 18230 T . When atpD+dnaK+glnll+recA genes were chosen, the algorithm CLUSTAL Omega presented a similarity of 97.60% between the type strain with the strain SEMIA 5025, while Muscle algorithm shows a 97.50% of similarity, whereas the analysis of atpD+dnaK+glnll genes resulted in a similarity of 97.17% using the Omega CLUSTAL and of 97.20% using the MUSCLE algorithm.
When the test set was used with genomic sequences of the species Rhizobium pisi, the classification resulted in values ranging from 30.00% to 82.15%, considering all the combinations involving alignment algorithms and subsets of genes. The results indicate the correct classification of Rhizobium pisi as not belonging to a species of Bradyrhizobium as described in additional file 3.
This work was developed in order to provide a database for the taxonomic and phylogenetic identification of the genus Bradyrhizobium by using the multilocus sequence analysis (MLSA) methodology. More specifically, the following tools and database functionality were developed:
a database based on a relational model using BioSQL to store data and to maintain the interoperability between bioinformatics projects such as BioPerl, BioPython and BioJava;
a database with validated information of Bradyrhizobium species through a friendly web interface for users;
computational tools suitable for the automatic data mining, analysis and classification of genomic sequences;
computational scripts for the automatic updating of the database with sequences used in the identification and classification process;
The experimental results indicate that the proposed database and the computational tools correctly distinguished species of the same genus and with high similarity rates, reinforcing the efficiency of the MLSA methodology. The Results also show that for the efficient use of the MLSA database it is important to know the combinations of genes that will be used in the taxonomic analysis, as well as the similarity rates that could be used for each genus. Therefore, it is necessary to perform previous tests in order to achieve the best results. The proposed database provides useful information for research in taxonomy and molecular phylogeny of the genus Bradyrhizobium, taking into account the possibility of gathering into a single database information that is commonly needed for studies of these microorganisms and is fragmented in various sources and formats. The current database contains 286 entries of gene sequences of the Bradyrhizobium genus. However, further studies are planned to include sequences of other rhizobial genera: Rhizobium, Sinorhizobium, Azorhizobium, Mesorhizobium and Neorhizobium. There is also the possibility of increasing the number of genes to be analysed. Finally, it is important to integrate the current results with other software packages that allow the visualization of the results directly into a web page, creating an association that will make it even more simple and practical to interpret phylogenetic implications from the proposed database.
This work was supported by CNPq and Fundação Araucária. We thank to Dr. Renan A. Ribeiro and Jakeline Delamuta for helping in providing sequences and discussion.
The authors declare that funding for publication of the article was sponsored by UTFPR - Federal University of Technology - Paraná and CNPq grant # 562008/2010-1.
This article has been published as part of BMC Genomics Volume 16 Supplement 5, 2015: Proceedings of the 10th International Conference of the Brazilian Association for Bioinformatics and Computational Biology (X-Meeting 2014). The full contents of the supplement are available online at http://www.biomedcentral.com/bmcgenomics/supplements/16/S5.
- EnTao W, Mínez-Romero E, Triplett E, et al: Phylogeny of root-and stem-nodule bacteria associated with legumes. Prokaryotic nitrogen fixation: a model system for the analysis of a biological process. 2000, 177-186.Google Scholar
- Lapage SP, Sneath PHA, Lessel EF, Skerman VBD, Seeliger HPR, Clark WA: International Code of Nomenclature of Bacteria: Bacteriological Code, 1990 Revision. 1992, ASM Press, Washington (DC)Google Scholar
- Gehlen MAC: Mapeamento de genes nif publicados no ncbi usando conceitos de mineração de dados e inteligência artificial. 2012Google Scholar
- Hungria M, Vienna P, Delamuta JRM: Bradyrhizobium, the ancestor of all rhizobia: phylogeny of housekeeping and nitrogen-fixation genes. Biological nitrogen fixation.Google Scholar
- Norris DO: Acid production byrhizobium a unifying concept. Plant and Soil. 1965, 22 (2): 143-166.View ArticleGoogle Scholar
- Lloret L, Martínez-Romero E: Evolucion y filogenia de rhizobium. Rev Latinoam Microbiol. 2005, 47 (1-2): 43-60.PubMedGoogle Scholar
- Doyle JJ: Phylogenetic perspectives on the origins of nodulation. Molecular Plant-Microbe Interactions. 2011, 24 (11): 1289-1295.View ArticlePubMedGoogle Scholar
- Parker MA: The spread of bradyrhizobium lineages across host legume clades: from abarema to zygia. Microbial ecology. 2014, 69 (3): 630-640.View ArticlePubMedGoogle Scholar
- Delamuta JR, Ribeiro RA, Menna P, Bangel EV, Hungria M: Multilocus sequence analysis (MLSA) of Bradyrhizobium strains: revealing high diversity of tropical diazotrophic symbiotic bacteria. Brazilian Journal of Microbiology. 2012, 43 (2): 698-710.PubMed CentralView ArticlePubMedGoogle Scholar
- Germano MG, Menna P, Mostasso FL, Hungria M: RFLP analysis of the rRNA operon of a Brazilian collection of bradyrhizobial strains from 33 legume species. Int J Syst Evol Microbiol. 2006, 56 (Pt 1): 217-229.View ArticlePubMedGoogle Scholar
- Menna P, Hungria M, Barcellos FG, Bangel EV, Hess PN, Martínez-Romero E: Molecular phylogeny based on the 16s rRNA gene of elite rhizobial strains used in Brazilian commercial inoculants. Systematic and Applied Microbiology. 2006, 29 (4): 315-332.View ArticlePubMedGoogle Scholar
- Menna P, Barcellos FG, Hungria M: Phylogeny and taxonomy of a diverse collection of bradyrhizobium strains based on multilocus sequence analysis of the 16s rRNA gene, ITS region and glnII, recA, atpD and dnaK genes. Int J Syst Evol Microbiol. 2009, 59 (Pt 12): 2934-2950.View ArticlePubMedGoogle Scholar
- Menna P, Pereira AA, Bangel EV, Hungria M: Rep-PCR of tropical rhizobia for strain fingerprinting, biodiversity appraisal and as a taxonomic and phylogenetic tool. Symbiosis. 2009, 48 (1-3): 120-130.View ArticleGoogle Scholar
- Delamuta JR, Ribeiro RA, Ormeno-Orrillo E, Melo IS, Martínez-Romero E, Hungria M: Polyphasic evidence supporting the reclassification of Bradyrhizobium japonicum group ia strains as Bradyrhizobium diazoefficiens sp. nov. Int J Syst Evol Microbiol. 2013, 69 (Pt 9): 3342-3351.View ArticleGoogle Scholar
- Bennasar A, Mulet M, Lalucat J, Garcia-Valdes E: PseudoMLSA: a database for multigenic sequence analysis of Pseudomonas species. BMC Microbiology. 2010, 10: 118-PubMed CentralView ArticlePubMedGoogle Scholar
- Chun J, Lee J, Jung Y, Kim M, Kim S, Kim BK, Lim YW: Eztaxon: a web-based tool for the identification of prokaryotes based on 16s ribosomal rna gene sequences. Int J Syst Evol Microbiol. 2007, 57 (Pt 10): 2259-2261.View ArticlePubMedGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215 (3): 403-410.View ArticlePubMedGoogle Scholar
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22 (22): 4673-4680.PubMed CentralView ArticlePubMedGoogle Scholar
- Felsenstein J: Phylip - phylogeny inference package (version 3.2). 1989, 5 (): 164-166.Google Scholar
- Rosselló-Mora R, Amann R: The species concept for prokaryotes. FEMS microbiology reviews. 2001, 25 (1): 39-67.View ArticlePubMedGoogle Scholar
- Rivas R, García-Fraile P, Velázquez E, et al: Taxonomy of bacteria nodulating legumes. Microbiology Insights. 2009, 2: 51-69.Google Scholar
- Gillis M, Van Van T, Bardin R, Goor M, Hebbar P, Willems A, et al: Polyphasic taxonomy in the genus burkholderia leading to an emended description of the genus and proposition of burkholderia vietnamiensis sp. nov. for n2-fixing isolates from rice in Vietnam. International Journal of Systematic Bacteriology. 1995, 45 (2): 274-289.View ArticleGoogle Scholar
- Vandamme P, Pot B, Gillis M, De Vos P, Kersters K, Swings J: Polyphasic taxonomy, a consensus approach to bacterial systematics. Microbiological Reviews. 1996, 60 (2): 407-438.PubMed CentralPubMedGoogle Scholar
- Stackebrandt E, Frederiksen W, Garrity GM, Grimont PA, Kampfer P, Maiden MC, et al: Report of the ad hoc committee for the re-evaluation of the species definition in bacteriology. International Journal of Systematic and Evolutionary Microbiology. 2002, 52 (Pt 3): 1043-1047.PubMedGoogle Scholar
- Gevers D, Cohan FM, Lawrence JG, Spratt BG, Coenye T, Feil EJ, et al: Re-evaluating prokaryotic species. Nature Reviews Microbiology. 2005, 3 (9): 733-739.View ArticlePubMedGoogle Scholar
- Ramos PL, Moreira-Filho CA, Van Trappen S, Swings J, Vos P, Barbosa HR, et al: An MLSA-based online scheme for the rapid identification of Stenotrophomonas isolates. Memórias do Instituto Oswaldo Cruz. 2011, 106 (4): 394-399.View ArticlePubMedGoogle Scholar
- Martens M, Delaere M, Coopman R, De Vos P, Gillis M, Willems A: Multilocus sequence analysis of ensifer and related taxa. International Journal of Systematic and Evolutionary Microbiology. 2007, 57 (Pt 3): 489-503.View ArticlePubMedGoogle Scholar
- Olsen GJ, Woese CR: Ribosomal RNA: a key to phylogeny. The FASEB journal. 1993, 7 (1): 113-123.PubMedGoogle Scholar
- Barrera LL, Trujillo ME, Goodfellow M, García FJ, Hernandez-Lucas I, Davila G, et al: Biodiversity of bradyrhizobia nodulating Lupinus spp. Int J Syst Bacteriol. 1997, 47 (4): 1086-1091.View ArticlePubMedGoogle Scholar
- Martínez-Romero E, CaballeroMellado J: Rhizobium phylogenies and bacterial genetic diversity. Critical Reviews in Plant Sciences. 1996, 15 (2): 113-140.View ArticleGoogle Scholar
- Coenye T, Vandamme P, Govan JR, LiPuma JJ: Taxonomy and identification of the Burkholderia cepacia complex. Journal of Clinical Microbiology. 2001, 39 (10): 3427-3436.PubMed CentralView ArticlePubMedGoogle Scholar
- Coenye T, Vandamme P: Extracting phylogenetic information from whole-genome sequencing projects: the lactic acid bacteria as a test case. Microbiology. 2003, 149 (Pt 12): 3507-3517.View ArticlePubMedGoogle Scholar
- Ribeiro RA, Rogel MA, López-López A, Ormeño-Orrillo E, Barcellos FG, Martínez J, et al: Reclassification of Rhizobium tropici type A strains as Rhizobium leucaenae sp. nov. Int J Syst Evol Microbiol. 2012, 62 (Pt 5): 1179-1184.View ArticlePubMedGoogle Scholar
- Coenye T, Gevers D, Van de Peer Y, Vandamme P, Swings J: Towards a prokaryotic genomic taxonomy. FEMS microbiology reviews. 2005, 29 (2): 147-167.View ArticlePubMedGoogle Scholar
- Dall'Agnol RF, Delamuta JRM, A RR: Diversidade e filogenia de estirpes de rhizobium pela metodologia de mlsa. Embrapa Soja-Artigo em Anais de Congresso (ALICE) (2012). A responsabilidade socioambiental da pesquisa agrícola: anais. Viçosa: SBCS. 2012, 4-Trab. 1212Google Scholar
- Ribeiro RA, Barcellos FG, Thompson FL, Hungria M: Multilocus sequence analysis of brazilian rhizobium microsymbionts of common bean (phaseolus vulgaris I.) reveals unexpected taxonomic diversity. Research in Microbiology. 2009, 160 (4): 297-306.View ArticlePubMedGoogle Scholar
- Thompson FL, Gevers D, Thompson CC, Dawyndt P, Naser S, Hoste B, et al: Phylogeny and molecular identification of vibrios on the basis of multilocus sequence analysis. Appl Environ Microbiol. 2005, 71 (9): 5107-5115.PubMed CentralView ArticlePubMedGoogle Scholar
- Zeigler DR: Gene sequences useful for predicting relatedness of whole genomes in bacteria. Int J Syst Evol Microbiol. 2003, 53 (6): 1893-1900.View ArticlePubMedGoogle Scholar
- Dall'agnol RF, Ribeiro RA, Ormeño-orrillo E, Rogel MA, Delamuta JRM, Andrade DS, et al: Rhizobium freirei sp. nov., a symbiont of Phaseolus vulgaris veryeffective in fixing nitrogen. International Journal of Systematic and Evolutionary Microbiology. 2013, 63 (): 4167-4173.View ArticlePubMedGoogle Scholar
- Dallágnol RF, Ribeiro RA, Ormeño-orrillo E, Rogel MA, Delamuta JRM, Andrade DS, et al: Rhizobium paranaensesp. nov., an effective N2-fixing symbiont of common bean (Phaseolus vulgaris L.) with broad geographical distribution in Brazil. Int J Syst Evol Microbiol. 2014, 64 (Pt 9): 3222-3229.View ArticleGoogle Scholar
- Ribeiro RA, Ormeno-Orrillo E, DaM'Agnol RF, Graham PH, Martinez-Romero E, Hungria M: Novel Rhizobium lineages isolated from root nodules of the common bean (Phaseolus vulgaris I.) in Andean and Mesoamerican areas. Research in Microbiology. 2013, 164 (7): 740-748.View ArticlePubMedGoogle Scholar
- BioSQL Project Main Page. 2014, [http://www.biosql.org/wiki/Main_Page]
- Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL: Genbank Nucleic acids research. 2003, 31 (1): 23-View ArticlePubMedGoogle Scholar
- Knight J: Seqio: Ac package for reading and writing sequences. 1996, Distributed by the author. Freely available at http://bioweb.pasteur.fr/docs/seqio/seqio.htmlGoogle Scholar
- Edgar RC: Muscle: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797.PubMed CentralView ArticlePubMedGoogle Scholar
- Chahboune R, Carro L, Peix A, Barrijal S, Velázquez E, Bedmar EJ: Bradyrhizobium cytisi sp. nov. isolated from effective nodules of Cytisus villosus in Morocco. International Journal of Systematic and Evolutionary Microbiology. 2011, 61 (Pt 12): 2922-2927.View ArticlePubMedGoogle Scholar
- Chahboune R, Carro L, Peix A, Ramírez-Bahena MH, Barrijal S, Velázquez E, Bedmar EJ: Bradyrhizobium rifense sp. nov. isolated from effective nodules of Cytisus villosus grown in the Moroccan Rif. Systematic and Applied Microbiology. 2012, 35 (5): 302-305.View ArticlePubMedGoogle Scholar
- Chang YL, Wang JY, Wang ET, Liu HC, Sui XH, Chen WX: Bradyrhizobium lablabi sp. nov., isolated from effective nodules of Lablab purpureus and Arachis hypogaea. Int J Syst Evol Microbiol. 2011, 61 (Pt 10): 2496-2502.View ArticlePubMedGoogle Scholar
- Zhang YM, Li Y, Chen WF, Wang ET, Sui XH, Li QQ, et al: Bradyrhizobium huanghuaihaiense sp. nov., an effective symbiotic bacterium isolated from soybean (Glycine max L.) nodules. Int J Syst Evol Microbiol. 2012, 62 (Pt 8): 1951-1957.View ArticlePubMedGoogle Scholar
- Stackebrandt E, Goebel BM: Taxonomic note: a place for dna-dna reassociation and 16s rrna sequence analysis in the present species definition in bacteriology. International Journal of Systematic Bacteriology. 1994, 44 (4): 846-849.View ArticleGoogle Scholar
- Stackebrandt E, Ebers J: Taxonomic parameters revisited: tarnished gold standards. Microbiology Today. 2006, 33 (4): 152-155.Google Scholar
- Konstantinidis KT, Ramette A, Tiedje JM: Toward a more robust assessment of intraspecies diversity, using fewer genetic markers. Applied and Environmental Microbiology. 2006, 72 (11): 7286-7293.PubMed CentralView ArticlePubMedGoogle Scholar
- Gamermann D, Montagud A, Conejero JA, Urchueguía JF, de Córdoba PF: New approach for phylogenetic tree recovery based on genome-scale metabolic networks. Journal of Computational Biology. 2014, 21 (7): 508-519.PubMed CentralView ArticlePubMedGoogle Scholar
- Tamura K, Stecher G, Peterson D, Filipski A, Kumar S: Mega6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013, 30 (12): 2725-2729.PubMed CentralView ArticlePubMedGoogle Scholar
- Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4 (4): 406-425.PubMedGoogle Scholar
- Felsenstein J: Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985, 39 (4): 783-791.View ArticleGoogle Scholar
- Tamura K, Nei M: Estimation of the number of nucleotide substitutions in the control region of mitochondrial dna in humans and chimpanzees. Molecular biology and evolution. 1993, 10 (3): 512-526.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.