- Open Access
Genome-wide Mycobacterium tuberculosis variation (GMTV) database: a new tool for integrating sequence variations and epidemiology
- Ekaterina N Chernyaeva1Email author,
- Marina V Shulgina2,
- Mikhail S Rotkevich1,
- Pavel V Dobrynin1,
- Serguei A Simonov1,
- Egor A Shitikov3,
- Dmitry S Ischenko3, 4,
- Irina Y Karpova3,
- Elena S Kostryukova3,
- Elena N Ilina3,
- Vadim M Govorun3,
- Vyacheslav Y Zhuravlev2,
- Olga A Manicheva2,
- Peter K Yablonsky2,
- Yulia D Isaeva5,
- Elena Y Nosova5,
- Igor V Mokrousov6,
- Anna A Vyazovaya6,
- Olga V Narvskaya6,
- Alla L Lapidus1, 7 and
- Stephen J O’Brien1Email author
© Chernyaeva et al.; licensee BioMed Central Ltd. 2014
- Received: 26 September 2013
- Accepted: 15 April 2014
- Published: 25 April 2014
Tuberculosis (TB) poses a worldwide threat due to advancing multidrug-resistant strains and deadly co-infections with Human immunodeficiency virus. Today large amounts of Mycobacterium tuberculosis whole genome sequencing data are being assessed broadly and yet there exists no comprehensive online resource that connects M. tuberculosis genome variants with geographic origin, with drug resistance or with clinical outcome.
Here we describe a broadly inclusive unifying Genome-wide Mycobacterium tuberculosis Variation (GMTV) database, (http://mtb.dobzhanskycenter.org) that catalogues genome variations of M. tuberculosis strains collected across Russia. GMTV contains a broad spectrum of data derived from different sources and related to M. tuberculosis molecular biology, epidemiology, TB clinical outcome, year and place of isolation, drug resistance profiles and displays the variants across the genome using a dedicated genome browser. GMTV database, which includes 1084 genomes and over 69,000 SNP or Indel variants, can be queried about M. tuberculosis genome variation and putative associations with drug resistance, geographical origin, and clinical stages and outcomes.
Implementation of GMTV tracks the pattern of changes of M. tuberculosis strains in different geographical areas, facilitates disease gene discoveries associated with drug resistance or different clinical sequelae, and automates comparative genomic analyses among M. tuberculosis strains.
- Mycobacterium tuberculosis
- Genome variations
- Genetic diversity
- Whole genome sequencing
Tuberculosis (TB) remains an ongoing threat to worldwide public health, which in 2011 caused some 8.7 new cases and killed 1.4 million people, including 430,000 co-infected with Human immunodeficiency virus (HIV) . The incidence of multidrug-resistant strains is rising in spite of increasing financial resources being released to stem the epidemic. With globalization, improvement of the health care and epidemic control systems in one country may not guarantee prevention of this airborne disease in others. Russia reported 180,000 TB cases and 20,000 TB deaths in 2011 and shows the highest incidence of new multidrug-resistant strains developed largely due to noncompliant drug regimens [1, 2].
Molecular genetic studies of Mycobacterium tuberculosis strains using various genotyping technologies offer an approach to monitor strain dispersal and evolutionary adaptations, important to stem bacterial and disease spread. Genetic markers that track TB transmission include IS6110, polymorphic GC-reach repetitive sequences, direct repeat regions and mycobacterial interspersed repetitive units [3–7]. Recently, it was shown that bacterial whole genome sequencing (WGS) provides greater discriminative power [8–11]. WGS of multiple isolates may address a broad range of topics – from questions on the transmission of clinical strains to how M. tuberculosis evolves over long and short time scales. Rapid analysis of WGS data allows to detect bacterial genetic variants based on single nucleotide polymorphisms (SNPs) and insertion/deletions (Indels), including mutations associated with drug resistance or genetic lineage.
Increasingly large quantities of genome sequence data are becoming available from different types of M. tuberculosis studies [12–16]. Numerous M. tuberculosis WGS studies have been used for phylogenetic analyses and to identify genetic factors involved in TB drug resistance [15, 17, 18]. Several databases were developed to systematize and compare genomic data. TubercuList (http://tuberculist.epfl.ch) database provides gene-based information of M. tuberculosis H37Rv genome . Tuberculosis Database (TBDB http://www.tbdb.org) is an integrated database providing access to TB genomic sequence data and resources from Mycobacterium species and M. tuberculosis strains, relevant to the discovery and development of TB drugs, vaccines and biomarkers. Currently TBDB contains information about 21 mycobacterial species whole genome sequences, nine of which belong to M. tuberculosis complex . Mycobacterial Genome Divergence Database (MGDD), allows to find genetic differences between two strains or species of M. tuberculosis complex . A web-based comprehensive information system Pathosystems Resource Integration Center (PATRIC, http://patricbrc.org) provides comparative analysis for genomes of different bacterial pathogens, one of which is M. tuberculosis. To date PATRIC contains 201 mycobacterial isolates whole genome sequences, 68 of which are M. tuberculosis genomes.
Although these TB information databases have been established, there is no comprehensive online resource that brings together detailed information on M. tuberculosis genome variations associated with phylogeographic distribution, drug resistance and clinical outcome of TB. Here we describe and release a broadly inclusive unifying database – Genome-wide Mycobacterium tuberculosis Variation (GMTV) – that catalogues genome variations of Russian M. tuberculosis strains combined with available clinical data. GMTV helps to discover genomic variants of M. tuberculosis strains from different geographical areas and lists genetic markers associated with drug resistance and different clinical TB signs. GMTV allows association analysis between molecular variation and clinical consequences as well as facilitating epidemiological surveillance of TB and HIV/TB co-infection. Our hope is that the database will allow to find efficacious strategies to control TB infection and spread.
GMTV database contains a broad spectrum of data derived from different sources and relates to M. tuberculosis molecular biology, epidemiology, TB clinical outcome, year and place of isolation, and drug resistance profiles. Access to GMTV database is distributed through web application with Python backend that connected to our MySQL database. Every record in the database is identified by the unique sample ID. Each sample ID corresponds to the set of SNPs and Indels and other information (e.g. medical, geographical and drug resistance data). The database includes information from following databases: NCBI, KEGG metabolic pathways [23, 24] and TubercuList , the web interface provides access to the corresponding websites through hyperlinks. The GMTV genome browser is an essential tool for genome variations visualization that could be used as an analytical tool to compare nucleotide variations. The core of our MySQL database is manually curated and contain M. tuberculosis genome sequences assessed at Theodosius Dobzhansky Center for Genome Bioinformatics (St. Petersburg), Research Institute of Physical-Chemical Medicine (Moscow) and publicly available data of sequenced M. tuberculosis strains obtained in Russia.
Mycobacterium tuberculosis H37Rv reference genome (NC_000962.3) was used for SNP and Indel calling. For reference assisted assembly sequence reads were aligned on reference genome (H37Rv) using bowtie2 program  with standard parameters (bowtie2 -x H37Rv - p30 -U raw_reads.fq -S aligned_reads.sam). For SNP calling and VCF file processing a combination of samtools and vcftools was used [26, 27].
A web genome browser, GMTVB, based on JBrowse platform [28, 29] is an essential component of GMTV. GMTVB allows one to compare distinct regions or genes among M. tuberculosis strains, including an option to select a particular reference sequence, e.g. H37Rv. GMTVB implements an AJAX paradigm, which increases reaction time. Distinctive regions, genes, genomic features are displayed in tracks. Some tracks are permanent for the reference (e.g. genes, CDS, repeats, etc.). The interface allows one to add, delete and substitute tracks as well as to change reference sequences. GMTV clinical and genetic data combined with graphical tools incorporated in GMTVB makes the database an effective instrument for TB analysis (available at http://mtb.dobzhanskycenter.org).
M. tuberculosis isolates and genome sequence
Whole genome sequences from 1084 M. tuberculosis isolates with various medical datasets from different regions of the Russian Federation comprise the present database. The database contains information on 73 isolates sequenced by our research group and 1011 publicly available genome sequences.
Whole genome sequence reads of other 1011 Russian M. tuberculosis isolates obtained in Samara region (Russia) were downloaded from the European Nucleotide Archive (http://www.ebi.ac.uk/ena/) submitted under accession no. ERP000192 [31, 32]. The region of M. tuberculosis strains isolation and available drug susceptibility tests results provided by the authors were deposited to GMTV database.
Sample: Sample ID, HIV status of the patient, patients’ gender, ear of strain isolation, spoligotype family name based on SpolDB4 , genetic clades based on SNP analysis , and geographical region.
Genes: Gene ID on NCBI database, gene name, locus tag, coordinates of gene start and end.
Variations (SNPs and Indels): SNP/Indel coordinates, Nucleic acid variations, Amino acid substitutions, Effect of the nucleic acid substitution (synonymous or nonsynonymous), various VCF file statistics, The SNP/Indels sections allows to download VCF file and easily get appropriate annotation of genome variations results. SNP, Indel or SNP/Indel options could be selected.
Genome variants (SNPs and Indels) in GMTV database filtered by Q30
Nonsynonymous mutations in CDS
Synonymous mutations in CDS
Variations in STOP-codons in CDS
Frameshift mutations in CDS
The GMTV web interface contains detailed information about genomic variations revealed from WGS data and provides for convenient analysis of genomic variation to facilitate searches of disease associations. Currently GMTV database allows:
Review SNPs and Indels of M. tuberculosis isolates filtered by quality score and sequence coverage selected by the user;
Identify functions and related metabolic pathways of genes where genome variations were identified using the links to KEGG and TubercuList databases;
Compare SNPs between several isolates selected by drug resistance, clinical outcome, geographical distribution, genetic lineage and other characteristics. It is possible to select all, common or unique genome variations, synonymous and nonsynonymous mutations are highlighted;
Annotate genome variations using integrated online-tool, download a table with annotated genome variations in CSV (Comma Separated Values) format for further research, and visualize results with the genome browser.
Download VCF files with nucleotide genome variations, FQ files with reference-assisted assemblies of MTB genomes and FASTA files with sequences of selected genes.
The “Download page” allows one to select some characteristics of M. tuberculosis isolates or ID of the interested isolates and to generate a table with information about genotype, geographic region and drug resistance. Each genome could be downloaded as a VCF file representing genome variations, or as an FQ file representing reference-assisted assembly of the genome. It is also possible to download FASTA files with a specific gene or genes, for this purpose the user have to select a special point “Gene” at the “Select genome region” section on the left bar region of the page and list one or several genes separated with ‘;’. This function is useful for comparative studies, for example, it allows analyzing selected protein-coding genes without intergenic regions.
The “Comparison samples” page is developed to compare single nucleotide genome variations. It is possible to browse variations in the whole genome sequence or gene-by-gene in selected M. tuberculosis isolates. For comparative analysis the user selects the sample ID in the left bar (the number is not limited) or to select interested features (medical, geographical, genotype or drug resistance). User may set Q score and coverage for SNPs. Generated table displays nucleotide variations and their position. It is possible to download the whole table or to browse mutations at the genome browser.
The combination of the database with a genome browser makes the GMTV an effective tool for interactive analysis and research. The browser illustrates a scalable map feature of the genome in tracks representing a site’s position, strand, value and supplement notes. Permanent tracks are preloaded into the browser (e.g. original DNA sequence, genes, repeats, etc. as well as defined SNPs, Indels) and listed on the left side of the screen. Tracks may be hidden or shown and there is an option to form tracks ad hoc based on SQL request. Such ad hoc tracks are created temporarily to provide opportunities to analyze the combination of data in visual form. Entire tracks can be downloaded in FASTA format. Scalable views of the genome elements inside the GMTVB let a user look at genome picture both with bird’s eye and as a detailed representation on the DNA level. GMTVB allows one to compare genomic features or to download selected feature for further analysis.
GMTV database is designed to assist in identification of genetic variants associated with drug resistance, clinical outcome or geographic distribution of the pathogen. It allows comparing nucleotide variations based on WGS data in different groups of M. tuberculosis isolates. Bacterial isolates could be divided into categories based on their geographic origin, drug resistance pattern, genetic clade or medical data. GMTV database functions allow using for phylogeographic, epidemiological and evolutionary studies.
GMTV is the first M. tuberculosis database to integrate clinical, epidemiological and microbiological description with genome variations based on whole genome sequencing data, a part of the large epidemiological database established at St. Petersburg Research Institute of Phthisiopulmonology. The development of a M. tuberculosis genome variations database will allow empirical exploring of influences of SNP and Indels around clinical outcomes. GMTV will facilitate the epidemiological surveillance of TB and HIV/TB co-infection and will help to develop effective strategies to control these infections in the population.
GMTV allows association analysis between molecular variation and clinical consequences as well as facilitates epidemiological surveillance of TB and HIV/TB co-infection. Our hope is to inform efficacious strategies for TB control.
The web server can be accessed at http://mtb.dobzhanskycenter.org.
Research was supported in part by the Russian Ministry of Education and Science, Mega-grant no. 11.G34.31.0068 and grant no. 16.522.11.2003.
- World Health Organization: Global Tuberculosis Report 2012. 2012, France: WHO, Available: http://who.int/tb/publications/global_report/gtbr12_main.pdf. Accessed 15 April 2014Google Scholar
- Phillips L: Infectious disease: TB’s revenge. Nature. 2013, 493 (7430): 14-16. 10.1038/493014a. doi:10.1038/493014aPubMedView ArticleGoogle Scholar
- Ross BC, Raios K, Jackson K, Dwyer B: Molecular cloning of a highly repeated DNA element from Mycobacterium tuberculosis and its use as an epidemiological tool. J Clin Microbiol. 1992, 30 (4): 942-946.PubMed CentralPubMedGoogle Scholar
- van Embden JD, Cave MD, Crawford JT, Dale JW, Eisenach KD, Gicquel B, Hermans P, Martin C, McAdam R, Shinnick TM, Small PM: Strain identification of Mycobacterium tuberculosis by DNA fingerprinting: recommendations for a standardized methodology. J Clin Microbiol. 1993, 31 (2): 406-409.PubMed CentralPubMedGoogle Scholar
- Kamerbeek J, Schouls L, Kolk A, van Agterveld M, van Soolingen D, Kuijper S, Bunschoten A, Molhuizen H, Shaw R, Goyal M, van Embden J: Simultaneous detection and strain differentiation of Mycobacterium tuberculosis for diagnosis and epidemiology. J Clin Microbiol. 1997, 35 (4): 907-914.PubMed CentralPubMedGoogle Scholar
- Frothingham R, Meeker-O’Connell WA: Genetic diversity in the Mycobacterium tuberculosis complex based on variable numbers of tandem DNA repeats. Microbiology. 1998, 144 (Pt 5): 1189-1196.PubMedView ArticleGoogle Scholar
- Mazars E, Lesjean S, Banuls AL, Gilbert M, Vincent V, Gicquel B, Tibayrenc M, Locht C, Supply P: High-resolution minisatellite-based typing as a portable approach to global analysis of Mycobacterium tuberculosis molecular epidemiology. Proc Natl Acad Sci U S A. 2001, 98 (4): 1901-1906. 10.1073/pnas.98.4.1901.PubMed CentralPubMedView ArticleGoogle Scholar
- Gardy JL, Johnston JC, Ho Sui SJ, Cook VJ, Shah L, Brodkin E, Rempel S, Moore R, Zhao Y, Holt R, Varhol R, Birol I, Lem M, Sharma MK, Elwood K, Jones SJ, Brinkman FS, Brunham RC, Tang P: Whole-genome sequencing and social-network analysis of a tuberculosis outbreak. N Engl J Med. 2011, 364 (8): 730-739. 10.1056/NEJMoa1003176.PubMedView ArticleGoogle Scholar
- Walker TM, Monk P, Grace Smith E, Peto TE: Contact investigations for outbreaks of Mycobacterium tuberculosis: advances through whole genome sequencing. Clin Microbiol Infect. 2013, doi:10.1111/1469-0691.12183. Epub ahead of printGoogle Scholar
- Roetzer A, Diel R, Kohl TA, Rückert C, Nübel U, Blom J, Wirth T, Jaenicke S, Schuback S, Rüsch-Gerdes S, Supply P, Kalinowski J, Niemann S: Whole genome sequencing versus traditional genotyping for investigation of a Mycobacterium tuberculosis outbreak: a longitudinal molecular epidemiological study. PLoS Med. 2013, 10 (2): e1001387-10.1371/journal.pmed.1001387. doi:10.1371/journal.pmed.1001387. Epub 2013 Feb 12PubMed CentralPubMedView ArticleGoogle Scholar
- Das S, Roychowdhury T, Kumar P, Kumar A, Kalra P, Singh J, Singh S, Prasad HK, Bhattacharya A: Genetic heterogeneity revealed by sequence analysis of Mycobacterium tuberculosis isolates from extra-pulmonary tuberculosis patients. BMC Genomics. 2013, 14: 404-10.1186/1471-2164-14-404. doi:10.1186/1471-2164-14-404PubMed CentralPubMedView ArticleGoogle Scholar
- Sreevatsan S, Pan X, Stockbauer KE, Connell ND, Kreiswirth BN, Whittam TS, Musser JM: Restricted structural gene polymorphism in the Mycobacterium tuberculosis complex indicates evolutionarily recent global dissemination. Proc Natl Acad Sci U S A. 1997, 94 (18): 9869-9874. 10.1073/pnas.94.18.9869.PubMed CentralPubMedView ArticleGoogle Scholar
- Filliol I, Motiwala AS, Cavatore M, Qi W, Hazbón MH, Bobadilla del Valle M, Fyfe J, García-García L, Rastogi N, Sola C, Zozio T, Guerrero MI, León CI, Crabtree J, Angiuoli S, Eisenach KD, Durmaz R, Joloba ML, Rendón A, Sifuentes-Osornio J, Ponce de León A, Cave MD, Fleischmann R, Whittam TS, Alland D: Global phylogeny of Mycobacterium tuberculosis based on single nucleotide polymorphism (SNP) analysis: insights into tuberculosis evolution, phylogenetic accuracy of other DNA fingerprinting systems, and recommendations for a minimal standard SNP set. J Bacteriol. 2006, 188 (2): 759-772. 10.1128/JB.188.2.759-772.2006. [Erratum in: J Bacteriol 2006, 188(8):3162–3163]PubMed CentralPubMedView ArticleGoogle Scholar
- Gagneux S, DeRiemer K, Van T, Kato-Maeda M, De Jong BC, Narayanan S, Nicol M, Niemann S, Kremer K, Gutierrez MC, Hilty M, Hopewell PC, Small PM: Variable host-pathogen compatibility in Mycobacterium tuberculosis. Proc Natl Acad Sci U S A. 2006, 103 (8): 2869-2873. 10.1073/pnas.0511240103. Epub 2006 Feb 13PubMed CentralPubMedView ArticleGoogle Scholar
- Comas I, Chakravartti J, Small PM, Galagan J, Niemann S, Kremer K, Ernst JD, Gagneux S: Human T cell epitopes of Mycobacterium tuberculosis are evolutionarily hyperconserved. Nat Genet. 2010, 42 (6): 498-503. 10.1038/ng.590. doi:10.1038/ng.590. Epub 2010 May 23PubMed CentralPubMedView ArticleGoogle Scholar
- Homolka S, Projahn M, Feuerriegel S, Ubben T, Diel R, Nübel U, Niemann S: High resolution discrimination of clinical Mycobacterium tuberculosis complex strains based on single nucleotide polymorphisms. PLoS One. 2012, 7 (7): e39855-10.1371/journal.pone.0039855. doi:10.1371/journal.pone.0039855. Epub 2012 Jul 2PubMed CentralPubMedView ArticleGoogle Scholar
- Ioerger TR, Koo S, No EG, Chen X, Larsen MH, Jacobs WR, Pillay M, Sturm AW, Sacchettini JC: Genome analysis of multi- and extensively-drug-resistant tuberculosis from KwaZulu-Natal, South Africa. PLoS One. 2009, 4 (11): e7778-10.1371/journal.pone.0007778. doi:10.1371/journal.pone.0007778PubMed CentralPubMedView ArticleGoogle Scholar
- Ilina EN, Shitikov EA, Ikryannikova LN, Alekseev DG, Kamashev DE, Malakhova MV, Parfenova TV, Afanas’ev MV, Ischenko DS, Bazaleev NA, Smirnova TG, Larionova EE, Chernousova LN, Beletsky AV, Mardanov AV, Ravin NV, Skryabin KG, Govorun VM: Comparative genomic analysis of Mycobacterium tuberculosis drug resistant strains from Russia. PLoS One. 2013, 8 (2): e56577-10.1371/journal.pone.0056577. doi:10.1371/journal.pone.0056577. Epub 2013 Feb 20PubMed CentralPubMedView ArticleGoogle Scholar
- Lew JM, Kapopoulou A, Jones LM, Cole ST: TubercuList–10 years after. Tuberculosis (Edinb). 2011, 91 (1): 1-7. 10.1016/j.tube.2010.09.008. doi:10.1016/j.tube.2010.09.008. Epub 2010 Oct 25. PubMed PMID: 20980199View ArticleGoogle Scholar
- Reddy TB, Riley R, Wymore F, Montgomery P, DeCaprio D, Engels R, Gellesch M, Hubble J, Jen D, Jin H, Koehrsen M, Larson L, Mao M, Nitzberg M, Sisk P, Stolte C, Weiner B, White J, Zachariah ZK, Sherlock G, Galagan JE, Ball CA, Schoolnik GK: TB database: an integrated platform for tuberculosis research. Nucleic Acids Res. 2009, 37 (Database issue): D499-D508. doi:10.1093/nar/gkn652. Epub 2008 Oct3. PubMed PMID: 18835847; PubMed Central PMCID: PMC2686437PubMed CentralPubMedView ArticleGoogle Scholar
- Vishnoi A, Srivastava A, Roy R, Bhattacharya A: MGDD: Mycobacterium tuberculosis genome divergence database. BMC Genomics. 2008, 9: 373-10.1186/1471-2164-9-373. doi:10.1186/1471-2164-9-373PubMed CentralPubMedView ArticleGoogle Scholar
- Gillespie JJ, Wattam AR, Cammer SA, Gabbard JL, Shukla MP, Dalay O, Driscoll T, Hix D, Mane SP, Mao C, Nordberg EK, Scott M, Schulman JR, Snyder EE, Sullivan DE, Wang C, Warren A, Williams KP, Xue T, Yoo HS, Zhang C, Zhang Y, Will R, Kenyon RW, Sobral BW: PATRIC: the comprehensive bacterial bioinformatics resource with a focus on human pathogenic species. Infect Immun. 2011, 79 (11): 4286-4298. 10.1128/IAI.00207-11. doi:10.1128/IAI.00207-11. Epub 2011 Sep 6PubMed CentralPubMedView ArticleGoogle Scholar
- Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M: KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012, 40 (Database issue): D109-D114. doi:10.1093/nar/gkr988. Epub 2011 Nov 10. PubMed PMID: 22080510; PubMed Central PMCID: PMC3245020PubMed CentralPubMedView ArticleGoogle Scholar
- Kanehisa M, Goto S: KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28 (1): 27-30. 10.1093/nar/28.1.27. PubMed PMID: 10592173; PubMed Central PMCID: PMC102409PubMed CentralPubMedView ArticleGoogle Scholar
- Langmead B, Salzberg SL: Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012, 9 (4): 357-359. 10.1038/nmeth.1923. doi:10.1038/nmeth.1923PubMed CentralPubMedView ArticleGoogle Scholar
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup: The sequence alignment/map (SAM) format and SAMtools. Bioinformatics. 2009, 25: 2078-2079. 10.1093/bioinformatics/btp352.PubMed CentralPubMedView ArticleGoogle Scholar
- Danecek P, Auton A, Abecasis G, Albers C, Banks E, DePristo M, Handsaker R, Lunter G, Marth G, Sherry S, McVean G, Durbin R, 1000 Genomes Project Analysis Group: The variant call format and VCFtools. Bioinformatics. 2011, 27 (15): 2156-2158. 10.1093/bioinformatics/btr330. doi:10.1093/bioinformatics/btr330. Epub 2011 Jun 7PubMed CentralPubMedView ArticleGoogle Scholar
- Skinner ME, Uzilov AV, Stein LD, Mungall CJ, Holmes IH: JBrowse: a next-generation genome browser. Genome Res. 2009, 19 (9): 1630-1638. 10.1101/gr.094607.109. doi:10.1101/gr.094607.109. Epub 2009 Jul 1PubMed CentralPubMedView ArticleGoogle Scholar
- Westesson O, Skinner M, Holmes I: Visualizing next-generation sequencing data with JBrowse. Brief Bioinform. 2013, 14 (2): 172-177. 10.1093/bib/bbr078. doi:10.1093/bib/bbr078. Epub 2012 Mar 12PubMed CentralPubMedView ArticleGoogle Scholar
- NCBI Sequence Read Archive. [http://www.ncbi.nlm.nih.gov/Traces/sra/]
- Casali N, Nikolayevskyy V, Balabanova Y, Ignatyeva O, Kontsevaya I, Harris SR, Bentley SD, Parkhill J, Nejentsev S, Hoffner SE, Horstmann RD, Brown T, Drobniewski F: Microevolution of extensively drug-resistant tuberculosis in Russia. Genome Res. 2012, 22 (4): 735-745. 10.1101/gr.128678.111. doi:10.1101/gr.128678.111. Epub 2012 Jan 31PubMed CentralPubMedView ArticleGoogle Scholar
- Casali N, Nikolayevskyy V, Balabanova Y, Harris SR, Ignatyeva O, Kontsevaya I, Corander J, Bryant J, Parkhill J, Nejentsev S, Horstmann RD, Brown T, Drobniewski F: Evolution and transmission of drug-resistant tuberculosis in a Russian population. Nat Genet. 2014, 46 (3): 279-286. 10.1038/ng.2878. doi:10.1038/ng.2878. Epub 2014 JanPubMed CentralPubMedView ArticleGoogle Scholar
- Brudey K, Driscoll JR, Rigouts L, Prodinger WM, Gori A, Al-Hajoj SA, Allix C, Aristimuño L, Arora J, Baumanis V, Binder L, Cafrune P, Cataldi A, Cheong S, Diel R, Ellermeier C, Evans JT, Fauville-Dufaux M, Ferdinand S, Garcia de Viedma D, Garzelli C, Gazzola L, Gomes HM, Guttierez MC, Hawkey PM, van Helden PD, Kadival GV, Kreiswirth BN, Kremer K, Kubin M, et al: Mycobacterium tuberculosis complex genetic diversity: mining the fourth international spoligotyping database (SpolDB4) for classification, population genetics and epidemiology. BMC Microbiol. 2006, 6: 23-10.1186/1471-2180-6-23.PubMed CentralPubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.