Skip to main content

Table 2 List of M. bovis BCG strains for which high per-bp coverage complete genomes are available

From: Reference genome and comparative genome analysis for the WHO reference strain for Mycobacterium bovis BCG Danish, the present tuberculosis vaccine

Strain Method type DU2 DU2 resolved Accession numbers Reference Year of publication
BCG Pasteur 1173P2 Sanger sequencing (ABI 3700) of a pUC19, two pMAQ1b, a M13 and a cloned shotgun library, earlier analysis of BAC clones had already identified the DU1 and DU2 [13] IV yes PRJEA18059, AM408590 [11] 2007
BCG Tokyo 172 SOLiD (Agencourt Bioscience Corporation) sequencing of pAGEN vector library (about 4 kb and 800 bp inserts) and a fosmid insert library (about 40 kb) I yes (2x) PRJDA31211, AP010918 [9] 2009
BCG Moreau RDJ Sanger (ABI 3730) sequencing of 2 pBluescript libraries, gap closure via PCR I no PRJEA70285, AM412059 [43] 2011
BCG Tice ATCC 35743 Illumina genome sequencing, multiplex PCR and primer walking IV no PRJNA63839, CP003494 [7] 2011
BCG Mexico 1931 Roche 454 pyrosequencing, Sanger sequencing of fos-end sequences of 250 fosmids, sequencing of three fosmids and 110 PCR end reads IV yes PRJNA45811, CP002095 [44] 2011
BCG Korea 1168P Roche 454 (GS-FLX) and Illumina (HiSeq) sequencing, gap closure via PCR and primer walking IV yes (2x) PRJNA170028, CP003900 [45] 2013
BCG Russia 368 Roche 454 (GS Junior) sequencing on a shotgun and a 3 kb paired-end library, gap closure via PCR and Sanger sequencing I yes (2x) PRJNA256163, CP009243 [46, 47] 2014
BCG 3281 (clinical isolate) Roche 454 (GS-FLX) and Illumina (Hiseq2500) sequencing, gap closure via PCR III yes PRJNA251957, CP008744 [27] 2015
BCG Russia BCG-1 Roche 454 (GS-FLX) sequencing of a shotgun library, Ion Torrent PGM sequencing of a mate-pair library I yes (2x) PRJNA306822, CP013741 [48] 2016
BCG Danish 1331 (07/270) PacBio (RSII) (long read) sequencing (235x coverage) and Illumina (MiSeq) (short read) sequencing III yes PRJNA494982, CP039850 this study 2019
  1. For each strain, we indicated the used method to create the assembled genome, the type of DU2 present in the strain, whether the DU2 was resolved in the genome assembly, the BioProject and genome assembly accession number, the reference to the study in which genome assembly method was published and the year of publication. In the ‘method’ description, we have put labor/capital-intensive aspects in bold, illustrating that our approach using solely massive parallel sequencing, is the only one that provides both high per-bp accuracy (allowing for SNP calling) and complete resolution of the assembly across large repeat regions