Skip to main content
Fig. 3 | BMC Genomics

Fig. 3

From: Comparative genomic analysis of the human genome and six bat genomes using unsupervised machine learning: Mb-level CpG and TFBS islands

Fig. 3

DegePenta-BLSOMs for 3 bats and humans and the distribution of the DegePenta frequency on their chromosomes. A DegePenta-BLSOM for 1-Mb sequences with a 500-kb sliding step for 3 bats and humans and its U-matrix. In the case of DegeDi- and DegeTri-BLSOMs, the 1-Mb window was slid in 100-kb steps for comparison with the results of 100-kb fragmentation. However, to reduce only the effects of the cutting positions of the 1-Mb window, the 100-kb step is too narrow and consumes considerable computation time in high-dimensional analyses. Therefore, the step size was changed to 500 kb. B CG-containing DegePenta-BLSOM for 3 bats and humans and its U-matrix. C CG-containing DegePenta-BLSOM for humans and its U-matrix. In the BLSOM, nodes containing sequences from more than one chromosome are indicated in black, and those containing sequences from a single chromosome are indicated in chromosome-specific colors; for the meanings of the colors, see Supplemental Table S2. D CG-containing DegePenta-BLSOM for 3 bats and its U-matrix. E The occurrence frequency (%) of the top ten DegePenta on three human chromosomes is displayed as described in Fig. 1E. On the three chromosomes, the top ten oligonucleotides differed from each other, but for each oligonucleotide, the common symbols presented in the rightmost panel are used for the three chromosomes. The (Max-Ave)/Ave ratio was used for the top ten selections, but the occurrence frequency (%) itself is displayed here. F The occurrence frequency (%) of the top ten DegePentas on three bat scaffolds is displayed. The common symbols are used, independent of the scaffold

Back to article page