Skip to main content
Figure 1 | BMC Genomics

Figure 1

From: Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples

Figure 1

The percentage of all possible n -mers (coverage) that appear in H. sapien, M. musculus, D. melanogaster, C. elegans, A. thaliana, S. cerevisiae, E. coli k12 , theoretical and pseudo-human genomes. Theo-human is the maximum coverage a human-length genome could achieve if every n-mer in its genome was unique. The pseudo-human (pseudo-hs) genome is a random genome generated with the same length and dinucleotide frequencies of the human genome. The space coverage of each genome listed above is plotted against the length of the oligomer analyzed, ranging from 1 to 20.

Back to article page