Skip to main content
Figure 3 | BMC Genomics

Figure 3

From: Global repeat discovery and estimation of genomic copy number in a large, complex genome using a high-throughput 454 sequence survey

Figure 3

Annotation of protein ORFs with hits to public database. A) Proportion of EST clones from the Glycine Max Gene Index (GMGI) matched by 454 reads at 95% and 100% sequence identity (using BLAST with e < 1E-6). The total number of sequences matching at 95% or higher identity is 37% of total EST clones. Note that few sequences match at 100% identity due to the error rate of the 454 pyrosequencing used for this study.B) Coding fragments discovered within the short reads (with e values to the GenBank protein (nr) database < 1E-6), and their closest protein-level sequence hit by taxonomy of the source organism of the database sequence.

Back to article page