Skip to main content

Table 4 Percentages of contigs showing sequence similarity (e-value < 10-5) with sequences stored in GenBank (nr, est databases and nr database restricted to the Insecta) and proteins of Caenorhabditis elegans, Drosophila melanogaster and Mus musculus (April 2007)

From: Collembase: a repository for springtail genomics and soil quality assessment

Database

BLAST

Significant hits for the total dataset

Significant hits excl. 140 clusters*

nr

blastx

42

41

nr

blastn

9

7

est

tblastx

40

39

nr ā€“ Insecta**

blastx

36

35

C. elegans

blastx

25

24

D. melanogaster

blastx

32

30

M. musculus

blastx

31

29

  1. * In total 140 clusters showed high similarity to yeast and human DNA sequences stored in GenBank and were therefore regarded as contamination.
  2. ** Blast analysis performed August 2007