Skip to main content

Table 3 Summary of homology searches of contigs against TAIR10 gene dataset using BLAST

From: Removal of redundant contigs from de novo RNA-Seq assemblies via homology search improves accurate detection of differentially expressed genes

 

Raw Contigs

Longest Contigs

Clustered Contigs

Annotated Contigs

Plant DB Contigs

PlantClust50 DB Contigs

Hit contigs

58,376 (93.64)*

54,175 (93.39)

55,857 (94.03)

23,873 (100)

40,179 (94.61)

24,784 (92.59)

Unique hit contigs

10,119 (16.23)

10,676 (18.40)

10,571 (17.79)

23,873 (100)

11,382 (26.80)

12,398 (46.32)

Multiple hit contigs

48,257 (77.41)

43,499 (74.99)

45,286 (76.23)

0 (0)

28,796 (67.81)

12,386 (46.28)

No hit

3963 (6.36)

3832 (6.61)

3548 (5.97)

0 (0)

2289 (0.05)

1982 (0.07)

Total

62,339 (100)

58,007 (100)

59,405 (100)

23,873 (100)

42,467 (100)

26,766 (100)

  1. * Values in parentheses are percentages of all contigs