Skip to main content

Table 2 Summary of gene dataset and reference contigs derived from de novo assembly of RNA-Seq reads in A. thaliana

From: Removal of redundant contigs from de novo RNA-Seq assemblies via homology search improves accurate detection of differentially expressed genes

 

Gene dataset

Raw Contigs

Longest Contigs

Clustered Contigs

Annotated Contigs

Plant DB Contigs

PlantClust50 DB Contigs

Number

35,385

62,339

58,007

59,405

23,873

42,467

26,766

Min. length (base)

22

121

121

121

121

121

121

Median length (base)

1383

285

275

281

599

317

351

N50 (base)

1814

739

699

711

1042

826

987

Mean length (base)

1535

475

457

464

757

520

589