Skip to main content

Table 1 Summary of T. salsuginea ESTs generated from 454 sequencing and assembled contigs

From: Transcriptome sequencing and microarray design for functional genomics in the extremophile Arabidopsis relative Thellungiella salsuginea (Eutrema salsugineum)

Library

Non-normalized

Normalized

Combined1

Number of reads

811,683

400,631

1,212,314

Number of contaminated reads2

4,496

2,506

7,002

Average read length (bp)

566

257

464

Total reads (Mbp)

459.54

103.13

562.67

Number of assembled reads

712,262

376,509

1,060,666

Number of contigs3

33,870

28,928

46,220

Number of contigs with only 454 reads4

21,662

16,949

33,147

Average coverage

11.99

6.74

12.63

N50 (bp)

665

632

646

Average contig length (bp)

621

502

567

Number of contigs with ORF5

33,625

28,416

45,583

Number of Arabidopsis peptides found6

23,787

23,220

24,457

  1. 1Combined 454 reads derived from normalized and non-normalized library.
  2. 2Best hits with E-value < 1e-10, non streptophyta sequences (NCBI, nr, nt, as of February 12, 2010).
  3. 3Includes publicly available T. salsuginea ESTs (as of April 19, 2010).
  4. 4Number of contigs minus number of contigs assembled with at least one publicly available EST.
  5. 5Open reading frames (ORFs) predicted including partial ORFs (Min et al. [83]).
  6. 6Any hits with E-value < 0.001 from BLASTX search against TAIR9 database (27,379 peptides).