Skip to main content

Table 1 Summary of sequence statistics for each assembly.

From: Smed454 dataset: unravelling the transcriptome of Schmidtea mediterranea

SET

Contigs

Singletons

TOTAL SEQs

GC%

LENGTHs

[min/median/max/avg]

90

52,885

137,213

190, 098

35.130

20

354

6812

355.78

95

52,501

137,077

189,578

---

---

---

---

---

98

52,321

137,353

189,674

35.127

20

354

6812

355.82

90e

53,867

138,766

192,633

35.108

20

358

7918

364.81

  1. Set names are related to the corresponding homology level cutoff value (90e stands for 90% similarity including the set of NCBI Unigene ESTs). Contigs are the result of at least two sequencing reads, and singletons of only one read. GC content is the average value for all sequences. Sequence lengths are shown as minimum, median, maximum and average values in nucleotides for each set.