Smed454 dataset: unravelling the transcriptome of Schmidtea mediterranea

BMC Genomics

Table 1 Summary of sequence statistics for each assembly.

SET	Contigs	Singletons	TOTAL SEQs	GC%	LENGTHs [min/median/max/avg]
90	52,885	137,213	190, 098	35.130	20	354	6812	355.78
95	52,501	137,077	189,578	---	---	---	---	---
98	52,321	137,353	189,674	35.127	20	354	6812	355.82
90e	53,867	138,766	192,633	35.108	20	358	7918	364.81

Set names are related to the corresponding homology level cutoff value (90e stands for 90% similarity including the set of NCBI Unigene ESTs). Contigs are the result of at least two sequencing reads, and singletons of only one read. GC content is the average value for all sequences. Sequence lengths are shown as minimum, median, maximum and average values in nucleotides for each set.

ISSN: 1471-2164