Skip to main content

Table 1 Summary of assembly and annotation of nucleotide sequence data from Chironex fleckeri tentacle tissue

From: Transcriptome and venom proteome of the box jellyfish Chironex fleckeri

Assembly  
Raw reads (paired-end) 43,150,858
After clipping and QC 23,370,860
Contigs 34,438
Average length ± SD 1,056 ± 1359
Length min - max 100-26,403
% GC content 38.88
Raw reads mapped to contigs 13,052,970 (56%)
ORFs  
Transcripts with signficant BLAST hit (10e-5) 13736 (40%)
Containing an Open Reading frame 20,548 (60%)
With homologues in:  
Nematostella vectensis 12,143 (35%)
GenBank nr proteins (Cnidaria) 13,035 (38%)
Hydra magnipapillata 11,681 (34%)
SwissProt 11,123 (32%)
UniProt venom and toxins database 455 (1%)
Matching CEGMA core eukaryotic proteins  
% Full length (>90% cover) 77.02
% Partial (<90% cover) 80.65
Interproscan  
Returning Pfam terms 10,653 (31%)
Returning GO terms 7,208 (21%)
Total GO terms 17203
Biological Process 5,060
Cellular Component 2,745
Molecular Function 9,398
Signal sequence and transmembrane domains  
Predicted proteins with signal sequences 930 (3%)
Predicted proteins with > = 2 transmembrane domains 1,332 (4%)