Skip to main content

Table 5 Open reading frame and transcript prediction analysis of sugarcane transcriptome sequence data

From: A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing

ORF prediction

PacBio transcript isoforms

De novo transcript contigs

ORF containing transcripts

100,639

491,544

  

Retained transcriptsa

96,114

355,453

  

Min length

300

300

  

Max length

8,142

9,501

  

N50

1,158

738

  

Evigene prediction

PacBio transcript isoforms

De novo transcript contigs

Unigenes

SoGI

Total transcripts

51,025

83,041

13,205

41,042

Main transcripts

25,012

56,766

13,205

32,013

Alternate transcripts

26,013

26,275

0

9,029

Ave length 1 K proteinsb

1,348

298

298

287

  1. atranscripts with Pfam and Viridiplantae hits. bAverage length (aa) of the largest 1.000 proteins