Skip to main content

Table 1 Summary of correction of PacBio transcript isoform data using Illumina short-reads

From: A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing

Analysis

 

PacBio non-corrected

LoRDEC Trinity normalized reads

Total transcripts

 

107,604

107,598

Evigene prediction

Okay transcripts

18,190

51,025

Main transcripts

14,124

25,012

Alternate transcripts

4,066

26,013

GC%

61.8

51.4

CEGMA alignment (%)

 

96.4

97.98

BUSCO notation (%)

 

87.13

90.27

ORFs detected

Minimum 300 bp

243,637

252,491

ORF N50 (bp)

570

888

Protein counts covered ≥90%

 

9.727

12,611

Transcripts mapped to sorghum genome (%)

 

66.43

69.44