Skip to main content

Table 1 Summary of correction of PacBio transcript isoform data using Illumina short-reads

From: A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing

Analysis   PacBio non-corrected LoRDEC Trinity normalized reads
Total transcripts   107,604 107,598
Evigene prediction Okay transcripts 18,190 51,025
Main transcripts 14,124 25,012
Alternate transcripts 4,066 26,013
GC% 61.8 51.4
CEGMA alignment (%)   96.4 97.98
BUSCO notation (%)   87.13 90.27
ORFs detected Minimum 300 bp 243,637 252,491
ORF N50 (bp) 570 888
Protein counts covered ≥90%   9.727 12,611
Transcripts mapped to sorghum genome (%)   66.43 69.44