Skip to main content

Advertisement

Table 1 Transcript annotation and analysis of full-length transcript reconstruction

From: Characterization of a second secologanin synthase isoform producing both secologanin and secoxyloganin allows enhanced de novo assembly of a Catharanthus roseus transcriptome

Assembly ccOrcae mpgrCra NIPGR PMS454 PMSIllu CD97 CD97 best clustersc CDF97
Number of transcripts 31,450 86,726 59,220 26,804 155,305 543,979 31,055 58,338
Total annotated transcriptsa 16,727 55,073 24,142 15,868 22,716 249,413 25,692 49,128
% Annotated transcriptsa 53.19 63.50 40.77 59.20 14.63 45.85 82.73 84.21
  Cumulative number of proteinsb
% Length coverage         
100 4,539 4,115 3,540 1,546 3,168 7,155 5,006 5,734
90 5,686 5,547 5,124 2,127 4,125 9,283 6,198 7,110
80 6,370 6,626 6,377 2,624 4,783 10,903 6,913 7,926
70 6,855 7,641 7,469 3,193 5,381 12,410 7,466 8,572
60 7,298 8,703 8,372 3,827 5,940 14,048 7,958 9,135
50 7,724 9,670 9,119 4,575 6,574 16,085 8,350 9,640
40 8,102 10,528 9,686 5,457 7,177 18,610 8,671 10,033
30 8,479 11,282 10,141 6,497 7,793 21,831 8,908 10,489
20 8,767 11,744 10,393 7,367 8,303 24,619 9,021 10,520
10 8,816 11,830 10,438 7,588 8,430 25,229 9,044 11,137
  1. a,Blastx analysis vs UniprotKB/Swiss-Prot. A transcript was considered to be annotated if it matches a protein in the database at a e-value threshold of 1e-20
  2. b,Reports the cumulative number of proteins in UniprotKB/Swiss-Prot matched by at least one transcript in the corresponding assembly at a given % coverage
  3. c,Clusters in CD97 having contigs from at least ten different single assemblies