Skip to main content

Table 1 Transcript annotation and analysis of full-length transcript reconstruction

From: Characterization of a second secologanin synthase isoform producing both secologanin and secoxyloganin allows enhanced de novo assembly of a Catharanthus roseus transcriptome

Assembly

ccOrcae

mpgrCra

NIPGR

PMS454

PMSIllu

CD97

CD97 best clustersc

CDF97

Number of transcripts

31,450

86,726

59,220

26,804

155,305

543,979

31,055

58,338

Total annotated transcriptsa

16,727

55,073

24,142

15,868

22,716

249,413

25,692

49,128

% Annotated transcriptsa

53.19

63.50

40.77

59.20

14.63

45.85

82.73

84.21

 

Cumulative number of proteinsb

% Length coverage

        

100

4,539

4,115

3,540

1,546

3,168

7,155

5,006

5,734

90

5,686

5,547

5,124

2,127

4,125

9,283

6,198

7,110

80

6,370

6,626

6,377

2,624

4,783

10,903

6,913

7,926

70

6,855

7,641

7,469

3,193

5,381

12,410

7,466

8,572

60

7,298

8,703

8,372

3,827

5,940

14,048

7,958

9,135

50

7,724

9,670

9,119

4,575

6,574

16,085

8,350

9,640

40

8,102

10,528

9,686

5,457

7,177

18,610

8,671

10,033

30

8,479

11,282

10,141

6,497

7,793

21,831

8,908

10,489

20

8,767

11,744

10,393

7,367

8,303

24,619

9,021

10,520

10

8,816

11,830

10,438

7,588

8,430

25,229

9,044

11,137

  1. a,Blastx analysis vs UniprotKB/Swiss-Prot. A transcript was considered to be annotated if it matches a protein in the database at a e-value threshold of 1e-20
  2. b,Reports the cumulative number of proteins in UniprotKB/Swiss-Prot matched by at least one transcript in the corresponding assembly at a given % coverage
  3. c,Clusters in CD97 having contigs from at least ten different single assemblies