Skip to main content

Table 1 Exon and splice completeness of transcript assemblies

From: The effects of sequencing depth on the assembly of coding and noncoding transcripts in the human genome

Read type

Feature

Read counta

Number

Coding transcripts (%)b

Noncoding transcripts (%)b

Complete (100%)

Missing (0%)

Complete (100%)

Missing (0%)

Simulated short read

Exon

10 m

20

39.94 (0.1)

56.7 (0.09)

21.01 (0.06)

77.68 (0.07)

 

Exon

20 m

10

47.04 (0.07)

50.0 (0.08)

25.14 (0.15)

73.21 (0.15)

 

Exon

50 m

4

54.9 (0.11)

42.43 (0.12)

31.22 (0.1)

66.62 (0.1)

 

Exon

100 m

2

59.85 (0.03)

37.63 (0.01)

35.98 (0.02)

61.37 (0.05)

 

Exon

200 m

1

63.86 (0.0)

33.8 (0.0)

40.82 (0.0)

55.98 (0.0)

 

Splice

10 m

20

39.94 (0.1)

56.73 (0.09)

21.01 (0.06)

77.79 (0.08)

 

Splice

20 m

10

47.04 (0.07)

50.03 (0.08)

25.14 (0.16)

73.37 (0.16)

 

Splice

50 m

4

54.9 (0.11)

42.47 (0.12)

31.22 (0.1)

66.86 (0.1)

 

Splice

100 m

2

59.85 (0.03)

37.65 (0.01)

35.98 (0.02)

61.65 (0.07)

 

Splice

200 m

1

63.85 (0.0)

33.83 (0.0)

40.82 (0.0)

56.37 (0.0)

Long read

Exon

25%

4

61.15 (0.15)

34.66 (0.17)

39.27 (0.13)

53.62 (0.18)

 

Exon

50%

2

76.28 (0.06)

20.0 (0.05)

58.78 (0.02)

33.27 (0.05)

 

Exon

75%

1

87.54 (0.0)

10.05 (0.0)

77.33 (0.0)

17.01 (0.0)

 

Splice

25%

4

61.0 (0.14)

34.68 (0.17)

39.35 (0.15)

53.79 (0.16)

 

Splice

50%

2

76.12 (0.04)

20.02 (0.05)

58.87 (0.04)

33.37 (0.08)

 

Splice

75%

1

87.35 (0.0)

10.06 (0.0)

77.34 (0.0)

17.08 (0.0)

  1. Splice or exon completeness was estimated as the percentage of the splice or exon of the reference transcript that is correctly assembled in the new assembly. Transcripts with partial splice or exon assembly are not shown in the table. For multiple samples, the average percentages were presented, with the standard deviations in parentheses
  2. aFor simulated short reads, read counts were in million paired read whereas for long read samples, read counts were in the percentage of total alignment
  3. bCompleteness was given as the average of the percentages of reference transcripts. The values in parentheses are the standard deviations computed from multiple samples