Skip to main content

Table 2 Statistics for the gene libraries shown in Table 1

From: EuroPineDB: a high-coverage web database for maritime pine transcriptome

Gene library

Raw

Curated

Mean

lengtha

Singletons

Contigs

UniGenes

(% annotated)

Discarded nt (%) by

       

QV

Vector

Artefacts b

Pp-454

913 786

844 737

227

471

54 960

55 431 (59.5%)

52.5%

NA

3.03%

LG0BCA

8766

8766

608

3834

1363

5197 (68.2%)

NA

NA

0.24%

GEMINI

13 057

7916

458

3066

1124

4190 (49.9%)

9.4%

10.4%

2.9%

SSH Xylem

992

790

474

385

142

527 (49.5%)

5.35%

31.8%

2.5%

UPM

2806

1115

465

258

157

415 (31.8%)

3.2%

15.9%

21.04%

ARG

218

148

394

127

7

134 (47.8%)

22.5%

5.1%

5.3%

SSH Lac-Pine

351

231

350

210

8

218 (34.4%)

18.5%

4.7%

2.64%

SSH Mic

294

194

314

149

13

162 (38.3%)

15.3%

13.4%

5.75%

CK16

358

282

575

221

24

245 (65.3%)

NA

0.05%

6.6%

SSH Embryos

96

57

437

34

6

40 (57.5%)

1.7%

20.6%

8.8%

Pin

863

617

532

335

86

421 (68.9%)

10.2%

9%

2.9%

EMBL v. 102

13 206

12 673

502

3704

1963

5667 (NA)

NA

0.1%

0.58%

TOTAL

954 793

880 295

       

   P. pinaster

951 641

877 523

597

684

54 648

55 332 (59.5%)

   

   P. sylvestris

2770

2466

730

476

203

679 (65.9%)

   

   P. pinea

382

306

574

239

27

266 (63.2%)

   
  1. QV, quality value. NA, not applicable.
  2. a Mean lengths are calculated with gene library reads. Nevertheless, they are calculated for contigs in the last three rows corresponding to the three species.
  3. b Artefacts include poly-A, poly-T, adaptors, contaminant sequences, and chimerical inserts.