Skip to main content

Table 1 Summary of the pre-processing pipeline in the molluscan transcriptomic libraries

From: Comparative transcriptomics enlarges the toolkit of known developmental genes in mollusks

Organism

No. of readsa before pre-processing

No. of readsa after pre-processing

No. of readsa excluded

Gymnomenia pellucida

(Neomeniomorpha)

53,751,440

50,292,634 (93.57 %)

3,458,806 (6.43 %)

Wirenia argentea

(Neomeniomorpha)

50,456,889

41,678,466 (82.60 %)

8,778,423 (17.4 %)

Scutopus ventrolineatus

(Chaetodermomorpha)

43,492,046

40,596,155 (93.34 %)

2,895,891 (6.66 %)

Acanthochitona crinita

(Polyplacophora)

35,737,364

33,695,610 (94.29 %)

2,041,754 (5.71 %)

Idiosepius notoides b

(Cephalopoda)

588,878

588,878 (100 %)

-

Idiosepius notoides

(Cephalopoda)

38,267,214

35,131,600 (91.81 %)

3,135,614 (8.19 %)

Lottia cf. kogamogai b

(Gastropoda)

402,814

402,814 (100 %)

-

Nucula tumidula

(Bivalvia)

40,797,848

38,849,372 (95.22 %)

1,948,476 (4.78 %)

Antalis entalis

(Scaphopoda)

24,194,021

22,881,795 (94.58 %)

1,312,226 (5.42 %)

  1. aRead pairs for Illumina libraries
  2. bNote that the 454 datasets were just trimmed and converted to fasta and fasta.qual files. The quality and length filtering was executed by the program MIRA4 during the assembling step