Skip to main content

Table 2 Numbers of parasite-derived contigs in publicly available genome and transcriptome assemblies

From: Parasite infection of public databases: a data mining approach to identify apicomplexan contaminations in animal genome and transcriptome assemblies

Host species

WGS/TSA ID

Assembly type

# parasite-derived contigs

# sequences in dataset 1

# sequences in dataset 2

Helicoverpa assulta

GBTA01

transcriptome

8347

370

208

Colinus virginianus

AWGU01

genome

4013

793

244

Colinus virginianus a

AWGT01

genome

3098

-

-

Ornithorhynchus anatinus c

AAPN01

genome

1397 (119)

540

178

Helicoverpa armigera

GBDM01

transcriptome

1137

160

102

Teleopsis dalmanni

GBBP01

transcriptome

919

339

171

Capra hircus

GAOJ01

transcriptome

405

107

63

Annulipalpia sp.

GATX01

transcriptome

226

81

57

Gorilla gorilla gorilla c

CABD02 (CABD03)

genome

148 (3)

33

15

Camelus dromedarius

GADZ01

transcriptome

148

35

25

Anolis carolinensis

GBBS01

transcriptome

120

54

33

Anolis carolinensis a

GAFN01

transcriptome

119

-

-

Dendroctonus frontalis b

GAFI01

transcriptome

114

-

-

Dastarcus helophoroides

GBCX01

transcriptome

104

29

21

Odocoileus virginianus

AEGY01

genome

98

34

11

Odocoileus virginianus a

AEGZ01

genome

98

-

-

Motis davidii

ALWT01

genome

66

9

-

Anolis carolinensis a

GAFD01

transcriptome

62

-

-

Orchesella cincta

GAMM01

transcriptome

61

30

27

Ixodes ricinus b

GADI01

transcriptome

56

-

-

Corydalinae sp.

GADH01

transcriptome

41

18

-

Pseudomasaris vespoides

GAXQ01

transcriptome

39

18

17

Camelus dromedarius a

GADZ01

transcriptome

24

-

-

Ixodes scapularis

ABJB01

genome

26

7

-

Homo sapiens

AADC01

genome

24

6

-

Polyxenus lagurus

GBKF01

transcriptome

21

12

-

Dendroctonus ponderosae

GAFW01

transcriptome

15

6

-

Amblyomma americanum

GAGD01

transcriptome

10

4

-

Carduelis chloris

GBCG01

transcriptome

8

-

-

Capra hircus

GAOE01

transcriptome

8

-

-

Ixodes ricinus

GANP01

transcriptome

7

5

-

Camelus bactrianus

GAEY01

transcriptome

7

2

 

Dendroctonus ponderosae a

GAFX01

transcriptome

6

-

-

Chrysochloris asiatica

AMDV01

genome

5

2

-

Cuculus canorus

JNOX01

genome

5

2

-

Bos mutus

AGSK01

transcriptome

5

1

-

Nevrorthus apatelios

GACU01

transcriptome

4

3

-

Fulmarus glacialis

JJRN01

genome

4

2

-

Forficula auricula

GAAX01

transcriptome

4

3

-

Serinus canaria

CAVT01

genome

3

2

-

Capra hircus

GAFC01

transcriptome

3

2

-

Balaenoptera bonaerensis

BAUQ01

genome

2

1

-

Blattela germanica

GBID01

transcriptome

2

-

-

Folsomia candida

GAMN01

transcriptome

2

-

-

Carabus granulatus

GACW01

transcriptome

1

-

-

Capra hircus

GAOG01

transcriptome

1

-

-

Nemurella pictetii

GAAV01

transcriptome

1

-

-

Anolis carolinensis

GADN01

transcriptome

1

-

-

Phaedon cochleariae

GAPU01

transcriptome

1

-

-

Gluvia dorsalis

GDAP01

transcriptome

1

-

-

Rhipicephalus microplus

ADMZ02

genome

1

-

-

  1. aAssembly was not used in phylogenetic analyses because it is based on the same raw data as another assembly
  2. bAssembly was not used in phylogenetic analyses because it contains sequences from multiple parasite species
  3. cData based on a superseded assembly version; the number of parasite-derived contigs in the current version is given in parentheses