Skip to main content

Table 2 Human pathogens/parasites and big data. The available number of nucleotide and protein sequence records at NCBI Entrez Databases (as of January 2021) are indicated for select groups of pathogens/parasites. It should be noted that not all the species that are part of the taxonomic groups listed here maybe pathogens/parasites of human

From: A systematic bioinformatics approach for large-scale identification and characterization of host-pathogen shared sequences

Pathogen/Parasite

Sequence Data (# Records)

Nucleotides

Proteins

Viruses

3,554,899

7,360,073

Bacteria

68,010,589

773,862,087

Archaea

914,730

6,836,105

Fungi

13,642,359

25,043,777

Plasmodium

567,181

662,147

Amoebozoa

802,451

317,693

Trichomonas

245,291

121,145

Trypanosoma

425,666

408,483

Platyhelminthes (Flatworm)

3,326,558

787,530

Nematodes

4,321,847

1,776,489

Acanthocephalans

8863

2669

Hirudinea (Leeches)

255,634

52,331