Skip to main content

Table 2 Human pathogens/parasites and big data. The available number of nucleotide and protein sequence records at NCBI Entrez Databases (as of January 2021) are indicated for select groups of pathogens/parasites. It should be noted that not all the species that are part of the taxonomic groups listed here maybe pathogens/parasites of human

From: A systematic bioinformatics approach for large-scale identification and characterization of host-pathogen shared sequences

Pathogen/Parasite Sequence Data (# Records)
Nucleotides Proteins
Viruses 3,554,899 7,360,073
Bacteria 68,010,589 773,862,087
Archaea 914,730 6,836,105
Fungi 13,642,359 25,043,777
Plasmodium 567,181 662,147
Amoebozoa 802,451 317,693
Trichomonas 245,291 121,145
Trypanosoma 425,666 408,483
Platyhelminthes (Flatworm) 3,326,558 787,530
Nematodes 4,321,847 1,776,489
Acanthocephalans 8863 2669
Hirudinea (Leeches) 255,634 52,331