Skip to main content

Table 1 Largest families, associated Pfam keywords and family specificity

From: Trends in genome dynamics among major orders of insects revealed through variations in protein families

# proteins # insects Pfam ID Pfam name # TPa # FPb # gainc Spec.
4952 18 PF13465 Zinc-finger double domain 3150 1343 459 0.701
3857 18 PF00069 Protein kinase domain 2954 809 94 0.785
2912 18 PF00089 Trypsin 2895 0 17 1.000
2646 18 PF00400 WD domain, G-beta repeat 2415 37 194 0.985
2240 18 PF07679 Immunoglobulin I-set domain 1391 632 217 0.688
1940 18 PF13855 Leucine rich repeat 1467 323 150 0.820
1860 18 PF12796 Ankyrin repeats (3 copies) 1580 165 115 0.905
1749 18 PF00067 Cytochrome P450 1694 0 55 1.000
1721 18 PF00076 RNA recognition motif. (RRM, RBD, RNP) 1520 125 76 0.924
1667 18 PF00379 Insect cuticle protein 1599 1 67 0.999
1647 18 PF00046 Homeobox domain 1400 147 100 0.905
1630 17 PF02949 7tm Odorant receptor 1507 0 123 1.000
1559 18 PF00071 Ras family 1082 445 32 0.709
1529 18 PF00001 7 tm receptor (rhodopsin family) 1417 43 69 0.971
1160 18 PF00651 BTB/POZ domain 1076 46 38 0.959
1100 18 PF00083 Sugar (and other) transporter 973 105 22 0.903
  1. a TP True positives, b FP False positives, c gain unannotated proteins, Spec. specificity