Distribution of transcripts in the germline SAGE library and the germline-specific/enriched dataset identified. (A) The distribution of tag counts in the germline SAGE library obeys a power law. More genes were identified as being expressed at low transcript levels, whereas significantly fewer genes are expressed at modest to high transcript levels. However, considerably higher numbers of tags are identified by the small subset of genes expressed at relatively high levels. Only 52 genes were identified to have a tag counts > 254, whereas they account for ~33% the total tags identified in the germline library. (B) The portion of the germline library identified to be germline-specific/enriched increases with increasing tag counts. Tag count ≥ 9 was chosen as a cut off to increase the confidence of the germline-specific/enriched gene set. With tag count ≥ 9, 1063 out of 1407 (~75%) of the germline library genes were identified to be germline-specific or germline-enriched.