Theoretical frequency distribution of WGP sequence tags. This distribution is based on 8.2 genome equivalents of template DNA and 54% of heterozygous tags, and is a combination of two Poisson distributions for respectively the heterozygous and homozygous tags. It gives a good fit to the lower half of the observed distribution (Figure 8) and accommodates the relatively high fraction of two-copy tags in the WGP-dataset. However, other distributions with slightly higher g.e. and heterozygosity values will fit as well. The theoretical distribution assumes that all sequence tags are derived from single loci, and that no losses or errors have occurred with WGP sequencing. The relatively thick tail in the observed distribution (Figure 8) indicates that some of the tag sequences are likely to have come from duplicated loci.