Skip to main content

Table 1 Repetitive sequences in the soybean genome quantified using the difference between the contigs produced by an assembly algorithm with conservative parameters, and the predictions of the Lander-Waterman model for sampling a completely non-repetitive genome

From: Global repeat discovery and estimation of genomic copy number in a large, complex genome using a high-throughput 454 sequence survey

Number of reads in contig

Predicted by model

Observed number of contigs

Repetitive reads (Observed-predicted)

2

41,126

42,221

2,189

3

2,511

9,742

21,693

4

153

3,498

13,379

5

9

1,646

8,183

6

1

937

5,619

7

0

634

4,438

> 7

0

4,213

238,389

   

total 293,890