Skip to main content

Table 1 Assembly statistics for the CamBac1 (GCF_000767855.1) and CamFer1 (GCF_000311805.1) and after improvement (CamBac2 and CamFer2, respectively) with reference-guided assembly with Ragout [16] using Progressive Cactus [17] alignments to CamDro3 then filling in gaps with GapFiller [18]

From: Nucleotide diversity of functionally different groups of immune response genes in Old World camels based on newly annotated and reference-guided assemblies

Assembly

 

CamBac1

CamBac2

CamFer1

CamFer2

Total size

1,992,663,268

2,039,590,309

2,009,194,609

2,086,258,888

Gap length

13,666,687

57,965,943

23,778,176

99,159,843

Scaffolds

 Number

35,455

33,593

13,334

9158

 Longest

46,538,883

122,729,119

15,735,958

123,639,755

 N90a

1,821,536

24,994,512

341,469

25,431,863

 L90b

255

29

1167

30

 N50a

8,812,066

68,446,253

2,005,940

69,671,486

 L50b

68

11

274

11

Contigsc

 Number

67,435

56,044

68,872

66,352

 Longest

1,143,031

2,938,098

853,441

1,096,594

 N90

29,656

43,365

16,267

16,886

 L90

15,603

10,214

25,475

23,951

 N50

139,019

219,031

90,263

97,198

 L50

3963

2415

5814

5272

 Single-copy BUSCOsd

3827

3835

3796

3816

 Duplicated BUSCOs

22

18

48

32

 Fragmented BUSCOs

164

157

175

168

 Missing BUSCOs

91

94

85

88

  1. aN90/N50 are the scaffold or contig lengths such that the sum of the lengths of all scaffolds or contigs of this size or larger is equal to 90/50% of the total assembly length
  2. bL90/L50 are the smallest number of scaffolds or contigs that make up at least 90/50% of the total assembly length
  3. cUsing minimum gap length of 10 bp
  4. dBUSCOs: Benchmarking Universal Single-Copy Orthologs [19] are mammalian BUSCOs from OrthoDB v. 9.1 genes [20]