Table 2.

Comparison of Draft Sequence Assemblies Across D4S394-D4S403 Region

Assembly NCBI1 NCBI2[i] UCSC1 UCSC2 CELERA HYBRID
Version9/2/0016/4/019/1/015/4/01publicNA
Length[ii](bp)4,220,0597,982,7906,597,8595,725,6833,359,2243,510,128
Contigs910348137
Gaps in ctgs3734664203251970
Gaps (bp)37,30046,600590,800631,400472,2940
Framework[iii]comparison
 Duplications[iv] 33130NA
 Deletions[v] 581010623
 Rearrangements[vi] 12131151NA
 Misassemblies/Mb[vii] 4.743.013.333.142.08NA
NR[viii] coverage
 NR fragments396144274383351617682322
 NR (bp)1,446,4411,597,8041,588,7011,292,726619,743834,492
 Coverage[ix] 0.540.590.590.480.230.31
Annotation
 Repetitive sequence (bp)2,395,0193,438,9412,689,3021,989,8672,418,6851,386,794
 PRS447 (bp)0025,70725,70715,6790

[i] Modified from original version—see Methods.

[ii] Length: total length of all contigs including gaps within contigs.

[iii] Framework: a set of 107 sequences accurately ordered across the region.

[iv] Duplications: observations of the additional appearance of a marker relative to the framework set.

[v] Deletions: observations of the absence of marker or a series of contiguous markers relative to the framework set.

[vi] Rearrangements: observations of marker orders differing from the framework set that are not the result of duplications or deletions.

[vii] Miassemblies/Mb: total number of duplications, deletions and rearrangements per Mb.

[viii] NR: a nonredundant set of genomic sequence data from the region.

[ix] Coverage: the proportion of sequence from the nonredundant genomic sequence data set (NR) present in assembly.