Table 5.

Misassemblies

Genome H. influenzae S. cerevisiae D. melanogaster Human 21 Human 22
Full Coverage (∼10-fold)
 Deletions251021326
 Mean length (bp)4404701660360430
 Insertions172
 Mean length (bp)350990400
 Hanging ends34
 Mean length (bp)190800
 Other31
 Misassemblies(note[ii])(note[iii])
Half Coverage (∼5-fold)
 Deletions341162641
 Mean length (bp)29037901600220340
 Insertions21223
 Mean length (bp)38067090390
 Hanging ends12421114
 Mean length (bp)784504601330826
 Other545
 misassemblies(note[iv])(note[v])(note[vi])

[i] See Figure 6A,B for an illustration of the various types of common misassemblies. Other misassemblies are as follows:

[ii] Two contigs of lengths 1 kb and 138 kb were exchanged in a supercontig. Two correct contigs of lengths 313 kb and 169 kb, which were separated in the genome by 60 bp, were glued along a 440-bp segment that appeared at the left end of the left contig, and also at the right end of the right contig, yielding a chimeric contig in the final assembly (which we call a slipped join). There was a standard misassembly (Fig. 6B), occurring in a 4.8-Mb supercontig.

[iii] At the end of a supercontig, there was a contig of length 10 kb, which aligned at a distant location in the genome relative to the rest of the supercontig.

[iv] A supercontig of 467 kb had a standard major misassembly. A supercontig of 1.4 Mb had two stray contigs of 5 kb and 2 kb. A contig had a deletion of 29 kb. Contigs of 27 kb and 37 kb were misassembled.

[v] A 3.4-Mb supercontig had a standard major misassembly. Contigs of 9 kb and 24 kb had slipped joins. A 12-kb contig at the end of a 2.6-Mb supercontig was misassembled.

[vi] A 4.5-Mb supercontig had a standard major misassembly. A 14-kb contig had a slipped join. A 303-kb supercontig was slightly misassembled near one of its ends. A supercontig had a spurious contig of length 5 kb. A 9-kb supercontig consisted of two incorrectly linked contigs.