ARACHNE: A Whole-Genome Shotgun Assembler

Table 4.

Base Pair Accuracy

Genome H. influenzae S. cerevisiae D. melanogaster Human 21 Human 22
Full Coverage (∼10-fold)
 Average quality 45.3 43.6 43.4 42.8 41.3
 Fraction of bases assigned quality 40 99.6% 99.6% 99.6% 99.6% 99.5%
 Average quality of bases reported to   have quality score 40 (all correct) 49.5 51.5 51.3 47.6
Half Coverage (∼5-fold)
 Average quality 32.3 32.6 33 32.3 32.1
 Fraction of bases assigned quality 40 95.1% 95.0% 95.2% 95.2% 95.1%
 Average quality of bases reported to   have quality score 40 44.0 42.6 48.2 44.2 43.6
  • The average quality is defined as: −10 log10 [(# of mismatches, insertions, and deletions)/(# of bases in contigs)]. Thus, bases of quality 30 are 99.9% accurate, bases of quality 40 are 99.99% accurate, and so on. ARACHNE assigns a maximum quality score of 40. The table reports the average quality for the assembly (obtained by comparison with the known genome sequence), the percentage of bases assigned the maximal score, and the actual quality of the bases assigned the maximal score.

This Article

  1. Genome Res. 12: 177-189

Preprint Server