Improved assembly of noisy long reads by k-mer validation

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 2.
Figure 2.

Sensitivity of read overlap detection with and without k-mer validation. Simulated PacBio reads from E. coli (250 pairs of 10-kb sequences with 2-kb overlaps) were subjected to standard MHAP (dashed line) or MHAP with masking of low-frequency k-mers (solid line) for overlap detection. The reference list of valid k-mers came from Illumina reads.

This Article

  1. Genome Res. 26: 1710-1720

Preprint Server