Figure 1.

Illustration of the read splitting step. Read Rj is split into substrings S1, S2, S3 such that all k-mers of each Si have the same minimizer, and extra linking characters (in red) are added to each Si. The overlap between two such consecutive extended Sis is of exactly k characters.

1198f01