Percentage of data remaining as the cutoff threshold N is varied, for different values of the word size k.