...minimizers and finding taxon-specific k-mers. However, we contend that these strategies are inadequate, especially when
reference sets are taxonomically imbalanced, as are most microbial libraries. In this paper, we explore approaches for selecting a fixed-size subset of k-mers present in an ultra-large data...