Cliffy index sizes and compression ratios on the SILVA SSU NR99 data set
| Digestion | Document array profiles (Ahmed et al. 2023a) | Cliffy | Reduction ratio | Mean no. of pairs | Variance no. of pairs |
|---|---|---|---|---|---|
| No digestion | 2265 | 9.014 | 251× | 7.095 | 4.633 |
| DNA minimizer | 1955 | 7.858 | 249× | 7.211 | 4.732 |
| Minimizer | 1172 | 4.266 | 275× | 5.137 | 2.664 |
[i] Cliffy index sizes and compression ratios on the SILVA SSU NR99 (510,508 rRNA sequences, d = 9118 genera). Each digestion method shows original size, Cliffy-compressed size, reduction ratio, and pair statistics (mean and variance). The expected mean number of pairs based on harmonic series (H9118 + 1 = 10.695) is discussed in Methods. All sizes are in gigabytes.