Table 4.

Contingency Table of Calculated and True Partitions

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Total
 1636115643
 2123712241
 3122911232
 412031205
 5176176
 69999
 7118486
 87777
 917116676
1042226
112424
1212223
132020
1410111215
15121114
1688
1744
18314
1933
2022
21112
2222
2322
24(*)1065143454345
  Total66927425420718310085826732282612822029

[i] Rows correspond to calculated clusters; columns correspond to true clusters. We observe a high proportion of pure calculated clusters. Only two clusters (cluster 14 and cluster 21) have a purity below 70%. Cluster 24 is marked (*). It contains singletons, i.e., clone fingerprints that have not been assigned to any of the clusters. False assignment to singletons happened in 45 cases (2.2%).