Table 2.

Sequence Representation within Various Size Clusters for TOGA 3.0

Cluster size (sequences) Number of clusters Total number of sequences Clusters with both orthologs and paralogs
 310,94332,829
 4725029,000
 5510325,515
 6295917,754167
 7183212,824193
 811839464191
 97827038196
105415410142
113884268117
12321385295
132593367102
14210294079
15150225075
16127203265
17118200668
1883149459
1956106445
2050100043
2149102944
223781436
233375930
242867225
253587534
262052020
271745917
281644816
291029010
301030010
3192799
321238412
3341324
3451705
3551755
362722
371371
402802
411411
431431
Total32,652116,4131921

[i] Clusters containing multiple sequences from a single species are considered to contain both orthologs and paralogs.