Large-Scale Clustering of cDNA-Fingerprinting Data

Table 1.

Distribution of Gene Copies for Simulation

Cluster size
1 2–5 6–10 11–100 >100 total
No. of cluster 500 144 21 16 17 698
No. of clones 500 388 156 381 4884 6309
Percent of sample 7.93 6.15 2.47 6.03 77.41 100
  • A total of 665 out of 698 genes are assigned a copy rate <11 (including 500 singletons) corresponding to a total number of 1044 clones; 17 genes get a copy rate >100 by a random number between 100 and 500. The biggest of our simulated cluster has a copy rate of 491 clones.

This Article

  1. Genome Res. 9: 1093-1105

Preprint Server