Gene Duplication and the Structure of Eukaryotic Genomes

Table 1.

Number and Distribution of Genes and Families for Three Eukaryotic Genomes

Genome/Cutoff No. of genes No. of families Genes in families (paranome)
total no. mean median maximum skewness
C. elegans/10−20 18890 1457 10256 7.0 3 1054 30.0
C. elegans/10−30 18890 1522 9060 6.0 2 382 13.6
C. elegans/10−40 18890 1505 7960 5.3 2 218 11.1
C. elegans/10−50 18890 1520 7077 4.7 2 156 10.3
C. elegans/10−60 18890 1472 6257 4.3 2 109 7.9
Drosophila/10−50 12860 824 2967 3.6 2 35 4.1
Yeast/10−50 5786 503 1440 2.9 2 44 8.2
  • (Cutoff) parameter in blast algorithm that corresponds to strictness of search; the smaller the value, the stricter the search (less matches). (No. of families) the total number of families (>1 gene per family) in the data set. (Genes in families) total number of genes in all families. Mean, median, maximum, and skewness are computed for family sizes (number of genes).

This Article

  1. Genome Res. 11: 373-381

Preprint Server