Table 1.

Number and Distribution of Genes and Families for Three Eukaryotic Genomes

Genome/Cutoff No. of genes No. of families Genes in families (paranome)
total no. mean median maximum skewness
C. elegans/10−20 188901457102567.03105430.0
C. elegans/10−30 18890152290606.0238213.6
C. elegans/10−40 18890150579605.3221811.1
C. elegans/10−50 18890152070774.7215610.3
C. elegans/10−60 18890147262574.321097.9
Drosophila/10−50 1286082429673.62354.1
Yeast/10−50 578650314402.92448.2

[i] (Cutoff) parameter in blast algorithm that corresponds to strictness of search; the smaller the value, the stricter the search (less matches). (No. of families) the total number of families (>1 gene per family) in the data set. (Genes in families) total number of genes in all families. Mean, median, maximum, and skewness are computed for family sizes (number of genes).