Table 1.

Summary of Sequence Sets Used in This Study

Variable Dataset
SingleGene FinishGene DraftGene GoldenPath
No. of sequences1751941038156500
No. of complete genes (partial)175206116 (256)
Mean sequence lengths (kbp)7961417
No. of genes/Mbp (estimated)1441714(10)
No. of exons/complete gene (partial)5.07.05.7 (3.0)
Mean C + G%49.645.145.239.9
No. of aa/complete protein (partial)324404321 (170)

[i] Datasets are described in Methods. Some genes in the DraftGene set are represented by multiple partial genes in different draft contigs, data for these genes are listed in parentheses. Gene density in the GoldenPath set assumes 30,000 human genes in a 3000-Mbp genome.