Table 2.

Characteristics of the Benchmark Data Sets Used in This Study

Variable SAG (BLASTX similarity) RG
Strong Moderate
No. of sequences172620
Mean sequence length (kb)16417429
No. of genes649329
Mean gene length (bp)4496458910,486
No. of exons385477191
Mean exon length (bp)197181201
No. of exons/gene6.025.136.59
No. of genes with one or two exons18243
Mean intron length (bp)6598861640
No. of gene/Mb22.9220.4749.64
Mean C + G %40.0139.5651.82
Exon density (%)2.731.896.54

[i] Exon density provides the percentage of nucleotides that occur in coding regions.