Characteristics of the Benchmark Data Sets Used in This Study
| Variable | SAG (BLASTX similarity) | RG | |||
| Strong | Moderate | ||||
| No. of sequences | 17 | 26 | 20 | ||
| Mean sequence length (kb) | 164 | 174 | 29 | ||
| No. of genes | 64 | 93 | 29 | ||
| Mean gene length (bp) | 4496 | 4589 | 10,486 | ||
| No. of exons | 385 | 477 | 191 | ||
| Mean exon length (bp) | 197 | 181 | 201 | ||
| No. of exons/gene | 6.02 | 5.13 | 6.59 | ||
| No. of genes with one or two exons | 18 | 24 | 3 | ||
| Mean intron length (bp) | 659 | 886 | 1640 | ||
| No. of gene/Mb | 22.92 | 20.47 | 49.64 | ||
| Mean C + G % | 40.01 | 39.56 | 51.82 | ||
| Exon density (%) | 2.73 | 1.89 | 6.54 | ||
[i] Exon density provides the percentage of nucleotides that occur in coding regions.