Table 2.

Comparison of the gerbera UniGene collection with other sequence collections from whole genomes, partial genomes or large EST projects


Challenge

Matches to database (%)

Unique to database
Arabidopsis thaliana proteome 4799 (59.3) 11
Oryza sativa proteome 1252 (15.5) 0
Populus trichocarpa genome 798 (9.9) 2
Medicago truncatula BAC collection 2132 (26.3) 13
Pooled Asterid UniGene sequences 4652 (57.4) 373
Pooled Eurosid sequences 3355 (41.4) 25
Pooled Caryophyllid sequence 1612 (20.0) 1
Pooled Monocot sequences 1819 (22.5) 12
GenBank sequence database 4867 (60.1) 46
Unassigned Gerbera sequences

1656

[i] Gerbera UniGene sequences were mapped to the query databases using a BLAST algorithm, and the results were filtered arbitrarily at 1e-10. The number of gerbera sequences that can be mapped to the query collection is shown along with this value, expressed as a percentage of all gerbera sequences. Also shown is the number of sequences that are unique to the query database and have no homolog elsewhere within the experiment.