Table 1.

Summary of initial data and filtered orthologs sets.


(A) Initial data sets
Ensembla
UCSC genome browserb RefSeqc
Species
Version
Genes
Introns
Version
Genes
Introns
humand v19.34a 33,633 284,125 HGv16/NCBI34 21,744 206,814
mousee v19.30 30,665 218,163 MGSCv4/NCBI32 17,988 139,258
ratf v19.3a 28,545 192,459 RGSCv3.1 4877 43,393
chickeng v22.1.1 28,491 252,226 CGSCv2 1496 12,632

(B) Filtered orthologs

Sets
Genes
Introns

Total human 6043 48,939 (out of 51,876)
mouse 5680 45,543 (out of 47,193)
rat 1847 13,929 (out of 14,245)
Orthologs human/mouse 5550 44,119
human/rat 1737 13,259
mouse/rat 1416 9655
Triads
human/mouse/rat
1283
8895

[i] (A) Initial data sets: the initial pool of genes/introns from which we filtered all the data sets for this work (aBirney et al. 2004; bKarolchik et al. 2003; cPruitt et al. 2003; dLander et al. 2001; eWaterston et al. 2002; fRat Genome Sequencing Project Consortium 2004; gInternational Chicken Genome Sequencing Consortium 2004).

[ii] (B) Filtered orthologs: the number of RefSeq orthologous genes and introns derived from these data sets.