Table 1.

Reduction of total sequence search space for regulatory region identification by restriction to phylogenetically conserved sequences between orthologous gene pairs.

Gene Genomic sequences Total sequence length (bp) (a) Sum of conserved regions (bp) (b) % Sequence to be searched for shared TF-binding sites.(b/a)
Human (bp) Mouse (bp)
ADA 3674129807665481666825
APEX 225272196344490598213
XRCC1 3778537349751341126015
ERCC2 5433632595869311425716
CD4 3951243508830201882823
PAX6 37862540000077862518743224
ATM 1624291164612688905882821
MYO7A 106974758251827995024727

[i] Examples of genes in which sequence search space was reduced 70% to 85%.