Table 1.
Reduction of total sequence search space for regulatory region identification by restriction to phylogenetically conserved sequences between orthologous gene pairs.
| Gene | Genomic sequences | Total sequence length (bp) (a) | Sum of conserved regions (bp) (b) | % Sequence to be searched for shared TF-binding sites.(b/a) | |||||
| Human (bp) | Mouse (bp) | ||||||||
| ADA | 36741 | 29807 | 66548 | 16668 | 25 | ||||
| APEX | 22527 | 21963 | 44490 | 5982 | 13 | ||||
| XRCC1 | 37785 | 37349 | 75134 | 11260 | 15 | ||||
| ERCC2 | 54336 | 32595 | 86931 | 14257 | 16 | ||||
| CD4 | 39512 | 43508 | 83020 | 18828 | 23 | ||||
| PAX6 | 378625 | 400000 | 778625 | 187432 | 24 | ||||
| ATM | 162429 | 116461 | 268890 | 58828 | 21 | ||||
| MYO7A | 106974 | 75825 | 182799 | 50247 | 27 | ||||
-
Examples of genes in which sequence search space was reduced 70% to 85%.











