Sequence Comparison of Human and Mouse Genes Reveals a Homologous Block Structure in the Promoter Regions

  1. Yutaka Suzuki1,3,
  2. Riu Yamashita1,
  3. Matsuyuki Shirota1,
  4. Yuta Sakakibara1,2,
  5. Joe Chiba2,
  6. Junko Mizushima-Sugano1,
  7. Kenta Nakai1, and
  8. Sumio Sugano1
  1. 1 Human Genome Center, The Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo, 108-8639, Japan
  2. 2 Department of Biological Science and Technology, Science University of Tokyo, Noda-shi, Chiba, 278-8510, Japan

Abstract

Comparative sequence analysis was carried out for the regions adjacent to experimentally validated transcriptional start sites (TSSs), using 3324 pairs of human and mouse genes. We aligned the upstream putative promoter sequences over the 1-kb proximal regions and found that the sequence conservation could not be further extended at, on average, 510 bp upstream positions of the TSSs. This discontinuous manner of the sequence conservation revealed a “block” structure in about one-third of the putative promoter regions. Consistently, we also observed that G+C content and CpG frequency were significantly different inside and outside the blocks. Within the blocks, the sequence identity was uniformly 65% regardless of their length. About 90% of the previously characterized transcription factor binding sites were located within those blocks. In 46% of the blocks, the 5′ ends were bounded by interspersed repetitive elements, some of which may have nucleated the genomic rearrangements. The length of the blocks was shortest in the promoters of genes encoding transcription factors and of genes whose expression patterns are brain specific, which suggests that the evolutional diversifications in the transcriptional modulations should be the most marked in these populations of genes.

Footnotes

  • [Supplemental material is available online at www.genome.org. The sequence data from this study have been submitted to DDBJ under accession nos. BP192706-BP383670.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2435604.

  • 3 Corresponding author. E-MAIL ysuzuki{at}ims.u-tokyo.ac.jp; FAX +81 3 5449 5416.

    • Accepted June 23, 2004.
    • Received February 10, 2004.
| Table of Contents

Preprint Server