Transcription induces strand specific mutations at the 5' end of human genes

  1. Paz Polak1 and
  2. Peter F Arndt
  1. Max Planck Institute for Molecular Genetics

Abstract

A regional analysis of nucleotide substitution rates along human genes and their flanking regions allows us to quantify the effect of mutational mechanisms associated with transcription in germline cells. Our analysis reveals three distinct patterns of substitution rates. First, a sharp decline in CpG methylation deamination rate, which is observed in the vicinity of the 5'end of genes. Second, a strand asymmetry in complementary substitution rates, which extends from the 5'end to 1 kbp downstream to the 3'end, associated with transcription coupled repair. Finally, a localized strand asymmetry, an excess of C->T over G->A substitution in the non-template strand confined to the first 1-2 kbp downstream of the 5'end of genes. We hypothesize that higher exposure of the non-template strand near the 5'end of genes leads to a higher cytosine deamination rate. Up to now only the somatic hyper mutation (SHM) pathway has been known to mediate localized and strand specific mutagenic processes associated with transcription in mammalia. The mutational patterns in SHM are induced by cytosine deaminase that just targets single stranded DNA. This DNA conformation is induced by R-loops, which preferentially occur at the 5'ends of genes. We predict that R-loops are extensively formed in the beginning of transcribed regions in germline cells.

Footnotes

    • Received January 29, 2008.
    • Accepted April 16, 2008.
ACCEPTED MANUSCRIPT

This Article

  1. Genome Res. gr.076570.108 Copyright © 2008, Cold Spring Harbor Laboratory Press

Article Category

Share

Preprint Server