TY - JOUR A1 - Baek, Daehyun A1 - Davis, Colleen A1 - Ewing, Brent A1 - Gordon, David A1 - Green, Phil T1 - Characterization and predictive discovery of evolutionarily conserved mammalian alternative promoters Y1 - 2007/02/01 JF - Genome Research JO - Genome Research SP - 145 EP - 155 DO - 10.1101/gr.5872707 VL - 17 IS - 2 UR - http://genome.cshlp.org/content/17/2/145.abstract N2 - Recent studies suggest that surprisingly many mammalian genes have alternative promoters (APs); however, their biological roles, and the characteristics that distinguish them from single promoters (SPs), remain poorly understood. We constructed a large data set of evolutionarily conserved promoters, and used it to identify sequence features, functional associations, and expression patterns that differ by promoter type. The four promoter categories CpG-rich APs, CpG-poor APs, CpG-rich SPs, and CpG-poor SPs each show characteristic strengths and patterns of sequence conservation, frequencies of putative transcription-related motifs, and tissue and developmental stage expression preferences. APs display substantially higher sequence conservation than SPs and CpG-poor promoters than CpG-rich promoters. Among CpG-poor promoters, APs and SPs show sharply contrasting developmental stage preferences and TATA box frequencies. We developed a discriminator to computationally predict promoter type, verified its accuracy through experimental tests that incorporate a novel method for deconvolving mixed sequence traces, and used it to find several new APs. The discriminator predicts that almost half of all mammalian genes have evolutionarily conserved APs. This high frequency of APs, together with the strong purifying selection maintaining them, implies a crucial role in expanding the expression diversity of the mammalian genome. ER -