
An expression library of genomic fragments to quantify the ability of ORF-encoded sequence features to regulate gene expression. (A) A schematic view of the following initial question: Does the ORF determine gene expression? (B) Scheme of the gDNA library preparation and expression measurements. (C) Measured expression distributions for all inserts (blue) and only those lacking a premature termination codon (PTC; gray). (D) Measured versus predicted expression levels in the gDNA library. Expression is predicted from the sequence of each gDNA insert using a 10-fold cross-validated linear model (R2 is calculated across all test data from all cross-validations). (E) Expression predicted from the sequence of each native yeast ORF using the same features as for the gDNA library. (F) Including ORF-encoded features in a model of expression increases the ability of promoter-YFP data to predict steady-state mRNA levels. Error bars, SD from 10-fold cross-validation.











