
OR repertoire statistics. (A) Sequence-length cumulative distribution. The discontinuity near length 220 amino acids reflects the PCR product size obtained by using standard primers and is expected to disappear once the genome is finished. (B) Distribution of number of frame disruptions (frameshifts, in-frame stop codons, disrupting interspersed repeats, or partial coding regions flanked by non-OR genomic sequence). (C) Distribution of G + C content levels for the OR coding regions (CDS), their genomic environments (env), and the genome at large (genome), as subdivided into the three isochore groups L, H1–2, and H3, by using the 43% and 50% cutoffs. The 100% value refers to the 702 ORs for which an environment statistic could be calculated.











