Table 1.

Primate Single-Nucleotide Variation versus Sequence Class

# loci Alignment length (bp) Aligned bases (bp) Matches (bp) Mismatches (bp) Transitions (s) Transversions (v) s/v Identity (%) Kimura Distance (%) R[ii] ×10−9
Human–Chimpanzee
 Overall514968069485370847989475476136914178472.0798.87 ± 0.001.14 ± 0.00[iii] 1.034 ± 0.004
 Overall-CG514968069476428347232494103415414256201.6699.14 ± 0.000.87 ± 0.000.788 ± 0.004
 Exon246905168957685434142961182.5199.40 ± 0.030.60 ± 0.030.548 ± 0.027
 Unique noncoding51274958427200232692593274308913185172.0898.99 ± 0.011.02 ± 0.010.924 ± 0.006
 Repetitive51220133620982972070750265478393181542.1698.73 ± 0.011.28 ± 0.011.162 ± 0.007
  Alu514462124193794128826497457719202.3898.45 ± 0.021.57 ± 0.021.425 ± 0.018
  Alu-CG514462123990483950134035156724681.5798.99 ± 0.021.02 ± 0.020.926 ± 0.015
  L1518370357677747582139561632232391.9598.75 ± 0.011.26 ± 0.011.143 ± 0.012
Human–Baboon
 Overall42498496544565074204745251762167380843821.9894.35 ± 0.015.90 ± 0.01[iii] 1.181 ± 0.002
 Overall-CG42498496543511984140103211095752371358581.8195.15 ± 0.015.03 ± 0.011.007 ± 0.002
 Exon24485784809846627147110424292.4396.94 ± 0.083.13 ± 0.080.627 ± 0.016
 Unique noncoding42314825530227152862848159867106002538651.9794.71 ± 0.015.51 ± 0.011.102 ± 0.003
 Repetitive421973097155529514567559854066258322822.0593.66 ± 0.026.66 ± 0.021.332 ± 0.004
  Alu42404917292641267869247721721975532.2891.54 ± 0.059.07 ± 0.061.814 ± 0.012
  Alu-CG42404917275514256962185526506120461.8593.27 ± 0.057.10 ± 0.051.419 ± 0.011
  L1427785755388635072373162620465111611.8394.13 ± 0.036.14 ± 0.041.228 ± 0.007
Human–Lemur
 Overall96231394231393410618207847053350251.3480.60 ± 0.0622.73 ± 0.08[iii] 2.066 ± 0.008
 Overall-CG96231394060113323217369031863418271.3181.85 ± 0.0621.01 ± 0.081.910 ± 0.007
 Unique93703373135122559775753533526240091.4081.65 ± 0.0721.31 ± 0.091.938 ± 0.009
 Repetitive9244728103787803332345412909105451.2277.40 ± 0.1327.25 ± 0.192.477 ± 0.017

[i] Orthologous sequences were globally aligned with ALIGN (Methods). A suboptimal alignment was defined as any alignment which exceeded two standard deviations of the mean genetic distance (window size 2 kb, slide 100 bp). These regions were not included in the analysis. The mean and standard deviation of alignment lengths are 106, 107 ± 41,659, 95,171 ± 38,751, and 47,015 ± 34,144 bp for human–chimpanzee, human–baboon, and human–lemur comparisons. Exon sequence was restricted only to well-annotated human genes (NCBI RefSeq database). Repetitive sequences were detected using RepeatMasker (version 3.0). Unique noncoding regions excluded both exonic and repetitive regions. For human–baboon and human–chimpanzee comparisons, Alu and L1 were calculated separately. Relatively few L1 and Alu repeats were orthologous between human and lemur genomic alignments and were therefore not partitioned. Due to the enrichment of CpG dinucleotides with Alu repeats, we considered substitutions without CpG dinucleotides (Alu-CG) as well as the overall content minus CpG dinucleotides in each species (Overall-CpG).

[ii] Substitution rate calculations assume divergence times of the human lineage from chimpanzee, baboon and lemur of 5.5, 25 and 55 Mya (Goodman 1999).

[iii] If suboptimal alignments were included in the analysis, the overall genetic distance increases to 1.14 ± 0.00%, 6.05 ± 0.01% and 25.69 ± 0.07%, respectively (Methods).