Linear regression models for human-chimpanzee divergence and human diversity
|
|
Human-chimp divergence |
Human diversity |
||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
R2 |
Slope
|
τ
|
R2 |
Slope
|
τ
|
||||
| Recombination rate (cM/Mb) | 0.311* | 0.291* | 0.239* | 0.328* | ||||||
| Poly(R/Y) | −0.380* | −0.031 | −0.151* | 0.020 | ||||||
| Simple repeat content | 0.255* | 0.357* | 0.143* | 0.221* | ||||||
| The top three predictors | 0.380 | 0.241 | ||||||||
| CpG content | 2.021* | 0.046** | 1.105* | 0.125* | ||||||
| CpG islands | −0.504* | −0.076*** | −0.224*** | 0.042 | ||||||
| Distance to centromere | −0.103* | 0.046** | −0.121* | 0.032 | ||||||
| Distance to telomeres | −0.145* | −0.339* | −0.159* | −0.283* | ||||||
| GC content | −2.664* | 0.009 | −1.474* | 0.094* | ||||||
| Gene content | −0.107* | −0.187* | N.S. | −0.066*** | ||||||
| Poly(A/T) | −1.465* | −0.047** | −0.875* | −0.115* | ||||||
| Poly(CA) | −0.079** | 0.227* | N.S. | 0.196* | ||||||
| Poly(G/C) | −0.202* | −0.062*** | −0.253* | 0.024 | ||||||
| SINE count | N.S. | 0.013 | 0.087*** | 0.051** | ||||||
| Full model
|
0.526
|
|
|
0.324
|
|
|
||||
-
This model was determined based upon a stepwise procedure. The divergence and diversity data come from nonrepetitive intergenic regions. Most of the variability is explained by the first three predictors. The R2 estimates are for the model with three predictors and for the full model, that is, all 12 (11) predictors that each significantly contributed to explain the variance in divergence (diversity). The slope estimates are for the full model and are standardized for ease of comparison. Also listed are the pairwise rank-correlation coefficients based on Kendall's τ.
(N.S.) Not significant in the multiple regression model.
-
↵* Significant at the <0.1% level.
-
↵** Significant at the 5% level.
-
↵*** Significant at the 1% level.











