Estimation of allele-specific fitness effects across human protein-coding sequences and implications for disease

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 3.
Figure 3.

Performance in predicting disease-associated nonsynonymous variants. Performance is quantified using the area under the receiver operating characteristic curve (AUC) statistic. Results for LASSIE are compared with those for Eigen (Ionita-Laza et al. 2016), PolyPhen-2 (Adzhubei et al. 2010), CADD (Kircher et al. 2014), SIFT (Ng and Henikoff 2003), and phyloP (Pollard et al. 2010). (A) Performance for pathogenic variants from ClinVar (Landrum et al. 2014). (B) Performance for cancer-driver mutations from Chang et al. (2016). (C) Distributions of estimated |s| for variants in BRCA1 predicted to be “functional” (FUNC; i.e., nondisruptive), “intermediate” (INT), or “nonfunctional” (NONFUNC; i.e., disruptive) by saturation genome editing (Findlay et al. 2018). Colored dots indicate those variants also having expert-reviewed status in ClinVar (CLINREVSTAT = reviewed by expert panel). (D) Performance for rare (MAF < 1%) GWAS hits. (E) Performance for common (MAF > 5%) GWAS hits, showing that all methods have limited power.

This Article

  1. Genome Res. 29: 1310-1321

Preprint Server