Analyzing the large and complex SFARI autism cohort data using the Genotypes and Phenotypes in Families (GPF) platform

Table 1.

Gene profiles provide freely accessible summary statistics of the possibly protected data managed by GPF and additional relevant information organized by gene

Present in autism gene sets SPARK SSC
Autism (21,206) Controls (9290) Autism (2814) Controls (2562)
Gene Iossifov PNAS 2015 Sanders Neuron 2015 Yuen Scherer Nature 2017 Turner Eichler ajhg 2019 Satterstrom Buxbaum Cell 2020 De novo LGD De novo missense MPC > 2 Transmitted LGD De novo LGD De novo missense MPC > 2 Transmitted LGD De novo LGD De novo missense MPC > 2 Transmitted LGD De novo LGD De novo missense MPC > 2 Transmitted LGD
SLC6A1 1 (0.05) 1 (0.05) 3 (0.14) 1 (0.11)
WAC 1 (0.05) 1 (0.05) 2 (0.71)
CHD2 7 (0.33) 5 (0.24) 1 (0.05) 3 (1.07) 1 (0.36)
PTEN 5 (0.24) 6 (0.28) 10 (0.47) 1 (0.11) 6 (0.65) 1 (0.36) 3 (1.07) 5 (1.78) 2 (0.78)
SPAST 1 (0.36)
DYRK1A 6 (0.28) 2 (0.09) 43 (2.03) 34 (3.66) 3 (1.07) 1 (0.36) 1 (0.39)
SCN2A 16 (0.75) 9 (0.42) 1 (0.05) 2 (0.22) 2 (0.71) 3 (1.07) 1 (0.36) 13 (5.07)
CHD8 13 (0.61) 3 (0.14) 5 (0.24) 1 (0.11) 9 (3.2) 1 (0.36)
ARID1B 3 (0.14) 1 (0.05) 44 (2.07) 15 (1.61) 2 (0.71) 21 (7.46) 15 (5.84)
ADNP 12 (0.57) 5 (0.24) 1 (0.11) 2 (0.71) 2 (0.71) 1 (0.39)
F0XP1 5 (0.24) 2 (0.09) 13 (0.61) 3 (0.32) 1 (0.36) 6 (2.13) 1 (0.39)
POGZ 6 (0.28) 1 (0.05) 1 (0.36)
SHANK2 3 (0.14) 5 (0.24) 1 (0.11)
PHF2 11 (0.52) 9 (0.97) 2 (0.71) 1 (0.36)
KMT2C 5 (0.24) 3 (0.14) 107 (5.05) 29 (3.12) 1 (0.36) 1 (0.36) 14 (4.98) 15 (5.84)
MYT1L 3 (0.14) 1 (0.05) 1 (0.36)
DSCAM 1 (0.05) 5 (0.24) 1 (0.11) 2 (0.71) 2 (0.71) 1 (0.39)
ASH1L 6 (0.28) 21 (0.99) 5 (0.54) 2 (0.71) 4 (1.42) 3 (1.17)
MBD5 1 (0.05) 2 (0.09) 1 (0.36) 1 (0.39)
SETD5 3 (0.14) 7 (0.33) 2 (0.71)
Sequencing de novo Related gene sets Intolerance scores
Autism (21,795) Congenital heart disease (1213) Developmental disorder (3664) Epilepsy (518) Intellectual disability (966) Schizophrenia (1022) Unaffected (7305)
De novo LGD De novo missense MPC > 2 De novo LGD De novo missense MPC > 2 De novo LGD De novo missense MPC > 2 De novo LGD De novo missense MPC > 2 De novo LGD De novo missense MPC > 2 n(%) De novo LGD De novo missense MPC > 2 De novo LGD De novo missense MPC > 2 CHD 8 target genes Chromatin modifiers Essential genes FMRP Darnell RVIS rank pLI rank
2 (0.09) 4 (0.18) 2 (0.55) 6 (1.64) 1 (1.04) 1 (1.04) 1 (0.14) 4848 1000
4 (0.18) 3 (0.82) 1 (1.93) 2 (2.07) 1781 686
13 (0.6) 7 (0.32) 6 (1.64) 2 (0.55) 1 (3.86) 2 (2.07) 1 (1.04) 392.5 101.5
8 (0.37) 11 (0.51) 1 (0.27) 1 (0.27) 1 (0.14) 6106.5 2246
2 (0.09) 1 (0.27) 8087 1357
11 (0.51) 2 (0.09) 14 (3.82) 4 (1.09) 1 (1.04) 1 (1.04) 4274 895
21 (0.96) 17 (0.78) 4 (1.09) 7 (1.91) 3 (5.79) 6 (6.21) 2 (2.07) 1 (1) 292 132
30 (1.38) 7 (0.32) 3 (0.82) 1 (1.04) 1 (1) 1 (0.14) 193 31.5
10 (0.46) 1 (0.05) 1 (0.82) 30 (8.19) 6 (7.35) 131 585
20 (0.92) 19 (5.19) 1 (1.04) 684 1089
7 (0.32) 2 (0.09) 8 (2.18) 2 (0.55) 1 (1.04) 1 (1.04) 4666 806
9 (0.41) 1 (0.05) 1 (0.82) 6 (1.64) 7 (7.25) 1 (1) 565 292
6 (0.28) 649
2 (0.09) 1306.5 1598
9 (0.41) 3 (0.14) 1 (0.82) 3 (0.82) 1 (1.04) 31.5
4 (0.18) 3 (0.14) 2 (0.55) 2 (0.55) 1 (1.04) 586 557
5 (0.23) 70 78.5
10 (0.46) 1 (0.82) 2 (0.55) 368 31.5
2 (0.09) 3 (0.82) 249 740
4 (0.18) 14 (3.82) 2 (2.07) 2098.5 259
  • The statistics comprise the number of variants of various types in the available data sets. Gene profiles organize data as a virtual table with a row for each human gene and columns organized into sections. The table shows a small part of the complete gene profiles in GPF-SFARI, containing the statistics for 20 of the genes most strongly implicated in autism. In GPF-SFARI's gene profiles, the first section of columns shows if each of the genes has been listed as a top candidate in five high-profile autism genetic analysis papers. The first 12 of the shown genes have been listed in all five papers. The second, third, and fourth column sections contain variant counts from the SPARK, SSC, and sequencing de novo collections, respectively. The sections for SPARK and SSC show the counts of de novo LGDs, de novo missense, and transmitted LGDs separately for the children affected with autism and for their unaffected siblings (“controls”). The sequencing de novo section shows the counts for de novo LGD and missense variants in individuals with autism, epilepsy, schizophrenia, developmental disorder, intellectual disability, congenital heart disease, and controls. In each column, the number of variants of the particular type in each gene is shown, as well as the rate in terms of the number of variants per 1000 individuals. The fifth section shows if the gene belongs to five genes strongly implicated in the etiology of autism: syndromic autism genes, CHD8 target genes, genes encoding chromatin modifiers, essential genes, and genes encoding FMRP target RNAs. The last section shows two popular gene scores (RVIS and pLI) related to the gene's degree of intolerance to damaging mutations computed through analysis of damaging variants in large human populations. We note that the table is a stylized representation of the gene profiles page on GPF-SFARI, designed to reduce white space.

This Article

  1. Genome Res. 35: 2352-2362

Preprint Server