Chencheng Xu; Suying Bao; Ye Wang; Wenxing Li; Hao Chen; Yufeng Shen; Tao Jiang; Chaolin Zhang

Figure 1.

The architecture of DeltaSplice. DeltaSplice comprises a feature-extraction module and multiple prediction modules. The feature-extraction module is constructed using residual-connected convolutional neural networks (CNNs), which convert a one-hot-encoded input sequence to a feature representation. Each prediction module consists of fully connected layers and a Softmax output layer that takes a feature representation as input and generates predictions for SSU or splice-site probabilities. The single-sequence mode employs two prediction modules to predict the SSU $\text{[math]}$ and the splice-site probabilities $\text{[math]}$ for each site in the input gene sequence s, based on the corresponding feature representation v_s. In the dual-sequence mode, the feature-extraction module calculates the feature representation $\text{[math]}$ and $\text{[math]}$ separately for the target gene sequence s_t and the reference gene sequence s_r. The predicted SSU $\text{[math]}$ for every site in the target gene sequence is computed using a prediction module, from the input $\text{[math]}$ Here $\text{[math]}$ is the feature representation of the reference SSU u_r. RNA-seq data from adult brain tissues of humans and seven other mammalian species, as summarized in Supplemental Table S1, were used to estimate SSU values for model training.

Reference-informed prediction of alternative splicing and splicing-altering mutations from sequences

This Article

Preprint Server

Current Issue

In This Issue