Table 1.

Quality assessment of CapSTARR-seq data set

Panel 1Panel 2
InputOutput - replicate 1Output - replicate 2InputOutput - replicate 1Output - replicate 2
# Targeted regions22,31448,507
# Tested regions (%)22,176 (99.38%)48,488 (99.96%)
Mean region size (bp)638.92735.36
Median region size (bp)448500
Primary alignment18,218,28238,617,34844,520,08645,352,13048,320,28854,647,238
Properly paired (%)18,000,464 (98.8%)36,958,904 (95.71%)42,253,962 (94.91%)45,078,556 (99.40%)47,427,054 (98.15%)53,626,310 (98.13%)
Active enhancers1137114262026484
Overlapping enhancers9145698
Pearson's r2 value0.720.93

[i] The targeted regions represent the number of regions in the capture panel design, whereas the tested elements represent the regions actually captured by the input library. This count is following concatenation of adjacent or overlapping candidate regions. Primary alignment values represent sequencing read counts, with the properly paired value representing the number of reads properly aligning with the genome. Active enhancer regions are designated as regions with a STARRPeaker Q-value ≤ 0.05. The overlap value indicates how many regions were designated active enhancers across both technical replicates of the panel. The Pearson's r2 value (also depicted in Fig. 1D,E) demonstrates the correlation between replicates.