Quality assessment of CapSTARR-seq data set
| Panel 1 | Panel 2 | |||||
|---|---|---|---|---|---|---|
| Input | Output - replicate 1 | Output - replicate 2 | Input | Output - replicate 1 | Output - replicate 2 | |
| # Targeted regions | 22,314 | – | – | 48,507 | – | – |
| # Tested regions (%) | 22,176 (99.38%) | – | – | 48,488 (99.96%) | – | – |
| Mean region size (bp) | 638.92 | – | – | 735.36 | – | – |
| Median region size (bp) | 448 | – | – | 500 | – | – |
| Primary alignment | 18,218,282 | 38,617,348 | 44,520,086 | 45,352,130 | 48,320,288 | 54,647,238 |
| Properly paired (%) | 18,000,464 (98.8%) | 36,958,904 (95.71%) | 42,253,962 (94.91%) | 45,078,556 (99.40%) | 47,427,054 (98.15%) | 53,626,310 (98.13%) |
| Active enhancers | – | 1137 | 1142 | – | 6202 | 6484 |
| Overlapping enhancers | – | 914 | – | 5698 | ||
| Pearson's r2 value | – | 0.72 | – | 0.93 | ||
[i] The targeted regions represent the number of regions in the capture panel design, whereas the tested elements represent the regions actually captured by the input library. This count is following concatenation of adjacent or overlapping candidate regions. Primary alignment values represent sequencing read counts, with the properly paired value representing the number of reads properly aligning with the genome. Active enhancer regions are designated as regions with a STARRPeaker Q-value ≤ 0.05. The overlap value indicates how many regions were designated active enhancers across both technical replicates of the panel. The Pearson's r2 value (also depicted in Fig. 1D,E) demonstrates the correlation between replicates.