Markup | Genome Research

Table 1.

Quality assessment of CapSTARR-seq data set

	Panel 1			Panel 2
	Input	Output - replicate 1	Output - replicate 2	Input	Output - replicate 1	Output - replicate 2
# Targeted regions	22,314	–	–	48,507	–	–
# Tested regions (%)	22,176 (99.38%)	–	–	48,488 (99.96%)	–	–
Mean region size (bp)	638.92	–	–	735.36	–	–
Median region size (bp)	448	–	–	500	–	–
Primary alignment	18,218,282	38,617,348	44,520,086	45,352,130	48,320,288	54,647,238
Properly paired (%)	18,000,464 (98.8%)	36,958,904 (95.71%)	42,253,962 (94.91%)	45,078,556 (99.40%)	47,427,054 (98.15%)	53,626,310 (98.13%)
Active enhancers	–	1137	1142	–	6202	6484
Overlapping enhancers	–	914		–	5698
Pearson's r² value	–	0.72		–	0.93

[i] The targeted regions represent the number of regions in the capture panel design, whereas the tested elements represent the regions actually captured by the input library. This count is following concatenation of adjacent or overlapping candidate regions. Primary alignment values represent sequencing read counts, with the properly paired value representing the number of reads properly aligning with the genome. Active enhancer regions are designated as regions with a STARRPeaker Q-value ≤ 0.05. The overlap value indicates how many regions were designated active enhancers across both technical replicates of the panel. The Pearson's r² value (also depicted in Fig. 1D,E) demonstrates the correlation between replicates.