Table 1.

Summary of sequencing data

623tbl1

[i] The number of Illumina fragments represents the sequences that passed Illumina's quality filters. WGS long-read counts represent the number of sequences for which accurate quality and clipping coordinates were available. The median Illumina size is the median mapping distance for pairs that aligned concordantly to the reference genome, whereas the WGS long-read figure reflects the median length of the sequences after quality and vector clipping. Median Illumina read length is the median length of the sequence on each end of each matepair. Observed coverage was computed empirically as the mean number of concordant pairs that spanned each base in the genome. The fraction aligned represents the proportion of reads that aligned somewhere in the genome, requiring that each end of Illumina reads aligned.

[ii] NA, Not applicable.