
Decomposition of cenX into HORs. The 12-monomer HOR for cenX is represented as M1… M12 = AB…KL. The monomer set includes these 12 frequent monomers as well as hybrid monomers M (a hybrid of monomers J and H) and N (a hybrid of monomers K and J) identified in Dvorkina et al. (2020). Each occurrence of this HOR that starts from the monomer Mi is labeled as ci (shown in red). Each occurrence of a partial HOR that includes monomers from i to j is labeled as pi,j. We use the notation cm (pm) to denote m consecutive occurrences of a canonical (partial) HOR. The most frequent partial monomers p3-7, p7-3, and p5-2 in cenX are colored in blue, green, and brown, respectively. The HOR decomposition of cenX has a length 72 and includes 1486 complete HORs that form 34 HOR runs. Only 257 of 18,089 (1.4%) monomer blocks in cenX are not covered by complete HORs. The “LINE” entry shows the position of the LINE element. To ensure that all monomers are shown in the forward strand, we decompose the reverse complement of cenX and take reverse-complements of all monomers in cenX (Supplemental Note 4).











