TY - JOUR A1 - Zhang, Pinglu A1 - Wei, Yanming A1 - Tian, Qinzhong A1 - Zou, Quan A1 - Wang, Yansu T1 - Fast sequence alignment for centromeres with RaMA Y1 - 2025/05/01 JF - Genome Research JO - Genome Research SP - 1209 EP - 1218 DO - 10.1101/gr.279763.124 VL - 35 IS - 5 UR - http://genome.cshlp.org/content/35/5/1209.abstract N2 - The release of the first draft of the human pangenome has revolutionized genomic research by enabling access to complex regions like centromeres, composed of extra-long tandem repeats (ETRs). However, a significant gap remains as current methodologies are inadequate for producing sequence alignments that effectively capture genetic events within ETRs, highlighting a pressing need for improved alignment tools. Inspired by UniAligner, we developed a rare match aligner (RaMA), using rare matches as anchors and two-piece affine gap cost to generate complete pairwise alignment that better captures genetic evolution. RaMA also employs parallel computing and the wavefront algorithm to accelerate anchor discovery and sequence alignment, achieving up to 13.66 times faster processing using only 11% of UniAligner's memory. Downstream analysis of simulated data and the CHM13 and CHM1 higher-order repeat (HOR) arrays demonstrates that RaMA achieves more accurate alignments, effectively capturing true HOR structures. RaMA also introduces two methods for defining reliable alignment regions, further refining and enhancing the accuracy of centromeric alignment statistics. ER -