Abstract
One major challenge in disease detection using methylation biomarkers is accurately identifying low-frequency, disease-specific methylation haplotypes. Conventional methods detect these haplotypes using unbiased long capture probes, requiring high sequencing depth, with most reads being uninformative. Here, we present Methylation Anchor Probe for Low-signal Enrichment (MAPLE), a novel technology that uses rationally designed, ultrasensitive short capture probes to selectively enrich low-frequency methylation haplotypes, significantly reducing sequencing costs while enhancing detection sensitivity. We experimentally validate MAPLE on reference samples and develop the Enrichment Factor Model (EFM) to quantify the enrichment efficiency of individual probes. To our knowledge, this is the first model to enable correction of observed disease-specific haplotype fractions to their original levels, essential for estimating the limit of detection (LOD) in assay performance evaluation. Finally, two panels are designed using MAPLE and the conventional method respectively to target 13,264 colorectal cancer (CRC)-related methylation haplotypes and are validated through a blood plasma cohort of 162 samples including CRC patients and controls. In 24 stage I and II cancer samples, MAPLE maps 22.7% of unique reads to 5045 targeted haplotypes, compared to 0.8% of reads mapping to 190 haplotypes with the conventional method. MAPLE also enriches tumor-specific fragmentomic characteristics, including fragment size and DNA cleavage profiles. Aided by a LightGBM classifier, the MAPLE panel accurately identifies patients with 95% specificity and 80% sensitivity. Even with just 0.05% of total sequencing reads, MAPLE exhibits only a modest performance decline, in contrast to the conventional method’s significant decline, highlighting MAPLE’s superior efficiency and robustness.