DNA conformational flexibility descriptors improve transcription factor binding prediction across diverse transcription factor families

    • 1Molecular Biology and Biotechnology, Tezpur University, Tezpur, Assam 784028, India;
    • 2Department of Biotechnology, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh 522302, India
Download PDF Please log-in to or register for your personal account in order to access PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 5
Current Issue:

Abstract

Precise transcription factor (TF) binding to DNA governs gene regulation, yet nucleotide sequence alone often fails to fully capture binding specificity. Although static DNA shape is a recognized determinant of indirect readout, the role of intrinsic conformational flexibility remains underexplored across TF families. Here, we demonstrate that integrating sequence-derived DNA flexibility descriptors into predictive models improves both prediction and mechanistic interpretability of TF–DNA affinity. Across large-scale in vitro data sets encompassing HT-SELEX and protein-binding microarrays for mammalian and Drosophila TFs, flexibility-augmented models consistently outperform sequence-only baselines and complement DNA shape models. Cross-platform analyses further indicate that flexibility features capture structural information that is robust to platform-specific biases. Using a position-resolved interpretation framework, we uncover family-specific “flexibility footprints,” including recurrent hotspots in core motifs and flanks that align with DNA structural deformations from TF–DNA complex structures. Extending to ENCODE ChIP-seq and DNase-seq data, flexibility augmentation improves classification of functional TF binding sites across diverse TFs and cellular contexts. Collectively, these results underscore the insufficiency of sequence-only models and highlight the utility of the flexibility descriptors as an interpretable component of the TF recognition code.

Loading
Loading
Back to top