FocalSV enables target region–based structural variant assembly and refinement using single-molecule long-read sequencing data
- 1Department of Biomedical Engineering, Vanderbilt University, Nashville, Tennessee 37235, USA;
- 2Department of Computer Science, Vanderbilt University, Nashville, Tennessee 37235, USA
-
↵3 These authors contributed equally to this work.
Abstract
Structural variants (SVs) play a critical role in shaping the diversity of the human genome, and their detection holds significant potential for advancing precision medicine. Despite notable progress in single-molecule long-read sequencing technologies, accurately identifying SV breakpoints and resolving their sequence remains a major challenge. Current alignment-based tools often struggle with precise breakpoint detection and sequence characterization, whereas whole-genome assembly-based methods are computationally demanding and less practical for targeted analyses. Neither approach is ideally suited for scenarios where regions of interest are predefined and require precise SV characterization. To address this gap, we introduce FocalSV, a targeted SV detection framework that integrates both assembly- and alignment-based signals. By combining the precision of local assemblies with the efficiency of region-specific analysis, FocalSV enables more accurate SV detection. FocalSV supports user-defined target regions and can automatically identify and expand regions with potential structural variants to enable more comprehensive detection. FocalSV is evaluated on 10 germline data sets and two paired normal-tumor cancer data sets, demonstrating superior performance in both precision and efficiency.
Footnotes
-
[Supplemental material is available for this article.]
-
Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.280282.124.
-
Freely available online through the Genome Research Open Access option.
- Received November 26, 2024.
- Accepted August 7, 2025.
This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.











