Genealogy-based trait association with LOCATER boosts power at loci with allelic heterogeneity
- Xinxin Wang1,2,3,10,11,
- Ryan Christ1,2,
- Erica Young4,5,
- Chul Joo Kang4,
- Indraniel Das5,12,
- Edward A. Belter Jr.5,13,
- Markku Laakso6,
- Louis J.M. Aslett7,
- David Steinsaltz8,
- Nathan O. Stitziel4,9 and
- Ira M. Hall1,2
- 1Department of Genetics, Yale University, New Haven, Connecticut 06520, USA;
- 2Center for Genomic Health, Yale University, New Haven, Connecticut 06520, USA;
- 3Roy and Diana Vagelos Division of Biology and Biomedical Sciences, Washington University School of Medicine, Saint Louis, Missouri 63110, USA;
- 4Center for Cardiovascular Research, Division of Cardiology, Department of Medicine, Washington University School of Medicine, Saint Louis, Missouri 63110, USA;
- 5McDonnell Genome Institute, Washington University School of Medicine, Saint Louis, Missouri 63108, USA;
- 6Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio FI-70211, Finland;
- 7Department of Mathematical Sciences, Durham University, Durham DH1 3LE, United Kingdom;
- 8Department of Statistics, University of Oxford, Oxford OX1 4BH, United Kingdom;
- 9Department of Genetics, Washington University School of Medicine, Saint Louis, Missouri 63110, USA
Abstract
A key methodological challenge for genome-wide association studies is how to leverage haplotype diversity and allelic heterogeneity to improve trait association power, especially in noncoding regions where it is difficult to predict variant impacts and define functional units for variant aggregation. Genealogy-based association methods have the potential to bridge this gap by testing combinations of common and rare haplotypes based purely on their ancestral relationships. In parallel work, we have developed an efficient local ancestry inference engine and a novel statistical method (LOCATER) for combining signals present on different branches of a locus-specific haplotype tree. Here, we develop a genome-wide LOCATER analysis pipeline and apply it to a genome sequencing study of 6795 Finnish individuals with 101 cardiometabolic traits and 18.9 million autosomal variants. We identify 351 significant trait associations at 47 distinct genomic loci and find that LOCATER boosts the single marker test (SMT) association signal at five loci by combining independent signals from distinct alleles. LOCATER successfully recovers known quantitative trait loci not found by SMT, including LIPG, recovers known allelic heterogeneity at the APOE/C1/C4/C2 gene cluster, and suggests one novel association. We find that confounders have a more pronounced effect on genealogy-based methods than SMT, and we propose a new randomization approach and a general method for genomic control to eliminate their effects. This study demonstrates that genealogy-based methods such as LOCATER excel when multiple causal variants are present and suggests that their application to larger and more diverse cohorts will be fruitful.
Footnotes
-
[Supplemental material is available for this article.]
-
Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.280372.124.
-
Freely available online through the Genome Research Open Access option.
- Received December 30, 2024.
- Accepted March 12, 2026.
This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.











