Published November 5, 2025 | Version v1
Journal article

Specificity, length and luck drive gene rankings in association studies

Description

Standard genome-wide association studies (GWAS) and rare variant burden tests are essential tools for identifying trait-relevant genes. Although these methods are conceptually similar, by analysing association studies of 209 quantitative traits in the UK Biobank, we show that they systematically prioritize different genes. This raises the question of how genes should ideally be prioritized. We propose two prioritization criteria: (1) trait importance — how much a gene quantitatively affects a trait; and (2) trait specificity — the importance of a gene for the trait under study relative to its importance across all traits. We find that GWAS prioritize genes near trait-specific variants, whereas burden tests prioritize trait-specific genes. Because non-coding variants can be context specific, GWAS can prioritize highly pleiotropic genes, whereas burden tests generally cannot. Both study designs are also affected by distinct trait-irrelevant factors, complicating their interpretation. Our results illustrate that burden tests and GWAS reveal different aspects of trait biology and suggest ways to improve their interpretation and usage.

Data availability

All data for reproducing the figures are provided on GitHub (https://github.com/jeffspence/specificity_length_luck).

The scripts for reproducing the figures and simulations are provided on GitHub (https://github.com/jeffspence/specificity_length_luck).

Additional details

Identifiers

DOI
10.1038/s41586-025-09703-7
Other
oai:uchicago.tind.io:16551

Funding

National Institutes of Health
R01HG011432
National Institutes of Health
R01HG008140
National Institutes of Health
R01HG014005
National Institutes of Health
U01HG012069
National Institutes of Health
R01GM115889
National Institutes of Health
R35GM157134

UChicago Information

Division(s)
Biological Sciences Division
Department(s)
Human Genetics