Published April 1, 2026 | Version v1
Journal article

The geometry of G × E: How scaling and endogenous treatment effects shape interaction direction

  • 1. University of California Los Angeles
  • 2. University of Chicago
  • 3. Carnegie Mellon University

Description

Gene-environment interaction (G × E) studies hold promise for identifying genetic loci mediating the effects of environmental risk on disease. However, interpretation of G × E effects is often confounded by two fundamental issues: the dependence of interaction estimates on outcome scale and the presence of endogenous treatment effects, in which genetic liability influences environmental exposure. These factors can induce apparent G × E signals—even when genetic and environmental contributions are purely additive on an unobserved scale. In this work, we demonstrate that any monotone convex transformation of an outcome induces sign-consistent G × E effects: the sign of the interaction term aligns with the sign of the corresponding main genetic effect. Convex transformations are a broad class of functions that include many commonly used data transformations, such as exponential and logarithmic functions, the square root, and other power transformations. We further show that endogenous treatment effects, modeled as threshold-based interventions, generate G × E effects with a similar directional signature. Exploiting this property, we propose a simple diagnostic: sign consistency across G × E estimates can signal when interactions are driven by outcome scaling or exposure endogeneity. We validate our framework in the UK Biobank using transcriptome-wide interaction studies (TxEWAS) across multiple trait–environment pairs, observing widespread sign consistency in some settings—suggesting confounding by scaling or treatment bias. Our results provide both a theoretical foundation and a practical tool for interpreting G × E findings, enabling researchers to assess whether the observed G × E signal may depend substantially on outcome scaling or be influenced by exposure endogeneity.

Data availability

The UK Biobank data underlying the results presented in this study were accessed under application 33127 and cannot be further distributed in accordance with UK Biobank policies. Researchers may obtain access to these data by submitting an application directly to the UK Biobank: https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access. All data and code necessary to reproduce the figures and results presented in this study are publicly available in a GitHub repository at https://github.com/michalsad/gxe_sign.

Additional details

Identifiers

DOI
10.1371/journal.pgen.1012073
Other
oai:uchicago.tind.io:16870

Funding

National Institutes of Health
R01MH130581
National Institutes of Health
U01MH126798
National Institutes of Health
R01MH122688
National Institutes of Health
R01HG006399
National Institutes of Health
R01HG011345
National Institutes of Health
R01GM142112
National Institutes of Health
L30HG013856
National Institutes of Health
R35GM150822
National Institutes of Health
K25HL157603

UChicago Information

Division(s)
Physical Sciences Division
Department(s)
Medicine