Published April 1, 2026
| Version v1
Journal article
The geometry of G × E: How scaling and endogenous treatment effects shape interaction direction
- 1. University of California Los Angeles
- 2. University of Chicago
- 3. Carnegie Mellon University
Description
Gene-environment interaction (G × E) studies hold promise for identifying genetic loci mediating the effects of environmental risk on disease. However, interpretation of G × E effects is often confounded by two fundamental issues: the dependence of interaction estimates on outcome scale and the presence of endogenous treatment effects, in which genetic liability influences environmental exposure. These factors can induce apparent G × E signals—even when genetic and environmental contributions are purely additive on an unobserved scale. In this work, we demonstrate that any monotone convex transformation of an outcome induces sign-consistent G × E effects: the sign of the interaction term aligns with the sign of the corresponding main genetic effect. Convex transformations are a broad class of functions that include many commonly used data transformations, such as exponential and logarithmic functions, the square root, and other power transformations. We further show that endogenous treatment effects, modeled as threshold-based interventions, generate G × E effects with a similar directional signature. Exploiting this property, we propose a simple diagnostic: sign consistency across G × E estimates can signal when interactions are driven by outcome scaling or exposure endogeneity. We validate our framework in the UK Biobank using transcriptome-wide interaction studies (TxEWAS) across multiple trait–environment pairs, observing widespread sign consistency in some settings—suggesting confounding by scaling or treatment bias. Our results provide both a theoretical foundation and a practical tool for interpreting G × E findings, enabling researchers to assess whether the observed G × E signal may depend substantially on outcome scaling or be influenced by exposure endogeneity.
Data availability
The UK Biobank data underlying the results presented in this study were accessed under application 33127 and cannot be further distributed in accordance with UK Biobank policies. Researchers may obtain access to these data by submitting an application directly to the UK Biobank: https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access. All data and code necessary to reproduce the figures and results presented in this study are publicly available in a GitHub repository at https://github.com/michalsad/gxe_sign.Additional details
Identifiers
- DOI
- 10.1371/journal.pgen.1012073
- Other
- oai:uchicago.tind.io:16870
Funding
- National Institutes of Health
- R01MH130581
- National Institutes of Health
- U01MH126798
- National Institutes of Health
- R01MH122688
- National Institutes of Health
- R01HG006399
- National Institutes of Health
- R01HG011345
- National Institutes of Health
- R01GM142112
- National Institutes of Health
- L30HG013856
- National Institutes of Health
- R35GM150822
- National Institutes of Health
- K25HL157603