Data-driven equation discovery reveals nonlinear reinforcement learning in humans

LaFollette, Kyle J.; Yuval, Janni; Schurr, Roey; Melnikoff, David; Goldenberg, Amit

doi:10.6082/zj99h-vkj08

Published July 31, 2025 | Version v1

Journal article Open

Data-driven equation discovery reveals nonlinear reinforcement learning in humans

1. University of Chicago
2. Massachusetts Institute of Technology
3. Harvard University
4. Stanford University

Contributors

Editor:

McClelland, James¹

1. Stanford University

Computational models of reinforcement learning (RL) have significantly contributed to our understanding of human behavior and decision-making. Traditional RL models, however, often adopt a linear approach to updating reward expectations, potentially oversimplifying the nuanced relationship between human behavior and rewards. To address these challenges and explore models of RL, we utilized a method of model discovery using equation discovery algorithms. This method, currently used mainly in physics and biology, attempts to capture data by proposing a differential equation from an array of suggested linear and nonlinear functions. Using this method, we were able to identify a model of RL which we termed the Quadratic Q-Weighted model. The model suggests that reward prediction errors obey nonlinear dynamics and exhibit negativity biases, resulting in an underweighting of reward when expectations are low, and an overweighting of the absence of reward when expectations are high. We tested the generalizability of our model by comparing it to classical models used in nine published studies. Our model surpassed traditional models in predictive accuracy across eight out of these nine published datasets, demonstrating not only its generalizability but also its potential to offer insights into the complexities of human learning. This work showcases the integration of a behavioral task with advanced computational methodologies as a potent strategy for uncovering the intricate patterns of human cognition, marking a significant step forward in the development of computational models that are both interpretable and broadly applicable.

Data availability

All simulation and empirical data are available on the Open Science Framework here: https://osf.io/aeujf/?view_only=88b2b75499f54a3895502fc353f4d244/. All analysis scripts and modeling code are available on GitHub here: https://github.com/GoldenbergLab/analysis-rl-sindy-kyle.

Files

lafollette-et-al-data-driven-equation-discovery-reveals-nonlinear-reinforcement-learning-in-humans.pdf

Files (3.3 MB)

Name	Size	Download all
lafollette-et-al-data-driven-equation-discovery-reveals-nonlinear-reinforcement-learning-in-humans.pdf Article md5:37cb355ab08a2547cc92ee5354ef38d9	1.4 MB	Preview Download
pnas.2413441122.sapp.pdf Supporting information md5:627d642fd906b81e2a05150ddc1ae87e	1.9 MB	Preview Download

Additional details

DOI: 10.1073/pnas.2413441122
Other: oai:uchicago.tind.io:15972

Division(s): Booth School of Business
Department(s): Econometrics and Statistics

	All versions	This version
Views	7	7
Downloads	13	13
Data volume	21.1 MB	21.1 MB

Data-driven equation discovery reveals nonlinear reinforcement learning in humans

Contributors

Editor:

Data availability

Files

lafollette-et-al-data-driven-equation-discovery-reveals-nonlinear-reinforcement-learning-in-humans.pdf

Files (3.3 MB)

Additional details

Identifiers

UChicago Information

Data-driven equation discovery reveals nonlinear reinforcement learning in humans

Creators

Contributors

Editor:

Description

Data availability

Files

lafollette-et-al-data-driven-equation-discovery-reveals-nonlinear-reinforcement-learning-in-humans.pdf

Files (3.3 MB)

Additional details

Identifiers

UChicago Information