Score: 1

From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models

Published: July 24, 2025 | arXiv ID: 2507.18229v1

By: Zeqiang Zhang, Ruxin Chen

Potential Business Impact:

Teaches computers to make fair economic choices.

The application of Reinforcement Learning (RL) to economic modeling reveals a fundamental conflict between the assumptions of equilibrium theory and the emergent behavior of learning agents. While canonical economic models assume atomistic agents act as `takers' of aggregate market conditions, a naive single-agent RL simulation incentivizes the agent to become a `manipulator' of its environment. This paper first demonstrates this discrepancy within a search-and-matching model with concave production, showing that a standard RL agent learns a non-equilibrium, monopsonistic policy. Additionally, we identify a parametric bias arising from the mismatch between economic discounting and RL's treatment of intertemporal costs. To address both issues, we propose a calibrated Mean-Field Reinforcement Learning framework that embeds a representative agent in a fixed macroeconomic field and adjusts the cost function to reflect economic opportunity costs. Our iterative algorithm converges to a self-consistent fixed point where the agent's policy aligns with the competitive equilibrium. This approach provides a tractable and theoretically sound methodology for modeling learning agents in economic systems within the broader domain of computational social science.

Learning Closed-Loop Parametric Nash Equilibria of Multi-Agent Collaborative Field Coverage

Multiagent Systems

Teaches robots to cover areas much faster.

14 Mar 2025 0

89%

Reinforcement Learning and Consumption-Savings Behavior

General Economics

Explains why people spend less after losing jobs.

23 Oct 2025 0

89%

Reinforcement Learning in Queue-Reactive Models: Application to Optimal Execution

Trading & Market Microstructure

Teaches computers to trade stocks smartly.

19 Nov 2025 2

View PDF Login to Bookmark

Country of Origin

🇯🇵 🇩🇪 Japan, Germany

Page Count

14 pages

From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models

Teaches computers to make fair economic choices.

Technical Abstract

Learning Closed-Loop Parametric Nash Equilibria of Multi-Agent Collaborative Field Coverage

Reinforcement Learning and Consumption-Savings Behavior

Reinforcement Learning in Queue-Reactive Models: Application to Optimal Execution