Score: 0

Exploratory Mean-Variance with Jumps: An Equilibrium Approach

Published: December 10, 2025 | arXiv ID: 2512.09224v1

By: Yuling Max Chen, Bin Li, David Saunders

Potential Business Impact:

Helps investors make more money in the stock market.

Business Areas:
A/B Testing Data and Analytics

Revisiting the continuous-time Mean-Variance (MV) Portfolio Optimization problem, we model the market dynamics with a jump-diffusion process and apply Reinforcement Learning (RL) techniques to facilitate informed exploration within the control space. We recognize the time-inconsistency of the MV problem and adopt the time-inconsistent control (TIC) approach to analytically solve for an exploratory equilibrium investment policy, which is a Gaussian distribution centered on the equilibrium control of the classical MV problem. Our approach accounts for time-inconsistent preferences and actions, and our equilibrium policy is the best option an investor can take at any given time during the investment period. Moreover, we leverage the martingale properties of the equilibrium policy, design a RL model, and propose an Actor-Critic RL algorithm. All of our RL model parameters converge to the corresponding true values in a simulation study. Our numerical study on 24 years of real market data shows that the proposed RL model is profitable in 13 out of 14 tests, demonstrating its practical applicability in real world investment.

Country of Origin
🇨🇦 Canada

Page Count
27 pages

Category
Quantitative Finance:
Portfolio Management