Exploratory Mean-Variance with Jumps: An Equilibrium Approach
By: Yuling Max Chen, Bin Li, David Saunders
Potential Business Impact:
Helps investors make more money in the stock market.
Revisiting the continuous-time Mean-Variance (MV) Portfolio Optimization problem, we model the market dynamics with a jump-diffusion process and apply Reinforcement Learning (RL) techniques to facilitate informed exploration within the control space. We recognize the time-inconsistency of the MV problem and adopt the time-inconsistent control (TIC) approach to analytically solve for an exploratory equilibrium investment policy, which is a Gaussian distribution centered on the equilibrium control of the classical MV problem. Our approach accounts for time-inconsistent preferences and actions, and our equilibrium policy is the best option an investor can take at any given time during the investment period. Moreover, we leverage the martingale properties of the equilibrium policy, design a RL model, and propose an Actor-Critic RL algorithm. All of our RL model parameters converge to the corresponding true values in a simulation study. Our numerical study on 24 years of real market data shows that the proposed RL model is profitable in 13 out of 14 tests, demonstrating its practical applicability in real world investment.
Similar Papers
The Exploratory Multi-Asset Mean-Variance Portfolio Selection using Reinforcement Learning
Mathematical Finance
Helps computers pick the best stocks to buy.
Exploratory Mean-Variance Portfolio Optimization with Regime-Switching Market Dynamics
Portfolio Management
Helps invest money better when markets change.
Mean--Variance Portfolio Selection by Continuous-Time Reinforcement Learning: Algorithms, Regret Analysis, and Empirical Study
Portfolio Management
Helps investors pick winning stocks automatically.