Attention-Enhanced Reinforcement Learning for Dynamic Portfolio Optimization
By: Pei Xue, Yuanchun Ye
Potential Business Impact:
Helps money managers pick winning stocks better.
We develop a deep reinforcement learning framework for dynamic portfolio optimization that combines a Dirichlet policy with cross-sectional attention mechanisms. The Dirichlet formulation ensures that portfolio weights are always feasible, handles tradability constraints naturally, and provides a stable way to explore the allocation space. The model integrates per-asset temporal encoders with a global attention layer, allowing it to capture sector relationships, factor spillovers, and other cross asset dependencies. The reward function includes transaction costs and portfolio variance penalties, linking the learning objective to traditional mean variance trade offs. The results show that attention based Dirichlet policies outperform equal-weight and standard reinforcement learning benchmarks in terms of terminal wealth and Sharpe ratio, while maintaining realistic turnover and drawdown levels. Overall, the study shows that combining principled action design with attention-based representations improves both the stability and interpretability of reinforcement learning for portfolio management.
Similar Papers
A Deep Reinforcement Learning Framework for Dynamic Portfolio Optimization: Evidence from China's Stock Market
Portfolio Management
Makes money grow faster by smart investing.
Risk-Aware Deep Reinforcement Learning for Dynamic Portfolio Optimization
Portfolio Management
Teaches computers to invest money safely.
Interpretable Learning Dynamics in Unsupervised Reinforcement Learning
Machine Learning (CS)
Helps robots learn faster by watching what's interesting.