Score: 1

MORSE: Multi-Objective Reinforcement Learning via Strategy Evolution for Supply Chain Optimization

Published: September 8, 2025 | arXiv ID: 2509.06490v1

By: Niki Kotecha, Ehecatl Antonio del Rio Chanona

Potential Business Impact:

Helps companies make better, faster supply chain choices.

Business Areas:
Autonomous Vehicles Transportation

In supply chain management, decision-making often involves balancing multiple conflicting objectives, such as cost reduction, service level improvement, and environmental sustainability. Traditional multi-objective optimization methods, such as linear programming and evolutionary algorithms, struggle to adapt in real-time to the dynamic nature of supply chains. In this paper, we propose an approach that combines Reinforcement Learning (RL) and Multi-Objective Evolutionary Algorithms (MOEAs) to address these challenges for dynamic multi-objective optimization under uncertainty. Our method leverages MOEAs to search the parameter space of policy neural networks, generating a Pareto front of policies. This provides decision-makers with a diverse population of policies that can be dynamically switched based on the current system objectives, ensuring flexibility and adaptability in real-time decision-making. We also introduce Conditional Value-at-Risk (CVaR) to incorporate risk-sensitive decision-making, enhancing resilience in uncertain environments. We demonstrate the effectiveness of our approach through case studies, showcasing its ability to respond to supply chain dynamics and outperforming state-of-the-art methods in an inventory management case study. The proposed strategy not only improves decision-making efficiency but also offers a more robust framework for managing uncertainty and optimizing performance in supply chains.

Country of Origin
🇬🇧 United Kingdom

Page Count
33 pages

Category
Computer Science:
Artificial Intelligence