MORSE: Multi-Objective Reinforcement Learning via Strategy Evolution for Supply Chain Optimization
By: Niki Kotecha, Ehecatl Antonio del Rio Chanona
Potential Business Impact:
Helps companies make better, faster supply chain choices.
In supply chain management, decision-making often involves balancing multiple conflicting objectives, such as cost reduction, service level improvement, and environmental sustainability. Traditional multi-objective optimization methods, such as linear programming and evolutionary algorithms, struggle to adapt in real-time to the dynamic nature of supply chains. In this paper, we propose an approach that combines Reinforcement Learning (RL) and Multi-Objective Evolutionary Algorithms (MOEAs) to address these challenges for dynamic multi-objective optimization under uncertainty. Our method leverages MOEAs to search the parameter space of policy neural networks, generating a Pareto front of policies. This provides decision-makers with a diverse population of policies that can be dynamically switched based on the current system objectives, ensuring flexibility and adaptability in real-time decision-making. We also introduce Conditional Value-at-Risk (CVaR) to incorporate risk-sensitive decision-making, enhancing resilience in uncertain environments. We demonstrate the effectiveness of our approach through case studies, showcasing its ability to respond to supply chain dynamics and outperforming state-of-the-art methods in an inventory management case study. The proposed strategy not only improves decision-making efficiency but also offers a more robust framework for managing uncertainty and optimizing performance in supply chains.
Similar Papers
Evolutionary Reinforcement Learning for Interpretable Decision-Making in Supply Chain Management
Neural and Evolutionary Computing
Makes smart factory decisions easy to understand.
Iterative Multi-Agent Reinforcement Learning: A Novel Approach Toward Real-World Multi-Echelon Inventory Optimization
Machine Learning (CS)
Makes stores keep just enough stuff, not too much.
Enhancing Decision Space Diversity in Multi-Objective Evolutionary Optimization for the Diet Problem
Neural and Evolutionary Computing
Finds best food mixes with many healthy choices.