MARL Warehouse Robots
By: Price Allman , Lian Thang , Dre Simmons and more
Potential Business Impact:
Robots learn to work together to move packages.
We present a comparative study of multi-agent reinforcement learning (MARL) algorithms for cooperative warehouse robotics. We evaluate QMIX and IPPO on the Robotic Warehouse (RWARE) environment and a custom Unity 3D simulation. Our experiments reveal that QMIX's value decomposition significantly outperforms independent learning approaches (achieving 3.25 mean return vs. 0.38 for advanced IPPO), but requires extensive hyperparameter tuning -- particularly extended epsilon annealing (5M+ steps) for sparse reward discovery. We demonstrate successful deployment in Unity ML-Agents, achieving consistent package delivery after 1M training steps. While MARL shows promise for small-scale deployments (2-4 robots), significant scaling challenges remain. Code and analyses: https://pallman14.github.io/MARL-QMIX-Warehouse-Robots/
Similar Papers
Multi-Agent Reinforcement Learning and Real-Time Decision-Making in Robotic Soccer for Virtual Environments
Robotics
Teaches robot soccer teams to play better together.
LLM Collaboration With Multi-Agent Reinforcement Learning
Artificial Intelligence
Helps AI agents work together to write and code.
From Pixels to Cooperation Multi Agent Reinforcement Learning based on Multimodal World Models
Multiagent Systems
Teaches robots to work together using sight and sound.