Learning to Lead Themselves: Agentic AI in MAS using MARL
By: Ansh Kamthan
Potential Business Impact:
Drones work together to deliver packages faster.
As autonomous systems move from prototypes to real deployments, the ability of multiple agents to make decentralized, cooperative decisions becomes a core requirement. This paper examines how agentic artificial intelligence, agents that act independently, adaptively and proactively can improve task allocation and coordination in multi-agent systems, with primary emphasis on drone delivery and secondary relevance to warehouse automation. We formulate the problem in a cooperative multi-agent reinforcement learning setting and implement a lightweight multi-agent Proximal Policy Optimization, called IPPO, approach in PyTorch under a centralized-training, decentralized-execution paradigm. Experiments are conducted in PettingZoo environment, where multiple homogeneous drones or agents must self-organize to cover distinct targets without explicit communication.
Similar Papers
Multi-agent Robust and Optimal Policy Learning for Data Harvesting
Systems and Control
Drones collect sensor data faster and smarter.
Heterogeneous Multi-Agent Proximal Policy Optimization for Power Distribution System Restoration
Artificial Intelligence
Fixes power grids faster after blackouts.
Energy Efficient Task Offloading in UAV-Enabled MEC Using a Fully Decentralized Deep Reinforcement Learning Approach
Multiagent Systems
Drones fly smarter by talking to neighbors.