Score: 0

AOAD-MAT: Transformer-based multi-agent deep reinforcement learning model considering agents' order of action decisions

Published: October 15, 2025 | arXiv ID: 2510.13343v1

By: Shota Takayama, Katsuhide Fujita

Potential Business Impact:

Helps robot teams decide who acts next.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

Multi-agent reinforcement learning focuses on training the behaviors of multiple learning agents that coexist in a shared environment. Recently, MARL models, such as the Multi-Agent Transformer (MAT) and ACtion dEpendent deep Q-learning (ACE), have significantly improved performance by leveraging sequential decision-making processes. Although these models can enhance performance, they do not explicitly consider the importance of the order in which agents make decisions. In this paper, we propose an Agent Order of Action Decisions-MAT (AOAD-MAT), a novel MAT model that considers the order in which agents make decisions. The proposed model explicitly incorporates the sequence of action decisions into the learning process, allowing the model to learn and predict the optimal order of agent actions. The AOAD-MAT model leverages a Transformer-based actor-critic architecture that dynamically adjusts the sequence of agent actions. To achieve this, we introduce a novel MARL architecture that cooperates with a subtask focused on predicting the next agent to act, integrated into a Proximal Policy Optimization based loss function to synergistically maximize the advantage of the sequential decision-making. The proposed method was validated through extensive experiments on the StarCraft Multi-Agent Challenge and Multi-Agent MuJoCo benchmarks. The experimental results show that the proposed AOAD-MAT model outperforms existing MAT and other baseline models, demonstrating the effectiveness of adjusting the AOAD order in MARL.

Sequential Multi-Agent Dynamic Algorithm Configuration

Machine Learning (CS)

Makes computer programs learn better by fixing settings.

27 Oct 2025 2

87%

MOMA-AC: A preference-driven actor-critic framework for continuous multi-objective multi-agent reinforcement learning

Machine Learning (CS)

Teaches robots to work together for many goals.

22 Nov 2025 0

86%

Multi-Agent Reinforcement Learning for Deadlock Handling among Autonomous Mobile Robots

Multiagent Systems

Robots avoid getting stuck in warehouses.

10 Nov 2025 0

View PDF Login to Bookmark

Page Count

19 pages

AOAD-MAT: Transformer-based multi-agent deep reinforcement learning model considering agents' order of action decisions

Helps robot teams decide who acts next.

Technical Abstract

Sequential Multi-Agent Dynamic Algorithm Configuration

MOMA-AC: A preference-driven actor-critic framework for continuous multi-objective multi-agent reinforcement learning

Multi-Agent Reinforcement Learning for Deadlock Handling among Autonomous Mobile Robots