Score: 0

cMALC-D: Contextual Multi-Agent LLM-Guided Curriculum Learning with Diversity-Based Context Blending

Published: August 28, 2025 | arXiv ID: 2508.20818v1

By: Anirudh Satheesh, Keenan Powell, Hua Wei

Potential Business Impact:

Teaches robots to handle new situations better.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

Many multi-agent reinforcement learning (MARL) algorithms are trained in fixed simulation environments, making them brittle when deployed in real-world scenarios with more complex and uncertain conditions. Contextual MARL (cMARL) addresses this by parameterizing environments with context variables and training a context-agnostic policy that performs well across all environment configurations. Existing cMARL methods attempt to use curriculum learning to help train and evaluate context-agnostic policies, but they often rely on unreliable proxy signals, such as value estimates or generalized advantage estimates that are noisy and unstable in multi-agent settings due to inter-agent dynamics and partial observability. To address these issues, we propose Contextual Multi-Agent LLM-Guided Curriculum Learning with Diversity-Based Context Blending (cMALC-D), a framework that uses Large Language Models (LLMs) to generate semantically meaningful curricula and provide a more robust evaluation signal. To prevent mode collapse and encourage exploration, we introduce a novel diversity-based context blending mechanism that creates new training scenarios by combining features from prior contexts. Experiments in traffic signal control domains demonstrate that cMALC-D significantly improves both generalization and sample efficiency compared to existing curriculum learning baselines. We provide code at https://github.com/DaRL-LibSignal/cMALC-D.

Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning

Machine Learning (CS)

Helps AI teams learn tasks faster and better.

30 Oct 2025 3

89%

Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation

Artificial Intelligence

Helps AI teams work together better in games.

1 Jun 2025 0

88%

Multi-agent In-context Coordination via Decentralized Memory Retrieval

Multiagent Systems

Helps robot teams learn new jobs faster together.

13 Nov 2025 2

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

16 pages

cMALC-D: Contextual Multi-Agent LLM-Guided Curriculum Learning with Diversity-Based Context Blending

Teaches robots to handle new situations better.

Technical Abstract

Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning

Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation

Multi-agent In-context Coordination via Decentralized Memory Retrieval