Score: 1

CRAFT: Coaching Reinforcement Learning Autonomously using Foundation Models for Multi-Robot Coordination Tasks

Published: September 17, 2025 | arXiv ID: 2509.14380v1

By: Seoyeon Choi , Kanghyun Ryu , Jonghoon Ock and more

BigTech Affiliations: University of California, Berkeley

Potential Business Impact:

Teaches robots to work together on hard jobs.

Business Areas:

Robotics Hardware, Science and Engineering, Software

Multi-Agent Reinforcement Learning (MARL) provides a powerful framework for learning coordination in multi-agent systems. However, applying MARL to robotics still remains challenging due to high-dimensional continuous joint action spaces, complex reward design, and non-stationary transitions inherent to decentralized settings. On the other hand, humans learn complex coordination through staged curricula, where long-horizon behaviors are progressively built upon simpler skills. Motivated by this, we propose CRAFT: Coaching Reinforcement learning Autonomously using Foundation models for multi-robot coordination Tasks, a framework that leverages the reasoning capabilities of foundation models to act as a "coach" for multi-robot coordination. CRAFT automatically decomposes long-horizon coordination tasks into sequences of subtasks using the planning capability of Large Language Models (LLMs). In what follows, CRAFT trains each subtask using reward functions generated by LLM, and refines them through a Vision Language Model (VLM)-guided reward-refinement loop. We evaluate CRAFT on multi-quadruped navigation and bimanual manipulation tasks, demonstrating its capability to learn complex coordination behaviors. In addition, we validate the multi-quadruped navigation policy in real hardware experiments.

LAMARL: LLM-Aided Multi-Agent Reinforcement Learning for Cooperative Policy Generation

Robotics

Robots learn tasks faster with AI help.

2 Jun 2025 0

89%

Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy

CV and Pattern Recognition

Helps robots work together on hard jobs.

13 Mar 2025 0

89%

MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization

Machine Learning (CS)

Teaches AI to control traffic better using smart lessons.

24 Nov 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

9 pages

CRAFT: Coaching Reinforcement Learning Autonomously using Foundation Models for Multi-Robot Coordination Tasks

Teaches robots to work together on hard jobs.

Technical Abstract

LAMARL: LLM-Aided Multi-Agent Reinforcement Learning for Cooperative Policy Generation

Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy

MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization