Score: 2

Integrating Diffusion-based Multi-task Learning with Online Reinforcement Learning for Robust Quadruped Robot Control

Published: July 8, 2025 | arXiv ID: 2507.05674v2

By: Xinyao Qin , Xiaoteng Ma , Yang Qi and more

Potential Business Impact:

Robots walk and change tasks by voice command.

Business Areas:

Robotics Hardware, Science and Engineering, Software

Recent research has highlighted the powerful capabilities of imitation learning in robotics. Leveraging generative models, particularly diffusion models, these approaches offer notable advantages such as strong multi-task generalization, effective language conditioning, and high sample efficiency. While their application has been successful in manipulation tasks, their use in legged locomotion remains relatively underexplored, mainly due to compounding errors that affect stability and difficulties in task transition under limited data. Online reinforcement learning (RL) has demonstrated promising results in legged robot control in the past years, providing valuable insights to address these challenges. In this work, we propose DMLoco, a diffusion-based framework for quadruped robots that integrates multi-task pretraining with online PPO finetuning to enable language-conditioned control and robust task transitions. Our approach first pretrains the policy on a diverse multi-task dataset using diffusion models, enabling language-guided execution of various skills. Then, it finetunes the policy in simulation to ensure robustness and stable task transition during real-world deployment. By utilizing Denoising Diffusion Implicit Models (DDIM) for efficient sampling and TensorRT for optimized deployment, our policy runs onboard at 50Hz, offering a scalable and efficient solution for adaptive, language-guided locomotion on resource-constrained robotic platforms.

Multi-Loco: Unifying Multi-Embodiment Legged Locomotion via Reinforcement Learning Augmented Diffusion

Robotics

Robots learn to walk on any legs.

13 Jun 2025 0

91%

Flexible Locomotion Learning with Diffusion Model Predictive Control

Robotics

Robots learn to walk and change how they move.

5 Oct 2025 0

89%

MLM: Learning Multi-task Loco-Manipulation Whole-Body Control for Quadruped Robot with Arm

Robotics

Robot dog with arm learns many jobs.

14 Aug 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

8 pages

Integrating Diffusion-based Multi-task Learning with Online Reinforcement Learning for Robust Quadruped Robot Control

Robots walk and change tasks by voice command.

Technical Abstract

Multi-Loco: Unifying Multi-Embodiment Legged Locomotion via Reinforcement Learning Augmented Diffusion

Flexible Locomotion Learning with Diffusion Model Predictive Control

MLM: Learning Multi-task Loco-Manipulation Whole-Body Control for Quadruped Robot with Arm