MLM: Learning Multi-task Loco-Manipulation Whole-Body Control for Quadruped Robot with Arm
By: Xin Liu , Bida Ma , Chenkun Qi and more
Potential Business Impact:
Robot dog with arm learns many jobs.
Whole-body loco-manipulation for quadruped robots with arm remains a challenging problem, particularly in achieving multi-task control. To address this, we propose MLM, a reinforcement learning framework driven by both real-world and simulation data. It enables a six-DoF robotic arm--equipped quadruped robot to perform whole-body loco-manipulation for multiple tasks autonomously or under human teleoperation. To address the problem of balancing multiple tasks during the learning of loco-manipulation, we introduce a trajectory library with an adaptive, curriculum-based sampling mechanism. This approach allows the policy to efficiently leverage real-world collected trajectories for learning multi-task loco-manipulation. To address deployment scenarios with only historical observations and to enhance the performance of policy execution across tasks with different spatial ranges, we propose a Trajectory-Velocity Prediction policy network. It predicts unobservable future trajectories and velocities. By leveraging extensive simulation data and curriculum-based rewards, our controller achieves whole-body behaviors in simulation and zero-shot transfer to real-world deployment. Ablation studies in simulation verify the necessity and effectiveness of our approach, while real-world experiments on the Go2 robot with an Airbot robotic arm demonstrate the policy's good performance in multi-task execution.
Similar Papers
Kinematics-Aware Multi-Policy Reinforcement Learning for Force-Capable Humanoid Loco-Manipulation
Robotics
Robots learn to lift heavy things and move.
DemoHLM: From One Demonstration to Generalizable Humanoid Loco-Manipulation
Robotics
Robots learn to move and grab from one example.
WholeBodyVLA: Towards Unified Latent VLA for Whole-Body Loco-Manipulation Control
Robotics
Robots can now reach and grab things anywhere.