Score: 0

OXE-AugE: A Large-Scale Robot Augmentation of OXE for Scaling Cross-Embodiment Policy Learning

Published: December 15, 2025 | arXiv ID: 2512.13100v1

By: Guanhua Ji , Harsha Polavaram , Lawrence Yunliang Chen and more

Large and diverse datasets are needed for training generalist robot policies that have potential to control a variety of robot embodiments -- robot arm and gripper combinations -- across diverse tasks and environments. As re-collecting demonstrations and retraining for each new hardware platform are prohibitively costly, we show that existing robot data can be augmented for transfer and generalization. The Open X-Embodiment (OXE) dataset, which aggregates demonstrations from over 60 robot datasets, has been widely used as the foundation for training generalist policies. However, it is highly imbalanced: the top four robot types account for over 85\% of its real data, which risks overfitting to robot-scene combinations. We present AugE-Toolkit, a scalable robot augmentation pipeline, and OXE-AugE, a high-quality open-source dataset that augments OXE with 9 different robot embodiments. OXE-AugE provides over 4.4 million trajectories, more than triple the size of the original OXE. We conduct a systematic study of how scaling robot augmentation impacts cross-embodiment learning. Results suggest that augmenting datasets with diverse arms and grippers improves policy performance not only on the augmented robots, but also on unseen robots and even the original robots under distribution shifts. In physical experiments, we demonstrate that state-of-the-art generalist policies such as OpenVLA and $π_0$ benefit from fine-tuning on OXE-AugE, improving success rates by 24-45% on previously unseen robot-gripper combinations across four real-world manipulation tasks. Project website: https://OXE-AugE.github.io/.

Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation

Robotics

Robots learn better by seeing more varied examples.

8 Aug 2025 0

87%

AirExo-2: Scaling up Generalizable Robotic Imitation Learning with Low-Cost Exoskeletons

Robotics

Teaches robots to learn from watching people.

5 Mar 2025 1

87%

HumanoidExo: Scalable Whole-Body Humanoid Manipulation via Wearable Exoskeleton

Robotics

Teaches robots to move like humans faster.

3 Oct 2025 0

View PDF Login to Bookmark

OXE-AugE: A Large-Scale Robot Augmentation of OXE for Scaling Cross-Embodiment Policy Learning

Technical Abstract

Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation

AirExo-2: Scaling up Generalizable Robotic Imitation Learning with Low-Cost Exoskeletons

HumanoidExo: Scalable Whole-Body Humanoid Manipulation via Wearable Exoskeleton