Robot Policy Evaluation for Sim-to-Real Transfer: A Benchmarking Perspective
By: Xuning Yang , Clemens Eppner , Jonathan Tremblay and more
Potential Business Impact:
Helps robots learn in games, then work in real life.
Current vision-based robotics simulation benchmarks have significantly advanced robotic manipulation research. However, robotics is fundamentally a real-world problem, and evaluation for real-world applications has lagged behind in evaluating generalist policies. In this paper, we discuss challenges and desiderata in designing benchmarks for generalist robotic manipulation policies for the goal of sim-to-real policy transfer. We propose 1) utilizing high visual-fidelity simulation for improved sim-to-real transfer, 2) evaluating policies by systematically increasing task complexity and scenario perturbation to assess robustness, and 3) quantifying performance alignment between real-world performance and its simulation counterparts.
Similar Papers
RobotArena $\infty$: Scalable Robot Benchmarking via Real-to-Sim Translation
Robotics
Tests robots better using videos and online help.
Sim-to-Real Transfer in Deep Reinforcement Learning for Bipedal Locomotion
Robotics
Robots learn to walk outside the computer.
Synthetic vs. Real Training Data for Visual Navigation
Robotics
Robots learn to navigate better using computer games.