Bridging Simulation and Reality: Cross-Domain Transfer with Semantic 2D Gaussian Splatting
By: Jian Tang , Pu Pang , Haowen Sun and more
Potential Business Impact:
Robots learn tasks from simulations and do them in real life.
Cross-domain transfer in robotic manipulation remains a longstanding challenge due to the significant domain gap between simulated and real-world environments. Existing methods such as domain randomization, adaptation, and sim-real calibration often require extensive tuning or fail to generalize to unseen scenarios. To address this issue, we observe that if domain-invariant features are utilized during policy training in simulation, and the same features can be extracted and provided as the input to policy during real-world deployment, the domain gap can be effectively bridged, leading to significantly improved policy generalization. Accordingly, we propose Semantic 2D Gaussian Splatting (S2GS), a novel representation method that extracts object-centric, domain-invariant spatial features. S2GS constructs multi-view 2D semantic fields and projects them into a unified 3D space via feature-level Gaussian splatting. A semantic filtering mechanism removes irrelevant background content, ensuring clean and consistent inputs for policy learning. To evaluate the effectiveness of S2GS, we adopt Diffusion Policy as the downstream learning algorithm and conduct experiments in the ManiSkill simulation environment, followed by real-world deployment. Results demonstrate that S2GS significantly improves sim-to-real transferability, maintaining high and stable task performance in real-world scenarios.
Similar Papers
High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting
Robotics
Makes robots learn real tasks from computer simulations.
Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation
Robotics
Robots learn better from fake 3D scenes.
CoRe-GS: Coarse-to-Refined Gaussian Splatting with Semantic Object Focus
CV and Pattern Recognition
Drones build 3D maps of important things faster.