CoinRobot: Generalized End-to-end Robotic Learning for Physical Intelligence
By: Yu Zhao , Huxian Liu , Xiang Chen and more
Potential Business Impact:
Robots learn new jobs faster on different machines.
Physical intelligence holds immense promise for advancing embodied intelligence, enabling robots to acquire complex behaviors from demonstrations. However, achieving generalization and transfer across diverse robotic platforms and environments requires careful design of model architectures, training strategies, and data diversity. Meanwhile existing systems often struggle with scalability, adaptability to heterogeneous hardware, and objective evaluation in real-world settings. We present a generalized end-to-end robotic learning framework designed to bridge this gap. Our framework introduces a unified architecture that supports cross-platform adaptability, enabling seamless deployment across industrial-grade robots, collaborative arms, and novel embodiments without task-specific modifications. By integrating multi-task learning with streamlined network designs, it achieves more robust performance than conventional approaches, while maintaining compatibility with varying sensor configurations and action spaces. We validate our framework through extensive experiments on seven manipulation tasks. Notably, Diffusion-based models trained in our framework demonstrated superior performance and generalizability compared to the LeRobot framework, achieving performance improvements across diverse robotic platforms and environmental conditions.
Similar Papers
Autonomous Embodied Agents: When Robotics Meets Deep Learning Reasoning
Robotics
Robots learn to do tasks in new places.
PhysicalAgent: Towards General Cognitive Robotics with Foundation World Models
Robotics
Robots learn to do tasks by watching videos.
Bayesian Inverse Physics for Neuro-Symbolic Robot Learning
Robotics
Robots learn to think and adapt in new places.