Embodied Image Compression
By: Chunyi Li , Rui Qing , Jianbo Zhang and more
Potential Business Impact:
Helps robots understand the world with less data.
Image Compression for Machines (ICM) has emerged as a pivotal research direction in the field of visual data compression. However, with the rapid evolution of machine intelligence, the target of compression has shifted from task-specific virtual models to Embodied agents operating in real-world environments. To address the communication constraints of Embodied AI in multi-agent systems and ensure real-time task execution, this paper introduces, for the first time, the scientific problem of Embodied Image Compression. We establish a standardized benchmark, EmbodiedComp, to facilitate systematic evaluation under ultra-low bitrate conditions in a closed-loop setting. Through extensive empirical studies in both simulated and real-world settings, we demonstrate that existing Vision-Language-Action models (VLAs) fail to reliably perform even simple manipulation tasks when compressed below the Embodied bitrate threshold. We anticipate that EmbodiedComp will catalyze the development of domain-specific compression tailored for Embodied agents , thereby accelerating the Embodied AI deployment in the Real-world.
Similar Papers
Static and Plugged: Make Embodied Evaluation Simple
CV and Pattern Recognition
Tests robots in pictures, not real life.
EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence
CV and Pattern Recognition
Robots learn to do tasks in the real world.
A Comprehensive Survey on World Models for Embodied AI
CV and Pattern Recognition
Helps robots learn to predict and act.