Semantic Communications with World Models
By: Peiwen Jiang , Jiajia Guo , Chao-Kai Wen and more
Potential Business Impact:
Saves internet by predicting video instead of sending it.
Semantic communication is a promising technique for emerging wireless applications, which reduces transmission overhead by transmitting only task-relevant features instead of raw data. However, existing methods struggle under extremely low bandwidth and varying channel conditions, where corrupted or missing semantics lead to severe reconstruction errors. To resolve this difficulty, we propose a world foundation model (WFM)-aided semantic video transmission framework that leverages the predictive capability of WFMs to generate future frames based on the current frame and textual guidance. This design allows transmissions to be omitted when predictions remain reliable, thereby saving bandwidth. Through WFM's prediction, the key semantics are preserved, yet minor prediction errors tend to amplify over time. To mitigate issue, a lightweight depth-based feedback module is introduced to determine whether transmission of the current frame is needed. Apart from transmitting the entire frame, a segmentation-assisted partial transmission method is proposed to repair degraded frames, which can further balance performance and bandwidth cost. Furthermore, an active transmission strategy is developed for mobile scenarios by exploiting camera trajectory information and proactively scheduling transmissions before channel quality deteriorates. Simulation results show that the proposed framework significantly reduces transmission overhead while maintaining task performances across varying scenarios and channel conditions.
Similar Papers
Diffusion-Aided Bandwidth-Efficient Semantic Communication with Adaptive Requests
Information Theory
Sends pictures using less data, fixing errors.
Distributionally Robust Wireless Semantic Communication with Large AI Models
Networking and Internet Architecture
Makes wireless messages understandable even with errors.
Diffusion-Aided Bandwidth-Efficient Semantic Communication with Adaptive Requests
Information Theory
Sends pictures using fewer words, fixing errors.