Score: 1

IC-World: In-Context Generation for Shared World Modeling

Published: December 1, 2025 | arXiv ID: 2512.02793v1

By: Fan Wu , Jiacheng Wei , Ruibo Li and more

Potential Business Impact:

Creates consistent 3D worlds from many pictures.

Business Areas:
Virtual World Community and Lifestyle, Media and Entertainment, Software

Video-based world models have recently garnered increasing attention for their ability to synthesize diverse and dynamic visual environments. In this paper, we focus on shared world modeling, where a model generates multiple videos from a set of input images, each representing the same underlying world in different camera poses. We propose IC-World, a novel generation framework, enabling parallel generation for all input images via activating the inherent in-context generation capability of large video models. We further finetune IC-World via reinforcement learning, Group Relative Policy Optimization, together with two proposed novel reward models to enforce scene-level geometry consistency and object-level motion consistency among the set of generated videos. Extensive experiments demonstrate that IC-World substantially outperforms state-of-the-art methods in both geometry and motion consistency. To the best of our knowledge, this is the first work to systematically explore the shared world modeling problem with video-based world models.

Country of Origin
πŸ‡ΈπŸ‡¬ Singapore

Repos / Data Links

Page Count
10 pages

Category
Computer Science:
CV and Pattern Recognition