PhysTalk: Language-driven Real-time Physics in 3D Gaussian Scenes
By: Luca Collorone , Mert Kiray , Indro Spinelli and more
Realistic visual simulations are omnipresent, yet their creation requires computing time, rendering, and expert animation knowledge. Open-vocabulary visual effects generation from text inputs emerges as a promising solution that can unlock immense creative potential. However, current pipelines lack both physical realism and effective language interfaces, requiring slow offline optimization. In contrast, PhysTalk takes a 3D Gaussian Splatting (3DGS) scene as input and translates arbitrary user prompts into real time, physics based, interactive 4D animations. A large language model (LLM) generates executable code that directly modifies 3DGS parameters through lightweight proxies and particle dynamics. Notably, PhysTalk is the first framework to couple 3DGS directly with a physics simulator without relying on time consuming mesh extraction. While remaining open vocabulary, this design enables interactive 3D Gaussian animation via collision aware, physics based manipulation of arbitrary, multi material objects. Finally, PhysTalk is train-free and computationally lightweight: this makes 4D animation broadly accessible and shifts these workflows from a "render and wait" paradigm toward an interactive dialogue with a modern, physics-informed pipeline.
Similar Papers
SplatTalk: 3D VQA with Gaussian Splatting
CV and Pattern Recognition
Lets computers understand 3D worlds from pictures.
GS-Verse: Mesh-based Gaussian Splatting for Physics-aware Interaction in Virtual Reality
Graphics
Makes virtual objects bend and stretch realistically.
PGSTalker: Real-Time Audio-Driven Talking Head Generation via 3D Gaussian Splatting with Pixel-Aware Density Control
Sound
Makes computer faces talk realistically with sound.