Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion
By: Shaowei Liu , David Yifan Yao , Saurabh Gupta and more
Potential Business Impact:
Aligns videos from different cameras perfectly.
Today, people can easily record memorable moments, ranging from concerts, sports events, lectures, family gatherings, and birthday parties with multiple consumer cameras. However, synchronizing these cross-camera streams remains challenging. Existing methods assume controlled settings, specific targets, manual correction, or costly hardware. We present VisualSync, an optimization framework based on multi-view dynamics that aligns unposed, unsynchronized videos at millisecond accuracy. Our key insight is that any moving 3D point, when co-visible in two cameras, obeys epipolar constraints once properly synchronized. To exploit this, VisualSync leverages off-the-shelf 3D reconstruction, feature matching, and dense tracking to extract tracklets, relative poses, and cross-view correspondences. It then jointly minimizes the epipolar error to estimate each camera's time offset. Experiments on four diverse, challenging datasets show that VisualSync outperforms baseline methods, achieving an median synchronization error below 50 ms.
Similar Papers
RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems
CV and Pattern Recognition
Syncs many cameras perfectly, even different kinds.
SyncTrack4D: Cross-Video Motion Alignment and Video Synchronization for Multi-Video 4D Gaussian Splatting
CV and Pattern Recognition
Makes shaky videos look like smooth 3D movies.
Generative Video Motion Editing with 3D Point Tracks
CV and Pattern Recognition
Edits videos by changing how things move.