MambaVSR: Content-Aware Scanning State Space Model for Video Super-Resolution
By: Linfeng He , Meiqin Liu , Qi Tang and more
Potential Business Impact:
Makes blurry videos sharp and clear.
Video super-resolution (VSR) faces critical challenges in effectively modeling non-local dependencies across misaligned frames while preserving computational efficiency. Existing VSR methods typically rely on optical flow strategies or transformer architectures, which struggle with large motion displacements and long video sequences. To address this, we propose MambaVSR, the first state-space model framework for VSR that incorporates an innovative content-aware scanning mechanism. Unlike rigid 1D sequential processing in conventional vision Mamba methods, our MambaVSR enables dynamic spatiotemporal interactions through the Shared Compass Construction (SCC) and the Content-Aware Sequentialization (CAS). Specifically, the SCC module constructs intra-frame semantic connectivity graphs via efficient sparse attention and generates adaptive spatial scanning sequences through spectral clustering. Building upon SCC, the CAS module effectively aligns and aggregates non-local similar content across multiple frames by interleaving temporal features along the learned spatial order. To bridge global dependencies with local details, the Global-Local State Space Block (GLSSB) synergistically integrates window self-attention operations with SSM-based feature propagation, enabling high-frequency detail recovery under global dependency guidance. Extensive experiments validate MambaVSR's superiority, outperforming the Transformer-based method by 0.58 dB PSNR on the REDS dataset with 55% fewer parameters.
Similar Papers
Trajectory-aware Shifted State Space Models for Online Video Super-Resolution
CV and Pattern Recognition
Makes blurry videos sharp using past frames.
Self-supervised ControlNet with Spatio-Temporal Mamba for Real-world Video Super-resolution
CV and Pattern Recognition
Makes blurry videos clear without weird glitches.
GPSMamba: A Global Phase and Spectral Prompt-guided Mamba for Infrared Image Super-Resolution
CV and Pattern Recognition
Makes blurry night-vision pictures clear.