SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation
By: Hu Cui , Wenqiang Hua , Renjing Huang and more
Potential Business Impact:
Helps computers understand human body movements better.
Recently, the Mamba architecture based on State Space Models (SSMs) has gained attention in 3D human pose estimation due to its linear complexity and strong global modeling capability. However, existing SSM-based methods typically apply manually designed scan operations to flatten detected 2D pose sequences into purely temporal sequences, either locally or globally. This approach disrupts the inherent spatial structure of human poses and entangles spatial and temporal features, making it difficult to capture complex pose dependencies. To address these limitations, we propose the Skeleton Structure-Aware Stride SSM (SAS-SSM), which first employs a structure-aware spatiotemporal convolution to dynamically capture essential local interactions between joints, and then applies a stride-based scan strategy to construct multi-scale global structural representations. This enables flexible modeling of both local and global pose information while maintaining linear computational complexity. Built upon SAS-SSM, our model SasMamba achieves competitive 3D pose estimation performance with significantly fewer parameters compared to existing hybrid models. The source code is available at https://hucui2022.github.io/sasmamba_proj/.
Similar Papers
First-order State Space Model for Lightweight Image Super-resolution
CV and Pattern Recognition
Makes pictures clearer with smarter computer math.
DefMamba: Deformable Visual State Space Model
CV and Pattern Recognition
Finds important parts of pictures better.
MMA: A Momentum Mamba Architecture for Human Activity Recognition with Inertial Sensors
Human-Computer Interaction
Helps computers understand body movements better.