Score: 0

SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation

Published: November 12, 2025 | arXiv ID: 2511.08872v1

By: Hu Cui , Wenqiang Hua , Renjing Huang and more

Potential Business Impact:

Helps computers understand human body movements better.

Business Areas:
Motion Capture Media and Entertainment, Video

Recently, the Mamba architecture based on State Space Models (SSMs) has gained attention in 3D human pose estimation due to its linear complexity and strong global modeling capability. However, existing SSM-based methods typically apply manually designed scan operations to flatten detected 2D pose sequences into purely temporal sequences, either locally or globally. This approach disrupts the inherent spatial structure of human poses and entangles spatial and temporal features, making it difficult to capture complex pose dependencies. To address these limitations, we propose the Skeleton Structure-Aware Stride SSM (SAS-SSM), which first employs a structure-aware spatiotemporal convolution to dynamically capture essential local interactions between joints, and then applies a stride-based scan strategy to construct multi-scale global structural representations. This enables flexible modeling of both local and global pose information while maintaining linear computational complexity. Built upon SAS-SSM, our model SasMamba achieves competitive 3D pose estimation performance with significantly fewer parameters compared to existing hybrid models. The source code is available at https://hucui2022.github.io/sasmamba_proj/.

Country of Origin
🇯🇵 Japan

Page Count
15 pages

Category
Computer Science:
CV and Pattern Recognition