Score: 1

Parts-Mamba: Augmenting Joint Context with Part-Level Scanning for Occluded Human Skeleton

Published: November 21, 2025 | arXiv ID: 2511.16860v1

By: Tianyi Shen , Huijuan Xu , Nilesh Ahuja and more

Potential Business Impact:

Helps computers understand actions even with missing body parts.

Business Areas:
Motion Capture Media and Entertainment, Video

Skeleton action recognition involves recognizing human action from human skeletons. The use of graph convolutional networks (GCNs) has driven major advances in this recognition task. In real-world scenarios, the captured skeletons are not always perfect or complete because of occlusions of parts of the human body or poor communication quality, leading to missing parts in skeletons or videos with missing frames. In the presence of such non-idealities, existing GCN models perform poorly due to missing local context. To address this limitation, we propose Parts-Mamba, a hybrid GCN-Mamba model designed to enhance the ability to capture and maintain contextual information from distant joints. The proposed Parts-Mamba model effectively captures part-specific information through its parts-specific scanning feature and preserves non-neighboring joint context via a parts-body fusion module. Our proposed model is evaluated on the NTU RGB+D 60 and NTU RGB+D 120 datasets under different occlusion settings, achieving up to 12.9% improvement in accuracy.

Page Count
5 pages

Category
Computer Science:
CV and Pattern Recognition