Score: 1

Boosting Skeleton-based Zero-Shot Action Recognition with Training-Free Test-Time Adaptation

Published: December 12, 2025 | arXiv ID: 2512.11458v1

By: Jingmin Zhu , Anqi Zhu , Hossein Rahmani and more

Potential Business Impact:

Helps computers recognize new human movements.

Business Areas:

Motion Capture Media and Entertainment, Video

We introduce Skeleton-Cache, the first training-free test-time adaptation framework for skeleton-based zero-shot action recognition (SZAR), aimed at improving model generalization to unseen actions during inference. Skeleton-Cache reformulates inference as a lightweight retrieval process over a non-parametric cache that stores structured skeleton representations, combining both global and fine-grained local descriptors. To guide the fusion of descriptor-wise predictions, we leverage the semantic reasoning capabilities of large language models (LLMs) to assign class-specific importance weights. By integrating these structured descriptors with LLM-guided semantic priors, Skeleton-Cache dynamically adapts to unseen actions without any additional training or access to training data. Extensive experiments on NTU RGB+D 60/120 and PKU-MMD II demonstrate that Skeleton-Cache consistently boosts the performance of various SZAR backbones under both zero-shot and generalized zero-shot settings. The code is publicly available at https://github.com/Alchemist0754/Skeleton-Cache.

SkeletonAgent: An Agentic Interaction Framework for Skeleton-based Action Recognition

CV and Pattern Recognition

Helps computers understand human movements better.

27 Nov 2025 1

88%

SUGAR: Learning Skeleton Representation with Visual-Motion Knowledge for Action Recognition

CV and Pattern Recognition

Teaches computers to understand human movements.

13 Nov 2025 0

88%

Action Hints: Semantic Typicality and Context Uniqueness for Generalizable Skeleton-based Video Anomaly Detection

CV and Pattern Recognition

Finds strange actions in videos without prior examples.

14 Sep 2025 1

View PDF Login to Bookmark

Country of Origin

🇦🇺 Australia

Repos / Data Links

github.com

Page Count

23 pages

Boosting Skeleton-based Zero-Shot Action Recognition with Training-Free Test-Time Adaptation

Helps computers recognize new human movements.

Technical Abstract

SkeletonAgent: An Agentic Interaction Framework for Skeleton-based Action Recognition

SUGAR: Learning Skeleton Representation with Visual-Motion Knowledge for Action Recognition

Action Hints: Semantic Typicality and Context Uniqueness for Generalizable Skeleton-based Video Anomaly Detection