Variational Contrastive Learning for Skeleton-based Action Recognition
By: Dang Dinh Nguyen, Decky Aspandi Latif, Titus Zaharia
Potential Business Impact:
Teaches computers to understand human movements better.
In recent years, self-supervised representation learning for skeleton-based action recognition has advanced with the development of contrastive learning methods. However, most of contrastive paradigms are inherently discriminative and often struggle to capture the variability and uncertainty intrinsic to human motion. To address this issue, we propose a variational contrastive learning framework that integrates probabilistic latent modeling with contrastive self-supervised learning. This formulation enables the learning of structured and semantically meaningful representations that generalize across different datasets and supervision levels. Extensive experiments on three widely used skeleton-based action recognition benchmarks show that our proposed method consistently outperforms existing approaches, particularly in low-label regimes. Moreover, qualitative analyses show that the features provided by our method are more relevant given the motion and sample characteristics, with more focus on important skeleton joints, when compared to the other methods.
Similar Papers
Skeleton-Snippet Contrastive Learning with Multiscale Feature Fusion for Action Localization
CV and Pattern Recognition
Finds exact start and end of human actions.
MS-CLR: Multi-Skeleton Contrastive Learning for Human Action Recognition
CV and Pattern Recognition
Teaches computers to understand actions from different body poses.
Informative Sample Selection Model for Skeleton-based Action Recognition with Limited Training Samples
CV and Pattern Recognition
Teaches computers to recognize actions with less data.