SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation
By: Weihong Chen , Xuemiao Xu , Haoxin Yang and more
Potential Business Impact:
Makes 3D body tracking faster and more accurate.
Existing 3D Human Pose Estimation (HPE) methods achieve high accuracy but suffer from computational overhead and slow inference, while knowledge distillation methods fail to address spatial relationships between joints and temporal correlations in multi-frame inputs. In this paper, we propose Sparse Correlation and Joint Distillation (SCJD), a novel framework that balances efficiency and accuracy for 3D HPE. SCJD introduces Sparse Correlation Input Sequence Downsampling to reduce redundancy in student network inputs while preserving inter-frame correlations. For effective knowledge transfer, we propose Dynamic Joint Spatial Attention Distillation, which includes Dynamic Joint Embedding Distillation to enhance the student's feature representation using the teacher's multi-frame context feature, and Adjacent Joint Attention Distillation to improve the student network's focus on adjacent joint relationships for better spatial understanding. Additionally, Temporal Consistency Distillation aligns the temporal correlations between teacher and student networks through upsampling and global supervision. Extensive experiments demonstrate that SCJD achieves state-of-the-art performance. Code is available at https://github.com/wileychan/SCJD.
Similar Papers
JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation
Machine Learning (CS)
Teaches cars to see and understand roads better.
FastDDHPose: Towards Unified, Efficient, and Disentangled 3D Human Pose Estimation
CV and Pattern Recognition
Makes computers understand body poses better and faster.
Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation
CV and Pattern Recognition
Makes robots see objects better with less data.