HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation
By: Bing Han, Yuhua Huang, Pan Gao
Potential Business Impact:
Lets computers see people's bodies in 3D.
Monocular 3D human pose estimation (HPE) often encounters challenges such as depth ambiguity and occlusion during the 2D-to-3D lifting process. Additionally, traditional methods may overlook multi-scale skeleton features when utilizing skeleton structure information, which can negatively impact the accuracy of pose estimation. To address these challenges, this paper introduces a novel 3D pose estimation method, HyperDiff, which integrates diffusion models with HyperGCN. The diffusion model effectively captures data uncertainty, alleviating depth ambiguity and occlusion. Meanwhile, HyperGCN, serving as a denoiser, employs multi-granularity structures to accurately model high-order correlations between joints. This improves the model's denoising capability especially for complex poses. Experimental results demonstrate that HyperDiff achieves state-of-the-art performance on the Human3.6M and MPI-INF-3DHP datasets and can flexibly adapt to varying computational resources to balance performance and efficiency.
Similar Papers
HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation
CV and Pattern Recognition
Makes computers understand people's body movements better.
Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning
CV and Pattern Recognition
Makes 3D human pose guessing much faster.
DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation
CV and Pattern Recognition
Lets computers guess people's movements in 3D.