Score: 1

Cooperative Inference for Real-Time 3D Human Pose Estimation in Multi-Device Edge Networks

Published: April 3, 2025 | arXiv ID: 2504.03052v1

By: Hyun-Ho Choi , Kangsoo Kim , Ki-Ho Lee and more

Potential Business Impact:

Helps phones guess body poses better, faster.

Business Areas:

Indoor Positioning Navigation and Mapping

Accurate and real-time three-dimensional (3D) pose estimation is challenging in resource-constrained and dynamic environments owing to its high computational complexity. To address this issue, this study proposes a novel cooperative inference method for real-time 3D human pose estimation in mobile edge computing (MEC) networks. In the proposed method, multiple end devices equipped with lightweight inference models employ dual confidence thresholds to filter ambiguous images. Only the filtered images are offloaded to an edge server with a more powerful inference model for re-evaluation, thereby improving the estimation accuracy under computational and communication constraints. We numerically analyze the performance of the proposed inference method in terms of the inference accuracy and end-to-end delay and formulate a joint optimization problem to derive the optimal confidence thresholds and transmission time for each device, with the objective of minimizing the mean per-joint position error (MPJPE) while satisfying the required end-to-end delay constraint. To solve this problem, we demonstrate that minimizing the MPJPE is equivalent to maximizing the sum of the inference accuracies for all devices, decompose the problem into manageable subproblems, and present a low-complexity optimization algorithm to obtain a near-optimal solution. The experimental results show that a trade-off exists between the MPJPE and end-to-end delay depending on the confidence thresholds. Furthermore, the results confirm that the proposed cooperative inference method achieves a significant reduction in the MPJPE through the optimal selection of confidence thresholds and transmission times, while consistently satisfying the end-to-end delay requirement in various MEC environments.

An End-to-End Framework for Video Multi-Person Pose Estimation

CV and Pattern Recognition

Tracks people's movements in videos better.

1 Sep 2025 0

87%

Cooperative Task Offloading through Asynchronous Deep Reinforcement Learning in Mobile Edge Computing for Future Networks

Machine Learning (CS)

Makes phones faster and use less power.

24 Apr 2025 0

87%

DETRPose: Real-time end-to-end transformer model for multi-person pose estimation

CV and Pattern Recognition

Lets computers see and track people in real time.

16 Jun 2025 1

View PDF Login to Bookmark

Country of Origin

🇰🇷 🇨🇦 Canada, Korea, Republic of

Page Count

13 pages

Cooperative Inference for Real-Time 3D Human Pose Estimation in Multi-Device Edge Networks

Helps phones guess body poses better, faster.

Technical Abstract

An End-to-End Framework for Video Multi-Person Pose Estimation

Cooperative Task Offloading through Asynchronous Deep Reinforcement Learning in Mobile Edge Computing for Future Networks

DETRPose: Real-time end-to-end transformer model for multi-person pose estimation