Score: 0

Joint Optimization of Offloading, Batching and DVFS for Multiuser Co-Inference

Published: April 20, 2025 | arXiv ID: 2504.14611v1

By: Yaodan Xu, Sheng Zhou, Zhisheng Niu

Potential Business Impact:

Saves phone battery by sharing tasks with a server.

Business Areas:

Cloud Computing Internet Services, Software

With the growing integration of artificial intelligence in mobile applications, a substantial number of deep neural network (DNN) inference requests are generated daily by mobile devices. Serving these requests presents significant challenges due to limited device resources and strict latency requirements. Therefore, edge-device co-inference has emerged as an effective paradigm to address these issues. In this study, we focus on a scenario where multiple mobile devices offload inference tasks to an edge server equipped with a graphics processing unit (GPU). For finer control over offloading and scheduling, inference tasks are partitioned into smaller sub-tasks. Additionally, GPU batch processing is employed to boost throughput and improve energy efficiency. This work investigates the problem of minimizing total energy consumption while meeting hard latency constraints. We propose a low-complexity Joint DVFS, Offloading, and Batching strategy (J-DOB) to solve this problem. The effectiveness of the proposed algorithm is validated through extensive experiments across varying user numbers and deadline constraints. Results show that J-DOB can reduce energy consumption by up to 51.30% and 45.27% under identical and different deadlines, respectively, compared to local computing.

Batching-Aware Joint Model Onloading and Offloading for Hierarchical Multi-Task Inference

Machine Learning (CS)

Lets phones do many smart jobs at once.

18 Aug 2025 1

89%

Joint Memory Frequency and Computing Frequency Scaling for Energy-efficient DNN Inference

Machine Learning (CS)

Saves phone power by adjusting chip speeds.

22 Sep 2025 0

88%

E4: Energy-Efficient DNN Inference for Edge Video Analytics Via Early-Exit and DVFS

CV and Pattern Recognition

Saves phone battery by making smart video analysis faster.

6 Mar 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

6 pages

Joint Optimization of Offloading, Batching and DVFS for Multiuser Co-Inference

Saves phone battery by sharing tasks with a server.

Technical Abstract

Batching-Aware Joint Model Onloading and Offloading for Hierarchical Multi-Task Inference

Joint Memory Frequency and Computing Frequency Scaling for Energy-efficient DNN Inference

E4: Energy-Efficient DNN Inference for Edge Video Analytics Via Early-Exit and DVFS