Score: 0

Joint Memory Frequency and Computing Frequency Scaling for Energy-efficient DNN Inference

Published: September 22, 2025 | arXiv ID: 2509.17970v2

By: Yunchu Han , Zhaojun Nan , Sheng Zhou and more

Potential Business Impact:

Saves phone power by adjusting chip speeds.

Business Areas:

DSP Hardware

Deep neural networks (DNNs) have been widely applied in diverse applications, but the problems of high latency and energy overhead are inevitable on resource-constrained devices. To address this challenge, most researchers focus on the dynamic voltage and frequency scaling (DVFS) technique to balance the latency and energy consumption by changing the computing frequency of processors. However, the adjustment of memory frequency is usually ignored and not fully utilized to achieve efficient DNN inference, which also plays a significant role in the inference time and energy consumption. In this paper, we first investigate the impact of joint memory frequency and computing frequency scaling on the inference time and energy consumption with a model-based and data-driven method. Then by combining with the fitting parameters of different DNN models, we give a preliminary analysis for the proposed model to see the effects of adjusting memory frequency and computing frequency simultaneously. Finally, simulation results in local inference and cooperative inference cases further validate the effectiveness of jointly scaling the memory frequency and computing frequency to reduce the energy consumption of devices.

Joint Optimization of Offloading, Batching and DVFS for Multiuser Co-Inference

Distributed, Parallel, and Cluster Computing

Saves phone battery by sharing tasks with a server.

20 Apr 2025 0

87%

E4: Energy-Efficient DNN Inference for Edge Video Analytics Via Early-Exit and DVFS

CV and Pattern Recognition

Saves phone battery by making smart video analysis faster.

6 Mar 2025 2

86%

Metadata-Guided Adaptable Frequency Scaling across Heterogeneous Applications and Devices

Distributed, Parallel, and Cluster Computing

Makes phone batteries last longer and run faster.

23 Sep 2025 1

View PDF Login to Bookmark

Page Count

6 pages

Joint Memory Frequency and Computing Frequency Scaling for Energy-efficient DNN Inference

Saves phone power by adjusting chip speeds.

Technical Abstract

Joint Optimization of Offloading, Batching and DVFS for Multiuser Co-Inference

E4: Energy-Efficient DNN Inference for Edge Video Analytics Via Early-Exit and DVFS

Metadata-Guided Adaptable Frequency Scaling across Heterogeneous Applications and Devices