Score: 0

To Offload or Not To Offload: Model-driven Comparison of Edge-native and On-device Processing

Published: April 21, 2025 | arXiv ID: 2504.15162v2

By: Nathan Ng , David Irwin , Ananthram Swami and more

Potential Business Impact:

Decides when phone should do work or send it away.

Business Areas:

Cloud Computing Internet Services, Software

Computational offloading is a promising approach for overcoming resource constraints on client devices by moving some or all of an application's computations to remote servers. With the advent of specialized hardware accelerators, client devices are now able to perform fast local processing of specific tasks, such as machine learning inference, reducing the need for offloading computations. However, edge servers with accelerators also offer faster processing for offloaded tasks than was previously possible. In this paper, we present an analytic and experimental comparison of on-device processing and edge offloading for a range of accelerator, network, and application workload scenarios, with the goal of understanding when to use local on-device processing and when to offload computations. We present models that leverage analytical queuing results to capture the effects of dynamic factors such as the performance gap between the device and edge server, network variability, server load, and multi-tenancy on the edge server. We experimentally demonstrate the accuracy of our models for a range of hardware and application scenarios and show that our models achieve a mean absolute percentage error of 2.2% compared to observed latencies. We use our models to develop an adaptive resource manager for intelligent offloading and show its efficacy in the presence of variable network conditions and dynamic multi-tenant edge settings.

Rethinking Inference Placement for Deep Learning across Edge and Cloud Platforms: A Multi-Objective Optimization Perspective and Future Directions

Distributed, Parallel, and Cluster Computing

Makes smart apps run faster and safer.

27 Oct 2025 1

87%

Onboard Optimization and Learning: A Survey

Machine Learning (CS)

Lets small computers learn and think by themselves.

7 May 2025 0

87%

Cooperative Task Offloading through Asynchronous Deep Reinforcement Learning in Mobile Edge Computing for Future Networks

Machine Learning (CS)

Makes phones faster and use less power.

24 Apr 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

14 pages

To Offload or Not To Offload: Model-driven Comparison of Edge-native and On-device Processing

Decides when phone should do work or send it away.

Technical Abstract

Rethinking Inference Placement for Deep Learning across Edge and Cloud Platforms: A Multi-Objective Optimization Perspective and Future Directions

Onboard Optimization and Learning: A Survey

Cooperative Task Offloading through Asynchronous Deep Reinforcement Learning in Mobile Edge Computing for Future Networks