Heavy-traffic Optimality of Skip-the-Longest-Queues in Heterogeneous Service Systems
By: Yishun Luo, Martin Zubeldia
Potential Business Impact:
Smartly sends jobs to avoid long waits.
We consider a discrete-time parallel service system consisting of $n$ heterogeneous single server queues with infinite capacity. Jobs arrive to the system as an i.i.d. process with rate proportional to $n$, and must be immediately dispatched in the time slot that they arrive. The dispatcher is assumed to be able to exchange messages with the servers to obtain their queue lengths and make dispatching decisions, introducing an undesirable communication overhead. In this setting, we propose a ultra-low communication overhead load balancing policy dubbed $k$-Skip-the-$d$-Longest-Queues ($k$-SLQ-$d$), where queue lengths are only observed every $k(n-d)$ time slots and, between observations, incoming jobs are sent to a queue that is not one of the $d$ longest ones at the time that the queues were last observed. For this policy, we establish conditions on $d$ for it to be throughput optimal and we show that, under that condition, it is asymptotically delay-optimal in heavy-traffic for arbitrarily low communication overheads (i.e., for arbitrarily large $k$).
Similar Papers
Stability and Heavy-traffic Delay Optimality of General Load Balancing Policies in Heterogeneous Service Systems
Performance
Makes jobs go to the right computer faster.
Geometric lower bounds for the steady-state occupancy of processing networks with limited connectivity
Probability
Makes computer networks handle more tasks faster.
Online Distributed Queue Length Estimation
Data Structures and Algorithms
Lets computers guess how many things are waiting.