Q-Learning-Based Time-Critical Data Aggregation Scheduling in IoT
By: Van-Vi Vo , Tien-Dung Nguyen , Duc-Tai Le and more
Potential Business Impact:
Makes smart devices send information faster.
Time-critical data aggregation in Internet of Things (IoT) networks demands efficient, collision-free scheduling to minimize latency for applications like smart cities and industrial automation. Traditional heuristic methods, with two-phase tree construction and scheduling, often suffer from high computational overhead and suboptimal delays due to their static nature. To address this, we propose a novel Q-learning framework that unifies aggregation tree construction and scheduling, modeling the process as a Markov Decision Process (MDP) with hashed states for scalability. By leveraging a reward function that promotes large, interference-free batch transmissions, our approach dynamically learns optimal scheduling policies. Simulations on static networks with up to 300 nodes demonstrate up to 10.87% lower latency compared to a state-of-the-art heuristic algorithm, highlighting its robustness for delay-sensitive IoT applications. This framework enables timely insights in IoT environments, paving the way for scalable, low-latency data aggregation.
Similar Papers
Data Scheduling Algorithm for Scalable and Efficient IoT Sensing in Cloud Computing
Distributed, Parallel, and Cluster Computing
Makes smart devices send data faster, cheaper.
State-Aware IoT Scheduling Using Deep Q-Networks and Edge-Based Coordination
Networking and Internet Architecture
Saves power for smart gadgets by sharing tasks.
Dynamic and Distributed Routing in IoT Networks based on Multi-Objective Q-Learning
Distributed, Parallel, and Cluster Computing
Helps smart devices change priorities on the fly.