Score: 0

Deep Reinforcement Learning for Joint Time and Power Management in SWIPT-EH CIoT

Published: December 17, 2025 | arXiv ID: 2512.15062v1

By: Nadia Abdolkhani , Nada Abdel Khalek , Walaa Hamouda and more

This letter presents a novel deep reinforcement learning (DRL) approach for joint time allocation and power control in a cognitive Internet of Things (CIoT) system with simultaneous wireless information and power transfer (SWIPT). The CIoT transmitter autonomously manages energy harvesting (EH) and transmissions using a learnable time switching factor while optimizing power to enhance throughput and lifetime. The joint optimization is modeled as a Markov decision process under small-scale fading, realistic EH, and interference constraints. We develop a double deep Q-network (DDQN) enhanced with an upper confidence bound. Simulations benchmark our approach, showing superior performance over existing DRL methods.

Category
Electrical Engineering and Systems Science:
Signal Processing