JOURNAL ARTICLE

Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things

Bo GuXu ZhangZiqi LinMamoun Alazab

Year: 2020 Journal:   IEEE Internet of Things Journal Vol: 8 (5)Pages: 3066-3074   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Ultrareliable and low-latency communication (URLLC) is a prerequisite for the successful implementation of the Internet of Controllable Things. In this article, we investigate the potential of deep reinforcement learning (DRL) for joint subcarrier-power allocation to achieve low latency and high reliability in a general form of device-to-device (D2D) networks, where each subcarrier can be allocated to multiple D2D pairs and each D2D pair is permitted to utilize multiple subcarriers. We first formulate the above problem as a Markov decision process and then propose a double deep $Q$ -network (DQN)-based resource allocation algorithm to learn the optimal policy in the absence of full instantaneous channel state information (CSI). Specifically, each D2D pair acts as a learning agent that adjusts its own subcarrier-power allocation strategy iteratively through interactions with the operating environment in a trial-and-error fashion. Simulation results demonstrate that the proposed algorithm achieves near-optimal performance in real time. It is worth mentioning that the proposed algorithm is especially suitable for cases where the environmental dynamics are not accurate and the CSI delay cannot be ignored.

Keywords:
Computer science Reinforcement learning Markov decision process Resource allocation Subcarrier Latency (audio) Q-learning Mathematical optimization Resource management (computing) Markov process Channel state information Distributed computing Computer network Channel (broadcasting) Artificial intelligence Wireless Orthogonal frequency-division multiplexing Telecommunications

Metrics

73
Cited By
7.80
FWCI (Field Weighted Citation Impact)
36
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Age of Information Optimization
Physical Sciences →  Computer Science →  Computer Networks and Communications
Advanced MIMO Systems Optimization
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Energy Harvesting in Wireless Networks
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.