Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things

Bo Gu; Xu Zhang; Ziqi Lin; Mamoun Alazab

doi:10.1109/jiot.2020.3023111

ScienceGate Book Chapters

JOURNAL ARTICLE

Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things

Bo Gu Xu Zhang Ziqi Lin Mamoun Alazab

Year: 2020 Journal: IEEE Internet of Things Journal Vol: 8 (5)Pages: 3066-3074 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/jiot.2020.3023111

Get Full-Text PDF Get Analytical Report

Abstract

Ultrareliable and low-latency communication (URLLC) is a prerequisite for the successful implementation of the Internet of Controllable Things. In this article, we investigate the potential of deep reinforcement learning (DRL) for joint subcarrier-power allocation to achieve low latency and high reliability in a general form of device-to-device (D2D) networks, where each subcarrier can be allocated to multiple D2D pairs and each D2D pair is permitted to utilize multiple subcarriers. We first formulate the above problem as a Markov decision process and then propose a double deep $Q$ -network (DQN)-based resource allocation algorithm to learn the optimal policy in the absence of full instantaneous channel state information (CSI). Specifically, each D2D pair acts as a learning agent that adjusts its own subcarrier-power allocation strategy iteratively through interactions with the operating environment in a trial-and-error fashion. Simulation results demonstrate that the proposed algorithm achieves near-optimal performance in real time. It is worth mentioning that the proposed algorithm is especially suitable for cases where the environmental dynamics are not accurate and the CSI delay cannot be ignored.

Keywords:

Computer science Reinforcement learning Markov decision process Resource allocation Subcarrier Latency (audio) Q-learning Mathematical optimization Resource management (computing) Markov process Channel state information Distributed computing Computer network Channel (broadcasting) Artificial intelligence Wireless Orthogonal frequency-division multiplexing Telecommunications

Metrics

Cited By

7.80

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Age of Information Optimization

Physical Sciences → Computer Science → Computer Networks and Communications

Advanced MIMO Systems Optimization

Physical Sciences → Engineering → Electrical and Electronic Engineering

Energy Harvesting in Wireless Networks

Physical Sciences → Engineering → Electrical and Electronic Engineering

Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things

Abstract

Metrics

Citation History

Topics

Related Documents

Multiagent Deep-Reinforcement-Learning-Based Virtual Resource Allocation Through Network Function Virtualization in Internet of Things

Multi-Agent Deep Reinforcement Learning Based Resource Allocation for Ultra-Reliable Low-Latency Internet of Controllable Things

Multiagent Federated Reinforcement Learning for Resource Allocation in UAV-Enabled Internet of Medical Things Networks

Deep Reinforcement Learning-Based Resource Allocation for Satellite Internet of Things with Diverse QoS Guarantee

Deep reinforcement learning based computation offloading and resource allocation strategy for maritime internet of things