Multiagent Deep-Reinforcement-Learning-Based Virtual Resource Allocation Through Network Function Virtualization in Internet of Things

Hurmat Ali Shah; Lian Zhao

doi:10.1109/jiot.2020.3022572

ScienceGate Book Chapters

JOURNAL ARTICLE

Multiagent Deep-Reinforcement-Learning-Based Virtual Resource Allocation Through Network Function Virtualization in Internet of Things

Hurmat Ali Shah Lian Zhao

Year: 2020 Journal: IEEE Internet of Things Journal Vol: 8 (5)Pages: 3410-3421 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/jiot.2020.3022572

Get Full-Text PDF Get Analytical Report

Abstract

Resource allocation is a significant task in the emerging area of Internet of Things (IoT). IoT devices are usually low-cost devices with limited computational power and capabilities for long term communication. In this article, the network function virtualization (NFV) technique is used to access resources of the network and a reinforcement learning (RL) algorithm is used to solve the problem of resource allocation in IoT networks. The traffic of the IoT network uses the substrate network which is available through NFV for its data transmission. The data transmission needs of the IoT network are translated to virtual requests and service function chain (SFC) are mapped to the substrate network to serve the requests. The problem of SFC placement while meeting the system constraints of the IoT network is a nonconvex problem. In the proposed deep RL (DRL)-based resource allocation, the virtual layer acts as a common repository of the network resources. The optimization problem of SFC placement under the system constraints of IoT networks can be formulated as a Markovian decision process (MDP). The MDP problem is solved through a multiagent DRL algorithm where each agent serves an SFC. Two Q-networks are considered, where one Q-network solves the SFC placement problem while the other updates weights of the Q-network through keeping track of long-term policy changes. The virtual agents serving SFCs interact with the environment, receive reward collectively and update the policy by using the learned experiences. We show that the proposed scheme can solve the optimization problem of SFC placement through adequate reward design, state, and action space formulation. Simulation results demonstrate that the multiagent DRL scheme outperforms the reference schemes in terms of utility gained as measured through different network parameters.

Keywords:

Computer science Reinforcement learning Resource allocation Virtual network Network virtualization Distributed computing Virtualization Computer network Optimization problem Resource management (computing) Artificial intelligence Cloud computing Algorithm

Metrics

Cited By

6.61

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Software-Defined Networks and 5G

Physical Sciences → Computer Science → Computer Networks and Communications

IoT and Edge/Fog Computing

Physical Sciences → Computer Science → Computer Networks and Communications

Advanced Memory and Neural Computing

Physical Sciences → Engineering → Electrical and Electronic Engineering

Multiagent Deep-Reinforcement-Learning-Based Virtual Resource Allocation Through Network Function Virtualization in Internet of Things

Abstract

Metrics

Citation History

Topics

Related Documents

Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things

AI‐Based Virtual Network Function Embedding for Internet of Things Using Parallel Deep Reinforcement Learning

Decentralized Resource Allocation-Based Multiagent Deep Learning in Vehicular Network

Multiagent Federated Deep-Reinforcement-Learning-Enabled Resource Allocation for an Air–Ground-Integrated Internet of Vehicles Network

Multiagent Federated Reinforcement Learning for Resource Allocation in UAV-Enabled Internet of Medical Things Networks