Task Offloading and Resource Allocation Using Deep Reinforcement Learning

Kaiyi Zhang

doi:10.20381/ruor-25749

ScienceGate Book Chapters

DISSERTATION

Task Offloading and Resource Allocation Using Deep Reinforcement Learning

Kaiyi Zhang

Year: 2020 University: uO Research (University of Ottawa) Publisher: University of Ottawa

DOI: 10.20381/ruor-25749

Get Full-Text PDF Get Analytical Report

Abstract

Rapid urbanization poses huge challenges to people's daily lives, such as traffic congestion, environmental pollution, and public safety. Mobile Internet of things (MIoT) applications serving smart cities bring the promise of innovative and enhanced public services such as air pollution monitoring, enhanced road safety and city resources metering and management. These applications rely on a number of energy constrained MIoT units (MUs) (e.g., robots and drones) to continuously sense, capture and process data and images from their environments to produce immediate adaptive actions (e.g., triggering alarms, controlling machinery and communicating with citizens). In this thesis, we consider a scenario where a battery constrained MU executes a number of time-sensitive data processing tasks whose arrival times and sizes are stochastic in nature. These tasks can be executed locally on the device, offloaded to one of the nearby edge servers or to a cloud data center within a mobile edge computing (MEC) infrastructure. We first formulate the problem of making optimal offloading decisions that minimize the cost of current and future tasks as a constrained Markov decision process (CMDP) that accounts for the constraints of the MU battery and the limited reserved resources on the MEC infrastructure by the application providers. Then, we relax the CMDP problem into regular Markov decision process (MDP) using Lagrangian primal-dual optimization. We then develop advantage actor-critic (A2C) algorithm, one of the model-free deep reinforcement learning (DRL) method to train the MU to solve the relaxed problem. The training of the MU can be carried-out once to learn optimal offloading policies that are repeatedly employed as long as there are no large changes in the MU environment. Simulation results are presented to show that the proposed algorithm can achieve performance improvement over offloading decisions schemes that aim at optimizing instantaneous costs.

Keywords:

Reinforcement learning Resource allocation Task (project management) Computer science Reinforcement Artificial intelligence Human–computer interaction Psychology Engineering Computer network Social psychology Systems engineering

Metrics

Cited By

0.51

FWCI (Field Weighted Citation Impact)

Refs

0.68

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Age of Information Optimization

Physical Sciences → Computer Science → Computer Networks and Communications

Task Offloading and Resource Allocation Using Deep Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Deep Reinforcement Learning-based Predictive Maintenance Task Offloading and Resource Allocation

Deep-reinforcement-learning–guided resource allocation and task offloading for 6G edge intelligence

Deep Reinforcement Learning Based Task Offloading and Resource Allocation in Small Cell MEC

Deep Reinforcement Learning-Based Task Offloading and Resource Allocation for Mobile Edge Computing

Intelligent Task Offloading: Reducing Delay and Offloading Failure Using Predictive Resource Allocation and Deep Reinforcement Learning (ITO-PDR)