Deep Reinforcement Learning With Dueling DQN for Partial Computation Offloading and Resource Allocation in Mobile Edge Computing

Ehzaz Mustafa; Junaid Shuja; Faisal Rehman; Abdallah Namoun; Mazhar Ali; Abdullah Alourani

doi:10.1109/access.2025.3573929

ScienceGate Book Chapters

JOURNAL ARTICLE

Deep Reinforcement Learning With Dueling DQN for Partial Computation Offloading and Resource Allocation in Mobile Edge Computing

Ehzaz Mustafa Junaid Shuja Faisal Rehman Abdallah Namoun Mazhar Ali Abdullah Alourani

Year: 2025 Journal: IEEE Access Vol: 13 Pages: 94319-94335 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/access.2025.3573929

Get Full-Text PDF Get Analytical Report

Abstract

Computation offloading transfers resource-intensive tasks from local Internet of Things (IoT) devices to powerful edge servers, which minimizes latency and reduces the computational load on IoT devices. Deep Reinforcement Learning (DRL) is widely utilized to optimize computation offloading decisions. However, previous studies fall short in two main ways: firstly, they do not collectively optimize the comprehensive state space, and secondly, their reliance on Q-learning and Deep Q Networks (DQN) makes it challenging for agents to discern the optimal action in large action spaces, as many actions may possess similar values. In this paper, we introduce a multi-branch Dueling Deep Q Network (MBDDQN) that tackles the challenges of high-dimensional state-action spaces and long-term cost optimizations in dynamic environments. The Dueling DQN alleviates the complexity of simultaneous offloading and resource allocation decisions, with each branch independently controlling a subset of the decision variables to scale efficiently with an increasing number of IoT devices, thereby avoiding the combinatorial explosion of potential actions. Furthermore, we implement a long short-term memory (LSTM) network with distinct advantage-value layers to enhance both short-term action selection and long-term system cost estimation, as well as improve the temporal learning capacity of the model. Finally, we propose an innovative adaptive cost-weighting mechanism within the reward function to dynamically balance competing objectives, including energy consumption, latency, and bandwidth utilization. Unlike prior works that use fixed reward structures, we leverage weighted state-action advantage values to dynamically adjust the optimization variables. This approach also enables the agent to self-tune, allowing it to prioritize delay minimization in delay-sensitive scenarios and energy conservation in resource-constrained environments. Simulation results demonstrate the superiority of the proposed scheme compared to benchmarks. For instance, MBDDQN reduces delay by 17.88% over DQN and 12.28% over DDPG. Additionally, regarding energy consumption, MBDDQN achieves a 10.1% improvement over DQN and a 7.64% enhancement over DDPG.

Keywords:

Computer science Reinforcement learning Computation offloading Mobile edge computing Resource allocation Enhanced Data Rates for GSM Evolution Computation Edge computing Artificial intelligence Resource (disambiguation) Resource management (computing) Distributed computing Computer network Algorithm

Metrics

Cited By

46.49

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

IoT and Edge/Fog Computing

Physical Sciences → Computer Science → Computer Networks and Communications

Deep Reinforcement Learning With Dueling DQN for Partial Computation Offloading and Resource Allocation in Mobile Edge Computing

Abstract

Metrics

Citation History

Topics

Related Documents

Computation Offloading and Resource Allocation in Mobile Edge Computing via Reinforcement Learning

Deep reinforcement learning-based computation offloading and resource allocation in security-aware mobile edge computing

Joint Offloading and Resource Allocation Using Deep Reinforcement Learning in Mobile Edge Computing

Deep Reinforcement Learning-Based Task Offloading and Resource Allocation for Mobile Edge Computing

Computation Resource Offloading in Mobile Edge Computing: A Deep Reinforcement Approach