JOURNAL ARTICLE

DNN Deployment, Task Offloading, and Resource Allocation for Joint Task Inference in IIoT

Wenhao FanZeyu ChenZhibo HaoYi SuFan WuBihua TangYuanan Liu

Year: 2022 Journal:   IEEE Transactions on Industrial Informatics Vol: 19 (2)Pages: 1634-1646   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Joint task inference, which fully utilizes end edge cloud cooperation, can effectively enhance the performance of deep neural network (DNN) inference services in the industrial internet of things (IIoT) applications. In this paper, we propose a novel joint resource management scheme for a multi task and multi service scenario consisting of multiple sensors, a cloud server, and a base station equipped with an edge server . A time slotted system model is proposed, incorporating DNN deployment, data size control, task offloading, computing resource allocation, and wireless channel allocation. Among them, the DNN deployment is to deploy proper DNNs on the edge server under its total resource constraint, and the data size control is to make trade off between task inference accuracy and task transmission delay through changing task da ta size. Our goal is to minimize the total cost including total task processing delay and total error inference penalty while guaranteeing long term task queue stability and all task inference accuracy requirements. Leveraging the Lyapunov optimization, we first transform the optimization problem into a deterministic problem for each time slot. Then, a deep deterministic policy gradient (DDPG) based deep reinforcement learning (DRL) algorithm is designed to provide the near optimal solution. We further desi gn a fast numerical method for the data size control sub problem to reduce the training complexity of the DRL model, and design a penalty mechanism to prevent frequent optimizations of DNN deployment. Extensive experiments are conducted by varying differen t crucial parameters. The superiority of our scheme is demonstrated in comparison with 3 other schemes.

Keywords:
Computer science Lyapunov optimization Resource allocation Cloud computing Inference Task (project management) Reinforcement learning Edge device Mobile edge computing Distributed computing Real-time computing Edge computing Enhanced Data Rates for GSM Evolution Software deployment Optimization problem Wireless Artificial intelligence Computer network Algorithm Engineering

Metrics

60
Cited By
12.85
FWCI (Field Weighted Citation Impact)
32
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

IoT and Edge/Fog Computing
Physical Sciences →  Computer Science →  Computer Networks and Communications
Age of Information Optimization
Physical Sciences →  Computer Science →  Computer Networks and Communications
Brain Tumor Detection and Classification
Life Sciences →  Neuroscience →  Neurology

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.