Joint Device Participation, Dataset Management, and Resource Allocation in Wireless Federated Learning via Deep Reinforcement Learning

Jinlian Chen; Jun Zhang; Nan Zhao; Yiyang Pei; Ying‐Chang Liang; Dusit Niyato

doi:10.1109/tvt.2023.3325843

ScienceGate Book Chapters

JOURNAL ARTICLE

Joint Device Participation, Dataset Management, and Resource Allocation in Wireless Federated Learning via Deep Reinforcement Learning

Jinlian Chen Jun Zhang Nan Zhao Yiyang Pei Ying‐Chang Liang Dusit Niyato

Year: 2023 Journal: IEEE Transactions on Vehicular Technology Vol: 73 (3)Pages: 4505-4510 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tvt.2023.3325843

Get Full-Text PDF Get Analytical Report

Abstract

Federated Learning (FL) enables large-scale machine learning without uploading the private data of wireless devices. Due to the heterogeneity and limitation of the devices' resources, the FL accuracy and latency substantially depend on the device participation and training dataset size. In this letter, to strike a trade-off between the FL accuracy and FL latency, a joint device participation, dataset management and resource allocation (DPDMRA) optimization problem is investigated. To solve the non-convex optimization problem, a Markov decision process is formulated for the resource-limited wireless FL. Moreover, due to the high dimensional continuous action space, a multi-agent softmax deep double deterministic policy gradients (MASD3) method is employed to obtain the optimal DPDMRA strategies. The double actor networks and softmax operator are designed to alleviate the underestimation bias. Simulation results demonstrate that the proposed DRL method can obtain the global optimal policy without complete information in the dynamic environment. Compared with the other baseline schemes, the proposed MASD3 approach can achieve the larger system utility with the better convergence performance.

Keywords:

Reinforcement learning Computer science Softmax function Markov decision process Resource allocation Wireless Latency (audio) Resource management (computing) Artificial intelligence Optimization problem Distributed computing Markov process Machine learning Mathematical optimization Deep learning Computer network Algorithm Telecommunications

Metrics

Cited By

1.53

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Privacy-Preserving Technologies in Data

Physical Sciences → Computer Science → Artificial Intelligence

Age of Information Optimization

Physical Sciences → Computer Science → Computer Networks and Communications

Vehicular Ad Hoc Networks (VANETs)

Physical Sciences → Engineering → Electrical and Electronic Engineering

Joint Device Participation, Dataset Management, and Resource Allocation in Wireless Federated Learning via Deep Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Joint Client Selection and Bandwidth Allocation of Wireless Federated Learning by Deep Reinforcement Learning

Joint Device Scheduling and Resource Allocation for Latency Constrained Wireless Federated Learning

Deep Reinforcement Learning for Resource Allocation in Blockchain-Based Federated Learning

Joint UAV Deployment and Resource Allocation: A Personalized Federated Deep Reinforcement Learning Approach

Adaptive User Scheduling and Resource Allocation in Wireless Federated Learning Networks: A Deep Reinforcement Learning Approach