JOURNAL ARTICLE

Joint Device Participation, Dataset Management, and Resource Allocation in Wireless Federated Learning via Deep Reinforcement Learning

Jinlian ChenJun ZhangNan ZhaoYiyang PeiYing‐Chang LiangDusit Niyato

Year: 2023 Journal:   IEEE Transactions on Vehicular Technology Vol: 73 (3)Pages: 4505-4510   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Federated Learning (FL) enables large-scale machine learning without uploading the private data of wireless devices. Due to the heterogeneity and limitation of the devices' resources, the FL accuracy and latency substantially depend on the device participation and training dataset size. In this letter, to strike a trade-off between the FL accuracy and FL latency, a joint device participation, dataset management and resource allocation (DPDMRA) optimization problem is investigated. To solve the non-convex optimization problem, a Markov decision process is formulated for the resource-limited wireless FL. Moreover, due to the high dimensional continuous action space, a multi-agent softmax deep double deterministic policy gradients (MASD3) method is employed to obtain the optimal DPDMRA strategies. The double actor networks and softmax operator are designed to alleviate the underestimation bias. Simulation results demonstrate that the proposed DRL method can obtain the global optimal policy without complete information in the dynamic environment. Compared with the other baseline schemes, the proposed MASD3 approach can achieve the larger system utility with the better convergence performance.

Keywords:
Reinforcement learning Computer science Softmax function Markov decision process Resource allocation Wireless Latency (audio) Resource management (computing) Artificial intelligence Optimization problem Distributed computing Markov process Machine learning Mathematical optimization Deep learning Computer network Algorithm Telecommunications

Metrics

6
Cited By
1.53
FWCI (Field Weighted Citation Impact)
17
Refs
0.82
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Privacy-Preserving Technologies in Data
Physical Sciences →  Computer Science →  Artificial Intelligence
Age of Information Optimization
Physical Sciences →  Computer Science →  Computer Networks and Communications
Vehicular Ad Hoc Networks (VANETs)
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.