JOURNAL ARTICLE

Globally Optimal Multi-agent Reinforcement Learning Parameters in Distributed Task Assignment

Abstract

Large-scale simulation studies are necessary to study the learning behaviour of individual agents and the overall system dynamics. One reason is that planning algorithms to find optimal solutions to fully observable general decentralised Markov decision problems do not admit to polynomial-time worst-case complexity bounds. Additionally, agent interaction often implies a non-stationary environment which does not lend itself to asymptotically greedy policies. Therefore, policies with a constant level of exploration are required to be able to adapt continuously. This paper casts the application domain of distributed task assignment into the formalisms of queueing theory, complex networks and decentralised Markov decision problems to analyse the impact of the momentum of a standard back-propagation neural network function approximator and the discount factor of $SARSA(0)$ reinforcement learning and the $\epsilon$ parameter of the $\epsilon$-greedy policy. For this purpose large queueing networks of one thousand interacting agents are evolved. A Kriging metamodel is fitted and in combination with simulated annealing optimal operating conditions with respect to the total average response time are found. The insights gained from this study are significant in that they provide guidance in deploying large-scale distributed task assignment systems modelled as multi-agent queueing networks.

Keywords:
Reinforcement learning Computer science Markov decision process Queueing theory Mathematical optimization Partially observable Markov decision process Markov chain Rotation formalisms in three dimensions Markov process Distributed computing Artificial intelligence Markov model Machine learning Mathematics

Metrics

1
Cited By
0.38
FWCI (Field Weighted Citation Impact)
49
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Bandit Algorithms Research
Social Sciences →  Decision Sciences →  Management Science and Operations Research

Related Documents

JOURNAL ARTICLE

Task assignment in multi-agent games via reinforcement learning

Shangheng LiHao LiuZiming RenYafan LiDawei Liu

Journal:   Scientia Sinica Technologica Year: 2024 Vol: 55 (5)Pages: 906-913
JOURNAL ARTICLE

Distributed Task Offloading based on Multi-Agent Deep Reinforcement Learning

Shucheng HuTao RenJianwei NiuZheyuan HuGuoliang Xing

Journal:   2021 17th International Conference on Mobility, Sensing and Networking (MSN) Year: 2021 Pages: 575-583
JOURNAL ARTICLE

Cooperative task assignment in spatial crowdsourcing via multi-agent deep reinforcement learning

Pengcheng ZhaoXiang LiShang GaoXiaohui Wei

Journal:   Journal of Systems Architecture Year: 2022 Vol: 128 Pages: 102551-102551
© 2026 ScienceGate Book Chapters — All rights reserved.