Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning

Xindi Yang; Hao Zhang; Zhuping Wang

doi:10.1109/tnnls.2021.3054685

ScienceGate Book Chapters

JOURNAL ARTICLE

Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning

Xindi Yang Hao Zhang Zhuping Wang

Year: 2021 Journal: IEEE Transactions on Neural Networks and Learning Systems Vol: 33 (8)Pages: 3872-3883 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tnnls.2021.3054685

Get Full-Text PDF Get Analytical Report

Abstract

This article investigates the optimally distributed consensus control problem for discrete-time multiagent systems with completely unknown dynamics and computational ability differences. The problem can be viewed as solving nonzero-sum games with distributed reinforcement learning (RL), and each agent is a player in these games. First, to guarantee the real-time performance of learning algorithms, a data-based distributed control algorithm is proposed for multiagent systems using offline system interaction data sets. By utilizing the interactive data produced during the run of a real-time system, the proposed algorithm improves system performance based on distributed policy gradient RL. The convergence and stability are guaranteed based on functional analysis and the Lyapunov method. Second, to address asynchronous learning caused by computational ability differences in multiagent systems, the proposed algorithm is extended to an asynchronous version in which executing policy improvement or not of each agent is independent of its neighbors. Furthermore, an actor-critic structure, which contains two neural networks, is developed to implement the proposed algorithm in synchronous and asynchronous cases. Based on the method of weighted residuals, the convergence and optimality of the neural networks are guaranteed by proving the approximation errors converge to zero. Finally, simulations are conducted to show the effectiveness of the proposed algorithm.

Keywords:

Reinforcement learning Computer science Asynchronous communication Convergence (economics) Artificial neural network Multi-agent system Stability (learning theory) Consensus Lyapunov function Artificial intelligence Mathematical optimization Machine learning Mathematics

Metrics

Cited By

5.92

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Adaptive Dynamic Programming Control

Physical Sciences → Computer Science → Computational Theory and Mathematics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Distributed Control Multi-Agent Systems

Physical Sciences → Computer Science → Computer Networks and Communications

Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Data-Driven Optimal Bipartite Consensus Control for Second-Order Multiagent Systems via Policy Gradient Reinforcement Learning

Optimal Bipartite Consensus Control for Nonlinear Multiagent Systems Based on Reinforcement Learning

Data-Based Optimal Couple-Group Consensus Control for Heterogeneous Multi-Agent Systems via Policy Gradient Reinforcement Learning

Recursive-Gradient-Based Data-Driven Consensus Control for Multiagent Systems

Data-Based Optimal Control of Multiagent Systems: A Reinforcement Learning Design Approach