Decentralized Multi-Agent Advantage Actor-Critic

Barnes, Scott

doi:10.6084/m9.figshare.13718365

ScienceGate Book Chapters

JOURNAL ARTICLE

Decentralized Multi-Agent Advantage Actor-Critic

Barnes, Scott

Year: 2021

DOI: 10.6084/m9.figshare.13718365

Get Full-Text PDF Get Analytical Report

Abstract

We present a decentralized advantage actor-critic algorithm that utilizes learning agents in parallel environments with synchronous gradient descent. This approach decorrelates agents’ experiences, stabilizing observations and eliminating the need for a replay buffer, requires no knowledge of the other agents’ internal state during training or execution, and runs on a single multi-core CPU.

Keywords:

State (computer science) Key (lock) Stability (learning theory) Decentralised system Class (philosophy)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.37

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Neural Networks and Reservoir Computing

Physical Sciences → Computer Science → Artificial Intelligence

Adaptive Dynamic Programming Control

Physical Sciences → Computer Science → Computational Theory and Mathematics

Decentralized Multi-Agent Advantage Actor-Critic

Abstract

Metrics

Topics

Related Documents

Decentralized Counterfactual Multi-Agent Actor-Critic Algorithms

Decentralized Multi-Agent Actor-Critic with Generative Inference

Capacity-Limited Decentralized Actor-Critic for Multi-Agent Games

A New Advantage Actor-Critic Algorithm For Multi-Agent Environments

GTDE: Grouped Training with Decentralized Execution for Multi-agent Actor-Critic