Strategic Interaction Multi-Agent Deep Reinforcement Learning

Wenhong Zhou; Jie Li; Yiting Chen; Lincheng Shen

doi:10.1109/access.2020.3005734

ScienceGate Book Chapters

JOURNAL ARTICLE

Strategic Interaction Multi-Agent Deep Reinforcement Learning

Wenhong Zhou Jie Li Yiting Chen Lincheng Shen

Year: 2020 Journal: IEEE Access Vol: 8 Pages: 119000-119009 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/access.2020.3005734

Get Full-Text PDF Get Analytical Report

Abstract

Despite the proliferation of multi-agent deep reinforcement learning (MADRL), most existing typical methods do not scale well to the dynamics of agent populations. And as the population increases, the dimensional explosion of joint state-action and the complex interaction between agents make learning extremely cumbersome, which poses the scalability challenge for MADRL. This paper focuses on the scalability issue of MADRL with homogeneous agents. In a natural population, local interaction is a more feasible mode of interplay rather than global interaction. And inspired by the strategic interaction model in economics, we decompose the value function of each agent into the sum of the expected cumulative rewards of the interaction between the agent and each neighbor. This novel value function is decentralized and decomposable, which enables it to scale well to the dynamic changes in the number of large-scale agents. Hereby, the corresponding strategic interaction reinforcement learning algorithm (SIQ), is proposed to learn the optimal policy of each agent, wherein a neural network is employed to estimate the expected cumulative reward for the interaction between the agent and one of its neighbors. We test the validity of the proposed method in a mixed cooperative-competitive confrontation game through numerical experiments. Furthermore, the scalability comparison experiments illustrate that the scalability of the SIQ algorithm outperforms the independent learning and mean field reinforcement learning algorithms in multiple scenarios with different and dynamically changing numbers.

Keywords:

Reinforcement learning Computer science Scalability Population Artificial intelligence Artificial neural network Scale (ratio) Function (biology) Machine learning Mathematical optimization Mathematics

Metrics

Cited By

0.73

FWCI (Field Weighted Citation Impact)

Refs

0.76

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Experimental Behavioral Economics Studies

Social Sciences → Social Sciences → Safety Research

Adaptive Dynamic Programming Control

Physical Sciences → Computer Science → Computational Theory and Mathematics

Strategic Interaction Multi-Agent Deep Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Deep reinforcement learning for multi-agent interaction

Deep multi-agent reinforcement learning

AdvCommNet: Enhancing Multi-Agent Interaction in Deep Reinforcement Learning Systems

Federated, multi-agent, deep reinforcement learning

Deep Reinforcement Learning Multi-Agent Systems