QFuture: Learning Future Expectation Cognition in Multiagent Reinforcement Learning

Boyin Liu; Zhiqiang Pu; Yi Pan; Jianqiang Yi; Min Chen; Shijie Wang

doi:10.1109/tcds.2023.3345735

ScienceGate Book Chapters

JOURNAL ARTICLE

QFuture: Learning Future Expectation Cognition in Multiagent Reinforcement Learning

Boyin Liu Zhiqiang Pu Yi Pan Jianqiang Yi Min Chen Shijie Wang

Year: 2023 Journal: IEEE Transactions on Cognitive and Developmental Systems Vol: 16 (4)Pages: 1302-1314 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tcds.2023.3345735

Get Full-Text PDF Get Analytical Report

Abstract

In multi-agent reinforcement learning (MARL), agents must learn to cooperate by observing the environment and selecting actions that maximize their rewards. However, this learning process can be hampered by myopia, wherein agents' strategies fail to consider the long-term consequences of their actions. A primary reason for this problem is the inaccurate estimation of the long-term value of each action. Socially, humans derive future expectation cognition from available information to anticipate potential future outcomes and adjust their actions accordingly to avoid myopia. Motivated by these insights, this paper proposes a novel framework called QFuture to address the myopia problem. Specifically, we first design a future expectation cognition module (FECM) in this framework to build future expectation cognition in the calculation of individual actionvalue (IAV) and joint action-value (JAV). We model future expectation cognition as random variables in FECM, which learn representation by maximizing mutual information with the future trajectory based on current information. Furthermore, a return-based regularizer is designed to reflect "expectation" and ensure informativeness in the future expectation representation module (FERM) which encodes the future trajectory. Experiments on StarCraft II micromanagement tasks and Google Research Football show that QFuture achieves significant state-of-the-art performance. Demonstrative videos are available at https://sites.google.com/view/qfuture .

Keywords:

Reinforcement learning Computer science Cognition Artificial intelligence Representation (politics) Action (physics) Trajectory Machine learning Human–computer interaction Psychology

Metrics

Cited By

1.53

FWCI (Field Weighted Citation Impact)

Refs

0.83

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Neural Networks and Reservoir Computing

Physical Sciences → Computer Science → Artificial Intelligence

Explainable Artificial Intelligence (XAI)

Physical Sciences → Computer Science → Artificial Intelligence

QFuture: Learning Future Expectation Cognition in Multiagent Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Cognition-Oriented Multiagent Reinforcement Learning

Multiagent Reinforcement Learning

Multiagent Reinforcement Learning

Multiagent Reinforcement Learning

Asymmetric multiagent reinforcement learning