JOURNAL ARTICLE

QFuture: Learning Future Expectation Cognition in Multiagent Reinforcement Learning

Boyin LiuZhiqiang PuYi PanJianqiang YiMin ChenShijie Wang

Year: 2023 Journal:   IEEE Transactions on Cognitive and Developmental Systems Vol: 16 (4)Pages: 1302-1314   Publisher: Institute of Electrical and Electronics Engineers

Abstract

In multi-agent reinforcement learning (MARL), agents must learn to cooperate by observing the environment and selecting actions that maximize their rewards. However, this learning process can be hampered by myopia, wherein agents' strategies fail to consider the long-term consequences of their actions. A primary reason for this problem is the inaccurate estimation of the long-term value of each action. Socially, humans derive future expectation cognition from available information to anticipate potential future outcomes and adjust their actions accordingly to avoid myopia. Motivated by these insights, this paper proposes a novel framework called QFuture to address the myopia problem. Specifically, we first design a future expectation cognition module (FECM) in this framework to build future expectation cognition in the calculation of individual actionvalue (IAV) and joint action-value (JAV). We model future expectation cognition as random variables in FECM, which learn representation by maximizing mutual information with the future trajectory based on current information. Furthermore, a return-based regularizer is designed to reflect "expectation" and ensure informativeness in the future expectation representation module (FERM) which encodes the future trajectory. Experiments on StarCraft II micromanagement tasks and Google Research Football show that QFuture achieves significant state-of-the-art performance. Demonstrative videos are available at https://sites.google.com/view/qfuture .

Keywords:
Reinforcement learning Computer science Cognition Artificial intelligence Representation (politics) Action (physics) Trajectory Machine learning Human–computer interaction Psychology

Metrics

6
Cited By
1.53
FWCI (Field Weighted Citation Impact)
53
Refs
0.83
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Neural Networks and Reservoir Computing
Physical Sciences →  Computer Science →  Artificial Intelligence
Explainable Artificial Intelligence (XAI)
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Cognition-Oriented Multiagent Reinforcement Learning

Tenghai QiuShiguang WuZhen LiuZhiqiang PuJianqiang YiYuqian ZhaoBiao Luo

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2024 Vol: 36 (6)Pages: 10736-10748
BOOK-CHAPTER

Multiagent Reinforcement Learning

Jonathan P. HowDong Ki KimSamir Wadhwania

Encyclopedia of Systems and Control Year: 2021 Pages: 1359-1367
BOOK-CHAPTER

Multiagent Reinforcement Learning

Jonathan P. HowDong Ki KimSamir Wadhwania

Encyclopedia of Systems and Control Year: 2020 Pages: 1-9
© 2026 ScienceGate Book Chapters — All rights reserved.