JOURNAL ARTICLE

Research on Multi-agent PPO Reinforcement Learning Algorithm based on Knowledge Graph

Abstract

At present, the reinforcement learning algorithm based on deep neural network has made great progress in the research of multi-agent interactive training methods. However, the training cost of the model increases significantly with the increase of network depth and the number of agents. Therefore, designing an efficient method about multi-agent reinforcement learning is the focus of current research. In this paper, the strategy network structure is the PPO which is the traditional reinforcement learning strategy gradient algorithm. The training data is characterized by knowledge graph and key information is extracted to ensure that the training can be completed by obtaining a small amount of data from the multi-agent interactive data. So we can effectively reduce the number of training iterations and improve the efficiency of convergence. This paper is based on knowledge graph and PPO algorithm to solve the multi-agent reinforcement learning problem. The research results can be applied in the field of unmanned cluster intelligent decision.

Keywords:
Reinforcement learning Computer science Artificial neural network Artificial intelligence Convergence (economics) Machine learning Graph Theoretical computer science

Metrics

2
Cited By
0.51
FWCI (Field Weighted Citation Impact)
9
Refs
0.67
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Technologies in Various Fields
Physical Sciences →  Computer Science →  Artificial Intelligence
Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.