Intelligent Wargame Deduction Decision Method Based on Deep Reinforcement Learning

Shui HU

ScienceGate Book Chapters

JOURNAL ARTICLE

Intelligent Wargame Deduction Decision Method Based on Deep Reinforcement Learning

Shui HU

Year: 2023 Journal: DOAJ (DOAJ: Directory of Open Access Journals)

Get Full-Text PDF Get Analytical Report

Abstract

Wargame deduction is an important method for cultivating modern military commanders. Introducing artificial intelligence technology in wargame deduction can simplify organizational processes and improve deduction efficiency. Owing to the complex situational information and incomplete inference information, intelligent wargame based on machine learning often reduces the sample efficiency of autonomous decision-making models. This paper proposes an intelligent wargame deduction decision-making method based on deep reinforcement learning. In response to the efficiency issue of intelligent wargame deduction and combat decision-making, a baseline is introduced into the strategy network, and the training of the policy network is accelerated. Subsequently, derivation and proof are presented, and a method for updating the parameters of the policy network after adding the baseline is proposed. The process of introducing the state-value function in the wargame deduction environment into the model is analyzed. Construct a Low Advantage Policy-Value Network(LAPVN) model and its training framework for wargame deduction under traditional policy-value networks, and construct the model using battlefield situational awareness methods. In a wargame combat experimental environment that approximately conforms to military operational rules, the traditional policy-value network and LAPVN are compared for training. In 400 self-game training sessions, the loss value of the LAPVN model decreases from 5.3 to 2.3, and the convergence is faster than that of the traditional policy-value network. The KL divergence of the LAPVN model is very close to zero during the training process.

Keywords:

Construct (python library) Process (computing) Reinforcement learning Baseline (sea) Artificial neural network Function (biology) Situation awareness Term (time)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.72

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Military Defense Systems Analysis

Physical Sciences → Engineering → Aerospace Engineering

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

AI and Multimedia in Education

Physical Sciences → Computer Science → Artificial Intelligence

Intelligent Wargame Deduction Decision Method Based on Deep Reinforcement Learning

Abstract

Metrics

Topics

Related Documents

Research on Deep Reinforcement Learning Exploration Strategy in Wargame Deduction

Intelligent Decision-Making and Human Language Communication Based on Deep Reinforcement Learning in a Wargame Environment

Intelligent Battlefield Situation Comprehension Method Based On Deep Learning in Wargame

A Deep Reinforcement Learning Based Intelligent Decision Method for UCAV Air Combat

Research on Self-Gaming Training Method of Wargame Based on Deep Reinforcement Learning