An air combat maneuver decision-making approach using coupled reward in deep reinforcement learning

Jian Yang; Liangpei Wang; Jiale Han; C. L. Philip Chen; Yinlong Yuan; Zhu Liang Yu; Guoli Yang

doi:10.1007/s40747-025-01992-9

ScienceGate Book Chapters

JOURNAL ARTICLE

An air combat maneuver decision-making approach using coupled reward in deep reinforcement learning

Jian Yang Liangpei Wang Jiale Han C. L. Philip Chen Yinlong Yuan Zhu Liang Yu Guoli Yang

Year: 2025 Journal: Complex & Intelligent Systems Vol: 11 (8) Publisher: Springer Science+Business Media

DOI: 10.1007/s40747-025-01992-9

Get Full-Text PDF Get Analytical Report

Abstract

Abstract In the domain of unmanned air combat, achieving efficient autonomous maneuvering decisions presents challenges. Deep Reinforcement learning(DRL) is one of the approaches to tackle this problem. The final performance of the DRL algorithm is directly affected by the design of the reward functions. However, the performance and convergence speed of the models suffer from unreasonable reward weights. Therefore, a method named Coupled Reward-Deep Reinforcement Learning(CR-DRL) is introduced to deal with this problem. Specifically, we propose a novel coupled-weight reward function for DRL within the air combat framework. The novel reward function integrates angle and distance so that our DRL maneuver decision model can be trained faster and perform better compared to that of the models use conventional reward functions. Additionally, we establish a brand new competitive training framework designed to enhance the performance of our model against personalized opponents. The experimental results show that our CR-DRL model outperforms the traditional model that uses the fixed-weight reward functions in this training framework, with a 6.3% increase in average reward in fixed scenarios and a 22.8% increase in changeable scenarios. Moreover, the performance of our model continually improves with the increase of iterations, ultimately yielding a certain degree of generalization performance against similar opponents. Finally, we develop a simulation environment that supports real-time air combat based on Unity3D, called Airfightsim, to demonstrate the performance of the proposed algorithm.

Keywords:

Reinforcement learning Reinforcement Computational intelligence Air combat Artificial intelligence Computer science Operations research Psychology Engineering Simulation Social psychology

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.09

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Guidance and Control Systems

Physical Sciences → Engineering → Aerospace Engineering

Autonomous Vehicle Technology and Safety

Physical Sciences → Engineering → Automotive Engineering

Aerospace and Aviation Technology

Physical Sciences → Engineering → Aerospace Engineering

An air combat maneuver decision-making approach using coupled reward in deep reinforcement learning

Abstract

Metrics

Topics

Related Documents

Air combat maneuver decision based on deep reinforcement learning with auxiliary reward

UAV maneuver decision-making via deep reinforcement learning for short-range air combat

An Intelligent Maneuver Decision-Making Approach for Air Combat Based on Deep Reinforcement Learning and Transformer Networks

2-D Air Combat Maneuver Decision Using Reinforcement Learning

Air Combat Maneuver Decision Method Based on A3C Deep Reinforcement Learning