JOURNAL ARTICLE

Reinforcement-Learning-Based Multi-UAV Cooperative Search for Moving Targets in 3D Scenarios

Yifei LiuXiaoshuai LiJian WangFeiyu WeiJunan Yang

Year: 2024 Journal:   Drones Vol: 8 (8)Pages: 378-378   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Most existing multi-UAV collaborative search methods only consider scenarios of two-dimensional path planning or static target search. To be close to the practical scenario, this paper proposes a path planning method based on an action-mask-based multi-agent proximal policy optimization (AM-MAPPO) algorithm for multiple UAVs searching for moving targets in three-dimensional (3D) environments. In particular, a multi-UAV high–low altitude collaborative search architecture is introduced that not only takes into account the extensive detection range of high-altitude UAVs but also leverages the benefit of the superior detection quality of low-altitude UAVs. The optimization objective of the search task is to minimize the uncertainty of the search area while maximizing the number of captured moving targets. The path planning problem for moving target search in a 3D environment is formulated and addressed using the AM-MAPPO algorithm. The proposed method incorporates a state representation mechanism based on field-of-view encoding to handle dynamic changes in neural network input dimensions and develops a rule-based target capture mechanism and an action-mask-based collision avoidance mechanism to enhance the AM-MAPPO algorithm’s convergence speed. Experimental results demonstrate that the proposed algorithm significantly reduces regional uncertainty and increases the number of captured moving targets compared to other deep reinforcement learning methods. Ablation studies further indicate that the proposed action mask mechanism, target capture mechanism, and collision avoidance mechanism of the AM-MAPPO algorithm can improve the algorithm’s effectiveness, target capture capability, and UAVs’ safety, respectively.

Keywords:
Reinforcement learning Computer science Motion planning Convergence (economics) Artificial intelligence Collision avoidance Search and rescue Path (computing) Encoding (memory) Machine learning Collision Robot

Metrics

18
Cited By
9.54
FWCI (Field Weighted Citation Impact)
49
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robotic Path Planning Algorithms
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
UAV Applications and Optimization
Physical Sciences →  Engineering →  Aerospace Engineering
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering

Related Documents

JOURNAL ARTICLE

Multi-AUV Cooperative Search for Moving Targets Based on Multi-Agent Reinforcement Learning

Le LiR. AnZhaozhi GuoJian Gao

Journal:   Journal of Marine Science and Engineering Year: 2025 Vol: 13 (11)Pages: 2072-2072
JOURNAL ARTICLE

Multi-UAV Cooperative Searching and Tracking for Moving Targets Based on Multi-Agent Reinforcement Learning

Kai SuFeng Qian

Journal:   Applied Sciences Year: 2023 Vol: 13 (21)Pages: 11905-11905
JOURNAL ARTICLE

Multi-Agent Reinforcement Learning for Distributed Cooperative Targets Search

Yang SunZhenning WuQiyuan ZhangZongying ShiYisheng Zhong

Journal:   2021 IEEE International Conference on Unmanned Systems (ICUS) Year: 2021 Pages: 711-716
© 2026 ScienceGate Book Chapters — All rights reserved.