Reinforcement-Learning-Based Multi-UAV Cooperative Search for Moving Targets in 3D Scenarios

Yifei Liu; Xiaoshuai Li; Jian Wang; Feiyu Wei; Junan Yang

doi:10.3390/drones8080378

ScienceGate Book Chapters

JOURNAL ARTICLE

Reinforcement-Learning-Based Multi-UAV Cooperative Search for Moving Targets in 3D Scenarios

Yifei Liu Xiaoshuai Li Jian Wang Feiyu Wei Junan Yang

Year: 2024 Journal: Drones Vol: 8 (8)Pages: 378-378 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/drones8080378

Get Full-Text PDF Get Analytical Report

Abstract

Most existing multi-UAV collaborative search methods only consider scenarios of two-dimensional path planning or static target search. To be close to the practical scenario, this paper proposes a path planning method based on an action-mask-based multi-agent proximal policy optimization (AM-MAPPO) algorithm for multiple UAVs searching for moving targets in three-dimensional (3D) environments. In particular, a multi-UAV high–low altitude collaborative search architecture is introduced that not only takes into account the extensive detection range of high-altitude UAVs but also leverages the benefit of the superior detection quality of low-altitude UAVs. The optimization objective of the search task is to minimize the uncertainty of the search area while maximizing the number of captured moving targets. The path planning problem for moving target search in a 3D environment is formulated and addressed using the AM-MAPPO algorithm. The proposed method incorporates a state representation mechanism based on field-of-view encoding to handle dynamic changes in neural network input dimensions and develops a rule-based target capture mechanism and an action-mask-based collision avoidance mechanism to enhance the AM-MAPPO algorithm’s convergence speed. Experimental results demonstrate that the proposed algorithm significantly reduces regional uncertainty and increases the number of captured moving targets compared to other deep reinforcement learning methods. Ablation studies further indicate that the proposed action mask mechanism, target capture mechanism, and collision avoidance mechanism of the AM-MAPPO algorithm can improve the algorithm’s effectiveness, target capture capability, and UAVs’ safety, respectively.

Keywords:

Reinforcement learning Computer science Motion planning Convergence (economics) Artificial intelligence Collision avoidance Search and rescue Path (computing) Encoding (memory) Machine learning Collision Robot

Metrics

Cited By

9.54

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Robotic Path Planning Algorithms

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

UAV Applications and Optimization

Physical Sciences → Engineering → Aerospace Engineering

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

Reinforcement-Learning-Based Multi-UAV Cooperative Search for Moving Targets in 3D Scenarios

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-UAV Cooperative Search for Moving Targets: A Deep Reinforcement Learning Method

Multi-AUV Cooperative Search for Moving Targets Based on Multi-Agent Reinforcement Learning

Multi-UAV Cooperative Searching and Tracking for Moving Targets Based on Multi-Agent Reinforcement Learning

Sensing-Aware Cooperative Multi-Uav Search with Reinforcement Learning

Multi-Agent Reinforcement Learning for Distributed Cooperative Targets Search