JOURNAL ARTICLE

Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

Xiaoru ZhaoRennong YangLiangsheng ZhongZhiwei Hou

Year: 2024 Journal:   Drones Vol: 8 (1)Pages: 18-18   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Dedicated to meeting the growing demand for multi-agent collaboration in complex scenarios, this paper introduces a parameter-sharing off-policy multi-agent path planning and the following approach. Current multi-agent path planning predominantly relies on grid-based maps, whereas our proposed approach utilizes laser scan data as input, providing a closer simulation of real-world applications. In this approach, the unmanned aerial vehicle (UAV) uses the soft actor–critic (SAC) algorithm as a planner and trains its policy to converge. This policy enables end-to-end processing of laser scan data, guiding the UAV to avoid obstacles and reach the goal. At the same time, the planner incorporates paths generated by a sampling-based method as following points. The following points are continuously updated as the UAV progresses. Multi-UAV path planning tasks are facilitated, and policy convergence is accelerated through sharing experiences among agents. To address the challenge of UAVs that are initially stationary and overly cautious near the goal, a reward function is designed to encourage UAV movement. Additionally, a multi-UAV simulation environment is established to simulate real-world UAV scenarios to support training and validation of the proposed approach. The simulation results highlight the effectiveness of the presented approach in both the training process and task performance. The presented algorithm achieves an 80% success rate to guarantee that three UAVs reach the goal points.

Keywords:
Motion planning Computer science Reinforcement learning Planner Process (computing) Grid Obstacle avoidance Path (computing) Task (project management) Convergence (economics) Real-time computing Artificial intelligence Simulation Distributed computing Robot Mobile robot Engineering Systems engineering

Metrics

29
Cited By
15.37
FWCI (Field Weighted Citation Impact)
46
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robotic Path Planning Algorithms
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
UAV Applications and Optimization
Physical Sciences →  Engineering →  Aerospace Engineering
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.