JOURNAL ARTICLE

Joint Trajectory and Communication Optimization for Heterogeneous Vehicles in Maritime SAR: Multi-Agent Reinforcement Learning

Chengjia LeiShaohua WuYi YangJiayin XueQinyu Zhang

Year: 2024 Journal:   IEEE Transactions on Vehicular Technology Vol: 73 (9)Pages: 12328-12344   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Nowadays, multiple types of equipment, including unmanned aerial vehicles (UAVs) and automatic surface vehicles (ASVs), have been deployed in maritime search and rescue (SAR). However, due to the lack of base stations (BSs), how to complete the rescue while maintaining the communication between vehicles is an unresolved challenge. In this paper, we design an efficient and fault-tolerant communication solution by jointly optimizing vehicles' trajectory, offloading scheduling, and routing topology for a heterogeneous vehicles system. First, we model several essential factors in maritime SAR, including the impact of ocean currents, the observational behavior of UAVs, the fault tolerance of relay networks, resource management of mobile edge computing (MEC), and energy consumption. A multi-objective optimization problem is formulated, aiming at minimizing time and energy consumption while increasing the fault tolerance of relay networks. Then, we transfer the objective into a decentralized partially observable Markov Decision Process (Dec-POMDP) and introduce multi-agent reinforcement learning (MARL) to search for a collaborative strategy. Specifically, two MARL approaches with different training styles are evaluated, and three techniques are added for improving performance, including sharing parameters , normalized generalized-advantage-estimation (GAE), and preserving-outputs-precisely-while-adaptively-rescaling-targets (Pop-Art). Experimental results demonstrate that our proposed approach, named heterogeneous vehicles multi-agent proximal policy optimization (HVMAPPO), outperforms other baselines in efficiency and fault tolerance of communication.

Keywords:
Reinforcement learning Trajectory Joint (building) Computer science Trajectory optimization Wireless Reinforcement Artificial intelligence Engineering Telecommunications Structural engineering

Metrics

5
Cited By
6.60
FWCI (Field Weighted Citation Impact)
47
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Satellite Communication Systems
Physical Sciences →  Engineering →  Aerospace Engineering
Maritime Navigation and Safety
Physical Sciences →  Engineering →  Ocean Engineering
UAV Applications and Optimization
Physical Sciences →  Engineering →  Aerospace Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.