Joint Trajectory and Communication Optimization for Heterogeneous Vehicles in Maritime SAR: Multi-Agent Reinforcement Learning

Chengjia Lei; Shaohua Wu; Yi Yang; Jiayin Xue; Qinyu Zhang

doi:10.1109/tvt.2024.3388499

ScienceGate Book Chapters

JOURNAL ARTICLE

Joint Trajectory and Communication Optimization for Heterogeneous Vehicles in Maritime SAR: Multi-Agent Reinforcement Learning

Chengjia Lei Shaohua Wu Yi Yang Jiayin Xue Qinyu Zhang

Year: 2024 Journal: IEEE Transactions on Vehicular Technology Vol: 73 (9)Pages: 12328-12344 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tvt.2024.3388499

Get Full-Text PDF Get Analytical Report

Abstract

Nowadays, multiple types of equipment, including unmanned aerial vehicles (UAVs) and automatic surface vehicles (ASVs), have been deployed in maritime search and rescue (SAR). However, due to the lack of base stations (BSs), how to complete the rescue while maintaining the communication between vehicles is an unresolved challenge. In this paper, we design an efficient and fault-tolerant communication solution by jointly optimizing vehicles' trajectory, offloading scheduling, and routing topology for a heterogeneous vehicles system. First, we model several essential factors in maritime SAR, including the impact of ocean currents, the observational behavior of UAVs, the fault tolerance of relay networks, resource management of mobile edge computing (MEC), and energy consumption. A multi-objective optimization problem is formulated, aiming at minimizing time and energy consumption while increasing the fault tolerance of relay networks. Then, we transfer the objective into a decentralized partially observable Markov Decision Process (Dec-POMDP) and introduce multi-agent reinforcement learning (MARL) to search for a collaborative strategy. Specifically, two MARL approaches with different training styles are evaluated, and three techniques are added for improving performance, including sharing parameters , normalized generalized-advantage-estimation (GAE), and preserving-outputs-precisely-while-adaptively-rescaling-targets (Pop-Art). Experimental results demonstrate that our proposed approach, named heterogeneous vehicles multi-agent proximal policy optimization (HVMAPPO), outperforms other baselines in efficiency and fault tolerance of communication.

Keywords:

Reinforcement learning Trajectory Joint (building) Computer science Trajectory optimization Wireless Reinforcement Artificial intelligence Engineering Telecommunications Structural engineering

Metrics

Cited By

6.60

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Satellite Communication Systems

Physical Sciences → Engineering → Aerospace Engineering

Maritime Navigation and Safety

Physical Sciences → Engineering → Ocean Engineering

UAV Applications and Optimization

Physical Sciences → Engineering → Aerospace Engineering

Joint Trajectory and Communication Optimization for Heterogeneous Vehicles in Maritime SAR: Multi-Agent Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning

Joint Trajectory and Spectrum Optimization in Advanced Air Mobility via Multi-Agent Deep Reinforcement Learning

Joint Trajectory and Radio Resource Optimization for Autonomous Mobile Robots Exploiting Multi-Agent Reinforcement Learning

Joint trajectory and incentive optimization for privacy-preserving UAV crowdsensing via multi-agent federated reinforcement learning

Federated Multi-Agent Deep Reinforcement Learning for Cooperative Optimization Communication Resource in Heterogeneous Wireless Networks