Multi-Agent Deep Reinforcement Learning Cooperative Control Model for Autonomous Vehicle Merging into Platoon in Highway

Jiajia Chen; Bingqing Zhu; Mengyu Zhang; Xiang Ling; Xiaobo Ruan; Yifan Deng; Ning Guo

doi:10.3390/wevj16040225

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-Agent Deep Reinforcement Learning Cooperative Control Model for Autonomous Vehicle Merging into Platoon in Highway

Jiajia Chen Bingqing Zhu Mengyu Zhang Xiang Ling Xiaobo Ruan Yifan Deng Ning Guo

Year: 2025 Journal: World Electric Vehicle Journal Vol: 16 (4)Pages: 225-225 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/wevj16040225

Get Full-Text PDF Get Analytical Report

Abstract

This study presents the first investigation into the problem of autonomous vehicle (AV) merging into existing platoons, proposing a multi-agent deep reinforcement learning (MA-DRL)-based cooperative control framework. The developed MA-DRL architecture enables coordinated learning among multiple autonomous agents to address the multi-objective coordination challenge through synchronized control of platoon longitudinal acceleration, AV steering and acceleration. To enhance training efficiency, we develop a dual-layer multi-agent maximum Q-value proximal policy optimization (MAMQPPO) method, which extends the multi-agent PPO algorithm (a policy gradient method ensuring stable policy updates) by incorporating maximum Q-value action selection for platoon gap control and discrete command generation. This method simplifies the training process by using maximum Q-value action policy optimization to learn platoon gap selection and discrete action commands. Furthermore, a partially decoupled reward function (PD-Reward) is designed to properly guide the behavioral actions of both AVs and platoons while accelerating network convergence. Comprehensive highway simulation experiments show the proposed method reduces merging time by 37.69% (12.4 s vs. 19.9 s) and energy consumption by 58% (3.56 kWh vs. 8.47 kWh) compared to existing methods (the quintic polynomial-based + PID (Proportional–Integral–Differential)).

Keywords:

Platoon Reinforcement learning Computer science Control (management) Reinforcement Automotive engineering Artificial intelligence Engineering Structural engineering

Metrics

Cited By

7.43

FWCI (Field Weighted Citation Impact)

Refs

0.91

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Traffic control and management

Physical Sciences → Engineering → Control and Systems Engineering

Autonomous Vehicle Technology and Safety

Physical Sciences → Engineering → Automotive Engineering

Traffic Prediction and Management Techniques

Physical Sciences → Engineering → Building and Construction

Multi-Agent Deep Reinforcement Learning Cooperative Control Model for Autonomous Vehicle Merging into Platoon in Highway

Abstract

Metrics

Citation History

Topics

Related Documents

Connected Autonomous Vehicle Platoon Control Through Multi-agent Deep Reinforcement Learning

Highway Merging Control Using Multi - Agent Reinforcement Learning

Incentivizing Cooperative Merging Control: Insights from Multi-Agent Deep Reinforcement Learning

Multi-Vehicle Cooperative Decision-Making in Merging Area Based on Deep Multi-Agent Reinforcement Learning

Cooperative Multi-agent Control Using Deep Reinforcement Learning