JOURNAL ARTICLE

Multi-Agent Deep Reinforcement Learning Cooperative Control Model for Autonomous Vehicle Merging into Platoon in Highway

Jiajia ChenBingqing ZhuMengyu ZhangXiang LingXiaobo RuanYifan DengNing Guo

Year: 2025 Journal:   World Electric Vehicle Journal Vol: 16 (4)Pages: 225-225   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

This study presents the first investigation into the problem of autonomous vehicle (AV) merging into existing platoons, proposing a multi-agent deep reinforcement learning (MA-DRL)-based cooperative control framework. The developed MA-DRL architecture enables coordinated learning among multiple autonomous agents to address the multi-objective coordination challenge through synchronized control of platoon longitudinal acceleration, AV steering and acceleration. To enhance training efficiency, we develop a dual-layer multi-agent maximum Q-value proximal policy optimization (MAMQPPO) method, which extends the multi-agent PPO algorithm (a policy gradient method ensuring stable policy updates) by incorporating maximum Q-value action selection for platoon gap control and discrete command generation. This method simplifies the training process by using maximum Q-value action policy optimization to learn platoon gap selection and discrete action commands. Furthermore, a partially decoupled reward function (PD-Reward) is designed to properly guide the behavioral actions of both AVs and platoons while accelerating network convergence. Comprehensive highway simulation experiments show the proposed method reduces merging time by 37.69% (12.4 s vs. 19.9 s) and energy consumption by 58% (3.56 kWh vs. 8.47 kWh) compared to existing methods (the quintic polynomial-based + PID (Proportional–Integral–Differential)).

Keywords:
Platoon Reinforcement learning Computer science Control (management) Reinforcement Automotive engineering Artificial intelligence Engineering Structural engineering

Metrics

2
Cited By
7.43
FWCI (Field Weighted Citation Impact)
26
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Traffic control and management
Physical Sciences →  Engineering →  Control and Systems Engineering
Autonomous Vehicle Technology and Safety
Physical Sciences →  Engineering →  Automotive Engineering
Traffic Prediction and Management Techniques
Physical Sciences →  Engineering →  Building and Construction

Related Documents

BOOK-CHAPTER

Connected Autonomous Vehicle Platoon Control Through Multi-agent Deep Reinforcement Learning

Guangfei XuBing ChenGuangxian LiXiangkun He

Lecture notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering Year: 2022 Pages: 239-248
© 2026 ScienceGate Book Chapters — All rights reserved.