JOURNAL ARTICLE

Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization

Abstract

Reinforcement learning (RL) is attracting increasing interests in autonomous driving due to its potential to solve complex classification and control problems. However, existing RL algorithms are rarely applied to real vehicles for two predominant problems: behaviors are unexplainable, and they cannot guarantee safety under new scenarios. This paper presents a safe RL algorithm, called Parallel Constrained Policy Optimization (PCPO), for two autonomous driving tasks. PCPO extends today's common actor-critic architecture to a three-component learning framework, in which three neural networks are used to approximate the policy function, value function and a newly added risk function, respectively. Meanwhile, a trust region constraint is added to allow large update steps without breaking the monotonic improvement condition. To ensure the feasibility of safety constrained problems, synchronized parallel learners are employed to explore different state spaces, which accelerates learning and policy-update. The simulations of two scenarios for autonomous vehicles confirm we can ensure safety while achieving fast learning.

Keywords:
Reinforcement learning Computer science Reinforcement Artificial intelligence Engineering Structural engineering

Metrics

59
Cited By
5.29
FWCI (Field Weighted Citation Impact)
39
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Autonomous Vehicle Technology and Safety
Physical Sciences →  Engineering →  Automotive Engineering
Traffic control and management
Physical Sciences →  Engineering →  Control and Systems Engineering

Related Documents

JOURNAL ARTICLE

CVaR-Constrained Policy Optimization for Safe Reinforcement Learning

Qiyuan ZhangShu LengXiaoteng MaQihan LiuXueqian WangBin LiangYu LiuJun Yang

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2024 Vol: 36 (1)Pages: 830-841
JOURNAL ARTICLE

Game-Theoretic Constrained Policy Optimization for Safe Reinforcement Learning

Changxin ZhangXinglong ZhangYixing LanHao GaoXin Xu

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2025 Vol: 36 (10)Pages: 17990-18004
JOURNAL ARTICLE

Safe Reinforcement Learning-based Driving Policy Design for Autonomous Vehicles on Highways

Hung Duy NguyenKyoungseok Han

Journal:   International Journal of Control Automation and Systems Year: 2023 Vol: 21 (12)Pages: 4098-4110
© 2026 ScienceGate Book Chapters — All rights reserved.