Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization

Lu Wen; Jingliang Duan; Shengbo Eben Li; Shaobing Xu; Huei Peng

doi:10.1109/itsc45102.2020.9294262

ScienceGate Book Chapters

JOURNAL ARTICLE

Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization

Lu Wen Jingliang Duan Shengbo Eben Li Shaobing Xu Huei Peng

Year: 2020 Pages: 1-7

DOI: 10.1109/itsc45102.2020.9294262

Get Full-Text PDF Get Analytical Report

Abstract

Reinforcement learning (RL) is attracting increasing interests in autonomous driving due to its potential to solve complex classification and control problems. However, existing RL algorithms are rarely applied to real vehicles for two predominant problems: behaviors are unexplainable, and they cannot guarantee safety under new scenarios. This paper presents a safe RL algorithm, called Parallel Constrained Policy Optimization (PCPO), for two autonomous driving tasks. PCPO extends today's common actor-critic architecture to a three-component learning framework, in which three neural networks are used to approximate the policy function, value function and a newly added risk function, respectively. Meanwhile, a trust region constraint is added to allow large update steps without breaking the monotonic improvement condition. To ensure the feasibility of safety constrained problems, synchronized parallel learners are employed to explore different state spaces, which accelerates learning and policy-update. The simulations of two scenarios for autonomous vehicles confirm we can ensure safety while achieving fast learning.

Keywords:

Reinforcement learning Computer science Reinforcement Artificial intelligence Engineering Structural engineering

Metrics

Cited By

5.29

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Autonomous Vehicle Technology and Safety

Physical Sciences → Engineering → Automotive Engineering

Traffic control and management

Physical Sciences → Engineering → Control and Systems Engineering

Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization

Abstract

Metrics

Citation History

Topics

Related Documents

CVaR-Constrained Policy Optimization for Safe Reinforcement Learning

Game-Theoretic Constrained Policy Optimization for Safe Reinforcement Learning

Scalable Constrained Policy Optimization for Safe Multi-agent Reinforcement Learning

Constrained Policy Optimization Algorithm for Autonomous Driving via Reinforcement Learning

Safe Reinforcement Learning-based Driving Policy Design for Autonomous Vehicles on Highways