Data Efficient Safe Reinforcement Learning

Sindhu Padakandla; K. J. Prabuchandran; S. Ganguly; Shalabh Bhatnagar

doi:10.1109/smc53654.2022.9945313

ScienceGate Book Chapters

JOURNAL ARTICLE

Data Efficient Safe Reinforcement Learning

Sindhu Padakandla K. J. Prabuchandran S. Ganguly Shalabh Bhatnagar

Year: 2022 Journal: 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC) Vol: 16 Pages: 1167-1172

DOI: 10.1109/smc53654.2022.9945313

Get Full-Text PDF Get Analytical Report

Abstract

Applying reinforcement learning (RL) methods for real world applications pose multiple challenges - the foremost being safety of the system controlled by the learning agent and the learning efficiency. An RL agent learns to control a system by exploring the available actions in various operating states. In some states, when the RL agent exercises an exploratory action, the system may enter unsafe operation, which can lead to safety hazards both for the system as well as for humans supervising the system. RL algorithms thus must learn to control the system respecting safety. In this work, we formulate the safe RL problem in the constrained off-policy setting that facilitates safe exploration by the RL agent. We then develop a sample efficient algorithm utilizing the cross-entropy method. The proposed algorithm's safety performance is evaluated numerically on benchmark RL problems.

Keywords:

Reinforcement learning Computer science Artificial intelligence

Metrics

Cited By

0.24

FWCI (Field Weighted Citation Impact)

Refs

0.43

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Autonomous Vehicle Technology and Safety

Physical Sciences → Engineering → Automotive Engineering

Smart Grid Security and Resilience

Physical Sciences → Engineering → Control and Systems Engineering

Data Efficient Safe Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Shielded Planning Guided Data-Efficient and Safe Reinforcement Learning

Safe and efficient inverse reinforcement learning

Planning for potential: efficient safe reinforcement learning

Provably Efficient Algorithms for Safe Reinforcement Learning

Data-Efficient Hierarchical Reinforcement Learning