Safe Reinforcement Learning for Signal Temporal Logic Tasks Using Robust Control Barrier Functions

Jiandong Chen; Yuanyuan Zou; Shaoyuan Li

doi:10.23919/ccc58697.2023.10240080

ScienceGate Book Chapters

JOURNAL ARTICLE

Safe Reinforcement Learning for Signal Temporal Logic Tasks Using Robust Control Barrier Functions

Jiandong Chen Yuanyuan Zou Shaoyuan Li

Year: 2023 Pages: 8627-8632

DOI: 10.23919/ccc58697.2023.10240080

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, a control synthesis problem based on reinforcement learning (RL) for continuous system under temporal logic tasks is studied. Since the systems are expected to satisfy diverse safety and liveness properties under temporal logic tasks, traditional RL algorithm does not work well due to the heuristic design of rewards and non-existent security guarantee. In this work, a novel framework is proposed to synthesize a safe and optimal controller based on RL for temporal logic tasks. First, signal temporal logic (STL) is adopted to formally describe the temporal logic tasks. Based on robust semantics of STL formula, a reference reward curve is designed to determine the reward according to the current instant and state. Then, the shield layer is designed by robust control barrier function which renders the system in a safe set when training the RL policy. In the proposed method, a time-dependent policy for STL tasks in continuous state space is achieved. We demonstrate that this approach both ensures safety and guides exploration effectively during training by several robot motion case studies.

Keywords:

Reinforcement learning Liveness Computer science Heuristic Temporal logic Set (abstract data type) Artificial intelligence Temporal difference learning Control logic Controller (irrigation) State (computer science) State space Function (biology) Semantics (computer science) Robot Theoretical computer science Algorithm Programming language Mathematics

Metrics

Cited By

0.25

FWCI (Field Weighted Citation Impact)

Refs

0.48

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Real-time simulation and control systems

Physical Sciences → Engineering → Control and Systems Engineering

Formal Methods in Verification

Physical Sciences → Computer Science → Computational Theory and Mathematics

Safety Systems Engineering in Autonomy

Physical Sciences → Engineering → Safety, Risk, Reliability and Quality

Safe Reinforcement Learning for Signal Temporal Logic Tasks Using Robust Control Barrier Functions

Abstract

Metrics

Citation History

Topics

Related Documents

Safe Reinforcement Learning Using Robust Control Barrier Functions

Control Barrier Functions for Signal Temporal Logic Tasks

Implicit Fixed-Time Convergence ISS Safe Control Barrier Functions for Signal Temporal Logic Tasks

Control Barrier Functions for Disjunctions of Signal Temporal Logic Tasks

Synthesis of Temporally-Robust Policies for Signal Temporal Logic Tasks using Reinforcement Learning