JOURNAL ARTICLE

Safe Reinforcement Learning for Signal Temporal Logic Tasks Using Robust Control Barrier Functions

Abstract

In this paper, a control synthesis problem based on reinforcement learning (RL) for continuous system under temporal logic tasks is studied. Since the systems are expected to satisfy diverse safety and liveness properties under temporal logic tasks, traditional RL algorithm does not work well due to the heuristic design of rewards and non-existent security guarantee. In this work, a novel framework is proposed to synthesize a safe and optimal controller based on RL for temporal logic tasks. First, signal temporal logic (STL) is adopted to formally describe the temporal logic tasks. Based on robust semantics of STL formula, a reference reward curve is designed to determine the reward according to the current instant and state. Then, the shield layer is designed by robust control barrier function which renders the system in a safe set when training the RL policy. In the proposed method, a time-dependent policy for STL tasks in continuous state space is achieved. We demonstrate that this approach both ensures safety and guides exploration effectively during training by several robot motion case studies.

Keywords:
Reinforcement learning Liveness Computer science Heuristic Temporal logic Set (abstract data type) Artificial intelligence Temporal difference learning Control logic Controller (irrigation) State (computer science) State space Function (biology) Semantics (computer science) Robot Theoretical computer science Algorithm Programming language Mathematics

Metrics

1
Cited By
0.25
FWCI (Field Weighted Citation Impact)
22
Refs
0.48
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Real-time simulation and control systems
Physical Sciences →  Engineering →  Control and Systems Engineering
Formal Methods in Verification
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Safety Systems Engineering in Autonomy
Physical Sciences →  Engineering →  Safety, Risk, Reliability and Quality

Related Documents

JOURNAL ARTICLE

Safe Reinforcement Learning Using Robust Control Barrier Functions

Yousef EmamGennaro NotomistaPaul GlotfelterZsolt KiraMagnus Egerstedt

Journal:   IEEE Robotics and Automation Letters Year: 2022 Vol: 10 (3)Pages: 2886-2893
JOURNAL ARTICLE

Control Barrier Functions for Signal Temporal Logic Tasks

Lars LindemannDimos V. Dimarogonas

Journal:   IEEE Control Systems Letters Year: 2018 Vol: 3 (1)Pages: 96-101
JOURNAL ARTICLE

Implicit Fixed-Time Convergence ISS Safe Control Barrier Functions for Signal Temporal Logic Tasks

Ming LiZhiyong Sun

Journal:   2022 IEEE 17th International Conference on Control & Automation (ICCA) Year: 2022 Pages: 722-727
© 2026 ScienceGate Book Chapters — All rights reserved.