JOURNAL ARTICLE

Reinforcement Learning with Temporal Logic Constraints

Bengt LennartsonQing‐Shan Jia

Year: 2020 Journal:   IFAC-PapersOnLine Vol: 53 (4)Pages: 485-492   Publisher: Elsevier BV

Abstract

Reinforcement learning (RL) is an agent based AI learning method, where learning and optimization are combined. Dynamic programming is then performed iteratively, based on reward and next state observations from the system to be controlled. A brief survey of RL is given, followed by an evaluation of a recently proposed method to include temporal logic safety and liveness guarantees in RL, here combined with classical performance optimization. RL is based on Markov decision processes (MDPs), and to reduce the number of observations from the system, a modular MDP framework is proposed. In the learning process, it is then assumed that some parts of the system are represented by known MDP models, while other parts can be estimated by observations from the real system. Local information from the modular system may then be used to reduce the computational complexity, especially in the handling of safety properties.

Keywords:
Reinforcement learning Liveness Computer science Markov decision process Modular design Artificial intelligence State (computer science) Dynamic programming Machine learning Process (computing) Markov process Theoretical computer science Mathematics Algorithm Programming language

Metrics

5
Cited By
0.15
FWCI (Field Weighted Citation Impact)
26
Refs
0.57
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Formal Methods in Verification
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Advanced Software Engineering Methodologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software

Related Documents

JOURNAL ARTICLE

Probabilistically Guaranteed Satisfaction of Temporal Logic Constraints During Reinforcement Learning

Derya AksarayYasin YazıcıoğluAhmet Semi Asarkaya

Journal:   2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Year: 2021 Vol: 16 Pages: 6531-6537
JOURNAL ARTICLE

Reinforcement learning under temporal logic constraints as a sequence modeling problem

Daiying TianHao FangQingkai YangHaoyong YuWenyu LiangYan Wu

Journal:   Robotics and Autonomous Systems Year: 2022 Vol: 161 Pages: 104351-104351
© 2026 ScienceGate Book Chapters — All rights reserved.