Modular Deep Reinforcement Learning for Continuous Motion Planning With Temporal Logic

Mingyu Cai; Mohammadhosein Hasanbeig; Shaoping Xiao; Alessandro Abate; Zhen Kan

doi:10.1109/lra.2021.3101544

ScienceGate Book Chapters

JOURNAL ARTICLE

Modular Deep Reinforcement Learning for Continuous Motion Planning With Temporal Logic

Mingyu Cai Mohammadhosein Hasanbeig Shaoping Xiao Alessandro Abate Zhen Kan

Year: 2021 Journal: IEEE Robotics and Automation Letters Vol: 6 (4)Pages: 7973-7980 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/lra.2021.3101544

Get Full-Text PDF Get Analytical Report

Abstract

This paper investigates the motion planning of autonomous dynamical systems\nmodeled by Markov decision processes (MDP) with unknown transition\nprobabilities over continuous state and action spaces. Linear temporal logic\n(LTL) is used to specify high-level tasks over infinite horizon, which can be\nconverted into a limit deterministic generalized B\\"uchi automaton (LDGBA) with\nseveral accepting sets. The novelty is to design an embedded product MDP\n(EP-MDP) between the LDGBA and the MDP by incorporating a synchronous\ntracking-frontier function to record unvisited accepting sets of the automaton,\nand to facilitate the satisfaction of the accepting conditions. The proposed\nLDGBA-based reward shaping and discounting schemes for the model-free\nreinforcement learning (RL) only depend on the EP-MDP states and can overcome\nthe issues of sparse rewards. Rigorous analysis shows that any RL method that\noptimizes the expected discounted return is guaranteed to find an optimal\npolicy whose traces maximize the satisfaction probability. A modular deep\ndeterministic policy gradient (DDPG) is then developed to generate such\npolicies over continuous state and action spaces. The performance of our\nframework is evaluated via an array of OpenAI gym environments.\n

Keywords:

Metrics

Cited By

8.47

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Formal Methods in Verification

Physical Sciences → Computer Science → Computational Theory and Mathematics

Robotic Path Planning Algorithms

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Modular Deep Reinforcement Learning for Continuous Motion Planning With Temporal Logic

Abstract

Metrics

Citation History

Topics

Related Documents

Deep Imitative Reinforcement Learning for Temporal Logic Robot Motion Planning with Noisy Semantic Observations

Poster Abstract: Signal Temporal Logic Compliant Motion Planning using Reinforcement Learning

Traffic Rule Integration with Temporal Logic in Deep Reinforcement Learning for Behavior Planning

Socially aware motion planning with deep reinforcement learning

Continuous Motion Planning for Mobile Robots Using Fuzzy Deep Reinforcement Learning