JOURNAL ARTICLE

Plan-based reward shaping for reinforcement learning

Abstract

Reinforcement learning, while being a highly popular learning technique for agents and multi-agent systems, has so far encountered difficulties when applying it to more complex domains due to scaling-up problems. This paper focuses on the use of domain knowledge to improve the convergence speed and optimality of various RL techniques. Specifically, we propose the use of high-level STRIPS operator knowledge in reward shaping to focus the search for the optimal policy. Empirical results show that the plan-based reward shaping approach outperforms other RL techniques, including alternative manual and MDP-based reward shaping when it is used in its basic form. We show that MDP-based reward shaping may fail and successful experiments with STRIPS-based shaping suggest modifications which can overcome encountered problems. The STRIPS-based method we propose allows expressing the same domain knowledge in a different way and the domain expert can choose whether to define an MDP or STRIPS planning task. We also evaluate the robustness of the proposed STRIPS-based technique to errors in the plan knowledge.

Keywords:
Reinforcement learning Computer science Robustness (evolution) STRIPS Artificial intelligence Machine learning Domain (mathematical analysis) Plan (archaeology) Focus (optics) Domain knowledge Task (project management) Engineering Mathematics

Metrics

73
Cited By
2.79
FWCI (Field Weighted Citation Impact)
19
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
AI-based Problem Solving and Planning
Physical Sciences →  Computer Science →  Artificial Intelligence
Robot Manipulation and Learning
Physical Sciences →  Engineering →  Control and Systems Engineering

Related Documents

JOURNAL ARTICLE

Plan-based reward shaping for multi-agent reinforcement learning

Sam DevlinDaniel Kudenko⋆

Journal:   The Knowledge Engineering Review Year: 2016 Vol: 31 (1)Pages: 44-58
JOURNAL ARTICLE

Reward Shaping Based Federated Reinforcement Learning

Yiqiu HuYun HuaWenyan LiuJun Zhu

Journal:   IEEE Access Year: 2021 Vol: 9 Pages: 67259-67267
JOURNAL ARTICLE

Reward Shaping for Model-Based Bayesian Reinforcement Learning

Hyeoneun KimWoosang LimKanghoon LeeYung‐Kyun NohKee-Eung Kim

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2015 Vol: 29 (1)
BOOK-CHAPTER

Multigrid Reinforcement Learning with Reward Shaping

Marek GrześDaniel Kudenko⋆

Lecture notes in computer science Year: 2008 Pages: 357-366
JOURNAL ARTICLE

Reward Shaping in Episodic Reinforcement Learning

Marek Grześ

Journal:   Adaptive Agents and Multi-Agents Systems Year: 2017 Pages: 565-573
© 2026 ScienceGate Book Chapters — All rights reserved.