Plan-based reward shaping for reinforcement learning

Marek Grześ; Daniel Kudenko⋆

doi:10.1109/is.2008.4670492

ScienceGate Book Chapters

JOURNAL ARTICLE

Plan-based reward shaping for reinforcement learning

Marek Grześ Daniel Kudenko⋆

Year: 2008 Pages: 10-22

DOI: 10.1109/is.2008.4670492

Get Full-Text PDF Get Analytical Report

Abstract

Reinforcement learning, while being a highly popular learning technique for agents and multi-agent systems, has so far encountered difficulties when applying it to more complex domains due to scaling-up problems. This paper focuses on the use of domain knowledge to improve the convergence speed and optimality of various RL techniques. Specifically, we propose the use of high-level STRIPS operator knowledge in reward shaping to focus the search for the optimal policy. Empirical results show that the plan-based reward shaping approach outperforms other RL techniques, including alternative manual and MDP-based reward shaping when it is used in its basic form. We show that MDP-based reward shaping may fail and successful experiments with STRIPS-based shaping suggest modifications which can overcome encountered problems. The STRIPS-based method we propose allows expressing the same domain knowledge in a different way and the domain expert can choose whether to define an MDP or STRIPS planning task. We also evaluate the robustness of the proposed STRIPS-based technique to errors in the plan knowledge.

Keywords:

Reinforcement learning Computer science Robustness (evolution) STRIPS Artificial intelligence Machine learning Domain (mathematical analysis) Plan (archaeology) Focus (optics) Domain knowledge Task (project management) Engineering Mathematics

Metrics

Cited By

2.79

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

AI-based Problem Solving and Planning

Physical Sciences → Computer Science → Artificial Intelligence

Robot Manipulation and Learning

Physical Sciences → Engineering → Control and Systems Engineering

Plan-based reward shaping for reinforcement learning

Abstract

Metrics

Citation History

Topics

Related Documents

Plan-based reward shaping for multi-agent reinforcement learning

Reward Shaping Based Federated Reinforcement Learning

Reward Shaping for Model-Based Bayesian Reinforcement Learning

Multigrid Reinforcement Learning with Reward Shaping

Reward Shaping in Episodic Reinforcement Learning