Goal Space Planning with Reward Shaping

Roice, Kevin

doi:10.7939/r3-dkhk-6b72

ScienceGate Book Chapters

DISSERTATION

Goal Space Planning with Reward Shaping

Roice, Kevin

Year: 2024 University: University of Alberta Library

DOI: 10.7939/r3-dkhk-6b72

Get Full-Text PDF Get Analytical Report

Abstract

Planning and goal-conditioned reinforcement learning aim to create more efficient and scalable methods for complex, long-horizon tasks. These approaches break tasks into manageable subgoals and leverage prior knowledge to guide learning. However, learned models may predict inaccurate next states and have compounding errors over long-horizon predictions. This often makes background planning with learned models worse than model-free alternatives, even though the former uses significantly more memory and computation. Methods that plan in an abstract space, such as Goal-Space Planning, avoid these typical problems of models by background planning with models that are abstract in state and time. This thesis shows how potential-based reward shaping can propagate value and speed up learning with local, subgoal-conditioned models. We demonstrate the effectiveness of this approach in tabular, linear, and deep value-based learners, and study its sensitivity to changes in environment dynamics and the chosen subgoals.

Keywords:

Reinforcement learning Leverage (statistics) Goal orientation Plan (archaeology) State space Scalability Space (punctuation) Automated planning and scheduling

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

AI-based Problem Solving and Planning

Physical Sciences → Computer Science → Artificial Intelligence

Machine Learning and Algorithms

Physical Sciences → Computer Science → Artificial Intelligence

Goal Space Planning with Reward Shaping

Abstract

Metrics

Topics

Related Documents

Goal-Space Planning with Subgoal Models

Reward Shaping Study for Sub-Goal Based Robotic Tasks

Plan-Based Relaxed Reward Shaping for Goal-Directed Tasks

Local Planning – Shaping Space

Reinforcement Learning with Converging Goal Space and Binary Reward Function