JOURNAL ARTICLE

Rewards Prediction-Based Credit Assignment for Reinforcement Learning With Sparse Binary Rewards

Minah SeoLuiz Felipe VecchiettiSangkeum LeeDongsoo Har

Year: 2019 Journal:   IEEE Access Vol: 7 Pages: 118776-118791   Publisher: Institute of Electrical and Electronics Engineers

Abstract

In reinforcement learning (RL), a reinforcement signal may be infrequent and delayed, not appearing immediately after the action that triggered the reward. To trace back what sequence of actions contributes to delayed rewards, e.g., credit assignment (CA), is one of the biggest challenges in RL. This challenge is aggravated under sparse binary rewards, especially when rewards are given only after successful completion of the task. To this end, a novel method consisting of key-action detection, among a sequence of actions to perform a task under sparse binary rewards, and CA strategy is proposed. The key-action defined as the most important action contributing to the reward is detected by a deep neural network that predicts future rewards based on the environment information. The rewards are re-assigned to the key-action and its adjacent actions, defined as adjacent-key-actions. Such re-assignment process enables increased success rate and convergence speed during training. For efficient re-assignment, two CA strategies are considered as part of proposed method. Proposed method is combined with hindsight experience replay (HER) for experiments in the OpenAI gym suite robotics environment. In the experiments, it is demonstrated that proposed method can detect key-actions and outperform the HER, increasing success rate and convergence speed, in the Fetch slide task, a type of task that is more exacting as compared to other tasks, but is addressed by few publications in the literature. From the experiments, a guideline for selecting CA strategy according to goal location is provided through goal distribution analysis with dot map.

Keywords:
Computer science Reinforcement learning Hindsight bias Key (lock) Artificial intelligence Task (project management) Machine learning Action (physics) Binary classification Artificial neural network Convergence (economics) Computer security Support vector machine Cognitive psychology

Metrics

48
Cited By
3.38
FWCI (Field Weighted Citation Impact)
48
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Neural and Behavioral Psychology Studies
Life Sciences →  Neuroscience →  Cognitive Neuroscience
Neural dynamics and brain function
Life Sciences →  Neuroscience →  Cognitive Neuroscience

Related Documents

JOURNAL ARTICLE

Intermittent Reinforcement Learning with Sparse Rewards

Prachi Pratyusha SahooKyriakos G. Vamvoudakis

Journal:   2022 American Control Conference (ACC) Year: 2022 Pages: 2709-2714
DISSERTATION

Reinforcement Learning with Sparse and Multiple Rewards

Simone Parisi

University:   TUbilio (Technical University of Darmstadt) Year: 2020
JOURNAL ARTICLE

Evolutionary reinforcement learning for sparse rewards

Shibei ZhuFrancesco BelardinelliBorja G. León

Journal:   Proceedings of the Genetic and Evolutionary Computation Conference Companion Year: 2021 Pages: 1508-1512
JOURNAL ARTICLE

Deep-Reinforcement-Learning-Based Autonomous UAV Navigation With Sparse Rewards

Chao WangJian WangJingjing WangXudong Zhang

Journal:   IEEE Internet of Things Journal Year: 2020 Vol: 7 (7)Pages: 6180-6190
© 2026 ScienceGate Book Chapters — All rights reserved.