Exploration With Task Information for Meta Reinforcement Learning

Peng Jiang; Shiji Song; Gao Huang

doi:10.1109/tnnls.2021.3121432

ScienceGate Book Chapters

JOURNAL ARTICLE

Exploration With Task Information for Meta Reinforcement Learning

Peng Jiang Shiji Song Gao Huang

Year: 2021 Journal: IEEE Transactions on Neural Networks and Learning Systems Vol: 34 (8)Pages: 4033-4046 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tnnls.2021.3121432

Get Full-Text PDF Get Analytical Report

Abstract

Meta reinforcement learning (meta-RL) is a promising technique for fast task adaptation by leveraging prior knowledge from previous tasks. Recently, context-based meta-RL has been proposed to improve data efficiency by applying a principled framework, dividing the learning procedure into task inference and task execution. However, the task information is not adequately leveraged in this approach, thus leading to inefficient exploration. To address this problem, we propose a novel context-based meta-RL framework with an improved exploration mechanism. For the existing exploration and execution problem in context-based meta-RL, we propose a novel objective that employs two exploration terms to encourage better exploration in action and task embedding space, respectively. The first term pushes for improving the diversity of task inference, while the second term, named action information, works as sharing or hiding task information in different exploration stages. We divide the meta-training procedure into task-independent exploration and task-relevant exploration stages according to the utilization of action information. By decoupling task inference and task execution and proposing the respective optimization objectives in the two exploration stages, we can efficiently learn policy and task inference networks. We compare our algorithm with several popular meta-RL methods on MuJoco benchmarks with both dense and sparse reward settings. The empirical results show that our method significantly outperforms baselines on the benchmarks in terms of sample efficiency and task performance.

Keywords:

Computer science Reinforcement learning Inference Task (project management) Artificial intelligence Meta learning (computer science) Context (archaeology) Machine learning

Metrics

Cited By

0.71

FWCI (Field Weighted Citation Impact)

Refs

0.76

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

EEG and Brain-Computer Interfaces

Life Sciences → Neuroscience → Cognitive Neuroscience

Advanced Memory and Neural Computing

Physical Sciences → Engineering → Electrical and Electronic Engineering

Exploration With Task Information for Meta Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Information-theoretic Task Selection for Meta-Reinforcement Learning

MAML2: meta reinforcement learning via meta-learning for task categories

Efficient Design Space Exploration with Multi-Task Reinforcement Learning

Meta Reinforcement Learning with Task Embedding and Shared Policy

Conservative Offline Meta-Reinforcement Learning with Task Similarity Measurement