Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

Maximilian Igl; Kamil Ciosek; Yingzhen Li; Sebastian Tschiatschek; Cheng Zhang; Sam Devlin; Katja Hofmann

doi:10.48550/arxiv.1910.12911

ScienceGate Book Chapters

JOURNAL ARTICLE

Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

Maximilian Igl Kamil Ciosek Yingzhen Li Sebastian Tschiatschek Cheng Zhang Sam Devlin Katja Hofmann

Year: 2019 Journal: arXiv (Cornell University) Vol: 32 Pages: 13956-13968 Publisher: Cornell University

DOI: 10.48550/arxiv.1910.12911

Get Full-Text PDF Get Analytical Report

Abstract

The ability for policies to generalize to new environments is key to the broad application of RL agents. A promising approach to prevent an agent's policy from overfitting to a limited set of training environments is to apply regularization techniques originally developed for supervised learning. However, there are stark differences between supervised learning and RL. We discuss those differences and propose modifications to existing regularization techniques in order to better adapt them to RL. In particular, we focus on regularization techniques relying on the injection of noise into the learned function, a family that includes some of the most widely used approaches such as Dropout and Batch Normalization. To adapt them to RL, we propose Selective Noise Injection (SNI), which maintains the regularizing effect the injected noise has, while mitigating the adverse effects it has on the gradient quality. Furthermore, we demonstrate that the Information Bottleneck (IB) is a particularly well suited regularization technique for RL as it is effective in the low-data regime encountered early on in training RL agents. Combining the IB with SNI, we significantly outperform current state of the art results, including on the recently proposed generalization benchmark Coinrun.

Keywords:

Reinforcement learning Overfitting Computer science Artificial intelligence Bottleneck Regularization (linguistics) Machine learning Normalization (sociology) Artificial neural network

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Adaptive Dynamic Programming Control

Physical Sciences → Computer Science → Computational Theory and Mathematics

Data Stream Mining Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

Abstract

Metrics

Citation History

Topics

Related Documents

Learning and generalization with the information bottleneck

Learning and Generalization with the Information Bottleneck

Information bottleneck and selective noise supervision for zero-shot learning

Federated learning via reweighting information bottleneck with domain generalization

Deep representation learning for domain generalization with information bottleneck principle