JOURNAL ARTICLE

Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

Abstract

The ability for policies to generalize to new environments is key to the broad application of RL agents. A promising approach to prevent an agent's policy from overfitting to a limited set of training environments is to apply regularization techniques originally developed for supervised learning. However, there are stark differences between supervised learning and RL. We discuss those differences and propose modifications to existing regularization techniques in order to better adapt them to RL. In particular, we focus on regularization techniques relying on the injection of noise into the learned function, a family that includes some of the most widely used approaches such as Dropout and Batch Normalization. To adapt them to RL, we propose Selective Noise Injection (SNI), which maintains the regularizing effect the injected noise has, while mitigating the adverse effects it has on the gradient quality. Furthermore, we demonstrate that the Information Bottleneck (IB) is a particularly well suited regularization technique for RL as it is effective in the low-data regime encountered early on in training RL agents. Combining the IB with SNI, we significantly outperform current state of the art results, including on the recently proposed generalization benchmark Coinrun.

Keywords:
Reinforcement learning Overfitting Computer science Artificial intelligence Bottleneck Regularization (linguistics) Machine learning Normalization (sociology) Artificial neural network

Metrics

56
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Adaptive Dynamic Programming Control
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Learning and generalization with the information bottleneck

Ohad ShamirSivan SabatoNaftali Tishby

Journal:   Theoretical Computer Science Year: 2010 Vol: 411 (29-30)Pages: 2696-2711
BOOK-CHAPTER

Learning and Generalization with the Information Bottleneck

Ohad ShamirSivan SabatoNaftali Tishby

Lecture notes in computer science Year: 2008 Pages: 92-107
JOURNAL ARTICLE

Federated learning via reweighting information bottleneck with domain generalization

Fangyu LiXuqiang ChenHan ZhuYongping DuHonggui Han

Journal:   Information Sciences Year: 2024 Vol: 677 Pages: 120825-120825
© 2026 ScienceGate Book Chapters — All rights reserved.