JOURNAL ARTICLE

Reversible Recurrent Neural Networks

Matthew MackayPaul VicolJimmy BaRoger Grosse

Year: 2018 Journal:   arXiv (Cornell University) Vol: 31 Pages: 9029-9040   Publisher: Cornell University

Abstract

Recurrent neural networks (RNNs) provide state-of-the-art performance in processing sequential data but are memory intensive to train, limiting the flexibility of RNN models which can be trained. Reversible RNNs---RNNs for which the hidden-to-hidden transition can be reversed---offer a path to reduce the memory requirements of training, as hidden states need not be stored and instead can be recomputed during backpropagation. We first show that perfectly reversible RNNs, which require no storage of the hidden activations, are fundamentally limited because they cannot forget information from their hidden state. We then provide a scheme for storing a small number of bits in order to allow perfect reversal with forgetting. Our method achieves comparable performance to traditional models while reducing the activation memory cost by a factor of 10--15. We extend our technique to attention-based sequence-to-sequence models, where it maintains performance while reducing activation memory cost by a factor of 5--10 in the encoder, and a factor of 10--15 in the decoder.

Keywords:
Recurrent neural network Computer science Sequence (biology) Forgetting Encoder Path (computing) Decoding methods Artificial intelligence Artificial neural network Backpropagation Flexibility (engineering) Algorithm

Metrics

21
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Neural Networks and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Time Series Analysis and Forecasting
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Recurrent neural networks

Stephen Grossberg

Journal:   Scholarpedia Year: 2013 Vol: 8 (2)Pages: 1888-1888
BOOK-CHAPTER

Recurrent Neural Networks

Bert Huang

Year: 2021
BOOK-CHAPTER

Recurrent Neural Networks

Mario V. WüthrichMichael Merz

Springer Actuarial Year: 2022 Pages: 381-406
BOOK-CHAPTER

Recurrent Neural Networks

Charų C. Aggarwal

Year: 2023 Pages: 265-304
BOOK-CHAPTER

Recurrent Neural Networks

Ke-Lin DuM. N. S. Swamy

Year: 2019 Pages: 351-371
© 2026 ScienceGate Book Chapters — All rights reserved.