Abstract

While there are many deep learning based approaches for single image compression, the field of end-to-end learned video coding has remained much less explored. Therefore, in this work we present an inter-frame compression approach for neural video coding that can seamlessly build up on different existing neural image codecs. Our end-to-end solution performs temporal prediction by optical flow based motion compensation in pixel space. The key insight is that we can increase both decoding efficiency and reconstruction quality by encoding the required information into a latent representation that directly decodes into motion and blending coefficients. In order to account for remaining prediction errors, residual information between the original image and the interpolated frame is needed. We propose to compute residuals directly in latent space instead of in pixel space as this allows to reuse the same image compression network for both key frames and intermediate frames. Our extended evaluation on different datasets and resolutions shows that the rate-distortion performance of our approach is competitive with existing state-of-the-art codecs.

Keywords:
Computer science Motion compensation Codec Artificial intelligence Residual frame Computer vision Key frame Data compression Intra-frame Motion estimation Reference frame Pixel Frame (networking)

Metrics

180
Cited By
11.44
FWCI (Field Weighted Citation Impact)
37
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multiscale (inter/intra-frame) fractal video coding

A. Bogdan

Year: 2002 Vol: 1 Pages: 760-764
JOURNAL ARTICLE

OPTIMAL INTER-FRAME ALIGNMENT FOR VIDEO COMPRESSION

Bruno CarpentieriJames A. Storer

Journal:   International Journal of Foundations of Computer Science Year: 1994 Vol: 05 (02)Pages: 165-177
JOURNAL ARTICLE

Neural Reference Synthesis for Inter Frame Coding

Dandan DingXiang GaoChenran TangZhan Ma

Journal:   IEEE Transactions on Image Processing Year: 2021 Vol: 31 Pages: 773-787
© 2026 ScienceGate Book Chapters — All rights reserved.