JOURNAL ARTICLE

An End-to-End Learning Framework for Video Compression

Guo LuXiaoyun ZhangWanli OuyangLi ChenZhiyong GaoDong Xu

Year: 2020 Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 43 (10)Pages: 3292-3308   Publisher: IEEE Computer Society

Abstract

Traditional video compression approaches build upon the hybrid coding framework with motion-compensated prediction and residual transform coding. In this paper, we propose the first end-to-end deep video compression framework to take advantage of both the classical compression architecture and the powerful non-linear representation ability of neural networks. Our framework employs pixel-wise motion information, which is learned from an optical flow network and further compressed by an auto-encoder network to save bits. The other compression components are also implemented by the well-designed networks for high efficiency. All the modules are jointly optimized by using the rate-distortion trade-off and can collaborate with each other. More importantly, the proposed deep video compression framework is very flexible and can be easily extended by using lightweight or advanced networks for higher speed or better efficiency. We also propose to introduce the adaptive quantization layer to reduce the number of parameters for variable bitrate coding. Comprehensive experimental results demonstrate the effectiveness of the proposed framework on the benchmark datasets.

Keywords:
Computer science Data compression Artificial intelligence End-to-end principle Encoder Data compression ratio Quantization (signal processing) Residual Image compression Computer vision Computer engineering Algorithm Image processing Image (mathematics)

Metrics

200
Cited By
12.39
FWCI (Field Weighted Citation Impact)
75
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Coding and Compression Technologies
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.