Learning-based End-to-End Video Compression Using Predictive Coding

Matheus Oliveira; Luiz Gustavo R. Martins; Henrique Costa Jung; Nilson Donizete Guerin; Renam Castro da Silva; Eduardo Peixoto; Bruno Macchiavello; Edson M. Hung; Vanessa Testoni; Pedro Garcia Freitas

doi:10.1109/sibgrapi54419.2021.00030

JOURNAL ARTICLE

Learning-based End-to-End Video Compression Using Predictive Coding

Matheus Oliveira Luiz Gustavo R. Martins Henrique Costa Jung Nilson Donizete Guerin Renam Castro da Silva Eduardo Peixoto Bruno Macchiavello Edson M. Hung Vanessa Testoni Pedro Garcia Freitas

Year: 2021 Pages: 160-167

DOI: 10.1109/sibgrapi54419.2021.00030

Get Full-Text PDF Get Analytical Report

Abstract

Driven by the growing demand for video applications, deep learning techniques have become alternatives for implementing end-to-end encoders to achieve applicable compression rates. Conventional video codecs exploit both spatial and temporal correlation. However, due to some restrictions (e.g. computational complexity), they are commonly limited to linear transformations and translational motion estimation. Autoencoder models open up the way for exploiting predictive end-to-end video codecs without such limitations. This paper presents an entire learning-based video codec that exploits spatial and temporal correlations. The presented codec extends the idea of P-frame prediction presented in our previous work. The architecture adopted for I-frame coding is defined by a variational autoencoder with non-parametric entropy modeling. Besides an entropy model parameterized by a hyperprior, the inter-frame encoder architecture has two other independent networks, responsible for motion estimation and residue prediction. Experimental results indicate that some improvements still have to be incorporated into our codec to overcome the all-intra coding set up regarding the traditional algorithms High Efficiency Video Coding (HEVC) and Versatile Video Coding (VVC).

Keywords:

Codec Computer science Autoencoder Encoder Artificial intelligence Data compression Intra-frame Motion estimation Multiview Video Coding Motion compensation Computer vision Inter frame Reference frame Deep learning Decoding methods Algorithm Video tracking Video processing Frame (networking) Computer hardware

Metrics

Cited By

0.29

FWCI (Field Weighted Citation Impact)

Refs

0.57

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Video Coding and Compression Technologies

Physical Sciences → Computer Science → Signal Processing

Advanced Image Processing Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Learning-based End-to-End Video Compression Using Predictive Coding

Abstract

Metrics

Citation History

Topics

Related Documents

End-to-End Learning of Video Compression using Spatio-Temporal Autoencoders

Learning-Based End-to-End Video Compression with Spatial-Temporal Adaptation

An End-to-End Learning Framework for Video Compression

End-to-End Deep Video Compression Based on Hierarchical Temporal Context Learning

End-to-end Distributed Video Coding