FVC: An End-to-End Framework towards Deep Video Compression in Feature Space

Zhihao Hu; Dong Xu; Guo Lu; Wei Jiang; Wei Wang; Shan Liu

doi:10.1109/tpami.2022.3210652

ScienceGate Book Chapters

JOURNAL ARTICLE

FVC: An End-to-End Framework towards Deep Video Compression in Feature Space

Zhihao Hu Dong Xu Guo Lu Wei Jiang Wei Wang Shan Liu

Year: 2022 Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 45 (4)Pages: 1-17 Publisher: IEEE Computer Society

DOI: 10.1109/tpami.2022.3210652

Get Full-Text PDF Get Analytical Report

Abstract

Deep video compression is attracting increasing attention from both deep learning and video processing community. Recent learning-based approaches follow the hybrid coding paradigm to perform pixel space operations for reducing redundancy along both spatial and temporal dimentions, which leads to inaccurate motion estimation or less effective motion compensation. In this work, we propose a feature-space video coding framework (FVC), which performs all major operations (i.e., motion estimation, motion compression, motion compensation and residual compression) in the feature space. Specifically, a new deformable compensation module, which consists of motion estimation, motion compression and motion compensation, is proposed for more effective motion compensation. In our deformable compensation module, we first perform motion estimation in the feature space to produce the motion information (i.e., the offset maps). Then the motion information is compressed by using the auto-encoder style network. After that, we use the deformable convolution operation to generate the predicted feature for motion compensation. Finally, the residual information between the feature from the current frame and the predicted feature from the deformable compensation module is also compressed in the feature space. Motivated by the conventional codecs, in which the blocks with different sizes are used for motion estimation, we additionally propose two new modules called resolution-adaptive motion coding (RaMC) and resolution-adaptive residual coding (RaRC) to automatically cope with different types of motion and residual patterns at different spatial locations. Comprehensive experimental results demonstrate that our proposed framework achieves the state-of-the-art performance on three benchmark datasets including HEVC, UVG and MCL-JCV.

Keywords:

Motion compensation Artificial intelligence Quarter-pixel motion Computer vision Computer science Motion estimation Block-matching algorithm Data compression Feature (linguistics) Motion field Pattern recognition (psychology) Video processing Video tracking

Metrics

Cited By

2.85

FWCI (Field Weighted Citation Impact)

Refs

0.90

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Coding and Compression Technologies

Physical Sciences → Computer Science → Signal Processing

FVC: An End-to-End Framework towards Deep Video Compression in Feature Space

Abstract

Metrics

Citation History

Topics

Related Documents

FVC: A New Framework towards Deep Video Compression in Feature Space

DVC: An End-To-End Deep Video Compression Framework

An End-to-End Learning Framework for Video Compression

Scale-Space Flow for End-to-End Optimized Video Compression

DRFC: An End-to-End Deep Dynamic RF Signal Compression Framework