JOURNAL ARTICLE

Progressive Spatial-temporal Collaborative Network for Video Frame Interpolation

Mengshun HuKui JiangLiang LiaoZhixiang NieJing XiaoZheng Wang

Year: 2022 Journal:   Proceedings of the 30th ACM International Conference on Multimedia Pages: 2145-2153

Abstract

Most video frame interpolation (VFI) algorithms infer the intermediate frame with the help of adjacent frames through the cascaded motion estimation and content refinement.However, the intrinsic correlations between motion and content are barely investigated, commonly producing interpolated results with inconsistency and blurry contents.Specifically, we first discover a simple yet essential domain knowledge that contents and motions characteristics should be homogeneous to a certain degree from the same objects, and formulate the consistency into the loss function for model optimization. Based on this, we propose to learn the collaborative representation between motions and contents, and construct a novel progressive spatial-temporal Collaborative network (Prost-Net) for video frame interpolation.Specifically, we develop a content-guided motion module (CGMM) and a motion-guided content module (MGCM) for individual content and motion representation. In particular, the predicted motion in CGMM is used to guide the fusion and distillation of contents for intermediate frame interpolation, and vice versa. Furthermore, by considering collaborative strategy in a multi-scale framework, our Prost-Net progressively optimizes motions and contents in a coarse-to-fine manner, making it robust to various challenging scenarios (occlusion and large motions) in VFI. Extensive experiments on the benchmark datasets demonstrate that our method significantly outperforms state-of-the-art methods.

Keywords:
Computer science Motion interpolation Interpolation (computer graphics) Artificial intelligence Frame (networking) Benchmark (surveying) Motion (physics) Computer vision Representation (politics) Motion estimation Block-matching algorithm Object (grammar) Video tracking

Metrics

17
Cited By
1.17
FWCI (Field Weighted Citation Impact)
39
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Enhanced spatial-temporal freedom for video frame interpolation

Haodong LiHui YinZhihao LiuHua Huang

Journal:   Applied Intelligence Year: 2022 Vol: 53 (9)Pages: 10535-10547
JOURNAL ARTICLE

STDC-Net: A spatial-temporal deformable convolution network for conference video frame interpolation

Jinhui HuQianrui WangDengshi LiYu Gao

Journal:   Multimedia Tools and Applications Year: 2023 Vol: 83 (40)Pages: 88283-88302
JOURNAL ARTICLE

Progressive Motion Context Refine Network for Efficient Video Frame Interpolation

Lingtong KongJinfeng LiuJie Yang

Journal:   IEEE Signal Processing Letters Year: 2022 Vol: 29 Pages: 2338-2342
JOURNAL ARTICLE

Dual-Guided Video Frame Interpolation With Spatial-Temporal Global Attention

Baojun ZhouXinpeng HuangGongyang LiChao YangLiquan ShenPing An

Journal:   IEEE Transactions on Multimedia Year: 2025 Vol: 27 Pages: 7783-7795
© 2026 ScienceGate Book Chapters — All rights reserved.