JOURNAL ARTICLE

Learning Spatiotemporal Interactions for User-Generated Video Quality Assessment

Hanwei ZhuBaoliang ChenLingyu ZhuShiqi Wang

Year: 2022 Journal:   IEEE Transactions on Circuits and Systems for Video Technology Vol: 33 (3)Pages: 1031-1042   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Distortions from spatial and temporal domains have been identified as the dominant factors that govern the visual quality. Though both have been studied independently in deep learning-based user-generated content (UGC) video quality assessment (VQA) by frame-wise distortion estimation and temporal quality aggregation, much less work has been dedicated to the integration of them with deep representations. In this paper, we propose a SpatioTemporal Interactive VQA (STI-VQA) model based upon the philosophy that video distortion can be inferred from the integration of both spatial characteristics and temporal motion, along with the flow of time. In particular, for each timestamp, both the spatial distortion explored by the feature statistics and local motion captured by feature difference are extracted and fed to a transformer network for the motion aware interaction learning. Meanwhile, the information flow of spatial distortion from the shallow layer to the deep layer is constructed adaptively during the temporal aggregation. The transformer network enjoys an advanced advantage for long-range dependencies modeling, leading to superior performance on UGC videos. Experimental results on five UGC video benchmarks demonstrate the effectiveness and efficiency of our STI-VQA model, and the source code will be available online at https://github.com/h4nwei/STI-VQA .

Keywords:
Timestamp Computer science Artificial intelligence Distortion (music) Optical flow Deep learning Video quality Geotagging Feature (linguistics) Data mining Information retrieval Real-time computing Image (mathematics) Computer network

Metrics

25
Cited By
3.09
FWCI (Field Weighted Citation Impact)
74
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Image and Video Quality Assessment
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.