Multi-Swin Transformer Based Spatio-Temporal Information Exploration for Compressed Video Quality Enhancement

Li Yu; Shiyu Wu; Moncef Gabbouj

doi:10.1109/lsp.2024.3429008

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-Swin Transformer Based Spatio-Temporal Information Exploration for Compressed Video Quality Enhancement

Li Yu Shiyu Wu Moncef Gabbouj

Year: 2024 Journal: IEEE Signal Processing Letters Vol: 31 Pages: 1880-1884 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/lsp.2024.3429008

Get Full-Text PDF Get Analytical Report

Abstract

Spatio-temporal information plays an important role in compressed video quality enhancement. Most advanced studies use deformable convolution or Swin transformer to explore spatio-temporal information. However, deformable convolution based methods may incur inaccurate motion compensation due to the compression artifacts and limited receptive fields. The Swin transformer based approaches are unable to fully explore the spatio-temporal information, limited by its rigid window-based mechanism. To solve the above problems, we propose a novel multi-Swin transformer-based network for compressed video quality enhancement to better explore spatio-temporal information. The whole workflow consists of the Local Alignment (LA) Module, the Global Refinement Fusion (GRF) Module, and the Quality Enhancement (QE) Module. The LA module roughly perceives the local motion through the deformable fusion. Subsequently, the GRF module employs the proposed multi-Swin transformer to enhance the spatio-temporal perception. Finally, the QE module effectively restores the texture details across various scales. Extensive experimental results prove the effectiveness of the proposed method.

Keywords:

Computer science Artificial intelligence Computer vision Transformer Remote sensing Geology Engineering

Metrics

Cited By

2.12

FWCI (Field Weighted Citation Impact)

Refs

0.80

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Image and Video Quality Assessment

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image and Signal Denoising Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image Fusion Techniques

Physical Sciences → Engineering → Media Technology

Multi-Swin Transformer Based Spatio-Temporal Information Exploration for Compressed Video Quality Enhancement

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-Frame Compressed Video Quality Enhancement by Spatio-Temporal Information Balance

Spatio-Temporal Information Fusion Network for Compressed Video Quality Enhancement

Spatio-Temporal Detail Information Retrieval for Compressed Video Quality Enhancement

Coarse-to-Fine Spatio-Temporal Information Fusion for Compressed Video Quality Enhancement

Compressed Video Enhancement with Spatio-Temporal Multi-Feature Extraction