Fully Cross-Attention Transformer for Guided Depth Super-Resolution

Ido Ariav; Israel Cohen

doi:10.3390/s23052723

ScienceGate Book Chapters

JOURNAL ARTICLE

Fully Cross-Attention Transformer for Guided Depth Super-Resolution

Ido Ariav Israel Cohen

Year: 2023 Journal: Sensors Vol: 23 (5)Pages: 2723-2723 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/s23052723

Get Full-Text PDF Get Analytical Report

Abstract

Modern depth sensors are often characterized by low spatial resolution, which hinders their use in real-world applications. However, the depth map in many scenarios is accompanied by a corresponding high-resolution color image. In light of this, learning-based methods have been extensively used for guided super-resolution of depth maps. A guided super-resolution scheme uses a corresponding high-resolution color image to infer high-resolution depth maps from low-resolution ones. Unfortunately, these methods still have texture copying problems due to improper guidance from color images. Specifically, in most existing methods, guidance from the color image is achieved by a naive concatenation of color and depth features. In this paper, we propose a fully transformer-based network for depth map super-resolution. A cascaded transformer module extracts deep features from a low-resolution depth. It incorporates a novel cross-attention mechanism to seamlessly and continuously guide the color image into the depth upsampling process. Using a window partitioning scheme, linear complexity in image resolution can be achieved, so it can be applied to high-resolution images. The proposed method of guided depth super-resolution outperforms other state-of-the-art methods through extensive experiments.

Keywords:

Upsampling Artificial intelligence Computer science Computer vision Image resolution Transformer Depth map Sub-pixel resolution Concatenation (mathematics) Color image Image processing Image (mathematics) Digital image processing Mathematics Voltage Engineering

Metrics

Cited By

2.00

FWCI (Field Weighted Citation Impact)

Refs

0.84

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image Processing Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Processing Techniques and Applications

Physical Sciences → Engineering → Media Technology

Fully Cross-Attention Transformer for Guided Depth Super-Resolution

Abstract

Metrics

Citation History

Topics

Related Documents

Fast Hierarchical Depth Super-Resolution via Guided Attention

Gradient Guided Depth Super-Resolution Using Attention-Based Cross Domain Interaction Module

Global Cross Attention Transformer for Image Super-Resolution

IGAF: Incremental Guided Attention Fusion for Depth Super-Resolution

Semantically Guided Efficient Attention Transformer for Face Super-Resolution Tasks