JOURNAL ARTICLE

High-Resolution Depth Maps Imaging via Attention-Based Hierarchical Multi-Modal Fusion

Zhiwei ZhongXianming LiuJunjun JiangDebin ZhaoZhiwen ChenXiangyang Ji

Year: 2021 Journal:   IEEE Transactions on Image Processing Vol: 31 Pages: 648-663   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Depth map records distance between the viewpoint and objects in the scene, which plays a critical role in many real-world applications. However, depth map captured by consumer-grade RGB-D cameras suffers from low spatial resolution. Guided depth map super-resolution (DSR) is a popular approach to address this problem, which attempts to restore a high-resolution (HR) depth map from the input low-resolution (LR) depth and its coupled HR RGB image that serves as the guidance. The most challenging issue for guided DSR is how to correctly select consistent structures and propagate them, and properly handle inconsistent ones. In this paper, we propose a novel attention-based hierarchical multi-modal fusion (AHMF) network for guided DSR. Specifically, to effectively extract and combine relevant information from LR depth and HR guidance, we propose a multi-modal attention based fusion (MMAF) strategy for hierarchical convolutional layers, including a feature enhancement block to select valuable features and a feature recalibration block to unify the similarity metrics of modalities with different appearance characteristics. Furthermore, we propose a bi-directional hierarchical feature collaboration (BHFC) module to fully leverage low-level spatial information and high-level structure information among multi-scale features. Experimental results show that our approach outperforms state-of-the-art methods in terms of reconstruction accuracy, running speed and memory efficiency.

Keywords:
Artificial intelligence Computer science RGB color model Feature (linguistics) Leverage (statistics) Computer vision Block (permutation group theory) Image resolution Modal Pattern recognition (psychology) Convolutional neural network Image fusion Depth map Image (mathematics) Mathematics

Metrics

52
Cited By
3.78
FWCI (Field Weighted Citation Impact)
79
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Latent Edge Guided Depth Super-Resolution Using Attention-Based Hierarchical Multi-Modal Fusion

Hui LanCheolkon Jung

Journal:   IEEE Access Year: 2024 Vol: 12 Pages: 114512-114526
JOURNAL ARTICLE

Multi-modal RGBD Attention Fusion for Dense Depth Estimation

Kirill ZhingalovAleksei KarpovIlya Makarov

Year: 2022 Vol: abs/1703.01398 Pages: 000109-000114
JOURNAL ARTICLE

Echo Depth Estimation via Attention-based Hierarchical Multi-scale Feature Fusion Network

Wenjie ZhangJun YinPeng YuYibo GuoXiaoheng JiangShaohui JinMingliang Xu

Journal:   ACM Transactions on Multimedia Computing Communications and Applications Year: 2025 Vol: 21 (8)Pages: 1-20
© 2026 ScienceGate Book Chapters — All rights reserved.