JOURNAL ARTICLE

Adaptive Context-Aware Multi-Modal Network for Depth Completion

Shanshan ZhaoMingming GongHuan FuDacheng Tao

Year: 2021 Journal:   IEEE Transactions on Image Processing Vol: 30 Pages: 5264-5276   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Depth completion aims to recover a dense depth map from the sparse depth data and the corresponding single RGB image. The observed pixels provide the significant guidance for the recovery of the unobserved pixels' depth. However, due to the sparsity of the depth data, the standard convolution operation, exploited by most of existing methods, is not effective to model the observed contexts with depth values. To address this issue, we propose to adopt the graph propagation to capture the observed spatial contexts. Specifically, we first construct multiple graphs at different scales from observed pixels. Since the graph structure varies from sample to sample, we then apply the attention mechanism on the propagation, which encourages the network to model the contextual information adaptively. Furthermore, considering the mutli-modality of input data, we exploit the graph propagation on the two modalities respectively to extract multi-modal representations. Finally, we introduce the symmetric gated fusion strategy to exploit the extracted multi-modal features effectively. The proposed strategy preserves the original information for one modality and also absorbs complementary information from the other through learning the adaptive gating weights. Our model, named Adaptive Context-Aware Multi-Modal Network (ACMNet), achieves the state-of-the-art performance on two benchmarks, i.e., KITTI and NYU-v2, and at the same time has fewer parameters than latest models. Our code is available at: https://github.com/sshan-zhao/ACMNet.

Keywords:

Metrics

153
Cited By
12.68
FWCI (Field Weighted Citation Impact)
82
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Multi-modal Characteristic Guided Depth Completion Network

Yong-Jin LeeSeokjun ParkBeomgu KangHyunWook Park

Lecture notes in computer science Year: 2023 Pages: 593-607
JOURNAL ARTICLE

Structure-Aware Cross-Modal Transformer for Depth Completion

Linqing ZhaoYi WeiJianqin LiJie ZhouJiwen Lu

Journal:   IEEE Transactions on Image Processing Year: 2024 Vol: 33 Pages: 1016-1031
BOOK-CHAPTER

Multi-modal Context-Aware Network for Scene Graph Generation

Junjie YeBing‐Kun BaoZhiyi Tan

Lecture notes in computer science Year: 2023 Pages: 335-347
JOURNAL ARTICLE

Context-Aware Multi-Modal Graph Attention Fusion Network for Adaptive Resource Allocation in Wireless Networks

Anoop MohanakumarUma SrinivasanJudy SimonNellore KapileswarK. Sutha

Journal:   Journal of Trends in Computer Science and Smart Technology Year: 2025 Vol: 7 (2)Pages: 266-294
© 2026 ScienceGate Book Chapters — All rights reserved.