UTLNet: Uncertainty-Aware Transformer Localization Network for RGB-Depth Mirror Segmentation

Wujie Zhou; Yuqi Cai; Liting Zhang; Weiqing Yan; Lu Yu

doi:10.1109/tmm.2023.3323890

ScienceGate Book Chapters

JOURNAL ARTICLE

UTLNet: Uncertainty-Aware Transformer Localization Network for RGB-Depth Mirror Segmentation

Wujie Zhou Yuqi Cai Liting Zhang Weiqing Yan Lu Yu

Year: 2023 Journal: IEEE Transactions on Multimedia Vol: 26 Pages: 4564-4574 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tmm.2023.3323890

Get Full-Text PDF Get Analytical Report

Abstract

Mirror segmentation, an emerging discipline in the field of computer vision, involves the identification and marking of mirrors in an image. Current mirror segmentation methods rely on fixed mirror elements as features for object segmentation. However, these methods do not account for the varied quality of feature images obtained under complex real-world conditions, leading to inaccurate segmentation results. To address these limitations, we propose a novel uncertainty-aware transformer localization network (UTLNet) for RGB-D mirror segmentation. Our approach draws inspiration from biomimicry, specifically the behavior pattern of human observation. We aim to explore features from different angles and focus on complex features that are challenging to determine during the coding stage. Additionally, we employ graph convolution to construct complementary dual-modal fusion features. Furthermore, we design a multiscale interaction transformer module using the shifted-window self-attention mechanism to acquire precise position information. In our experiments, the proposed UTLNet surpasses the current state-of-the-art mirror segmentation method as well as alternative task-specific methods. It achieves superior performance across various evaluation scenarios.

Keywords:

Computer science Segmentation Artificial intelligence Computer vision Image segmentation Transformer RGB color model Segmentation-based object categorization Scale-space segmentation Fusion mechanism Pattern recognition (psychology) Fusion Voltage

Metrics

Cited By

4.37

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Retinal Imaging and Analysis

Health Sciences → Medicine → Radiology, Nuclear Medicine and Imaging

UTLNet: Uncertainty-Aware Transformer Localization Network for RGB-Depth Mirror Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

UCTNet: Uncertainty-Aware Cross-Modal Transformer Network for Indoor RGB-D Semantic Segmentation

Depth-Aware Mirror Segmentation

Depth-Aware CNN for RGB-D Segmentation

Depth-Aware Transformer for Aerial Localization

ADRNet-S*: Asymmetric depth registration network via contrastive knowledge distillation for RGB-D mirror segmentation