JOURNAL ARTICLE

UTLNet: Uncertainty-Aware Transformer Localization Network for RGB-Depth Mirror Segmentation

Wujie ZhouYuqi CaiLiting ZhangWeiqing YanLu Yu

Year: 2023 Journal:   IEEE Transactions on Multimedia Vol: 26 Pages: 4564-4574   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Mirror segmentation, an emerging discipline in the field of computer vision, involves the identification and marking of mirrors in an image. Current mirror segmentation methods rely on fixed mirror elements as features for object segmentation. However, these methods do not account for the varied quality of feature images obtained under complex real-world conditions, leading to inaccurate segmentation results. To address these limitations, we propose a novel uncertainty-aware transformer localization network (UTLNet) for RGB-D mirror segmentation. Our approach draws inspiration from biomimicry, specifically the behavior pattern of human observation. We aim to explore features from different angles and focus on complex features that are challenging to determine during the coding stage. Additionally, we employ graph convolution to construct complementary dual-modal fusion features. Furthermore, we design a multiscale interaction transformer module using the shifted-window self-attention mechanism to acquire precise position information. In our experiments, the proposed UTLNet surpasses the current state-of-the-art mirror segmentation method as well as alternative task-specific methods. It achieves superior performance across various evaluation scenarios.

Keywords:
Computer science Segmentation Artificial intelligence Computer vision Image segmentation Transformer RGB color model Segmentation-based object categorization Scale-space segmentation Fusion mechanism Pattern recognition (psychology) Fusion Voltage

Metrics

24
Cited By
4.37
FWCI (Field Weighted Citation Impact)
81
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Retinal Imaging and Analysis
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging

Related Documents

BOOK-CHAPTER

Depth-Aware CNN for RGB-D Segmentation

Weiyue WangUlrich Neumann

Lecture notes in computer science Year: 2018 Pages: 144-161
JOURNAL ARTICLE

Depth-Aware Transformer for Aerial Localization

Jianjun LeiDemin TuBo PengJie ZhuZhe ZhangChong WuQingming Huang

Journal:   ACM Transactions on Multimedia Computing Communications and Applications Year: 2025 Vol: 22 (1)Pages: 1-16
© 2026 ScienceGate Book Chapters — All rights reserved.