JOURNAL ARTICLE

CalibNet: Dual-Branch Cross-Modal Calibration for RGB-D Salient Instance Segmentation

Jialun PeiTao JiangHe TangNian LiuYueming JinDeng-Ping FanPheng‐Ann Heng

Year: 2024 Journal:   IEEE Transactions on Image Processing Vol: 33 Pages: 4348-4362   Publisher: Institute of Electrical and Electronics Engineers

Abstract

In this study, we propose a novel approach for RGB-D salient instance segmentation using a dual-branch cross-modal feature calibration architecture called CalibNet. Our method simultaneously calibrates depth and RGB features in the kernel and mask branches to generate instance-aware kernels and mask features. CalibNet consists of three simple modules, a dynamic interactive kernel (DIK) and a weight-sharing fusion (WSF), which work together to generate effective instance-aware kernels and integrate cross-modal features. To improve the quality of depth features, we incorporate a depth similarity assessment (DSA) module prior to DIK and WSF. In addition, we further contribute a new DSIS dataset, which contains 1,940 images with elaborate instance-level annotations. Extensive experiments on three challenging benchmarks show that CalibNet yields a promising result, i.e., 58.0% AP with 320×480 input size on the COME15K-E test set, which significantly surpasses the alternative frameworks. Our code and dataset will be publicly available at: https://github.com/PJLallen/CalibNet.

Keywords:
Computer science Kernel (algebra) RGB color model Artificial intelligence Segmentation Salient Pattern recognition (psychology) Modal Set (abstract data type) Calibration Computer vision Mathematics

Metrics

17
Cited By
8.48
FWCI (Field Weighted Citation Impact)
85
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Video Quality Assessment
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

DcieNet: dual-branch cross-modal interaction enhanced RGB-D instance segmentation

Zhi LinMingen ZhongBingan YuanJiawei TanKang FanPengzhi Lin

Journal:   Engineering Research Express Year: 2025 Vol: 7 (4)Pages: 0452f4-0452f4
JOURNAL ARTICLE

Cross Modal Calibration for RGB-D Instance Segmentation

Aecheon JungSungeun Hong

Journal:   The Journal of Korean Institute of Information Technology Year: 2024 Vol: 22 (5)Pages: 13-22
JOURNAL ARTICLE

Dual-branch frequency-domain fusion for RGB-D tree trunks instance segmentation

Chunjiang YuYuanhang LiuJiangming KanYunhe ZhouRuifang DongXixuan Zhao

Journal:   Computers and Electronics in Agriculture Year: 2025 Vol: 240 Pages: 111159-111159
JOURNAL ARTICLE

A Three-Branch Cross-Modal Interactive Network for RGB-D Salient Defect Detection

Lisha CuiMing MaChaochao LiXiaoheng JiangZhiwen SongLiu-Yin FanMingliang Xu

Journal:   IEEE Transactions on Instrumentation and Measurement Year: 2025 Vol: 74 Pages: 1-11
© 2026 ScienceGate Book Chapters — All rights reserved.