Improved RGB-D Indoor Semantic Segmentation using Cascaded Loss Fusion

Sonali Patil; Anita Sellent; Andreas Gerndt; Geórgia Albuquerque

doi:10.1109/aixvr59861.2024.00024

ScienceGate Book Chapters

JOURNAL ARTICLE

Improved RGB-D Indoor Semantic Segmentation using Cascaded Loss Fusion

Sonali Patil Anita Sellent Andreas Gerndt Geórgia Albuquerque

Year: 2024 Pages: 119-127

DOI: 10.1109/aixvr59861.2024.00024

Get Full-Text PDF Get Analytical Report

Abstract

Semantic segmentation of images promises numerous benefits for augmented reality applications. However, in such applications typical scenes are challenging for current segmentation algorithms due to high variability in object appearances and distribution. We propose a new cascaded loss fusion strategy to improve the training schedule of state-of-the-art realtime RGB-D semantic segmentation architectures. We employ methods developed in the context of multi-task learning to solve the multiclass and multi-loss learning problems in semantic segmentation. Through our quantitative evaluation on the NYUv2 [3] and SUNRGB-D [4] benchmark datasets, we show improvement over the state-of-the-art approaches. Furthermore, our approach improves results qualitatively on both the benchmark datasets as well as on our own recordings of some scenarios that are typical for head-mounted cameras.

Keywords:

Computer science Artificial intelligence Segmentation Computer vision Fusion RGB color model Image segmentation Sensor fusion

Metrics

Cited By

0.53

FWCI (Field Weighted Citation Impact)

Refs

0.49

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Remote Sensing and LiDAR Applications

Physical Sciences → Environmental Science → Environmental Engineering

Improved RGB-D Indoor Semantic Segmentation using Cascaded Loss Fusion

Abstract

Metrics

Citation History

Topics

Related Documents

Transformer Fusion for Indoor Rgb-D Semantic Segmentation

Transformer fusion for indoor RGB-D semantic segmentation

Multi-scale fusion for RGB-D indoor semantic segmentation

Feature fusion and context interaction for RGB-D indoor semantic segmentation

RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation