DMFNet: Deep Multi-Modal Fusion Network for RGB-D Indoor Scene Segmentation

Jianzhong Yuan; Wujie Zhou; Ting Luo

doi:10.1109/access.2019.2955101

ScienceGate Book Chapters

JOURNAL ARTICLE

DMFNet: Deep Multi-Modal Fusion Network for RGB-D Indoor Scene Segmentation

Jianzhong Yuan Wujie Zhou Ting Luo

Year: 2019 Journal: IEEE Access Vol: 7 Pages: 169350-169358 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/access.2019.2955101

Get Full-Text PDF Get Analytical Report

Abstract

Indoor scene segmentation is a difficult task in computer vision. We propose an indoor scene segmentation framework, called DFMNet, incorporating RGB and complementary depth information to establish indoor scene segmentation. We use the squeeze-and-excitation residual network as encoder to simultaneously extract features from RGB and depth data and fuse them in the decoder. Multiple average pooling layers and transposed convolution layers are used to process the encoded outputs and fuse their outputs over several decoder layers. To optimize the network parameters, we use a pyramid supervision training scheme, which applies supervised learning over different layers in the decoder to prevent vanishing gradients. We evaluated the proposed DFMNet on the NYU Depth V2 dataset, which consists of 1449 cluttered indoor scenes, achieving competitive results compared to state-of-the-art methods.

Keywords:

Computer science Artificial intelligence Computer vision Modal RGB color model Segmentation Sensor fusion Fusion

Metrics

Cited By

1.28

FWCI (Field Weighted Citation Impact)

Refs

0.84

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Remote Sensing and LiDAR Applications

Physical Sciences → Environmental Science → Environmental Engineering

DMFNet: Deep Multi-Modal Fusion Network for RGB-D Indoor Scene Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

CMPFFNet: Cross-Modal and Progressive Feature Fusion Network for RGB-D Indoor Scene Semantic Segmentation

Multi‐modal deep network for RGB‐D segmentation of clothes

MAPNet: Multi-modal attentive pooling network for RGB-D indoor scene classification

CFANet: The Cross-Modal Fusion Attention Network for Indoor RGB-D Semantic Segmentation

Cascading attention enhancement network for RGB-D indoor scene segmentation