JOURNAL ARTICLE

DMFNet: Deep Multi-Modal Fusion Network for RGB-D Indoor Scene Segmentation

Jianzhong YuanWujie ZhouTing Luo

Year: 2019 Journal:   IEEE Access Vol: 7 Pages: 169350-169358   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Indoor scene segmentation is a difficult task in computer vision. We propose an indoor scene segmentation framework, called DFMNet, incorporating RGB and complementary depth information to establish indoor scene segmentation. We use the squeeze-and-excitation residual network as encoder to simultaneously extract features from RGB and depth data and fuse them in the decoder. Multiple average pooling layers and transposed convolution layers are used to process the encoded outputs and fuse their outputs over several decoder layers. To optimize the network parameters, we use a pyramid supervision training scheme, which applies supervised learning over different layers in the decoder to prevent vanishing gradients. We evaluated the proposed DFMNet on the NYU Depth V2 dataset, which consists of 1449 cluttered indoor scenes, achieving competitive results compared to state-of-the-art methods.

Keywords:
Computer science Artificial intelligence Computer vision Modal RGB color model Segmentation Sensor fusion Fusion

Metrics

39
Cited By
1.28
FWCI (Field Weighted Citation Impact)
68
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Remote Sensing and LiDAR Applications
Physical Sciences →  Environmental Science →  Environmental Engineering

Related Documents

JOURNAL ARTICLE

CMPFFNet: Cross-Modal and Progressive Feature Fusion Network for RGB-D Indoor Scene Semantic Segmentation

Wujie ZhouYuxiang XiaoWeiqing YanLu Yu

Journal:   IEEE Transactions on Automation Science and Engineering Year: 2023 Vol: 21 (4)Pages: 5523-5533
JOURNAL ARTICLE

Multi‐modal deep network for RGB‐D segmentation of clothes

Boris JoukovskyPengpeng HuAdrian Munteanu

Journal:   Electronics Letters Year: 2020 Vol: 56 (9)Pages: 432-435
JOURNAL ARTICLE

CFANet: The Cross-Modal Fusion Attention Network for Indoor RGB-D Semantic Segmentation

Longtao WuDan WeiChang‐An Xu

Journal:   Journal of Imaging Year: 2025 Vol: 11 (6)Pages: 177-177
© 2026 ScienceGate Book Chapters — All rights reserved.