2.5D Convolution for RGB-D Semantic Segmentation

Yajie Xing; Jingbo Wang; Xiaokang Chen; Gang Zeng

doi:10.1109/icip.2019.8803757

ScienceGate Book Chapters

JOURNAL ARTICLE

2.5D Convolution for RGB-D Semantic Segmentation

Yajie Xing Jingbo Wang Xiaokang Chen Gang Zeng

Year: 2019 Pages: 1410-1414

DOI: 10.1109/icip.2019.8803757

Get Full-Text PDF Get Analytical Report

Abstract

Convolutional neural networks (CNN) have achieved great success in RGB semantic segmentation. RGB-D images provide additional depth information, which can improve segmentation performance. To take full advantages of the 3D geometry relations provided by RGB-D images, in this paper, we propose 2.5D convolution, which mimics one 3D convolution kernel by several masked 2D convolution kernels. Our 2.5D convolution can effectively process spatial relations between pixels in a manner similar to 3D convolution while still sampling pixels on 2D plane, and thus saves computational cost. And it can be seamlessly incorporated into pretrained CNNs. Experiments on two challenging RGB-D semantic segmentation benchmarks NYUDv2 and SUN-RGBD validate the effectiveness of our approach.

Keywords:

RGB color model Artificial intelligence Convolution (computer science) Computer science Kernel (algebra) Convolutional neural network Segmentation Computer vision Pixel Image segmentation Pattern recognition (psychology) Artificial neural network Mathematics

Metrics

Cited By

2.57

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

2.5D Convolution for RGB-D Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

DCANet: Differential convolution attention network for RGB-D semantic segmentation

RGB-D Image Semantic Segmentation Based on Multi-Modal Adaptive Convolution

3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation

Project-and-fuse: Improving RGB-D semantic segmentation via graph convolution networks

Shape-Aware Convolution with Convolutional Kernel Attention for RGB-D Image Semantic Segmentation