Yajie XingJingbo WangXiaokang ChenGang Zeng
Convolutional neural networks (CNN) have achieved great success in RGB semantic segmentation. RGB-D images provide additional depth information, which can improve segmentation performance. To take full advantages of the 3D geometry relations provided by RGB-D images, in this paper, we propose 2.5D convolution, which mimics one 3D convolution kernel by several masked 2D convolution kernels. Our 2.5D convolution can effectively process spatial relations between pixels in a manner similar to 3D convolution while still sampling pixels on 2D plane, and thus saves computational cost. And it can be seamlessly incorporated into pretrained CNNs. Experiments on two challenging RGB-D semantic segmentation benchmarks NYUDv2 and SUN-RGBD validate the effectiveness of our approach.
Lizhi BaiJun YangChunqi TianYaoru SunMaoyu MaoYanjun XuXU Wei-rong
Qi-Chao SunQing EnLijuan DuanYuanhua Qiao
Yunlu ChenThomas MensinkEfstratios Gavves
Xiaoyan JiangBohan WangXiaoyun WanShanshan ChenHamido FujitaHanan Abd. Al Juaid
Kun ZhouZejun ZHANGXu TangWen XuJianxiao XieChangbing Tang