MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition

Anran Wang; Jianfei Cai; Jiwen Lu; Tat‐Jen Cham

doi:10.1109/iccv.2015.134

ScienceGate Book Chapters

JOURNAL ARTICLE

MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition

Anran Wang Jianfei Cai Jiwen Lu Tat‐Jen Cham

Year: 2015 Pages: 1125-1133

DOI: 10.1109/iccv.2015.134

Get Full-Text PDF Get Analytical Report

Abstract

Most of the feature-learning methods for RGB-D object recognition either learn features from color and depth modalities separately, or simply treat RGB-D as undifferentiated four-channel data, which cannot adequately exploit the relationship between different modalities. Motivated by the intuition that different modalities should contain not only some modal-specific patterns but also some shared common patterns, we propose a multi-modal feature learning framework for RGB-D object recognition. We first construct deep CNN layers for color and depth separately, and then connect them with our carefully designed multi-modal layers, which fuse color and depth information by enforcing a common part to be shared by features of different modalities. In this way, we obtain features reflecting shared properties as well as modal-specific properties in different modalities. The information of the multi-modal learning frameworks is back-propagated to the early CNN layers. Experimental results show that our proposed multi-modal feature learning method outperforms state-of-the-art approaches on two widely used RGB-D object benchmark datasets.

Keywords:

Computer science RGB color model Artificial intelligence Modal Modalities Feature (linguistics) Pattern recognition (psychology) Feature learning Object (grammar) Deep learning Computer vision Benchmark (surveying)

Metrics

101

Cited By

7.72

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-modal deep feature learning for RGB-D object detection

Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition

RGB-D Scene Recognition via Spatial-Related Multi-Modal Feature Learning

Multi-channel feature dictionaries for RGB-D object recognition

Unsupervised Feature Learning for RGB-D Based Object Recognition