JOURNAL ARTICLE

MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition

Abstract

Most of the feature-learning methods for RGB-D object recognition either learn features from color and depth modalities separately, or simply treat RGB-D as undifferentiated four-channel data, which cannot adequately exploit the relationship between different modalities. Motivated by the intuition that different modalities should contain not only some modal-specific patterns but also some shared common patterns, we propose a multi-modal feature learning framework for RGB-D object recognition. We first construct deep CNN layers for color and depth separately, and then connect them with our carefully designed multi-modal layers, which fuse color and depth information by enforcing a common part to be shared by features of different modalities. In this way, we obtain features reflecting shared properties as well as modal-specific properties in different modalities. The information of the multi-modal learning frameworks is back-propagated to the early CNN layers. Experimental results show that our proposed multi-modal feature learning method outperforms state-of-the-art approaches on two widely used RGB-D object benchmark datasets.

Keywords:
Computer science RGB color model Artificial intelligence Modal Modalities Feature (linguistics) Pattern recognition (psychology) Feature learning Object (grammar) Deep learning Computer vision Benchmark (surveying)

Metrics

101
Cited By
7.72
FWCI (Field Weighted Citation Impact)
49
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-modal deep feature learning for RGB-D object detection

Xiangyang XuYuncheng LiGangshan WuJiebo Luo

Journal:   Pattern Recognition Year: 2017 Vol: 72 Pages: 300-313
JOURNAL ARTICLE

Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition

Anran WangJiwen LuJianfei CaiTat‐Jen ChamGang Wang

Journal:   IEEE Transactions on Multimedia Year: 2015 Vol: 17 (11)Pages: 1887-1898
JOURNAL ARTICLE

RGB-D Scene Recognition via Spatial-Related Multi-Modal Feature Learning

Zhitong XiongYuan YuanQi Wang

Journal:   IEEE Access Year: 2019 Vol: 7 Pages: 106739-106747
JOURNAL ARTICLE

Multi-channel feature dictionaries for RGB-D object recognition

Mina ChongJun LiJian SongXiaodong LanQiming Li

Journal:   Ninth International Conference on Graphic and Image Processing (ICGIP 2017) Year: 2018 Vol: 57 Pages: 160-160
BOOK-CHAPTER

Unsupervised Feature Learning for RGB-D Based Object Recognition

Liefeng BoXiaofeng RenDieter Fox

Springer tracts in advanced robotics Year: 2013 Pages: 387-402
© 2026 ScienceGate Book Chapters — All rights reserved.