RGB-D object recognition with multimodal deep convolutional neural networks

Mohammad Muntasir Rahman; Yanhao Tan; Jian Xue; Ke Lü

doi:10.1109/icme.2017.8019538

ScienceGate Book Chapters

JOURNAL ARTICLE

RGB-D object recognition with multimodal deep convolutional neural networks

Mohammad Muntasir Rahman Yanhao Tan Jian Xue Ke Lü

Year: 2017 Pages: 991-996

DOI: 10.1109/icme.2017.8019538

Get Full-Text PDF Get Analytical Report

Abstract

Object recognition from RGB-D images has become a hot topic and gained a significant popularity in recent years due to its numerous applications. In this paper, we propose a novel multimodal deep convolutional neural networks architecture for RGB-D object recognition which composed of three streams with two different types of deep CNNs, where each stream can separately learn from each modality. Finally, we propose a combined architecture of joint network of these three streams to classify the objects. Compared to RGB data, RGB-D images provide additional depth information that can be represented as depth colorization methods or surface normals. Our goal is to exploit both colorization and surface normals information to encode depth images. We show that by utilizing both colorization and surface normals of depth images combined with RGB significantly can improves the classification accuracy. We evaluate our model on one of the most challenging RGB-D object dataset and achieves comparable performance to state-of-the-art methods.

Keywords:

RGB color model Artificial intelligence Computer science Convolutional neural network Computer vision Pattern recognition (psychology) Deep learning Object (grammar) Modality (human–computer interaction) Cognitive neuroscience of visual object recognition Feature extraction

Metrics

Cited By

2.67

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Industrial Vision Systems and Defect Detection

Physical Sciences → Engineering → Industrial and Manufacturing Engineering

3D Surveying and Cultural Heritage

Physical Sciences → Earth and Planetary Sciences → Geology

RGB-D object recognition with multimodal deep convolutional neural networks

Abstract

Metrics

Citation History

Topics

Related Documents

RGB-D Object Recognition Using Deep Convolutional Neural Networks

RGB-D-Based Object Recognition Using Multimodal Convolutional Neural Networks: A Survey

Revisiting Deep Convolutional Neural Networks for RGB-D Based Object Recognition

RGB-D Based Multimodal Convolutional Neural Networks for Spacecraft Recognition

Human action recognition using RGB-D sensor and deep convolutional neural networks