JOURNAL ARTICLE

Learning hierarchical sparse features for RGB-(D) object recognition

Liefeng BoXiaofeng RenDieter Fox

Year: 2014 Journal:   The International Journal of Robotics Research Vol: 33 (4)Pages: 581-599   Publisher: SAGE Publishing

Abstract

Recently introduced RGB-D cameras are capable of providing high quality synchronized videos of both color and depth. With its advanced sensing capabilities, this technology represents an opportunity to significantly increase the capabilities of object recognition. It also raises the problem of developing expressive features for the color and depth channels of these sensors. In this paper we introduce hierarchical matching pursuit (HMP) for RGB-D data. As a multi-layer sparse coding network, HMP builds feature hierarchies layer by layer with an increasing receptive field size to capture abstract representations from raw RGB-D data. HMP uses sparse coding to learn codebooks at each layer in an unsupervised way and builds hierarchical feature representations from the learned codebooks in conjunction with orthogonal matching pursuit, spatial pooling and contrast normalization. Extensive experiments on various datasets indicate that the features learned with our approach enable superior object recognition results using linear support vector machines.

Keywords:
Artificial intelligence Computer science RGB color model Pattern recognition (psychology) Normalization (sociology) Neural coding Pooling Feature (linguistics) Computer vision Matching pursuit Feature extraction Coding (social sciences) Discriminative model Compressed sensing Mathematics

Metrics

63
Cited By
18.78
FWCI (Field Weighted Citation Impact)
80
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
Sparse and Compressive Sensing Techniques
Physical Sciences →  Engineering →  Computational Mechanics
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.