JOURNAL ARTICLE

Discriminative feature learning for efficient RGB-D object recognition

Abstract

© 2015 IEEE. This paper presents an efficient approach to recognize objects captured with an RGB-D sensor. The proposed approach uses a Bag-of-Words (BOW) model to learn feature representations from raw RGB-D point clouds in a weakly supervised manner. To this end, we introduce a novel method based on randomized clustering trees to learn visual vocabularies which are fast to compute and more discriminative compared to the vocabularies generated by classical methods such as k-means. We show that, when combined with standard spatial pooling strategies, our proposed approach yields a powerful feature representation for RGB-D object recognition. Our extensive experimental evaluation on two challenging RGB-D object datasets and live video streams from Kinect shows that our learned features result in superior object recognition accuracies compared with the state-of-the-art methods.

Keywords:
Artificial intelligence Discriminative model Computer science RGB color model Pattern recognition (psychology) Object (grammar) Feature (linguistics) Feature learning Cluster analysis Cognitive neuroscience of visual object recognition Computer vision Pooling Point cloud Feature extraction Representation (politics) 3D single-object recognition Object detection

Metrics

8
Cited By
3.64
FWCI (Field Weighted Citation Impact)
41
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.