Unsupervised Joint Feature Learning and Encoding for RGB-D Scene Labeling

Anran Wang; Jiwen Lu; Jianfei Cai; Gang Wang; Tat‐Jen Cham

doi:10.1109/tip.2015.2465133

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised Joint Feature Learning and Encoding for RGB-D Scene Labeling

Anran Wang Jiwen Lu Jianfei Cai Gang Wang Tat‐Jen Cham

Year: 2015 Journal: IEEE Transactions on Image Processing Vol: 24 (11)Pages: 4459-4473 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tip.2015.2465133

Get Full-Text PDF Get Analytical Report

Abstract

Most existing approaches for RGB-D indoor scene labeling employ hand-crafted features for each modality independently and combine them in a heuristic manner. There has been some attempt on directly learning features from raw RGB-D data, but the performance is not satisfactory. In this paper, we propose an unsupervised joint feature learning and encoding (JFLE) framework for RGB-D scene labeling. The main novelty of our learning framework lies in the joint optimization of feature learning and feature encoding in a coherent way, which significantly boosts the performance. By stacking basic learning structure, higher level features are derived and combined with lower level features for better representing RGB-D data. Moreover, to explore the nonlinear intrinsic characteristic of data, we further propose a more general joint deep feature learning and encoding (JDFLE) framework that introduces the nonlinear mapping into JFLE. The experimental results on the benchmark NYU depth dataset show that our approaches achieve competitive performance, compared with the state-of-the-art methods, while our methods do not need complex feature handcrafting and feature combination and can be easily applied to other data sets.

Keywords:

Artificial intelligence Computer science Feature (linguistics) Encoding (memory) RGB color model Pattern recognition (psychology) Feature learning Benchmark (surveying) Feature extraction Unsupervised learning Machine learning

Metrics

Cited By

5.01

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Remote Sensing and LiDAR Applications

Physical Sciences → Environmental Science → Environmental Engineering

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Unsupervised Joint Feature Learning and Encoding for RGB-D Scene Labeling

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-modal Unsupervised Feature Learning for RGB-D Scene Labeling

Unsupervised feature learning for 3D scene labeling

Feature learning for RGB-D scene understanding

Unsupervised Feature Learning for RGB-D Image Classification

Unsupervised Feature Learning for RGB-D Based Object Recognition