JOURNAL ARTICLE

Detecting object affordances with Convolutional Neural Networks

Abstract

We present a novel and real-time method to detect \nobject affordances from RGB-D images. Our method trains \na deep Convolutional Neural Network (CNN) to learn deep \nfeatures from the input data in an end-to-end manner. The CNN \nhas an encoder-decoder architecture in order to obtain smooth \nlabel predictions. The input data are represented as multiple \nmodalities to let the network learn the features more effectively. \nOur method sets a new benchmark on detecting object affordances, improving the accuracy by 20% in comparison with \nthe state-of-the-art methods that use hand-designed geometric \nfeatures. Furthermore, we apply our detection method on a \nfull-size humanoid robot (WALK-MAN) to demonstrate that \nthe robot is able to perform grasps after efficiently detecting \nthe object affordances.

Keywords:
Affordance Computer science Artificial intelligence Convolutional neural network Humanoid robot Benchmark (surveying) Object (grammar) Computer vision RGB color model Encoder Object detection Deep learning Robot Pattern recognition (psychology) Human–computer interaction

Metrics

171
Cited By
16.67
FWCI (Field Weighted Citation Impact)
46
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robot Manipulation and Learning
Physical Sciences →  Engineering →  Control and Systems Engineering
Robotic Locomotion and Control
Physical Sciences →  Engineering →  Biomedical Engineering
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.