Abstract

Localizing functional regions of objects or affordances is an important aspect of scene understanding and relevant for many robotics applications. In this work, we introduce a pixel-wise annotated affordance dataset of 3090 images containing 9916 object instances. Since parts of an object can have multiple affordances, we address this by a convolutional neural network for multilabel affordance segmentation. We also propose an approach to train the network from very few keypoint annotations. Our approach achieves a higher affordance detection accuracy than other weakly supervised methods that also rely on keypoint annotations or image annotations as weak supervision.

Keywords:
Affordance Computer science Artificial intelligence Convolutional neural network Object (grammar) Segmentation Pattern recognition (psychology) Computer vision Machine learning Human–computer interaction

Metrics

87
Cited By
8.39
FWCI (Field Weighted Citation Impact)
53
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robot Manipulation and Learning
Physical Sciences →  Engineering →  Control and Systems Engineering
Image and Object Detection Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Weakly Supervised Multimodal Affordance Grounding for Egocentric Images

Lingjing XuYang GaoWenfeng SongAimin Hao

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2024 Vol: 38 (6)Pages: 6324-6332
BOOK-CHAPTER

INTRA: Interaction Relationship-Aware Weakly Supervised Affordance Grounding

Ji Ha JangHoigi SeoSe Young Chun

Lecture notes in computer science Year: 2024 Pages: 18-34
JOURNAL ARTICLE

Weakly Supervised Action Detection

Parthipan SivaTao Xiang

Year: 2011 Pages: 65.1-65.0
© 2026 ScienceGate Book Chapters — All rights reserved.