Object-Centric Scene Representations Using Active Inference

Toon Van de Maele; Tim Verbelen; Pietro Mazzaglia; Stefano Ferraro; Bart Dhoedt

doi:10.1162/neco_a_01637

ScienceGate Book Chapters

JOURNAL ARTICLE

Object-Centric Scene Representations Using Active Inference

Toon Van de Maele Tim Verbelen Pietro Mazzaglia Stefano Ferraro Bart Dhoedt

Year: 2024 Journal: Neural Computation Vol: 36 (4)Pages: 677-704 Publisher: The MIT Press

DOI: 10.1162/neco_a_01637

Get Full-Text PDF Get Analytical Report

Abstract

Abstract Representing a scene and its constituent objects from raw sensory data is a core ability for enabling robots to interact with their environment. In this letter, we propose a novel approach for scene understanding, leveraging an object-centric generative model that enables an agent to infer object category and pose in an allocentric reference frame using active inference, a neuro-inspired framework for action and perception. For evaluating the behavior of an active vision agent, we also propose a new benchmark where, given a target viewpoint of a particular object, the agent needs to find the best matching viewpoint given a workspace with randomly positioned objects in 3D. We demonstrate that our active inference agent is able to balance epistemic foraging and goal-driven behavior, and quantitatively outperforms both supervised and reinforcement learning baselines by more than a factor of two in terms of success rate.

Keywords:

Inference Computer science Artificial intelligence Object (grammar) Benchmark (surveying) Workspace Generative model Active perception Representation (politics) Active vision Matching (statistics) Generative grammar Perception Machine learning Computer vision Robot Mathematics

Metrics

Cited By

1.59

FWCI (Field Weighted Citation Impact)

Refs

0.72

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Robot Manipulation and Learning

Physical Sciences → Engineering → Control and Systems Engineering

Object-Centric Scene Representations Using Active Inference

Abstract

Metrics

Citation History

Topics

Related Documents

Disentangling What and Where for 3D Object-Centric Representations Through Active Inference

Disentangling What and Where for 3D Object-Centric Representations Through Active Inference

Compositional scene modeling with global object-centric representations

OSIN: Object-Centric Scene Inference Network for Unsupervised Video Anomaly Detection

Unsupervised Learning of Global Object-Centric Representations for Compositional Scene Understanding