JOURNAL ARTICLE

Multi-view Fusion for Multi-level Robotic Scene Understanding

Yunzhi LinJonathan TremblayStephen TyreePatricio A. VelaStan Birchfield

Year: 2021 Journal:   2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Pages: 6817-6824

Abstract

We present a system for multi-level scene awareness for robotic manipulation. Given a sequence of camera-inhand RGB images, the system calculates three types of information: 1) a point cloud representation of all the surfaces in the scene, for the purpose of obstacle avoidance. 2) the rough pose of unknown objects from categories corresponding to primitive shapes (e.g., cuboids and cylinders), and 3) full 6-DoF pose of known objects. By developing and fusing recent techniques in these domains, we provide a rich scene representation for robot awareness. We demonstrate the importance of each of these modules, their complementary nature, and the potential benefits of the system in the context of robotic manipulation.

Keywords:
Computer vision Artificial intelligence Computer science Point cloud Representation (politics) Context (archaeology) Robot RGB color model Point (geometry) Obstacle Mathematics Geography

Metrics

25
Cited By
9.84
FWCI (Field Weighted Citation Impact)
71
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology
Robot Manipulation and Learning
Physical Sciences →  Engineering →  Control and Systems Engineering

Related Documents

JOURNAL ARTICLE

Leveraging Multi-View Information for Scene Understanding

ELKHOULY, MOHAMED DAHY ABDELZAHER

Journal:   CINECA IRIS Institutial Research Information System (University of Genoa) Year: 2021
JOURNAL ARTICLE

Multi-view Consistent 3D Panoptic Scene Understanding

Xianzhu LiuXin SunHaozhe XieZonglin LiRu LiShengping Zhang

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2025 Vol: 39 (6)Pages: 5613-5621
DISSERTATION

Efficient multi-level scene understanding in videos

Buyu Liu

University:   ANU Open Research (Australian National University) Year: 2016
JOURNAL ARTICLE

Semantic Scene Completion through Multi-Level Feature Fusion

Ruochong FuHang WuMengxiang HaoYubin Miao

Journal:   2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Year: 2022 Pages: 8399-8406
© 2026 ScienceGate Book Chapters — All rights reserved.