Multi-view Fusion for Multi-level Robotic Scene Understanding

Yunzhi Lin; Jonathan Tremblay; Stephen Tyree; Patricio A. Vela; Stan Birchfield

doi:10.1109/iros51168.2021.9635994

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-view Fusion for Multi-level Robotic Scene Understanding

Yunzhi Lin Jonathan Tremblay Stephen Tyree Patricio A. Vela Stan Birchfield

Year: 2021 Journal: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Pages: 6817-6824

DOI: 10.1109/iros51168.2021.9635994

Get Full-Text PDF Get Analytical Report

Abstract

We present a system for multi-level scene awareness for robotic manipulation. Given a sequence of camera-inhand RGB images, the system calculates three types of information: 1) a point cloud representation of all the surfaces in the scene, for the purpose of obstacle avoidance. 2) the rough pose of unknown objects from categories corresponding to primitive shapes (e.g., cuboids and cylinders), and 3) full 6-DoF pose of known objects. By developing and fusing recent techniques in these domains, we provide a rich scene representation for robot awareness. We demonstrate the importance of each of these modules, their complementary nature, and the potential benefits of the system in the context of robotic manipulation.

Keywords:

Computer vision Artificial intelligence Computer science Point cloud Representation (politics) Context (archaeology) Robot RGB color model Point (geometry) Obstacle Mathematics Geography

Metrics

Cited By

9.84

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

3D Surveying and Cultural Heritage

Physical Sciences → Earth and Planetary Sciences → Geology

Robot Manipulation and Learning

Physical Sciences → Engineering → Control and Systems Engineering

Multi-view Fusion for Multi-level Robotic Scene Understanding

Abstract

Metrics

Citation History

Topics

Related Documents

Leveraging Multi-View Information for Scene Understanding

Multi-view Consistent 3D Panoptic Scene Understanding

Efficient multi-level scene understanding in videos

Multi-level scene understanding via hierarchical classification

Semantic Scene Completion through Multi-Level Feature Fusion