3D Semantic Scene Completion With Multi-scale Feature Maps and Masked Autoencoder

Sangmin Park; Jong-Eun Ha

doi:10.5302/j.icros.2023.23.0143

ScienceGate Book Chapters

JOURNAL ARTICLE

3D Semantic Scene Completion With Multi-scale Feature Maps and Masked Autoencoder

Sangmin Park Jong-Eun Ha

Year: 2023 Journal: Journal of Institute of Control Robotics and Systems Vol: 29 (12)Pages: 966-972

DOI: 10.5302/j.icros.2023.23.0143

Get Full-Text PDF Get Analytical Report

Abstract

Autonomous systems require a profound understanding of their surroundings, encompassing both semantic and 3D geometry. This study focuses on advancing 3D semantic scene completion approaches using a camera. Building upon the foundation laid by VoxFormer [1], which is recognized for its state-of-the-art performance in 3D semantic scene completion, our approach involves two distinct stages. In the initial stage, scene completion is done with depth images, while in the second stage, the final 3D scene completion is performed using masked autoencoder. To enhance the performance of VoxFormer, we introduced two key modifications. First, we modified the first stage using multi-scale feature maps. Second, we further modified the first stage using a masked autoencoder. Experimental results, based on the adapted VoxFormer model in both stages are presented. Our two proposed approaches exhibit notable improvements, particularly in the context of small objects. However, these enhancements warrant further investigation for optimization and refinement.

Keywords:

Autoencoder Computer science Artificial intelligence Feature (linguistics) Context (archaeology) Semantic feature Stage (stratigraphy) Scale (ratio) Key (lock) Natural language processing Computer vision Pattern recognition (psychology) Deep learning Cartography Geography Linguistics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.18

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Computer Graphics and Visualization Techniques

Physical Sciences → Computer Science → Computer Graphics and Computer-Aided Design

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

3D Semantic Scene Completion With Multi-scale Feature Maps and Masked Autoencoder

Abstract

Metrics

Topics

Related Documents

Multi-Head Multi-Scale Feature Fusion Network for Semantic Scene Completion

Semantic Scene Completion through Multi-Level Feature Fusion

Semantic Scene Completion with 2D and 3D Feature Fusion

MAESR360: Masked autoencoder-based 360-degree video streaming via multi-scale feature fusion

Multi-Scale Multi-Feature Context Modeling for Scene Recognition in the Semantic Manifold