JOURNAL ARTICLE

Semantic Scene Completion via Semantic-Aware Guidance and Interactive Refinement Transformer

Haihong XiaoWenxiong KangHao LiuYuqiong LiYing He

Year: 2024 Journal:   IEEE Transactions on Circuits and Systems for Video Technology Vol: 35 (5)Pages: 4212-4225   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Predicting per-voxel occupancy status and corresponding semantic labels in 3D scenes is pivotal to 3D intelligent perception in autonomous driving. In this paper, we propose a novel semantic scene completion framework that can generate complete 3D volumetric semantics from a single image at a low cost. To the best of our knowledge, this is the first endeavor specifically aimed at mitigating the negative impacts of incorrect voxel query proposals caused by erroneous depth estimates and enhancing interactions for positive ones in camera-based semantic scene completion tasks. Specifically, we present a straightforward yet effective Semantic-aware Guided (SAG) module, which seamlessly integrates with task-related semantic priors to facilitate effective interactions between image features and voxel query proposals in a plug-and-play manner. Furthermore, we introduce a set of learnable object queries to better perceive objects within the scene. Building on this, we propose an Interactive Refinement Transformer (IRT) block, which iteratively updates voxel query proposals to enhance the perception of semantics and objects within the scene by leveraging the interaction between object queries and voxel queries through query-to-query cross-attention. Extensive experiments demonstrate that our method outperforms existing state-of-the-art approaches, achieving overall improvements of 0.30 and 2.74 in mIoU metric on the SemanticKITTI and SSCBench-KITTI-360 validation datasets, respectively, while also showing superior performance in the aspect of small object generation.

Keywords:
Computer science Semantic computing Transformer Artificial intelligence Semantic technology Natural language processing Semantic compression Information retrieval Semantic Web

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
82
Refs
0.26
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Time Series Analysis and Forecasting
Physical Sciences →  Computer Science →  Signal Processing
Data Visualization and Analytics
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Geometry-semantic aware for monocular 3D Semantic Scene Completion

Zonghao LuBing CaoShuyin XiaQinghua Hu

Journal:   Pattern Recognition Year: 2024 Vol: 158 Pages: 111030-111030
JOURNAL ARTICLE

Instance-Aware Monocular 3D Semantic Scene Completion

Haihong XiaoHongbin XuWenxiong KangYuqiong Li

Journal:   IEEE Transactions on Intelligent Transportation Systems Year: 2024 Vol: 25 (7)Pages: 6543-6554
JOURNAL ARTICLE

2D Semantic-Guided Semantic Scene Completion

Xianzhu LiuHaozhe XieShengping ZhangHongxun YaoRongrong JiLiqiang NieDacheng Tao

Journal:   International Journal of Computer Vision Year: 2024 Vol: 133 (3)Pages: 1306-1325
© 2026 ScienceGate Book Chapters — All rights reserved.