JOURNAL ARTICLE

Multimodal Fusion and Data Augmentation for 3D Semantic Segmentation

Dong HeFurqan AbidJong-Hwan Kim

Year: 2022 Journal:   2022 22nd International Conference on Control, Automation and Systems (ICCAS) Pages: 1143-1148

Abstract

Since modern autonomous driving (AD) platforms offer a variety of sensors, it is intuitive to leverage complementary data from multimodal sensors to produce reliable 3D semantic segmentation. However, due to the information loss and the sub-optimized fusion in multimodal fusion methods, LiDAR-only methods currently occupy the top positions in the leaderboard of datasets. In this paper, we focus on two aspects to improve the LiDAR-camera fusion semantic segmentation performance, namely data augmentation and fusion strategy. First, we propose an novel data augmentation by refining point-image patches. Second, we design an attention fusion block for the dual-branch segmentation network by considering the modality gap between LiDAR and RGB camera. Experiments on nuScences indicate that our proposed method outperforms the baseline methods on key classes.

Keywords:
Computer science Segmentation Leverage (statistics) Artificial intelligence Lidar Computer vision RGB color model Fusion Key (lock) Sensor fusion Image fusion Focus (optics) Block (permutation group theory) Image segmentation Image (mathematics) Remote sensing

Metrics

2
Cited By
0.14
FWCI (Field Weighted Citation Impact)
42
Refs
0.46
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
Remote Sensing and LiDAR Applications
Physical Sciences →  Environmental Science →  Environmental Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.