JOURNAL ARTICLE

Multi-Modal LiDAR Point Cloud Semantic Segmentation with Salience Refinement and Boundary Perception

Yong ZhouZeming XieJiaqi ZhaoWenliang DuRui YaoAbdulmotaleb El Saddik

Year: 2024 Journal:   ACM Transactions on Multimedia Computing Communications and Applications Vol: 20 (10)Pages: 1-20   Publisher: Association for Computing Machinery

Abstract

Point cloud segmentation is essential for scene understanding, which provides advanced information for many applications, such as autonomous driving, robots, and virtual reality. To improve the accuracy and robustness of point cloud segmentation, many researchers have attempted to fuze camera images to complement the color and texture information. The common fusion strategy is the combination of convolutional operations with concatenation, element-wise addition or element-wise multiplication. However, conventional convolutional operators tend to confine the fusion of modal features within their receptive fields, which can be incomplete and limited. In addition, the inability of encoder–decoder segmentation networks to explicitly perceive segmentation boundary information results in semantic ambiguity and classification errors at object edges. These errors are further amplified in point cloud segmentation tasks, significantly affecting the accuracy of point cloud segmentation. To address the above issues, we propose a novel self-attention multi-modal fusion semantic segmentation network for point cloud semantic segmentation. Firstly, to effectively fuze different modal features, we propose a self-cross fusion module (SCF), which models long-range modality dependencies and transfers complementary image information to the point cloud to fully leverage the modality-specific advantages. Secondly, we design the salience refinement module (SR), which calculates the importance of channels in the feature maps and global descriptors to enhance the representation capability of salient modal features. Finally, we propose the local-aware anisotropy loss measure the element-level importance in the data and explicitly provide boundary information for the model, which alleviates the inherent semantic ambiguity problem in segmentation networks. Extensive experiments on two benchmark datasets demonstrate that our proposed method surpasses current state-of-the-art methods.

Keywords:
Point cloud Modal Lidar Segmentation Computer science Salience (neuroscience) Perception Boundary (topology) Artificial intelligence Computer vision Remote sensing Geography Mathematics Psychology Mathematical analysis

Metrics

5
Cited By
1.94
FWCI (Field Weighted Citation Impact)
39
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Remote Sensing and LiDAR Applications
Physical Sciences →  Environmental Science →  Environmental Engineering
3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-guided feature refinement for point cloud semantic segmentation with weakly supervision

Yufan WangQunfei ZhaoZeyang Xia

Journal:   Knowledge-Based Systems Year: 2025 Vol: 311 Pages: 113050-113050
JOURNAL ARTICLE

MSCSeg: Multi-scale contextual network for LiDAR point cloud semantic segmentation

Yan ZhouYichao FanHaibin ZhouRichard Irampaye

Journal:   Signal Image and Video Processing Year: 2025 Vol: 19 (13)
JOURNAL ARTICLE

Cross-modal semantic transfer for point cloud semantic segmentation

Zhen CaoXiaoxin MiBo QiuZhipeng CaoChen LongXinrui YanChao ZhengZhen DongBisheng Yang

Journal:   ISPRS Journal of Photogrammetry and Remote Sensing Year: 2025 Vol: 221 Pages: 265-279
JOURNAL ARTICLE

Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation

Kyungmin Kim

Journal:   Sensors Year: 2024 Vol: 24 (23)Pages: 7840-7840
© 2026 ScienceGate Book Chapters — All rights reserved.