JOURNAL ARTICLE

MFSM-Net: Multimodal Feature Fusion for the Semantic Segmentation of Urban-Scale Textured 3D Meshes

Xinjie HaoJiahui WangWei LengRongting ZhangGuangyun Zhang

Year: 2025 Journal:   Remote Sensing Vol: 17 (9)Pages: 1573-1573   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

The semantic segmentation of textured 3D meshes is a critical step in constructing city-scale realistic 3D models. Compared to colored point clouds, textured 3D meshes have the advantage of high-resolution texture image patches embedded on each mesh face. However, existing studies predominantly focus on their geometric structures, with limited utilization of these high-resolution textures. Inspired by the binocular perception of humans, this paper proposes a multimodal feature fusion network based on 3D geometric structures and 2D high-resolution texture images for the semantic segmentation of textured 3D meshes. Methodologically, the 3D feature extraction branch computes the centroid coordinates and face normals of mesh faces as initial 3D features, followed by a multi-scale Transformer network to extract high-level 3D features. The 2D feature extraction branch employs orthographic views of city scenes captured from a top-down perspective and uses a U-Net to extract high-level 2D features. To align features across 2D and 3D modalities, a Bridge view-based alignment algorithm is proposed, which visualizes the 3D mesh indices to establish pixel-level associations with orthographic views, achieving the precise alignment of multimodal features. Experimental results demonstrate that the proposed method achieves competitive performance in city-scale textured 3D mesh semantic segmentation, validating the effectiveness and potential of the cross-modal fusion strategy.

Keywords:
Computer science Polygon mesh Segmentation Scale (ratio) Fusion Artificial intelligence Feature (linguistics) Computer vision Computer graphics (images) Cartography Geography

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
46
Refs
0.13
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

3D Shape Modeling and Analysis
Physical Sciences →  Engineering →  Computational Mechanics
3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

SEMANTIC SEGMENTATION OF URBAN TEXTURED MESHES THROUGH POINT SAMPLING

Grégoire GrzeczkowiczBruno Vallet

Journal:   ISPRS annals of the photogrammetry, remote sensing and spatial information sciences Year: 2022 Vol: V-2-2022 Pages: 177-184
JOURNAL ARTICLE

Semantic segmentation of 3D textured meshes for urban scene analysis

Mohammad RouhaniFlorent LafargePierre Alliez

Journal:   ISPRS Journal of Photogrammetry and Remote Sensing Year: 2016 Vol: 123 Pages: 124-139
JOURNAL ARTICLE

Improving Semantic Image Segmentation via Label Fusion in Semantically Textured Meshes

Florian FerversTimo BreuerGregor StachowiakSebastian BullingerChristoph BodensteinerMichael Arens

Journal:   Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications Year: 2022 Pages: 509-516
JOURNAL ARTICLE

DynaFusion: Dynamic Feature Fusion for Multimodal 3D Semantic Segmentation

Tae-Ho KimJoohee Kim

Journal:   IEEE Access Year: 2025 Vol: 13 Pages: 213140-213150
© 2026 ScienceGate Book Chapters — All rights reserved.