JOURNAL ARTICLE

STUCNET – SWIN TRANSFORMER-V2 UNET FOR CRACK SEGMENTATION NETWORK

Abstract

Automatic crack detection on road surfaces is an important task for supporting the quality control of road infrastructure in transportation. Various methods have been proposed for crack segmentation, but their accuracy is still limited. To improve the effectiveness of crack detection, we propose the Swin Transformer v2 U-Net for Crack Segmentation Network model (STUCNet) for crack recognition. The proposed model combines the advantages of the Swin Transformer v2 into the encoding module of the U-Net-based architecture to enhance the quality of semantic image segmentation. Specifically, the model integrates Swin Transformer v2 with shifted windows as the encoder to extract contextual features for crack segmentation. The symmetric decoder is based on a convolutional neural network with attention designed to perform upsampling operations to restore the spatial resolution of the feature map. We evaluate the STUCNet model on a large dataset containing cracks collected in different contexts. Compared to current advanced models, the proposed method achieves the state-of-the-art (SOTA) results for crack segmentation.

Keywords:
Segmentation Computer science Upsampling Transformer Encoder Convolutional neural network Artificial intelligence Deep learning Computer vision Pattern recognition (psychology) Engineering Voltage Image (mathematics)

Metrics

1
Cited By
0.20
FWCI (Field Weighted Citation Impact)
26
Refs
0.44
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Infrastructure Maintenance and Monitoring
Physical Sciences →  Engineering →  Civil and Structural Engineering
Asphalt Pavement Performance Evaluation
Physical Sciences →  Engineering →  Civil and Structural Engineering
Concrete Corrosion and Durability
Physical Sciences →  Engineering →  Civil and Structural Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.