JOURNAL ARTICLE

Swin transformer adaptation into YOLOv7 for road damage detection

Riyandi Banovbi Putera IrsalFitri UtaminingrumKohichi Ogata

Year: 2024 Journal:   Bulletin of Electrical Engineering and Informatics Vol: 13 (4)Pages: 2527-2536   Publisher: Institute of Advanced Engineering and Science (IAES)

Abstract

Highways are an important component of any country. However, some highways in Indonesia endanger users while maintaining road safety. Crack detection early in the deterioration process can prevent further damage and lower maintenance costs. A recent study sought to develop a method for detecting road damage by combining the road damage detection (RDD) dataset with generative adversarial network technology and data augmentation to improve training. The current study aims to broaden the you only look once (YOLO) framework by incorporating the Swin Transformer into the chiral stationary phases (CSP) component of YOLOv7, with the goal of improving object detection accuracy in a variety of visual scenarios. The study compares the performance of various object detection models with varying parameters and configurations, such as YOLOv5l, YOLOv6l, YOLOv7-tiny, YOLOv7, and YOLOv7x. YOLOv5l has 46 million parameters and 108 billion floating point operations per second (FLOPS), whereas YOLOv6l has 59.5 million parameters and 150 billion FLOPS. With 31 million parameters and 140 billion FLOPS, the YOLOv7-swin model performs best with mean average precision (mAP), mAP_0.50 of 0.47. and mAP_0.5:0.95 of 0.232. The experimental results show that our YOLOv7-swin model outperforms both YOLOv7x and YOLOv7-tiny. The proposed model significantly improves object detection accuracy while keeping complexity and performance in balance.

Keywords:
FLOPS Transformer Computer science Object detection Component (thermodynamics) Artificial intelligence Real-time computing Data mining Machine learning Engineering Pattern recognition (psychology)

Metrics

3
Cited By
1.59
FWCI (Field Weighted Citation Impact)
24
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Infrastructure Maintenance and Monitoring
Physical Sciences →  Engineering →  Civil and Structural Engineering
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

STA-YOLOv7: Swin-Transformer-Enabled YOLOv7 for Road Damage Detection

Dong Zhang

Journal:   Computer Science and Application Year: 2023 Vol: 13 (05)Pages: 1157-1165
BOOK-CHAPTER

Improved YOLOv7 for Road Damage Detection

Dongmei ZhangZhijie Xu

Lecture notes in electrical engineering Year: 2023 Pages: 559-567
JOURNAL ARTICLE

Road Damage Detection and Classification with YOLOv7

Vung PhamDu NguyenChristopher Donan

Journal:   2022 IEEE International Conference on Big Data (Big Data) Year: 2022 Pages: 6416-6423
© 2026 ScienceGate Book Chapters — All rights reserved.