Enhancing Container Damage Detection with improved YOLOv5 Model: Integrating Swin Transformer

Jiahao Chen

doi:10.65286/icic.v20i1.41531

ScienceGate Book Chapters

JOURNAL ARTICLE

Enhancing Container Damage Detection with improved YOLOv5 Model: Integrating Swin Transformer

Jiahao Chen

Year: 2024 Pages: 670-685

DOI: 10.65286/icic.v20i1.41531

Get Full-Text PDF Get Analytical Report

Abstract

To improve the safety of port logistics transportation, container damage detection is critical. Container damage is diverse and includes small-scale object damage e.g., holes, dents, scratches . Traditional object detection algorithms used for container damage detection suffer from low accuracy and high miss rates for small-scale objects. This paper proposes an improvement to the YOLOv5 model based on the Transformer self-attention mechanism for container damage detection. To effectively capture global and long-range relationships in damage images, two layers of Swin Transformer blocks are added to the backbone network of YOLOv5. The PANet in YOLOv5 Neck has been optimized to BiFPN. Enhanced ability to fuse multi-scale features in damaged images while reducing computational complexity and information loss. Furthermore, use the Focaler-IoU Loss Function to improve the balance of features extracted from different samples in the dataset. The training set is clustered using the KMeans algorithm to obtain 9 initial anchor boxes more suitable for the container damage dataset. Experimental results on the COCO and Tianjin Port official container damage datasets validate that the improved model achieves an mAP of 95.4 . This outperforms common object detection algorithms such as Fast-RCNN and YOLOv5.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Enhancing Container Damage Detection with improved YOLOv5 Model: Integrating Swin Transformer

Abstract

Metrics

Topics

Related Documents

YOLOv5 UAV Feature Detection with Swin-Transformer

SF-YOLOv5: Improved YOLOv5 with swin transformer and fusion-concat method for multi-UAV detection

DenseSPH-YOLOv5: An automated damage detection model based on DenseNet and Swin-Transformer prediction head-enabled YOLOv5 with attention mechanism

Enhancing YOLOv5 with Swin Transformer and Multi-Scale Attention for Improved Helmet Detection in Power Grid Construction Sites

ST-CA YOLOv5: Improved YOLOv5 Based on Swin Transformer and Coordinate Attention for Surface Defect Detection