JOURNAL ARTICLE

Object Detector with Multi-head Self-attention and Multi-scale Fusion

Abstract

Aiming at the problems that the existing object detection network cannot fully extract the internal correlation of features and the multi-scale feature fusion is insufficient, an object detector that introduces multi-head self-attention and multi-scale fusion is proposed. Firstly, the multi-head self-attention mechanism is introduced, and the internal correlation of the extracted features is strengthened through the multi-head self-attention mechanism, and the dependence of the features on the external information is reduced. Secondly, the DenseASPP module is integrated in the designed network, and the ability of multi-scale feature fusion is improved through dilated convolution with different dilatation rate. The experiments of the proposed object detection algorithm on the PASCAL VOC 2007 dataset show that the overall accuracy is greatly improved compared with other advanced algorithms.

Keywords:
Pascal (unit) Computer science Artificial intelligence Fusion Convolution (computer science) Detector Pattern recognition (psychology) Head (geology) Feature (linguistics) Object (grammar) Object detection Fusion mechanism Scale (ratio) Computer vision Artificial neural network

Metrics

3
Cited By
0.37
FWCI (Field Weighted Citation Impact)
28
Refs
0.56
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.