JOURNAL ARTICLE

Image Recognition based on Multi-scale Feature Fusion Transformer

Zhefeng ZhuKe QiWenbin ChenYicong ZhouPeiyue LiZhenxian Liu

Year: 2022 Journal:   2022 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA) Pages: 7-13

Abstract

Aiming at the problem that image recognition based on transformers has low image recognition rate due to ignoring local information of image blocks, an image recognition framework based on multi-scale Feature Fusion Transformer (FFT) is proposed, where the FFT block is designed to fuse feature information of different scales, and the residual attention module is introduced to emphasize feature channels and feature regions of interest. The FFT framework not only avoids the problem of vision transformer internal structure and local information loss of image feature blocks but also captures richer detailed features, which effectively improves the image recognition rate. A large number of experiments are performed on common image recognition datasets Tiny-ImageNet, CIFAR-10 and CIFAR-100, and the recognition accuracy can reach 57.81%, 82.04% and 56.98%, respectively, which are significantly higher than the mainstream image recognition algorithms.

Keywords:
Artificial intelligence Computer science Pattern recognition (psychology) Fast Fourier transform Feature (linguistics) Fuse (electrical) Feature extraction Computer vision Image fusion Transformer Block (permutation group theory) Image (mathematics) Mathematics Engineering Algorithm

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
9
Refs
0.07
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Multi-scale Feature Fusion Based Dongba Character Recognition

Haini LuoDan XuBing YangHaoyuan Zhang

Journal:   2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE) Year: 2020 Pages: 1571-1575
JOURNAL ARTICLE

Multi-Scale Transformer-Based Feature Combination for Image Retrieval

Carlos Roig MariDavid Varas GonzálezElisenda Bou‐Balust

Journal:   2022 IEEE International Conference on Image Processing (ICIP) Year: 2022 Vol: 30 Pages: 3166-3170
© 2026 ScienceGate Book Chapters — All rights reserved.