Image Recognition based on Multi-scale Feature Fusion Transformer

Zhefeng Zhu; Ke Qi; Wenbin Chen; Yicong Zhou; Peiyue Li; Zhenxian Liu

doi:10.1109/icaica54878.2022.9844458

ScienceGate Book Chapters

JOURNAL ARTICLE

Image Recognition based on Multi-scale Feature Fusion Transformer

Zhefeng Zhu Ke Qi Wenbin Chen Yicong Zhou Peiyue Li Zhenxian Liu

Year: 2022 Journal: 2022 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA) Pages: 7-13

DOI: 10.1109/icaica54878.2022.9844458

Get Full-Text PDF Get Analytical Report

Abstract

Aiming at the problem that image recognition based on transformers has low image recognition rate due to ignoring local information of image blocks, an image recognition framework based on multi-scale Feature Fusion Transformer (FFT) is proposed, where the FFT block is designed to fuse feature information of different scales, and the residual attention module is introduced to emphasize feature channels and feature regions of interest. The FFT framework not only avoids the problem of vision transformer internal structure and local information loss of image feature blocks but also captures richer detailed features, which effectively improves the image recognition rate. A large number of experiments are performed on common image recognition datasets Tiny-ImageNet, CIFAR-10 and CIFAR-100, and the recognition accuracy can reach 57.81%, 82.04% and 56.98%, respectively, which are significantly higher than the mainstream image recognition algorithms.

Keywords:

Artificial intelligence Computer science Pattern recognition (psychology) Fast Fourier transform Feature (linguistics) Fuse (electrical) Feature extraction Computer vision Image fusion Transformer Block (permutation group theory) Image (mathematics) Mathematics Engineering Algorithm

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.07

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image Fusion Techniques

Physical Sciences → Engineering → Media Technology

Image Recognition based on Multi-scale Feature Fusion Transformer

Abstract

Metrics

Topics

Related Documents

CNN-Transformer Based Image Emotion Recognition with Multi-Scale Feature Enhancement

Transformer-based multi-scale gradient feature fusion for low-light image enhancement

TFNet: Transformer-Based Multi-Scale Feature Fusion Forest Fire Image Detection Network

Multi-scale Feature Fusion Based Dongba Character Recognition

Multi-Scale Transformer-Based Feature Combination for Image Retrieval