Pcwin Transformer: Permuted Channel Window based Attention for Image Classification

Shibao Li; Yixuan Liu; Zhaoyu Wang; Xuerong Cui; Yunwu Zhang; Z. Jiao; Jinze Zhu

doi:10.1109/ijcnn55064.2022.9892630

ScienceGate Book Chapters

JOURNAL ARTICLE

Pcwin Transformer: Permuted Channel Window based Attention for Image Classification

Shibao Li Yixuan Liu Zhaoyu Wang Xuerong Cui Yunwu Zhang Z. Jiao Jinze Zhu

Year: 2022 Journal: 2022 International Joint Conference on Neural Networks (IJCNN) Vol: abs 2012 9958 Pages: 1-8

DOI: 10.1109/ijcnn55064.2022.9892630

Get Full-Text PDF Get Analytical Report

Abstract

The Transformer is one of the mainstream methods in computer vision. Most Transformer based architectures focus on the design of spatial attention and optimizing the computational complexity of high resolution of pixels in image but pay little attention to modeling channel dependencies and optimizing the computational complexity associated with a large number of channels. In this paper, we propose a new channel window based self-attention mechanism and apply two consecutive transformer layers to capture global channel dependencies through permuting channel layer, which can greatly reduce the computational complexity caused by a large number of channels. Meanwhile, a new linear layer for channel attention is proposed, which eliminates the need for position bias in Transformer. The proposed method can be conveniently appended to the existing image classification architectures in parallel with minimal modification. We demonstrate the feasibility of the proposed method on the state-of-the-art transformer-based architecture for image classification and improve the results on ImageNet-1K. The code will be publicly available at GitHub.

Keywords:

Computer science Transformer Computational complexity theory Channel (broadcasting) Artificial intelligence Pixel Computer engineering Algorithm Telecommunications Engineering Electrical engineering Voltage

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.16

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

CCD and CMOS Imaging Sensors

Physical Sciences → Engineering → Electrical and Electronic Engineering

Pcwin Transformer: Permuted Channel Window based Attention for Image Classification

Abstract

Metrics

Topics

Related Documents

FSwin Transformer: Feature-Space Window Attention Vision Transformer for Image Classification

Local Window Attention Transformer for Polarimetric SAR Image Classification

Spectral Spatial Window Attention Transformer for Hyperspectral Image Classification

Refined Feature-Space Window Attention Vision Transformer for Image Classification

Image Classification Based on Triplet Neighborhood Attention Transformer