JOURNAL ARTICLE

Transformer With Linear-Window Attention for Feature Matching

Zhiwei ShenBin KongXiaoyu Dong

Year: 2023 Journal:   IEEE Access Vol: 11 Pages: 121202-121211   Publisher: Institute of Electrical and Electronics Engineers

Abstract

A transformer can capture long-term dependencies through an attention mechanism, and hence, can be applied to various vision tasks. However, its secondary computational complexity is a major obstacle in vision tasks that require accurate predictions. To address this limitation, this study introduces linear-window attention (LWA), a new attention model for a vision transformer. The transformer computes self-attention that is restricted to nonoverlapping local windows and represented as a linear dot product of kernel feature mappings. Furthermore, the computational complexity of each window is reduced to linear from quadratic using the constraint property of matrix products. In addition, we applied the LWA to feature matching to construct a coarse-to-fine-level detector-free feature matching method, called transformer with linear-window attention for feature matching TRLWAM. At the coarse level, we extracted the dense pixel-level matches, and at the fine level, we obtained the final matching results via multi-head multilayer perceptron refinement. We demonstrated the effectiveness of LWA through Replace experiments. The results showed that the TRLWAM could extract dense matches from low-texture or repetitive pattern regions in indoor environments, and exhibited excellent results with a low computational cost for MegaDepth and HPatches datasets. We believe the proposed LWA can provide new conceptions for transformer applications in visual tasks.

Keywords:
Computer science Transformer Artificial intelligence Computational complexity theory Feature extraction Pattern recognition (psychology) Computer vision Algorithm Voltage

Metrics

1
Cited By
0.18
FWCI (Field Weighted Citation Impact)
84
Refs
0.44
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

ParaFormer: Parallel Attention Transformer for Efficient Feature Matching

Xiaoyong LuYaping YanBin KangСонглин Ду

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2023 Vol: 37 (2)Pages: 1853-1860
JOURNAL ARTICLE

WinMRSI: Feature Matching With Window Attention for Multimodal Remote Sensing Image

Yide DiYun LiaoYunan LiuHao ZhouKaijun ZhuMingyu LuQing DuanJunhui Liu

Journal:   IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Year: 2025 Vol: 18 Pages: 14615-14629
© 2026 ScienceGate Book Chapters — All rights reserved.