PolarFormer: A Registration-Free Fusion Transformer with Polar Coordinate Position Encoding for Multi-View SAR Target Recognition

Xiang Yu; Ying Qian; Guodong Jin; Zhe Geng; Daiyin Zhu

doi:10.3390/rs17213559

ScienceGate Book Chapters

JOURNAL ARTICLE

PolarFormer: A Registration-Free Fusion Transformer with Polar Coordinate Position Encoding for Multi-View SAR Target Recognition

Xiang Yu Ying Qian Guodong Jin Zhe Geng Daiyin Zhu

Year: 2025 Journal: Remote Sensing Vol: 17 (21)Pages: 3559-3559 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/rs17213559

Get Full-Text PDF Get Analytical Report

Abstract

Multi-view Synthetic Aperture Radar (SAR) provides rich information for target recognition. However, fusing features from unaligned multi-view images presents challenges for existing methods. Conventional early fusion methods often rely on image registration, a process that is computationally intensive and can introduce feature distortions. More recent registration-free approaches based on the Transformer architecture are constrained by standard position encodings, which were not designed to represent the rotational relationships among multi-view SAR data and thus can cause spatial ambiguity. To address this specific limitation of position encodings, we propose a registration-free fusion framework based on a spatially aware Transformer. The framework includes two key components: (1) a multi-view polar coordinate position encoding that models the geometric relationships of patches both within and across views in a unified coordinate system; and (2) a spatially aware self-attention mechanism that injects this geometric information as a learnable inductive bias. Experiments were conducted on our self-developed FAST-Vehicle dataset, which provides full 360° azimuthal coverage. The results show that our method outperforms both registration-based strategies and Transformer baselines that use conventional position encodings. This work indicates that for multi-view SAR fusion, explicitly modeling the underlying geometric relationships with a suitable position encoding is an effective alternative to physical image registration or the use of generic, single-image position encodings.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

PolarFormer: A Registration-Free Fusion Transformer with Polar Coordinate Position Encoding for Multi-View SAR Target Recognition

Abstract

Metrics

Topics

Related Documents

PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer

Position Embedding-Free Transformer for Radar HRRP Target Recognition

Multi-view Fusion Transformer for Emotion Recognition Under Sleep Deprivation

Target Recognition Based on Polar Coordinate Template Matching

Multi-View Fusion SAR Target Recognition Method with LSTM Temporal Decision-Making