TransferAttn: Transferable-guided Attention for Video Domain Adaptation

André Sacilotti; Nicu Sebe; Jurandy Almeida

doi:10.5753/sibgrapi.est.2024.31663

ScienceGate Book Chapters

JOURNAL ARTICLE

TransferAttn: Transferable-guided Attention for Video Domain Adaptation

André Sacilotti Nicu Sebe Jurandy Almeida

Year: 2024 Pages: 155-158

DOI: 10.5753/sibgrapi.est.2024.31663

Get Full-Text PDF Get Analytical Report

Abstract

Unsupervised domain adaptation (UDA) in videos is a challenging task that remains not well explored compared to image-based UDA techniques. Although vision transformers (ViT) achieve state-of-the-art performance in many computer vision tasks, their use in video domain adaptation has still been little explored. Our key idea is to use the transformer layers as a feature encoder and incorporate spatial and temporal transferability relationships into the attention mechanism. A Transferable-guided Attention (TransferAttn) framework is then developed to exploit the capacity of the transformer to adapt cross-domain knowledge across different backbones. To improve the transferability of ViT, we introduce a novel and effective module, named Domain Transferable-guided Attention Block (DTAB), which compels ViT to focus on the spatio-temporal transferability relationship among video frames by changing the self-attention mechanism to a transferability attention mechanism. Experiments conducted on the UCF-HMDB and Kinetics-NEC Drone datasets, with different backbones, like I3D and STAM, show that TransferAttn outperforms state-of-the-art approaches. Also, we demonstrate that our DTAB yields performance gains when applied to other ViT-based methods for video UDA.

Keywords:

Computer science Transferability Exploit Encoder Transformer Domain adaptation Artificial intelligence Domain (mathematical analysis) Machine learning Adaptation (eye) Autoencoder Benchmarking Pattern recognition (psychology) Human–computer interaction Deep learning

Metrics

Cited By

0.53

FWCI (Field Weighted Citation Impact)

Refs

0.60

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

TransferAttn: Transferable-guided Attention for Video Domain Adaptation

Abstract

Metrics

Citation History

Topics

Related Documents

Transferable-Guided Attention Is All You Need for Video Domain Adaptation

Transferable Attention for Domain Adaptation

Transferable attention networks for adversarial domain adaptation

Attention guided for partial domain adaptation

Attention‐guided adaptation factors for unsupervised facial domain adaptation