Adversarial Transformers for Weakly Supervised Object Localization

Meng Meng; Tianzhu Zhang; Zhe Zhang; Yongdong Zhang; Feng Wu

doi:10.1109/tip.2022.3220055

ScienceGate Book Chapters

JOURNAL ARTICLE

Adversarial Transformers for Weakly Supervised Object Localization

Meng Meng Tianzhu Zhang Zhe Zhang Yongdong Zhang Feng Wu

Year: 2022 Journal: IEEE Transactions on Image Processing Vol: 31 Pages: 7130-7143 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tip.2022.3220055

Get Full-Text PDF Get Analytical Report

Abstract

Weakly supervised object localization (WSOL) aims at localizing objects with only image-level labels, which has better scalability and practicability than fully supervised methods. However, without pixel-level supervision, existing methods tend to generate rough localization maps, which hinders localization performance. To alleviate this problem, we propose an adversarial transformer network (ATNet), which aims to obtain a well-learned localization model with pixel-level pseudo labels. The proposed ATNet enjoys several merits. First, we design an object transformer ( G ) that can generate localization maps and pseudo labels effectively and dynamically, and a part transformer ( D ) to accurately discriminate detailed local differences between localization maps and pseudo labels. Second, we propose to train G and D via an adversarial process, where G can generate more accurate localization maps approaching pseudo labels to fool D . To the best of our knowledge, this is the first work to explore transformers with adversarial training to obtain a well-learned localization model for WSOL. Extensive experiments with four backbones on two standard benchmarks demonstrate that our ATNet achieves favorable performance against state-of-the-art WSOL methods. Besides, our adversarial training can provide higher robustness against adversarial attacks.

Keywords:

Adversarial system Artificial intelligence Computer science Scalability Robustness (evolution) Transformer Pixel Pattern recognition (psychology) Computer vision Engineering

Metrics

Cited By

1.37

FWCI (Field Weighted Citation Impact)

104

Refs

0.79

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Adversarial Transformers for Weakly Supervised Object Localization

Abstract

Metrics

Citation History

Topics

Related Documents

Adversarial Complementary Learning for Weakly Supervised Object Localization

Entropy guided adversarial model for weakly supervised object localization

Leveraging Transformers for Weakly Supervised Object Localization in Unconstrained Videos

Weakly Supervised Object Localization Using Self-Paced Pyramid Adversarial Learning

Discriminative Sampling of Proposals in Self-Supervised Transformers for Weakly Supervised Object Localization