JOURNAL ARTICLE

Discriminative Sampling of Proposals in Self-Supervised Transformers for Weakly Supervised Object Localization

Abstract

Drones are employed in a growing number of visual recognition applications. A recent development in cell tower inspection is drone-based asset surveillance, where the autonomous flight of a drone is guided by localizing objects of interest in successive aerial images. In this paper, we propose a method to train deep weakly-supervised object localization (WSOL) models based only on image-class labels to locate object with high confidence. To train our localizer, pseudo labels are efficiently harvested from a self-supervised vision transformers (SSTs). However, since SSTs decompose the scene into multiple maps containing various object parts, and do not rely on any explicit super-visory signal, they cannot distinguish between the object of interest and other objects, as required WSOL. To address this issue, we propose leveraging the multiple maps generated by the different transformer heads to acquire pseudo-labels for training a deep WSOL model. In particular, a new Discriminative Proposals Sampling (DiPS) method is introduced that relies on a CNN classifier to identify discriminative regions. Then, foreground and background pixels are sampled from these regions in order to train a WSOL model for generating activation maps that can accurately localize objects belonging to a specific class. Empirical results 1 1 Our code is available: https://github.com/shakeebmurtaza/dips on the challenging TelDrone dataset indicate that our proposed approach can outperform state-of-art methods over a wide range of threshold values over produced maps. We also computed results on CUB dataset, showing that our method can be adapted for other tasks.

Keywords:
Discriminative model Artificial intelligence Computer science Classifier (UML) Pattern recognition (psychology) Object detection Computer vision Drone Pixel Object (grammar) Transformer Engineering

Metrics

14
Cited By
2.55
FWCI (Field Weighted Citation Impact)
66
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Adversarial Transformers for Weakly Supervised Object Localization

Meng MengTianzhu ZhangZhe ZhangYongdong ZhangFeng Wu

Journal:   IEEE Transactions on Image Processing Year: 2022 Vol: 31 Pages: 7130-7143
JOURNAL ARTICLE

Recurrent self-optimizing proposals for weakly supervised object detection

Ming ZhangBing Zeng

Journal:   Neural Computing and Applications Year: 2022 Vol: 35 (1)Pages: 757-771
JOURNAL ARTICLE

Generalized Weakly Supervised Object Localization

Dingwen ZhangGuangyu GuoWenyuan ZengLei LiJunwei Han

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2022 Vol: 35 (4)Pages: 5395-5406
© 2026 ScienceGate Book Chapters — All rights reserved.