JOURNAL ARTICLE

Zero-Shot Object Detection With Transformers

Abstract

Deep learning has significantly improved the precision of object detection with abundant labeled data. However, collecting and labeling sufficient data is extremely hard. Zero-shot object detection (ZSD) has been proposed to solve this problem which aims to simultaneously recognize and localize both seen and unseen objects. Recently, the transformer and its variant architectures have shown their effectiveness over conventional methods in many natural language processing and computer vision tasks. In this paper, we study the ZSD task and develop a new framework named zero-shot object detection with transformers (ZSDTR). ZSDTR consists of the head network, transformer encoder, transformer decoder, and the vision-semantic-attention trail network. We find that the transformer is very effective for improving the ability to recall unseen objects and the tail performs well for discriminating seen and unseen objects. To the best of our knowledge, our ZSDTR is the first method to use the transformer in ZSD task. Extensive experimental results on various zero-shot object detection benchmarks show that our ZSDTR outperforms the current state-of-the-art methods.

Keywords:
Transformer Computer science Encoder Object detection Artificial intelligence Computer vision Pattern recognition (psychology) Engineering Electrical engineering Voltage

Metrics

6
Cited By
0.41
FWCI (Field Weighted Citation Impact)
27
Refs
0.61
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Zero-Shot Object Detection with Textual Descriptions

Zhihui LiLina YaoXiaoqin ZhangXianzhi WangSalil S. KanhereHuaxiang Zhang

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2019 Vol: 33 (01)Pages: 8690-8697
JOURNAL ARTICLE

Zero-Shot Camouflaged Object Detection

Haoran LiChun-Mei FengYong XuTao ZhouLina YaoXiaojun Chang

Journal:   IEEE Transactions on Image Processing Year: 2023 Vol: 32 Pages: 5126-5137
JOURNAL ARTICLE

Zero-Shot Object Detection with Multi-label Context

Yongxian WeiYong Ma

Journal:   Proceedings/Proceedings of the ... International Conference on Software Engineering and Knowledge Engineering Year: 2022 Vol: 2022 Pages: 142-146
© 2026 ScienceGate Book Chapters — All rights reserved.