Semi-DETR: Semi-Supervised Object Detection with Detection Transformers

Jiacheng Zhang; Xiangru Lin; Wei Zhang; Kuo Wang; Xiao Tan; Junyu Han; Errui Ding; Jingdong Wang; Guanbin Li

doi:10.1109/cvpr52729.2023.02280

ScienceGate Book Chapters

JOURNAL ARTICLE

Semi-DETR: Semi-Supervised Object Detection with Detection Transformers

Jiacheng Zhang Xiangru Lin Wei Zhang Kuo Wang Xiao Tan Junyu Han Errui Ding Jingdong Wang Guanbin Li

Year: 2023 Pages: 23809-23818

DOI: 10.1109/cvpr52729.2023.02280

Get Full-Text PDF Get Analytical Report

Abstract

We analyze the DETR-based framework on semi-supervised object detection (SSOD) and observe that (1) the one-to-one assignment strategy generates incorrect matching when the pseudo ground-truth bounding box is inaccurate, leading to training inefficiency; (2) DETR-based detectors lack deterministic correspondence between the input query and its prediction output, which hinders the applicability of the consistency-based regularization widely used in current SSOD methods. We present Semi-DETR, the first transformer-based end-to-end semi-supervised object detector, to tackle these problems. Specifically, we propose a Stage-wise Hybrid Matching strategy that combines the one-to-many assignment and one-to-one assignment strategies to improve the training efficiency of the first stage and thus provide high-quality pseudo labels for the training of the second stage. Besides, we introduce a Cross-view Query Consistency method to learn the semantic feature invariance of object queries from different views while avoiding the need to find deterministic query correspondence. Furthermore, we propose a Cost-based Pseudo Label Mining module to dynamically mine more pseudo boxes based on the matching cost of pseudo ground truth bounding boxes for consistency training. Extensive experiments on all SSOD settings of both COCO and Pascal VOC benchmark datasets show that our Semi-DETR method outperforms all state-of-the-art methods by clear margins.

Keywords:

Computer science Pascal (unit) Object detection Ground truth Data mining Artificial intelligence Tuple Benchmark (surveying) Transformer Machine learning Pattern recognition (psychology) Mathematics

Metrics

Cited By

9.64

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Semi-DETR: Semi-Supervised Object Detection with Detection Transformers

Abstract

Metrics

Citation History

Topics

Related Documents

Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection

Omni-DETR: Omni-Supervised Object Detection with Transformers

Efficient Semi-DETR: Real Time End-to-End Semi-Supervised Object Detection

Semi-supervised Object Detection with Unlabeled Data

Semi-supervised Object Detection with Unlabeled Data