Leonardo BlangerNina S. T. HirataXiaoyi Jiang
We address the problem of training Object Detection models using significantly less bounding box annotated images. For that, we take advantage of cheaper and more abundant image classification data. Our proposal consists in automatically generating artificial detection samples, with no need of expensive detection level supervision, using images with classification labels only. We also detail a pretraining initialization strategy for detection architectures using these artificially synthesized samples, before finetuning on real detection data, and experimentally show how this consistently leads to more data efficient models. With the proposed approach, we were able to effectively use only classification data to improve results on the harder and more supervision hungry object detection problem. We achieve results equivalent to those of the full data scenario using only a small fraction of the original detection data for Face, Bird, and Car detection.
Leonardo BlangerNina S. T. HirataXiaoyi Jiang
Leonardo BlangerNina S. T. HirataXiaoyi Jiang
Blanger, LeonardoHirata, Nina S. T.Xiaoyi Jiang
Minhajul Arifin BadhonIan Stavness
Jeffri Murrugarra-LlerenaLucas N. KirstenCláudio R. Jung