Activity Driven Weakly Supervised Object Detection

Zhenheng Yang; Dhruv Mahajan; Deepti Ghadiyaram; Ram Nevatia; Vignesh Ramanathan

doi:10.1109/cvpr.2019.00303

ScienceGate Book Chapters

JOURNAL ARTICLE

Activity Driven Weakly Supervised Object Detection

Zhenheng Yang Dhruv Mahajan Deepti Ghadiyaram Ram Nevatia Vignesh Ramanathan

Year: 2019 Pages: 2912-2921

DOI: 10.1109/cvpr.2019.00303

Get Full-Text PDF Get Analytical Report

Abstract

Weakly supervised object detection aims at reducing the amount of supervision required to train detection models. Such models are traditionally learned from images/videos labelled only with the object class and not the object bounding box. In our work, we try to leverage not only the object class labels but also the action labels associated with the data. We show that the action depicted in the image/video can provide strong cues about the location of the associated object. We learn a spatial prior for the object dependent on the action (e.g. "ball" is closer to "leg of the person" in "kicking ball"), and incorporate this prior to simultaneously train a joint object detection and action classification model. We conducted experiments on both video datasets and image datasets to evaluate the performance of our weakly supervised object detection model. Our approach outperformed the current state-of-the-art (SOTA) method by more than 6% in mAP on the Charades video dataset.

Keywords:

Artificial intelligence Object detection Computer science Minimum bounding box Leverage (statistics) Object (grammar) Bounding overwatch Computer vision Pattern recognition (psychology) Viola–Jones object detection framework Machine learning Image (mathematics) Face detection Facial recognition system

Metrics

Cited By

2.67

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Activity Driven Weakly Supervised Object Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Activity and Relationship Modeling Driven Weakly Supervised Object Detection

Weakly Supervised Open-Vocabulary Object Detection

Misclassification in Weakly Supervised Object Detection

Weakly Supervised Video Salient Object Detection

Weakly-supervised Human-object Interaction Detection