Flow-Guided Feature Aggregation for Video Object Detection

Xizhou Zhu; Yujie Wang; Jifeng Dai; Lu Yuan; Yichen Wei

doi:10.1109/iccv.2017.52

ScienceGate Book Chapters

JOURNAL ARTICLE

Flow-Guided Feature Aggregation for Video Object Detection

Xizhou Zhu Yujie Wang Jifeng Dai Lu Yuan Yichen Wei

Year: 2017 Pages: 408-417

DOI: 10.1109/iccv.2017.52

Get Full-Text PDF Get Analytical Report

Abstract

Extending state-of-the-art object detectors from image to video is challenging. The accuracy of detection suffers from degenerated object appearances in videos, e.g., motion blur, video defocus, rare poses, etc. Existing work attempts to exploit temporal information on box level, but such methods are not trained end-to-end. We present flow-guided feature aggregation, an accurate and end-to-end learning framework for video object detection. It leverages temporal coherence on feature level instead. It improves the per-frame features by aggregation of nearby features along the motion paths, and thus improves the video recognition accuracy. Our method significantly improves upon strong singleframe baselines in ImageNet VID [33], especially for more challenging fast moving objects. Our framework is principled, and on par with the best engineered systems winning the ImageNet VID challenges 2016, without additional bells-and-whistles. The code would be released.

Keywords:

Computer science Artificial intelligence Computer vision Motion blur Feature (linguistics) Object detection Exploit Optical flow Object (grammar) Frame (networking) Coherence (philosophical gambling strategy) Motion (physics) Video tracking Feature extraction Image (mathematics) Pattern recognition (psychology)

Metrics

655

Cited By

28.22

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Flow-Guided Feature Aggregation for Video Object Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Attention-Guided Disentangled Feature Aggregation for Video Object Detection

Guided Sampling Based Feature Aggregation for Video Object Detection

Boundary-guided video salient object detection with multi-feature aggregation

Relation-Guided Multi-stage Feature Aggregation Network for Video Object Detection

FIFA3D: Flow-Guided Feature Aggregation for Temporal Three-Dimensional Object Detection