FIFA3D: Flow-Guided Feature Aggregation for Temporal Three-Dimensional Object Detection

Ruiqi Ma; Chunwei Wang; Chi Chen; Yihan Zeng; Bijun Li; Qin Zou; Qingqiu Huang; Xinge Zhu; Hang Xu

doi:10.3390/rs17030380

ScienceGate Book Chapters

JOURNAL ARTICLE

FIFA3D: Flow-Guided Feature Aggregation for Temporal Three-Dimensional Object Detection

Ruiqi Ma Chunwei Wang Chi Chen Yihan Zeng Bijun Li Qin Zou Qingqiu Huang Xinge Zhu Hang Xu

Year: 2025 Journal: Remote Sensing Vol: 17 (3)Pages: 380-380 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/rs17030380

Get Full-Text PDF Get Analytical Report

Abstract

Detecting accurate 3D bounding boxes from LiDAR point clouds is crucial for autonomous driving. Recent studies have shown the superiority of the performance of multi-frame 3D detectors, yet eliminating the misalignment across frames and effectively aggregating spatiotemporal information are still challenging problems. In this paper, we present a novel flow-guided feature aggregation scheme for 3D object detection (FIFA3D) to align cross-frame information. FIFA3D first leverages optical flow with supervised signals to model the pixel-to-pixel correlations between sequential frames. Considering the sparse nature of bird’s-eye-view feature maps, an additional classification branch is adopted to provide explicit pixel-wise clues. Meanwhile, we utilize multi-scale feature maps and predict flow in a coarse-to-fine manner. With guidance from the estimated flow, historical features can be well aligned to the current situation, and a cascade fusion strategy is introduced to benefit the following detection. Extensive experiments show that FIFA3D surpasses the single-frame baseline with remarkable margins of +10.8% mAPH and +6.8% mAP on the Waymo and nuScenes validation datasets and performs well compared with state-of-the-art methods.

Keywords:

Computer science Feature (linguistics) Remote sensing Artificial intelligence Geology

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.01

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

FIFA3D: Flow-Guided Feature Aggregation for Temporal Three-Dimensional Object Detection

Abstract

Metrics

Topics

Related Documents

Flow-Guided Feature Aggregation for Video Object Detection

Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection

Attention-Guided Disentangled Feature Aggregation for Video Object Detection

Guided Sampling Based Feature Aggregation for Video Object Detection

Boundary-Guided Feature Aggregation Network for Salient Object Detection