BEVTransFusion: LiDAR-Camera Fusion Under Bird’s-Eye-View for 3D Object Detection with Transformers

Kaiqi Feng; Yu Zhang

doi:10.1109/prml59573.2023.10348338

ScienceGate Book Chapters

JOURNAL ARTICLE

BEVTransFusion: LiDAR-Camera Fusion Under Bird’s-Eye-View for 3D Object Detection with Transformers

Kaiqi Feng Yu Zhang

Year: 2023 Pages: 21-28

DOI: 10.1109/prml59573.2023.10348338

Get Full-Text PDF Get Analytical Report

Abstract

Recently, there is growing research interest in extracting Bird's-Eye-View (BEV) features from images and LiDAR to improve 3D object detection. However, existing methods mainly combine the features mechanically, which limits the utilization of BEV features. To address this limitation, we draw inspiration from TransFusion and design a two-layer transformer decoder to fuse LiDAR and camera BEV features. By doing so, we can omit the steps of reference point backprojection and feature sampling, which results in better correlation between the fused LiDAR and image features and higher robustness to the calibration matrix. Furthermore, we add 3D position encoding to the BEV features to compensate for the lack of height information. We also propose an length-width-height modulated attention mechanism to incorporate scale information. We also perform comprehensive experiments to verify the effectiveness of our methods.

Keywords:

Lidar Computer vision Artificial intelligence Computer science Transformer Object detection Fusion Remote sensing Engineering Geography Pattern recognition (psychology) Electrical engineering Voltage

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.18

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Industrial Vision Systems and Defect Detection

Physical Sciences → Engineering → Industrial and Manufacturing Engineering

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

BEVTransFusion: LiDAR-Camera Fusion Under Bird’s-Eye-View for 3D Object Detection with Transformers

Abstract

Metrics

Topics

Related Documents

Lift-Attend-Splat: Bird’s-eye-view camera-lidar fusion using transformers

CL-fusionBEV: 3D object detection method with camera-LiDAR fusion in Bird’s Eye View

CPMFusion: LiDAR-camera fusion framework for 3D object detection in bird’s eye view space

Free Space Detection Using Camera-LiDAR Fusion in a Bird’s Eye View Plane

Radar–Camera Fusion in Perspective View and Bird’s Eye View for 3D Object Detection