Lidar and Monocular Sensor Fusion Depth Estimation

Shuyao He; Yue Zhu; Yushan Dong; Hao Qin; Yuhong Mo

doi:10.5281/zenodo.11347309

ScienceGate Book Chapters

JOURNAL ARTICLE

Lidar and Monocular Sensor Fusion Depth Estimation

Shuyao He Yue Zhu Yushan Dong Hao Qin Yuhong Mo

Year: 2024 Journal: Zenodo (CERN European Organization for Nuclear Research) Publisher: European Organization for Nuclear Research

DOI: 10.5281/zenodo.11347309

Get Full-Text PDF Get Analytical Report

Abstract

In this project, we present a novel approach to depth perception using a monocular camera by incorporating information from both RGB and LiDAR modalities. Our primary objective is to investigate the performance and effectiveness of different techniques to generate accurate depth estimation. We first implemented the Swin Transformer-based depth estimation model and evaluated its performance on KITTI dataset containing RGB images and their corresponding ground truth depth maps. Next, we proposed an RGB-LiDAR fusion model. We performed necessary preprocessing steps on the dataset, such as resizing, normalization, and data augmentation, and trained both models with identical configurations for a fair comparison. Our results demonstrate that the proposed RGB- LiDAR fusion model achieves superior depth estimation performance compared to the original Swin Transformer based model. We evaluated the models on the test dataset using metrics such as mean absolute error (MAE) and root mean squared error (RMSE). The enhanced performance indicates the potential benefits of RGB-LiDAR fusion for monocular depth perception tasks. This study offers valuable insights into the strengths [1] and weaknesses of combining RGB and LiDAR inputs and lays the foundation for future research in monocular depth perception, aiming to further improve model architectures and training techniques.

Keywords:

Lidar Monocular RGB color model Ground truth Sensor fusion Measured depth Preprocessor Fusion

Metrics

Cited By

1.06

FWCI (Field Weighted Citation Impact)

Refs

0.72

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

Lidar and Monocular Sensor Fusion Depth Estimation

Abstract

Metrics

Citation History

Topics

Related Documents

Lidar and Monocular Sensor Fusion Depth Estimation

Sparse Pseudo-LiDAR Depth Assisted Monocular Depth Estimation

Light-weight Monocular Depth Estimation Via Cross Attention Fusion of Sparse LiDAR

SPADnet: deep RGB-SPAD sensor fusion assisted by monocular depth estimation

Dense Depth Map Estimation based on Camera-LiDAR Sensor Fusion