JOURNAL ARTICLE

Efficiently Fusing Sparse Lidar for Enhanced Self-Supervised Monocular Depth Estimation

Abstract

Monocular self-supervised depth estimation with a low-cost sensor is the mainstream solution to gathering dense depth maps for robots and autonomous driving. In this paper, based on the philosophy "less is more" (i.e., focusing only on valid pixels in sparse LiDAR), we propose a novel framework, Efficient Sparse Depth (EffisDepth), for predicting dense depth. The Sparse Feature Extractor (SFE) embedded in the proposed framework effectively handles sparse LiDAR by forming sparse tensors. The Slender Group Block (SGB) is the main building block in SFE, which extracts features from sparse tensors via a structure of two branches. Extensive experiments show that our method achieves state-of-the-art performance on the KITTI benchmark, demonstrating the effectiveness of each proposed component and the self-supervised learning framework.

Keywords:
Lidar Benchmark (surveying) Artificial intelligence Computer science Monocular Block (permutation group theory) Feature (linguistics) Sparse matrix Component (thermodynamics) Pattern recognition (psychology) Extractor Pixel Feature extraction Computer vision Mathematics Remote sensing Engineering Geology

Metrics

2
Cited By
0.36
FWCI (Field Weighted Citation Impact)
29
Refs
0.51
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Optical measurement and interference techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.