Self-Supervised Multi-Frame Monocular Depth Estimation for Dynamic Scenes

Guanghui Wu; Hao Liu; Longguang Wang; Kunhong Li; Yulan Guo; Zengping Chen

doi:10.1109/tcsvt.2023.3340948

ScienceGate Book Chapters

JOURNAL ARTICLE

Self-Supervised Multi-Frame Monocular Depth Estimation for Dynamic Scenes

Guanghui Wu Hao Liu Longguang Wang Kunhong Li Yulan Guo Zengping Chen

Year: 2023 Journal: IEEE Transactions on Circuits and Systems for Video Technology Vol: 34 (6)Pages: 4989-5001 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tcsvt.2023.3340948

Get Full-Text PDF Get Analytical Report

Abstract

Self-supervised multi-frame depth estimation outperforms single-frame approaches by utilizing not only appearance information, but also geometric information. A common practice for multi-frame methods is to employ feature-metric bundle adjustment (FBA) to refine depth map initialized from the single-frame prior. However, FBA cannot always provide effective residual updates due to unreliable matching costs, which are corrupted by thin texture, occlusion, and especially object motion. To tackle this problem, we propose a context-aware transformer (CAT) to refine the corrupted matching costs by leveraging the spatial context information. Specifically, the CAT adaptively aggregates matching costs according to the spatial affinity inferred from local appearance context, and produces reliable contextual costs for FBA. Moreover, we design a motion-aware regularization loss to provide supervision for regions with moving objects, making CAT competent for dynamic scenes. Extensive experiments and analyses on the KITTI and Cityscapes datasets demonstrate the effectiveness and superior generalization capability of our approach.

Keywords:

Artificial intelligence Computer science Computer vision Inter frame Motion estimation Feature matching Matching (statistics) Monocular Context (archaeology) Pattern recognition (psychology) Frame (networking) Feature extraction Reference frame Mathematics

Metrics

Cited By

2.00

FWCI (Field Weighted Citation Impact)

Refs

0.85

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Optical measurement and interference techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

Self-Supervised Multi-Frame Monocular Depth Estimation for Dynamic Scenes

Abstract

Metrics

Citation History

Topics

Related Documents

Depth Self-Inhibition: Enhancing Self-Supervised Monocular Depth Estimation in Dynamic Scenes

EDS-Depth: Enhancing Self-Supervised Monocular Depth Estimation in Dynamic Scenes

LungDepth: Self‐Supervised Multi‐Frame Monocular Depth Estimation for Bronchoscopy

SC-DepthV3: Robust Self-Supervised Monocular Depth Estimation for Dynamic Scenes

Manydepth2: Motion-Aware Self-Supervised Monocular Depth Estimation in Dynamic Scenes