JOURNAL ARTICLE

Self-Supervised Multi-Frame Monocular Depth Estimation for Dynamic Scenes

Guanghui WuHao LiuLongguang WangKunhong LiYulan GuoZengping Chen

Year: 2023 Journal:   IEEE Transactions on Circuits and Systems for Video Technology Vol: 34 (6)Pages: 4989-5001   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Self-supervised multi-frame depth estimation outperforms single-frame approaches by utilizing not only appearance information, but also geometric information. A common practice for multi-frame methods is to employ feature-metric bundle adjustment (FBA) to refine depth map initialized from the single-frame prior. However, FBA cannot always provide effective residual updates due to unreliable matching costs, which are corrupted by thin texture, occlusion, and especially object motion. To tackle this problem, we propose a context-aware transformer (CAT) to refine the corrupted matching costs by leveraging the spatial context information. Specifically, the CAT adaptively aggregates matching costs according to the spatial affinity inferred from local appearance context, and produces reliable contextual costs for FBA. Moreover, we design a motion-aware regularization loss to provide supervision for regions with moving objects, making CAT competent for dynamic scenes. Extensive experiments and analyses on the KITTI and Cityscapes datasets demonstrate the effectiveness and superior generalization capability of our approach.

Keywords:
Artificial intelligence Computer science Computer vision Inter frame Motion estimation Feature matching Matching (statistics) Monocular Context (archaeology) Pattern recognition (psychology) Frame (networking) Feature extraction Reference frame Mathematics

Metrics

11
Cited By
2.00
FWCI (Field Weighted Citation Impact)
80
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Optical measurement and interference techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering

Related Documents

JOURNAL ARTICLE

EDS-Depth: Enhancing Self-Supervised Monocular Depth Estimation in Dynamic Scenes

Shangshu YuMeiqing WuSiew-Kei LamChangshuo WangRuiping Wang

Journal:   IEEE Transactions on Intelligent Transportation Systems Year: 2025 Vol: 26 (4)Pages: 5585-5597
JOURNAL ARTICLE

LungDepth: Self‐Supervised Multi‐Frame Monocular Depth Estimation for Bronchoscopy

Jingsheng XuBo GuanJianchang ZhaoBo YiJianmin Li

Journal:   International Journal of Medical Robotics and Computer Assisted Surgery Year: 2025 Vol: 21 (1)Pages: e70050-e70050
JOURNAL ARTICLE

SC-DepthV3: Robust Self-Supervised Monocular Depth Estimation for Dynamic Scenes

Libo SunJia-Wang BianHuangying ZhanWei YinIan ReidChunhua Shen

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2023 Vol: 46 (1)Pages: 497-508
© 2026 ScienceGate Book Chapters — All rights reserved.