JOURNAL ARTICLE

SelfVIO: Self-supervised deep monocular Visual–Inertial Odometry and depth estimation

Abstract

In the last decade, numerous supervised deep learning approaches have been proposed for visual-inertial odometry (VIO) and depth map estimation, which require large amounts of labelled data. To overcome the data limitation, self-supervised learning has emerged as a promising alternative that exploits constraints such as geometric and photometric consistency in the scene. In this study, we present a novel self-supervised deep learning-based VIO and depth map recovery approach (SelfVIO) using adversarial training and self-adaptive visual-inertial sensor fusion. SelfVIO learns the joint estimation of 6 degrees-of-freedom (6-DoF) ego-motion and a depth map of the scene from unlabelled monocular RGB image sequences and inertial measurement unit (IMU) readings. The proposed approach is able to perform VIO without requiring IMU intrinsic parameters and/or extrinsic calibration between IMU and the camera. We provide comprehensive quantitative and qualitative evaluations of the proposed framework and compare its performance with state-of-the-art VIO, VO, and visual simultaneous localization and mapping (VSLAM) approaches on the KITTI, EuRoC and Cityscapes datasets. Detailed comparisons prove that SelfVIO outperforms state-of-the-art VIO approaches in terms of pose estimation and depth recovery, making it a promising approach among existing methods in the literature.

Keywords:
Artificial intelligence Inertial measurement unit Computer science Computer vision Odometry Monocular Deep learning Visual odometry Simultaneous localization and mapping RGB color model Pose Robot Mobile robot

Metrics

94
Cited By
26.72
FWCI (Field Weighted Citation Impact)
196
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
3D Surveying and Cultural Heritage
Physical Sciences →  Earth and Planetary Sciences →  Geology

Related Documents

JOURNAL ARTICLE

Visual Odometry integrated with Self-Supervised Monocular Depth Estimation

Xinyu QiZhijun FangShuqun YangHeng Zhou

Journal:   2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI) Year: 2021 Vol: abs 1502 956 Pages: 464-469
JOURNAL ARTICLE

A self-supervised monocular odometry with visual-inertial and depth representations

Lingzhe ZhaoTianyu XiangZhuping Wang

Journal:   Journal of the Franklin Institute Year: 2024 Vol: 361 (6)Pages: 106698-106698
© 2026 ScienceGate Book Chapters — All rights reserved.