Localisation-Safe Reinforcement Learning for Mapless Navigation

Feiqiang Lin; Ze Ji; Changyun Wei; Raphael Grech

doi:10.1109/robio55434.2022.10011937

ScienceGate Book Chapters

JOURNAL ARTICLE

Localisation-Safe Reinforcement Learning for Mapless Navigation

Feiqiang Lin Ze Ji Changyun Wei Raphael Grech

Year: 2022 Journal: 2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)

DOI: 10.1109/robio55434.2022.10011937

Get Full-Text PDF Get Analytical Report

Abstract

Most reinforcement learning (RL)-based works for mapless point goal navigation tasks assume the availability of the robot ground-truth poses, which is unrealistic for real world applications. In this work, we remove such an assumption and deploy observation-based localisation algorithms, such as Lidar-based or visual odometry, for robot self-pose estimation. These algorithms, despite having widely achieved promising performance and being robust to various harsh environments, may fail to track robot locations under many scenarios, where observations perceived along robot trajectories are insufficient or ambiguous. Hence, using such localisation algorithms will introduce new unstudied problems for mapless navigation tasks. This work will propose a new RL-based algorithm, with which robots learn to navigate in a way that prevents localisation failures or getting trapped in local minimum regions. This ability can be learned by deploying two techniques suggested in this work: a reward metric to decide punishment on behaviours resulting in localisation failures; and a reconfigured state representation that consists of current observation and history trajectory information to transfer the problem from a partially observable Markov decision process (POMDP) to a Markov Decision Process (MDP) model to avoid local minimum.

Keywords:

Reinforcement learning Partially observable Markov decision process Computer science Markov decision process Robot Artificial intelligence Odometry Metric (unit) Trajectory Machine learning Process (computing) Mobile robot Markov process Markov chain Markov model Engineering

Metrics

Cited By

0.47

FWCI (Field Weighted Citation Impact)

Refs

0.61

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

Robotic Path Planning Algorithms

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Localisation-Safe Reinforcement Learning for Mapless Navigation

Abstract

Metrics

Citation History

Topics

Related Documents

Reinforcement Learning-Based Mapless Navigation with Fail-Safe Localisation

A Deep Safe Reinforcement Learning Approach for Mapless Navigation

Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation

Curriculum learning for safe mapless navigation

Reinforcement learning based mapless robot navigation