End-to-End Autonomous Navigation Based on Deep Reinforcement Learning with a Survival Penalty Function

Shyr-Long Jeng; Chienhsun Chiang

doi:10.3390/s23208651

ScienceGate Book Chapters

JOURNAL ARTICLE

End-to-End Autonomous Navigation Based on Deep Reinforcement Learning with a Survival Penalty Function

Shyr-Long Jeng Chienhsun Chiang

Year: 2023 Journal: Sensors Vol: 23 (20)Pages: 8651-8651 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/s23208651

Get Full-Text PDF Get Analytical Report

Abstract

An end-to-end approach to autonomous navigation that is based on deep reinforcement learning (DRL) with a survival penalty function is proposed in this paper. Two actor–critic (AC) frameworks, namely, deep deterministic policy gradient (DDPG) and twin-delayed DDPG (TD3), are employed to enable a nonholonomic wheeled mobile robot (WMR) to perform navigation in dynamic environments containing obstacles and for which no maps are available. A comprehensive reward based on the survival penalty function is introduced; this approach effectively solves the sparse reward problem and enables the WMR to move toward its target. Consecutive episodes are connected to increase the cumulative penalty for scenarios involving obstacles; this method prevents training failure and enables the WMR to plan a collision-free path. Simulations are conducted for four scenarios—movement in an obstacle-free space, in a parking lot, at an intersection without and with a central obstacle, and in a multiple obstacle space—to demonstrate the efficiency and operational safety of our method. For the same navigation environment, compared with the DDPG algorithm, the TD3 algorithm exhibits faster numerical convergence and higher stability in the training phase, as well as a higher task execution success rate in the evaluation phase.

Keywords:

Nonholonomic system Reinforcement learning Obstacle avoidance Computer science Penalty method Obstacle Mobile robot Motion planning Intersection (aeronautics) Function (biology) Collision avoidance Control theory (sociology) Path (computing) Mathematical optimization Simulation Robot Collision Artificial intelligence Engineering Mathematics Control (management) Transport engineering

Metrics

Cited By

2.18

FWCI (Field Weighted Citation Impact)

Refs

0.86

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Robotic Path Planning Algorithms

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Autonomous Vehicle Technology and Safety

Physical Sciences → Engineering → Automotive Engineering

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

End-to-End Autonomous Navigation Based on Deep Reinforcement Learning with a Survival Penalty Function

Abstract

Metrics

Citation History

Topics

Related Documents

End-to-end Multi-Objective Deep Reinforcement Learning for Autonomous Navigation

End-to-End Autonomous Navigation with Modular Architecture and Model-Based Reinforcement Learning

End-to-End Autonomous Driving Decision Based on Deep Reinforcement Learning

End-to-End Deep Reinforcement Learning for Image-Based UAV Autonomous Control

Deep Reinforcement Learning-Based End-to-End Navigation of Mobile Robots with Reward Shaping