Spacecraft Rendezvous Guidance Method Based on Safe Reinforcement Learning

XING Linquan, XIAO Yingmin, YANG Zhibin, WEI Zhengmin, ZHOU Yong, GAO Saijun

ScienceGate Book Chapters

JOURNAL ARTICLE

Spacecraft Rendezvous Guidance Method Based on Safe Reinforcement Learning

XING Linquan, XIAO Yingmin, YANG Zhibin, WEI Zhengmin, ZHOU Yong, GAO Saijun

Year: 2023 Journal: DOAJ (DOAJ: Directory of Open Access Journals)

Get Full-Text PDF Get Analytical Report

Abstract

With the increasing complexity of spacecraft rendezvous and docking tasks,the requirements for its efficiency,autonomy and reliability are highly demanded.In recent years,the introduction of reinforcement learning technology to solve the problem of spacecraft rendezvous and guidance has become an international frontier hotspot.Obstacle avoidance is critical for safe spacecraft rendezvous,and the general reinforcement learning algorithm does not impose safety restrictions on space exploration,which make the design of spacecraft rendezvous guidance policy challenging.This paper proposes a spacecraft rendezvous guidance method based on safe reinforcement learning.First,a Markov model of autonomous spacecraft rendezvous in collision avoidance scenarios is designed,a reward mechanism based on obstacle warning and collision avoidance restraint is proposed,and thus a safe reinforcement learning framework for solving spacecraft rendezvous guidance strategy is established.Second,with the framework of safe reinforcement learning,guidance policies are generated based on two deep reinforcement learning algorithms,proximal po-licy optimization(PPO) and deep deterministic policy gradient(DDPG).Experimental results show that the method can effectively avoid obstacle and complete the rendezvous with high accuracy.In addition,the performance and generalization ability of the two algorithms are analyzed,which proves the effectiveness of the proposed method.

Keywords:

Rendezvous Reinforcement learning Spacecraft Collision avoidance Markov decision process Obstacle avoidance Trajectory Obstacle

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.68

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Urban, Neighborhood, and Segregation Studies

Social Sciences → Social Sciences → Sociology and Political Science

Data Analysis with R

Physical Sciences → Computer Science → Artificial Intelligence

Census and Population Estimation

Physical Sciences → Mathematics → Statistics and Probability

Spacecraft Rendezvous Guidance Method Based on Safe Reinforcement Learning

Abstract

Metrics

Topics

Related Documents

Meta-reinforcement learning for adaptive spacecraft guidance during finite-thrust rendezvous missions

Run-Time Assured Reinforcement Learning for Safe Spacecraft Rendezvous with Obstacle Avoidance

Autonomous Rendezvous Guidance via Deep Reinforcement Learning

Guidance Trajectories for Spacecraft Rendezvous

Spacecraft Proximity Maneuvering and Rendezvous With Collision Avoidance Based on Reinforcement Learning