Spatio-Temporal Transformer-Based Reinforcement Learning for Robot Crowd Navigation

Haodong He; Hao Fu; Qiang Wang; Shuai Zhou; Wei Liu; Chen Yang

doi:10.1109/robio58561.2023.10355042

ScienceGate Book Chapters

JOURNAL ARTICLE

Spatio-Temporal Transformer-Based Reinforcement Learning for Robot Crowd Navigation

Haodong He Hao Fu Qiang Wang Shuai Zhou Wei Liu Chen Yang

Year: 2023 Pages: 1-7

DOI: 10.1109/robio58561.2023.10355042

Get Full-Text PDF Get Analytical Report

Abstract

Ensuring robots can move safely and adhere to social norms in dynamic human environments is a crucial step towards robot autonomous decision-making. In existing work, double serial separate modules are generally used to capture spatial and temporal interactions, respectively. However, such methods lead to extra difficulties in improving the utilization of spatio-temporal features and reducing the conservatism of navigation policy. In light of this, this paper proposes a spatiotemporal transformer-based policy optimization algorithm to more effectively preserve the human-robot interactions. Specifically, a gated embedding mechanism is introduced to effectively fuses the spatial and temporal representations by integrating both modalities at the feature level. Then Transformer is leveraged to encode the spatio-temporal semantic information, with the hope of finding the optimal navigation policy. Finally, a combination of spatio-temporal Transformer and self-adjusting policy entropy significantly reduce the conservatism of navigation policies. Experimental results demonstrate the priority of the proposed algorithm over the state-of-the-art methods.

Keywords:

Computer science Reinforcement learning ENCODE Robot Transformer Artificial intelligence Embedding Mobile robot Entropy (arrow of time) Leverage (statistics) Machine learning Computer vision Engineering

Metrics

Cited By

1.09

FWCI (Field Weighted Citation Impact)

Refs

0.73

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Evacuation and Crowd Dynamics

Physical Sciences → Engineering → Ocean Engineering

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Spatio-Temporal Transformer-Based Reinforcement Learning for Robot Crowd Navigation

Abstract

Metrics

Citation History

Topics

Related Documents

Robot Crowd Navigation Incorporating Spatio-Temporal Information Based on Deep Reinforcement Learning

Crowd-Robot Interaction: Crowd-Aware Robot Navigation With Attention-Based Deep Reinforcement Learning

Memory-based crowd-aware robot navigation using deep reinforcement learning

Deep Reinforcement Learning Based Mobile Robot Navigation in Crowd Environments

Robot Crowd Navigation Based on Spatio-Temporal Interaction Graphs and Danger Zones