JOURNAL ARTICLE

Anchor-based Multi-modal Transformer Network for Pedestrian Trajectory and Intention Prediction

Abstract

Accurate prediction of pedestrian behavior is of paramount importance in enhancing both traffic efficiency and pedestrian safety for autonomous vehicles. However, due to the inherent unpredictability of pedestrian movements, achieving accurate predictions is challenging. In recent years, researchers have explored various approaches to address this challenge, focusing on multimodal trajectory prediction and crossing intention prediction. In this study, we propose an innovative method called the Anchor-based Multimodal Transformer Network for Pedestrian Trajectory and Intention Prediction. Our proposed approach adopts a multi-task learning framework that combines trajectory prediction and intention prediction, leading to improved performance in both tasks. Moreover, this integration enables a more comprehensive understanding of pedestrians' future movements, empowering autonomous vehicles to make more informed decisions. Additionally, to capture the diverse range of future possibilities for pedestrians, we introduce learnable intention anchors. Each anchor serves as a reference point and facilitates the prediction of a specific future modality. These anchors are iteratively refined by leveraging historical context in our multi-layer decoder, resulting in enhanced multimodal trajectory and intention predictions. Experimental result shows that our model achieves the state-of-the-art performance on JAAD dataset. Specifically, our method improves trajectory prediction by 9% and intention prediction by 11%. By providing comprehensive and accurate predictions of pedestrian behavior for automated vehicles, our model contributes to enhancing traffic efficiency and pedestrian safety.

Keywords:
Pedestrian Modal Transformer Trajectory Computer science Engineering Transport engineering Electrical engineering Voltage Physics Materials science

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
20
Refs
0.21
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Autonomous Vehicle Technology and Safety
Physical Sciences →  Engineering →  Automotive Engineering
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Traffic and Road Safety
Physical Sciences →  Engineering →  Safety, Risk, Reliability and Quality

Related Documents

JOURNAL ARTICLE

Multi-Modal Pedestrian Crossing Intention Prediction with Transformer-Based Model

Ting-Wei WangShang‐Hong Lai

Journal:   APSIPA Transactions on Signal and Information Processing Year: 2024 Vol: 13 (5)
JOURNAL ARTICLE

Stochastic Non-Autoregressive Transformer-Based Multi-Modal Pedestrian Trajectory Prediction for Intelligent Vehicles

Xiaobo ChenHuanjia ZhangFuwen DengJun LiangJian Yang

Journal:   IEEE Transactions on Intelligent Transportation Systems Year: 2023 Vol: 25 (5)Pages: 3561-3574
© 2026 ScienceGate Book Chapters — All rights reserved.