As the development of autonomous driving technology continues, pedestrian safety is becoming an increasingly important issue. The ability of an autonomous car to accurately predict whether a pedestrian will cross the road is essential for ensuring their safety, as the vehicle can slow down in time or stop to avoid any potential accidents. However, predicting pedestrian behavior is a complex task influenced by various environmental and contextual factors. To deal with this issue, we propose a novel method, Crossing Intention Prediction based on feature Fusion modules (CIPF) that combines eight different input features extracted from both pedestrians and vehicles through three fusion modules using RNN layers and attention mechanisms. We demonstrated state-of-the-art performance of prediction accuracy in the PIE dataset, which is the most widely used for pedestrian crossing intention prediction. We also demonstrated the superiority of the performance of our CIPF network through qualitative and quantitative analysis. In particular, we also performed ablation studies on the verification of the effectiveness of the eight input features, the validity of VGG encoders, and performance comparison of our CIPF over time by adjusting the prediction time.
Yasaman SalehiMehdi EzojiFarzam Mohammadpourmir
Jieru GuoYutong DingAoshang Tian
Xiaofei ZhangXiaolan WangWeiwei ZhangYansong WangXintian LiuDan Wei
Biao YangJun ZhuChuan HuZhitao YuHongyu HuRongrong Ni