Huaqi ZhaoSu WangXiang PengJeng‐Shyang PanRui WangXiaomin Liu
Although semantic segmentation is widely employed in autonomous driving, its performance in segmenting road surfaces falls short in complex traffic environments. This study proposes a frequency-based semantic segmentation with a transformer (FSSFormer) based on the sensitivity of semantic segmentation to frequency information. Specifically, we propose a weight-sharing factorized attention to select important frequency features that can improve the segmentation performance of overlapping targets. Moreover, to address boundary information loss, we used a cross-attention method combining spatial and frequency features to obtain further detailed pixel information. To improve the segmentation accuracy in complex road scenarios, we adopted a parallel-gated feedforward network segmentation method to encode the position information. Extensive experiments demonstrate that the mIoU of FSSFormer increased by 2% compared with existing segmentation methods on the Cityscapes dataset.
I. V. SgibnevА.А. СорокинB. V. VishnyakovYu. V. Vizilter
Suvash SharmaJohn E. BallBo TangDaniel W. CarruthMatthew DoudeMuhammad Aminul Islam
Usha DivakarlaRamyashree BhatSuraj B MadagaonkarD. V. PranavChaithanya ShyamK. Chandrashekar
Jingwei YangSicen GuoMohammud Junaid BocusQijun ChenRui Fan