Haoxue ZhangGang XieLinjuan LiXinlin XieJinchang Ren
Convolutional Neural Networks (CNNs), transformers, and the hybrid methods have been significant application in remote sensing. However, existing methods are limited in effectively modeling frequency domain information, which affects their ability to capture detailed information. Therefore, we propose a frequency-domain guided feature coupled mechanism and a global-local feature integration method (FGNet) for semantic segmentation. Specifically, a frequency-domain guided Swin transformer (FGSwin) is designed by introducing dilation group convolution, Fast Fourier Transform (FFT) and learnable weights to enhance the expression capability of frequency-domain and space-domain, local and global features, simultaneously. In addition, a global-local feature integration module (GLFI) is proposed for aggregating features to further enhance the discrimination of each category. Comprehensive experimental results demonstrate that, compared to existing methods, the proposed method achieves superior performance in terms of mean intersection over union (mIoU), reaching 71.46% and 74.04% on the ISPRS Potsdam and Vaihingen, two widely used datasets.
Xin LiFeng XuHongmin GaoFan LiuXin Lyu
Yinghao LinShihao ZhaoYuye WangYi XieBaojun Qiao
Youda MoHuihui LiXiangling XiaoHuimin ZhaoXiaoyong LiuJin Zhan