Jiajun MaNian PanDongxiao YinDi WangJin Zhou
Infrared small-target detection encounters significant challenges due to a low image signal-to-noise ratio, limited target size, and complex background noise. To address the issues of sparse feature loss for small targets during the down-sampling phase of the traditional U-Net network and the semantic gap in the feature fusion process, a multilevel feature extraction and fusion attention network (MEFA-Net) is designed. Specifically, the dilated direction-sensitive convolution block (DDCB) is devised to collaboratively extract local detail features, contextual features, and Gaussian salient features via ordinary convolution, dilated convolution and parallel strip convolution. Furthermore, the encoder attention fusion module (EAF) is employed, where spatial and channel attention weights are generated using dual-path pooling to achieve the adaptive fusion of deep and shallow layer features. Lastly, an efficient up-sampling block (EUB) is constructed, integrating a hybrid up-sampling strategy with multi-scale dilated convolution to refine the localization of small targets. The experimental results confirm that the proposed algorithm model surpasses most existing recent methods. Compared with the baseline, the intersection over union (IoU) and probability of detection Pd of MEFA-Net on the IRSTD-1k dataset are increased by 2.25% and 3.05%, respectively, achieving better detection performance and a lower false alarm rate in complex scenarios.
Zhen ZuoXiaozhong TongJunyu WeiShaojing SuPeng WuRunze GuoBei Sun
Yidan ZhangChunlei LiYundong LiuZhoufeng LiuRuimin Yang
Keyan WangXueyan WuPeicheng ZhouZuntian ChenRui ZhangLiyun YangYunsong Li
Tianlei MaHao WangJing LiangJinzhu PengQi MaZhiqiang Kai
Zehao ZhaoWeining ChenShiwei DongYaohong ChenHao Wang