K Martynenko BorisKriuk FedorKarthik Periyasamy
Attention mechanisms have become a fundamental component of deep learning, including the field of computer vision. The key idea behind attention in computer vision is to help the model focus on the relevant spatial regions of the input image, rather than treating all regions equally. The traditional approaches to attention mechanisms in computer vision often suffer from distribution inconsistencies in the attention maps, resulting in sharp transitions that negatively affect model’s focus and lead to poor generalization on complex shapes. The problem of spatial incoherence is particularly pronounced in the task of semantic segmentation, where accurate pixel-level predictions require a detailed understanding of the spatial relationships within the image. In this paper, we propose an attention mechanism called Smooth Attention designed for convolutional neural networks to address the problem of spatial inconsistency in attention maps through multidime- nsional spatial smoothing. We conduct a series of experiments to evaluate the effectiveness of the proposed mechanism and demonstrate its superior performance compared to traditional methods.
Xin ZuoJianyong JiangJifeng ShenWankou Yang
Kumar RajamaniSahana D. GowdaVishwa Nedunoori TejSrividya Tirunellai Rajamani
Hyungjoon KimHyeonwoo KimSeongkuk ChoEenjun Hwang
Weihao WengXin ZhuMianxiong Dong