Kai ZhaoBinbing HeShiju PanYuan Zhu
Factors like object's appearance changes, occlusions, and background changes have a negative impact on template matching tasks. To suppress this effect and improve template matching accuracy, we propose a Siamese network with multi-scale feature fusion and dual attention module in this paper. A multi-scale feature fusion module is introduced to extract shallow structural and deep semantic features, and learnable weight coefficients are used to adaptively fuse multi-scale features to improve representation capabilities of the object. In order to improve feature recognition and eliminate redundancy caused by feature fusion, a dual attention mechanism module is introduced, which applies weights to features from separate channels and spatial positions. Finally, the features are normalized, and the cross-correlation is calculated to produce a similarity score map and locate the best matching region. When compared to the two state-of-the-art CNN-based template matching algorithms, QATM and Deep-DIM, the experimental results show that the algorithm in this paper improves template matching accuracy by 4.72 percent and 2.30 percent, respectively, and has the least time-consuming.
Xu ChenCaiping XiHongyu LuXiang Xu
Gaihua WangGan XinQingcheng CaoQianyu Zhai
Xue ShangjieWenjin YaoYang Wenjun