Qiwei WangShouhong WanLihua YueChe Wang
Bag-of-words is a classical method for image classification. The core problem is how to count the frequency of the visual words and what visual words to select. In this paper, we propose a visual attention based bag-of-words model (VABOW model) for image classification task. The VABOW model utilizes visual attention method to generate a saliency map, and uses the saliency map as a weighted matrix to instruct the statistic process for the frequency of the visual words. On the other hand, the VABOW model combines shape, color and texture cues and uses L1 regularization logistic regression method to select the most relevant and most efficient features. We compare our approach with traditional bag-of-words based method on two datasets, and the result shows that our VABOW model outperforms the state-of-the-art method for image classification.
Huadong SunXu ZhangXiaowei HanXuesong JinZhijie Zhao
Zainab N. SultaniBan N. Dhannoon
Andrea MelloniPaolo BestaginiA. CostanzoMauro BarniMarco TagliasacchiStefano Tubaro