JOURNAL ARTICLE

Contextual Bag-of-Words for Visual Categorization

Teng LiTao MeiIn-So KweonXian‐Sheng Hua

Year: 2010 Journal:   IEEE Transactions on Circuits and Systems for Video Technology Vol: 21 (4)Pages: 381-392   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Bag-of-words (BOW), which represents an image by the histogram of local patches on the basis of a visual vocabulary, has attracted intensive attention in visual categorization due to its good performance and flexibility. Conventional BOW neglects the contextual relations between local patches due to its Naïve Bayesian assumption. However, it is well known that contextual relations play an important role for human beings to recognize visual categories from their local appearance. This paper proposes a novel contextual bag-of-words (CBOW) representation to model two kinds of typical contextual relations between local patches, i.e., a semantic conceptual relation and a spatial neighboring relation. To model the semantic conceptual relation, visual words are grouped on multiple semantic levels according to the similarity of class distribution induced by them, accordingly local patches are encoded and images are represented. To explore the spatial neighboring relation, an automatic term extraction technique is adopted to measure the confidence that neighboring visual words are relevant. Word groups with high relevance are used and their statistics are incorporated into the BOW representation. Classification is taken using the support vector machine with an efficient kernel to incorporate the relational information. The proposed approach is extensively evaluated on two kinds of visual categorization tasks, i.e., video event and scene categorization. Experimental results demonstrate the importance of contextual relations of local patches and the CBOW shows superior performance to conventional BOW.

Keywords:
Artificial intelligence Computer science Bag-of-words model Spatial relation Categorization Natural language processing Pattern recognition (psychology) Vocabulary Bag-of-words model in computer vision Visual Word Image retrieval Image (mathematics)

Metrics

146
Cited By
8.96
FWCI (Field Weighted Citation Impact)
46
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Scene categorization via contextual visual words

Jianzhao QinN.H.C. Yung

Journal:   Pattern Recognition Year: 2009 Vol: 43 (5)Pages: 1874-1888
JOURNAL ARTICLE

Region Contextual Visual Words for scene categorization

Chang LiuDe XuSonghe Feng

Journal:   Expert Systems with Applications Year: 2011 Vol: 38 (9)Pages: 11591-11597
JOURNAL ARTICLE

Contextual Bag-of-Words for Robust Visual Tracking

Fanxiang ZengYuefeng JiMartin D. Levine

Journal:   IEEE Transactions on Image Processing Year: 2017 Vol: 27 (3)Pages: 1433-1447
© 2026 ScienceGate Book Chapters — All rights reserved.