JOURNAL ARTICLE

Discovery of Collocation Patterns: from Visual Words to Visual Phrases

Abstract

A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a “bag-of-words” representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e.visual phrase lexicon, whereavisual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning. 1.

Keywords:
Lexicon Computer science Phrase Artificial intelligence Natural language processing Categorization Object (grammar) Visual Word Set (abstract data type) Word (group theory) Metric (unit) Representation (politics) Automatic summarization Similarity (geometry) Cluster analysis Pattern recognition (psychology) Linguistics Image (mathematics)

Metrics

237
Cited By
16.20
FWCI (Field Weighted Citation Impact)
30
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.