JOURNAL ARTICLE

Aggregating Local Image Descriptors into Compact Codes

H. JegouFlorent PerronninMatthijs DouzeJorge SánchezPatrick PérezC. Schmid

Year: 2011 Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 34 (9)Pages: 1704-1716   Publisher: IEEE Computer Society

Abstract

This paper addresses the problem of large-scale image search. Three constraints have to be taken into account: search accuracy, efficiency, and memory usage. We first present and evaluate different ways of aggregating local image descriptors into a vector and show that the Fisher kernel achieves better performance than the reference bag-of-visual words approach for any given vector dimension. We then jointly optimize dimensionality reduction and indexing in order to obtain a precise vector comparison as well as a compact representation. The evaluation shows that the image representation can be reduced to a few dozen bytes while preserving high accuracy. Searching a 100 million image data set takes about 250 ms on one processor core.

Keywords:
Fisher kernel Search engine indexing Kernel (algebra) Computer science Artificial intelligence Dimensionality reduction Pattern recognition (psychology) Representation (politics) Byte Dimension (graph theory) Image (mathematics) Support vector machine Set (abstract data type) Curse of dimensionality Kernel method Computer vision Mathematics

Metrics

1533
Cited By
54.74
FWCI (Field Weighted Citation Impact)
44
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Robotics and Sensor-Based Localization
Physical Sciences →  Engineering →  Aerospace Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.