JOURNAL ARTICLE

Scalable forest hashing for fast similarity search

Abstract

Indexing images and videos using binary hash bits has shown promising results for fast similarity search. Existing datadriven hashing methods learn compact hash codes from the data, but usually with the cost of generating unbalanced hash buckets, thus affecting the search efficiency. We propose a novel data-driven hashing method called forest hashing, which utilizes multiple tree structures to perform data hashing. By leveraging the index structure of trees, we can significantly improve the hashing efficacy by generating balanced hash buckets. Moreover, forest hashing naturally supports scalable coding where more trees can improve the coding quality with a longer code. Last but not the least, our forest hashing can be easily extended for semantic search by integrating semi-supervised label information. Experiments on two benchmark datasets show favorable results compared with the state-of-the-art hashing methods.

Keywords:
Dynamic perfect hashing Universal hashing Double hashing Computer science Hash table Linear hashing Hash function Locality-sensitive hashing Scalability Feature hashing Consistent hashing Nearest neighbor search Search engine indexing Binary code Ternary search tree Data mining Benchmark (surveying) Artificial intelligence Tree structure Binary number Binary tree Algorithm Database Mathematics

Metrics

12
Cited By
2.41
FWCI (Field Weighted Citation Impact)
31
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.