ForestHash: Semantic Hashing With Shallow Random Forests and Tiny Convolutional Networks
Qiang Qiu, Jose Lezama, Alex Bronstein, Guillermo Sapiro

TL;DR
This paper introduces ForestHash, a novel semantic hashing method combining shallow random forests with tiny CNNs, achieving high-quality image retrieval and classification with efficient, compact representations.
Contribution
It proposes a new hashing scheme embedding lightweight CNNs into shallow random forests with an information-theoretic code aggregation, improving similarity preservation and code uniqueness.
Findings
Outperforms state-of-the-art hashing methods in image retrieval.
Achieves classification performance comparable to deeper models.
Produces compact, scalable hash codes for large datasets.
Abstract
Hash codes are efficient data representations for coping with the ever growing amounts of data. In this paper, we introduce a random forest semantic hashing scheme that embeds tiny convolutional neural networks (CNN) into shallow random forests, with near-optimal information-theoretic code aggregation among trees. We start with a simple hashing scheme, where random trees in a forest act as hashing functions by setting `1' for the visited tree leaf, and `0' for the rest. We show that traditional random forests fail to generate hashes that preserve the underlying similarity between the trees, rendering the random forests approach to hashing challenging. To address this, we propose to first randomly group arriving classes at each tree split node into two groups, obtaining a significantly simplified two-class classification problem, which can be handled using a light-weight CNN weak…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
