Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval
Morris Florek, David Tschirschwitz, Bj\"orn Barz, Volker Rodehorst

TL;DR
This paper introduces a computationally efficient universal image feature extractor trained on a multi-domain dataset, achieving near state-of-the-art retrieval performance with significantly fewer parameters.
Contribution
Developed a resource-efficient training framework and dataset for universal image feature extraction, outperforming similar methods with fewer parameters.
Findings
Achieved 0.721 mMP@5 on Google Universal Image Embedding Challenge
Placed second on the leaderboard, 0.7% behind the top method
Outperformed previous state-of-the-art by 3.3% with similar computational resources
Abstract
Current image retrieval systems often face domain specificity and generalization issues. This study aims to overcome these limitations by developing a computationally efficient training framework for a universal feature extractor that provides strong semantic image representations across various domains. To this end, we curated a multi-domain training dataset, called M4D-35k, which allows for resource-efficient training. Additionally, we conduct an extensive evaluation and comparison of various state-of-the-art visual-semantic foundation models and margin-based metric learning loss functions regarding their suitability for efficient universal feature extraction. Despite constrained computational resources, we achieve near state-of-the-art results on the Google Universal Image Embedding Challenge, with a mMP@5 of 0.721. This places our method at the second rank on the leaderboard, just…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
MethodsSparse Evolutionary Training
