Efficient and Discriminative Image Feature Extraction for Universal   Image Retrieval

Morris Florek; David Tschirschwitz; Bj\"orn Barz; Volker Rodehorst

arXiv:2409.13513·cs.CV·September 23, 2024

Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval

Morris Florek, David Tschirschwitz, Bj\"orn Barz, Volker Rodehorst

PDF

Open Access 1 Repo

TL;DR

This paper introduces a computationally efficient universal image feature extractor trained on a multi-domain dataset, achieving near state-of-the-art retrieval performance with significantly fewer parameters.

Contribution

Developed a resource-efficient training framework and dataset for universal image feature extraction, outperforming similar methods with fewer parameters.

Findings

01

Achieved 0.721 mMP@5 on Google Universal Image Embedding Challenge

02

Placed second on the leaderboard, 0.7% behind the top method

03

Outperformed previous state-of-the-art by 3.3% with similar computational resources

Abstract

Current image retrieval systems often face domain specificity and generalization issues. This study aims to overcome these limitations by developing a computationally efficient training framework for a universal feature extractor that provides strong semantic image representations across various domains. To this end, we curated a multi-domain training dataset, called M4D-35k, which allows for resource-efficient training. Additionally, we conduct an extensive evaluation and comparison of various state-of-the-art visual-semantic foundation models and margin-based metric learning loss functions regarding their suitability for efficient universal feature extraction. Despite constrained computational resources, we achieve near state-of-the-art results on the Google Universal Image Embedding Challenge, with a mMP@5 of 0.721. This places our method at the second rank on the leaderboard, just…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

morrisfl/unifex
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques

MethodsSparse Evolutionary Training