SearchAD: Large-Scale Rare Image Retrieval Dataset for Autonomous Driving

Felix Embacher; Jonas Uhrig; Marius Cordts; Markus Enzweiler

arXiv:2604.08008·cs.CV·April 10, 2026

SearchAD: Large-Scale Rare Image Retrieval Dataset for Autonomous Driving

Felix Embacher, Jonas Uhrig, Marius Cordts, Markus Enzweiler

PDF

1 Repo

TL;DR

SearchAD is a large-scale dataset designed for semantic image retrieval of rare driving scenarios, facilitating research in autonomous driving safety and long-tail perception.

Contribution

It introduces a comprehensive dataset with annotations for rare classes, enabling diverse retrieval tasks and benchmarking for autonomous driving applications.

Findings

01

Text-based retrieval methods outperform image-based ones.

02

Spatial visual features align well with language in zero-shot models.

03

Fine-tuning improves retrieval performance, but results are still limited.

Abstract

Retrieving rare and safety-critical driving scenarios from large-scale datasets is essential for building robust autonomous driving (AD) systems. As dataset sizes continue to grow, the key challenge shifts from collecting more data to efficiently identifying the most relevant samples. We introduce SearchAD, a large-scale rare image retrieval dataset for AD containing over 423k frames drawn from 11 established datasets. SearchAD provides high-quality manual annotations of more than 513k bounding boxes covering 90 rare categories. It specifically targets the needle-in-a-haystack problem of locating extremely rare classes, with some appearing fewer than 50 times across the entire dataset. Unlike existing benchmarks, which focused on instance-level retrieval, SearchAD emphasizes semantic image retrieval with a well-defined data split, enabling text-to-image and image-to-image retrieval,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://iis-esslingen.github.io/searchad
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.