FOCUS: Familiar Objects in Common and Uncommon Settings
Priyatham Kattakinda, Soheil Feizi

TL;DR
FOCUS is a dataset designed to evaluate how well deep image classifiers generalize to uncommon settings, revealing biases and limitations in current models through detailed analysis.
Contribution
The paper introduces FOCUS, a new dataset for stress-testing model generalization to uncommon settings, and analyzes model performance and feature usage in these scenarios.
Findings
Models perform significantly worse on uncommon settings.
Deep features often rely on spurious correlations.
Dataset reveals biases in current image classifiers.
Abstract
Standard training datasets for deep learning often contain objects in common settings (e.g., "a horse on grass" or "a ship in water") since they are usually collected by randomly scraping the web. Uncommon and rare settings (e.g., "a plane on water", "a car in snowy weather") are thus severely under-represented in the training data. This can lead to an undesirable bias in model predictions towards common settings and create a false sense of accuracy. In this paper, we introduce FOCUS (Familiar Objects in Common and Uncommon Settings), a dataset for stress-testing the generalization power of deep image classifiers. By leveraging the power of modern search engines, we deliberately gather data containing objects in common and uncommon settings in a wide range of locations, weather conditions, and time of day. We present a detailed analysis of the performance of various popular image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and Data Classification
