Diverse, Difficult, and Odd Instances (D2O): A New Test Set for Object   Classification

Ali Borji

arXiv:2301.12527·cs.CV·January 31, 2023

Diverse, Difficult, and Odd Instances (D2O): A New Test Set for Object Classification

Ali Borji

PDF

Open Access 1 Repo

TL;DR

The paper introduces D2O, a new diverse and challenging test set for object classification, designed to better evaluate model generalization and reveal weaknesses in current AI systems.

Contribution

D2O is a novel test set with diverse, real-world images that differ from existing datasets, highlighting limitations of current models and APIs in object recognition.

Findings

01

Models achieve around 60% accuracy on D2O, much lower than on ImageNet.

02

Popular vision APIs perform poorly on D2O categories like faces, cars, and cats.

03

D2O's varied difficulty levels make it a strong predictor of model performance.

Abstract

Test sets are an integral part of evaluating models and gauging progress in object recognition, and more broadly in computer vision and AI. Existing test sets for object recognition, however, suffer from shortcomings such as bias towards the ImageNet characteristics and idiosyncrasies (e.g., ImageNet-V2), being limited to certain types of stimuli (e.g., indoor scenes in ObjectNet), and underestimating the model performance (e.g., ImageNet-A). To mitigate these problems, we introduce a new test set, called D2O, which is sufficiently different from existing test sets. Images are a mix of generated images as well as images crawled from the web. They are diverse, unmodified, and representative of real-world scenarios and cause state-of-the-art models to misclassify them with high confidence. To emphasize generalization, our dataset by design does not come paired with a training set. It…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aliborji/d2o
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques

MethodsTest