Few-Class Arena: A Benchmark for Efficient Selection of Vision Models   and Dataset Difficulty Measurement

Bryan Bo Cao; Lawrence O'Gorman; Michael Coss; Shubham Jain

arXiv:2411.01099·cs.CV·March 4, 2025

Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement

Bryan Bo Cao, Lawrence O'Gorman, Michael Coss, Shubham Jain

PDF

Open Access 1 Repo

TL;DR

Few-Class Arena (FCA) is a comprehensive benchmark designed to evaluate and select efficient vision models specifically for applications involving only a few classes, addressing a gap in current many-class datasets.

Contribution

The paper introduces FCA, a new benchmark and difficulty measure for few-class image classification, enabling better model evaluation and selection in real-world scenarios.

Findings

01

ResNet performance varies significantly across few-class subsets.

02

The difficulty measure correlates with class similarity and impacts model accuracy.

03

FCA is adaptable and facilitates future research in few-class vision tasks.

Abstract

We propose Few-Class Arena (FCA), as a unified benchmark with focus on testing efficient image classification models for few classes. A wide variety of benchmark datasets with many classes (80-1000) have been created to assist Computer Vision architectural evolution. An increasing number of vision models are evaluated with these many-class datasets. However, real-world applications often involve substantially fewer classes of interest (2-10). This gap between many and few classes makes it difficult to predict performance of the few-class applications using models trained on the available many-class datasets. To date, little has been offered to evaluate models in this Few-Class Regime. We conduct a systematic evaluation of the ResNet family trained on ImageNet subsets from 2 to 1000 classes, and test a wide spectrum of Convolutional Neural Networks and Transformer architectures over ten…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fewclassarena/fca
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrared Target Detection Methodologies · Advanced Image and Video Retrieval Techniques · 3D Surveying and Cultural Heritage

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Average Pooling · Adam · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout