Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement
Bryan Bo Cao, Lawrence O'Gorman, Michael Coss, Shubham Jain

TL;DR
Few-Class Arena (FCA) is a comprehensive benchmark designed to evaluate and select efficient vision models specifically for applications involving only a few classes, addressing a gap in current many-class datasets.
Contribution
The paper introduces FCA, a new benchmark and difficulty measure for few-class image classification, enabling better model evaluation and selection in real-world scenarios.
Findings
ResNet performance varies significantly across few-class subsets.
The difficulty measure correlates with class similarity and impacts model accuracy.
FCA is adaptable and facilitates future research in few-class vision tasks.
Abstract
We propose Few-Class Arena (FCA), as a unified benchmark with focus on testing efficient image classification models for few classes. A wide variety of benchmark datasets with many classes (80-1000) have been created to assist Computer Vision architectural evolution. An increasing number of vision models are evaluated with these many-class datasets. However, real-world applications often involve substantially fewer classes of interest (2-10). This gap between many and few classes makes it difficult to predict performance of the few-class applications using models trained on the available many-class datasets. To date, little has been offered to evaluate models in this Few-Class Regime. We conduct a systematic evaluation of the ResNet family trained on ImageNet subsets from 2 to 1000 classes, and test a wide spectrum of Convolutional Neural Networks and Transformer architectures over ten…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies · Advanced Image and Video Retrieval Techniques · 3D Surveying and Cultural Heritage
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Average Pooling · Adam · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout
