CleverBirds: A Multiple-Choice Benchmark for Fine-grained Human Knowledge Tracing
Leonie Bossemeyer, Samuel Heinrich, Grant Van Horn, Oisin Mac Aodha

TL;DR
CleverBirds is a large-scale benchmark dataset designed to evaluate models' ability to trace and predict human knowledge progression in fine-grained bird species recognition through extensive multiple-choice quizzes.
Contribution
Introduces CleverBirds, a comprehensive dataset for visual knowledge tracing in fine-grained classification, enabling new research on learning patterns and individual differences.
Findings
Tracking knowledge is challenging across subgroups and question types.
Contextual information variably improves prediction accuracy.
CleverBirds is among the largest benchmarks for visual knowledge tracing.
Abstract
Mastering fine-grained visual recognition, essential in many expert domains, can require that specialists undergo years of dedicated training. Modeling the progression of such expertize in humans remains challenging, and accurately inferring a human learner's knowledge state is a key step toward understanding visual learning. We introduce CleverBirds, a large-scale knowledge tracing benchmark for fine-grained bird species recognition. Collected by the citizen-science platform eBird, it offers insight into how individuals acquire expertize in complex fine-grained classification. More than 40,000 participants have engaged in the quiz, answering over 17 million multiple-choice questions spanning over 10,000 bird species, with long-range learning patterns across an average of 400 questions per participant. We release this dataset to support the development and evaluation of new methods for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSpecies Distribution and Climate Change · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
