A Testbed for Cross-Dataset Analysis

Tatiana Tommasi; Tinne Tuytelaars; Barbara Caputo

arXiv:1402.5923·cs.CV·February 25, 2014·47 cites

A Testbed for Cross-Dataset Analysis

Tatiana Tommasi, Tinne Tuytelaars, Barbara Caputo

PDF

Open Access

TL;DR

This paper introduces a comprehensive testbed that consolidates twelve image datasets into a single corpus, facilitating large-scale analysis of dataset biases and their impact on visual recognition system generalization.

Contribution

It creates a unified dataset repository and analysis framework to study dataset biases across multiple visual recognition datasets, aiding future research.

Findings

01

Organized twelve datasets into a single corpus

02

Provided a feature repository for dataset analysis

03

Facilitated large-scale bias analysis in visual recognition

Abstract

Since its beginning visual recognition research has tried to capture the huge variability of the visual world in several image collections. The number of available datasets is still progressively growing together with the amount of samples per object category. However, this trend does not correspond directly to an increasing in the generalization capabilities of the developed recognition systems. Each collection tends to have its specific characteristics and to cover just some aspects of the visual world: these biases often narrow the effect of the methods defined and tested separately over each image set. Our work makes a first step towards the analysis of the dataset bias problem on a large scale. We organize twelve existing databases in a unique corpus and we present the visual community with a useful feature repository for future research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications