fruit-SALAD: A Style Aligned Artwork Dataset to reveal similarity perception in image embeddings

Tillmann Ohm; Andres Karjus; Mikhail Tamm; Maximilian Schich

arXiv:2406.01278·cs.CV·November 12, 2025

fruit-SALAD: A Style Aligned Artwork Dataset to reveal similarity perception in image embeddings

Tillmann Ohm, Andres Karjus, Mikhail Tamm, Maximilian Schich

PDF

Open Access 1 Repo

TL;DR

This paper introduces fruit-SALAD, a balanced and diverse dataset of 10,000 fruit images across styles and categories, to systematically evaluate and compare how different models perceive visual similarity in images.

Contribution

The creation of Style Aligned Artwork Datasets (SALADs), specifically fruit-SALAD, providing a benchmark for analyzing semantic and style similarity perception in image embeddings.

Findings

01

Models show varied sensitivity to semantic and style features.

02

The dataset reveals differences in similarity weights across models.

03

SALAD enables controlled, quantitative comparison of similarity perception.

Abstract

The notion of visual similarity is essential for computer vision, and in applications and studies revolving around vector embeddings of images. However, the scarcity of benchmark datasets poses a significant hurdle in exploring how these models perceive similarity. Here we introduce Style Aligned Artwork Datasets (SALADs), and an example of fruit-SALAD with 10,000 images of fruit depictions. This combined semantic category and style benchmark comprises 100 instances each of 10 easy-to-recognize fruit categories, across 10 easy distinguishable styles. Leveraging a systematic pipeline of generative image synthesis, this visually diverse yet balanced benchmark demonstrates salient differences in semantic category and style similarity weights across various computational models, including machine learning models, feature extraction algorithms, and complexity measures, as well as conceptual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tillmannohm/fruit-SALAD
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAesthetic Perception and Analysis · Generative Adversarial Networks and Image Synthesis