SommBench: Assessing Sommelier Expertise of Language Models

William Brach; Tomas Bedej; Jacob Nielsen; Jacob Pichna; Juraj Bedej; Eemeli Saarensilta; Julie Dupouy; Gianluca Barmina; Andrea Blasi N\'u\~nez; Peter Schneider-Kamp; Kristian Ko\v{s}\v{t}\'al; Michal Ries; Lukas Galke Poech

arXiv:2603.12117·cs.CL·March 13, 2026

SommBench: Assessing Sommelier Expertise of Language Models

William Brach, Tomas Bedej, Jacob Nielsen, Jacob Pichna, Juraj Bedej, Eemeli Saarensilta, Julie Dupouy, Gianluca Barmina, Andrea Blasi N\'u\~nez, Peter Schneider-Kamp, Kristian Ko\v{s}\v{t}\'al, Michal Ries, Lukas Galke Poech

PDF

Open Access

TL;DR

SommBench is a multilingual benchmark designed to evaluate language models' expertise in sommelier knowledge, focusing on sensory judgment tasks like wine theory, feature completion, and food pairing, across multiple languages.

Contribution

It introduces a novel, expert-developed multilingual benchmark for assessing sommelier expertise in language models, emphasizing sensory and cultural understanding beyond basic knowledge.

Findings

01

High performance on wine theory questions (up to 97%)

02

Moderate success on feature completion (up to 65%)

03

Challenging food-wine pairing task with MCC between 0 and 0.39

Abstract

With the rapid advances of large language models, it becomes increasingly important to systematically evaluate their multilingual and multicultural capabilities. Previous cultural evaluation benchmarks focus mainly on basic cultural knowledge that can be encoded in linguistic form. Here, we propose SommBench, a multilingual benchmark to assess sommelier expertise, a domain deeply grounded in the senses of smell and taste. While language models learn about sensory properties exclusively through textual descriptions, SommBench tests whether this textual grounding is sufficient to emulate expert-level sensory judgment. SommBench comprises three main tasks: Wine Theory Question Answering (WTQA), Wine Feature Completion (WFC), and Food-Wine Pairing (FWP). SommBench is available in multiple languages: English, Slovak, Swedish, Finnish, German, Danish, Italian, and Spanish. This helps separate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOlfactory and Sensory Function Studies · Multisensory perception and integration · Nutritional Studies and Diet