January Food Benchmark (JFB): A Public Benchmark Dataset and Evaluation Suite for Multimodal Food Analysis

Amir Hosseinian; Ashkan Dehghani Zahedani; Umer Mansoor; Noosheen Hashemi; Mark Woodward

arXiv:2508.09966·cs.CV·August 14, 2025

January Food Benchmark (JFB): A Public Benchmark Dataset and Evaluation Suite for Multimodal Food Analysis

Amir Hosseinian, Ashkan Dehghani Zahedani, Umer Mansoor, Noosheen Hashemi, Mark Woodward

PDF

TL;DR

The paper introduces the January Food Benchmark, a new dataset and evaluation framework for multimodal food analysis, enabling standardized assessment of AI models in nutritional analysis.

Contribution

It provides a public dataset, a comprehensive benchmarking framework, and baseline results, advancing standardized evaluation in automated nutritional analysis.

Findings

01

Specialized model achieves an Overall Score of 86.2

02

12.1-point improvement over general-purpose models

03

Provides a valuable dataset and evaluation tools for future research

Abstract

Progress in AI for automated nutritional analysis is critically hampered by the lack of standardized evaluation methodologies and high-quality, real-world benchmark datasets. To address this, we introduce three primary contributions. First, we present the January Food Benchmark (JFB), a publicly available collection of 1,000 food images with human-validated annotations. Second, we detail a comprehensive benchmarking framework, including robust metrics and a novel, application-oriented overall score designed to assess model performance holistically. Third, we provide baseline results from both general-purpose Vision-Language Models (VLMs) and our own specialized model, january/food-vision-v1. Our evaluation demonstrates that the specialized model achieves an Overall Score of 86.2, a 12.1-point improvement over the best-performing general-purpose configuration. This work offers the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.