Informatics for Food Processing

Gordana Ispirova; Michael Sebek; Giulia Menichetti

arXiv:2505.17087·cs.CL·April 7, 2026

Informatics for Food Processing

Gordana Ispirova, Michael Sebek, Giulia Menichetti

PDF

TL;DR

This chapter reviews the evolution of food processing classification and introduces AI-driven computational methods, including FoodProX and language models, to improve accuracy and scalability in food informatics.

Contribution

It presents novel AI approaches like FoodProX and large language models to address limitations of traditional food processing classification frameworks.

Findings

01

FoodProX accurately infers processing levels from nutrient data.

02

Language models effectively embed food descriptions for predictive tasks.

03

Multimodal AI models can classify foods at scale using diverse data sources.

Abstract

This chapter explores the evolution, classification, and health implications of food processing, while emphasizing the transformative role of machine learning, artificial intelligence (AI), and data science in advancing food informatics. It begins with a historical overview and a critical review of traditional classification frameworks such as NOVA, Nutri-Score, and SIGA, highlighting their strengths and limitations, particularly the subjectivity and reproducibility challenges that hinder epidemiological research and public policy. To address these issues, the chapter presents novel computational approaches, including FoodProX, a random forest model trained on nutrient composition data to infer processing levels and generate a continuous FPro score. It also explores how large language models like BERT and BioBERT can semantically embed food descriptions and ingredient lists for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.