An Explainable Unsupervised-to-Supervised Machine Learning Framework for Dietary Pattern Discovery Using UK National Dietary Survey Data
Wing Yi Yu, Chun Yin Chiu

TL;DR
This paper introduces an explainable machine learning framework that identifies and interprets dietary patterns from UK survey data, aiding dietetic assessment and counseling.
Contribution
It combines unsupervised clustering with supervised interpretation to discover and explain dietary patterns using UK dietary survey data.
Findings
Identified four interpretable dietary patterns with K-means clustering.
Supervised classifier achieved high test performance (macro-F1 = 0.963).
SHAP analysis linked predictions to meaningful dietary drivers.
Abstract
Clinical dietary assessment can generate detailed but high-dimensional nutrient and food-group information that is difficult to translate quickly into counselling priorities. This paper proposes an explainable unsupervised-to-supervised machine learning framework for discovering, reproducing and interpreting dietary patterns using public UK National Diet and Nutrition Survey data. Adult participants aged 19 years and above from NDNS Years 12-15 were represented using 25 energy-adjusted nutrient and food-group features. K-means, Gaussian Mixture Models and Agglomerative Clustering were compared across k = 2-8, with stability and dietetic interpretability used alongside internal validation metrics. The selected K-means k = 4 solution identified four interpretable dietary patterns: high fat/meat and sodium, higher fibre fruit-vegetable micronutrient, high free-sugar snacks and sugary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
