# Data-Driven Soft Sensing for Raw Milk Ethanol Stability Prediction

**Authors:** Song Shen, Xiaodong Song, Haohan Ding, Xiaohui Cui, Zhenqi Xie, Huadi Huang, Guanjun Dong

PMC · DOI: 10.3390/s26030903 · Sensors (Basel, Switzerland) · 2026-01-30

## TL;DR

This paper introduces a non-destructive method to predict raw milk ethanol stability using data from routine measurements, avoiding traditional lab tests.

## Contribution

A novel soft sensing model using autoencoders and TabNet with data augmentation to predict ethanol stability from routine dairy data.

## Key findings

- The model achieved 92.57% accuracy in predicting ethanol stability from raw milk data.
- A TabDDPM-based method improved performance by addressing class imbalance in the dataset.
- The approach shows strong potential for real-world dairy quality monitoring.

## Abstract

Ethanol stability is an important indicator for evaluating the quality and heat-processing suitability of raw milk. Traditional ethanol stability testing relies on destructive laboratory procedures, which are not suitable for large-scale industrial use. In contrast, parameters such as protein, fat, lactose and other basic compositional indicators are already measured routinely in dairy plants through sensor-based or spectroscopic systems. This provides the basis for developing a non-destructive soft sensing approach for ethanol stability. In this study, a soft sensing model was developed to predict ethanol stability from commonly monitored raw-milk intake indicators. An autoencoder was used to examine feature correlations and select variables with stronger relevance to ethanol stability. TabNet was then applied to build the classification model, and a TabDDPM-based data generation method was introduced to address class imbalance in the dataset. The proposed model was trained and tested using three years of industrial raw-milk intake data from a dairy company. It achieved an accuracy of 92.57% and a recall of 90.26% for identifying ethanol-unstable samples. These results demonstrate the model’s strong potential for practical engineering applications in real-world dairy quality monitoring.

## Full-text entities

- **Chemicals:** lactose (MESH:D007785), Ethanol (MESH:D000431)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12899035/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12899035/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC12899035/full.md

---
Source: https://tomesphere.com/paper/PMC12899035