# Machine Learning-Enabled Rapid Assessment of Plant-Based Protein Digestibility Through Physicochemical Profiles

**Authors:** Meichen Liu, Ruoyan Zhang, Hao Yin, Yu Zhong, Yapeng Fang, Cuixia Sun, Yun Deng

PMC · DOI: 10.3390/foods14223874 · Foods · 2025-11-13

## TL;DR

This paper introduces a machine learning model that quickly predicts how easily plant-based proteins can be digested, using just a small sample and key physicochemical features.

## Contribution

A novel deep learning framework that enables rapid and minimal-sample prediction of plant protein digestibility.

## Key findings

- The model reduced digestibility assessment time by 80% and required only 1/100th of the sample size.
- Key predictors of digestibility include α-helix content, random coil content, and solubility.
- The model achieved high accuracy (R² = 0.91) when validated against independent datasets.

## Abstract

Plant-based proteins offer sustainable alternatives to animal sources, yet their lower digestibility remains a critical barrier to widespread applications. Current digestibility assessment methods require days of analysis and gram-scale samples, creating significant bottlenecks in protein optimization workflows. This study developed an ensembled deep learning framework that transforms digestibility prediction from a resource-intensive process to a rapid, minimal-sample assessment. By systematically characterizing 23 diverse plant protein isolates across multiple physicochemical dimensions, we trained a feedforward neural network based on augmented data. Our model identified α-helix content, random coil content, and solubility as key digestibility indicators. This insight enabled the construction of a streamlined three-feature model that reduced assessment time by 80% while requiring only one-hundredth of standard sample amounts. When validated against independent published datasets, the model achieved rational prediction accuracy, with an R2 = 0.91. These findings establish a transformative framework for accelerating plant protein development, enabling rapid screening of novel sources and targeted modification strategies to enhance nutritional bioavailability, ultimately advancing sustainable food system transitions.

## Full-text entities

- **Diseases:** Bean (MESH:C536240), obesity (MESH:D009765), FNN (MESH:D015441), injury to (MESH:D014947), lactose intolerance (MESH:D007787)
- **Chemicals:** HCl (MESH:D006851), Hydrogen (MESH:D006859), H2SO4 (MESH:C033158), NaOH (MESH:D012972), cholesterol (MESH:D002784), SC (MESH:D012538), water (MESH:D014867), E (MESH:D004540), methyl red (MESH:C008492), nitrogen (MESH:D009584), amino acid (MESH:D000596), urea (MESH:D014508), bromocresol green (MESH:D001961), K2SO4 (MESH:C031512), PC (MESH:C053518), Coomassie Brilliant Blue (MESH:C004692), TCA (MESH:D014238), bile salts (MESH:D001647), NaCl (MESH:D012965), CaCl2(H2O)2 (-), Disulfide (MESH:D004220), ammonia (MESH:D000641), salt (MESH:D012492), phosphate (MESH:D010710), n-hexane (MESH:C026385)
- **Species:** Sesamum indicum (beniseed, species) [taxon 4182], Bacillus sp. SA (species) [taxon 1168094], PX clade (clade) [taxon 569578], Oryza sativa (Asian cultivated rice, species) [taxon 4530], Paenibacillus sp. EA2 (species) [taxon 1167411], Homo sapiens (human, species) [taxon 9606], Vicia faba (broad bean, species) [taxon 3906], Powellomyces sp. EA (species) [taxon 252690], Glycine max (soybean, species) [taxon 3847], Cicer arietinum (chickpea, species) [taxon 3827]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12651773/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12651773/full.md

## References

61 references — full list in the complete paper: https://tomesphere.com/paper/PMC12651773/full.md

---
Source: https://tomesphere.com/paper/PMC12651773