# Application of Multivariate Adaptive Regression Splines to Estimate Fatty Liver Index in Healthy Young Taiwanese Men

**Authors:** Po-Chung Chen, Chung-Chi Yang, Dee Pei, Ta-Wei Chu, Jyh-Gang Leu

PMC · DOI: 10.3390/diagnostics16050795 · Diagnostics · 2026-03-07

## TL;DR

This study uses a statistical method called MARS to better predict fatty liver index in young healthy men from Taiwan.

## Contribution

The study introduces a MARS-based predictive equation for fatty liver index that outperforms traditional linear regression.

## Key findings

- MARS modeling showed lower estimation errors compared to multiple linear regression for predicting fatty liver index.
- The MARS-derived equation explains 62.7% of the variance in fatty liver index.
- CRP was identified as the most influential predictor of fatty liver index in this population.

## Abstract

Background: Non-alcoholic fatty liver disease (NAFLD) represents the most widespread chronic liver disorder globally, impacting roughly 30% of the general population. Numerous factors have been linked to NAFLD, including obesity, type 2 diabetes, diet, physical inactivity, age, sex, genetic factors, and metabolic syndrome. Previous research predominantly treated NAFLD as a categorical outcome, providing less granular data compared to the continuous fatty liver index (FLI). This investigation enrolled healthy young Taiwanese men and applied multivariate adaptive regression spline (MARS) modeling to develop a predictive equation. Our aims were twofold: 1. To assess the predictive accuracy of traditional multiple linear regression (MLR) versus MARS. 2. To construct a MARS-derived equation for estimating FLI in this demographic. Methods: Data originated from the Taiwan MJ Cohort, comprising 5496 men aged 20–50 years not using medications for metabolic syndrome. MARS was used to formulate the FLI estimation equation. Model performance was compared using symmetric mean absolute percentage error (SMAPE), relative absolute error (RAE), root relative squared error (RRSE), and root mean squared error (RMSE). Results: Evaluation indicated that MARS yielded lower estimation errors than MLR, demonstrating its superior performance. The derived equation is: FLI = 65.224 − 0.436 × B1 − 0.490 × B2 + 0.252 × B3 − 2.962 × B4 + 2.231 × B5 − 0.292 × B6 + 0.189 × B7 − 0.361 × B8 − 0.699 × B9 + 0.160 × B10 − 2.715 × B11 + 0.799 × B12 − 0.153 × B13 + 0.084 × B14 − 35.274 × B15 − 4.424 × B16. Conclusions: Using MLR as a benchmark, our analysis revealed that MARS delivered better predictive performance. The presented equation explains 62.7% of the variance in FLI (r2 = 0.627). Based on standardized variable importance scores (nsubsets metric), CRP emerged as the most influential predictor, followed by WBC, UA, HDL-C, AST, age, ALT, FPG, SBP, and LDL in this cohort of healthy young Taiwanese men.

## Linked entities

- **Chemicals:** UA (PubChem CID 16040291), ALT (PubChem CID 10219674)
- **Diseases:** Non-alcoholic fatty liver disease (MONDO:0013209), metabolic syndrome (MONDO:0000816), type 2 diabetes (MONDO:0005148)

## Full-text entities

- **Genes:** CRP (C-reactive protein) [NCBI Gene 1401] {aka PTX1}, SLC17A5 (solute carrier family 17 member 5) [NCBI Gene 26503] {aka AST, ISSD, NSD, SD, SIALIN, SIASD}
- **Diseases:** metabolic syndrome (MESH:D024821), liver disorder (MESH:D017093), Fatty Liver (MESH:D005234), obesity (MESH:D009765), type 2 diabetes (MESH:D003924), NAFLD (MESH:D065626)
- **Chemicals:** B6 (-), B12 (MESH:C034730)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12984924/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12984924/full.md

## References

61 references — full list in the complete paper: https://tomesphere.com/paper/PMC12984924/full.md

---
Source: https://tomesphere.com/paper/PMC12984924