# A machine-learning clustering approach for reference interval estimation of liver enzymes from hospital laboratory big-data

**Authors:** Prakruti Dash, Saurav Nayak

PMC · DOI: 10.6026/973206300211069 · Bioinformation · 2025-05-31

## TL;DR

This paper uses machine learning and big data from hospital labs to determine accurate reference intervals for liver enzymes AST and ALT.

## Contribution

A novel combination of clustering and outlier detection methods is applied to estimate liver enzyme reference intervals from real-world data.

## Key findings

- DBSCAN with Tukey's fences or Local Outlier Factor performed best in covering validation data.
- Estimated AST and ALT reference intervals are 15-41 U/L and 11-46 U/L, respectively.

## Abstract

It is of interest to establish clinically valid reference intervals (RIs) for the liver enzymes aspartate transaminase (AST) and
Alanine aminotransferase (ALT) using a combination of unsupervised machine learning clustering and robust outlier detection applied to
real-world laboratory big data. Four outlier detection methods were each combined with four clustering algorithms to identify homogeneous
subgroups and the largest cluster from each combination was used to estimate RIs based on percentile cut-offs. Among the tested
combinations, DBSCAN with Tukey's fences or Local Outlier Factor achieved optimal performance, covering 100% of the validation data. The
widest intervals were derived using Local Outlier Factor, while Isolation Forest yielded the narrowest. Ultimately, the study estimated
the reference intervals for AST and ALT to be 15-41 U/L and 11-46 U/L, respectively.

## Full-text entities

- **Genes:** SLC17A5 (solute carrier family 17 member 5) [NCBI Gene 26503] {aka AST, ISSD, NSD, SD, SIALIN, SIASD}, GPT (glutamic--pyruvic transaminase) [NCBI Gene 2875] {aka AAT1, ALT, ALT1, GPT1, SGPT}

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12357660/full.md

## References

25 references — full list in the complete paper: https://tomesphere.com/paper/PMC12357660/full.md

---
Source: https://tomesphere.com/paper/PMC12357660