# Urine Metabolomics and Machine Learning Identify Metabolic Features and Potential Biomarkers of HTLV-1-Associated Myelopathy (HAM)

**Authors:** Lorena Abreu Fernandes, Youko Nukui, Rosa Maria Marcusso, Michel Elyas Jung Haziot, Augusto César Penalva de Oliveira, Jorge Casseb, Patricia Bianca Clissa, Ana Olivia de Souza, Silas G. Villas-Boas, Sabri Saeed Sanabani

PMC · DOI: 10.3390/ijms27041827 · International Journal of Molecular Sciences · 2026-02-14

## TL;DR

This study uses urine metabolomics and machine learning to find noninvasive biomarkers for a neuroinflammatory disease caused by HTLV-1.

## Contribution

The novel use of untargeted urine metabolomics and machine learning to identify potential biomarkers for HTLV-1-associated myelopathy.

## Key findings

- 175 metabolites were identified, with 86 showing significant differences across groups.
- Machine learning models achieved robust separation of HAM from other groups using specific metabolites.
- Key metabolites like histidine and 4-hydroxyphenylacetic acid showed reduced levels in HAM with high diagnostic accuracy.

## Abstract

Human T-cell lymphotropic virus type 1 (HTLV-1) can cause HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP), a progressive neuroinflammatory disease that lacks noninvasive biomarkers. We used untargeted urine metabolomics with machine learning to profile 113 participants (39 with HAM, 17 with intermediate syndrome, 33 asymptomatic carriers, and 24 healthy controls). Gas chromatography–mass spectrometry identified 175 metabolites, 86 of which showed significant differences (fold change > 2, FDR p < 0.05). Multivariate analyses revealed distinct but partially overlapping metabolic profiles: sPLS-DA captured a reproducible yet moderately discriminative signal, while nonlinear machine learning models (Random Forest and SVM) achieved robust group separation, with HAM displaying a distinct metabolic signature. Key discriminators included Unknown_151, Unknown_127, histidine, alanine, and 4-hydroxyphenylacetic acid, which showed marked reductions in HAM and yielded ROC AUCs of 0.855–0.871. Pathway and disease enrichment analyses highlighted disturbances in amino acid metabolism, particularly beta-alanine and aromatic amino acids, along with disease signatures related to inherited amino acid handling disorders such as hyperlysinemia. These results demonstrate that urinary metabolomics combined with machine learning can identify potential noninvasive biomarkers for HAM and provide novel insights into HTLV-1-associated pathophysiology.

## Linked entities

- **Chemicals:** histidine (PubChem CID 773), alanine (PubChem CID 239), 4-hydroxyphenylacetic acid (PubChem CID 127)
- **Diseases:** hyperlysinemia (MONDO:0009388)

## Full-text entities

- **Genes:** CNTN2 (contactin 2) [NCBI Gene 6900] {aka AXT, EPEO5, FAME5, TAG-1, TAX, TAX1}
- **Diseases:** HTLV 1 infection (MESH:D006800), tyrosinemia (MESH:D020176), fatigue (MESH:D005221), familial hyperlysinemia (MESH:D020167), neurological disease (MESH:D020271), hyperdibasic aminoaciduria I (MESH:C562687), carbamoyl phosphate synthetase deficiency (MESH:D020165), Urinary dysfunction (MESH:D001745), Myelopathy (MESH:D013118), metabolic disorders (MESH:D008659), mitochondrial dysfunction (MESH:D028361), HC (MESH:D000067329), injury to (MESH:D014947), neurodegeneration (MESH:D019636), chronic inflammation (MESH:D007249), ATL (MESH:D015459), neuroinflammation (MESH:D000090862), diabetes (MESH:D003920), tumors (MESH:D009369), amino acid metabolism disorders (MESH:D000592), phenylketonuria (MESH:D010661), kidney disease (MESH:D007674), chronic (MESH:D002908), infectious diseases (MESH:D003141), incontinence (MESH:D014549), MS (MESH:D009103), urine retention (MESH:D016055), ASC (MESH:D015490), HAM (MESH:D015493), digestive disorders (MESH:D004066), IS (MESH:D001924)
- **Chemicals:** spermine (MESH:D013096), stearic acid (MESH:C031183), tyrosine (MESH:D014443), water (MESH:D014867), NaOH (MESH:D012972), dipeptide (MESH:D004151), pyrimidine (MESH:C030986), glutamate (MESH:D018698), glutaric acid (MESH:C035736), biotin (MESH:D001710), aromatic amino acid (MESH:D024322), methyl chloroformate (MESH:C014667), proline (MESH:D011392), suberic acid (MESH:C005738), methanol (MESH:D000432), metal (MESH:D008670), adipic acid (MESH:C029900), dihydrouracil (MESH:C007419), alanine (MESH:D000409), L-histidine (MESH:D006639), nitrogen (MESH:D009584), uracil (MESH:D014498), anserine (MESH:D000861), lipid (MESH:D008055), cysteine (MESH:D003545), citrate (MESH:D019343), glutathione (MESH:D005978), Beta-alanine (MESH:D015091), polyamine (MESH:D011073), spermidine (MESH:D013095), tryptophan (MESH:D014364), lysine (MESH:D008239), caffeine (MESH:D002110), 3-aminopropanal (MESH:C050862), pyridine (MESH:C023666), Helium (MESH:D006371), 3-hydroxy fatty acids (-), 4-hydroxyphenylacetic acid (MESH:C008070), arginine (MESH:D001120), acrolein (MESH:D000171), TCA (MESH:D014238), CoA (MESH:D003065), propanoate (MESH:D011422), phenylalanine (MESH:D010649), Amino acids (MESH:D000596), aspartate (MESH:D001224), 2-aminoadipic acid (MESH:D015074)
- **Species:** Human T-cell leukemia virus type I (no rank) [taxon 11908], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12940815/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12940815/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/PMC12940815/full.md

---
Source: https://tomesphere.com/paper/PMC12940815