# A Machine Learning-Guided Approach for Identifying Potential HCAR1 Antagonists in Lactate-Driven Cancers

**Authors:** Letícia Vivas Carvalho, Núbia Seyffert, Roberto Meyer, Sandeep Tiwari, Thiago Luiz de Paula Castro

PMC · DOI: 10.1021/acsomega.5c09253 · ACS Omega · 2026-02-03

## TL;DR

This study uses machine learning to identify potential HCAR1 antagonists, which could help treat lactate-driven cancers by targeting a receptor involved in tumor progression and drug resistance.

## Contribution

A novel SVM model and framework integrating conformational docking and SHAP analysis to identify and interpret HCAR1 antagonists.

## Key findings

- The SVM model achieved 79.3% accuracy and an AUC of 0.94 in classifying HCAR1 ligands as agonists or antagonists.
- Polar, rigid, and aromatic substructures were identified as key features for HCAR1 antagonist selectivity.
- Three compounds, including two FDA-approved drugs, showed promising antagonistic potential based on model predictions and docking scores.

## Abstract

GPR81 (HCAR1) is a lactate-sensing G protein-coupled
receptor (GPCR)
involved in tumor progression, immune evasion, and therapeutic resistance
across various cancers. Despite their clinical relevance and druggable
nature, selective HCAR1 antagonists have yet to be identified. This
study aimed to construct a statistically significant Support Vector
Machine (SVM) model for binary classification (agonists versus antagonists)
of HCAR1’s potential ligands and the prioritization of molecular
substructures driving antagonism and receptor selectivity. An SVM
model was trained on 144 ligands (66 agonists, 78 antagonists), listed
in the IUPHAR/BPS Guide to Pharmacology, from 12 structurally related
Class A GPCRs (HCAR1, HCAR2, HCAR3, OXER1, GPR35, SUCNR1, P2Y2, MCHR1,
OPRD1, AGTR1, ADORA2A, and ADRA1A). Their ligands were encoded using
physicochemical descriptors, 2048-bit ECFP4 fingerprints, and ΔAffinity
scores from molecular docking to active and inactive receptor conformations.
The data set was split 80/20 for training and testing, respectively,
with hyperparameters (C, γ) being optimized
via 5-fold cross-validation. SHAP analysis was performed for feature
interpretation. The final SVM model achieved a test set accuracy of
79.3%, with a sensitivity of 69.2% and specificity of 87.5%. The ROC
analysis yielded an AUC of 0.94, while bootstrapping confirmed robust
performance with a mean AUC of 0.874 and a 95% confidence interval
[0.711, 1.000]. SHAP analysis highlighted polar, rigid, and aromatic
substructures as selectivity-driving features. We applied the model
to screen 3,377 compounds from natural products, synthetic libraries,
and FDA-approved drugs, prioritizing potential HCAR1 ligands with
antagonist-like features. Based on ΔAffinity, off-target scores,
and prediction confidence, Ketanserin, Cryptopyranmoscatone A1 diacetate,
and Cefuroxime emerged as reference ligands with promising antagonistic
potential, two of which are FDA-approved drugs. Rather than representing
final hits, these molecules illustrate how structural and electronic
features can favor the stabilization of inactive states in HCAR1.
Overall, this work presents a proof-of-concept framework that integrates
conformational docking, machine learning, and substructure interpretation
to elucidate the chemical and structural determinants of HCAR1 antagonism.
The findings provide fragment-level insights that may guide future
bioisosteric and fragment-based design of selective antagonists for
lactate-driven tumors.

## Linked entities

- **Genes:** HCAR1 (hydroxycarboxylic acid receptor 1) [NCBI Gene 27198], HCAR1 (hydroxycarboxylic acid receptor 1) [NCBI Gene 27198], HCAR2 (hydroxycarboxylic acid receptor 2) [NCBI Gene 338442], HCAR3 (hydroxycarboxylic acid receptor 3) [NCBI Gene 8843], OXER1 (oxoeicosanoid receptor 1) [NCBI Gene 165140], GPR35 (G protein-coupled receptor 35) [NCBI Gene 2859], SUCNR1 (succinate receptor 1) [NCBI Gene 56670], P2RY2 (purinergic receptor P2Y2) [NCBI Gene 5029], MCHR1 (melanin concentrating hormone receptor 1) [NCBI Gene 2847], OPRD1 (opioid receptor delta 1) [NCBI Gene 4985], AGTR1 (angiotensin II receptor type 1) [NCBI Gene 185], ADORA2A (adenosine A2a receptor) [NCBI Gene 135], ADRA1A (adrenoceptor alpha 1A) [NCBI Gene 148]
- **Chemicals:** Ketanserin (PubChem CID 3822), Cefuroxime (PubChem CID 5479529)

## Full-text entities

- **Genes:** ADORA2A (adenosine A2a receptor) [NCBI Gene 135] {aka A2aR, ADORA2, RDC8}, P2RY2 (purinergic receptor P2Y2) [NCBI Gene 5029] {aka HP2U, P2RU1, P2U, P2U1, P2UR, P2Y2}, MCHR1 (melanin concentrating hormone receptor 1) [NCBI Gene 2847] {aka GPR24, MCH-1R, MCH1R, SLC-1, SLC1}, GNAI1 (G protein subunit alpha i1) [NCBI Gene 2770] {aka Gi, HG1B, NEDHISB}, ITIH2 (inter-alpha-trypsin inhibitor heavy chain 2) [NCBI Gene 3698] {aka H2P, ITI-HC2, SHAP}, TPM3 (tropomyosin 3) [NCBI Gene 7170] {aka CAPM1, CFTD, CMYO4A, CMYO4B, CMYP4A, CMYP4B}, Pdcd1 (programmed cell death 1) [NCBI Gene 18566] {aka Ly101, PD-1, Pdc1}, HCAR2 (hydroxycarboxylic acid receptor 2) [NCBI Gene 338442] {aka GPR109A, HCA2, HM74a, HM74b, NIACR1, PUMAG}, VEGFA (vascular endothelial growth factor A) [NCBI Gene 7422] {aka L-VEGF, MVCD1, VEGF, VPF}, ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}, IGKV2D-29 (immunoglobulin kappa variable 2D-29) [NCBI Gene 28882] {aka A2a, A2c, IGKV2D29}, AGTR1 (angiotensin II receptor type 1) [NCBI Gene 185] {aka AG2S, AGTR1B, AT1, AT1AR, AT1B, AT1BR}, ADRA1A (adrenoceptor alpha 1A) [NCBI Gene 148] {aka ADRA1C, ADRA1L1, ALPHA1AAR}, HTR2A (5-hydroxytryptamine receptor 2A) [NCBI Gene 3356] {aka 5-HT2A, HTR2}, ABL1 (ABL proto-oncogene 1, non-receptor tyrosine kinase) [NCBI Gene 25] {aka ABL, BCR-ABL, CHDSKM, JTK7, bcr/abl, c-ABL}, SLC18A2 (solute carrier family 18 member A2) [NCBI Gene 6571] {aka PKDYS2, SVAT, SVMT, VAT2, VMAT2}, GPR35 (G protein-coupled receptor 35) [NCBI Gene 2859], Hcar1 (hydrocarboxylic acid receptor 1) [NCBI Gene 243270] {aka Gpr81}, HCA1 (Hypercalciuria, absorptive, 1) [NCBI Gene 266790] {aka AH, HCA}, HCAR1 (hydroxycarboxylic acid receptor 1) [NCBI Gene 27198] {aka FKSG80, GPR104, GPR81, HCA1, LACR1, TA-GPCR}, HCAR3 (hydroxycarboxylic acid receptor 3) [NCBI Gene 8843] {aka GPR109B, HCA3, HM74, PUMAG, Puma-g}, OXER1 (oxoeicosanoid receptor 1) [NCBI Gene 165140] {aka GPCR, GPR170, TG1019}, SUCNR1 (succinate receptor 1) [NCBI Gene 56670] {aka GPR91}, EGFR (epidermal growth factor receptor) [NCBI Gene 1956] {aka ERBB, ERBB1, ERRP, HER1, NISBD2, NNCIS}, CD274 (CD274 molecule) [NCBI Gene 29126] {aka ADMIO5, B7-H, B7H1, PD-L1, PDCD1L1, PDCD1LG1}, OPRD1 (opioid receptor delta 1) [NCBI Gene 4985] {aka DOP, DOR, DOR1, OPRD}, CXCR4 (C-X-C motif chemokine receptor 4) [NCBI Gene 7852] {aka CD184, D2S201E, FB22, HM89, HSY3RR, LCR1}
- **Diseases:** Lactate-Driven Cancers (MESH:D009369), PDAC (MESH:D021441), RBF (MESH:D020425), oncogenes (MESH:D000074723), tumorigenesis (MESH:D063646), colorectal cancer (MESH:D015179)
- **Chemicals:** fluorobenzene (MESH:D005464), D-lactic acid (MESH:D019344), carboxylic acids (MESH:D002264), diamine (MESH:D003959), ketone (MESH:D007659), ester (MESH:D004952), 4-hydroxybutanoate (MESH:D012978), carbon (MESH:D002244), succinate (MESH:D019802), allylbenzene (MESH:C102347), Reserpine (MESH:D012110), oxygen (MESH:D010100), 3,5-dihydroxybenzoic acid (MESH:C076950), indole alkaloid (MESH:D026121), water (MESH:D014867), amide (MESH:D000577), yohimban (MESH:D046948), alkene (MESH:D000475), nicotinic acid (MESH:D009525), TMBA (MESH:C063620), amino acids (MESH:D000596), carbonyl diamides (MESH:D014508), amines (MESH:D000588), Methyl ketone (MESH:D000096), fatty acids (MESH:D005227), Vinca alkaloid (MESH:D014748), BraCoLi (-), furan (MESH:C039281), Cefuroxime (MESH:D002444), beta-lactam (MESH:D047090), Hydrogen (MESH:D006859), glucose (MESH:D005947), ether (MESH:D004986), 3-chloro-5-hydroxybenzoic acid (MESH:C000726152), reserpic acid (MESH:C046455), Ketanserin (MESH:D007650)
- **Species:** Catharanthus roseus (chatas, species) [taxon 4058], Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606], Rauvolfia serpentina (devilpepper, species) [taxon 4060]
- **Mutations:** Y268A, Arg240, A2A, Tyr268, R240A, AUC of 0

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12917714/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12917714/full.md

## References

64 references — full list in the complete paper: https://tomesphere.com/paper/PMC12917714/full.md

---
Source: https://tomesphere.com/paper/PMC12917714